BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 000067
(2445 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|224132582|ref|XP_002327831.1| SET domain protein [Populus trichocarpa]
gi|222837240|gb|EEE75619.1| SET domain protein [Populus trichocarpa]
Length = 2476
Score = 3212 bits (8327), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 1684/2558 (65%), Positives = 1958/2558 (76%), Gaps = 195/2558 (7%)
Query: 1 MGDGGVACMPLQQQQQHNSIMERFPISDK------------TTICVGNSSNNSNKT---- 44
MG GGVACMPLQ + + ERFP+ ++ TT C G + NSN
Sbjct: 1 MGSGGVACMPLQHGSNNIIMEERFPVQEQPTAAAAAMTTTATTACGGGKTVNSNSNISSA 60
Query: 45 -----NNNSISNNNDNKTNNDSSNN----------------------------------- 64
NN S + DN N SSN
Sbjct: 61 DNDNNNNGSSGDKKDNGKVNASSNGVTGKLKRVKRIIKVKKVVRRVVLGEKKGVGLDKAV 120
Query: 65 NGSSSSKNNETNKSNVKKNGVSTKTVRKKIVKIKKVIAVKKKEVQKN-SGSSKSNNNGEN 123
G+ S + E K++G+ T+ K++ KK +KK++ K + K + +
Sbjct: 121 KGAGGSGSKEVAVLEKKESGLKTEEKSKEVAAEKKESGLKKEDKSKEVAAEKKESGLKSS 180
Query: 124 IDNKNVENGGAVGE---VVTVDKENLKNEEVEEGELGTLKW------ENGEFVQPEKSQP 174
+K VENG +G V N+K EEVEEGELGTL+W ENGEFV P +P
Sbjct: 181 SGSKTVENGDGLGSGDSKVQSGSNNIK-EEVEEGELGTLRWPSKGEIENGEFV-PTPEKP 238
Query: 175 QSQLQSQSKQIEKGEIIVFSSKCRRGETEKGE---SGLWRGN---KDDIEKGEFIPDRWH 228
+ +IE+GEI S K ++G+ EKGE WR +D+IEKGEFIPDRW+
Sbjct: 239 RRS------EIERGEI--GSGKWKKGDIEKGEIVSGNKWRKGEAVRDEIEKGEFIPDRWN 290
Query: 229 KEVVKDEYGYSKSR-RYDYKLERTPPSGKYSGEDVYRRKEFDRSGSQHSKSSSRWESGQE 287
+KDEYGY+KSR R+D ERTPPSGKYS EDVYRRKE RSG RWESGQE
Sbjct: 291 ---IKDEYGYNKSRGRHDMSSERTPPSGKYSSEDVYRRKELSRSGGM------RWESGQE 341
Query: 288 RNVRISSKIVDDEGLYKGEHNNGKNHGREYFHGNRFKRHGTDSDSGDRKYYGDYGDFAGL 347
R+ RISSKIVD+EG YK E++NGK+H RE+ GNR KRH TDSD+ +RKYYGDY A
Sbjct: 342 RSTRISSKIVDEEGSYKSEYSNGKSHEREHASGNRLKRHVTDSDNTERKYYGDY---AIS 398
Query: 348 KSRRLSDDYNSRSVHSEHYSRHSVEKFHRNSSSSRISSLDKYSSRHHEPSLSSRVIYDRH 407
KSRRLS+D SR +SEHYSRHSVE+F+++SS SR+SS DKYSSRHHEP+LSS+V+YDRH
Sbjct: 399 KSRRLSED-GSRYAYSEHYSRHSVERFYKSSSYSRVSSSDKYSSRHHEPTLSSKVVYDRH 457
Query: 408 GRSPSHSDRSPHDRGRYYDHRDRSPSR--------------HDRSPYTRDRSPYT----- 448
SHSDRSPHDR RYYDHRDRSP R H+RSPY R+RSPY
Sbjct: 458 ----SHSDRSPHDRPRYYDHRDRSPIRYEKSPYGREKTPFGHERSPYGRERSPYGRERSP 513
Query: 449 ---------FDRSPYSRERSPYNRDRSPYAREKSPYDRSRHYDHRNRSPFSAERSPQDRA 499
DRSPY RE+SPY R+RSPY EKSPYDRS + +HR RSP ERSPQDR
Sbjct: 514 YWRDRSPDGHDRSPYGREKSPYGRERSPYVLEKSPYDRSSYNEHRKRSPAYFERSPQDRT 573
Query: 500 RFHDRSDRTPNYLERSPLHRSRPNNHREASSKTGASEKRNARYDSKGHEDKLGPKDSNAR 559
R HDRSDRTP+YLERSP R+RP NHREAS K A EKR+++Y +K +DK+ KD +
Sbjct: 574 RHHDRSDRTPSYLERSPHDRARPTNHREASRKGAAHEKRSSQYGNKKQDDKISQKDPAVK 633
Query: 560 CSRSSAKESQDKSNVQDLNVSDEKTANCESHKEEQPQSSSVDCKEPPQVDGPPLEELVSM 619
+ SAKESQDKS+V +L+ DEK + E+ EE+ +S ++ KE P+VDGPP EEL SM
Sbjct: 634 DTELSAKESQDKSSVHNLDGLDEKNTSSETRLEEKSESPVINAKESPKVDGPPPEELQSM 693
Query: 620 EEDMDICDTPPHVPAVTDSSVGKWFYLDHCGMECGPSRLCDLKTLVEEGVLVSDHFIKHL 679
EEDMDICDTPPHVP V D+S G+WFYLDH G+ECGPS+LC+LK LV+EG+L+SDHFIKHL
Sbjct: 694 EEDMDICDTPPHVPVVADTSTGRWFYLDHFGVECGPSKLCELKALVDEGILMSDHFIKHL 753
Query: 680 DSNRWETVENAVSPLVTVNFPSITSDSVTQLVSPPEASGNLLADTGDTAQS---TGEEFP 736
DS+RW T+ENAVSPLVTVNFPS+ D +TQLVSPPEA GNLLADTGD QS GE P
Sbjct: 754 DSDRWLTIENAVSPLVTVNFPSVVPDVITQLVSPPEAPGNLLADTGDIVQSCSQIGEGVP 813
Query: 737 VTL-QSQCCPDGSAAAAESSEDLHIDVRVGALLDGFTVIPGKEIETLGEILQTTFERVDW 795
L Q CP+ SA A+E EDL ID RVGALL+GF+V+PG EIET+G
Sbjct: 814 GNLLQPLVCPNHSAVASEPLEDLQIDERVGALLEGFSVVPGSEIETVG------------ 861
Query: 796 QNNGGPTWHGACVGEQKPGDQKVDELY-ISDTKMKEAAELKSG---DKDH-WVVCFDSDE 850
G W+ A EQ+ DQ +EL SD KEA E G DKD + DS +
Sbjct: 862 ----GFAWYLASTAEQQ--DQNSNELLGHSDLITKEAVEAWPGSLADKDDGFASSVDSAD 915
Query: 851 WFSGRWSCKGGDWKRNDEAAQDRCSRKKQVLNDGFPLCQMPKSGYEDPRWNQKDDLYYPS 910
WFSGRWSCKGGDWKRNDE+ QDR +R+K VLNDGFPLC M KSG EDPRW +KDDLY+PS
Sbjct: 916 WFSGRWSCKGGDWKRNDESVQDRFTRRKVVLNDGFPLCHMTKSGCEDPRWQRKDDLYFPS 975
Query: 911 HSRRLDLPPWAYACPDERNDGSGGSRSTQSKLAAVRGVKGTMLPVVRINACVVNDHGSFV 970
SR+LDLPPWA++ DERND G S+ST +K RGVKGT+LPVVRINACVV DH V
Sbjct: 976 QSRKLDLPPWAFSSTDERNDTGGVSKSTLNKPPITRGVKGTVLPVVRINACVVQDH---V 1032
Query: 971 SEPRSKVRAKERHSSRSARSYSSANDVRRSSAESDSHSKARNNQDSQGSWKSIACINTPK 1030
SE R+KVR K+R+ SR+AR++S+ NDV+RSS ESDS SK N+ DS G WKS A +NTPK
Sbjct: 1033 SETRTKVRGKDRYHSRAARTHSATNDVKRSSVESDSQSKVVNDPDSHGCWKSTAPLNTPK 1092
Query: 1031 DRLCTVDDLQLQLGEWYYLDGAGHERGPSSFSELQVLVDQGCIQKHTSVFRKFDKVWVPL 1090
D LCT DDLQL LGEWYYLDGAGHE+GPSSFSELQ L D G IQK++SVFRKFD+VWVP+
Sbjct: 1093 DCLCTADDLQLNLGEWYYLDGAGHEQGPSSFSELQNLADIGTIQKYSSVFRKFDRVWVPI 1152
Query: 1091 TFATETSASTVRNHGEKIMPSGDSSGLPPTQSQDAVLGESNNNVNSNAFHTMHPQFIGYT 1150
T ATET ++V+ + P SSG T S+ ++ +S++FH++HPQFIG+T
Sbjct: 1153 TSATETFGASVKIQQSNVEPVIGSSG---TLSKSQTASNVESDRSSSSFHSLHPQFIGFT 1209
Query: 1151 RGKLHELVMKSYKNREFAAAINEVLDPWINAKQPKKETE-HVYRKS--EGDTRAGKRARL 1207
RGKLHELVMKSYKNREFAAAINE LDPWI AK+P KE + H+Y KS E D RAGKRAR+
Sbjct: 1210 RGKLHELVMKSYKNREFAAAINEALDPWIVAKRPPKEIDKHMYLKSGMEIDARAGKRARM 1269
Query: 1208 LVRESDGDEETEEELQTIQDESTFEDLCGDASFPGEESASSAIESGGWGLLDGHTLAHVF 1267
++D D E EE +DE+TFE LCGD +F EES S IE+G WGLLDGH LA VF
Sbjct: 1270 QPAQNDEDYEMEEGTLH-KDETTFEQLCGDTNFHREESMCSEIEAGSWGLLDGHMLARVF 1328
Query: 1268 HFLRSDMKSLAFASLTCRHWRAAVRFYKGISRQVDLSSVGPNCTDSLIRKTLNAFDKEKL 1327
HFLRSDMKSL FASLTC+ WR AV FYKGIS QVDLSS PNCTD ++R +N ++KEK+
Sbjct: 1329 HFLRSDMKSLVFASLTCKKWRCAVSFYKGISIQVDLSSGAPNCTDIMVRSIMNGYNKEKI 1388
Query: 1328 NSILLVGCTNITSGMLEEILQSFPHLSSIDIRGCGQFGELALKFPNINWVKSQKSRGAKF 1387
N+++L GC NITSGMLEEIL+SFP LSSIDIRGC QF ELAL+FPNI+W+KS+ +
Sbjct: 1389 NAMVLAGCKNITSGMLEEILRSFPCLSSIDIRGCTQFMELALRFPNISWLKSRTRISVES 1448
Query: 1388 NDSRSKIRSLKQITEKSSSAPKSKGLGDDMDDFGDLKDYFESVDKRDSANQSFRRSLYQR 1447
N SK+RSLKQI+E+ DDFG+LK+YF+SV+KRDSANQ FRRSLY+R
Sbjct: 1449 N---SKLRSLKQISER--------------DDFGELKEYFDSVNKRDSANQLFRRSLYKR 1491
Query: 1448 SKVFDARKSSSILSRDARMRRWSIKKSENGYKRMEEFLASSLKEIMRVNTFEFFVPKVAE 1507
SKVFDARKSSSIL RDARMRRW++KKSEN Y+RME FLAS LK+IM+ NTF+FFVPK+ E
Sbjct: 1492 SKVFDARKSSSILPRDARMRRWAVKKSENSYRRMEGFLASGLKDIMKENTFDFFVPKLTE 1551
Query: 1508 IEGRMKKGYYISHGLGSVKDDISRMCRDAIKAKNRGSAGDMNRITTLFIQLATRLEQGAK 1567
IE RMK GYY+ HGL +VK+DISRMCRDAIK KNRG AGDMN I TLF+QLA+RLE+ +K
Sbjct: 1552 IEDRMKSGYYVGHGLRAVKEDISRMCRDAIKVKNRG-AGDMNHIITLFLQLASRLEESSK 1610
Query: 1568 SSYYEREEMMKSWKDESPAGLYSATSKYKKKLSKMVSERKYMNRSNGTSLANGDFDYGEY 1627
SY ER+E+MKSWKD+ L SA K+KKK KYMNRSNGT LANG FD+GEY
Sbjct: 1611 FSY-ERDELMKSWKDDVSTALDSAPIKHKKKAIDK----KYMNRSNGTILANGSFDFGEY 1665
Query: 1628 ASDREIRKRLSKLNRKSLDSGSETSDDLDGSSEDGKSDSESTVSDTDSDMDFRSDGRARE 1687
ASD+EI+KR+SKLNRKS+DSGSETSDD SSEDG+S ST SDT+SD+DFRS+GR +
Sbjct: 1666 ASDQEIKKRISKLNRKSMDSGSETSDDR--SSEDGRSGGGSTASDTESDLDFRSEGRPGD 1723
Query: 1688 SRGAGDFTTDEGLDFSDDREWGARMTKASLVPPVTRKYEVIDQYVIVADEEDVRRKMRVS 1747
SRG F TDE D+REWGARMT ASLVPPVTRKYEVIDQYVIVADEEDV+RKM VS
Sbjct: 1724 SRGDEYFMTDE-----DEREWGARMTNASLVPPVTRKYEVIDQYVIVADEEDVQRKMSVS 1778
Query: 1748 LPEDYAEKLNAQKNGSEELDMELPEVKDYKPRKQLGDQVFEQEVYGIDPYTHNLLLDSMP 1807
LP+DYAEKL+AQKNG+EELDMELPEVKDYKPRKQLGD+V EQEVYGIDPYTHNLLLDSMP
Sbjct: 1779 LPDDYAEKLDAQKNGTEELDMELPEVKDYKPRKQLGDEVIEQEVYGIDPYTHNLLLDSMP 1838
Query: 1808 DELDWNLLEKHLFIEDVLLRTLNKQVRHFTGTGNTPMMYPLQPVIEEIEKEAVDDCDVRT 1867
+E+DW L +KH+FIEDVLL TLNKQVRH+TG GNTPM YPLQPV+EE+E+ A++DCD RT
Sbjct: 1839 EEVDWPLSQKHMFIEDVLLCTLNKQVRHYTGAGNTPMTYPLQPVVEELEQAAMEDCDTRT 1898
Query: 1868 MKMCRGILKAMDSRPDDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQD 1927
MK+CRGIL+A+DSRPDDKYVAYRKGLGVVCNKE GF +DDFVVEFLGEVYP WKWFEKQD
Sbjct: 1899 MKICRGILRAIDSRPDDKYVAYRKGLGVVCNKEAGFRDDDFVVEFLGEVYPAWKWFEKQD 1958
Query: 1928 GIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKV 1987
GIR LQK++++PAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSC+PNCEAKV
Sbjct: 1959 GIRLLQKDSKEPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCKPNCEAKV 2018
Query: 1988 TAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEG 2047
TAV G YQIGIY+VR I +GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEG
Sbjct: 2019 TAVGGQYQIGIYSVRKIQHGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEG 2078
Query: 2048 AFEKVLKELHGLLDRHQLMLEACELNSVSEEDYLELGRAGLGSCLLGGLPNWVVAYSARL 2107
AF+KVLKE HGLLDRH LML ACELNSVSEEDYL+LGRAGLGSCLLGGLP+WVVAYSARL
Sbjct: 2079 AFQKVLKECHGLLDRHYLMLGACELNSVSEEDYLDLGRAGLGSCLLGGLPDWVVAYSARL 2138
Query: 2108 VRFINLERTKLPEEILRHNLEEKRKYFSDICLEVEKSDAEVQAEGVYNQRLQNLAVTLDK 2167
VRFINLERTKLPEEILRHNLEEK+KYF+DIC+EVE+SDAEVQAEGVYNQRLQNLAVTLDK
Sbjct: 2139 VRFINLERTKLPEEILRHNLEEKKKYFADICIEVERSDAEVQAEGVYNQRLQNLAVTLDK 2198
Query: 2168 VRYVMRCVFGDPKKAPPPVERLSPEETVSFLWKGEGSLVEELIQCMAPHVEEDVLNDLKS 2227
VRYVMRC+FGDPK APPP+E+L+PEETVSFLWK EGSLVEEL+QCM+PH++ ++LNDLKS
Sbjct: 2199 VRYVMRCIFGDPKLAPPPLEKLTPEETVSFLWKEEGSLVEELLQCMSPHMDGEMLNDLKS 2258
Query: 2228 KIQAHDPSGSEDIQRELRKSLLWLRDEVRNLPCTYKCRHDAAADLIHIYAYTKCFFRVQE 2287
KI AHDPS S+DI + ++KSLLWLRDEVR+LPCTYKCRHDAAADLIH+YAYTK FFRV+E
Sbjct: 2259 KIYAHDPSDSDDIPKAIQKSLLWLRDEVRSLPCTYKCRHDAAADLIHVYAYTKSFFRVRE 2318
Query: 2288 YKAFTSPPVYISPLDLGPKYADKLGADLQVYRKTYGENYCLGQLIFWHIQTNADPDCTLA 2347
Y AFTSPPVYISPLDLGPK ADKLG Y+KTYGENYC+GQLIFWHIQTN +PD TLA
Sbjct: 2319 YDAFTSPPVYISPLDLGPKCADKLGGLPHKYQKTYGENYCMGQLIFWHIQTNTEPDSTLA 2378
Query: 2348 RASRGCLSLPDIGSFYAKVQKPSRHRVYGPKTVRFMLSRMEKQPQRPWPKDRIWAFKSSP 2407
+AS+GCLSLPDIGSFY+KVQKPS+ R+YGPKTV+ ML RMEK PQ+PWPKD+IW+FKSSP
Sbjct: 2379 KASKGCLSLPDIGSFYSKVQKPSQQRIYGPKTVKMMLGRMEKYPQKPWPKDQIWSFKSSP 2438
Query: 2408 RIFGSPMLDSSLTGCPLDREMVHWLKHRPAIFQAMWDR 2445
++FGSPMLD+ L PLDREMVHWLKHRP ++QAMWDR
Sbjct: 2439 KVFGSPMLDAVLNKSPLDREMVHWLKHRPTVYQAMWDR 2476
>gi|255549293|ref|XP_002515700.1| huntingtin interacting protein, putative [Ricinus communis]
gi|223545137|gb|EEF46647.1| huntingtin interacting protein, putative [Ricinus communis]
Length = 2430
Score = 3181 bits (8248), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 1628/2346 (69%), Positives = 1886/2346 (80%), Gaps = 124/2346 (5%)
Query: 144 ENLKNEEVEEGELGTLKW-------ENGEFVQPEKSQPQSQLQSQSKQIEKGEIIVFSSK 196
+N EEVEEGELGTLKW ENGEFV PEK+ ++ +I+KGEI++ + K
Sbjct: 165 QNNNKEEVEEGELGTLKWPPKAAEVENGEFVPPEKT-------TRRTEIDKGEIVI-ADK 216
Query: 197 CRRGETEKGE----SGLWRG---NKDDIEKGEFIPDRWHKEVVKDEYGYSKSR-RYDYKL 248
R+ + EKGE SG WR ++D+IEKGEFIPDRWH K+E GY+KSR +YD
Sbjct: 217 WRKRDIEKGEGTAVSGRWRKGDFSRDEIEKGEFIPDRWHN---KEELGYNKSRTKYDISR 273
Query: 249 ERTPPSGKYSGEDVYRRKEFDRSGSQHSKSSS-RWESGQERNVRISSKIVDDEGLYKGEH 307
ERTPPSGKYS ED+YRRKEF RSGS SS RWESG ERN+RISSKI+D+E +YK E+
Sbjct: 274 ERTPPSGKYSNEDIYRRKEFSRSGSSQHSKSSSRWESGLERNIRISSKILDEESMYKSEY 333
Query: 308 NNGKNHGREYFHGNRFKRHGTDSDSGDRKYYGDYGDFAGLKSRRLSDDYNSRSVHSEHYS 367
+NGKNHGR+Y GNR KR+G DSDS +RK+YGDYGD+A KSRRLS+D +R +HSEHYS
Sbjct: 334 SNGKNHGRDYTSGNRLKRYGADSDSSERKHYGDYGDYACSKSRRLSED-TARPIHSEHYS 392
Query: 368 RHSVEKFHRNSSSSRISS--LDKYSSRHHEPSLSSRVIYDRHGRSPSHSDRSPHDRGRYY 425
RHSVE+F+RNSS++ LDKYSSRHHEP+LSS+V+YDRH RSP HS+RSP DR R+Y
Sbjct: 393 RHSVERFYRNSSTTSSRISSLDKYSSRHHEPTLSSKVVYDRHERSPGHSERSPRDRARHY 452
Query: 426 DHRDRSPSRHDRSPY--------------TRDRSPYTFDRSPYSRERSPYNRDRSPYARE 471
DHRDRSP R +RSPY R+RSPY +RSPY ERSPY R+RSPYAR+
Sbjct: 453 DHRDRSPVRRERSPYRLERSPFGRERSPYVRERSPYVRERSPYVHERSPYVRERSPYARD 512
Query: 472 KSPYDRSRHYDHRNRSPFSAERSPQDRARFHDRSDRTPNYLERSPLHRSRPNNHREASSK 531
KSPYDRSRHYD+R RSP +ERS QDR +HDR DRTPN+LERSPL R RPNNHREAS K
Sbjct: 513 KSPYDRSRHYDYR-RSPAHSERSSQDR--YHDRRDRTPNFLERSPLDRGRPNNHREASRK 569
Query: 532 TGASEKRNARYDSKGHEDKLGPKDSNARCSRSSAKESQDKSNVQDLNVSDEKTANCESHK 591
G SEKRN++ +KG EDKL KD + R S+ KESQD+++V ++ +EK A+ +S K
Sbjct: 570 GGVSEKRNSQNANKGKEDKLNQKDCSERDSQFIVKESQDRNDVHNITGLEEKNASSDSLK 629
Query: 592 EEQPQSSSVDCKEPPQVDGPPLEELVSMEEDMDICDTPPHVPAVTDSSVGKWFYLDHCGM 651
E Q QS +D KE VDGPP EEL+SMEEDMDICDTPPHVPAVTDSS GKWFYLD+ G+
Sbjct: 630 EAQTQSPVMDVKESLPVDGPPPEELLSMEEDMDICDTPPHVPAVTDSSTGKWFYLDYFGL 689
Query: 652 ECGPSRLCDLKTLVEEGVLVSDHFIKHLDSNRWETVENAVSPLVTVNFPSITSDSVTQLV 711
ECGPS+LCDLK LV+ GVLV+DH +KHLDS+RW T+ENAVSPLV NFPSI SD+VT+LV
Sbjct: 690 ECGPSKLCDLKALVDGGVLVADHLVKHLDSDRWVTIENAVSPLVASNFPSIVSDTVTRLV 749
Query: 712 SPPEASGNLLADTGDTAQS---TGEEFPVTL-QSQCCPDGSAAAAESSEDLHIDVRVGAL 767
SPPEA GNLLADTGD QS GEE + L Q C + +AA +E EDLHID RVGAL
Sbjct: 750 SPPEAPGNLLADTGDMGQSGYKNGEEASMALPQPLGCLNDNAALSEPLEDLHIDQRVGAL 809
Query: 768 LDGFTVIPGKEIETLGEILQTTFERVDWQNNGGPTWHGACVGEQKPGDQKVDELYISDTK 827
L+G+T++PG+E+ET+GE+L TTFE V W+ G E++ G + SD K
Sbjct: 810 LEGYTIVPGRELETIGEVLLTTFELVPWERCGQ--------SEEQFGQSNDEPSRYSDLK 861
Query: 828 MKEAAELKS---GDKDHWVVCF-DSDEWFSGRWSCKGGDWKRNDEAAQDRCSRKKQVLND 883
+A E+ S D+D CF DS +WFSGRWSCKGGDWKRNDE QDR SR+K VL+D
Sbjct: 862 PNDAVEVSSSATSDRDQSCACFADSADWFSGRWSCKGGDWKRNDENVQDRFSRRKFVLSD 921
Query: 884 GFPLCQMPKSGYEDPRWNQKDDLYYPSHSRRLDLPPWAYACPDERNDGSGGSRSTQSKLA 943
G+PLCQMPKSG EDPRW++KDDLYYPS SRRLDLPPWA++C DERN+ SR+T +K +
Sbjct: 922 GYPLCQMPKSGTEDPRWHRKDDLYYPSQSRRLDLPPWAFSCTDERNECGSASRTTLAKPS 981
Query: 944 AVRGVKGTMLPVVRINACVVNDHGSFVSEPRSKVRAKERHSSRSARSYSSANDVRRSSAE 1003
VRGVKGTMLPVVRINACVV DHGSFVSEPR KVR KER+ SRS+R YS+ANDV+R +AE
Sbjct: 982 VVRGVKGTMLPVVRINACVVKDHGSFVSEPRIKVRGKERYPSRSSRMYSAANDVKRLTAE 1041
Query: 1004 SDSHSKARNNQDSQGSWKSIACINTPKDRLCTVDDLQLQLGEWYYLDGAGHERGPSSFSE 1063
DS SK +QDS SWKSI+ +NTPKDRLCTVDDLQL LGEWYYLDG+GHE+GPSSFSE
Sbjct: 1042 GDSQSKI--DQDSHSSWKSISFVNTPKDRLCTVDDLQLHLGEWYYLDGSGHEQGPSSFSE 1099
Query: 1064 LQVLVDQGCIQKHTSVFRKFDKVWVPLTFATETSASTVRNHGEKIMPSGDSSGLPPTQSQ 1123
LQVL QG I+K +SVFRKFD+VWVP+T T +S +T + E + GDSS T S+
Sbjct: 1100 LQVLASQGAIKKWSSVFRKFDRVWVPVTPVTGSSEATFKTQEETVALPGDSST---TLSK 1156
Query: 1124 DAVLGESNNNVNSNAFHTMHPQFIGYTRGKLHELVMKSYKNREFAAAINEVLDPWINAKQ 1183
S NN NS FH HPQFIGYTRGKLHELVMKS+K+REFAAAIN+VLDPWINAKQ
Sbjct: 1157 SQGAANSENNANSVPFHCQHPQFIGYTRGKLHELVMKSFKSREFAAAINDVLDPWINAKQ 1216
Query: 1184 PKKETE-HVYRKSEGDTRAGKRARLLVRESDGDEETEEELQTIQ-DESTFEDLCGDASFP 1241
PKKE + H+YRKSE D R+ KRARL V SD D +E++++IQ DE+TFE+LCGD+ F
Sbjct: 1217 PKKEVDSHIYRKSEIDGRSSKRARLQVDGSDDDYFIDEDVESIQKDETTFEELCGDSIFH 1276
Query: 1242 GEESASSAIESGGWGLLDGHTLAHVFHFLRSDMKSLAFASLTCRHWRAAVRFYKGISRQV 1301
GE S S E G WGLLDGH LA VFH++RSDM+SL FASLTC+HWRAAV FYK ISRQV
Sbjct: 1277 GENSECSDSELGSWGLLDGHMLARVFHYMRSDMRSLVFASLTCKHWRAAVSFYKDISRQV 1336
Query: 1302 DLSSVGPNCTDSLIRKTLNAFDKEKLNSILLVGCTNITSGMLEEILQSFPHLSSIDIRGC 1361
D S +G NCTDS+I LN ++KE++NS+ L+ + +P L+ +
Sbjct: 1337 DFSHLGSNCTDSMIWNILNGYNKERINSMALIYFA---------LSLVYPLLT---LEVA 1384
Query: 1362 GQFGELALKFPNINWVKSQKSRG-AKFNDSRSKIRSLKQITEKSSSAPKSKGLGDDMDDF 1420
LKFP++ W+K+Q SRG +S SKIRSLK I+E++ + K+KGLG D DDF
Sbjct: 1385 ANSRNWPLKFPDVRWIKTQSSRGIGIIEESSSKIRSLKHISERTPTFYKTKGLGSDADDF 1444
Query: 1421 GDLKDYFESVDKRDSANQSFRRSLYQRSKVFDARKSSSILSRDARMRRWSIKKSENGYKR 1480
GDLK+YF+SV+KRDSANQ FRRSLY+RSK+FDAR+SSSI+SRDAR+RRW+IKKSE+GYKR
Sbjct: 1445 GDLKEYFDSVNKRDSANQLFRRSLYKRSKLFDARRSSSIVSRDARVRRWAIKKSESGYKR 1504
Query: 1481 MEEFLASSLKEIMRVNTFEFFVPKVAEIEGRMKKGYYISHGLGSVKDDISRMCRDAIKAK 1540
ME FLAS LK+IM+ NTF+FFVPKVAEIE RMK GYY+ HGL SVK+DISRMCRDAIK
Sbjct: 1505 MEGFLASGLKDIMKENTFDFFVPKVAEIEDRMKSGYYLGHGLRSVKEDISRMCRDAIK-- 1562
Query: 1541 NRGSAGDMNRITTLFIQLATRLEQGAKSSYYEREEMMKSWKDESPAGLYSATSKYKKKLS 1600
+E+MKSWKD+ AGL A+ K KKKL
Sbjct: 1563 ---------------------------------DELMKSWKDDLSAGLGCASMKSKKKL- 1588
Query: 1601 KMVSERKYMNRSNGTSLANGDFDYGEYASDREIRKRLSKLNRKSLDSGSETSDDLDGSSE 1660
+ ++K NR+NG++ +NG FDYGEYASDREIR+RLSKLNRKS++SGSETSD LD SSE
Sbjct: 1589 --LIDKKNANRNNGSTFSNGGFDYGEYASDREIRRRLSKLNRKSMESGSETSDGLDKSSE 1646
Query: 1661 DGKSDSESTVSDTDSDMDFRSDGRARESRGAGDFTTDEGLD-FSDDREWGARMTKASLVP 1719
DG+S+S+ST SDT+SD+D R +GR ESRG G F DE LD D+REWGARMTKASLVP
Sbjct: 1647 DGRSESDSTSSDTESDLDIRLEGRIGESRGGGFFMEDEALDSMIDEREWGARMTKASLVP 1706
Query: 1720 PVTRKYEVIDQYVIVADEEDVRRKMRVSLPEDYAEKLNAQKNGSEELDMELPEVKDYKPR 1779
PVTRKYEVIDQYVIVADEEDV+RKM V+LP+DYAEKL+AQKNG+E DMELPEVK+YKPR
Sbjct: 1707 PVTRKYEVIDQYVIVADEEDVQRKMCVALPDDYAEKLDAQKNGTE--DMELPEVKEYKPR 1764
Query: 1780 KQLGDQVFEQEVYGIDPYTHNLLLDSMPDELDWNLLEKHLFIEDVLLRTLNKQVRHFTGT 1839
KQ GD+V EQEVYGIDPYTHNLLLDSMP+ELDW L +KH+FIED+LLRTLNKQVR FTGT
Sbjct: 1765 KQPGDEVLEQEVYGIDPYTHNLLLDSMPEELDWTLSDKHMFIEDMLLRTLNKQVRRFTGT 1824
Query: 1840 GNTPMMYPLQPVIEEIEKEAVDDCDVRTMKMCRGILKAMDSRPDDKYVAYRKGLGVVCNK 1899
GNTPM YPL+P+IEEIE A +DCDVRTMK+C+GILKA+DSR DD YVAYRKGLGVVCNK
Sbjct: 1825 GNTPMKYPLKPIIEEIEAAAEEDCDVRTMKICQGILKAIDSRRDDNYVAYRKGLGVVCNK 1884
Query: 1900 EGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYD 1959
EGGF EDDFVVEFLGEVYP WKWFEKQDGIRSLQK+++DPAPEFYNIYLERPKGDADGYD
Sbjct: 1885 EGGFAEDDFVVEFLGEVYPAWKWFEKQDGIRSLQKDSKDPAPEFYNIYLERPKGDADGYD 1944
Query: 1960 LVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTE 2019
LVVVDAMHKANYASRICHSCRPNCEAKVTAV G YQIGIYTVR I YGEEITFDYNSVTE
Sbjct: 1945 LVVVDAMHKANYASRICHSCRPNCEAKVTAVHGQYQIGIYTVREIQYGEEITFDYNSVTE 2004
Query: 2020 SKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDRHQLMLEACELNSVSEED 2079
SKEEYEASVCLCGSQVCRGSYLNLTGEGAF+KVLKE H +LDRH LMLEACELNSVSEED
Sbjct: 2005 SKEEYEASVCLCGSQVCRGSYLNLTGEGAFQKVLKEWHAMLDRHHLMLEACELNSVSEED 2064
Query: 2080 YLELGRAGLGSCLLGGLPNWVVAYSARLVRFINLERTKLPEEILRHNLEEKRKYFSDICL 2139
YL+LGRAGLGSCLLGGLP+WVVAYSARLVRFINLERTKLPEEILRHNLEEKRKYFSDICL
Sbjct: 2065 YLDLGRAGLGSCLLGGLPDWVVAYSARLVRFINLERTKLPEEILRHNLEEKRKYFSDICL 2124
Query: 2140 EVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRCVFGDPKKAPPPVERLSPEETVSFLW 2199
EVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMR +FGDPKKAPPP+ERLSPEETVSF+W
Sbjct: 2125 EVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRSLFGDPKKAPPPLERLSPEETVSFIW 2184
Query: 2200 KGEGSLVEELIQCMAPHVEEDVLNDLKSKIQAHDPSGSEDIQRELRKSLLWLRDEVRNLP 2259
K EGSLV+EL+QCMAPHVE DVLNDLKSKI A DP S++I++EL+KSLLWLRDEVR+LP
Sbjct: 2185 KEEGSLVDELLQCMAPHVEVDVLNDLKSKICARDPLNSDNIRKELQKSLLWLRDEVRSLP 2244
Query: 2260 CTYKCRHDAAADLIHIYAYTKCFFRVQEYKAFTSPPVYISPLDLGPKYADKLGADLQVYR 2319
CTYKCRHDAAADLIH+YAYT+CF+RV+EY FTSPPV+ISPLDLGPKYADKLGA + YR
Sbjct: 2245 CTYKCRHDAAADLIHVYAYTRCFYRVREYDTFTSPPVHISPLDLGPKYADKLGAGIHEYR 2304
Query: 2320 KTYGENYCLGQLIFWHIQTNADPDCTLARASRGCLSLPDIGSFYAKVQKPSRHRVYGPKT 2379
KTYGENYC+GQLIFWHIQTNA+PDC+LA+ASRGCLSLPDIGSFYAKVQKPS+ RVYGP+T
Sbjct: 2305 KTYGENYCMGQLIFWHIQTNAEPDCSLAKASRGCLSLPDIGSFYAKVQKPSQQRVYGPRT 2364
Query: 2380 VRFMLSRMEKQPQRPWPKDRIWAFKSSPRIFGSPMLDSSLTGCPLDREMVHWLKHRPAIF 2439
V+ ML RMEK PQ+PWPKD+IW+FKSSP++ GSPMLD+ L+ LDREMVHWLKHRP ++
Sbjct: 2365 VKLMLERMEKYPQKPWPKDQIWSFKSSPKVIGSPMLDAVLSNSSLDREMVHWLKHRPTVY 2424
Query: 2440 QAMWDR 2445
QAMWDR
Sbjct: 2425 QAMWDR 2430
>gi|359485692|ref|XP_002275342.2| PREDICTED: probable histone-lysine N-methyltransferase ATXR3-like
[Vitis vinifera]
Length = 2367
Score = 3131 bits (8117), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 1633/2366 (69%), Positives = 1888/2366 (79%), Gaps = 137/2366 (5%)
Query: 129 VENGGAVGEVVTVDKENLKNEEVEEGELGTLKW-----ENGEFVQPEKSQPQSQLQSQSK 183
+ENG + + + EEVEEGELGTLKW ENGEF +PEK +
Sbjct: 90 IENG-------EICNDKIVKEEVEEGELGTLKWPKGEVENGEF-EPEKPR--------RS 133
Query: 184 QIEKGEIIVFSSKCRRGETEKGESGLWR-----GNKDDIEKGEFIPDRWHKEVVKDEYGY 238
IEKGE + S K R+G+ EKGE L R G+KD++EKGEFIPDRW ++V +D YG
Sbjct: 134 DIEKGEFV--SGKWRKGDIEKGELVLERFRKGDGSKDELEKGEFIPDRWQRDVGRDGYGC 191
Query: 239 SKSRR------------YDYKLERTPPSGKYSGEDVYRRKEFDRSGSQHSKSSSR--WES 284
SK RR YD++ ERTPPSGKYSG+DV +RKEF RSGSQ +K SSR WE+
Sbjct: 192 SKMRRHELAKDKGWKFEYDHERERTPPSGKYSGDDVSQRKEFSRSGSQFAKRSSRSRWEA 251
Query: 285 GQERNVRISSKIVDDEGLYKGEHNNGKNHGREYFHGNRFKRHGTDSDSGDRKYYGDYGDF 344
ERNVRISSKIVDDEG YK EHN+ KNHGRE R KR+GTDSD +RK++G+YGD
Sbjct: 252 VPERNVRISSKIVDDEGTYKTEHNSSKNHGRELVSRTRMKRYGTDSDGSERKHHGEYGDH 311
Query: 345 AGLKSRRLSDDYNSRSVHSEHYSRHSVEKFHRNSSSSRISSLDKYSSRHHEPSLSSRVIY 404
G K R+LSDD N R+VH EHYSR S+E+ +RNSSSSRISS D++SSRH+E S SS+V++
Sbjct: 312 MGSKIRKLSDDSN-RTVHLEHYSRRSMERSYRNSSSSRISSSDRFSSRHYESSFSSKVVH 370
Query: 405 DRHGRSPSHSDRSPHDRGRYYDHRDRSPSRHDRSPYTRDRSPYTFDRSPYSRERSPYNRD 464
DRHGRSP HS+RSP DR RY+DH RDRSP
Sbjct: 371 DRHGRSPVHSERSPRDRARYHDH--------------RDRSPAY---------------- 400
Query: 465 RSPYAREKSPYDRSRHYDHRNRSPFSAERSPQDRARFHDRSDRTPNYLERSPLHRSRPNN 524
RS R++SPYDRSRHYDHRNRSP ERSPQDR R+H+R DRTP YLERSPL SRPNN
Sbjct: 401 RSSPRRDRSPYDRSRHYDHRNRSPAPTERSPQDRPRYHERRDRTPTYLERSPLDHSRPNN 460
Query: 525 HREASSKTGASEKRNARYDSKGHEDKLGPKDSNARCSRSSAKESQDKSNVQDLNV--SDE 582
+REAS K GA EKR+ +Y +K E+KL +D+N R SAKESQD+S++ +N SDE
Sbjct: 461 YREASCKGGAGEKRHGQYGNKVQEEKLNQRDANGRDPHFSAKESQDRSSLHTVNGHGSDE 520
Query: 583 KTANCESHKEEQPQSSSVDCKEPPQVDGPPLEELVSMEEDMDICDTPPHVPAVTDSSVGK 642
K+AN + HKEE+PQS V+ +EPPQ+ P EEL SMEEDMDICDTPPHVP V DS+ GK
Sbjct: 521 KSANHQPHKEEKPQSPCVNLEEPPQITVAP-EELASMEEDMDICDTPPHVPLVADSTTGK 579
Query: 643 WFYLDHCGMECGPSRLCDLKTLVEEGVLVSDHFIKHLDSNRWETVENAVSPLVTVNFPSI 702
WFYLDH GME GPS+LCDLK LVEEGVLVSDH IKH+DS+RW T+ENA SPLV VNFPSI
Sbjct: 580 WFYLDHFGMERGPSKLCDLKKLVEEGVLVSDHLIKHVDSDRWLTIENAASPLVPVNFPSI 639
Query: 703 TSDSVTQLVSPPEASGNLLADTGDTAQST---GEEFPVTL-QSQCCPDGSAAAAESSEDL 758
SD+VTQLVSPPEA GNLLA+ GD +S+ EE P TL QS C + S+ A+E EDL
Sbjct: 640 VSDTVTQLVSPPEAPGNLLAEAGDATESSKLLDEETPATLLQSMSCNNDSSTASEPLEDL 699
Query: 759 HIDVRVGALLDGFTVIPGKEIETLGEILQTTFERVDWQNNGGPTWHGACVGEQKPGDQKV 818
ID RV ALL GFTVIPG+E+ETLG G +WH +GEQ DQ+
Sbjct: 700 QIDERVRALLKGFTVIPGRELETLG----------------GLSWHQPRIGEQ--FDQRT 741
Query: 819 DEL-YISDTKMKEAAELKSGDKDHWVVCF---DSDEWFSGRWSCKGGDWKRNDEAAQDRC 874
DE + KEA++ +S F D +WFS RW+ KGGDWKRNDE+AQDR
Sbjct: 742 DEFSRYPEITSKEASDSRSSTSSDKDYAFAFGDFSDWFSARWASKGGDWKRNDESAQDRL 801
Query: 875 SRKKQVLNDGFPLCQMPKSGYEDPRWNQKDDLYYPSHSRRLDLPPWAYACPDERNDGSGG 934
SRKK VLNDG+PLCQMPKSGYEDPRW++KD+LYYPSH R+LDLP WA++ PDER+D +
Sbjct: 802 SRKKLVLNDGYPLCQMPKSGYEDPRWHRKDELYYPSHGRKLDLPIWAFSWPDERSDSNSA 861
Query: 935 SRSTQSKLAAVRGVKGTMLPVVRINACVVNDHGSFVSEPRSKVRAKERHSSRSARSYSSA 994
SR++Q K VRGVKG+MLPVVRINACV SEP +KVR K+R+SSRSAR+YSS
Sbjct: 862 SRASQIK-PVVRGVKGSMLPVVRINACV--------SEPPAKVRGKDRYSSRSARAYSST 912
Query: 995 NDVRRSSAESDSHSKARNNQDSQGSWKSIACINTPKDRLCTVDDLQLQLGEWYYLDGAGH 1054
DV+RSSAES SHSK+ + DSQGSWK I INTPKDRLCT +DLQL LG+WYYLDGAGH
Sbjct: 913 TDVKRSSAESASHSKSVSENDSQGSWKCITSINTPKDRLCTAEDLQLHLGDWYYLDGAGH 972
Query: 1055 ERGPSSFSELQVLVDQGCIQKHTSVFRKFDKVWVPLTFATETSASTVRNHGEKIMPSGDS 1114
E+GPSSFSELQ LVDQG IQKH+SVFRK DK+WVP+T A + + V+ + + S D
Sbjct: 973 EQGPSSFSELQALVDQGSIQKHSSVFRKNDKIWVPITSAADVPDAAVKIQPQNNVTSTDC 1032
Query: 1115 SGLPPTQSQDAVLGESNNNVNSNAFHTMHPQFIGYTRGKLHELVMKSYKNREFAAAINEV 1174
SG QS +G NN S + H++HPQFIGYT GKLHELVMKSYK+REFAAAINEV
Sbjct: 1033 SGPSLAQSLAGAIG--GNNTISRSLHSLHPQFIGYTCGKLHELVMKSYKSREFAAAINEV 1090
Query: 1175 LDPWINAKQPKKETEH--VYRKSEGDTR-----------AGKRARLLVRESDGDEETEEE 1221
LDPWIN+KQPKKE + V S D AG R R LV S+ D E EE+
Sbjct: 1091 LDPWINSKQPKKEMANSAVSNSSLHDLNKFRTSGMSHICAGIRGRWLVDGSEDDYEMEED 1150
Query: 1222 LQTIQ-DESTFEDLCGDASFPGEESASSAIESGGWGLLDGHTLAHVFHFLRSDMKSLAFA 1280
+ +Q DESTFEDLC DA+F E+ A + + S WGLLDG+ LA VFHFLR+D+KSLAFA
Sbjct: 1151 VLLVQKDESTFEDLCSDATFYQEDIALAEMGSENWGLLDGNVLARVFHFLRTDVKSLAFA 1210
Query: 1281 SLTCRHWRAAVRFYKGISRQVDLSSVGPNCTDSLIRKTLNAFDKEKLNSILLVGCTNITS 1340
+LTC+HWRAAVRFYKG+SRQVDLSSVG CTDS I +N ++KE++ S++L+GCTNIT
Sbjct: 1211 ALTCKHWRAAVRFYKGVSRQVDLSSVGSLCTDSTIWSMINGYNKERITSMILIGCTNITP 1270
Query: 1341 GMLEEILQSFPHLSSIDIRGCGQFGELALKFPNINWVKSQKSRGAKFNDSRSKIRSLKQI 1400
GMLE++L SFP LSSIDIRGC QF ELA KF N+NW+KS+ F +S SKI++LKQI
Sbjct: 1271 GMLEDVLGSFPSLSSIDIRGCSQFWELADKFSNLNWIKSRIRVMKVFEESYSKIKALKQI 1330
Query: 1401 TEKSSSAPKSKGLGDDMDDFGDLKDYFESVDKRDSANQSFRRSLYQRSKVFDARKSSSIL 1460
TE+ S + KG+G +DD +LK+YF+SVD+R+SA+QSFRRS Y+RSK+FDAR+SSSIL
Sbjct: 1331 TERPSVSKPLKGMGSHVDDSSELKEYFDSVDRRESASQSFRRSYYKRSKLFDARRSSSIL 1390
Query: 1461 SRDARMRRWSIKKSENGYKRMEEFLASSLKEIMRVNTFEFFVPKVAEIEGRMKKGYYISH 1520
SRDARMRRWSIK SENGYKRMEEFLASSL++IM+ NTF+FFVPKVAEIE RMK GYY H
Sbjct: 1391 SRDARMRRWSIKNSENGYKRMEEFLASSLRDIMKENTFDFFVPKVAEIEDRMKNGYYAGH 1450
Query: 1521 GLGSVKDDISRMCRDAIKAKNRGSAGDMNRITTLFIQLATRLEQGAKSSYYEREEMMKSW 1580
GL SVK+DISRMCRDAIKAKNRG +G+MNRI TLFI+LAT LE+G+KSS REEM++ W
Sbjct: 1451 GLSSVKEDISRMCRDAIKAKNRGDSGNMNRIITLFIRLATCLEEGSKSSN-GREEMVRRW 1509
Query: 1581 KDESPAGLYSATSKYKKKLSKMVSERKYMNRSNGTSLANGDFDYGEYASDREIRKRLSKL 1640
KDESP+GL S+ SKYKKKL+K+V+ERK+ RSNG S DYGEYASDREIR+RLSKL
Sbjct: 1510 KDESPSGLCSSGSKYKKKLNKIVTERKH--RSNGGS------DYGEYASDREIRRRLSKL 1561
Query: 1641 NRKSLDSGSETSDDLDGSSEDGKSDSESTVSDTDSDMDFRSDGRARESRGAGDFTTDEGL 1700
N+KS+DSGS+TSDDLD SSE G S SEST SDT+SD+DFRS+G ESR G FT DEGL
Sbjct: 1562 NKKSMDSGSDTSDDLDRSSEGGSSGSESTASDTESDLDFRSEGGVAESRVDGYFTADEGL 1621
Query: 1701 -DFSDDREWGARMTKASLVPPVTRKYEVIDQYVIVADEEDVRRKMRVSLPEDYAEKLNAQ 1759
+DDREWGARMTK SLVPPVTRKYEVI+QYVIVADE++V+RKM+VSLPE Y EKL AQ
Sbjct: 1622 YSMTDDREWGARMTKVSLVPPVTRKYEVIEQYVIVADEDEVQRKMKVSLPEHYNEKLTAQ 1681
Query: 1760 KNGSEELDMELPEVKDYKPRKQLGDQVFEQEVYGIDPYTHNLLLDSMPDELDWNLLEKHL 1819
KNG+EE DME+PEVKDYKPRKQLGD+V EQEVYGIDPYTHNLLLDSMP+ELDW LLEKHL
Sbjct: 1682 KNGTEESDMEIPEVKDYKPRKQLGDEVIEQEVYGIDPYTHNLLLDSMPEELDWPLLEKHL 1741
Query: 1820 FIEDVLLRTLNKQVRHFTGTGNTPMMYPLQPVIEEIEKEAVDDCDVRTMKMCRGILKAMD 1879
FIE+VLL TLNKQVRHFTGTGNTPMMY LQPV+E+I+K A ++ D+RT+KMC+GILKAM+
Sbjct: 1742 FIEEVLLCTLNKQVRHFTGTGNTPMMYHLQPVVEDIQKTAEEELDLRTLKMCQGILKAMN 1801
Query: 1880 SRPDDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDP 1939
SRPDD YVAYRKGLGVVCNKEGGF ++DFVVEFLGEVYP WKWFEKQDGIRSLQKN++DP
Sbjct: 1802 SRPDDNYVAYRKGLGVVCNKEGGFSQEDFVVEFLGEVYPAWKWFEKQDGIRSLQKNSKDP 1861
Query: 1940 APEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIY 1999
APEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAV+G YQIGIY
Sbjct: 1862 APEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVEGQYQIGIY 1921
Query: 2000 TVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGL 2059
TVR I YGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAF+KVLKE HG+
Sbjct: 1922 TVRQIQYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFQKVLKECHGI 1981
Query: 2060 LDRHQLMLEACELNSVSEEDYLELGRAGLGSCLLGGLPNWVVAYSARLVRFINLERTKLP 2119
LDR+Q+M EACELN VSEEDY++LGRAGLGSCLLGGLP+W++AY+ARLVRFIN ERTKLP
Sbjct: 1982 LDRYQMMFEACELNMVSEEDYIDLGRAGLGSCLLGGLPDWLIAYAARLVRFINFERTKLP 2041
Query: 2120 EEILRHNLEEKRKYFSDICLEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRCVFGDP 2179
EEILRH+L+EKRKYF+DI LEVEKSDAE+QAEGVYNQRLQNLA+TLDKVRYVMRCVFGDP
Sbjct: 2042 EEILRHSLDEKRKYFADISLEVEKSDAELQAEGVYNQRLQNLALTLDKVRYVMRCVFGDP 2101
Query: 2180 KKAPPPVERLSPEETVSFLWKGEGSLVEELIQCMAPHVEEDVLNDLKSKIQAHDPSGSED 2239
KKAPPP+ERLS EE VSFLW GEGSLVEEL+QCMAPH+E+ +L++LK KI+AHDPSGS+D
Sbjct: 2102 KKAPPPLERLSAEEVVSFLWNGEGSLVEELLQCMAPHMEDGMLSELKPKIRAHDPSGSDD 2161
Query: 2240 IQRELRKSLLWLRDEVRNLPCTYKCRHDAAADLIHIYAYTKCFFRVQEYKAFTSPPVYIS 2299
I +EL+KSLLWLRDEVRNLPC YKCRHDAAADLIHIYAYTKCFFRV+EYK+ TSPPVYIS
Sbjct: 2162 IHKELQKSLLWLRDEVRNLPCNYKCRHDAAADLIHIYAYTKCFFRVREYKSVTSPPVYIS 2221
Query: 2300 PLDLGPKYADKLGADLQVYRKTYGENYCLGQLIFWHIQTNADPDCTLARASRGCLSLPDI 2359
PLDLGPKY+DKLG+ +Q Y KTYGENYCLGQLI+WH QTNADPDC LARASRGCLSLPDI
Sbjct: 2222 PLDLGPKYSDKLGSGIQEYCKTYGENYCLGQLIYWHNQTNADPDCNLARASRGCLSLPDI 2281
Query: 2360 GSFYAKVQKPSRHRVYGPKTVRFMLSRMEKQPQRPWPKDRIWAFKSSPRIFGSPMLDSSL 2419
GSFYAKVQKPSR RVYGP+T+RFML+RMEKQPQR WPKDRIW+FKS P+IFGSPMLD+ L
Sbjct: 2282 GSFYAKVQKPSRQRVYGPRTLRFMLARMEKQPQRQWPKDRIWSFKSCPKIFGSPMLDAVL 2341
Query: 2420 TGCPLDREMVHWLKHRPAIFQAMWDR 2445
PLDREM+HWLK+RPA FQAMWDR
Sbjct: 2342 HNSPLDREMLHWLKNRPATFQAMWDR 2367
>gi|449453666|ref|XP_004144577.1| PREDICTED: probable histone-lysine N-methyltransferase ATXR3-like
[Cucumis sativus]
Length = 2336
Score = 2979 bits (7722), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 1573/2491 (63%), Positives = 1859/2491 (74%), Gaps = 201/2491 (8%)
Query: 1 MGDGGVACMPLQQQQQHNSIMERFPISDKTTICVGNSSNNSNKTNNNSISNNNDNKTNND 60
MGDGGVAC+PLQQQQQH IME FPI + +C G
Sbjct: 1 MGDGGVACIPLQQQQQH--IMETFPIPSEKMLCAGK------------------------ 34
Query: 61 SSNNNGSSSSKNNETNKSNVKKNGVSTKTVRKKIVKIKK--VIAVKKKEVQKNSGSSKSN 118
NNG +S KS VK ++ RK+ +K+KK V+A + + SG K
Sbjct: 35 ---NNGFNS-------KSTVK----FSEAERKQKMKLKKEEVVAKDVELGRTESGLDKPG 80
Query: 119 NNGENIDNKNVENGGAVGEVVTVDKENLKNEEVEEGELGTLKW-----ENGEFVQPEKSQ 173
+ + + ENG E +EVEEGE GTLKW ENGEFV PEKS+
Sbjct: 81 KSSREVGH--AENGVDSAE----------KDEVEEGEFGTLKWSRVEVENGEFV-PEKSR 127
Query: 174 PQSQLQSQSKQIEKGEIIVFSSKCRRGETEKGE------------SGLWRGNKDDIEKGE 221
+I+KGE + K RRG+ EKGE + R KD+IE+GE
Sbjct: 128 --------RTEIDKGENV--RGKWRRGDIEKGEIVPEKSRKGEVDNRSRRLAKDEIERGE 177
Query: 222 FIPDRWHK-EVVKDEYGYSKSRRYDYKLER--------TPPSGKYSGEDVYRRKEFDRSG 272
FIPDRW K +++KD++ YS++RRY+ + +R TPP KYS +D RRKE +RSG
Sbjct: 178 FIPDRWEKGDILKDDFRYSRTRRYEPEKDRAWKNVREPTPPLVKYSTDDT-RRKELNRSG 236
Query: 273 SQHSKSSSRWESGQERNVRISSKIVDDEGLYKGEHNNGKNHGREYFHGNRFKRHGTDSDS 332
+QH K++ RWE+GQ+R R SK+++DE ++ ++N+GKN G++Y NR KR+ +SD+
Sbjct: 237 NQHGKTTPRWETGQDRGSRYGSKLMNDEVTHRNDYNDGKNFGKDYSSCNRLKRYSLESDN 296
Query: 333 GDRKYYGDYGDFAGLKSRRLSDDYNSRSVHSEHYSRHSVEKFHRNSSSSRISSL-DKYSS 391
+RK+YGDYGD+AG KSRRLS+D +SR+ HS+HYS +E+ +NSSSS S DK+S+
Sbjct: 297 FERKHYGDYGDYAGSKSRRLSED-SSRTAHSDHYSIRPMERSCKNSSSSSRISSSDKFST 355
Query: 392 RHHEPS-LSSRVIYDRHGRSPSHSDRSPHDRGRYYDHRDRSPSRHDRSPYTRDRSPYTFD 450
RH+E S SSR Y RH SP HSDRSP ++GRY+DHRDRSP
Sbjct: 356 RHYESSSTSSREAYSRHVHSPGHSDRSPREKGRYHDHRDRSPGH---------------- 399
Query: 451 RSPYSRERSPYNRDRSPYAREKSPYDRSRHYDHRNRSPFSAERSPQDRARFHDRSDRTPN 510
R+RSP+ +RSPY R+KSPYDRSRHYDHR RSP + ERSPQDRAR H R DRTPN
Sbjct: 400 -----RDRSPFIGERSPYGRDKSPYDRSRHYDHRYRSPLT-ERSPQDRARCHSRRDRTPN 453
Query: 511 YLERSPLHRSRPNNHREASSKTGASEKRNARYDSKGHEDKLGPKDSNARCSRSSAKESQD 570
YL+RSPL RSR +NHRE S ++ + N S+ EDK PKD + R S AKES D
Sbjct: 454 YLDRSPLDRSRTSNHRETSRRSKGEKHNNG---SRAREDKTTPKDPDGR--ESVAKESYD 508
Query: 571 KSNVQDLNVSDEKTANCESHK-EEQPQSSSVDCKEPPQVDGPPLEELVSMEEDMDICDTP 629
+ N Q+ N S E +C S++ EE+ QS + E VDG P EEL SMEEDMDICDTP
Sbjct: 509 EINEQNTNGSIETVGDCRSYEGEEKSQSPNQTSIELSHVDGVP-EELPSMEEDMDICDTP 567
Query: 630 PHVPAVTDSSVGKWFYLDHCGMECGPSRLCDLKTLVEEGVLVSDHFIKHLDSNRWETVEN 689
PH P VTD+S GKWFYLD+ G+E GP+RL DLK LVEEG L+SDHFIKHLDS+RW TVEN
Sbjct: 568 PHAPLVTDTSTGKWFYLDYYGLERGPTRLYDLKALVEEGSLMSDHFIKHLDSDRWVTVEN 627
Query: 690 AVSPLVTVNFPSITSDSVTQLVSPPEASGNLLADTGDTAQ---STGEEFPVTLQSQCCPD 746
AVSPLVT+NFPSI DSVTQLVSPPEA+GN+L D DT + G P + S
Sbjct: 628 AVSPLVTINFPSIVPDSVTQLVSPPEATGNVLVDITDTGKLDIQGGHFEPNQIPSGGSIL 687
Query: 747 GSAAAAESSE---DLHIDVRVGALLDGFTVIPGKEIETLGEILQTTFERVDWQNNGGPTW 803
S E+SE DLHID R+GALL+ TVIPGKE+ET+ E+LQ T + W+
Sbjct: 688 PSDEGVEASEPLGDLHIDERIGALLEDITVIPGKELETIAEVLQMTLDGEQWERLAISEG 747
Query: 804 HGACVGEQKPGDQKVDELY-ISD--TKMKEAAELK-SGDKDHWVVCFDSDEWFSGRWSCK 859
VGEQ DQ D++ SD T + ++ S DKD V D +W SG WSCK
Sbjct: 748 FSDHVGEQL--DQSTDDVVEFSDFVTSVDSGSQKNVSSDKDFAV---DDGDWTSGPWSCK 802
Query: 860 GGDWKRNDEAAQDRCSRKKQVLNDGFPLCQMPKSGYEDPRWNQKDDLYYPSHSRRLDLPP 919
GGDW+RNDE+AQ+R RKK VLNDGFPLCQM KSGYEDPRW+QKD+LYYPS S+RLDLPP
Sbjct: 803 GGDWRRNDESAQERNGRKKLVLNDGFPLCQMSKSGYEDPRWHQKDELYYPSQSKRLDLPP 862
Query: 920 WAYACPDERNDGSGGSRSTQSKLAAVRGVKGTMLPVVRINACVVNDHGSFVSEPRSKVRA 979
WA+ C D+R+ +RG KGTMLPV+RINACVV DHGSFVSEPR KVR
Sbjct: 863 WAFTCLDDRS------------TLTIRGTKGTMLPVIRINACVVKDHGSFVSEPRMKVRG 910
Query: 980 KERHSSRSARSYSSANDVRRSSAESDSHSKARNNQDSQGSWKSIACINTPKDRLCTVDDL 1039
K SRS R +SS D +RS A+ DS SK + S+ S K+ A ++ PKDRLC+ DDL
Sbjct: 911 KGH--SRS-RLFSSNTDGKRS-ADGDSLSKIARDVSSERSLKATAFVSIPKDRLCSYDDL 966
Query: 1040 QLQLGEWYYLDGAGHERGPSSFSELQVLVDQGCIQKHTSVFRKFDKVWVPLTFATETSAS 1099
QL G+WYYLDGAGHE GPSSFSELQ+LVD G IQK++SVFRKFD+VWVP+T E S S
Sbjct: 967 QLHFGDWYYLDGAGHECGPSSFSELQLLVDHGIIQKNSSVFRKFDRVWVPVTSFAECSES 1026
Query: 1100 TVRNHGEKIMPSGDSSGLPPTQSQDAVLGESNNNVNSNAFHTMHPQFIGYTRGKLHELVM 1159
T R EKI G+++ P + S D G SN FH +HPQF+GYTRGKLHELVM
Sbjct: 1027 TRRIQREKIPLLGETTKNPVSVSGDNSFG--GLATTSNMFHELHPQFVGYTRGKLHELVM 1084
Query: 1160 KSYKNREFAAAINEVLDPWINAKQPKKETEH-VYRKSEGDTRAGKRARLLVRESDGDEET 1218
K YK+REFAAAIN+VLDPWINAKQPKKE E ++ KS+G RA KRAR+LV ESD D E
Sbjct: 1085 KFYKSREFAAAINDVLDPWINAKQPKKEMEKTMHWKSDGSARAAKRARVLVDESDDDYEV 1144
Query: 1219 EEEL--QTIQDESTFEDLCGDASFPGEESASSAIESGGWGLLDGHTLAHVFHFLRSDMKS 1276
+E+L +DE FEDLCGDA+FPGEES S +ES WG LDGH LA +FHFL+SD+KS
Sbjct: 1145 DEDLLHHRQKDEIAFEDLCGDATFPGEESTSLEVES--WGFLDGHILARIFHFLQSDLKS 1202
Query: 1277 LAFASLTCRHWRAAVRFYKGISRQVDLSSVGPNCTDSLIRKTLNAFDKEKLNSILLVGCT 1336
L+FAS+TC+HWRAAVRFYK IS+QVDLSS+GPNCT+S ++ +++EK+N I+LVGCT
Sbjct: 1203 LSFASVTCKHWRAAVRFYKDISKQVDLSSLGPNCTNSTFMNVMSTYNEEKVNFIVLVGCT 1262
Query: 1337 NITSGMLEEILQSFPHLSSIDIRGCGQFGELALKFPNINWVKSQKSRGAKFNDSRSKIRS 1396
NIT +LEEIL FP L+SID+RGC QF +L K+PNINWVK + ++ SK+RS
Sbjct: 1263 NITPVVLEEILGMFPQLASIDVRGCSQFNDLPSKYPNINWVKRSLNATKNNEETHSKMRS 1322
Query: 1397 LKQITEKSSSAPKSKGLGDDMDDFGDLKDYFESVDKRDSANQSFRRSLYQRSKVFDARKS 1456
LK +T+KS S K KGL ++DDFG+LK YFESVDKR+SANQ FRRSLY+RSKVFDARKS
Sbjct: 1323 LKHLTDKSYSLSKIKGLSSNVDDFGELKQYFESVDKRESANQLFRRSLYKRSKVFDARKS 1382
Query: 1457 SSILSRDARMRRWSIKKSENGYKRMEEFLASSLKEIMRVNTFEFFVPKVAEIEGRMKKGY 1516
SSI+SRDARMR+WSIKKSE GYKRM EFLASSLKEIMR NTFEFFVPKVAEI+ R++ GY
Sbjct: 1383 SSIVSRDARMRQWSIKKSEVGYKRMVEFLASSLKEIMRDNTFEFFVPKVAEIQDRIRNGY 1442
Query: 1517 YISHGLGSVKDDISRMCRDAIKAKNRGSAGDMNRITTLFIQLATRLEQGAKSSYYEREEM 1576
YI GLGSVK+DISRMCRDAIK +
Sbjct: 1443 YIKRGLGSVKEDISRMCRDAIKY-----------------------------------DE 1467
Query: 1577 MKSWKDESPAGL-YSATSKYKKKLSKMVSERKYMNRSNGTSLANGDFDYGEYASDREIRK 1635
+ SW+D+S L SA SKYK++L K+ +ERKY NRSNG+ NG D+GEYASDREIR+
Sbjct: 1468 VSSWEDDSSLRLGSSAASKYKRRLGKVGTERKYTNRSNGSIFGNGALDHGEYASDREIRR 1527
Query: 1636 RLSKLNRKSLDSGSETSDDLDGSSEDGKSDSESTVSDTDSDMDFRSDGRARESRGAGDFT 1695
RLS+LN+K + S SETSD+ D SS DGKS SE++ SDT+SD++F S GR E+RG F
Sbjct: 1528 RLSRLNKKPIGSESETSDEFDRSSGDGKSGSENSASDTESDLEF-SSGRI-ETRGDKCFI 1585
Query: 1696 TDEGLDFS-DDREWGARMTKASLVPPVTRKYEVIDQYVIVADEEDVRRKMRVSLPEDYAE 1754
DE D + DDREWGARMTKASLVPPVTRKYE+ID+YV++ADEE+VRRKMRVSLP+DY E
Sbjct: 1586 LDEAFDSTMDDREWGARMTKASLVPPVTRKYELIDEYVVIADEEEVRRKMRVSLPDDYVE 1645
Query: 1755 KLNAQKNGSEELDMELPEVKDYKPRKQLGDQVFEQEVYGIDPYTHNLLLDSMPDELDWNL 1814
KLNAQKNG+EELDMELPEVKDYKPRK++GD+V EQEVYGIDPYTHNLLLDS+P+ELDW+L
Sbjct: 1646 KLNAQKNGAEELDMELPEVKDYKPRKKIGDEVLEQEVYGIDPYTHNLLLDSVPEELDWSL 1705
Query: 1815 LEKHLFIEDVLLRTLNKQVRHFTGTGNTPMMYPLQPVIEEIEKEAVDDCDVRTMKMCRGI 1874
++KH+FIEDVLLRTLNKQ HFTGTGNTPM YPL PVIEEIEK A +CD+R M++C+GI
Sbjct: 1706 MDKHMFIEDVLLRTLNKQAIHFTGTGNTPMKYPLLPVIEEIEKVAAAECDIRIMRLCQGI 1765
Query: 1875 LKAMDSRPDDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQK 1934
LKA+ SRP+DKYVAYRKGLGVVCNK+ GFGEDDFVVEFLGEVYPVWKW+EKQDGIRSLQK
Sbjct: 1766 LKAIHSRPEDKYVAYRKGLGVVCNKQEGFGEDDFVVEFLGEVYPVWKWYEKQDGIRSLQK 1825
Query: 1935 NNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHY 1994
N++DPAPEFYNIYLERPKGD DGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHY
Sbjct: 1826 NDKDPAPEFYNIYLERPKGDGDGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHY 1885
Query: 1995 QIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLK 2054
QIGIYT+R I YGEEITFDYNSVTESKEEYEASVCLCGS VCRGSYLNLTG+GAF KVL+
Sbjct: 1886 QIGIYTLRKIQYGEEITFDYNSVTESKEEYEASVCLCGSHVCRGSYLNLTGDGAFLKVLE 1945
Query: 2055 ELHGLLDRHQLMLEACELNSVSEEDYLELGRAGLGSCLLGGLPNWVVAYSARLVRFINLE 2114
E HG+LD HQLMLEACELNSVSE+DYL+LGRAGLGSCLLGGLP+W+VAYSAR+VRFIN E
Sbjct: 1946 EWHGVLDCHQLMLEACELNSVSEDDYLDLGRAGLGSCLLGGLPDWLVAYSARVVRFINFE 2005
Query: 2115 RTKLPEEILRHNLEEKRKYFSDICLEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRC 2174
RTKLP+EIL HNLEEKRKYFSDICL+VEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRC
Sbjct: 2006 RTKLPQEILAHNLEEKRKYFSDICLDVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRC 2065
Query: 2175 VFGDPKKAPPPVERLSPEETVSFLWKGEGSLVEELIQCMAPHVEEDVLNDLKSKIQAHDP 2234
+FGDPK APPP++RLSPEE+VS++W GEGSLVEEL+ M PHVEED+++DLK KI+AHDP
Sbjct: 2066 IFGDPKNAPPPLKRLSPEESVSYIWNGEGSLVEELLLSMVPHVEEDLISDLKLKIRAHDP 2125
Query: 2235 SGSEDIQRELRKSLLWLRDEVRNLPCTYKCRHDAAADLIHIYAYTKCFFRVQEYKAFTSP 2294
S+DIQ+EL++SLLWLRDEVRN+PCTYK R+DAAADLIHIYAYTK FFR+QEYKA TSP
Sbjct: 2126 LCSDDIQKELQQSLLWLRDEVRNIPCTYKSRNDAAADLIHIYAYTKNFFRIQEYKAVTSP 2185
Query: 2295 PVYISPLDLGPKYADKLGADLQVYRKTYGENYCLGQLIFWHIQTNADPDCTLARASRGCL 2354
PVYIS LDLGPKY DKLG Q Y KTYG NYCLGQLIFWH Q N DPDC+LA ASRGCL
Sbjct: 2186 PVYISSLDLGPKYVDKLGTGFQEYCKTYGPNYCLGQLIFWHNQQNIDPDCSLALASRGCL 2245
Query: 2355 SLPDIGSFYAKVQKPSRHRVYGPKTVRFMLSRMEKQPQRPWPKDRIWAFKSSPRIFGSPM 2414
SLP+I SFYA+VQKPSR RVYGPKTV+FMLSRMEKQPQRPWPKDRIW+FK+SP++ GSPM
Sbjct: 2246 SLPEISSFYARVQKPSRQRVYGPKTVKFMLSRMEKQPQRPWPKDRIWSFKNSPKVIGSPM 2305
Query: 2415 LDSSLTGCPLDREMVHWLKHRPAIFQAMWDR 2445
LD L+ PL++++VHWLKHR IFQAMWDR
Sbjct: 2306 LDVVLSNSPLEKDLVHWLKHRTPIFQAMWDR 2336
>gi|449493199|ref|XP_004159219.1| PREDICTED: probable histone-lysine N-methyltransferase ATXR3-like
[Cucumis sativus]
Length = 2336
Score = 2978 bits (7720), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 1573/2491 (63%), Positives = 1858/2491 (74%), Gaps = 201/2491 (8%)
Query: 1 MGDGGVACMPLQQQQQHNSIMERFPISDKTTICVGNSSNNSNKTNNNSISNNNDNKTNND 60
MGDGGVAC+PLQQQQQH IME FPI + +C G
Sbjct: 1 MGDGGVACIPLQQQQQH--IMETFPIPSEKMLCAGK------------------------ 34
Query: 61 SSNNNGSSSSKNNETNKSNVKKNGVSTKTVRKKIVKIKK--VIAVKKKEVQKNSGSSKSN 118
NNG +S KS VK ++ RK+ +K+KK V+A + + SG K
Sbjct: 35 ---NNGFNS-------KSTVK----FSEAERKQKMKLKKEEVVAKDVELGRTESGLDKPG 80
Query: 119 NNGENIDNKNVENGGAVGEVVTVDKENLKNEEVEEGELGTLKW-----ENGEFVQPEKSQ 173
+ + + ENG E +EVEEGE GTLKW ENGEFV PEKS+
Sbjct: 81 KSSREVGH--AENGVDSAE----------KDEVEEGEFGTLKWSRVEVENGEFV-PEKSR 127
Query: 174 PQSQLQSQSKQIEKGEIIVFSSKCRRGETEKGE------------SGLWRGNKDDIEKGE 221
+I+KGE + K RRG+ EKGE + R KD+IE+GE
Sbjct: 128 --------RTEIDKGENV--RGKWRRGDIEKGEIVPEKSRKGEVDNRSRRLAKDEIERGE 177
Query: 222 FIPDRWHK-EVVKDEYGYSKSRRYDYKLER--------TPPSGKYSGEDVYRRKEFDRSG 272
FIPDRW K +++KD++ YS++RRY+ + +R TPP KYS +D RRKE +RSG
Sbjct: 178 FIPDRWEKGDILKDDFRYSRTRRYEPEKDRAWKNVREPTPPLVKYSTDDT-RRKELNRSG 236
Query: 273 SQHSKSSSRWESGQERNVRISSKIVDDEGLYKGEHNNGKNHGREYFHGNRFKRHGTDSDS 332
+QH K++ RWE+GQ+R R SK+++DE ++ ++N+GKN G++Y NR KR+ +SD+
Sbjct: 237 NQHGKTTPRWETGQDRGSRYGSKLMNDEVSHRNDYNDGKNFGKDYSSCNRLKRYSLESDN 296
Query: 333 GDRKYYGDYGDFAGLKSRRLSDDYNSRSVHSEHYSRHSVEKFHRNSSSSRISSL-DKYSS 391
+RK+YGDYGD+AG KSRRLS+D +SR+ HS+HYS +E+ +NSSSS S DK+S+
Sbjct: 297 FERKHYGDYGDYAGSKSRRLSED-SSRTAHSDHYSIRPMERSCKNSSSSSRISSSDKFST 355
Query: 392 RHHEPS-LSSRVIYDRHGRSPSHSDRSPHDRGRYYDHRDRSPSRHDRSPYTRDRSPYTFD 450
RH+E S SSR Y RH SP HSDRSP ++GRY+DHRDRSP DRS
Sbjct: 356 RHYESSSTSSREAYSRHVHSPGHSDRSPREKGRYHDHRDRSPGHQDRS------------ 403
Query: 451 RSPYSRERSPYNRDRSPYAREKSPYDRSRHYDHRNRSPFSAERSPQDRARFHDRSDRTPN 510
P+ +RSPY R+KSPYDRSRHYDHR RSP + ERSPQDRAR H R DRTPN
Sbjct: 404 ---------PFIGERSPYGRDKSPYDRSRHYDHRYRSPLT-ERSPQDRARCHSRRDRTPN 453
Query: 511 YLERSPLHRSRPNNHREASSKTGASEKRNARYDSKGHEDKLGPKDSNARCSRSSAKESQD 570
YL+RSPL RSR +NHRE S ++ + N S+ EDK PKD + R S AKES D
Sbjct: 454 YLDRSPLDRSRTSNHRETSRRSKGEKHNNG---SRAREDKTTPKDPDGR--ESVAKESYD 508
Query: 571 KSNVQDLNVSDEKTANCESHK-EEQPQSSSVDCKEPPQVDGPPLEELVSMEEDMDICDTP 629
+ N Q+ N S E +C S++ EE+ QS + E VDG P EEL SMEEDMDICDTP
Sbjct: 509 EINEQNTNGSIETVGDCRSYEGEEKSQSPNQTSIELSHVDGVP-EELPSMEEDMDICDTP 567
Query: 630 PHVPAVTDSSVGKWFYLDHCGMECGPSRLCDLKTLVEEGVLVSDHFIKHLDSNRWETVEN 689
PH P VTD+S GKWFYLD+ G+E GP+RL DLK LVEEG L+SDHFIKHLDS+RW TVEN
Sbjct: 568 PHAPLVTDTSTGKWFYLDYYGLERGPTRLYDLKALVEEGSLMSDHFIKHLDSDRWVTVEN 627
Query: 690 AVSPLVTVNFPSITSDSVTQLVSPPEASGNLLADTGDTAQ---STGEEFPVTLQSQCCPD 746
AVSPLVT+NFPSI DSVTQLVSPPEA+GN+L D DT + G P + S
Sbjct: 628 AVSPLVTINFPSIVPDSVTQLVSPPEATGNVLVDITDTGKLDIQGGHFEPNQIPSGGSIL 687
Query: 747 GSAAAAESSE---DLHIDVRVGALLDGFTVIPGKEIETLGEILQTTFERVDWQNNGGPTW 803
S E+SE DLHID R+GALL+ TVIPGKE+ET+ E+LQ T + W+
Sbjct: 688 PSDEGVEASEPLGDLHIDERIGALLEDITVIPGKELETIAEVLQMTLDGEQWERLAISEG 747
Query: 804 HGACVGEQKPGDQKVDELY-ISD--TKMKEAAELK-SGDKDHWVVCFDSDEWFSGRWSCK 859
VGEQ DQ D++ SD T + ++ S DKD V D +W SG WSCK
Sbjct: 748 FSDHVGEQL--DQSTDDVVEFSDFVTSVDSGSQKNVSSDKDFAV---DDGDWTSGPWSCK 802
Query: 860 GGDWKRNDEAAQDRCSRKKQVLNDGFPLCQMPKSGYEDPRWNQKDDLYYPSHSRRLDLPP 919
GGDW+RNDE+AQ+R RKK VLNDGFPLCQM KSGYEDPRW+QKD+LYYPS S+RLDLPP
Sbjct: 803 GGDWRRNDESAQERNGRKKLVLNDGFPLCQMSKSGYEDPRWHQKDELYYPSQSKRLDLPP 862
Query: 920 WAYACPDERNDGSGGSRSTQSKLAAVRGVKGTMLPVVRINACVVNDHGSFVSEPRSKVRA 979
WA+ C D+R+ +RG KGTMLPV+RINACVV DHGSFVSEPR KVR
Sbjct: 863 WAFTCLDDRS------------TLTIRGTKGTMLPVIRINACVVKDHGSFVSEPRMKVRG 910
Query: 980 KERHSSRSARSYSSANDVRRSSAESDSHSKARNNQDSQGSWKSIACINTPKDRLCTVDDL 1039
K SRS R +SS D +RS A+ DS SK + S+ S K+ A ++ PKDRLC+ DDL
Sbjct: 911 KGH--SRS-RLFSSNTDGKRS-ADGDSLSKIARDVSSERSLKATAFVSIPKDRLCSYDDL 966
Query: 1040 QLQLGEWYYLDGAGHERGPSSFSELQVLVDQGCIQKHTSVFRKFDKVWVPLTFATETSAS 1099
QL G+WYYLDGAGHE GPSSFSELQ+LVD G IQK++SVFRKFD+VWVP+T E S S
Sbjct: 967 QLHFGDWYYLDGAGHECGPSSFSELQLLVDHGIIQKNSSVFRKFDRVWVPVTSFAECSES 1026
Query: 1100 TVRNHGEKIMPSGDSSGLPPTQSQDAVLGESNNNVNSNAFHTMHPQFIGYTRGKLHELVM 1159
T R EKI G+++ P + S D G SN FH +HPQF+GYTRGKLHELVM
Sbjct: 1027 TRRIQREKIPLLGETTKNPVSVSGDNSFG--GLATTSNMFHELHPQFVGYTRGKLHELVM 1084
Query: 1160 KSYKNREFAAAINEVLDPWINAKQPKKETEH-VYRKSEGDTRAGKRARLLVRESDGDEET 1218
K YK+REFAAAIN+VLDPWINAKQPKKE E ++ KS+G RA KRAR+LV ESD D E
Sbjct: 1085 KFYKSREFAAAINDVLDPWINAKQPKKEMEKTMHWKSDGSARAAKRARVLVDESDDDYEV 1144
Query: 1219 EEEL--QTIQDESTFEDLCGDASFPGEESASSAIESGGWGLLDGHTLAHVFHFLRSDMKS 1276
+E+L +DE FEDLCGDA+FPGEES S +ES WG LDGH LA +FHFL+SD+KS
Sbjct: 1145 DEDLLHHRQKDEIAFEDLCGDATFPGEESTSLEVES--WGFLDGHILARIFHFLQSDLKS 1202
Query: 1277 LAFASLTCRHWRAAVRFYKGISRQVDLSSVGPNCTDSLIRKTLNAFDKEKLNSILLVGCT 1336
L+FAS+TC+HWRAAVRFYK IS+QVDLSS+GPNCT+S ++ +++EK+N I+LVGCT
Sbjct: 1203 LSFASVTCKHWRAAVRFYKDISKQVDLSSLGPNCTNSTFMNVMSTYNEEKVNFIVLVGCT 1262
Query: 1337 NITSGMLEEILQSFPHLSSIDIRGCGQFGELALKFPNINWVKSQKSRGAKFNDSRSKIRS 1396
NIT +LEEIL FP L+SID+RGC QF +L K+PNINWVK + ++ SK+RS
Sbjct: 1263 NITPVVLEEILGMFPQLASIDVRGCSQFNDLPSKYPNINWVKRSLNATKNNEETHSKMRS 1322
Query: 1397 LKQITEKSSSAPKSKGLGDDMDDFGDLKDYFESVDKRDSANQSFRRSLYQRSKVFDARKS 1456
LK +T+KS S K KGL ++DDFG+LK YFESVDKR+SANQ FRRSLY+RSKVFDARKS
Sbjct: 1323 LKHLTDKSYSLSKIKGLSSNVDDFGELKQYFESVDKRESANQLFRRSLYKRSKVFDARKS 1382
Query: 1457 SSILSRDARMRRWSIKKSENGYKRMEEFLASSLKEIMRVNTFEFFVPKVAEIEGRMKKGY 1516
SSI+SRDARMR+WSIKKSE GYKRM EFLASSLKEIMR NTFEFFVPKVAEI+ R++ GY
Sbjct: 1383 SSIVSRDARMRQWSIKKSEVGYKRMVEFLASSLKEIMRDNTFEFFVPKVAEIQDRIRNGY 1442
Query: 1517 YISHGLGSVKDDISRMCRDAIKAKNRGSAGDMNRITTLFIQLATRLEQGAKSSYYEREEM 1576
YI GLGSVK+DISRMCRDAIK +
Sbjct: 1443 YIKRGLGSVKEDISRMCRDAIKY-----------------------------------DE 1467
Query: 1577 MKSWKDESPAGL-YSATSKYKKKLSKMVSERKYMNRSNGTSLANGDFDYGEYASDREIRK 1635
+ SW+D+S L SA SKYK++L K+ +ERKY NRSNG+ NG D+GEYASDREIR+
Sbjct: 1468 VSSWEDDSSLRLGSSAASKYKRRLGKVGTERKYTNRSNGSIFGNGALDHGEYASDREIRR 1527
Query: 1636 RLSKLNRKSLDSGSETSDDLDGSSEDGKSDSESTVSDTDSDMDFRSDGRARESRGAGDFT 1695
RLS+LN+K + S SETSD+ D SS DGKS SE++ SDT+SD++F S GR E+RG F
Sbjct: 1528 RLSRLNKKPIGSESETSDEFDRSSGDGKSGSENSASDTESDLEF-SSGRI-ETRGDKCFI 1585
Query: 1696 TDEGLDFS-DDREWGARMTKASLVPPVTRKYEVIDQYVIVADEEDVRRKMRVSLPEDYAE 1754
DE D + DDREWGARMTKASLVPPVTRKYE+ID+YV++ADEE+VRRKMRVSLP+DY E
Sbjct: 1586 LDEAFDSTMDDREWGARMTKASLVPPVTRKYELIDEYVVIADEEEVRRKMRVSLPDDYVE 1645
Query: 1755 KLNAQKNGSEELDMELPEVKDYKPRKQLGDQVFEQEVYGIDPYTHNLLLDSMPDELDWNL 1814
KLNAQKNG+EELDMELPEVKDYKPRK++GD+V EQEVYGIDPYTHNLLLDS+P+ELDW+L
Sbjct: 1646 KLNAQKNGAEELDMELPEVKDYKPRKKIGDEVLEQEVYGIDPYTHNLLLDSVPEELDWSL 1705
Query: 1815 LEKHLFIEDVLLRTLNKQVRHFTGTGNTPMMYPLQPVIEEIEKEAVDDCDVRTMKMCRGI 1874
++KH+FIEDVLLRTLNKQ HFTGTGNTPM YPL PVIEEIEK A +CD+R M++C+GI
Sbjct: 1706 MDKHMFIEDVLLRTLNKQAIHFTGTGNTPMKYPLLPVIEEIEKVAAAECDIRIMRLCQGI 1765
Query: 1875 LKAMDSRPDDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQK 1934
LKA+ SRP+DKYVAYRKGLGVVCNK+ GFGEDDFVVEFLGEVYPVWKW+EKQDGIRSLQK
Sbjct: 1766 LKAIHSRPEDKYVAYRKGLGVVCNKQEGFGEDDFVVEFLGEVYPVWKWYEKQDGIRSLQK 1825
Query: 1935 NNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHY 1994
N++DPAPEFYNIYLERPKGD DGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHY
Sbjct: 1826 NDKDPAPEFYNIYLERPKGDGDGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHY 1885
Query: 1995 QIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLK 2054
QIGIYT+R I YGEEITFDYNSVTESKEEYEASVCLCGS VCRGSYLNLTG+GAF KVL+
Sbjct: 1886 QIGIYTLRKIQYGEEITFDYNSVTESKEEYEASVCLCGSHVCRGSYLNLTGDGAFLKVLE 1945
Query: 2055 ELHGLLDRHQLMLEACELNSVSEEDYLELGRAGLGSCLLGGLPNWVVAYSARLVRFINLE 2114
E HG+LD HQLMLEACELNSVSE+DYL+LGRAGLGSCLLGGLP+W+VAYSAR+VRFIN E
Sbjct: 1946 EWHGVLDCHQLMLEACELNSVSEDDYLDLGRAGLGSCLLGGLPDWLVAYSARVVRFINFE 2005
Query: 2115 RTKLPEEILRHNLEEKRKYFSDICLEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRC 2174
RTKLP+EIL HNLEEKRKYFSDICL+VEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRC
Sbjct: 2006 RTKLPQEILAHNLEEKRKYFSDICLDVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRC 2065
Query: 2175 VFGDPKKAPPPVERLSPEETVSFLWKGEGSLVEELIQCMAPHVEEDVLNDLKSKIQAHDP 2234
+FGDPK APPP++RLSPEE+VS++W GEGSLVEEL+ M PHVEED+++DLK KI+AHDP
Sbjct: 2066 IFGDPKNAPPPLKRLSPEESVSYIWNGEGSLVEELLLSMVPHVEEDLISDLKLKIRAHDP 2125
Query: 2235 SGSEDIQRELRKSLLWLRDEVRNLPCTYKCRHDAAADLIHIYAYTKCFFRVQEYKAFTSP 2294
S+DIQ+EL++SLLWLRDEVRN+PCTYK R+DAAADLIHIYAYTK FFR+QEYKA TSP
Sbjct: 2126 LCSDDIQKELQQSLLWLRDEVRNIPCTYKSRNDAAADLIHIYAYTKNFFRIQEYKAVTSP 2185
Query: 2295 PVYISPLDLGPKYADKLGADLQVYRKTYGENYCLGQLIFWHIQTNADPDCTLARASRGCL 2354
PVYIS LDLGPKY DKLG Q Y KTYG NYCLGQLIFWH Q N DPDC+LA ASRGCL
Sbjct: 2186 PVYISSLDLGPKYVDKLGTGFQEYCKTYGPNYCLGQLIFWHNQQNIDPDCSLALASRGCL 2245
Query: 2355 SLPDIGSFYAKVQKPSRHRVYGPKTVRFMLSRMEKQPQRPWPKDRIWAFKSSPRIFGSPM 2414
SLP+I SFYA+VQKPSR RVYGPKTV+FMLSRMEKQPQRPWPKDRIW+FK+SP++ GSPM
Sbjct: 2246 SLPEISSFYARVQKPSRQRVYGPKTVKFMLSRMEKQPQRPWPKDRIWSFKNSPKVIGSPM 2305
Query: 2415 LDSSLTGCPLDREMVHWLKHRPAIFQAMWDR 2445
LD L+ PL++++VHWLKHR IFQAMWDR
Sbjct: 2306 LDVVLSNSPLEKDLVHWLKHRTPIFQAMWDR 2336
>gi|356544844|ref|XP_003540857.1| PREDICTED: probable histone-lysine N-methyltransferase ATXR3-like
[Glycine max]
Length = 2331
Score = 2936 bits (7611), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 1567/2476 (63%), Positives = 1857/2476 (75%), Gaps = 176/2476 (7%)
Query: 1 MGDGGVACMPLQQQQQHNSIMERFP-ISDKTTICVGNSSNNSNKTNNNSISNNNDNKTNN 59
MGDGGVACMPLQQQ ++ER P + + +C G S N +
Sbjct: 1 MGDGGVACMPLQQQH----VIERLPNAAAEKALCGGKSGNGFD----------------- 39
Query: 60 DSSNNNGSSSSKNNETNKSNVKKNGVSTKTVRKKIVKIKKVIAVKKKEVQKNSGSSKSNN 119
S + K KK+V K E+ + S+ N
Sbjct: 40 --------SGLLKVAGKRKKKVKVKKKVSPAAKKVV---------KSELTVDGVGSRGGN 82
Query: 120 NGENIDNKNVENGGAVGEVVTVDKENLKNEEVEEGELGTL--KWENGEFVQPEKSQPQSQ 177
+VE+G GE+ +EVEEGELGTL + ENGEFV PEK
Sbjct: 83 --------DVESGEVCGEM----------DEVEEGELGTLGCELENGEFV-PEK----PV 119
Query: 178 LQSQSKQIEKGEIIVFSSKCRRGETEKGE--SGLWRGNK-DDIEKGEFIPDRWHK-EVVK 233
+ + +IE GEI+ S + ++GE E+GE SG WR + DDIEKGEFIPDRWH+ ++ +
Sbjct: 120 MLMRRSEIENGEIV--SERWKKGEVERGEFVSGKWRKEEDDDIEKGEFIPDRWHRGDMGR 177
Query: 234 DEYGYSKSRRYD------YKLER--TPPSGK-YSGEDVYRRKEFDRSGSQHSKSSSRWES 284
D+YGY++ RRY +K ER TPPSG+ Y+G++ +R+KE +RSGSQH+KS+ RWES
Sbjct: 178 DDYGYARIRRYQPGRDKGWKNEREHTPPSGRYYTGDEHFRKKELNRSGSQHAKSAPRWES 237
Query: 285 GQERNVRISSKIVDDEGLYKGEHNNGKNHGREYFHGNRFKRHGTDSDSGDRKYYGDYGDF 344
GQERN+RISSKIVD+E K EH+N + H R+Y GNR KRHG +S+ +RK +YGD+
Sbjct: 238 GQERNIRISSKIVDEE---KNEHSNSRTHMRDYSSGNRLKRHGNESEGCERK---NYGDY 291
Query: 345 AGLKSRRLSDDYNSRSVHSEHYSRHSVEKFHRNSSSSRISSLDKYSSRHHEPSLSSRVIY 404
AG KSRRLSDD + R +SEHYSR SVE+ +RNSSS + DKYSSRHHE SL +R +Y
Sbjct: 292 AGSKSRRLSDD-SPRLAYSEHYSRLSVERSYRNSSSKSSA--DKYSSRHHE-SLPTRSVY 347
Query: 405 DRHGRSPSHSDRSPHDRGRYYDHRDRSPSRHDRSPYTRDRSPYTFDRSPYSRERSPYNRD 464
D+HGRSP +S+RSPHDR RYYDH+DR+P R SPY+ DRSPY+ ++SP+ RERSPYNR+
Sbjct: 348 DKHGRSPGNSERSPHDRARYYDHKDRTPVR--PSPYSCDRSPYSSEKSPHGRERSPYNRN 405
Query: 465 RSPYAREKSPYDRSRHYDHRNRSPFSAERSPQDRARFHDRSDRTPNYLERSPLHRSRPNN 524
+DRSRH+DH+ RSP AERSPQDR R HDR D TPN +E+SP R+R N
Sbjct: 406 ----------WDRSRHHDHKMRSPTHAERSPQDRGRHHDRRDPTPNLIEQSPHDRTRSNM 455
Query: 525 HREASSKTGASEKRNARYDSKGHEDKLGPKDSNARCSRSSAKESQDKSNVQDLNVSDEKT 584
HRE +SK +SEK N+++ K +EDK K++N S ESQ + NV + + S E
Sbjct: 456 HREINSKISSSEKHNSQHSCKDYEDKHVQKEANL-----SDVESQGERNVHNASKSFEID 510
Query: 585 ANCESHKEEQPQSSSVDCKEPPQVDGPPLEELVSMEEDMDICDTPPHVPAVTDSSVGKWF 644
E KE+Q + +V CK P ++ P EEL SMEEDMDICDTPPHVP V DSS GKWF
Sbjct: 511 VCSEPEKEQQSSNPTVSCKGSPCLEPLP-EELASMEEDMDICDTPPHVPVVVDSSSGKWF 569
Query: 645 YLDHCGMECGPSRLCDLKTLVEEGVLVSDHFIKHLDSNRWETVENAVSPLVTVNFPSITS 704
YLD+ G+E GPS+L D+K LV++GVL+SDHFIKH+DS+RW TVENAVSP+ +F S+ S
Sbjct: 570 YLDYNGVEHGPSKLSDIKVLVDDGVLMSDHFIKHIDSDRWLTVENAVSPVTAQSFLSVVS 629
Query: 705 DSVTQLVSPPEASGNLLADTGDTAQSTGEEF-----PVTLQSQCCPDGSAAAAESSEDLH 759
+++TQLV+PPEA GNLLADTGD QS E + P+ LQ C + S A+ EDLH
Sbjct: 630 ETITQLVNPPEAPGNLLADTGDILQSGPENYLGIPTPI-LQPMLCSEDSGIASVLLEDLH 688
Query: 760 IDVRVGALLDGFTVIPGKEIETLGEILQTTFERVDWQN----NGGPTWHGACVGEQKPGD 815
ID RVG LL+G+ VIPG+E E + E LQ FE W+ G P H C+ + D
Sbjct: 689 IDERVGVLLEGYDVIPGREFEAIKESLQMNFEYAKWEGLEECEGFPG-HDTCLRMEH--D 745
Query: 816 QKVDELYISDTKMKEAAELKSGDKDHWVVCFDSDEWFSGRWSCKGGDWKRNDEAAQDRCS 875
++D S + + + SG ++ + + D WFS +WSCKGGDWKRND+ AQDR
Sbjct: 746 SRID----SSREYESQVSIPSGKENGFTLGVPGD-WFSAQWSCKGGDWKRNDD-AQDRYC 799
Query: 876 RKKQVLNDGFPLCQMPKSGYEDPRWNQKDDLYYPSHSRRLDLPPWAYACPDERNDGSGGS 935
KK VLNDGF LCQMPKSG EDPRW +KDDLYYPSHSRRLDLP WA+ C DER D S S
Sbjct: 800 NKKLVLNDGFSLCQMPKSGCEDPRWTRKDDLYYPSHSRRLDLPVWAF-CTDERGDCSTLS 858
Query: 936 RSTQSKLAAVRGVKGTMLPVVRINACVVNDHGSFVSEPRSKVRAKERHSSRSARSYSSAN 995
+ Q+KLA+VRGVKG +L VVRINACVV D GS VSE K R+K+R+ SRS S+SS +
Sbjct: 859 KPVQTKLASVRGVKGNILSVVRINACVVKDQGSLVSESCHKTRSKDRYPSRSTWSFSSTS 918
Query: 996 DVRRSSAESDSHSKARNNQDSQGSWKSIACINTPKDRLCTVDDLQLQLGEWYYLDGAGHE 1055
+RSS E DS SKA N+Q S GS +S+ IN PKD TV DLQL G WYYLDG+G E
Sbjct: 919 YSKRSSTEEDSQSKASNDQGSLGSCRSMEFINIPKDYCRTVHDLQLHSGNWYYLDGSGRE 978
Query: 1056 RGPSSFSELQVLVDQGCIQKHTSVFRKFDKVWVPLTFATETS--ASTVRNHGEKIMPSGD 1113
RGPSSFSELQ LVDQG ++K++SVFRK DK+WVP+T + ET ++R+H E SG+
Sbjct: 979 RGPSSFSELQRLVDQGIVKKYSSVFRKCDKLWVPVTSSAETYDFDVSLRSHQESSTLSGE 1038
Query: 1114 SSGLPPTQSQDAVLGESNNNVNSNAFHTMHPQFIGYTRGKLHELVMKSYKNREFAAAINE 1173
SGLP Q A +GE ++ SN F+++ PQF+GYTRGKLHELVM+SYK+REFAA INE
Sbjct: 1039 CSGLPSKQIHGASVGEHDS--KSNLFNSLQPQFVGYTRGKLHELVMRSYKSREFAAVINE 1096
Query: 1174 VLDPWINAKQPKKETE-HVYRKSEGDTRAGKRARLLVRESDGDEETEE-ELQTIQDESTF 1231
VLDPWIN +QPKKETE Y KSEGD A KRAR+LV S+ D + E+ L +DESTF
Sbjct: 1097 VLDPWINTRQPKKETEKQTYWKSEGDGHASKRARMLVDYSEEDSDFEDGSLPNWKDESTF 1156
Query: 1232 EDLCGDASFPGEESASSAIESGGWGLLDGHTLAHVFHFLRSDMKSLAFASLTCRHWRAAV 1291
E LCGDA+F GE S + G GLLDG L+ VFH LRSD+KSLAFAS+TC+HWRA V
Sbjct: 1157 EALCGDATFSGEGSDITDPNVGSLGLLDGCMLSRVFHCLRSDLKSLAFASMTCKHWRATV 1216
Query: 1292 RFYKGISRQVDLSSVGPNCTDSLIRKTLNAFDKEKLNSILLVGCTNITSGMLEEILQSFP 1351
RFYK +SR V+LSS+G +CTDS++ LNA++K+K+ SI+L+GCTNIT+GMLE+IL FP
Sbjct: 1217 RFYKKVSRHVNLSSLGHSCTDSIMWNILNAYEKDKIESIVLIGCTNITAGMLEKILLLFP 1276
Query: 1352 HLSSIDIRGCGQFGELALKFPNINWVKSQKSRGAKFNDSRSKIRSLKQITEKSSSAPKSK 1411
LS++DIRGC QFGEL LKF N+ W+KS S K KIRS+KQ E++SS K
Sbjct: 1277 GLSTVDIRGCSQFGELTLKFTNVKWIKSHSSHITKIASESHKIRSVKQFAEQTSSVSKVS 1336
Query: 1412 GLGDDMDDFGDLKDYFESVDKRDSANQSFRRSLYQRSKVFDARKSSSILSRDARMRRWSI 1471
LG DDFG+LKDYF+SVDKRD+A Q FR++LY+RSK++DAR SSSILSRDAR RRW I
Sbjct: 1337 ILG-IRDDFGELKDYFDSVDKRDTAKQLFRQNLYKRSKLYDARNSSSILSRDARTRRWPI 1395
Query: 1472 KKSENGYKRMEEFLASSLKEIMRVNTFEFFVPKVAEIEGRMKKGYYISHGLGSVKDDISR 1531
KKSE+GYKRME+FLAS L+EIM+ N+ +FF+PKVAEIE +MK GYY HGL VK+DISR
Sbjct: 1396 KKSESGYKRMEQFLASRLREIMKANSCDFFMPKVAEIEAKMKNGYYSGHGLSYVKEDISR 1455
Query: 1532 MCRDAIKAKNRGSAGDMNRITTLFIQLATRLEQGAKSSYYEREEMMKSWKDESPAGLYSA 1591
MCRDAIK + +MK W ++ P+ L S
Sbjct: 1456 MCRDAIK-----------------------------------DALMKLWGNDPPSSLCST 1480
Query: 1592 TSKYKKKL-SKMVSERKYMNRSNGTSLANGDFDYGEYASDREIRKRLSKLNRKSLDSGSE 1650
+SKYKK ++++SERK+ N +G D GEYASDREIR+RLSKLN+K +S SE
Sbjct: 1481 SSKYKKSKENRLLSERKHRNNE-----THGGLDNGEYASDREIRRRLSKLNKKYFNSESE 1535
Query: 1651 TSDDLDGSSEDGKSDSESTVSDTDSDMDFRSDGRARESRGAGDFTTDEGLDF-SDDREWG 1709
TSDD D SSEDGKSDS++T +DT+SD D S+ R +SRG G FT D+GL F +D+REWG
Sbjct: 1536 TSDDFDRSSEDGKSDSDTTTTDTESDQDVHSESRIGDSRGDGYFTPDDGLHFITDEREWG 1595
Query: 1710 ARMTKASLVPPVTRKYEVIDQYVIVADEEDVRRKMRVSLPEDYAEKLNAQKNGSEELDME 1769
ARMTKASLVPPVTRKY+VIDQY+IVADEEDVRRKMRVSLP+DYAEKL+AQKNG EE DME
Sbjct: 1596 ARMTKASLVPPVTRKYDVIDQYIIVADEEDVRRKMRVSLPDDYAEKLSAQKNGIEESDME 1655
Query: 1770 LPEVKDYKPRKQLGDQVFEQEVYGIDPYTHNLLLDSMPDELDWNLLEKHLFIEDVLLRTL 1829
LPEVKDYKPRKQL ++V EQEVYGIDPYTHNLLLDSMP ELDW+L EKHLFIED LLR L
Sbjct: 1656 LPEVKDYKPRKQLENEVVEQEVYGIDPYTHNLLLDSMPKELDWSLQEKHLFIEDKLLRML 1715
Query: 1830 NKQVRHFTGTGNTPMMYPLQPVIEEIEKEAVDDCDVRTMKMCRGILKAMDSRPDDKYVAY 1889
NKQV+HFTGTGNTPM YPLQP IEEIE+ A + CD RT++MC+GILKA+ SR DDKYVAY
Sbjct: 1716 NKQVKHFTGTGNTPMSYPLQPAIEEIERYAEEHCDARTVRMCQGILKAIKSRSDDKYVAY 1775
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
RKGLGVVCNKE GFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKN++DPAPEFYNIYLE
Sbjct: 1776 RKGLGVVCNKEEGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNSDDPAPEFYNIYLE 1835
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIY+VR I +GEE
Sbjct: 1836 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYSVREIQHGEE 1895
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDRHQLMLEA 2069
ITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKE HG+LDRH LMLEA
Sbjct: 1896 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKEWHGILDRHYLMLEA 1955
Query: 2070 CELNSVSEEDYLELGRAGLGSCLLGGLPNWVVAYSARLVRFINLERTKLPEEILRHNLEE 2129
CELNSVSEEDY +LGRAGLGSCLLGGLP+W+V+Y+ARLVRFIN ERTKLPEEIL+HNLEE
Sbjct: 1956 CELNSVSEEDYNDLGRAGLGSCLLGGLPDWLVSYAARLVRFINFERTKLPEEILKHNLEE 2015
Query: 2130 KRKYFSDICLEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRCVFGDPKKAPPPVERL 2189
KRKYFSDICLEVE+SDAEVQAEGVYNQRLQNLAVTLDKVRYVMRC+FGDP KAPPP+E+L
Sbjct: 2016 KRKYFSDICLEVERSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRCIFGDPLKAPPPLEKL 2075
Query: 2190 SPEETVSFLWKGEGSLVEELIQCMAPHVEEDVLNDLKSKIQAHDPSGSEDIQRELRKSLL 2249
SPE VSFLWKGE S VEEL+QC+AP+VEE LNDLKSKI AHDPS S DIQ+ ++KSLL
Sbjct: 2076 SPEAVVSFLWKGEDSFVEELLQCLAPYVEESTLNDLKSKIHAHDPSSSGDIQKAVQKSLL 2135
Query: 2250 WLRDEVRNLPCTYKCRHDAAADLIHIYAYTKCFFRVQEYKAFTSPPVYISPLDLGPKYAD 2309
WLRDEVRNLPCTYKCRHDAAADLIHIYAYTK FFR+Q+Y+ TSPPVYISPLDLGPKYAD
Sbjct: 2136 WLRDEVRNLPCTYKCRHDAAADLIHIYAYTKYFFRIQDYQTITSPPVYISPLDLGPKYAD 2195
Query: 2310 KLGADLQVYRKTYGENYCLGQLIFWHIQTNADPDCTLARASRGCLSLPDIGSFYAKVQKP 2369
KLGA Q YRK YGENYCLGQLIFWH Q+NA+PDCTLAR SRGCLSLPDI SFYAK QKP
Sbjct: 2196 KLGAGFQEYRKIYGENYCLGQLIFWHNQSNAEPDCTLARISRGCLSLPDISSFYAKAQKP 2255
Query: 2370 SRHRVYGPKTVRFMLSRMEKQPQRPWPKDRIWAFKSSPRIFGSPMLDSSLTGCPLDREMV 2429
SRHRVYGP+TVR ML+RMEKQPQ+PWPKDRIW+FK+SP+ FGSPMLD+ + PLDREMV
Sbjct: 2256 SRHRVYGPRTVRSMLARMEKQPQKPWPKDRIWSFKNSPKYFGSPMLDAVINNSPLDREMV 2315
Query: 2430 HWLKHRPAIFQAMWDR 2445
HWLKHRPAIFQA+WD+
Sbjct: 2316 HWLKHRPAIFQALWDQ 2331
>gi|356547055|ref|XP_003541933.1| PREDICTED: probable histone-lysine N-methyltransferase ATXR3-like
[Glycine max]
Length = 2351
Score = 2898 bits (7513), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 1568/2474 (63%), Positives = 1852/2474 (74%), Gaps = 152/2474 (6%)
Query: 1 MGDGGVACMPLQQQQQHNSIMERFPISDKTTICVGNSSNNSNKTNNNSISNNNDNKTNND 60
MGDGGVACMPLQ IMER P ++K T+C G S N N + + K
Sbjct: 1 MGDGGVACMPLQY------IMERLPSAEK-TVCRGKSGNGFN-SKLLKFAGKERRKMKPR 52
Query: 61 SSNNNGSSSSKNNETNKSNVKKNGVSTKTVRKKIVKIKKVIAVKKKEVQKNSGSSKSNNN 120
S SK N + SN +NG V+KK+ Q +
Sbjct: 53 KSELGLDRVSKRNSS--SNDVENGGE----------------VEKKQ-QHEKVQKEEVEE 93
Query: 121 GE----NIDNKNVENGGAVGEVVTVDKENLKNEEVEEGELGTLKWENGEFVQPEKSQPQS 176
GE ++ENG V E++ + + EVE GE+ + KW+ E + E
Sbjct: 94 GELGTLKWPRADLENGEFVPEMLPLPPP--RRGEVENGEIVSEKWKARELEKGEVGFG-- 149
Query: 177 QLQSQSKQIEKGEIIVFSSKCRRGETEKGESGLWRGNKDDIEKGEFIPDRWHKEVVKDEY 236
+ + +++E+ E IV R+GE E+GE G WRG KD+IEKGEFIPDRW+ K +Y
Sbjct: 150 --KWRKEEVERRE-IVSEKGGRKGEAERGEYGSWRGGKDEIEKGEFIPDRWY----KGDY 202
Query: 237 GYSKSRRYD------YKLER----TPPSGKYSGEDVYRRKEFDRSGSQHSKSSSRWE-SG 285
S++RR+ +K ER TP SG+Y+G+D +R+KE +RSGSQH KSS RWE G
Sbjct: 203 DNSRNRRHHSGRDKGWKAEREHESTPSSGRYTGDDFFRKKELNRSGSQHVKSSPRWEGGG 262
Query: 286 QERNVRISSKIVDDEGLYKGEHNNGKNHGREYFHGNRFKRHGTDSDSGDRKYYGDYGDFA 345
Q+RNVRISSKIV DE K H+NGK+H R+Y G+R KR G D+DS +RK DY A
Sbjct: 263 QQRNVRISSKIVHDE---KNVHSNGKDHTRDYSSGSRLKRLGNDTDSYERKQSADY---A 316
Query: 346 GLKSRRLSDDYNSRSVHSEHYSRH---SVEKFHRNSSSSRISSLDKYSSRHHEPSLSSRV 402
GLKSRRLSDD + R V+SE+YS H SVE+ +RN++ +++S+ DKYS R+HE SLS+R
Sbjct: 317 GLKSRRLSDD-SCRQVYSENYSCHSPRSVERSYRNNNGTKLSA-DKYSCRNHESSLSTRP 374
Query: 403 IYDRHGRSPSHSDRSPHDRGRYYDHRDRSPSRHDRSPYTRDRSPYTFDRSPYSRERSPYN 462
YDRHGRSP HS+RSP DRGRYYDHR+R+P R RSP RDRSPY +++SPY RE+SPY
Sbjct: 375 AYDRHGRSPGHSERSPRDRGRYYDHRERTPVR--RSPCGRDRSPYNWEKSPYGREKSPYM 432
Query: 463 RDRSPYAREKSPYDRSRHYDHRNRSPFSAERSPQDRARFHDRSDRTPNYLERSPLHRSRP 522
R+ +DRSR +DH+ RSP AE+SP DR+R HDR D TPN E SPL R+R
Sbjct: 433 RN----------WDRSRQHDHKLRSPTHAEQSPPDRSRRHDRRDCTPNLAEASPLDRARK 482
Query: 523 NNHREASSKTGASEKRNARYDSKGHEDKLGPKDSNARCSRSSAKESQDKSNVQDLNVSDE 582
N+ E+SSKT +SEK +++ K EDK ++SN S+ ESQ + +VQ S E
Sbjct: 483 NSRHESSSKTLSSEKHDSQNSCKDREDKQIQRESNC-----SSTESQSEKSVQVTIKSVE 537
Query: 583 KTANCESHKEEQPQSSSVDCKEPPQVDGPPLEELVSMEEDMDICDTPPHVPAVTDSSVGK 642
K E KE+Q S +V KE P + PP EEL SMEEDMDICDTPPHVP VTD S GK
Sbjct: 538 KDICSEPVKEQQSCSPTVSHKESPHSEPPP-EELPSMEEDMDICDTPPHVPVVTDLSSGK 596
Query: 643 WFYLDHCGMECGPSRLCDLKTLVEEGVLVSDHFIKHLDSNRWETVENAVSPLVTVNFPSI 702
W+YLD+ G+E GP++LCD+K LV+EGVL+SDHFIKHLDS+RW TVENA SPLV +F SI
Sbjct: 597 WYYLDYGGVENGPAKLCDIKVLVDEGVLMSDHFIKHLDSDRWLTVENAASPLVRQSFASI 656
Query: 703 TSDSVTQLVSPPEASGNLLADTGDTAQSTGEEFPVTL----QSQCCPDGSAAAAESSEDL 758
SD++TQLV+PPEA GN+L+D D S + L Q + CP+ S E EDL
Sbjct: 657 ASDTITQLVNPPEAPGNILSDAADILHSAPDNHQEMLTPLRQPRVCPNDSVFTFELLEDL 716
Query: 759 HIDVRVGALLDGFTVIPGKEIETLGEILQTTFERVD---WQNNGGPTWHGACVGEQKPGD 815
HI+ RV LL+G+ V PG E+E + E LQ FE ++ G W +CVGE D
Sbjct: 717 HIEERVRNLLEGYDVTPGMELEAIKEALQMNFENAKGEGLEDYEGFLWSVSCVGED--WD 774
Query: 816 QKVDELYISDTKMKEAAELKSGDKDHWVVCFDSDEWFSGRWSCKGGDWKRNDEAAQDRCS 875
D L D+ E+ S DKD+ S +WFS RWSCKGGDWKRND+ AQDR S
Sbjct: 775 SSTD-LASRDS---ESQSSMSCDKDNGHAFGVSSDWFSTRWSCKGGDWKRNDD-AQDRYS 829
Query: 876 RKKQVLNDGFPLCQMPKSGYEDPRWNQKDDLYYPSHSRRLDLPPWAYACPDERNDGSGGS 935
RKK VLN+GFPLCQMPKSG EDPRW QKDDLY+PS SR+LDLP WA+ C DER+D S S
Sbjct: 830 RKKLVLNNGFPLCQMPKSGCEDPRWPQKDDLYFPSQSRKLDLPLWAF-CADERDDCSVAS 888
Query: 936 RSTQSKLAAVRGVKGTMLPVVRINACVVNDHGSFVSEPRSKVRAKERHSSRSARSYSSAN 995
+S QSK A+VRGVKG +L VVRINACVV D GS VSE R K R KERH SR AR +SS +
Sbjct: 889 KSVQSKPASVRGVKGNVLSVVRINACVVKDQGSLVSESRHKTRVKERHHSRPARPFSSIS 948
Query: 996 DVRRSSAESDSHSKARNNQDSQGSWKSIACINTPKDRLCTVDDLQLQLGEWYYLDGAGHE 1055
D +RSS E D SKA ++ Q S++ + INTPKD CT+ +LQL LG+WYYLDG+G E
Sbjct: 949 DSKRSSTEQD-QSKAVSD---QVSYQILEFINTPKDHRCTIRELQLHLGDWYYLDGSGRE 1004
Query: 1056 RGPSSFSELQVLVDQGCIQKHTSVFRKFDKVWVPLTFATETSASTVRNHGEKIMPSGDSS 1115
RGPSSFSELQ VDQG I+KH+SVFRK DK+WVP+T ATETS ++ + E SG S
Sbjct: 1005 RGPSSFSELQYFVDQGIIKKHSSVFRKSDKLWVPITSATETSDGSLMDQQESSSISGACS 1064
Query: 1116 GLPPTQSQDAVLGESNNNVNSNAFHTMHPQFIGYTRGKLHELVMKSYKNREFAAAINEVL 1175
G P Q+Q GE NS+ F+++HPQF+GYTRGKLHELVMKSYK+REFAAAINEVL
Sbjct: 1065 GFPSKQTQVVSCGEP--YTNSSLFNSLHPQFVGYTRGKLHELVMKSYKSREFAAAINEVL 1122
Query: 1176 DPWINAKQPKKETE-HVYRKSEGDTRAGKRARLLVRESDGDEETEE-ELQTIQDESTFED 1233
DPWINA+QPKKE E +Y KSEGD A KRAR+LV +S+ D + E+ ++ +DESTFED
Sbjct: 1123 DPWINARQPKKEIEKQIYWKSEGDAHAAKRARMLVDDSEDDIDLEDGDVNIEKDESTFED 1182
Query: 1234 LCGDASFPGEESASSAIESGGWGLLDGHTLAHVFHFLRSDMKSLAFASLTCRHWRAAVRF 1293
LCGDA+FP EE + + G W LDGH LA VFHFL+SD+KSL FAS+TC+HWRAAVRF
Sbjct: 1183 LCGDATFPEEEIGITDTDLGSWSNLDGHVLARVFHFLKSDLKSLVFASMTCKHWRAAVRF 1242
Query: 1294 YKGISRQVDLSSVGPNCTDSLIRKTLNAFDKEKLNSILLVGCTNITSGMLEEILQSFPHL 1353
YK +S QV+LSS+G +CTD+++ LNA++K+K+NS++L GC NIT+ MLE+IL SFP L
Sbjct: 1243 YKEVSIQVNLSSLGHSCTDTMLWNILNAYEKDKINSVILRGCVNITADMLEKILFSFPGL 1302
Query: 1354 SSIDIRGCGQFGELALKFPNINWVKSQKSRGAKFNDSRSKIRSLKQITEKSSSAPKSKGL 1413
+IDIRGC QFGEL LKF N+ W+KS+ S K + KIRSLK ITE +SS KS L
Sbjct: 1303 FTIDIRGCNQFGELTLKFANVKWIKSRSSHLTKIAEESHKIRSLKHITELTSSVSKSISL 1362
Query: 1414 GDDMDDFGDLKDYFESVDKRDSANQSFRRSLYQRSKVFDARKSSSILSRDARMRRWSIKK 1473
G +DDFG LKDYF+SVDKRD+ Q FR++LY+RSK++DARKSSSILSRDAR RRW+IKK
Sbjct: 1363 G--IDDFGQLKDYFDSVDKRDN-KQLFRQNLYKRSKLYDARKSSSILSRDARTRRWAIKK 1419
Query: 1474 SENGYKRMEEFLASSLKEIMRVNTFEFFVPKVAEIEGRMKKGYYISHGLGSVKDDISRMC 1533
SE+GYKRMEEFLA L+EIM+ N+ +FFV KVAEIE +MK GYY S GL SVK+DISRMC
Sbjct: 1420 SESGYKRMEEFLALRLREIMKTNSCDFFVLKVAEIEAKMKSGYYSSRGLNSVKEDISRMC 1479
Query: 1534 RDAIKAKNRGSAGDMNRITTLFIQLATRLEQGAKSSYYEREEMMKSWKDESPAGLYSATS 1593
RDAIK ++KSW ++ PAG S S
Sbjct: 1480 RDAIK-----------------------------------NALLKSWDNDLPAGSCSTFS 1504
Query: 1594 KYKKKLSKMVSERKYMNRSNGTSLANGDFDYGEYASDREIRKRLSKLNRKSLDSGSETS- 1652
KYKK +++V+ERKY RSNGT +G D EY SDREIR+RLSKLN+KS+DS SETS
Sbjct: 1505 KYKK--NRLVNERKY--RSNGT---HGGLDNVEYTSDREIRRRLSKLNKKSMDSESETSD 1557
Query: 1653 DDLDGSSEDGKSDSESTVSDTDSDMDFRSDGRARESRGAGDFTTDEGLDF-SDDREWGAR 1711
DDLD S E+GKSD+++T SD++SD + + +RESRG G FT++E L F +DDREWGAR
Sbjct: 1558 DDLDKSYEEGKSDTDTTTSDSESDREVHPESLSRESRGDGYFTSEEELGFITDDREWGAR 1617
Query: 1712 MTKASLVPPVTRKYEVIDQYVIVADEEDVRRKMRVSLPEDYAEKLNAQKNGSEELDMELP 1771
MTKASLVPPVTRKYEVIDQY IVADEEDVRRKMRVSLP+DYAEKL+AQKNG+EE DMELP
Sbjct: 1618 MTKASLVPPVTRKYEVIDQYCIVADEEDVRRKMRVSLPDDYAEKLSAQKNGTEESDMELP 1677
Query: 1772 EVKDYKPRKQLGDQVFEQEVYGIDPYTHNLLLDSMPDELDWNLLEKHLFIEDVLLRTLNK 1831
EVKDYKPRKQLG++V EQEVYGIDPYTHNLLLDSMP+ELDW+L EKHLFIED LLRTLNK
Sbjct: 1678 EVKDYKPRKQLGNEVIEQEVYGIDPYTHNLLLDSMPEELDWSLQEKHLFIEDTLLRTLNK 1737
Query: 1832 QVRHFTGTGNTPMMYPLQPVIEEIEKEAVDDCDVRTMKMCRGILKAMDSRPDDKYVAYRK 1891
QVR+FTG G+TPM Y L+ VIE+I+K A +DCD R +KMC+GILKA+DSRPDDKYVAYRK
Sbjct: 1738 QVRNFTGNGSTPMSYSLRSVIEDIKKFAEEDCDARMVKMCQGILKAIDSRPDDKYVAYRK 1797
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
GLGVVCNKE GF EDDFVVEFLGEVYPVWKWFEKQDGIRSLQK+++DPAPEFYNIYLERP
Sbjct: 1798 GLGVVCNKEEGFAEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKDSKDPAPEFYNIYLERP 1857
Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
KGDADGYDLVVVDAMH ANYASRICHSCRPNCEAKVTAVDG YQIGIY++R I +GEEIT
Sbjct: 1858 KGDADGYDLVVVDAMHMANYASRICHSCRPNCEAKVTAVDGQYQIGIYSLREIQHGEEIT 1917
Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDRHQLMLEACE 2071
FDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAF+KVLK+ HG+LDRH LMLEACE
Sbjct: 1918 FDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFQKVLKDSHGILDRHCLMLEACE 1977
Query: 2072 LNSVSEEDYLELGRAGLGSCLLGGLPNWVVAYSARLVRFINLERTKLPEEILRHNLEEKR 2131
LNSVSEEDY +LGRAGLGSCLLGGLP+W+VAY+ARLVRFIN ERTKLPEEIL+HNLEEKR
Sbjct: 1978 LNSVSEEDYNDLGRAGLGSCLLGGLPDWLVAYAARLVRFINFERTKLPEEILKHNLEEKR 2037
Query: 2132 KYFSDICLEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRCVFGDPKKAPPPVERLSP 2191
KYFSDI LEVE+SDAEVQAEGVYNQRLQNLAVTLDKVRYVMRC+FGDP+KAPPP+E+LSP
Sbjct: 2038 KYFSDIILEVERSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRCIFGDPRKAPPPLEKLSP 2097
Query: 2192 EETVSFLWKGEGSLVEELIQCMAPHVEEDVLNDLKSKIQAHDPSGSEDIQRELRKSLLWL 2251
E TVSFLWKGEGS VEEL+QC+ PHVEE +LNDLK KI AHDPS S DIQ+ELRKSLLWL
Sbjct: 2098 EATVSFLWKGEGSFVEELVQCITPHVEEGILNDLKFKIHAHDPSNSGDIQKELRKSLLWL 2157
Query: 2252 RDEVRNLPCTYKCRHDAAADLIHIYAYTKCFFRVQEYKAFTSPPVYISPLDLGPKYADKL 2311
RDEVRNLPCTYKCRHDAAADLIHIYAYTK FFR++ Y+ TSPPVYISPLDLGPKY +KL
Sbjct: 2158 RDEVRNLPCTYKCRHDAAADLIHIYAYTKYFFRIRNYQTITSPPVYISPLDLGPKYTNKL 2217
Query: 2312 GADLQVYRKTYGENYCLGQLIFWHIQTNADPDCTLARASRGCLSLPDIGSFYAKVQKPSR 2371
GA+ Q YRK YGENYCLGQLIFWH Q+NADPD +LARASRGCLSLPD SFYAK QKPSR
Sbjct: 2218 GAEFQEYRKIYGENYCLGQLIFWHNQSNADPDRSLARASRGCLSLPDTNSFYAKAQKPSR 2277
Query: 2372 HRVYGPKTVRFMLSRMEKQPQRPWPKDRIWAFKSSPRIFGSPMLDSSLTGCPLDREMVHW 2431
H VYGP+TVR ML+RMEK PQR WPKDRIW+FKSSP+ FGSPMLD+ + PLDREMVHW
Sbjct: 2278 HCVYGPRTVRSMLARMEKLPQRSWPKDRIWSFKSSPKFFGSPMLDAVVNNSPLDREMVHW 2337
Query: 2432 LKHRPAIFQAMWDR 2445
KHRPAIFQAMWDR
Sbjct: 2338 FKHRPAIFQAMWDR 2351
>gi|297739332|emb|CBI28983.3| unnamed protein product [Vitis vinifera]
Length = 2199
Score = 2776 bits (7197), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 1493/2354 (63%), Positives = 1747/2354 (74%), Gaps = 281/2354 (11%)
Query: 129 VENGGAVGEVVTVDKENLKNEEVEEGELGTLKWENGEFVQPEKSQPQSQLQSQSKQIEKG 188
+ENG + + EEVEEGELGTLKW GE V+ + +P+ +S S EKG
Sbjct: 90 IENGEICNDKIV-------KEEVEEGELGTLKWPKGE-VENGEFEPEKPRRSDS---EKG 138
Query: 189 EIIVFSSKCRRGETEKGES------------GLWRGNKDDIEKGEFIPDRWHKEVVKDEY 236
EI+ + K R+GE EKGE G WRG+KD++EKGEFIPDRW ++V +D Y
Sbjct: 139 EIV--AEKSRKGEVEKGEFRFRKGDGEKADFGSWRGSKDELEKGEFIPDRWQRDVGRDGY 196
Query: 237 GYSKSRR------------YDYKLERTPPSGKYSGEDVYRRKEFDRSGSQHSKSSSRWES 284
G SK RR YD++ ERTPPSGK
Sbjct: 197 GCSKMRRHELAKDKGWKFEYDHERERTPPSGK---------------------------- 228
Query: 285 GQERNVRISSKIVDDEGLYKGEHNNGKNHGREYFHGNRFKRHGTDSDSGDRKYYGDYGDF 344
NVRISSKIVDDEG YK EHN+ KNHGRE R KR+GTDSD +RK++G+YGD
Sbjct: 229 ----NVRISSKIVDDEGTYKTEHNSSKNHGRELVSRTRMKRYGTDSDGSERKHHGEYGDH 284
Query: 345 AGLKSRRLSDDYNSRSVHSEHYSRHSVEKFHRNSSSSRISSLDKYSSRHHEPSLSSRVIY 404
G K R+LSDD N R+VH EHYSR S+E+ +RNSSSSRISS D++SSRH+E S SS+V++
Sbjct: 285 MGSKIRKLSDDSN-RTVHLEHYSRRSMERSYRNSSSSRISSSDRFSSRHYESSFSSKVVH 343
Query: 405 DRHGRSPSHSDRSPHDRGRYYDHRDRSPSRHDRSPYTRDRSPYTFDRSPYSRERSPYNRD 464
DRHGRSP HS+RSP DR RY+DH RDRSP
Sbjct: 344 DRHGRSPVHSERSPRDRARYHDH--------------RDRSPAY---------------- 373
Query: 465 RSPYAREKSPYDRSRHYDHRNRSPFSAERSPQDRARFHDRSDRTPNYLERSPLHRSRPNN 524
RS R++SPYDRSRHYDHRNRSP ERSPQDR R+H+R DRTP YLERSPL SRPNN
Sbjct: 374 RSSPRRDRSPYDRSRHYDHRNRSPAPTERSPQDRPRYHERRDRTPTYLERSPLDHSRPNN 433
Query: 525 HREASSKTGASEKRNARYDSKGHEDKLGPKDSNARCSRSSAKESQDKSNVQDLNV--SDE 582
+REAS K GA EKR+ +Y +K E+KL +D+N R SAKESQD+S++ +N SDE
Sbjct: 434 YREASCKGGAGEKRHGQYGNKVQEEKLNQRDANGRDPHFSAKESQDRSSLHTVNGHGSDE 493
Query: 583 KTANCESHKEEQPQSSSVDCKEPPQVDGPPLEELVSMEEDMDICDTPPHVPAVTDSSVGK 642
K+AN + HKEE+PQS V+ +EPPQ+ P EEL SMEEDMDI
Sbjct: 494 KSANHQPHKEEKPQSPCVNLEEPPQITVAP-EELASMEEDMDI----------------- 535
Query: 643 WFYLDHCGMECGPSRLCDLKTLVEEGVLVSDHFIKHLDSNRWETVENAVSPLVTVNFPSI 702
LVSDH IKH+D
Sbjct: 536 --------------------------FLVSDHLIKHVD---------------------- 547
Query: 703 TSDSVTQLVSPPEASGNLLADTGDTAQST---GEEFPVTL-QSQCCPDGSAAAAESSEDL 758
+A GNLLA+ GD +S+ EE P TL QS C + S+ A+E EDL
Sbjct: 548 ------------KAPGNLLAEAGDATESSKLLDEETPATLLQSMSCNNDSSTASEPLEDL 595
Query: 759 HIDVRVGALLDGFTVIPGKEIETLGEILQTTFERVDWQNNG--GPTWHGACVGEQKPGDQ 816
ID RV ALL GFTVIPG+E+ETLGE+LQ +FE W+ G G +WH +GEQ DQ
Sbjct: 596 QIDERVRALLKGFTVIPGRELETLGEVLQVSFEHAQWEKLGAEGLSWHQPRIGEQ--FDQ 653
Query: 817 KVDEL-YISDTKMKEAAELKSGDKDHWVVCF---DSDEWFSGRWSCKGGDWKRNDEAAQD 872
+ DE + KEA++ +S F D +WFS RW+ KGGDWKRNDE+AQD
Sbjct: 654 RTDEFSRYPEITSKEASDSRSSTSSDKDYAFAFGDFSDWFSARWASKGGDWKRNDESAQD 713
Query: 873 RCSRKKQVLNDGFPLCQMPKSGYEDPRWNQKDDLYYPSHSRRLDLPPWAYACPDERNDGS 932
R SRKK VLNDG+PLCQMPKSGYEDPRW++KD+LYYPSH R+LDLP WA++ PDER+D +
Sbjct: 714 RLSRKKLVLNDGYPLCQMPKSGYEDPRWHRKDELYYPSHGRKLDLPIWAFSWPDERSDSN 773
Query: 933 GGSRSTQSKLAAVRGVKGTMLPVVRINACVVNDHGSFVSEPRSKVRAKERHSSRSARSYS 992
SR++Q K VRGVKG+MLPVVRINACV SEP +KVR K+R+SSRSAR+YS
Sbjct: 774 SASRASQIK-PVVRGVKGSMLPVVRINACV--------SEPPAKVRGKDRYSSRSARAYS 824
Query: 993 SANDVRRSSAESDSHSKARNNQDSQGSWKSIACINTPKDRLCTVDDLQLQLGEWYYLDGA 1052
S DV+RSSAES SHSK+ + DSQGSWK I INTPKDRLCT +DLQL LG+WYYLDGA
Sbjct: 825 STTDVKRSSAESASHSKSVSENDSQGSWKCITSINTPKDRLCTAEDLQLHLGDWYYLDGA 884
Query: 1053 GHERGPSSFSELQVLVDQGCIQKHTSVFRKFDKVWVPLTFATETSASTVRNHGEKIMPSG 1112
GHE+GPSSFSELQ LVDQG IQKH+SVFRK DK+W +T + + N
Sbjct: 885 GHEQGPSSFSELQALVDQGSIQKHSSVFRKNDKIWNNVTSTDYHCTAYILN--------- 935
Query: 1113 DSSGLPPTQSQDAVLGESNNNVNSNAFHTMHPQFIGYTRGKLHELVMKSYKNREFAAAIN 1172
S + P + +N+ V++++ H ++ RG+ LV S + E +
Sbjct: 936 --SLVIPKEM-------ANSAVSNSSLHDLNKFRTSGIRGRW--LVDGSEDDYEMEEDV- 983
Query: 1173 EVLDPWINAKQPKKETEHVYRKSEGDTRAGKRARLLVRESDGDEETEEELQTIQDESTFE 1232
LLV++ DE T E+L
Sbjct: 984 ----------------------------------LLVQK---DESTFEDL---------- 996
Query: 1233 DLCGDASFPGEESASSAIESGGWGLLDGHTLAHVFHFLRSDMKSLAFASLTCRHWRAAVR 1292
C DA+F E+ A + + S WGLLDG+ LA VFHFLR+D+KSLAFA+LTC+HWRAAVR
Sbjct: 997 --CSDATFYQEDIALAEMGSENWGLLDGNVLARVFHFLRTDVKSLAFAALTCKHWRAAVR 1054
Query: 1293 FYKGISRQVDLSSVGPNCTDSLIRKTLNAFDKEKLNSILLVGCTNITSGMLEEILQSFPH 1352
FYKG+SRQVDLSSVG CTDS I +N ++KE++ S++L+GCTNIT GMLE++L SFP
Sbjct: 1055 FYKGVSRQVDLSSVGSLCTDSTIWSMINGYNKERITSMILIGCTNITPGMLEDVLGSFPS 1114
Query: 1353 LSSIDIRGCGQFGELALKFPNINWVKSQKSRGAKFNDSRSKIRSLKQITEKSSSAPKSKG 1412
LSSIDIRGC QF ELA KF N+NW+KS+ F +S SKI++LKQITE+ S + KG
Sbjct: 1115 LSSIDIRGCSQFWELADKFSNLNWIKSRIRVMKVFEESYSKIKALKQITERPSVSKPLKG 1174
Query: 1413 LGDDMDDFGDLKDYFESVDKRDSANQSFRRSLYQRSKVFDARKSSSILSRDARMRRWSIK 1472
+G +DD +LK+YF+SVD+R+SA+QSFRRS Y+RSK+FDAR+SSSILSRDARMRRWSIK
Sbjct: 1175 MGSHVDDSSELKEYFDSVDRRESASQSFRRSYYKRSKLFDARRSSSILSRDARMRRWSIK 1234
Query: 1473 KSENGYKRMEEFLASSLKEIMRVNTFEFFVPKVAEIEGRMKKGYYISHGLGSVKDDISRM 1532
SENGYKRMEEFLASSL++IM+ NTF+FFVPKVAEIE RMK GYY HGL SVK+DISRM
Sbjct: 1235 NSENGYKRMEEFLASSLRDIMKENTFDFFVPKVAEIEDRMKNGYYAGHGLSSVKEDISRM 1294
Query: 1533 CRDAIKAKNRGSAGDMNRITTLFIQLATRLEQGAKSSYYEREEMMKSWKDESPAGLYSAT 1592
CRDAIKAKNRG +G+MNRI TLFI+LAT LE+G+KSS REEM++ WKDESP+GL S+
Sbjct: 1295 CRDAIKAKNRGDSGNMNRIITLFIRLATCLEEGSKSS-NGREEMVRRWKDESPSGLCSSG 1353
Query: 1593 SKYKKKLSKMVSERKYMNRSNGTSLANGDFDYGEYASDREIRKRLSKLNRKSLDSGSETS 1652
SKYKKKL+K+V+ERK+ RSNG S DYGEYASDREIR+RLSKLN+KS+DSGS+TS
Sbjct: 1354 SKYKKKLNKIVTERKH--RSNGGS------DYGEYASDREIRRRLSKLNKKSMDSGSDTS 1405
Query: 1653 DDLDGSSEDGKSDSESTVSDTDSDMDFRSDGRARESRGAGDFTTDEGL-DFSDDREWGAR 1711
DDLD SSE G S SEST SDT+SD+DFRS+G ESR G FT DEGL +DDREWGAR
Sbjct: 1406 DDLDRSSEGGSSGSESTASDTESDLDFRSEGGVAESRVDGYFTADEGLYSMTDDREWGAR 1465
Query: 1712 MTKASLVPPVTRKYEVIDQYVIVADEEDVRRKMRVSLPEDYAEKLNAQKNGSEELDMELP 1771
MTK SLVPPVTRKYEVI+QYVIVADE++V+RKM+VSLPE Y EKL AQKNG+EE DME+P
Sbjct: 1466 MTKVSLVPPVTRKYEVIEQYVIVADEDEVQRKMKVSLPEHYNEKLTAQKNGTEESDMEIP 1525
Query: 1772 EVKDYKPRKQLGDQVFEQEVYGIDPYTHNLLLDSMPDELDWNLLEKHLFIEDVLLRTLNK 1831
EVKDYKPRKQLGD+V EQEVYGIDPYTHNLLLDSMP+ELDW LLEKHLFIE+VLL TLNK
Sbjct: 1526 EVKDYKPRKQLGDEVIEQEVYGIDPYTHNLLLDSMPEELDWPLLEKHLFIEEVLLCTLNK 1585
Query: 1832 QVRHFTGTGNTPMMYPLQPVIEEIEKEAVDDCDVRTMKMCRGILKAMDSRPDDKYVAYRK 1891
QVRHFTGTGNTPMMY LQPV+E+I+K A ++ D+RT+KMC+GILKAM+SRPDD YVAYRK
Sbjct: 1586 QVRHFTGTGNTPMMYHLQPVVEDIQKTAEEELDLRTLKMCQGILKAMNSRPDDNYVAYRK 1645
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
GLGVVCNKEGGF ++DFVVEFLGEVYP WKWFEKQDGIRSLQKN++DPAPEFYNIYLERP
Sbjct: 1646 GLGVVCNKEGGFSQEDFVVEFLGEVYPAWKWFEKQDGIRSLQKNSKDPAPEFYNIYLERP 1705
Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAV+G YQIGIYTVR I YGEEIT
Sbjct: 1706 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVEGQYQIGIYTVRQIQYGEEIT 1765
Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDRHQLMLEACE 2071
FDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAF+KVLKE HG+LDR+Q+M EACE
Sbjct: 1766 FDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFQKVLKECHGILDRYQMMFEACE 1825
Query: 2072 LNSVSEEDYLELGRAGLGSCLLGGLPNWVVAYSARLVRFINLERTKLPEEILRHNLEEKR 2131
LN VSEEDY++LGRAGLGSCLLGGLP+W++AY+ARLVRFIN ERTKLPEEILRH+L+EKR
Sbjct: 1826 LNMVSEEDYIDLGRAGLGSCLLGGLPDWLIAYAARLVRFINFERTKLPEEILRHSLDEKR 1885
Query: 2132 KYFSDICLEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRCVFGDPKKAPPPVERLSP 2191
KYF+DI LEVEKSDAE+QAEGVYNQRLQNLA+TLDKVRYVMRCVFGDPKKAPPP+ERLS
Sbjct: 1886 KYFADISLEVEKSDAELQAEGVYNQRLQNLALTLDKVRYVMRCVFGDPKKAPPPLERLSA 1945
Query: 2192 EETVSFLWKGEGSLVEELIQCMAPHVEEDVLNDLKSKIQAHDPSGSEDIQRELRKSLLWL 2251
EE VSFLW GEGSLVEEL+QCMAPH+E+ +L++LK KI+AHDPSGS+DI +EL+KSLLWL
Sbjct: 1946 EEVVSFLWNGEGSLVEELLQCMAPHMEDGMLSELKPKIRAHDPSGSDDIHKELQKSLLWL 2005
Query: 2252 RDEVRNLPCTYKCRHDAAADLIHIYAYTKCFFRVQEYKAFTSPPVYISPLDLGPKYADKL 2311
RDEVRNLPC YKCRHDAAADLIHIYAYTKCFFRV+EYK+ TSPPVYISPLDLGPKY+DKL
Sbjct: 2006 RDEVRNLPCNYKCRHDAAADLIHIYAYTKCFFRVREYKSVTSPPVYISPLDLGPKYSDKL 2065
Query: 2312 GADLQVYRKTYGENYCLGQLIFWHIQTNADPDCTLARASRGCLSLPDIGSFYAKVQKPSR 2371
G+ +Q Y KTYGENYCLGQLI+WH QTNADPDC LARASRGCLSLPDIGSFYAKVQKPSR
Sbjct: 2066 GSGIQEYCKTYGENYCLGQLIYWHNQTNADPDCNLARASRGCLSLPDIGSFYAKVQKPSR 2125
Query: 2372 HRVYGPKTVRFMLSRMEKQPQRPWPKDRIWAFKSSPRIFGSPMLDSSLTGCPLDREMVHW 2431
RVYGP+T+RFML+RMEKQPQR WPKDRIW+FKS P+IFGSPMLD+ L PLDREM+HW
Sbjct: 2126 QRVYGPRTLRFMLARMEKQPQRQWPKDRIWSFKSCPKIFGSPMLDAVLHNSPLDREMLHW 2185
Query: 2432 LKHRPAIFQAMWDR 2445
LK+RPA FQAMWDR
Sbjct: 2186 LKNRPATFQAMWDR 2199
>gi|297804746|ref|XP_002870257.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297316093|gb|EFH46516.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 2364
Score = 2711 bits (7026), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 1450/2543 (57%), Positives = 1794/2543 (70%), Gaps = 279/2543 (10%)
Query: 1 MGDGGVACMPLQQQQQHNSIMERFPISDKTTICVGNSSNNSNKTNNNSISNNNDNKTNND 60
M DGGVACMPL +IME+ PI +KTT+C GN S + T+N
Sbjct: 1 MSDGGVACMPLL------NIMEKLPIVEKTTLCGGNESKSVGTTDNG------------- 41
Query: 61 SSNNNGSSSSKNNETNKSNVKKNGVSTKTVRKKIVK-IKKVIAVKKKEVQKNS------- 112
+ S SSK E+ ++ K S +K+IVK I+KV+ + K+ QK +
Sbjct: 42 ----HTSISSKVPESQPAD-NKPSASQPVKKKRIVKVIRKVVKRRPKQPQKQAEEQLKDQ 96
Query: 113 ---------------------GSSKSNNNGENIDNKNVENGGAVGEVVTVDKENLKNEEV 151
+ KS G +K VENGG G +EV
Sbjct: 97 PPSQVVQLPAESQLQLKEQEEQNKKSEVKGGTSGDKEVENGGDSG----------FKDEV 146
Query: 152 EEGELGTLK----WENGEFVQPEKSQPQSQLQSQSKQIEKGEIIV--------------- 192
EEGELGTLK ENGE + P KS Q +IEKGEI+
Sbjct: 147 EEGELGTLKPPGDLENGE-ISPVKSL-------QKSEIEKGEIVGESWKKDEPTKGEFSY 198
Query: 193 ------------FSS-KCRRGETEKGESGLWRGNKDDIEKGEFIPDRWHK-EVVKDEYGY 238
FS+ K +G E E WR + D+IEKGEFIPDRW K + VKD++ Y
Sbjct: 199 LKYHKGNVERRDFSADKNWKGGKEDREFRSWRDSGDEIEKGEFIPDRWQKMDAVKDDHSY 258
Query: 239 SKSRR----------YDYKLERTPPSGKYSGEDVYRRKEFDRSGSQHSKSSSRWESGQER 288
+SRR Y+Y+ ERTPP G+++ ED+Y ++EF SG +R
Sbjct: 259 IRSRRNGVDREKTWKYEYEYERTPPGGRFANEDIYHQREF--------------RSGHDR 304
Query: 289 NVRISSKIVDDEGLYKGEHNNGKNHGREYFHG-NRFKRHGTDSDSGDRKY-YGDYGDFAG 346
RISSKIV +E L+K E+NN N +EY NR KRHG + DS +RK+ Y DYGD+
Sbjct: 305 TTRISSKIVIEENLHKNEYNNPSNFVKEYSSTVNRLKRHGAEPDSVERKHSYADYGDYGS 364
Query: 347 LKSRRLSDDYNSRSVHSEHYSRHSVEKFHRNSSSSRISSLDKYSSRHHEPSLSSRVIYDR 406
K R+LSDD SRS+HS+HYS+HS E+ +R+S SS+ SSL+KY +H + S ++ D+
Sbjct: 365 SKCRKLSDDC-SRSLHSDHYSQHSAERLYRDSYSSKNSSLEKYHRKHQDASFPAKAFSDK 423
Query: 407 HGRSPSHSDRSPHDRGRYYDHRDRSPSRHDRSPYTRDRSPYTFDRSPYSRERSPYNRDRS 466
HG SP+ SD SPHDR RY+++R DRSPY+RERSPY ++S
Sbjct: 424 HGHSPARSDWSPHDRSRYHENR---------------------DRSPYARERSPYIFEKS 462
Query: 467 PYAREKSPYDRSRHYDHRNRSPFSAERSPQDRARFHDRSDRTPNYLERSPLHRSRPNNHR 526
+AR++SP DRSRH+D+R RSP +E SP DR+R DR D PNY+E + R+R N HR
Sbjct: 463 SHARKRSPRDRSRHHDYR-RSPSYSEWSPHDRSRPSDRRDSIPNYMEDTQNDRNRRNGHR 521
Query: 527 EASSKTGASEKRNARYDSKGHEDKLGPKDSNARCSRSSAKESQDKSNVQDLNVSDEKTAN 586
E S K+G E+R+++ ++ E+K KDSN + S SS+KE Q K+ + + N+ EK +
Sbjct: 522 EISRKSGVRERRDSQTGTE-LENKHRYKDSNGKESTSSSKELQGKNILYNNNLVVEKNSV 580
Query: 587 CESHKEEQPQSSSVDCKEPPQVDGPPLEELVSMEEDMDICDTPPHVPAVTDSSVGKWFYL 646
C+S K P ++ KEP QV P EEL SME DMDICDTPPH P DSS+GKWFYL
Sbjct: 581 CDSSKIPIPCATG---KEPVQVGEAPPEELPSMEVDMDICDTPPHEPMAADSSLGKWFYL 637
Query: 647 DHCGMECGPSRLCDLKTLVEEGVLVSDHFIKHLDSNRWETVENAVSPLVTVNFPSITSDS 706
D+ G E GP+RL +LK L+E+G+L SDH IKH D+NRW
Sbjct: 638 DYYGTEHGPARLSELKALMEQGILFSDHMIKHSDNNRW---------------------- 675
Query: 707 VTQLVSPPEASGNLLADTGDTA------QSTGEEFPVTLQSQCCPDGSAAAAESSEDLHI 760
L +PPEA GNLL D DT Q G+ P ++ PD + E ED I
Sbjct: 676 ---LANPPEAPGNLLEDITDTTEAVCIEQEAGDSLPESVSVMTIPDANEFLVEHLEDFQI 732
Query: 761 DVRVGALLDGFTVIPGKEIETLGEILQTT--------------FERVDWQNNGGPTWHGA 806
D R+ LL+G+T+ PG+E E+LGE L T FE V G + G
Sbjct: 733 DKRIANLLEGYTIAPGREFESLGEALNVTVEFKETRRCVTSEVFEVVQIWAFGMKSI-GK 791
Query: 807 CVGEQKPGDQKVDELYISDTKMKEAAELKSGDKDHWVVCFDSDE---WFSGRWSCKGGDW 863
C+ K ++ L S+ + E KS D V +SDE WFSGRWSCKGGDW
Sbjct: 792 CLMFVKDDEEL---LGCSEPIKRAIEEFKSDD----VYGSESDEIGSWFSGRWSCKGGDW 844
Query: 864 KRNDEAAQDRCSRKKQVLNDGFPLCQMPKSGYEDPRWNQKDDLYYPSHSRRLDLPPWAYA 923
R DEA+QDR +KK VLNDGFPLC M KSGYEDPRW+ KDD+YYP S RL+LP WA++
Sbjct: 845 IRQDEASQDRYYKKKIVLNDGFPLCLMQKSGYEDPRWHHKDDMYYPLSSSRLELPLWAFS 904
Query: 924 CPDERNDGSGGSRSTQSKLAAVRGVKGTMLPVVRINACVVNDHGSFVSEPRSKVRAKERH 983
DERN RGVK +L VVR+N+ VVND V +PR+KVR KER
Sbjct: 905 GVDERNQA--------------RGVKANLLSVVRLNSLVVNDQVPPVPDPRAKVRGKERC 950
Query: 984 SSRSARSYSSANDVRRSSAESDSHSKARNNQDSQGSWKSIACINTPKDRLCTVDDLQLQL 1043
SR AR +++D +R S ES S S A N QDS G ++ A +NTP+DRLCTVDDLQL +
Sbjct: 951 PSRPARPSPASSDSKRESVESHSQSTASNGQDSHGLLRTDASVNTPRDRLCTVDDLQLHI 1010
Query: 1044 GEWYYLDGAGHERGPSSFSELQVLVDQGCIQKHTSVFRKFDKVWVPLTFATETSASTVRN 1103
G+W+Y DGAG E+GP FSELQ+LV++G I+ H+SVFRK DK+WVP+T T + + +
Sbjct: 1011 GDWFYTDGAGQEQGPLPFSELQILVEKGFIKSHSSVFRKSDKIWVPVTSITNSPETIAKL 1070
Query: 1104 HGEKIMPSGDSSGLPPTQSQDAVLGESNNNVNSNAFHTMHPQFIGYTRGKLHELVMKSYK 1163
G+ D L +++QD L S + + N+FH +HPQF+GY RGKLH+LVMK++K
Sbjct: 1071 RGKNPALPSDCQDLVVSETQD--LKRSEMDTSLNSFHGVHPQFLGYFRGKLHQLVMKTFK 1128
Query: 1164 NREFAAAINEVLDPWINAKQPKKETE-HVYRKSEGDTRAGKRARLLVRESDGDEETEEEL 1222
+R+F+AAIN+VLD WI+A+QPKKE+E ++Y+ SE D+ KRARL+ ES D E E+
Sbjct: 1129 SRDFSAAINDVLDSWIHARQPKKESEKYMYQSSELDSCFTKRARLMAGESGEDSEMEDTQ 1188
Query: 1223 QTIQDESTFEDLCGDASFPGEESASSAIESGGWGLLDGHTLAHVFHFLRSDMKSLAFASL 1282
+DE TFEDLCGDA+F E S S+ WGLLDGH LA VFH LR D+KSLAFAS+
Sbjct: 1189 MFQKDELTFEDLCGDATFQIEGSGSAGTVGIYWGLLDGHALARVFHLLRYDVKSLAFASM 1248
Query: 1283 TCRHWRAAVRFYKGISRQVDLSSVGPNCTDSLIRKTLNAFDKEKLNSILLVGCTNITSGM 1342
TCRHW+A + YK ISRQVDLSS+GPNCTDS +R +N ++KEK++SI+LVGCTN+T+ M
Sbjct: 1249 TCRHWKATINSYKEISRQVDLSSLGPNCTDSRLRSIMNTYNKEKIDSIILVGCTNVTASM 1308
Query: 1343 LEEILQSFPHLSSIDIRGCGQFGELALKFPNINWVKSQKSRGAKFNDSRSKIRSLKQITE 1402
LEEIL FP +SS+DI GC QFG+L++ + N++W++ Q +R + + S+IRSLKQ T+
Sbjct: 1309 LEEILHIFPRISSVDITGCSQFGDLSVNYKNVSWLRCQNTRSGELH---SRIRSLKQATD 1365
Query: 1403 KSSSAPKSKGLGDDMDDFGDLKDYFESVDKRDSANQSFRRSLYQRSKVFDARKSSSILSR 1462
S KSKG+G D DDFG+LKDYF+ V+KRDSANQ FRRSLY+RSK++DARKSS+ILSR
Sbjct: 1366 GS----KSKGVGGDTDDFGNLKDYFDRVEKRDSANQLFRRSLYKRSKLYDARKSSAILSR 1421
Query: 1463 DARMRRWSIKKSENGYKRMEEFLASSLKEIMRVNTFEFFVPKVAEIEGRMKKGYYISHGL 1522
DAR+RRW+IKKSE+GYKR+EEFLA SL+ IM+ NTF+FF KV++IE +MK GYY+SHGL
Sbjct: 1422 DARIRRWAIKKSEHGYKRVEEFLALSLRGIMKQNTFDFFALKVSQIEEKMKNGYYVSHGL 1481
Query: 1523 GSVKDDISRMCRDAIKAKNRGSAGDMNRITTLFIQLATRLEQGAKSSYYEREEMMKSWKD 1582
SVK+DISRMCR+AIK +E+MKSW+D
Sbjct: 1482 RSVKEDISRMCREAIK-----------------------------------DELMKSWQD 1506
Query: 1583 ESPAGLYSATSKYKKKLSKMVSERKYMNRSNGTSLANGDFDYGEYASDREIRKRLSKLNR 1642
S GL SA SKY KKLSK V+E+KYM+R++ T NG DYGEYASDREI++RLSKLNR
Sbjct: 1507 GS--GLSSA-SKYNKKLSKTVTEKKYMSRTSDTFGVNGASDYGEYASDREIKRRLSKLNR 1563
Query: 1643 KSLDSGSETSDDLDGSSEDGKSDSESTVSDTDSDMDFRSDGRARESRGAGDFTTDEGLD- 1701
KS SGSETS + S++GKSD+ S+ S ++S+ D RS+GR+++ R FT DE D
Sbjct: 1564 KSFSSGSETSSE---LSDNGKSDNYSSASASESESDIRSEGRSQDLRTERYFTADESFDS 1620
Query: 1702 FSDDREWGARMTKASLVPPVTRKYEVIDQYVIVADEEDVRRKMRVSLPEDYAEKLNAQKN 1761
+++REWGARMTKASLVPPVTRKYEVI++Y IVADEE+V+RKMRVSLPEDY EKLNAQ+N
Sbjct: 1621 VTEEREWGARMTKASLVPPVTRKYEVIEKYAIVADEEEVQRKMRVSLPEDYGEKLNAQRN 1680
Query: 1762 GSEELDMELPEVKDYKPRKQLGDQVFEQEVYGIDPYTHNLLLDSMPDELDWNLLEKHLFI 1821
G EELDMELPEVK++KPRK LGD+V EQEVYGIDPYTHNLLLDSMP ELDW+L +KH FI
Sbjct: 1681 GIEELDMELPEVKEFKPRKLLGDEVLEQEVYGIDPYTHNLLLDSMPGELDWSLQDKHSFI 1740
Query: 1822 EDVLLRTLNKQVRHFTGTGNTPMMYPLQPVIEEIEKEAVDDCDVRTMKMCRGILKAMDSR 1881
EDV+LRTLN+QVR FTG+GNTPM++PL+PVIEE+++ A ++CD+RT+KMC+ +LK ++SR
Sbjct: 1741 EDVVLRTLNRQVRLFTGSGNTPMVFPLRPVIEELKESAREECDIRTLKMCQVVLKEIESR 1800
Query: 1882 PDDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAP 1941
DDKYV+YRKGLGVVCNKEGGFGE+DFVVEFLGEVYPVWKWFEKQDGIRSLQ+N DPAP
Sbjct: 1801 SDDKYVSYRKGLGVVCNKEGGFGEEDFVVEFLGEVYPVWKWFEKQDGIRSLQENKTDPAP 1860
Query: 1942 EFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTV 2001
EFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIY+V
Sbjct: 1861 EFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYSV 1920
Query: 2002 RGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLD 2061
R I YGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAF+KVLK+ HGLL+
Sbjct: 1921 RAIEYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFQKVLKDWHGLLE 1980
Query: 2062 RHQLMLEACELNSVSEEDYLELGRAGLGSCLLGGLPNWVVAYSARLVRFINLERTKLPEE 2121
RH+LMLEAC LNSVSEEDYLELGRAGLGSCLLGGLP+WV+AYSARLVRFIN ERTKLPEE
Sbjct: 1981 RHRLMLEACILNSVSEEDYLELGRAGLGSCLLGGLPDWVIAYSARLVRFINFERTKLPEE 2040
Query: 2122 ILRHNLEEKRKYFSDICLEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRCVFGDPKK 2181
IL+HNLEEKRKYFSDI L+VEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMR VFGDPK
Sbjct: 2041 ILKHNLEEKRKYFSDIHLDVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRHVFGDPKN 2100
Query: 2182 APPPVERLSPEETVSFLWKGEGSLVEELIQCMAPHVEEDVLNDLKSKIQAHDPSGSEDIQ 2241
APPP+ERL+PEETVSF+W G+GSLV+EL+Q ++PH+EE +LN+L+SKI +HDPSGS D+
Sbjct: 2101 APPPLERLTPEETVSFVWNGDGSLVDELVQSLSPHLEEGILNELRSKIHSHDPSGSADVL 2160
Query: 2242 RELRKSLLWLRDEVRNLPCTYKCRHDAAADLIHIYAYTKCFFRVQEYKAFTSPPVYISPL 2301
+EL++SLLWLRDE+R+LPCTYKCR+DAAADLIHIYAYTKCFF+V+EY++F S PV+ISPL
Sbjct: 2161 KELQRSLLWLRDEIRDLPCTYKCRNDAAADLIHIYAYTKCFFKVREYQSFISSPVHISPL 2220
Query: 2302 DLGPKYADKLGADLQVYRKTYGENYCLGQLIFWHIQTNADPDCTLARASRGCLSLPDIGS 2361
DLG KYADKLG ++ YRKTYGENYCLGQLI+W+ QTN DPD TL +A+RGCLSLPD+ S
Sbjct: 2221 DLGAKYADKLGESIKEYRKTYGENYCLGQLIYWYNQTNTDPDLTLVKATRGCLSLPDVAS 2280
Query: 2362 FYAKVQKPSRHRVYGPKTVRFMLSRMEKQPQRPWPKDRIWAFKSSPRIFGSPMLDSSLTG 2421
FYAK QKPS+HRVYGPKTV+ M+S+M KQPQRPWPKD+IW FKS+PR+FGSPM D+ L
Sbjct: 2281 FYAKAQKPSKHRVYGPKTVKTMVSQMSKQPQRPWPKDKIWTFKSTPRVFGSPMFDAVLNN 2340
Query: 2422 CPLDREMVHWLKHRPAIFQAMWD 2444
LDRE++ WL++R +FQA WD
Sbjct: 2341 SSLDRELLQWLRNRRHVFQATWD 2363
>gi|186511821|ref|NP_193253.4| putative histone-lysine N-methyltransferase ATXR3 [Arabidopsis
thaliana]
gi|229488102|sp|O23372.2|ATXR3_ARATH RecName: Full=Probable histone-lysine N-methyltransferase ATXR3;
AltName: Full=Protein SET DOMAIN GROUP 2; AltName:
Full=Trithorax-related protein 3; Short=TRX-related
protein 3
gi|332658165|gb|AEE83565.1| putative histone-lysine N-methyltransferase ATXR3 [Arabidopsis
thaliana]
Length = 2335
Score = 2681 bits (6950), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 1432/2524 (56%), Positives = 1777/2524 (70%), Gaps = 270/2524 (10%)
Query: 1 MGDGGVACMPLQQQQQHNSIMERFPISDKTTICVGNSSNNSNKTNNNSISNNNDNKTNND 60
M DGGVACMPL +IME+ PI +KTT+C GN S KT
Sbjct: 1 MSDGGVACMPLL------NIMEKLPIVEKTTLCGGNES-----------------KTAAT 37
Query: 61 SSNNNGSSSSKNNETNKSNVKKNGVSTKTVRKKIVK-IKKVIAVKKKEVQKNSGSS---- 115
+ N + S ++K E+ +N K + S +K+IVK I+KV+ + K+ QK +
Sbjct: 38 TENGHTSIATKVPESQPAN-KPSASSQPVKKKRIVKVIRKVVKRRPKQPQKQADEQLKDQ 96
Query: 116 ---------------------KSNNNGENIDNKNVENGGAVGEVVTVDKENLKNEEVEEG 154
KS G K VENGG G +EVEEG
Sbjct: 97 PPSQVVQLPAESQLQIKEQDKKSEFKGGTSGVKEVENGGDSG----------FKDEVEEG 146
Query: 155 ELGTLKW----ENGEFVQPEKSQPQSQLQSQSKQIEKGEIIV------------------ 192
ELGTLK ENGE + P KS Q +IEKGEI+
Sbjct: 147 ELGTLKLHEDLENGE-ISPVKSL-------QKSEIEKGEIVGESWKKDEPTKGEFSHLKY 198
Query: 193 ---------FSS-KCRRGETEKGESGLWRGNKDDIEKGEFIPDRWHK-EVVKDEYGYSKS 241
FS+ K +G E+ E WR D+IEKGEFIPDRW K + KD++ Y +S
Sbjct: 199 HKGYVERRDFSADKNWKGGKEEREFRSWRDPSDEIEKGEFIPDRWQKMDTGKDDHSYIRS 258
Query: 242 RR----------YDYKLERTPPSGKYSGEDVYRRKEFDRSGSQHSKSSSRWESGQERNVR 291
RR Y+Y+ ERTPP G++ ED+Y ++EF SG +R R
Sbjct: 259 RRNGVDREKTWKYEYEYERTPPGGRFVNEDIYHQREF--------------RSGLDRTTR 304
Query: 292 ISSKIVDDEGLYKGEHNNGKNHGREYFH-GNRFKRHGTDSDSGDRKY-YGDYGDFAGLKS 349
ISSKIV +E L+K E+NN N +EY GNR KRHG + DS +RK+ Y DYGD+ K
Sbjct: 305 ISSKIVIEENLHKNEYNNSSNFVKEYSSTGNRLKRHGAEPDSIERKHSYADYGDYGSSKC 364
Query: 350 RRLSDDYNSRSVHSEHYSRHSVEKFHRNSSSSRISSLDKYSSRHHEPSLSSRVIYDRHGR 409
R+LSDD SRS+HS+HYS+HS E+ +R+S S+ SSL+KY +H + S ++ D+HG
Sbjct: 365 RKLSDDC-SRSLHSDHYSQHSAERLYRDSYPSKNSSLEKYPRKHQDASFPAKAFSDKHGH 423
Query: 410 SPSHSDRSPHDRGRYYDHRDRSPSRHDRSPYTRDRSPYTFDRSPYSRERSPYNRDRSPYA 469
SPS SD SPHDR RY+++R DRSPY+RERSPY ++S +A
Sbjct: 424 SPSRSDWSPHDRSRYHENR---------------------DRSPYARERSPYIFEKSSHA 462
Query: 470 REKSPYDRSRHYDHRNRSPFSAERSPQDRARFHDRSDRTPNYLERSPLHRSRPNNHREAS 529
R++SP DR H RSP +E SP DR+R DR D PN++E + R+R N HRE S
Sbjct: 463 RKRSPRDRRHH--DYRRSPSYSEWSPHDRSRPSDRRDYIPNFMEDTQSDRNRRNGHREIS 520
Query: 530 SKTGASEKRNARYDSKGHEDKLGPKDSNARCSRSSAKESQDKSNVQDLNVSDEKTANCES 589
K+G E+R+ + ++ E K K+SN + S SS+KE Q K+ + + ++ EK + C+S
Sbjct: 521 RKSGVRERRDCQTGTE-LEIKHKYKESNGKESTSSSKELQGKNILYNNSLLVEKNSVCDS 579
Query: 590 HKEEQPQSSSVDCKEPPQVDGPPLEELVSMEEDMDICDTPPHVPAVTDSSVGKWFYLDHC 649
K P ++ KEP QV P EEL SME DMDICDTPPH P +DSS+GKWFYLD+
Sbjct: 580 SKIPVPCATG---KEPVQVGEAPTEELPSMEVDMDICDTPPHEPMASDSSLGKWFYLDYY 636
Query: 650 GMECGPSRLCDLKTLVEEGVLVSDHFIKHLDSNRWETVENAVSPLVTVNFPSITSDSVTQ 709
G E GP+RL DLK L+E+G+L SDH IKH D+NRW
Sbjct: 637 GTEHGPARLSDLKALMEQGILFSDHMIKHSDNNRW------------------------- 671
Query: 710 LVSPPEASGNLLADTGDTA------QSTGEEFPVTLQSQCCPDGSAAAAESSEDLHIDVR 763
LV+PPEA GNLL D DT Q G+ P + + PDG E+ ED ID+R
Sbjct: 672 LVNPPEAPGNLLEDIADTTEAVCIEQGAGDSLPELVSVRTLPDGKEIFVENREDFQIDMR 731
Query: 764 VGALLDGFTVIPGKEIETLGEILQTTFERVDWQNNGGPTWHGACVGEQKPGDQKVDELYI 823
V LLDG T+ PG+E ETLGE L+ V+++ VG +P + ++E
Sbjct: 732 VENLLDGRTITPGREFETLGEALKVN---VEFEETRRCVTSEGVVGMFRPMKRAIEEFKS 788
Query: 824 SDTKMKEAAELKSGDKDHWVVCFDSDEWFSGRWSCKGGDWKRNDEAAQDRCSRKKQVLND 883
D E+ E+ S WFSGRWSCKGGDW R DEA+QDR +KK VLND
Sbjct: 789 DDAYGSESDEIGS--------------WFSGRWSCKGGDWIRQDEASQDRYYKKKIVLND 834
Query: 884 GFPLCQMPKSGYEDPRWNQKDDLYYPSHSRRLDLPPWAYACPDERNDGSGGSRSTQSKLA 943
GFPLC M KSG+EDPRW+ KDDLYYP S RL+LP WA++ DERN
Sbjct: 835 GFPLCLMQKSGHEDPRWHHKDDLYYPLSSSRLELPLWAFSVVDERNQ------------- 881
Query: 944 AVRGVKGTMLPVVRINACVVNDHGSFVSEPRSKVRAKERHSSRSARSYSSANDVRRSSAE 1003
RGVK ++L VVR+N+ VVND + +PR+KVR+KER SR AR +++D +R S E
Sbjct: 882 -TRGVKASLLSVVRLNSLVVNDQVPPIPDPRAKVRSKERCPSRPARPSPASSDSKRESVE 940
Query: 1004 SDSHSKARNNQDSQGSWKSIACINTPKDRLCTVDDLQLQLGEWYYLDGAGHERGPSSFSE 1063
S S S A QDSQG WK+ +NTP+DRLCTVDDLQL +G+W+Y DGAG E+GP SFSE
Sbjct: 941 SHSQSTASTGQDSQGLWKTDTSVNTPRDRLCTVDDLQLHIGDWFYTDGAGQEQGPLSFSE 1000
Query: 1064 LQVLVDQGCIQKHTSVFRKFDKVWVPLTFATETSASTVRNHGEKIMPSGDSSGLPPTQSQ 1123
LQ LV++G I+ H+SVFRK DK+WVP+T T++ + G+ GL +++Q
Sbjct: 1001 LQKLVEKGFIKSHSSVFRKSDKIWVPVTSITKSPETIAMLRGKTPALPSACQGLVVSETQ 1060
Query: 1124 DAVLGESNNNVNSNAFHTMHPQFIGYTRGKLHELVMKSYKNREFAAAINEVLDPWINAKQ 1183
D E + ++NS FH +HPQF+GY RGKLH+LVMK++K+R+F+AAIN+V+D WI+A+Q
Sbjct: 1061 DFKYSEMDTSLNS--FHGVHPQFLGYFRGKLHQLVMKTFKSRDFSAAINDVVDSWIHARQ 1118
Query: 1184 PKKETE-HVYRKSEGDTRAGKRARLLVRESDGDEETEEELQTIQDESTFEDLCGDASFPG 1242
PKKE+E ++Y+ SE ++ KRARL+ ES D E E+ +DE TFEDLCGD +F
Sbjct: 1119 PKKESEKYMYQSSELNSCYTKRARLMAGESGEDSEMEDTQMFQKDELTFEDLCGDLTFNI 1178
Query: 1243 EESASSAIESGGWGLLDGHTLAHVFHFLRSDMKSLAFASLTCRHWRAAVRFYKGISRQVD 1302
E + S+ WGLLDGH LA VFH LR D+KSLAFAS+TCRHW+A + YK ISRQVD
Sbjct: 1179 EGNRSAGTVGIYWGLLDGHALARVFHMLRYDVKSLAFASMTCRHWKATINSYKDISRQVD 1238
Query: 1303 LSSVGPNCTDSLIRKTLNAFDKEKLNSILLVGCTNITSGMLEEILQSFPHLSSIDIRGCG 1362
LSS+GP+CTDS +R +N ++KEK++SI+LVGCTN+T+ MLEEIL+ P +SS+DI GC
Sbjct: 1239 LSSLGPSCTDSRLRSIMNTYNKEKIDSIILVGCTNVTASMLEEILRLHPRISSVDITGCS 1298
Query: 1363 QFGELALKFPNINWVKSQKSRGAKFNDSRSKIRSLKQITEKSSSAPKSKGLGDDMDDFGD 1422
QFG+L + + N++W++ Q +R + + S+IRSLKQ T+ KSKGLG D DDFG+
Sbjct: 1299 QFGDLTVNYKNVSWLRCQNTRSGELH---SRIRSLKQTTD----VAKSKGLGGDTDDFGN 1351
Query: 1423 LKDYFESVDKRDSANQSFRRSLYQRSKVFDARKSSSILSRDARMRRWSIKKSENGYKRME 1482
LKDYF+ V+KRDSANQ FRRSLY+RSK++DAR+SS+ILSRDAR+RRW+IKKSE+GYKR+E
Sbjct: 1352 LKDYFDRVEKRDSANQLFRRSLYKRSKLYDARRSSAILSRDARIRRWAIKKSEHGYKRVE 1411
Query: 1483 EFLASSLKEIMRVNTFEFFVPKVAEIEGRMKKGYYISHGLGSVKDDISRMCRDAIKAKNR 1542
EFLASSL+ IM+ NTF+FF KV++IE +MK GYY+SHGL SVK+DISRMCR+AIK
Sbjct: 1412 EFLASSLRGIMKQNTFDFFALKVSQIEEKMKNGYYVSHGLRSVKEDISRMCREAIK---- 1467
Query: 1543 GSAGDMNRITTLFIQLATRLEQGAKSSYYEREEMMKSWKDESPAGLYSATSKYKKKLSKM 1602
+E+MKSW+D S GL SAT KY KKLSK
Sbjct: 1468 -------------------------------DELMKSWQDGS--GLSSAT-KYNKKLSKT 1493
Query: 1603 VSERKYMNRSNGTSLANGDFDYGEYASDREIRKRLSKLNRKSLDSGSETSDDLDGSSEDG 1662
V+E+KYM+R++ T NG DYGEYASDREI++RLSKLNRKS S S+TS + S++G
Sbjct: 1494 VAEKKYMSRTSDTFGVNGASDYGEYASDREIKRRLSKLNRKSFSSESDTSSE---LSDNG 1550
Query: 1663 KSDSESTVSDTDSDMDFRSDGRARESRGAGDFTTDEGLD-FSDDREWGARMTKASLVPPV 1721
KSD+ S+ S ++S+ D RS+GR+++ R FT D+ D +++REWGARMTKASLVPPV
Sbjct: 1551 KSDNYSSASASESESDIRSEGRSQDLRIEKYFTADDSFDSVTEEREWGARMTKASLVPPV 1610
Query: 1722 TRKYEVIDQYVIVADEEDVRRKMRVSLPEDYAEKLNAQKNGSEELDMELPEVKDYKPRKQ 1781
TRKYEVI++Y IVADEE+V+RKMRVSLPEDY EKLNAQ+NG EELDMELPEVK+YKPRK
Sbjct: 1611 TRKYEVIEKYAIVADEEEVQRKMRVSLPEDYGEKLNAQRNGIEELDMELPEVKEYKPRKL 1670
Query: 1782 LGDQVFEQEVYGIDPYTHNLLLDSMPDELDWNLLEKHLFIEDVLLRTLNKQVRHFTGTGN 1841
LGD+V EQEVYGIDPYTHNLLLDSMP ELDW+L +KH FIEDV+LRTLN+QVR FTG+G+
Sbjct: 1671 LGDEVLEQEVYGIDPYTHNLLLDSMPGELDWSLQDKHSFIEDVVLRTLNRQVRLFTGSGS 1730
Query: 1842 TPMMYPLQPVIEEIEKEAVDDCDVRTMKMCRGILKAMDSRPDDKYVAYRKGLGVVCNKEG 1901
TPM++PL+PVIEE+++ A ++CD+RTMKMC+G+LK ++SR DDKYV+YRKGLGVVCNKEG
Sbjct: 1731 TPMVFPLRPVIEELKESAREECDIRTMKMCQGVLKEIESRSDDKYVSYRKGLGVVCNKEG 1790
Query: 1902 GFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLV 1961
GFGE+DFVVEFLGEVYPVWKWFEKQDGIRSLQ+N DPAPEFYNIYLERPKGDADGYDLV
Sbjct: 1791 GFGEEDFVVEFLGEVYPVWKWFEKQDGIRSLQENKTDPAPEFYNIYLERPKGDADGYDLV 1850
Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
VVDAMH ANYASRICHSCRPNCEAKVTAVDGHYQIGIY+VR I YGEEITFDYNSVTESK
Sbjct: 1851 VVDAMHMANYASRICHSCRPNCEAKVTAVDGHYQIGIYSVRAIEYGEEITFDYNSVTESK 1910
Query: 2022 EEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDRHQLMLEACELNSVSEEDYL 2081
EEYEASVCLCGSQVCRGSYLNLTGEGAF+KVLK+ HGLL+RH+LMLEAC LNSVSEEDYL
Sbjct: 1911 EEYEASVCLCGSQVCRGSYLNLTGEGAFQKVLKDWHGLLERHRLMLEACVLNSVSEEDYL 1970
Query: 2082 ELGRAGLGSCLLGGLPNWVVAYSARLVRFINLERTKLPEEILRHNLEEKRKYFSDICLEV 2141
ELGRAGLGSCLLGGLP+W++AYSARLVRFIN ERTKLPEEIL+HNLEEKRKYFSDI L+V
Sbjct: 1971 ELGRAGLGSCLLGGLPDWMIAYSARLVRFINFERTKLPEEILKHNLEEKRKYFSDIHLDV 2030
Query: 2142 EKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRCVFGDPKKAPPPVERLSPEETVSFLWKG 2201
EKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMR VFGDPK APPP+ERL+PEETVSF+W G
Sbjct: 2031 EKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRHVFGDPKNAPPPLERLTPEETVSFVWNG 2090
Query: 2202 EGSLVEELIQCMAPHVEEDVLNDLKSKIQAHDPSGSEDIQRELRKSLLWLRDEVRNLPCT 2261
+GSLV+EL+Q ++PH+EE LN+L+SKI HDPSGS D+ +EL++SLLWLRDE+R+LPCT
Sbjct: 2091 DGSLVDELLQSLSPHLEEGPLNELRSKIHGHDPSGSADVLKELQRSLLWLRDEIRDLPCT 2150
Query: 2262 YKCRHDAAADLIHIYAYTKCFFRVQEYKAFTSPPVYISPLDLGPKYADKLGADLQVYRKT 2321
YKCR+DAAADLIHIYAYTKCFF+V+EY++F S PV+ISPLDLG KYADKLG ++ YRKT
Sbjct: 2151 YKCRNDAAADLIHIYAYTKCFFKVREYQSFISSPVHISPLDLGAKYADKLGESIKEYRKT 2210
Query: 2322 YGENYCLGQLIFWHIQTNADPDCTLARASRGCLSLPDIGSFYAKVQKPSRHRVYGPKTVR 2381
YGENYCLGQLI+W+ QTN DPD TL +A+RGCLSLPD+ SFYAK QKPS+HRVYGPKTV+
Sbjct: 2211 YGENYCLGQLIYWYNQTNTDPDLTLVKATRGCLSLPDVASFYAKAQKPSKHRVYGPKTVK 2270
Query: 2382 FMLSRMEKQPQRPWPKDRIWAFKSSPRIFGSPMLDSSL-TGCPLDREMVHWLKHRPAIFQ 2440
M+S+M KQPQRPWPKD+IW FKS+PR+FGSPM D+ L LDRE++ WL++R +FQ
Sbjct: 2271 TMVSQMSKQPQRPWPKDKIWTFKSTPRVFGSPMFDAVLNNSSSLDRELLQWLRNRRHVFQ 2330
Query: 2441 AMWD 2444
A WD
Sbjct: 2331 ATWD 2334
>gi|357453545|ref|XP_003597050.1| Histone-lysine N-methyltransferase E(z) [Medicago truncatula]
gi|355486098|gb|AES67301.1| Histone-lysine N-methyltransferase E(z) [Medicago truncatula]
Length = 2512
Score = 2550 bits (6608), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 1325/2090 (63%), Positives = 1583/2090 (75%), Gaps = 72/2090 (3%)
Query: 394 HEPSLSSRVIYDRHGRSPSHSDRSPHDRG-RYYDHRDRSPSRHDRSPYTRD--------R 444
H P S +D RSP+ +++SP D+G R YD + RSP+R ++SP + R
Sbjct: 457 HSPQDPSWRQHDHKLRSPARAEQSPQDQGWRQYDPKLRSPARTEQSPRNQGWRHNDHKLR 516
Query: 445 SPYTFDRSPYSRE-RSPYNRDRSPYAREKSPYDRS-RHYDHRNRSPFSAERSPQDRARFH 502
SP ++SP + R ++ RSP E+SP + + DH+ RSP E+SP D+ R
Sbjct: 517 SPARTEQSPRGQGWRHNDHKLRSPACTEQSPRGQGWQQNDHKLRSPARTEQSPHDQGRRR 576
Query: 503 DRSDRTPNYLERSPLHRSRPNNHREASSKTGASEKRNARYDSKGHEDKLGPKDSNARCSR 562
D TPN E SP R+ + H E S K +SE N K EDK P++S
Sbjct: 577 GLRDCTPNLGEESPHVRTTKDVHEETSCKNSSSENLNFPNSCKSDEDKHIPRESAC---- 632
Query: 563 SSAKESQDKSNVQDLNVSDEK-TANCESHKEEQPQSSSVDCKEPPQVDG-PPLEELVSME 620
S ES+ + NVQ N S EK ++ + +Q S +VD KE PQ + PP +EL+SME
Sbjct: 633 -SVTESEGERNVQKTNESIEKDISSSQPVDTQQSCSPTVDHKESPQCEAQPPPDELLSME 691
Query: 621 EDMDICDTPPHVPAVTDSSVGKWFYLDHCGMECGPSRLCDLKTLVEEGVLVSDHFIKHLD 680
EDMDICDTPPHVP VTD S GKWFYLD+ G+E GP++LCD+K LV+EGVL+SDHFIKHLD
Sbjct: 692 EDMDICDTPPHVPVVTDLSSGKWFYLDYGGVENGPTKLCDIKALVDEGVLMSDHFIKHLD 751
Query: 681 SNRWETVENAVSPLVTVNFPSITSDSVTQLVSPPEASGNLLADTGDTAQSTGEEFPVTLQ 740
SNRW TVENAVSPLV FPS+ SD++TQLV+PPEASGNLLADT D QS P L
Sbjct: 752 SNRWLTVENAVSPLVAQIFPSVVSDTITQLVNPPEASGNLLADTADI-QSAPANNPEML- 809
Query: 741 SQCCPDG----SAAAAESSEDLHIDVRVGALLDGFTVIPGKEIETLGEILQTTFERVDWQ 796
+ P G + +E ++ +ID RV LL+G+ VIPG E+E + E LQ FE
Sbjct: 810 APSPPRGHLNDNVLTSELLDNFYIDERVQKLLEGYDVIPGMELEAIKEALQMKFEYPKED 869
Query: 797 NNG---GPTWHGACVGEQKPGDQKVDELYISDTKMKEAAELKSGDKDHWVVCFDSDEWFS 853
G G WH +C+ E D D ++ + + +KD +WFS
Sbjct: 870 GLGDYEGFPWHVSCLRED--CDSSTD---LASRDSESQLSMSCDNKDDGFGYGIPKDWFS 924
Query: 854 GRWSCKGGDWKRNDEAAQDRCSRKKQVLNDGFPLCQMPKSGYEDPRWNQKDDLYYPSHSR 913
WSCKGGDWKRND+ QDR RKK VLN+GFPLCQ+PKSG EDPRW + DDLY PS SR
Sbjct: 925 TLWSCKGGDWKRNDDT-QDRFFRKKVVLNNGFPLCQLPKSGCEDPRWPEIDDLYCPSQSR 983
Query: 914 RLDLPPWAYACPDERNDGSGGSRSTQSKLAAVRGVKGTMLPVVRINACVVNDHGSFVSEP 973
LDLP WA DE D + SRS QSK +++GVKG +L VVRINACVVND G +SE
Sbjct: 984 -LDLPLWAVGA-DELVDCNAASRSVQSKPPSIKGVKGNVLSVVRINACVVNDQGLLLSES 1041
Query: 974 RSKVRAKERHSSRSARSYSSANDVRRSSAESDSHSKARNNQDSQGSWKSIACINTPKDRL 1033
R + R K+R RS R ++S +D +RSS E S SKA ++Q GS++S+ I PKD L
Sbjct: 1042 RHQTRGKDRQHPRSTRPFTSTSDSKRSSTEESSQSKAVSDQ---GSYQSMEFIGVPKDHL 1098
Query: 1034 CTVDDLQLQLGEWYYLDGAGHERGPSSFSELQVLVDQGCIQKHTSVFRKFDKVWVPLTFA 1093
CT+ +LQL LG+WYY+D +G E+GPSSFSELQ LVDQG I++H+SVFRK DK+WVP+ A
Sbjct: 1099 CTIQELQLHLGDWYYIDASGREKGPSSFSELQSLVDQGVIKRHSSVFRKRDKLWVPIASA 1158
Query: 1094 TETSASTVRNHGEKIMPSGDSSGLPPTQSQDAVLGESNNNVNSNAFHTMHPQFIGYTRGK 1153
ET +H + G S P Q+Q GES +S+ F+ +HPQF+G+TRGK
Sbjct: 1159 AETLDVCPTSHQKSSSTLGACSDHPSQQTQGVSYGESC--TSSSLFNKIHPQFVGFTRGK 1216
Query: 1154 LHELVMKSYKNREFAAAINEVLDPWINAKQPKKETE-HVYRKSEGDTRAGKRARLLVRES 1212
LHELVMKSYK+RE AAAINEVLDPWINA+QPKK+ E +Y KSEGDTRA KRAR+LV +S
Sbjct: 1217 LHELVMKSYKSRELAAAINEVLDPWINARQPKKDIEKQIYWKSEGDTRAAKRARMLVDDS 1276
Query: 1213 DGDEETEEELQTIQDESTFEDLCGDASFPGEESASSAIESGGWGLLDGHTLAHVFHFLRS 1272
+ D E+ + ++E TFEDL GDA+FP +E + E G WGLLDG LA +FHFLRS
Sbjct: 1277 EEDSGLEDGVTIGKNEPTFEDLRGDATFPEKEIGITDSEVGSWGLLDGPVLARIFHFLRS 1336
Query: 1273 DMKSLAFASLTCRHWRAAVRFYKGISRQVDLSSVGPNCTDSLIRKTLNAFDKEKLNSILL 1332
D KSL FAS+TC+HW AAVRFYK IS Q++LSS+G +CTDS++ +NA++K+K+NSI+L
Sbjct: 1337 DFKSLVFASMTCKHWSAAVRFYKEISMQLNLSSLGHSCTDSVLWNIMNAYEKDKINSIIL 1396
Query: 1333 VGCTNITSGMLEEILQSFPHLSSIDIRGCGQFGELALKFPNINWVKSQKSRGAKFNDSRS 1392
+GC NIT+ MLE+IL SFP L +IDIRGC QFGEL KF N+ W+KS+ SR +
Sbjct: 1397 IGCNNITADMLEKILLSFPGLCTIDIRGCSQFGELTPKFTNVKWIKSRSSRMDGIAEEPH 1456
Query: 1393 KIRSLKQITEKSSSAPKSKGLGDDMDDFGDLKDYFESVDKRDSANQSFRRSLYQRSKVFD 1452
KIRSLK IT ++ SA KS LG +DDFG LK+YF+SVDKRDSA Q FR++LY+RSK++D
Sbjct: 1457 KIRSLKHITGQTLSASKSSNLG--IDDFGQLKEYFDSVDKRDSAKQLFRQNLYKRSKLYD 1514
Query: 1453 ARKSSSILSRDARMRRWSIKKSENGYKRMEEFLASSLKEIMRVNTFEFFVPKVAEIEGRM 1512
ARKSSSILSRDAR RRW+IKKSE+G+KRMEEFLAS LKEIM+ N+ +FFVPKVAEIE +M
Sbjct: 1515 ARKSSSILSRDARTRRWAIKKSESGFKRMEEFLASRLKEIMKTNSCDFFVPKVAEIEAKM 1574
Query: 1513 KKGYYISHGLGSVKDDISRMCRDAIKAKNRGSAGDMNRITTLFIQLATRLEQGAKSSYYE 1572
K GYY S GL SVK+DISRMCRDAIKAK+RG A DMN I TLFIQLA+RLE +K+
Sbjct: 1575 KSGYYSSRGLSSVKEDISRMCRDAIKAKSRGDASDMNHIVTLFIQLASRLEASSKN-VQG 1633
Query: 1573 REEMMKSWKDESPAGLYSATSKYKKKLSKMVSERKYMNRSNGTSLANGDFDYGEYASDRE 1632
R+ ++KSW ++SPA S +SKYKK +++V+ERKY RSNG + D +Y SD+E
Sbjct: 1634 RDVLLKSWDNDSPAMFSSTSSKYKK--NRLVNERKY--RSNG---KHNILDNLDYTSDKE 1686
Query: 1633 IRKRLSKLNRKSLDSGSETSDDLDGSSEDGKSDSESTVSDTDSDMDFRSDGRARESRGAG 1692
IR+RLSKLN+KS+ S SETSDDLD S ED KSDS+ST +++ SD + RS R+ R G
Sbjct: 1687 IRRRLSKLNKKSMGSESETSDDLDRSFEDDKSDSDSTTAESGSDHEVRSKITTRDPRD-G 1745
Query: 1693 DFTTDEGLDF-SDDREWGARMTKASLVPPVTRKYEVIDQYVIVADEEDVRRKMRVSLPED 1751
F+ + LDF +DDREWGARMTKASLVPPVTRKYEVID Y IVADEE+VRRKM+VSLP+D
Sbjct: 1746 CFSPEGELDFITDDREWGARMTKASLVPPVTRKYEVIDHYCIVADEEEVRRKMQVSLPDD 1805
Query: 1752 YAEKLNAQKNGSEELDMELPEVKDYKPRKQLGDQVFEQEVYGIDPYTHNLLLDSMPDELD 1811
YAEKL+AQKNG+EE DMELPEVK +KPRK+LG++V EQEVYGIDPYTHNLLLDSMP+ELD
Sbjct: 1806 YAEKLSAQKNGTEESDMELPEVKSFKPRKELGNEVIEQEVYGIDPYTHNLLLDSMPEELD 1865
Query: 1812 WNLLEKHLFIEDVLLRTLNKQVRHFTGTGNTPMMYPLQPVIEEIEKEAVDDCDVRTMKMC 1871
W+L EKHLFIED LL+TLNK VR TGTGNTPM YPLQP+I++I++ A + CD R ++MC
Sbjct: 1866 WSLQEKHLFIEDTLLQTLNKHVRSSTGTGNTPMSYPLQPIIDDIKRCAEEGCDARMLRMC 1925
Query: 1872 RGILKAMDSRPDDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEV--------------- 1916
+GILKAM+SRPDDKYVAYRKGLGVVCNKE GF +DDFVVEFLGEV
Sbjct: 1926 QGILKAMNSRPDDKYVAYRKGLGVVCNKEEGFSQDDFVVEFLGEVRHHICTVLIFNIFLQ 1985
Query: 1917 -YPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRI 1975
YPVWKWFEKQDGIRSLQK++ DPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRI
Sbjct: 1986 VYPVWKWFEKQDGIRSLQKDSTDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRI 2045
Query: 1976 CHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQV 2035
CHSCRPNCEAKVTAVDG YQIGIY+VR I +GEEITFDYNSVTESKEEYEASVCLCGSQV
Sbjct: 2046 CHSCRPNCEAKVTAVDGQYQIGIYSVRKIQHGEEITFDYNSVTESKEEYEASVCLCGSQV 2105
Query: 2036 CRGSYLNLTGEGAFEKVLKELHGLLDRHQLMLEACELNSVSEEDYLELGRAGLGSCLLGG 2095
CRGSYLNLTGEGAF+KVLK+ HG+LDRH LMLEACE N VSEEDY +LGRAGLGSCLLGG
Sbjct: 2106 CRGSYLNLTGEGAFQKVLKDSHGILDRHYLMLEACESNIVSEEDYNDLGRAGLGSCLLGG 2165
Query: 2096 LPNWVVAYSARLVRFINLERTKLPEEILRHNLEEKRKYFSDICLEVEKSDAEVQAEGVYN 2155
LP+W+VAY+ARLVRFIN ERTKLPEEIL+HNL+EKRKYFSD+ LEVE+SDAEVQAEGVYN
Sbjct: 2166 LPDWLVAYAARLVRFINFERTKLPEEILKHNLDEKRKYFSDVHLEVERSDAEVQAEGVYN 2225
Query: 2156 QRLQNLAVTLDKVRYVMRCVFGDPKKAPPPVERLSPEETVSFLWKGEGSLVEELIQCMAP 2215
QRLQNLAVTLDKVRYVMRC+FGDP+KAPPP+E+LSPEE VS LWKGEGS VEEL+Q +A
Sbjct: 2226 QRLQNLAVTLDKVRYVMRCIFGDPRKAPPPLEKLSPEEVVSSLWKGEGSFVEELLQGIAA 2285
Query: 2216 HVEEDVLNDLKSKIQAHDPSGSEDIQRELRKSLLWLRDEVRNLPCTYKCRHDAAADLIHI 2275
HVEED+LNDLKSKI A DPS S DI +ELRKSLLWLRDE+R+L CTYKCRHDAAADL+HI
Sbjct: 2286 HVEEDILNDLKSKIHARDPSSSADILKELRKSLLWLRDEIRSLSCTYKCRHDAAADLLHI 2345
Query: 2276 YAYTKCFFRVQEYKAFTSPPVYISPLDLGPKYADKLGADLQVYRKTYGENYCLGQLIFWH 2335
YAYTK FFR+QEY+ TSPPV+ISPLDLGPKY +KLGA++Q YRK YGENYCLGQLIFWH
Sbjct: 2346 YAYTKHFFRIQEYQTVTSPPVHISPLDLGPKYTNKLGAEIQEYRKVYGENYCLGQLIFWH 2405
Query: 2336 IQTNADPDCTLARASRGCLSLPDIGSFYAKVQKPSRHRVYGPKTVRFMLSRMEKQPQRPW 2395
Q+N DPD +L RASRGCLSLPDI SFYAK Q PS++RVYGP+TVR ML+RMEKQPQR W
Sbjct: 2406 NQSNTDPDRSLVRASRGCLSLPDINSFYAKAQNPSQNRVYGPRTVRSMLARMEKQPQRSW 2465
Query: 2396 PKDRIWAFKSSPRIFGSPMLDSSLTGCPLDREMVHWLKHRPAIFQAMWDR 2445
PKD+IW F+SSP+ FGSPMLD+ + LDREMVHWLKHRP + MWDR
Sbjct: 2466 PKDQIWLFRSSPKFFGSPMLDAVINNSTLDREMVHWLKHRPDV---MWDR 2512
Score = 340 bits (871), Expect = 8e-90, Method: Compositional matrix adjust.
Identities = 232/530 (43%), Positives = 313/530 (59%), Gaps = 85/530 (16%)
Query: 1 MGDGGVACMPLQQQQQHNSIMERFPISDKTTICVGNSSNNSNKTNNNSISNNNDNKTNND 60
MGDGGV CMPLQ IME+ S+K+ C G+ N ++ N S ++ D
Sbjct: 1 MGDGGVTCMPLQY------IMEKISSSEKSH-CGGSKFVNGDRKNMKS----RKSELGFD 49
Query: 61 SSNNNGSSSSKNNETNKSNVKKNGVSTKTVRKKIVKIKKVIAVKKKEVQKNSGSSKSNNN 120
N +GS ++ K V++ + T + EV+
Sbjct: 50 RVNKSGSDVENGDKVLKEEVEEGELVTN------------FKWPRSEVE----------- 86
Query: 121 GENIDNKNVENGGAVGEVVTVDKENLKNEEVEEGELGTLKWENGEFVQPEKSQPQSQLQS 180
+ENG V E V + E+E GE+ +W+ EF + EK + +S
Sbjct: 87 --------IENGEIVPENVMS-----RRSEIENGEIVGERWKTREFEKFEKGEFRSG-NW 132
Query: 181 QSKQIEKGEIIVFSSKCRRGETEKGESGLWRGNKDDIEKGEFIPDRWHKEVV--KDEYGY 238
+ +E+GEI+ S K RRGE E G G WRG KDD EKGEF+PDRW+K + K++YG
Sbjct: 133 RRDDVERGEIV--SEKGRRGENEYG-PGSWRGGKDDYEKGEFVPDRWYKGEMGGKNDYGN 189
Query: 239 SKSRR--------YDYKLERT-PPSGKYSGEDVYRRKEF-DRSGSQHSKSSSRWESGQER 288
+RR + ++ ERT PPS +Y+GED +R+KEF +RSG+QH+K+SSRWE+ Q R
Sbjct: 190 ISNRRNYPGKDKGWKFQRERTPPPSWRYTGEDSFRKKEFINRSGNQHAKNSSRWENAQPR 249
Query: 289 NVRISSKIVDDEGLYKGEHNNGKNHGREYF-HGNRFKRHGTDSDSGDRKYYGDYGDFAGL 347
NVR SSKIVDDE K ++NGK+H R+Y G+R KR G D D +RK+ Y DF L
Sbjct: 250 NVRTSSKIVDDE---KNAYSNGKDHTRDYTSSGSRLKRPGNDFDGYERKH---YADFTNL 303
Query: 348 KSRRLSDDYNSRSVHSEHYSRHSVEKFHRNSSSSRISSLDKYSSRHHEPSLSSRVIYDRH 407
KSRRLSDD N R +SE+YSR VE+ +RN++S+R+S+ +KYSSR+HE SLS+R YDRH
Sbjct: 304 KSRRLSDD-NYRCAYSENYSRRPVEQSYRNNNSTRLSA-EKYSSRNHESSLSTRPAYDRH 361
Query: 408 GRSPSHSDRSPHDRGRYYDHRDRSPSRHDRSPYTRDRSPYTFDRSPYSRERSPYNRDRSP 467
RSP HS+ SP DR RYYD R+R+P R RSP+ R+RSPY+ D+SP++RERSPY R
Sbjct: 362 ERSPVHSEWSPRDRSRYYDQRERTPVR--RSPFGRERSPYSRDKSPHARERSPYMRS--- 416
Query: 468 YAREKSPYDRSRHYDHRNRSPFSAERSPQDRA-RFHDRSDRTPNYLERSP 516
+DRSR +DH+ RSP E+SPQD+ R HD R+P E SP
Sbjct: 417 -------WDRSRQHDHKLRSPVRTEQSPQDQGWRQHDHKLRSPARTEHSP 459
>gi|224095776|ref|XP_002310475.1| SET domain protein [Populus trichocarpa]
gi|222853378|gb|EEE90925.1| SET domain protein [Populus trichocarpa]
Length = 2350
Score = 2293 bits (5943), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 1129/1522 (74%), Positives = 1295/1522 (85%), Gaps = 39/1522 (2%)
Query: 926 DERNDGSGGSRSTQSKLAAVRGVKGTMLPVVRINACVVNDHGSFVSEPRSKVRAKERHSS 985
D+RND G SRST +K RGVKGT+LPVVRINACVV DH VSE R+KVR K+R+ S
Sbjct: 866 DDRNDTGGVSRSTLNKPPITRGVKGTVLPVVRINACVVQDH--VVSETRTKVRGKDRYHS 923
Query: 986 RSARSYSSANDVRRSSAESDSHSKARNNQDSQGSWKSIACINTPKDRLCTVDDLQLQLGE 1045
RSAR++S+ NDV+ SS E DS S+ N+QDS G WKS A +NTPKDRLCT DDLQL LG+
Sbjct: 924 RSARTHSATNDVKSSSVECDSQSRVVNDQDSHGCWKSTASLNTPKDRLCTADDLQLNLGD 983
Query: 1046 WYYLDGAGHERGPSSFSELQVLVDQGCIQKHTSVFRKFDKVWVPLTFATETSASTVRNHG 1105
WYYLDG+GHERGP SFSELQ L D+G IQK++SVFRKFD+VWVP+ ATETS + VR
Sbjct: 984 WYYLDGSGHERGPLSFSELQNLADKGTIQKYSSVFRKFDRVWVPVASATETSEAAVRIQQ 1043
Query: 1106 EKIMPSGDSSGLPPTQSQDAVLGESNNNVNSNAFHTMHPQFIGYTRGKLHELVMKSYKNR 1165
+ S SSG +SQ A ESN + S++FH++HPQFIG+TRGKLHELVMKSYKNR
Sbjct: 1044 SNVELSVGSSGTL-LKSQTAANIESNKD--SSSFHSLHPQFIGFTRGKLHELVMKSYKNR 1100
Query: 1166 EFAAAINEVLDPWINAKQPKKETE-HVYRKSEGDTRAGKRARLLVRESDGDEETEEE-LQ 1223
EFA AINE LDPWI AKQP+KE + H+Y KSE D R GKRA + + D E EE+ L
Sbjct: 1101 EFAVAINEALDPWIVAKQPQKELDKHMYLKSEIDVRVGKRAWMQPDQIVKDNEMEEDTLH 1160
Query: 1224 TIQDESTFEDLCGDASFPGEESASSAIESGGWGLLDGHTLAHVFHFLRSDMKSLAFASLT 1283
+ E+TFE LCGD +F EES S IE+G WGLLDGH LA +FHFLRSD+KSL FASLT
Sbjct: 1161 KV--ETTFEQLCGDTNFHREESMCSEIEAGSWGLLDGHMLARIFHFLRSDLKSLVFASLT 1218
Query: 1284 CRHWRAAVRFYKGISRQVDLSSVGPNCTDSLIRKTLNAFDKEKLNSILLVGCTNITSGML 1343
C+HWRAAV FYKGIS QVDLSSVG NCTD ++R +N ++KEK+N+++L GCTN+TSGML
Sbjct: 1219 CKHWRAAVSFYKGISIQVDLSSVGLNCTDLMVRSIMNGYNKEKINAMVLTGCTNVTSGML 1278
Query: 1344 EEILQSFPHLSSIDIRGCGQFGELALKFPNINWVKSQKSRGAKFNDSRSKIRSLKQITEK 1403
EEIL S P LSSIDIRGC QF EL +FP ++W+KS R +S SK+RSLKQI+ +
Sbjct: 1279 EEILCSLPCLSSIDIRGCTQFMELVHQFPRVSWLKS---RTRIPEESNSKLRSLKQISGR 1335
Query: 1404 SSSAPKSKGLGDDMDDFGDLKDYFESVDKRDSANQSFRRSLYQRSKVFDARKSSSILSRD 1463
DDFG+LK+YF+SV+KRDSANQ FRRSLY+RSKVFDARKSSSILSRD
Sbjct: 1336 --------------DDFGELKEYFDSVNKRDSANQLFRRSLYKRSKVFDARKSSSILSRD 1381
Query: 1464 ARMRRWSIKKSENGYKRMEEFLASSLKEIMRVNTFEFFVPKVAEIEGRMKKGYYISHGLG 1523
ARMRRW++KKSEN Y RME FLA+ LK+IM+ N F+FFVPKVAEIE RMK GYY+ HGL
Sbjct: 1382 ARMRRWAVKKSENSYTRMEGFLAAGLKDIMKENIFDFFVPKVAEIEDRMKNGYYVGHGLR 1441
Query: 1524 SVKDDISRMCRDAIKAKNRGSAGDMNRITTLFIQLATRLEQGAKSSYYEREEMMKSWKDE 1583
SVK+DISRMCRDAIK KNRG AGDMN I TLF QLA+RLE+ +K SY ER+E+MKSWKD+
Sbjct: 1442 SVKEDISRMCRDAIKVKNRG-AGDMNHIITLFFQLASRLEESSKFSY-ERDELMKSWKDD 1499
Query: 1584 SPAGLYSATSKYKKKLSKMVSERKYMNRSNGTSLANGDFDYGEYASDREIRKRLSKLNRK 1643
A L SA K K + +KYMNRSNGT ANG FDYGEYASD+EI+KR+SKLNRK
Sbjct: 1500 LSAALDSAP----MKHKKKATGKKYMNRSNGTIPANGSFDYGEYASDQEIKKRISKLNRK 1555
Query: 1644 SLDSGSETSDDLDGSSEDGKSDSESTVSDTDSDMDFRSDGRARESRGAGDFTTDEGLDFS 1703
S+DSGSETSDD SSEDG+S S+ST SDT+SD+DFRS+GR ESRG TDE
Sbjct: 1556 SMDSGSETSDDR--SSEDGRSGSDSTASDTESDLDFRSEGRTGESRGDRYCMTDE----- 1608
Query: 1704 DDREWGARMTKASLVPPVTRKYEVIDQYVIVADEEDVRRKMRVSLPEDYAEKLNAQKNGS 1763
D+REWGARMTK SLVPPVTRKYEVIDQY+IVADEEDV+RKM VSLP+DYAEKL+AQKNG+
Sbjct: 1609 DEREWGARMTKVSLVPPVTRKYEVIDQYLIVADEEDVQRKMSVSLPDDYAEKLDAQKNGT 1668
Query: 1764 EELDMELPEVKDYKPRKQLGDQVFEQEVYGIDPYTHNLLLDSMPDELDWNLLEKHLFIED 1823
EELDMELPEVKDYKPRKQLGD+V EQEVYGIDPYTHNLLLDSMP+E+DW LL+KH+FIED
Sbjct: 1669 EELDMELPEVKDYKPRKQLGDEVIEQEVYGIDPYTHNLLLDSMPEEVDWPLLQKHMFIED 1728
Query: 1824 VLLRTLNKQVRHFTGTGNTPMMYPLQPVIEEIEKEAVDDCDVRTMKMCRGILKAMDSRPD 1883
VLL TLNKQVRHFTG GNTPM Y +QPV+EEIE+ A++DCD+R MK+CRGIL+A+DSRPD
Sbjct: 1729 VLLCTLNKQVRHFTGAGNTPMTYAIQPVVEEIEQAAMEDCDIRKMKICRGILRAIDSRPD 1788
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
DKYVAYRKGLGVVCNKEGGFG+DDFVVEFLGEVYP WKWFEKQDGIR LQK++++PAPEF
Sbjct: 1789 DKYVAYRKGLGVVCNKEGGFGDDDFVVEFLGEVYPAWKWFEKQDGIRLLQKDSKEPAPEF 1848
Query: 1944 YNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRG 2003
YNIYLERPKGDADGYDLVVVDAMHKANYASRICHSC+PNCEAKVTAVDG YQIGIYTVR
Sbjct: 1849 YNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCKPNCEAKVTAVDGQYQIGIYTVRE 1908
Query: 2004 IHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDRH 2063
I +GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAF+KVLKE HGLLDRH
Sbjct: 1909 IQHGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFQKVLKEWHGLLDRH 1968
Query: 2064 QLMLEACELNSVSEEDYLELGRAGLGSCLLGGLPNWVVAYSARLVRFINLERTKLPEEIL 2123
LML ACELNSVSEEDYL+LGRAGLGSCLLGGLP+WVVAYSARLVRFINLERTKLPEEIL
Sbjct: 1969 YLMLGACELNSVSEEDYLDLGRAGLGSCLLGGLPDWVVAYSARLVRFINLERTKLPEEIL 2028
Query: 2124 RHNLEEKRKYFSDICLEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRCVFGDPKKAP 2183
RHNL+EKRKYF+D CLEVE+SDAEVQAEGVYNQRLQNLAVTLDKVRYVMRC+FGDPK+AP
Sbjct: 2029 RHNLKEKRKYFADTCLEVERSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRCIFGDPKQAP 2088
Query: 2184 PPVERLSPEETVSFLWKGEGSLVEELIQCMAPHVEEDVLNDLKSKIQAHDPSGSEDIQRE 2243
PP+E+L+PEETVSFLWKG+GSLV+EL+QCM+P+++ED+LNDLKSK+ AHDPS +DIQ+
Sbjct: 2089 PPLEKLTPEETVSFLWKGDGSLVDELLQCMSPYMDEDMLNDLKSKVCAHDPSDCDDIQKA 2148
Query: 2244 LRKSLLWLRDEVRNLPCTYKCRHDAAADLIHIYAYTKCFFRVQEYKAFTSPPVYISPLDL 2303
L+KSLLWLRDEVR+LPCTYKCRHDAAADLIH+YAYTK FFRV++Y AFTSPPV+ISPLDL
Sbjct: 2149 LQKSLLWLRDEVRSLPCTYKCRHDAAADLIHVYAYTKSFFRVRDYDAFTSPPVHISPLDL 2208
Query: 2304 GPKYADKLGADLQVYRKTYGENYCLGQLIFWHIQTNADPDCTLARASRGCLSLPDIGSFY 2363
GPK ADKLG Y+KTYG +YC+GQLIFWH+QTN +PD TLA+AS+GCLSLP+IGSFY
Sbjct: 2209 GPKCADKLGGLPHKYQKTYGGSYCMGQLIFWHVQTNTEPDFTLAKASKGCLSLPEIGSFY 2268
Query: 2364 AKVQKPSRHRVYGPKTVRFMLSRMEKQPQRPWPKDRIWAFKSSPRIFGSPMLDSSLTGCP 2423
AKVQKPS+ R+YGPKTV+ ML RMEK PQ+PWPKD+IW+FK+SP++FGSPMLD+ L P
Sbjct: 2269 AKVQKPSQQRIYGPKTVKMMLERMEKYPQKPWPKDQIWSFKNSPKVFGSPMLDAVLNNAP 2328
Query: 2424 LDREMVHWLKHRPAIFQAMWDR 2445
LDREMVHWLKHRP ++QA+WDR
Sbjct: 2329 LDREMVHWLKHRPTVYQAVWDR 2350
Score = 795 bits (2053), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 489/890 (54%), Positives = 579/890 (65%), Gaps = 130/890 (14%)
Query: 1 MGDGGVACMPLQQQQQHNSIMERFPISDK-------------TTICVGNSSNNSNKTNNN 47
MGDGGVACMPLQ + + ERFP+ +K TT V +NNSN +
Sbjct: 1 MGDGGVACMPLQHSSNNIIMEERFPVQEKNTTTTVTAAVPSTTTTKVETVNNNSNSGSGG 60
Query: 48 SISNNNDNKTNNDSSNNNGSSSSKNNETNKSNVKKN------------------------ 83
SNNN+N ++ NG S+S NN K
Sbjct: 61 GSSNNNNNNVSSGDKKGNGKSNSSNNGVTGKVKKVKRIVKVKKVVRKVVVGEKKGVGLVR 120
Query: 84 ------GVSTKTV---------RKKIVKIKKVIAVKKKEVQKNSGSSKSNNNGENID--N 126
G +K V KK K K+V A KK+ K +++ +G I +
Sbjct: 121 EVKSACGSGSKEVVVLEKKESGLKKEEKSKEVTAEKKESGWKKELAAEKKESGLKISSGS 180
Query: 127 KNVENGGAVGEVVTVDKENLKN--EEVEEGELGTLKW------ENGEFVQ-PEKSQPQSQ 177
K VENG +G T + N EEVEEGELGTLKW ENGEFV PEK
Sbjct: 181 KTVENGDGLGSGDTKLQSGSNNIKEEVEEGELGTLKWPTKGEIENGEFVPIPEK------ 234
Query: 178 LQSQSKQIEKGEIIVFSSKCRRGETEKGESGLWRGNK--------DDIEKGEFIPDRWHK 229
+ +IE+GEI S K ++G+ EKGE + GNK D+IEKGEFIPDRW+
Sbjct: 235 --PRRSEIERGEI--GSEKWKKGDIEKGE--IVSGNKWQRGEVVRDEIEKGEFIPDRWNG 288
Query: 230 EVVKDEYGYSKSR-RYDYKLERTPPSGKYSGEDVYRRKEFDRS-GSQHSKSSSRWESGQE 287
KDEYGY +SR RYD ERTPPSGKYS EDV RRKE RS GS HSKSS RWESGQE
Sbjct: 289 ---KDEYGYIRSRGRYDMSRERTPPSGKYSCEDVNRRKELTRSGGSLHSKSSMRWESGQE 345
Query: 288 RNVRISSKIVDDEGLYKGEHNNGKNHGREYFHGNRFKRHGTDSDSGDRKYYGDYGDFAGL 347
R+ RISSKIVD+EG YK E++NGKN GREY GNR KRHGTDSDS +RK+YGDY +
Sbjct: 346 RSTRISSKIVDEEGSYKSEYSNGKNPGREYSSGNRLKRHGTDSDSTERKHYGDY---SSS 402
Query: 348 KSRRLSDDYNSRSVHSEHYSRHSVEKFHRN-SSSSRISSLDKYSSRHHEPSLSSRVIYDR 406
KSRRLS+D SR +SEHYSRHSVE+F++N SSSSR+S DKYSSRHHE +L S+V+YDR
Sbjct: 403 KSRRLSED-GSRYAYSEHYSRHSVERFYKNSSSSSRVSLSDKYSSRHHESTLPSKVVYDR 461
Query: 407 HGRSPSHSDRSPHDRGRYYDHRDRSPSRH----------------------------DRS 438
H HSD SPH+R RY DHRDRSP RH +RS
Sbjct: 462 H----VHSDWSPHERPRYNDHRDRSPIRHEKSPYGRERTPYGLERSPYGRERSPYGRERS 517
Query: 439 PYTRDRSPYTFDRSPYSRERSPYNRDRSPYAREKSPYDRSRHYDHRNRSPFSAERSPQDR 498
PY RDRSPY DRSPY RE+SPY R+RSPY EKSPYDRSRHY+HR RSP ERSPQDR
Sbjct: 518 PYWRDRSPYGHDRSPYGREKSPYGRERSPYGLEKSPYDRSRHYEHRKRSPSYVERSPQDR 577
Query: 499 ARFHDRSDRTPNYLERSPLHRSRPNNHREASSKTGASEKRNARYDSKGHEDKLGPKDSNA 558
AR HDRSDRTPNYLERSP R++PNN+REA K GA+EKRN++Y +K EDK+ KD +A
Sbjct: 578 ARHHDRSDRTPNYLERSPHDRAKPNNYREA-RKGGATEKRNSQYGNKQQEDKISQKDPDA 636
Query: 559 RCSRSSAKESQDKSNVQDLNVSDEKTANCESHKEEQPQSSSVDCKEPPQVDGPPLEELVS 618
R + SAKESQDKS+V +L+ DEK A+ E+ EE+ +S ++ KEPPQVDGPP EEL S
Sbjct: 637 RDTEPSAKESQDKSSVLNLDGLDEKNASSETRIEEKSESPRINVKEPPQVDGPPPEELQS 696
Query: 619 MEEDMDICDTPPHVPAVTDSSVGKWFYLDHCGMECGPSRLCDLKTLVEEGVLVSDHFIKH 678
MEEDMDICDTPPHVPAV D+S GKWFYLDH G+ECGPS+LC+LK LV+EG L+SDHFIKH
Sbjct: 697 MEEDMDICDTPPHVPAVADTSTGKWFYLDHFGVECGPSKLCELKALVDEGSLMSDHFIKH 756
Query: 679 LDSNRWETVENAVSPLVTVNFPSITSDSVTQLVSPPEASGNLLADTGDTAQS---TGEEF 735
L S+RW T+ENA+SP V VNFPS+ D++TQLVSPPEA GNLLADTGD QS GE
Sbjct: 757 LHSDRWLTIENALSPFVPVNFPSVVPDAITQLVSPPEAPGNLLADTGDIGQSCAQIGEGV 816
Query: 736 PVT-LQSQCCPDGSAAAAESSEDLHIDVRVGALLDGFTVIPGKEIETLGE 784
L+ CPD S A+ES EDL ID RVGALL+GF+V+PG E+ET+G+
Sbjct: 817 SGNFLKPPVCPDHSEIASESLEDLQIDERVGALLEGFSVVPGSELETVGD 866
>gi|222640020|gb|EEE68152.1| hypothetical protein OsJ_26262 [Oryza sativa Japonica Group]
Length = 2255
Score = 2122 bits (5499), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 1179/2335 (50%), Positives = 1501/2335 (64%), Gaps = 187/2335 (8%)
Query: 150 EVEEGELGTLKWENGEFVQPEKSQPQSQLQ-----SQSKQIEKGEIIVFSSKCRRGETEK 204
E+EEGEL + +N E+S P + + S + ++E GEI++ S K R+
Sbjct: 61 ELEEGELLNGEADNSSSRDLERSMPPKKWRKVLAASSAAEVEPGEIVMPSKKARKN---- 116
Query: 205 GESGLWRGNKDDIEKGEFIPDRWHKEVVKDEYGYSKSRRYDYKLERTPPSGKYSGEDVYR 264
GE +EKGE P+R K+ KS R K E P GE
Sbjct: 117 GE----------LEKGEIAPERQRKD------KSDKSGRKSNKDEVEP------GEVAPP 154
Query: 265 RKEFDRSGSQHSKSSSRWESGQERNVRISSKIVDDEGLYKGEHNN-----GKNHGREYFH 319
K+ DR ++ SS++ V D+G KG + G+
Sbjct: 155 DKKQDRDHNKKLGSSAQ---------------VRDDGSKKGSSRDSDEEPGEIRPESSST 199
Query: 320 GNRFKRHGTDSDSGDRKYYGDYGDFAGLKSRRLSDDYNSRSVHSEHYSRHSVEKFHRNSS 379
G+ K T+ ++ + K+ D D G KSRR + +S RH
Sbjct: 200 GSARKSRATEPENSNHKHQADTCDQTGSKSRRKGEAKSS--------GRH---------- 241
Query: 380 SSRISSLDKYSSRHHEPSLSSRVIYDRHGRSPSHSDRSPHDRGRYYDHRDRSPSRHDRSP 439
S R+ + S +R DRH RSP R PHDR R+ DRSPSR + SP
Sbjct: 242 ---------LSGRNRDISPMTR---DRHERSPGILGRFPHDRLRH----DRSPSRLEPSP 285
Query: 440 YTRDRSPYTFDRSPYSRERSPYNRDRSPYAREKSPYDRSRHYDHRNRSPFSAERSPQDRA 499
R R DRSPY SP +R R + R+ +P R + HR+ +P + SP+ R+
Sbjct: 286 RDRGRHYDNRDRSPYI---SPRHRMRPSHYRDNTP-SRGEMHHHRDNTPSRVDSSPR-RS 340
Query: 500 RFHDRSDRTPNYLERSPLHRSRPNNHREASSKTGASEKRNARYDSKGHEDKLGPKD---- 555
+ D DR+P+ ++SP R R EA K+ ++ N + H+ K +
Sbjct: 341 QHEDFRDRSPSRRDKSPSERGRTTESHEAGKKSRGAKLENNSLEKAQHKSKSTKQSTKSK 400
Query: 556 -----SNARCSRSSAKESQDKSNVQDLNVSDEKTANCESHKEEQPQSSSVDCKEPPQV-- 608
SN + S+ A E+ + + P + PP+
Sbjct: 401 SSSNGSNEKISKEKATETIQYTELPPPPPLPPPPPPPPPPPPPLPPNMPPPLPPPPEPEL 460
Query: 609 DGPPLEELVSMEEDMDICDTPPHV----PAVTD---SSVGKWFYLDHCGMECGPSRLCDL 661
+G P E+ VSMEEDMDICDTPPH P T+ S VGKWFYLDH G+E GPS+L DL
Sbjct: 461 NGAPAED-VSMEEDMDICDTPPHTTSSAPGPTEPPASDVGKWFYLDHYGIEQGPSKLADL 519
Query: 662 KTLVEEGVLVSDHFIKHLDSNRWETVENAVSPLVTVNFPSITSDSVTQLVSPPEASGNLL 721
K LVE+G L+SDH IKH DSNRW TVENA SPLV FPS+ SD TQLVSPPEA GNLL
Sbjct: 520 KKLVEDGYLLSDHLIKHADSNRWVTVENAASPLVPSEFPSVYSDVSTQLVSPPEAPGNLL 579
Query: 722 ADTGDTAQSTGEEFPVTLQSQCCPDGSAAAAESSEDLHIDVRVGALLDGFTVIPGKEIET 781
+ + A T E A+AE ED +ID RV AL+DG ++ G+E+E
Sbjct: 580 DEAREEASGTDHE-----------QMKEASAEEQEDFYIDDRVDALMDGSIMVDGQELEI 628
Query: 782 LGEILQTTFERVDWQNNGGPTWHGACVGEQKPGDQKVDELYISDTKMKEAAELKSGDKDH 841
LGE+L FE V+W++ + E+ G ++ E D++ + ++D
Sbjct: 629 LGELLNAHFEPVNWESEDLSRFQVKL--ERDDGTKRSTEF--PDSRTAHIYGVVPAERDT 684
Query: 842 WVVCFDSDEWFSGRWSCKGGDWKRNDEAAQDRCSRKKQVLNDGFPLCQMPKSGYEDPRWN 901
+ +S EW+SGRWSCKGGDWKRND+ +QD+ RKK VLN+G+PLCQMPK +EDPRW
Sbjct: 685 YQPHIESSEWYSGRWSCKGGDWKRNDDFSQDKPYRKKLVLNEGYPLCQMPKGNHEDPRWG 744
Query: 902 QKDDLYYPSHSRRLDLPPWAYACPDERND--------GSGGSRSTQSKLAAVRGVKGTML 953
KDDLYYP +++LDLP WA++ +E +D G RS Q+K +GVKGT L
Sbjct: 745 CKDDLYYPLRAKKLDLPLWAFSSTEENDDTVDDASKSGVMPGRSGQTKQPP-KGVKGTTL 803
Query: 954 PVVRINACVVNDHGSFVSEPRSKVRAKERHSSRSARSYSSANDVRRSSAESDSHSKARNN 1013
PVV+INA VV D S SE R K + +R SRS+RS+S D R S+ E SHSK +
Sbjct: 804 PVVKINARVVKDQSS--SELRIKPKVADRPPSRSSRSHSIGTD-RSSTHEGSSHSKKHHE 860
Query: 1014 QDSQGSWKSIACINTPKDRLCTVDDLQLQLGEWYYLDGAGHERGPSSFSELQVLVDQGCI 1073
DSQ KS + N PKD +CTV++L +++G+WYYLDG GHERGP S+SELQ L +G I
Sbjct: 861 HDSQSLHKSKSVPNIPKDHVCTVEELSVKVGDWYYLDGTGHERGPFSYSELQELAKKGTI 920
Query: 1074 QKHTSVFRKFDKVWVPLTFATETSASTVRNHGEKIMPSGDSSGLPPTQSQDAVLGESNNN 1133
+ +SVFRK D W+P+ K + SG S+ S + L SN +
Sbjct: 921 LEGSSVFRKIDNTWLPVL---------------KDLKSGCSARNGEAGSSTSALTHSNQS 965
Query: 1134 VNSNAFHTMHPQFIGYTRGKLHELVMKSYKNREFAAAINEVLDPWINAKQPKKETEHVYR 1193
FH MHPQF+GYTRGKLHELVMK +K+RE AINEVL+PWI KQP+KE E +
Sbjct: 966 ----NFHEMHPQFVGYTRGKLHELVMKYFKSRELTLAINEVLEPWIATKQPRKELETFFS 1021
Query: 1194 KSEG-------DTRAGKRARLLVRESDG-DEETEEELQTIQDESTFEDLCGDASFPGEES 1245
S D + KRARLL +SD + +E+ L + +D+ FEDL A+ E
Sbjct: 1022 HSSASKNFVQEDGGSTKRARLLPDQSDEYTDMSEDILASQKDDCCFEDLFEGAAHVKESP 1081
Query: 1246 ASSAIESGGWGLLDGHTLAHVFHFLRSDMKSLAFASLTCRHWRAAVRFYKGISRQVDLSS 1305
+S ES WGLL+ H LA +FHFLR+D+KSL ++ TC W A ++Y+ + R +DLSS
Sbjct: 1082 LNSRTESESWGLLNEHVLARIFHFLRADVKSLISSAATCSWWNTAAKYYRSVCRFIDLSS 1141
Query: 1306 VGPNCTDSLIRKTLNAFDKEKLNSILLVGCTNITSGMLEEILQSFPHLSSIDIRGCGQFG 1365
+GP CTD++ + +D + + +++L GC+N++S L E+L+ FPH+S + I+GC Q G
Sbjct: 1142 LGPQCTDNVFHDIMAGYDMQNIRTLVLTGCSNLSSLALAEVLKRFPHISYVHIQGCSQLG 1201
Query: 1366 ELALKFPNINWVKSQKSRGAKFNDSRSKIRSLKQITEKSSSAPKS-KGLGDDMDDFGDLK 1424
+L KF ++ W+KS + A + KIRSLKQI + S+S K+ + L M +L
Sbjct: 1202 DLKNKFQHVKWIKSSLNPDASYQ----KIRSLKQIDDGSNSTSKAGRILTSQMGGSDELD 1257
Query: 1425 DYFESVDKRDSANQSFRRSLYQRSKVFDARKSSSILSRDARMRRWSIKKSENGYKRMEEF 1484
YF + R+S+ SF + Y+RSK D RKSS++LSRDA+MRR +K+EN Y++MEEF
Sbjct: 1258 GYFADISNRESSTLSFGQGFYKRSKWLDIRKSSAVLSRDAQMRRLMQRKAENSYRKMEEF 1317
Query: 1485 LASSLKEIMRVNTFEFFVPKVAEIEGRMKKGYYISHGLGSVKDDISRMCRDAIKAKNRGS 1544
+ + LKEIM+ + F+FFVPKVA+IE R+K GYY HG +K+DI MCRDA++ K R
Sbjct: 1318 VINKLKEIMKSSRFDFFVPKVAKIEVRLKNGYYARHGFSYIKNDIRSMCRDALRYKGRSD 1377
Query: 1545 AGDMNRITTLFIQLATRLEQGAKSSYYEREEMMKSWKDESPAGLYSATSKYKKKLSKMVS 1604
GDM +I FIQLA +LE S + + K D S YS+ K KKK SK +S
Sbjct: 1378 LGDMKQIVVAFIQLAKKLENPRLISDRDGTAVQK---DSSDMSQYSSDLKLKKKQSKTMS 1434
Query: 1605 ERKYMNRSNGTSLANGDFDYGEYASDREIRKRLSKLNRKSLDSGSETSDDLDGSSEDGKS 1664
ER+ G + D A DREI++ LSKL ++ +DSGSETSDD DG SE ++
Sbjct: 1435 ERR------GANWTTAGADPSSRAFDREIKRSLSKLKKRDIDSGSETSDDDDGYSEGDET 1488
Query: 1665 DSESTVSDTDSDMDFRSDGRARESRGAGDFTTDEGLDFSDDREWGARMTKASLVPPVTRK 1724
+SE+TVSDT+SD+D S + G F + E L +DDR WGARMTKASLVPPVTRK
Sbjct: 1489 ESETTVSDTESDLDVNSGAWDLKGNGMKLFESSESL--TDDRGWGARMTKASLVPPVTRK 1546
Query: 1725 YEVIDQYVIVADEEDVRRKMRVSLPEDYAEKLNAQKNGSEELDMELPEVKDYKPRKQLGD 1784
YEVI++Y+IVADEE+V RKMRV+LP+DY+EKL +QKNG+E L ELPEVKDY+PRK GD
Sbjct: 1547 YEVIEKYLIVADEEEVLRKMRVALPDDYSEKLLSQKNGTENL--ELPEVKDYQPRKVPGD 1604
Query: 1785 QVFEQEVYGIDPYTHNLLLDSMPDELDWNLLEKHLFIEDVLLRTLNKQVRHFTGTGNTPM 1844
+V EQEVYGIDPYTHNLLL+ MP ELDW +KH F+E++LL TLNKQVR FTG+GNTPM
Sbjct: 1605 EVLEQEVYGIDPYTHNLLLEMMPTELDWPSSDKHTFVEELLLNTLNKQVRQFTGSGNTPM 1664
Query: 1845 MYPLQPVIEEIEKEAVDDCDVRTMKMCRGILKAMDSRPDDKYVAYRKGLGVVCNKEGGFG 1904
+YPL+PVIEEI+K A + D RT KMC G+LKAM + P+ Y GLGVVCNK GGFG
Sbjct: 1665 VYPLKPVIEEIQKSAEESGDRRTSKMCLGMLKAMRNHPE-----YNYGLGVVCNKTGGFG 1719
Query: 1905 EDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVD 1964
DDFV+EF GEVYP W+W+EKQDGI+ +Q N++D APEFYNI LERPKGD DGYDLV VD
Sbjct: 1720 VDDFVIEFFGEVYPSWRWYEKQDGIKHIQNNSDDQAPEFYNIMLERPKGDRDGYDLVFVD 1779
Query: 1965 AMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEY 2024
AMHKANYASRICHSC PNCEAKVTAVDGHYQIGIYTVR I GEEITFDYNSVTESKEE+
Sbjct: 1780 AMHKANYASRICHSCNPNCEAKVTAVDGHYQIGIYTVRPIAEGEEITFDYNSVTESKEEH 1839
Query: 2025 EASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDRHQLMLEACELNSVSEEDYLELG 2084
EASVCLCGSQ+CRGSYLN +GEGAFEKVL E HG+LDRH L+L+ACE NSVS++D ++LG
Sbjct: 1840 EASVCLCGSQICRGSYLNFSGEGAFEKVLMEFHGVLDRHSLLLQACEANSVSQQDLIDLG 1899
Query: 2085 RAGLGSCLLGGLPNWVVAYSARLVRFINLERTKLPEEILRHNLEEKRKYFSDICLEVEKS 2144
RAGLG+CLL GLP W+VAY+A LVRFI ER KLP EI +HN++EKR++F+DI ++ EK+
Sbjct: 1900 RAGLGTCLLAGLPGWLVAYTAHLVRFIFFERQKLPHEIFKHNVDEKRQFFTDINMDSEKN 1959
Query: 2145 DAEVQAEGVYNQRLQNLAVTLDKVRYVMRCVFGDPKKAPPPVERLSPEETVSFLWKGEGS 2204
DAEVQAEGV N RLQNL TLDKVRYVMRC+FGDPK APPP+ RL+ VS +WKGEGS
Sbjct: 1960 DAEVQAEGVLNSRLQNLTHTLDKVRYVMRCIFGDPKNAPPPLVRLTGRSLVSAIWKGEGS 2019
Query: 2205 LVEELIQCMAPHVEEDVLNDLKSKIQAHDPSGSEDIQRELRKSLLWLRDEVRNLPCTYKC 2264
LV+EL++ M PHVEEDVL DLK+KI+AHDPSGSEDI+ E+R SLLWLRDE+R L CTYKC
Sbjct: 2020 LVDELLESMEPHVEEDVLTDLKAKIRAHDPSGSEDIEGEIRSSLLWLRDELRTLSCTYKC 2079
Query: 2265 RHDAAADLIHIYAYTKCFFRVQEYKAFTSPPVYISPLDLGPKYADKLGADLQVYRKTYGE 2324
RHDAAADLIH+YAYTKCFFRV++YK SPPV ISPLDLGPKYADKLG Q Y KTY E
Sbjct: 2080 RHDAAADLIHMYAYTKCFFRVRDYKTVKSPPVLISPLDLGPKYADKLGPGFQEYCKTYPE 2139
Query: 2325 NYCLGQLIFWHIQTNADPDCTLARASRGCLSLPDIGSFYAKVQKPSRHRVYGPKTVRFML 2384
NYCLGQLI+W+ Q NA+P+ L RA +GC+SLPD+ SFY K KP++ RVYG +TVRFML
Sbjct: 2140 NYCLGQLIYWYSQ-NAEPESRLTRARKGCMSLPDVSSFYVKSVKPTQERVYGSRTVRFML 2198
Query: 2385 SRMEKQPQRPWPKDRIWAFKSSPRIFGSPMLDSSLTGCPLDREMVHWLKHRPAIF 2439
+RME Q QRPWPKDRIW FKS PR FG+PM+D+ L PLD+EMVHWLK R +F
Sbjct: 2199 ARMENQAQRPWPKDRIWVFKSDPRFFGTPMMDAVLNNSPLDKEMVHWLKTRSNVF 2253
>gi|2244876|emb|CAB10297.1| hypothetical protein [Arabidopsis thaliana]
gi|7268264|emb|CAB78560.1| hypothetical protein [Arabidopsis thaliana]
Length = 2351
Score = 2066 bits (5352), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 1071/1765 (60%), Positives = 1298/1765 (73%), Gaps = 161/1765 (9%)
Query: 709 QLVSPPEASGNLLADTGDTA------QSTGEEFPVTLQSQCCPDGSAAAAESSEDLHIDV 762
+LV+PPEA GNLL D DT Q G+ P + + PDG E+ ED ID+
Sbjct: 570 ELVNPPEAPGNLLEDIADTTEAVCIEQGAGDSLPELVSVRTLPDGKEIFVENREDFQIDM 629
Query: 763 RVGALLDGFTVIPGKEIETLGEILQTTFE----------RVDWQNNG--GPTWHG---AC 807
RV LLDG T+ PG+E ETLGE L+ E V NN P
Sbjct: 630 RVENLLDGRTITPGREFETLGEALKVNVEFEETRRCVTSEVFAPNNTKFSPKQKAEPNKF 689
Query: 808 VGEQKPGDQKVDELYISDTKMKEAAELKSGDKDHWVVCFDSDEWFSGRWSCKGGDWKRND 867
VG +P + ++E D E+ E+ S WFSGRWSCKGGDW R D
Sbjct: 690 VGMFRPMKRAIEEFKSDDAYGSESDEIGS--------------WFSGRWSCKGGDWIRQD 735
Query: 868 EAAQDRCSRKKQVLNDGFPLCQMPKSGYEDPRWNQKDDLYYPSHSRRLDLPPWAYACPDE 927
EA+QDR +KK VLNDGFPLC M KSG+EDPRW+ KDDLYYP S RL+LP WA++ DE
Sbjct: 736 EASQDRYYKKKIVLNDGFPLCLMQKSGHEDPRWHHKDDLYYPLSSSRLELPLWAFSVVDE 795
Query: 928 RNDGSGGSRSTQSKLAAVRGVKGTMLPVVRINACVVNDHGSFVSEPRSKVRAKERHSSRS 987
RN RGVK ++L VVR+N+ VVND + +PR+KVR+KER SR
Sbjct: 796 RNQ--------------TRGVKASLLSVVRLNSLVVNDQVPPIPDPRAKVRSKERCPSRP 841
Query: 988 ARSYSSANDVRRSSAESDSHSKARNNQDSQGSWKSIACINTPKDRLCTVDDLQLQLGEWY 1047
AR +++D +R S ES S S A QDSQG WK+ +NTP+DRLCTVDDLQL +G+W+
Sbjct: 842 ARPSPASSDSKRESVESHSQSTASTGQDSQGLWKTDTSVNTPRDRLCTVDDLQLHIGDWF 901
Query: 1048 YLDGAGHERGPSSFSELQVLVDQGCIQKHTSVFRKFDKVWVPLTFATETSASTVRNHGEK 1107
Y DGAG E+GP SFSELQ LV++G I+ H+SVFRK DK+WVP+T T++ + G+
Sbjct: 902 YTDGAGQEQGPLSFSELQKLVEKGFIKSHSSVFRKSDKIWVPVTSITKSPETIAMLRGKT 961
Query: 1108 IMPSGDSSGLPPTQSQDAVLGESNNNVNSNAFHTMHPQFIGYTRGKLHELVMKSYKNREF 1167
GL +++QD E + ++NS FH +HPQF+GY RGKLH+LVMK++K+R+F
Sbjct: 962 PALPSACQGLVVSETQDFKYSEMDTSLNS--FHGVHPQFLGYFRGKLHQLVMKTFKSRDF 1019
Query: 1168 AAAINEVLDPWINAKQPKKETEHVYRKSEGDTR-----------------------AGKR 1204
+AAIN+V+D WI+A+QPKKE+E +S G + R
Sbjct: 1020 SAAINDVVDSWIHARQPKKESEKYMYQSSGMHNYQNLNFPLTYWFLNLGGCLLLFFSPSR 1079
Query: 1205 ARLLVRESDGDEETEEELQTIQDESTFEDLCGDASFPGEESASSAIESGGWGLLDGHTLA 1264
ARL+ ES D E E+ +DE TFEDLCGD +F E + S+ WGLLDGH LA
Sbjct: 1080 ARLMAGESGEDSEMEDTQMFQKDELTFEDLCGDLTFNIEGNRSAGTVGIYWGLLDGHALA 1139
Query: 1265 HVFHFLRSDMKSLAFASLTCRHWRAAVRFYKGISRQVDLSSVGPNCTDSLIRKTLNAFDK 1324
VFH LR D+KSLAFAS+TCRHW+A + YK ISRQVDLSS+GP+CTDS +R +N ++K
Sbjct: 1140 RVFHMLRYDVKSLAFASMTCRHWKATINSYKDISRQVDLSSLGPSCTDSRLRSIMNTYNK 1199
Query: 1325 EKLNSILLVGCTNITSGMLEEILQSFPHLSSIDIRGCGQFGELALKFPNINWVKSQKSRG 1384
EK++SI+LVGCTN+T+ MLEEIL+ P +SS+DI GC QFG+L + + N++W++ Q +R
Sbjct: 1200 EKIDSIILVGCTNVTASMLEEILRLHPRISSVDITGCSQFGDLTVNYKNVSWLRCQNTR- 1258
Query: 1385 AKFNDSRSKIRSLKQITEKSSSAPKSKGLGDDMDDFGDLKDYFESVDKRDSANQSFRRSL 1444
S KSKGLG D DDFG+LKDYF+ V+KRDSANQ FRRSL
Sbjct: 1259 --------------------SDVAKSKGLGGDTDDFGNLKDYFDRVEKRDSANQLFRRSL 1298
Query: 1445 YQRSKVFDARKSSSILSRDARMRRWSIKKSENGYKRMEEFLASSLKEIMRVNTFEFFVPK 1504
Y+RSK++DAR+SS+ILSRDAR+RRW+IKKSE+GYKR+EEFLASSL+ IM+ NTF+FF K
Sbjct: 1299 YKRSKLYDARRSSAILSRDARIRRWAIKKSEHGYKRVEEFLASSLRGIMKQNTFDFFALK 1358
Query: 1505 V------AEIEGRMKKGYYISHGLGSVKDDISRMCRDAIK-------------AKNRGSA 1545
V ++IE +MK GYY+SHGL SVK+DISRMCR+AI G +
Sbjct: 1359 VLSGTCVSQIEEKMKNGYYVSHGLRSVKEDISRMCREAINFVIFLLTLLCIQGGGIEGGS 1418
Query: 1546 GDMNRITTLFIQLATRLEQGAK-SSYYEREEMMKSWKDESPAGLYSATSKYKKKLSKMVS 1604
DMNRI LFIQLATRLE+ + +S Y R+E+MKSW+D S GL SAT KY KKLSK V+
Sbjct: 1419 KDMNRIIALFIQLATRLEEVSMITSSYGRDELMKSWQDGS--GLSSAT-KYNKKLSKTVA 1475
Query: 1605 ERKYMNRSNGTSLANGDFDYGEYASDREIRKRLSKLNRKSLDSGSETSDDLDGSSEDGKS 1664
E+KYM+R++ T NG DYGEYASDREI++RLSKLNRKS S S+TS + S++GKS
Sbjct: 1476 EKKYMSRTSDTFGVNGASDYGEYASDREIKRRLSKLNRKSFSSESDTSSE---LSDNGKS 1532
Query: 1665 DSESTVSDTDSDMDFRSDGRARESRGAGDFTTDEGLD-FSDDREWGARMTKASLVPPVTR 1723
D+ S+ S ++S+ D RS+GR+++ R FT D+ D +++REWGARMTKASLVPPVTR
Sbjct: 1533 DNYSSASASESESDIRSEGRSQDLRIEKYFTADDSFDSVTEEREWGARMTKASLVPPVTR 1592
Query: 1724 KYEVIDQYVIVADEEDVRRKMRVSLPEDYAEKLNAQKNGSEELDMELPEVKDYKPRKQLG 1783
KYEVI++Y IVADEE+V+RKMRVSLPEDY EKLNAQ+NG EELDMELPEVK+YKPRK LG
Sbjct: 1593 KYEVIEKYAIVADEEEVQRKMRVSLPEDYGEKLNAQRNGIEELDMELPEVKEYKPRKLLG 1652
Query: 1784 DQVFEQEVYGIDPYTHNLLLDSMPDELDWNLLEKHLFIEDVLLRTLNKQVRHFTGTGNTP 1843
D+V EQEVYGIDPYTHNLLLDSMP ELDW QVR FTG+G+TP
Sbjct: 1653 DEVLEQEVYGIDPYTHNLLLDSMPGELDW-------------------QVRLFTGSGSTP 1693
Query: 1844 MMYPLQPVIEEIEKEAVDDCDVRTMKMCRGILKAMDSRPDDKYVAYRKGLGVVCNKEGGF 1903
M++PL+PVIEE+++ A ++CD+RTMKMC+G+LK ++SR DDKYV+YRKGLGVVCNKEGGF
Sbjct: 1694 MVFPLRPVIEELKESAREECDIRTMKMCQGVLKEIESRSDDKYVSYRKGLGVVCNKEGGF 1753
Query: 1904 GEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPK------GDADG 1957
GE+DFVVEFLGEVYPVWKWFEKQDGIRSLQ+N DPAPEFYNIYLERPK GDADG
Sbjct: 1754 GEEDFVVEFLGEVYPVWKWFEKQDGIRSLQENKTDPAPEFYNIYLERPKVWRKYDGDADG 1813
Query: 1958 YDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSV 2017
YDLVVVDAMH ANYASRICHSCRPNCEAKVTAVDGHYQIGIY+VR I YGEEITFDYNSV
Sbjct: 1814 YDLVVVDAMHMANYASRICHSCRPNCEAKVTAVDGHYQIGIYSVRAIEYGEEITFDYNSV 1873
Query: 2018 TESKEEYEASVCLCG-------SQVCRGSYLNLTGEGAFEKVLKELHGLLDRHQLMLEAC 2070
TE +C QVCRGSYLNLTGEGAF+KVLK+ HGLL+RH+LMLEAC
Sbjct: 1874 TEVCSLLSLLLCSSTVGKYYFVGQVCRGSYLNLTGEGAFQKVLKDWHGLLERHRLMLEAC 1933
Query: 2071 ELNSVSEEDYLELGRAGLGSCLLGGLPNWVVAYSARLVRFINLERTKLPEEILRHNLEEK 2130
LNSVSEEDYLELGRAGLGSCLLGGLP+W++AYSARLVRFIN ERTKLPEEIL+HNLEEK
Sbjct: 1934 VLNSVSEEDYLELGRAGLGSCLLGGLPDWMIAYSARLVRFINFERTKLPEEILKHNLEEK 1993
Query: 2131 RKYFSDICLEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRCVFGDPKKAPPPVERLS 2190
RKYFSDI L+VEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMR VFGDPK APPP+ERL+
Sbjct: 1994 RKYFSDIHLDVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRHVFGDPKNAPPPLERLT 2053
Query: 2191 PEETVSFLWKGEGSLVEELIQCMAPHVEEDVLNDLKSKIQAHDPSGSEDIQRELRKSLLW 2250
PEETVSF+W G+GSLV+EL+Q ++PH+EE LN+L+SKI HDPSGS D+ +EL++SLLW
Sbjct: 2054 PEETVSFVWNGDGSLVDELLQSLSPHLEEGPLNELRSKIHGHDPSGSADVLKELQRSLLW 2113
Query: 2251 LRDEVRNLPCTYKCRHDAAADLIHIYAYTKCFFRV-------QEYKAFTSPPVYISPLDL 2303
LRDE+R+LPCTYKCR+DAAADLIHIYAYTKCFF+V QEY++F S PV+ISPLDL
Sbjct: 2114 LRDEIRDLPCTYKCRNDAAADLIHIYAYTKCFFKVRMGLDMLQEYQSFISSPVHISPLDL 2173
Query: 2304 GPKYADKLGADLQVYRKTYGENYCLGQLIFWHIQTNADPDCTLARASRGCLSLPDIGSFY 2363
G KYADKLG ++ YRKTYGENYCLGQLI+W+ QTN DPD TL +A+RGCLSLPD+ SFY
Sbjct: 2174 GAKYADKLGESIKEYRKTYGENYCLGQLIYWYNQTNTDPDLTLVKATRGCLSLPDVASFY 2233
Query: 2364 AKVQKPSRHRVYGPKTVRFMLSRME 2388
AK QKPS+HRVYGPKTV+ M+S+M+
Sbjct: 2234 AKAQKPSKHRVYGPKTVKTMVSQMQ 2258
Score = 337 bits (863), Expect = 7e-89, Method: Compositional matrix adjust.
Identities = 261/686 (38%), Positives = 349/686 (50%), Gaps = 159/686 (23%)
Query: 1 MGDGGVACMPLQQQQQHNSIMERFPISDKTTICVGNSSNNSNKTNNNSISNNNDNKTNND 60
M DGGVACMPL +IME+ PI +KTT+C GN S KT
Sbjct: 1 MSDGGVACMPLL------NIMEKLPIVEKTTLCGGNES-----------------KTAAT 37
Query: 61 SSNNNGSSSSKNNETNKSNVKKNGVSTKTVRKKIVK-IKKVIAVKKKEVQKNSGSS---- 115
+ N + S ++K E+ +N K + S +K+IVK I+KV+ + K+ QK +
Sbjct: 38 TENGHTSIATKVPESQPAN-KPSASSQPVKKKRIVKVIRKVVKRRPKQPQKQADEQLKDQ 96
Query: 116 ---------------------KSNNNGENIDNKNVENGGAVGEVVTVDKENLKNEEVEEG 154
KS G K VENGG G +EVEEG
Sbjct: 97 PPSQVVQLPAESQLQIKEQDKKSEFKGGTSGVKEVENGGDSG----------FKDEVEEG 146
Query: 155 ELGTLKW----ENGEFVQPEKSQPQSQLQSQSKQIEKGEII---------VFSSKCRRGE 201
ELGTLK ENGE + P KS Q +IEKGEI+ + K +G
Sbjct: 147 ELGTLKLHEDLENGE-ISPVKSL-------QKSEIEKGEIVGESWKKDEPTKADKNWKGG 198
Query: 202 TEKGESGLWRGNKDDIEKGEFIPDRWHK-EVVKDEYGYSKSRR----------YDYKLER 250
E+ E WR D+IEKGEFIPDRW K + KD++ Y +SRR Y+Y+ ER
Sbjct: 199 KEEREFRSWRDPSDEIEKGEFIPDRWQKMDTGKDDHSYIRSRRNGVDREKTWKYEYEYER 258
Query: 251 TPPSGKYSGEDVYRRKEFDRSGSQHSKSSSRWESGQERNVRISSKIVDDEGLYKGEHNNG 310
TPP G R RISSKIV +E L+K E+NN
Sbjct: 259 TPPGG--------------------------------RTTRISSKIVIEENLHKNEYNNS 286
Query: 311 KNHGREYFH-GNRFKRHGTDSDSGDRKY-YGDYGDFAGLKSRRLSDDYNSRSVHSEHYSR 368
N +EY GNR KRHG + DS +RK+ Y DYGD+ K R+LSDD SRS+HS+HYS+
Sbjct: 287 SNFVKEYSSTGNRLKRHGAEPDSIERKHSYADYGDYGSSKCRKLSDDC-SRSLHSDHYSQ 345
Query: 369 HSVEKFHRNSSSSRISSLDKYSSRHHEPSLSSRVIYDRHGRSPSHSDRSPHDRGRYYDHR 428
HS E+ +R+S S+ SSL+KY +H + S ++ D+HG SPS SD SPHDR RY+++R
Sbjct: 346 HSAERLYRDSYPSKNSSLEKYPRKHQDASFPAKAFSDKHGHSPSRSDWSPHDRSRYHENR 405
Query: 429 DRSPSRHDRSPYTRDRSPYTFDRSPYSRERSPYNRDRSPYAREKSPYDRSRHYDHRNRSP 488
DRSPY R+RSPY F++S ++R+RSP +R RSP
Sbjct: 406 -------DRSPYARERSPYIFEKSSHARKRSPRDRRHH----------------DYRRSP 442
Query: 489 FSAERSPQDRARFHDRSDRTPNYLERSPLHRSRPNNHREASSKTGASEKRNARYDSKGHE 548
+E SP DR+R DR D PN++E + R+R N HRE S K+G E+R+ + ++ E
Sbjct: 443 SYSEWSPHDRSRPSDRRDYIPNFMEDTQSDRNRRNGHREISRKSGVRERRDCQTGTE-LE 501
Query: 549 DKLGPKDSNARCSRSSAKESQDKSNVQDLNVSDEKTANCESHKEEQPQSSSVDCKEPPQV 608
K K+SN + S SS+KE Q K+ + + ++ EK + C+S K P ++ KEP QV
Sbjct: 502 IKHKYKESNGKESTSSSKELQGKNILYNNSLLVEKNSVCDSSKIPVPCATG---KEPVQV 558
Query: 609 DGPPLEELVSMEEDMDICDTPPHVPA 634
P EEL SME PP P
Sbjct: 559 GEAPTEELPSME-----LVNPPEAPG 579
>gi|357139674|ref|XP_003571404.1| PREDICTED: probable histone-lysine N-methyltransferase ATXR3-like
[Brachypodium distachyon]
Length = 2214
Score = 2041 bits (5287), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 1111/2110 (52%), Positives = 1403/2110 (66%), Gaps = 140/2110 (6%)
Query: 360 SVHSEHYSR---HSVEKFHRNSSSSRISSLDKYSSRHHEPSLSSRVIYDRHGRSPSHSDR 416
S H +H++ S K R + S+ S R+HE S R +DR RSP R
Sbjct: 213 SNHRKHHAETCDQSGSKSRRKGEAKSTSAGRHLSGRNHEISTPIRDRHDRLERSPGILGR 272
Query: 417 SPHDRGRYYDHRDRSPSRHDRSPYTRDRSPYTFDRSPYSRERSPYNRDRSPYAREKSPYD 476
PHDR R+ H +RSPSR +RSP R R +D NRDRSPY SP
Sbjct: 273 FPHDRVRHEKH-ERSPSRLERSPRDRGRH---YD-----------NRDRSPYI---SPRH 314
Query: 477 RSRHYDHRNRSPFSAERSPQDRARFHDRSDRTPNYLERSPLHRSRPNNHREASSKTGASE 536
+ R HR+ +P + SP+ R + D DRTP +RSP R R + EAS K+
Sbjct: 315 KVRQPHHRDSTPSRIDNSPRGRIQHEDIRDRTPLRTDRSPSERGRTTDSHEASKKS---- 370
Query: 537 KRNARYDSKGHEDKLGPKDSNARCSRSSAKESQDKSNVQDLNVSDEKTANCESHKEEQPQ 596
R A+ +SK E NA+ S K+S EQP
Sbjct: 371 -RGAKLESKNLE--------NAQHKNKSMKQSL----------------------PEQPN 399
Query: 597 SSSVDCKEPPQVDGPPLEELVSMEEDMDICDTPPHV-----PAVTDSSV-GKWFYLDHCG 650
V E VSMEEDMDICDTPPH P+V S+V GKWFYLD G
Sbjct: 400 DVVV--------------EDVSMEEDMDICDTPPHTSEAPKPSVEPSTVMGKWFYLDQFG 445
Query: 651 MECGPSRLCDLKTLVEEGVLVSDHFIKHLDSNRWETVENAVSPLVTVNFPSITSDSVTQL 710
+E GPS+L DLK LV++G L+SDH IKH D NRW TVENA +PLV + + SD TQL
Sbjct: 446 VEQGPSKLADLKKLVDDGYLLSDHLIKHADCNRWVTVENAATPLVPSDISLVYSDGTTQL 505
Query: 711 VSPPEASGNLLADTGDTAQSTGEEFPVTLQSQCCPDGSAAAAESSEDLHIDVRVGALLDG 770
VSPPEA GNLL D A+ EE S A+ E EDL+ID RVGAL+ G
Sbjct: 506 VSPPEAPGNLL----DEAR---EEASALASSADNEQMEEASEEPKEDLYIDNRVGALMYG 558
Query: 771 FTVIPGKEIETLGEILQTTFERVDWQNNGGPTWHGACVGEQKPGDQKVDELYISDTKMKE 830
++ G E+E LG+ L T F RVD + P + D + +D +
Sbjct: 559 SVLVEGHELEILGDALATHFNRVDLERWDQPEDFPRFQAQPAREDVINGGIEFADNSATD 618
Query: 831 AAELKSGDKDHWVVCFDSDEWFSGRWSCKGGDWKRNDEAAQDRCSRKKQVLNDGFPLCQM 890
+ ++D + +S EWFSGRWSCKGGDWKRNDE +QD+ RKK VLN+G+ LCQM
Sbjct: 619 IYGVGPIERDTFYHNVESSEWFSGRWSCKGGDWKRNDEFSQDKPYRKKLVLNEGYALCQM 678
Query: 891 PKSGYEDPRWNQKDDLYYPSHSRRLDLPPWAYACPDERND-----GSGG---SRSTQSKL 942
PK +EDPRW+ KDDLYY +++LDLP WA++ +E D GG RS Q +
Sbjct: 679 PKGSHEDPRWHCKDDLYYHVPAKKLDLPLWAFSSTEESTDTVDDTSKGGIMPGRSGQVR- 737
Query: 943 AAVRGVKGTMLPVVRINACVVNDHGSFVSEPRSKVRAKERHSSRSARSYSSANDVRRSSA 1002
+ +GVKG LPVVRINA VV D S EP K R +R SRS+RS+S D R S+
Sbjct: 738 QSTKGVKGMTLPVVRINARVVKDQSSV--EPCIKPRGADRSLSRSSRSHSIGAD-RSSAH 794
Query: 1003 ESDSHSKARNNQDSQGSWKSIACINTPKDRLCTVDDLQLQLGEWYYLDGAGHERGPSSFS 1062
E S+SK + D Q KS + +N P+D +CTV++L ++LG+WYYLDG HE GP S+S
Sbjct: 795 EGLSYSKKHHEHDLQSFHKSKSVLNIPEDHVCTVEELSVKLGDWYYLDGTAHEHGPFSYS 854
Query: 1063 ELQVLVDQGCIQKHTSVFRKFDKVWVPLTFATETSASTVRNHGEKIMPSGDSSGLPPTQS 1122
ELQ LV +G I++ +SVFRK D W+P+ + +++ S +S L +
Sbjct: 855 ELQKLVRRGTIRERSSVFRKIDNTWLPVVKDMKFDSASRNGGSGS---SNSTSALVHSDQ 911
Query: 1123 QDAVLGESNNNVNSNAFHTMHPQFIGYTRGKLHELVMKSYKNREFAAAINEVLDPWINAK 1182
+ V+ N S +FH +HPQF+GYTRGKLHELVMK +K+RE AINEVLDPWI AK
Sbjct: 912 SNVVV-----NHGSGSFHELHPQFVGYTRGKLHELVMKYFKSRELTLAINEVLDPWIAAK 966
Query: 1183 QPKKETEHVYRKSEGDTR--------AGKRARLLVRESDGDEETEEELQTI-QDESTFED 1233
QPKKE E Y + TR + KRAR L SD D + E++ T +D+ FED
Sbjct: 967 QPKKEIE-TYVANNSATRNLLPEDAGSAKRARFLPDRSDEDIDMYEDILTSHKDDCCFED 1025
Query: 1234 LCGDASFPGEESASSAIESGGWGLLDGHTLAHVFHFLRSDMKSLAFASLTCRHWRAAVRF 1293
L +A+ +S ES W LL+GH LA +FHFLR+DMKSL ++ TCR W A +
Sbjct: 1026 LFQEAAL-----TNSIAESESWDLLNGHVLARIFHFLRADMKSLISSAATCRRWNTAAKC 1080
Query: 1294 YKGISRQVDLSSVGPNCTDSLIRKTLNAFDKEKLNSILLVGCTNITSGMLEEILQSFPHL 1353
Y+ R VDLSSVGP CTDS+ R + ++K+ + +++LVGC++++ LE++L PH+
Sbjct: 1081 YRNTCRFVDLSSVGPRCTDSVFRGIMAGYEKQNIKTLVLVGCSSLSPLALEKVLVQLPHI 1140
Query: 1354 SSIDIRGCGQFGELALKFPNINWVKSQKSRGAKFNDSRSKIRSLKQITEKSSSAPK-SKG 1412
S + I+GC Q ++ +F +I W+ S + +S KI+SLKQI + S K ++
Sbjct: 1141 SYVHIQGCSQLEDMKSRFQHIKWITSSLNP----EESLQKIKSLKQIDDGSGHPSKVARN 1196
Query: 1413 LGDDMDDFGDLKDYFESVDKRDSANQSFRRSLYQRSKVFDARKSSSILSRDARMRRWSIK 1472
+ + +L YF + R++AN SF + Y+RSK DARKSS++LS+DA++RR +
Sbjct: 1197 MTSQLGGSDELDGYFADISNRENANLSFGQGFYKRSKWLDARKSSAVLSKDAQLRRLMQR 1256
Query: 1473 KSENGYKRMEEFLASSLKEIMRVNTFEFFVPKVAEIEGRMKKGYYISHGLGSVKDDISRM 1532
+EN Y++MEEF+ S L+EIM+ + F+FF PKV +IE R++ GYY HG S+KDDI M
Sbjct: 1257 NAENSYRKMEEFVISRLREIMKSSRFDFFDPKVEKIEARLRSGYYARHGFSSLKDDIRSM 1316
Query: 1533 CRDAIKAKNRGSAGDMNRITTLFIQLATRLEQGAKSSYYEREEMMKSWKDESPAGLYSAT 1592
CRDA+++K R DM +I FIQLA RL G ER + KD S Y++
Sbjct: 1317 CRDALRSKGRSE--DMKQIVVSFIQLAKRL--GNPRVISERNGAVIQ-KDNSDMVQYTSD 1371
Query: 1593 SKYKKKLSKMVSERKYMNRSNGTSLANGDFDYGEYASDREIRKRLSKLNRKSLDSGSETS 1652
+K KKK +K ER+ N + T+ A D A DREI++ LSKL ++ +DSGSETS
Sbjct: 1372 TKLKKKQNKTTGERRGANWTAATAGA----DTSSRAFDREIKRSLSKLKKRDVDSGSETS 1427
Query: 1653 DDLDGSSEDGKSDSESTVSDTDSDMDFRSDGRARESRGAGDFTTDEGLDFSDDREWGARM 1712
DD DG SE +++SE+TVSDT+SD+D S A + +G G + G +DDR WGARM
Sbjct: 1428 DDDDGYSEGDETESETTVSDTESDLDLNS--VAWDLKGNGMKLFESGDSVTDDRGWGARM 1485
Query: 1713 TKASLVPPVTRKYEVIDQYVIVADEEDVRRKMRVSLPEDYAEKLNAQKNGSEELDMELPE 1772
TKASLVPPVTRKYEVI++Y+IVADEE+V+RKMRV+LP+DY+EKL +QKNG+E L E+PE
Sbjct: 1486 TKASLVPPVTRKYEVIEKYLIVADEEEVQRKMRVALPDDYSEKLLSQKNGTENL--EIPE 1543
Query: 1773 VKDYKPRKQLGDQVFEQEVYGIDPYTHNLLLDSMPDELDWNLLEKHLFIEDVLLRTLNKQ 1832
VK+Y+ RK GD++ EQEVYGIDP+THNLL D MP +L W+ ++H FIE++LL TLNKQ
Sbjct: 1544 VKEYQRRKVPGDEILEQEVYGIDPFTHNLLRDIMPADLGWSAADQHTFIEELLLNTLNKQ 1603
Query: 1833 VRHFTGTGNTPMMYPLQPVIEEIEKEAVDDCDVRTMKMCRGILKAMDSRP--DDK-YVAY 1889
V+ FTG+GNTPM+Y L+PVIEEI+K A + D RT+KMC G+LKAM SRP D K YVAY
Sbjct: 1604 VKDFTGSGNTPMVYHLKPVIEEIQKSAEESGDRRTVKMCLGMLKAMRSRPGPDHKHYVAY 1663
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
RKGLGVVCNK+GGFG DDFV+EF GEVYP W+W+EKQDGI+ +Q N+ED APEFYNI LE
Sbjct: 1664 RKGLGVVCNKKGGFGVDDFVIEFFGEVYPSWRWYEKQDGIKHIQNNSEDQAPEFYNIMLE 1723
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
RPKGD DGYDLV VDAMHKANYASRICHSC PNCEAKVTAVDG YQIG+YTVR I GEE
Sbjct: 1724 RPKGDRDGYDLVFVDAMHKANYASRICHSCNPNCEAKVTAVDGQYQIGVYTVRPIAEGEE 1783
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDRHQLMLEA 2069
ITFDYNSVTESKEE+EASVCLCGSQVCRGSYLN +GEGAFEKVL E HG+LDRH L+L+A
Sbjct: 1784 ITFDYNSVTESKEEHEASVCLCGSQVCRGSYLNFSGEGAFEKVLMEFHGVLDRHSLLLQA 1843
Query: 2070 CELNSVSEEDYLELGRAGLGSCLLGGLPNWVVAYSARLVRFINLERTKLPEEILRHNLEE 2129
CE NSVS++D ++LGRAGLG+CLL GLP W+VAY+A LVRFI ER KLP EI +HN++E
Sbjct: 1844 CEANSVSQQDLIDLGRAGLGTCLLAGLPGWLVAYTAHLVRFIFFERQKLPNEIFKHNVDE 1903
Query: 2130 KRKYFSDICLEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRCVFGDPKKAPPPVERL 2189
KR++F+DI ++ E++DAEVQAEGV N RLQNL TLDKVRYVMRCVFGDPK APPP+ RL
Sbjct: 1904 KRQFFTDINMDSERNDAEVQAEGVLNSRLQNLTHTLDKVRYVMRCVFGDPKNAPPPLVRL 1963
Query: 2190 SPEETVSFLWKGEGSLVEELIQCMAPHVEEDVLNDLKSKIQAHDPSGSEDIQRELRKSLL 2249
+ VS +WKGEGSLVEEL+Q M PHVEEDVL DLK KI+ HDPS SEDI+ ++R SLL
Sbjct: 1964 TGRSLVSAIWKGEGSLVEELLQSMEPHVEEDVLADLKDKIRDHDPSDSEDIEGDIRNSLL 2023
Query: 2250 WLRDEVRNLPCTYKCRHDAAADLIHIYAYTKCFFRVQEYKAFTSPPVYISPLDLGPKYAD 2309
WLRDE+R+L CTYKCRHDAAADLIH+YAYTKCFFR ++YK SPPV+ISPLDLGPKYAD
Sbjct: 2024 WLRDELRSLSCTYKCRHDAAADLIHMYAYTKCFFRARDYKTVKSPPVHISPLDLGPKYAD 2083
Query: 2310 KLGADLQVYRKTYGENYCLGQLIFWHIQTNADPDCTLARASRGCLSLPDIGSFYAKVQKP 2369
KLG Q YRKTY ENYCL QLI+W+ Q NA+P+ L RA +GC+SLPD+ SFY K
Sbjct: 2084 KLGPGFQEYRKTYPENYCLAQLIYWYSQ-NAEPESRLTRARKGCMSLPDVSSFYVTSVKQ 2142
Query: 2370 SRHRVYGPKTVRFMLSRMEKQPQRPWPKDRIWAFKSSPRIFGSPMLDSSLTGCPLDREMV 2429
++ RVYG +TVRFML+RMEKQ QR WPKDRIW FK+ PR FG+PM+D+ L LD+EMV
Sbjct: 2143 TQERVYGTRTVRFMLTRMEKQAQRQWPKDRIWVFKNHPRFFGTPMMDAVLNNSSLDKEMV 2202
Query: 2430 HWLKHRPAIF 2439
HWLK R +F
Sbjct: 2203 HWLKTRSNVF 2212
>gi|413921170|gb|AFW61102.1| hypothetical protein ZEAMMB73_524379 [Zea mays]
Length = 2278
Score = 1967 bits (5097), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 1081/2140 (50%), Positives = 1384/2140 (64%), Gaps = 143/2140 (6%)
Query: 362 HSEHYSRHSVEKFHRNS--SSSRISSLDKYSSRHHEPSLSSRVIYDRHGRSPSHSDRSPH 419
H S S K HR + S+ S R+ E S +R DRH RSP R PH
Sbjct: 218 HHTDTSDQSGSKSHRKGPKGEGKRSAARHLSGRNREISSPTRDRRDRHERSPGILGRFPH 277
Query: 420 DRGRYYDHRDRSPSRHDRSPY--------TRDRSPYTFDRSPYSRERSPYNRDRSPYARE 471
+R R+ D DRSPSR +RSP+ +RD SPY SP R R P+ RD +P +
Sbjct: 278 ERSRH-DRYDRSPSRLERSPHRERARHYESRDHSPYV---SPRHRARQPHFRDNTPSRVD 333
Query: 472 KSPYDRSRHYDHRNRSPFSAERSPQDRARFHDRSDRTPNYLERSPLHRSRPNNHREAS-- 529
P R + D R+RSPF HDRS P R RP + EAS
Sbjct: 334 NFPRGRVQREDVRDRSPF-----------LHDRS----------PSERFRPTDTHEASKK 372
Query: 530 SKTGASEKRNARYDSKGHEDKL---GPKDSNARCSRSSAKESQDKSNVQDLNVSDEKTAN 586
S++G + +++ + G + N + S ES + +
Sbjct: 373 SRSGNNSEKSQHKSKSAKQSSKTKSGSNEKNEKISNEKPTESSKYTELPPPPPLPLPPPP 432
Query: 587 CESHKEEQPQSSSVDCKEPPQVDGPPLEELVSMEEDMDICDTPPHVPAVTDSS-----VG 641
+ PP P M EDMDICDTPPH A + + +G
Sbjct: 433 PPPPPPPPLPPAVPPPLPPPPEPEPTGVLAEDMIEDMDICDTPPHTSAAPEPTDPIYDIG 492
Query: 642 KWFYLDHCGMECGPSRLCDLKTLVEEGVLVSDHFIKHLDSNRWETVENAVSPLVTVNFPS 701
+WFYLDH G+E GPS+L LK LVE+G L+SDH IKH DSNRW TVENA SPLV +FPS
Sbjct: 493 RWFYLDHFGIEQGPSKLAVLKKLVEDGYLLSDHLIKHADSNRWVTVENAASPLVPSDFPS 552
Query: 702 ITSDSVTQLVSPPEASGNLLADTGDTAQ--STGEEFPVTLQSQCCPDGSAAAAESSEDLH 759
SD+ TQ+V+PPEA GNLL + + A ++G E ++ A+AE SE+ +
Sbjct: 553 FYSDTSTQMVNPPEAPGNLLDEALEEASNLASGSEDKQMVE---------ASAEDSEEFY 603
Query: 760 IDVRVGALLDGFTVIPGKEIETLGEILQTTFERVDWQNNGGPT----WHGACVGEQKPGD 815
I+ RV AL+DG ++ G+E+E +GE+L F+ DWQ P +H G+ D
Sbjct: 604 INDRVEALMDGSILVHGQELEIIGELLGADFQPADWQRLSHPEDFTRFHVHIEGD----D 659
Query: 816 QKVDELYISDTKMKEAAELKSGDKDHWVVCFDSDEWFSGRWSCKGGDWKRNDEAAQDRCS 875
+ + + + +A L S D H V +S EWFSGRWSCKGGDW+RNDE QD
Sbjct: 660 EIIGGTEFLENRTTDAYGLVSVDNFHHYV--ESSEWFSGRWSCKGGDWRRNDELGQDTPF 717
Query: 876 RKKQVLNDGFPLCQMPKSGYEDPRWNQKDDLYYPSHSRRLDLPPWAYACPDERNDGSGGS 935
RKK VLN+G+PLCQ+PK YEDPR KD+LYYP ++ DLP WA++ +E DG +
Sbjct: 718 RKKLVLNEGYPLCQIPKGSYEDPRRPCKDELYYPVRGKKHDLPLWAFSSTEEDIDGVNDT 777
Query: 936 RSTQSKLAAVRGVKG----------TMLPVVRINACVVNDHGSFVSEPRSKVRAKERHSS 985
+K V G G ML VV IN V+ D S EPR+K R +R S
Sbjct: 778 ----TKNTVVPGRPGQTRQPPSEVKVMLQVVSINYHVIKDQSSV--EPRTKPRGTDRPPS 831
Query: 986 RSARSYSSANDVRRSSAESDSHSKARNNQDSQGSWKSIACINTPKDRLCTVDDLQLQLGE 1045
RS+RS+S + R S + SH + ++ DSQ KS + N PKD +CTVD+L + G+
Sbjct: 832 RSSRSHSIGAE-RSSIHDGSSHFRKHHDHDSQSFHKSKSVPNIPKDHVCTVDELSVNRGD 890
Query: 1046 WYYLDGAGHERGPSSFSELQVLVDQGCIQKHTSVFRKFDKVWVPLTFATETSASTVRNHG 1105
WYYLDG GH++GP S+SELQ LV + I + +SVFRK D W P+ + +S
Sbjct: 891 WYYLDGTGHDQGPFSYSELQELVKKDTIIEQSSVFRKIDNTWFPVLKDLKPGSSVPSAAP 950
Query: 1106 EKIMPSGDSSGLPPTQSQDAVLGESNNNVNSNAFHTMHPQFIGYTRGKLHELVMKSYKNR 1165
+ + + P Q V N S++FH +HPQF GYTRGKLHELVMK +K+R
Sbjct: 951 SSNLIAAFTH---PDQYNFGV------NQGSSSFHELHPQFAGYTRGKLHELVMKYFKSR 1001
Query: 1166 EFAAAINEVLDPWINAKQPKKETEHVYRKSEG-------DTRAGKRARLLVRESDGDEET 1218
E AINEVLDPWI+AKQPKKE E + + D + KRARLL +SD +
Sbjct: 1002 ELTLAINEVLDPWISAKQPKKEFEAYFSHNSASRNFLPEDGGSAKRARLLPDQSDENIHL 1061
Query: 1219 EEELQTIQDEST-FEDLCGDASFPGEESASSAIESGGWGLLDGHTLAHVFHFLRSDMKSL 1277
E++ + E FE+LC AS +S + + WGLL+ H LA +FHF+R+D+KSL
Sbjct: 1062 SEDIIASRKEDICFEELCDGASSVDNDSVNPRAGNASWGLLNDHLLARIFHFMRADLKSL 1121
Query: 1278 AFASLTCRHWRAAVRFYKGISRQVDLSSVGPNCTDSLIRKTLNAFDKEKLNSILLVGCTN 1337
++ TC+ W AA ++Y+ + R +DLSSVGP CTDS+ + F+K+ + +++L GC+N
Sbjct: 1122 ISSAATCKSWNAAAKYYRNMCRFIDLSSVGPLCTDSVFCDIMAGFEKQNIRTLILAGCSN 1181
Query: 1338 ITSGMLEEILQSFPHLSSIDIRGCGQFGELALKFPNINWVKSQKSRGAKFNDSRSKIRSL 1397
++S L +L+ P +S + I+GC G+L KF ++ W++S + + K+++L
Sbjct: 1182 LSSHALGRVLEHLPQISYVHIQGCSHLGDLKNKFQHVKWIRSSLNPEGSYR----KMKTL 1237
Query: 1398 KQITEKSSSAPKSKGLGDDMDDFGDLKDYFESVDKRDSANQ-SFRRSLYQRSKVFDARKS 1456
KQI + ++ A K D +DD +L YF + K + A+ SF + Y+RSK+ DARKS
Sbjct: 1238 KQIGDGNNYASKVARNFDQLDDSDELDGYFADISKIEGASLFSFGQGFYKRSKLLDARKS 1297
Query: 1457 SSILSRDARMRRWSIKKSENGYKRMEEFLASSLKEIMRVNTFEFFVPKVAEIEGRMKKGY 1516
S++LSRDA MRR +++EN Y++MEEF+ + L+EIMR N F+FF+PKV++IEGR+K GY
Sbjct: 1298 SAVLSRDAEMRRLMQRQAENSYRKMEEFVINRLREIMRCNRFDFFIPKVSKIEGRLKNGY 1357
Query: 1517 YISHGLGSVKDDISRMCRDAIKAKNRGSAGDMNRITTLFIQLATRLEQGAKSSYYEREEM 1576
Y HG ++K DI MC+DA++ K+ D+ +I FIQLA RL + Y E
Sbjct: 1358 YARHGFRTIKHDIRTMCQDALRYKDGNDLDDVKQIVVSFIQLAKRL----GNPRYISERN 1413
Query: 1577 MKSWKDESPAGLYSATSKYKKKLSKMVSERKYMNRSNGTSLANGDFDYGEYASDREIRKR 1636
+ +D YS +K KKK +K R+N L D A D EI++
Sbjct: 1414 GAAAQDSLDISQYSFDTKLKKKQNK--------TRAN---LVAAGADNSSRAFDLEIKRS 1462
Query: 1637 LSKLNRKSLDSGSETSDDLDGSSEDGKSDSESTVSDTDSDMDFRSDGRARESRGAGDFTT 1696
LSKL +K + SGSETSDD DG SE +++SE+TVSDT+SD D S A + +G T
Sbjct: 1463 LSKLKKKDVCSGSETSDD-DGYSEGDETESETTVSDTESDFDVNSG--AWDLKGNCLKLT 1519
Query: 1697 DEGLDFSDDREWGARMTKASLVPPVTRKYEVIDQYVIVADEEDVRRKMRVSLPEDYAEKL 1756
+ G DDR GARMTKASLVPPVTRKYEVI++Y+IVAD E+V+RKMRVSLP+DY+EKL
Sbjct: 1520 EHGESVIDDRILGARMTKASLVPPVTRKYEVIEEYLIVADVEEVQRKMRVSLPDDYSEKL 1579
Query: 1757 NAQKNGSEELDMELPEVKDYKPRKQLGDQVFEQEVYGIDPYTHNLLLDSMPDELDWNLLE 1816
+QKNG+E L ELPEVKDY+PRK GD++ EQEVYGIDPYTHNLL D MP +L+ + +
Sbjct: 1580 LSQKNGTENL--ELPEVKDYQPRKVAGDEILEQEVYGIDPYTHNLLSDIMPSDLELSPTD 1637
Query: 1817 KHLFIEDVLLRTLNKQVRHFTGTGNTPMMYPLQPVIEEIEKEAVDDCDVRTMKMCRGILK 1876
KH+FIE++LL LNKQVRHFTG GNTPM Y ++PVIEEI++ A D D RT+KMC G+LK
Sbjct: 1638 KHIFIEELLLNALNKQVRHFTGLGNTPMTYNIRPVIEEIQRSAEDSGDRRTLKMCLGMLK 1697
Query: 1877 AMDSRPDDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNN 1936
+M +R D +VAYRKGLGVVCNK+GGFG DDFVVEF GEVYP W+W+EKQDGI+ +Q N+
Sbjct: 1698 SMRNRSDQNFVAYRKGLGVVCNKKGGFGVDDFVVEFFGEVYPSWRWYEKQDGIKHIQNNS 1757
Query: 1937 EDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAK---------- 1986
ED APEFYNI LERPKGD GYDLV VDAMHKANYASRICHSC PNCEAK
Sbjct: 1758 EDQAPEFYNIMLERPKGDRHGYDLVFVDAMHKANYASRICHSCNPNCEAKKKRIYTYDYA 1817
Query: 1987 -------VTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGS 2039
VTAVDG YQIG+YT+R I GEEITFDYNSVTESKEE+EASVCLCGSQVCRGS
Sbjct: 1818 KADMRAIVTAVDGKYQIGVYTLRPIAEGEEITFDYNSVTESKEEHEASVCLCGSQVCRGS 1877
Query: 2040 YLNLTGEGAFEKVLKELHGLLDRHQLMLEACELNSVSEEDYLELGRAGLGSCLLGGLPNW 2099
YLN +GEGAFEKVL E HG+LDRH L+L+ACE +SVS++D ++LGRAGLG+CLL GLP W
Sbjct: 1878 YLNFSGEGAFEKVLMEFHGVLDRHSLLLQACETDSVSQQDLIDLGRAGLGTCLLAGLPVW 1937
Query: 2100 VVAYSARLVRFINLERTKLPEEILRHNLEEKRKYFSDICLEVEKSDAEVQAEGVYNQRLQ 2159
+VAY+A LVRFI LER KLP+EILRHN++EKR++ +I ++ EK+DAEVQAEGV N RLQ
Sbjct: 1938 LVAYTAHLVRFIYLERQKLPDEILRHNVDEKRQFLIEINMDSEKNDAEVQAEGVLNSRLQ 1997
Query: 2160 NLAVTLDKVRYVMRCVFGDPKKAPPPVERLSPEETVSFLWKGEGSLVEELIQCMAPHVEE 2219
+ TLDKVRYVMRC+FGDPK APPP+ RLS + VS +WKG+ S+V EL+Q M PHVEE
Sbjct: 1998 QIVHTLDKVRYVMRCIFGDPKNAPPPMVRLSGKSLVSAIWKGDSSIVAELLQSMEPHVEE 2057
Query: 2220 DVLNDLKSKIQAHDPSGSEDIQRELRKSLLWLRDEVRNLPCTYKCRHDAAADLIHIYAYT 2279
+VL+DLK+KI AHDPS SEDI+ +R SLLWLRDE+R LPCTYKCRHDAAADLIH+YAYT
Sbjct: 2058 EVLSDLKAKICAHDPSDSEDIEGGIRNSLLWLRDELRTLPCTYKCRHDAAADLIHLYAYT 2117
Query: 2280 KCFFRVQEYKAFTSPPVYISPLDLGPKYADKLGADLQVYRKTYGENYCLGQLIFWHIQTN 2339
KCFFRV++YK SPPV+ISPLDLGPKYADKLG Q Y KTY ENYCL QLI+W+ Q N
Sbjct: 2118 KCFFRVRDYKTVKSPPVHISPLDLGPKYADKLGPGFQEYCKTYPENYCLAQLIYWYSQ-N 2176
Query: 2340 ADPDCTLARASRGCLSLPDIGSFYAKVQKPSRHRVYGPKTVRFMLSRMEKQPQRPWPKDR 2399
++P+ L RA +GC+SLPD+ SFY K KP + RVYG +TVRFMLSRMEKQ QRPWPKDR
Sbjct: 2177 SEPESRLTRARKGCMSLPDVSSFYVKSLKPLQERVYGNRTVRFMLSRMEKQAQRPWPKDR 2236
Query: 2400 IWAFKSSPRIFGSPMLDSSLTGCPLDREMVHWLKHRPAIF 2439
IW FKS PR FGSPM+D+ L PLD+EMVHWLK RP +F
Sbjct: 2237 IWVFKSDPRYFGSPMMDAVLNNSPLDKEMVHWLKTRPNVF 2276
>gi|218200574|gb|EEC83001.1| hypothetical protein OsI_28046 [Oryza sativa Indica Group]
Length = 2000
Score = 1368 bits (3540), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 657/994 (66%), Positives = 783/994 (78%), Gaps = 19/994 (1%)
Query: 1446 QRSKVFDARKSSSILSRDARMRRWSIKKSENGYKRMEEFLASSLKEIMRVNTFEFFVPKV 1505
+RSK D RKSS++LSRDA+MRR +K+EN Y++MEEF+ + LKEIM+ + F+FFVPKV
Sbjct: 1024 ERSKWLDIRKSSAVLSRDAQMRRLMQRKAENSYRKMEEFVINKLKEIMKSSRFDFFVPKV 1083
Query: 1506 AEIEGRMKKGYYISHGLGSVKDDISRMCRDAIKAKNRGSAGDMNRITTLFIQLATRLEQG 1565
A+IE R+K GYY HG +K+DI MCRDA++ K R GDM +I FIQLA +LE
Sbjct: 1084 AKIEVRLKNGYYARHGFSYIKNDIRSMCRDALRYKGRSDLGDMKQIVVAFIQLAKKLENP 1143
Query: 1566 AKSSYYEREEMMKSWKDESPAGLYSATSKYKKKLSKMVSERKYMNRSNGTSLANGDFDYG 1625
S + + K D S YS+ K KKK SK +SER+ G + D
Sbjct: 1144 RLISDRDGTAVQK---DSSDMSQYSSDLKLKKKQSKTMSERR------GANWTTAGADPS 1194
Query: 1626 EYASDREIRKRLSKLNRKSLDSGSETSDDLDGSSEDGKSDSESTVSDTDSDMDFRSDGRA 1685
A DREI++ LSKL ++ +DSGSETSDD DG SE +++SE+TVSDT+SD+D S
Sbjct: 1195 SRAFDREIKRSLSKLKKRDIDSGSETSDDDDGYSEGDETESETTVSDTESDLDVNSGAWD 1254
Query: 1686 RESRGAGDFTTDEGLDFSDDREWGARMTKASLVPPVTRKYEVIDQYVIVADEEDVRRKMR 1745
+ G F + E L +DDR WGARMTKASLVPPVTRKYEVI++Y+IVADEE+V RKMR
Sbjct: 1255 LKGNGMKLFESSESL--TDDRGWGARMTKASLVPPVTRKYEVIEKYLIVADEEEVLRKMR 1312
Query: 1746 VSLPEDYAEKLNAQKNGSEELDMELPEVKDYKPRKQLGDQVFEQEVYGIDPYTHNLLLDS 1805
V+LP+DY+EKL +QKNG+E L ELPEVKDY+PRK GD+V EQEVYGIDPYTHNLLL+
Sbjct: 1313 VALPDDYSEKLLSQKNGTENL--ELPEVKDYQPRKVPGDEVLEQEVYGIDPYTHNLLLEM 1370
Query: 1806 MPDELDWNLLEKHLFIEDVLLRTLNKQVRHFTGTGNTPMMYPLQPVIEEIEKEAVDDCDV 1865
MP ELDW +KH F+E++LL TLNKQVR FTG+GNTPM+YPL+PVIEEI+K A + D
Sbjct: 1371 MPTELDWPSSDKHTFVEELLLNTLNKQVRQFTGSGNTPMVYPLKPVIEEIQKSAEESGDR 1430
Query: 1866 RTMKMCRGILKAMDSRPDDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEK 1925
RT KMC G+LKAM + P+ Y GLGVVCNK GGFG DDFV+EF GEVYP W+W+EK
Sbjct: 1431 RTSKMCLGMLKAMRNHPE-----YNYGLGVVCNKTGGFGVDDFVIEFFGEVYPSWRWYEK 1485
Query: 1926 QDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEA 1985
QDGI+ +Q N++D APEFYNI LERPKGD DGYDLV VDAMHKANYASRICHSC PNCEA
Sbjct: 1486 QDGIKHIQNNSDDQAPEFYNIMLERPKGDRDGYDLVFVDAMHKANYASRICHSCNPNCEA 1545
Query: 1986 KVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTG 2045
KVTAVDGHYQIGIYTVR I GEEITFDYNSVTESKEE+EASVCLCGSQ+CRGSYLN +G
Sbjct: 1546 KVTAVDGHYQIGIYTVRPIAEGEEITFDYNSVTESKEEHEASVCLCGSQICRGSYLNFSG 1605
Query: 2046 EGAFEKVLKELHGLLDRHQLMLEACELNSVSEEDYLELGRAGLGSCLLGGLPNWVVAYSA 2105
EGAFEKVL E HG+LDRH L+L+ACE NSVS++D ++LGRAGLG+CLL GLP W+VAY+A
Sbjct: 1606 EGAFEKVLMEFHGVLDRHSLLLQACEANSVSQQDLIDLGRAGLGTCLLAGLPGWLVAYTA 1665
Query: 2106 RLVRFINLERTKLPEEILRHNLEEKRKYFSDICLEVEKSDAEVQAEGVYNQRLQNLAVTL 2165
LVRFI ER KLP EI +HN++EKR++F+DI ++ EK+DAEVQAEGV N RLQNL TL
Sbjct: 1666 HLVRFIFFERQKLPHEIFKHNVDEKRQFFTDINMDSEKNDAEVQAEGVLNSRLQNLTHTL 1725
Query: 2166 DKVRYVMRCVFGDPKKAPPPVERLSPEETVSFLWKGEGSLVEELIQCMAPHVEEDVLNDL 2225
DKVRYVMRC+FGDPK APPP+ RL+ VS +WKGEGSLV+EL++ M PHVEEDVL DL
Sbjct: 1726 DKVRYVMRCIFGDPKNAPPPLVRLTGRSLVSAIWKGEGSLVDELLESMEPHVEEDVLTDL 1785
Query: 2226 KSKIQAHDPSGSEDIQRELRKSLLWLRDEVRNLPCTYKCRHDAAADLIHIYAYTKCFFRV 2285
K+KI+AHDPSGSEDI+ E+R SLLWLRDE+R L CTYKCRHDAAADLIH+YAYTKCFFRV
Sbjct: 1786 KAKIRAHDPSGSEDIEGEIRSSLLWLRDELRTLSCTYKCRHDAAADLIHMYAYTKCFFRV 1845
Query: 2286 QEYKAFTSPPVYISPLDLGPKYADKLGADLQVYRKTYGENYCLGQLIFWHIQTNADPDCT 2345
++YK SPPV ISPLDLGPKYADKLG Q Y KTY ENYCLGQLI+W+ Q NA+P+
Sbjct: 1846 RDYKTVKSPPVLISPLDLGPKYADKLGPGFQEYCKTYPENYCLGQLIYWYSQ-NAEPESR 1904
Query: 2346 LARASRGCLSLPDIGSFYAKVQKPSRHRVYGPKTVRFMLSRMEKQPQRPWPKDRIWAFKS 2405
L RA +GC+SLPD+ SFY K KP++ RVYG +TVRFML+RME Q QRPWPKDRIW FKS
Sbjct: 1905 LTRARKGCMSLPDVSSFYVKSVKPTQERVYGSRTVRFMLARMENQAQRPWPKDRIWVFKS 1964
Query: 2406 SPRIFGSPMLDSSLTGCPLDREMVHWLKHRPAIF 2439
PR FG+PM+D+ L PLD+EM+HWLK R +F
Sbjct: 1965 DPRFFGTPMMDAVLNNSPLDKEMMHWLKTRSNVF 1998
Score = 610 bits (1573), Expect = e-171, Method: Compositional matrix adjust.
Identities = 349/741 (47%), Positives = 456/741 (61%), Gaps = 62/741 (8%)
Query: 609 DGPPLEELVSMEEDMDICDTPPHV----PAVTD---SSVGKWFYLDHCGMECGPSRLCDL 661
+G P E+ VSMEEDMDICDTPPH P T+ S VGKWFYLDH G+E GPS+L DL
Sbjct: 326 NGAPAED-VSMEEDMDICDTPPHTTSSAPEPTEPPASDVGKWFYLDHYGIEQGPSKLADL 384
Query: 662 KTLVEEGVLVSDHFIKHLDSNRWETVENAVSPLVTVNFPSITSDSVTQLVSPPEASGNLL 721
K LVE+G L+SDH IKH DSNRW TVENA SPLV FPS+ SD TQLVSPPEA GNLL
Sbjct: 385 KKLVEDGYLLSDHLIKHADSNRWVTVENAASPLVPSEFPSVYSDVSTQLVSPPEAPGNLL 444
Query: 722 ADTGDTAQSTGEEFPVTLQSQCCPDGSAAAAESSEDLHIDVRVGALLDGFTVIPGKEIET 781
+ + A T E A+AE ED +ID RV AL+DG ++ G+E+E
Sbjct: 445 DEAREEASGTDHE-----------QMKEASAEEQEDFYIDDRVDALMDGSIMVDGQELEI 493
Query: 782 LGEILQTTFERVDWQNNGGPTWHGACVGEQKPGDQKVDELYISDTKMKEAAELKSGDKDH 841
LGE+L FE V+W++ + E+ G ++ E D++ + ++D
Sbjct: 494 LGELLNAHFEPVNWESEDLSRFQVKL--ERDDGTKRSTEF--PDSRTAHIYGVVPAERDT 549
Query: 842 WVVCFDSDEWFSGRWSCKGGDWKRNDEAAQDRCSRKKQVLNDGFPLCQMPKSGYEDPRWN 901
+ +S EW+SGRWSCKGGDWKRND+ +QD+ RKK VLN+G+PLCQMPK +EDPRW
Sbjct: 550 YQPHIESSEWYSGRWSCKGGDWKRNDDFSQDKPYRKKLVLNEGYPLCQMPKGNHEDPRWV 609
Query: 902 QKDDLYYPSHSRRLDLPPWAYACPDERND--------GSGGSRSTQSKLAAVRGVKGTML 953
KDDLYYP +++LDLP WA++ +E +D G RS Q+K +GVKGT L
Sbjct: 610 CKDDLYYPLRAKKLDLPLWAFSSTEENDDTVDDASKSGVIPGRSGQTKQPP-KGVKGTTL 668
Query: 954 PVVRINACVVNDHGSFVSEPRSKVRAKERHSSRSARSYSSANDVRRSSAESDSHSKARNN 1013
PVV+INA VV D S SE R K + +R SRS+RS+S D R S+ E SHSK +
Sbjct: 669 PVVKINARVVKDQSS--SEHRIKPKVADRPPSRSSRSHSIGTD-RSSTHEGSSHSKKHHE 725
Query: 1014 QDSQGSWKSIACINTPKDRLCTVDDLQLQLGEWYYLDGAGHERGPSSFSELQVLVDQGCI 1073
DSQ KS + N PKD +CTV++L +++G+WYYLDG GHER P S+SELQ L +G I
Sbjct: 726 HDSQSLHKSKSVPNIPKDHVCTVEELSVKVGDWYYLDGTGHERVPFSYSELQELAKKGTI 785
Query: 1074 QKHTSVFRKFDKVWVPLTFATETSASTVRNHGEKIMPSGDSSGLPPTQSQDAVLGESNNN 1133
+ +SVFRK D W+P+ K + SG S+ S + L SN
Sbjct: 786 LEGSSVFRKIDNTWLPVL---------------KDLKSGCSARNGEAGSSTSALTHSNQ- 829
Query: 1134 VNSNAFHTMHPQFIGYTRGKLHELVMKSYKNREFAAAINEVLDPWINAKQPKKETEHVYR 1193
+ FH MHPQF+GYTRGKLHELVMK +K+RE AINEVL+PWI KQP+KE E +
Sbjct: 830 ---SNFHEMHPQFVGYTRGKLHELVMKYFKSRELTLAINEVLEPWIATKQPRKELETFFS 886
Query: 1194 KSEG-------DTRAGKRARLLVRESDG-DEETEEELQTIQDESTFEDLCGDASFPGEES 1245
S D + KRARLL +SD + +E+ L + +D+ FEDL A+ E
Sbjct: 887 HSSASKNFVQEDGGSTKRARLLPDQSDEYTDMSEDILASQKDDCCFEDLFEGAAHVKESP 946
Query: 1246 ASSAIESGGWGLLDGHTLAHVFHFLRSDMKSLAFASLTCRHWRAAVRFYKGISRQVDLSS 1305
+S ES WGLL+ H LA +FHFLR+D+KSL ++ TC W A ++Y+ + R +DLSS
Sbjct: 947 LNSRTESESWGLLNEHVLARIFHFLRADVKSLISSAATCSWWNTAAKYYRSVCRFIDLSS 1006
Query: 1306 VGPNCTDSLIRKTLNAFDKEK 1326
+GP CTD++ + + ++ K
Sbjct: 1007 LGPQCTDNVFHDIMGSIERSK 1027
>gi|242078371|ref|XP_002443954.1| hypothetical protein SORBIDRAFT_07g005020 [Sorghum bicolor]
gi|241940304|gb|EES13449.1| hypothetical protein SORBIDRAFT_07g005020 [Sorghum bicolor]
Length = 2166
Score = 1021 bits (2639), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 579/1247 (46%), Positives = 791/1247 (63%), Gaps = 86/1247 (6%)
Query: 619 MEEDMDICDTPPHVPAVTDSS-----VGKWFYLDHCGMECGPSRLCDLKTLVEEGVLVSD 673
M EDMDICDTPPH + + +G+WFYLDH G+E GPS+L +LK LVE+G L+SD
Sbjct: 433 MIEDMDICDTPPHTSGAPEPTEPICDIGRWFYLDHFGIEQGPSKLAELKKLVEDGYLLSD 492
Query: 674 HFIKHLDSNRWETVENAVSPLVTVNFPSITSDSVTQLVSPPEASGNLLADTGDTAQ--ST 731
H IKH DSNRW TVENA SPLV +FPS+ SD+ TQ+V+PPEA GNLL + + A ++
Sbjct: 493 HLIKHADSNRWVTVENAASPLVPSDFPSLYSDTSTQMVNPPEAPGNLLDEALEEASNLAS 552
Query: 732 GEEFPVTLQSQCCPDGSAAAAESSEDLHIDVRVGALLDGFTVIPGKEIETLGEILQTTFE 791
G E Q A+AE SE+ +ID RV AL+DG ++ G+E+E +GE+L F+
Sbjct: 553 GAE-----DKQM----DEASAEDSEEFYIDDRVEALMDGSILVHGQELEIIGELLGADFQ 603
Query: 792 RVDWQNNGGPT----WHGACVGEQKPGDQKVDELYISDTKMKEAAELKSGDKDHWVVCFD 847
DWQ+ P +H G+ G E + + +A L S +K+++ +
Sbjct: 604 PADWQSWSHPEDFTRFHVHTEGDD--GINGGTEFL--ENRATDAYGLVSVEKNNFHHYVE 659
Query: 848 SDEWFSGRWSCKGGDWKRNDEAAQDRCSRKKQVLNDGFPLCQMPKSGYEDPRWNQKDDLY 907
S EWFSGRWSCKGGDW RNDE +QD RKK VLN+G+PLCQMPK YEDPR KD+LY
Sbjct: 660 SSEWFSGRWSCKGGDWMRNDELSQDTPFRKKLVLNEGYPLCQMPKGSYEDPRRPCKDELY 719
Query: 908 YPSHSRRLDLPPWAYACPDERND--------GSGGSRSTQSKLAAVRGVKGTMLPVVRIN 959
YP +++ DLP WA++ +E D G R Q++ RGVKG MLPVVRIN
Sbjct: 720 YPVRAKKHDLPLWAFSSTEEDTDSVNDTTKSGVVPGRPGQTRQPP-RGVKGMMLPVVRIN 778
Query: 960 ACVVNDHGSFVSEPRSKVRAKERHSSRSARSYSSANDVRRSSAESDSHSKARNNQDSQGS 1019
+ VV D S EPR+K R +R SRS+RS+S RSS S + ++ DSQ
Sbjct: 779 SRVVKDQSSV--EPRTKPRGTDRPLSRSSRSHSIG--AERSSVHEGSTHRKHHDHDSQSL 834
Query: 1020 WKSIACINTPKDRLCTVDDLQLQLGEWYYLDGAGHERGPSSFSELQVLVDQGCIQKHTSV 1079
KS + N PKDR+CTVD+L + G+WYYLDG GHE GP S+SELQ LV +G I + +SV
Sbjct: 835 HKSKSVPNIPKDRVCTVDELSVNRGDWYYLDGTGHEHGPFSYSELQELVKKGTIIEQSSV 894
Query: 1080 FRKFDKVWVPLTFATETSASTVRNHGEKIMPSGDSSGLPPTQSQDAVL--GESNNNVN-- 1135
FRK D W P+ + +S +PS S + S A++ + N VN
Sbjct: 895 FRKIDNTWFPVLKDLKPGSS---------VPSAARS----SNSTAALMHPDQYNFGVNQG 941
Query: 1136 SNAFHTMHPQFIGYTRGKLHELVMKSYKNREFAAAINEVLDPWINAKQPKKETEHVYRKS 1195
S +FH +HPQF+GYTRGKLHELVMK +K+RE AINEVLDPWI+AKQPKKE E + +
Sbjct: 942 SGSFHELHPQFVGYTRGKLHELVMKYFKSRELTLAINEVLDPWISAKQPKKEFEAYFSHN 1001
Query: 1196 EG-------DTRAGKRARLLVRESDGDEETEEE-LQTIQDESTFEDLC-GDASFPGEESA 1246
D + KRA+LL +SD D E+ L + +++ FE+LC G +S +S
Sbjct: 1002 SASRNFLPEDGGSAKRAKLLPDQSDEDIHLSEDILASRKEDICFEELCDGASSSVDNDSV 1061
Query: 1247 SSAIESGGWGLLDGHTLAHVFHFLRSDMKSLAFASLTCRHWRAAVRFYKGISRQVDLSSV 1306
+ + WGLL+GH LA +FHF+R+D+KSL ++ TCR W AA ++Y+ + R +DLSSV
Sbjct: 1062 NPRAGNESWGLLNGHVLARIFHFMRADVKSLISSAATCRSWNAAAKYYRNMCRFIDLSSV 1121
Query: 1307 GPNCTDSLIRKTLNAFDKEKLNSILLVGCTNITSGMLEEILQSFPHLSSIDIRGCGQFGE 1366
GP CTDS+ + ++K+ + +++L GC+N++S L +L+ P +S + I+GCG G+
Sbjct: 1122 GPLCTDSVFCDIMAGYEKQNIRTLILAGCSNLSSHALGRVLEQLPQISYVHIQGCGHLGD 1181
Query: 1367 LALKFPNINWVKSQKSRGAKFNDSRSKIRSLKQITEKSSSAPK-SKGLGDDMDDFGDLKD 1425
L KF ++ W++S + +S K+++LKQI + ++ K ++ +D +L
Sbjct: 1182 LKSKFQHVKWIRSSLNP----EESYQKMKTLKQIGDGNNYTSKVARNFTSQLDGSDELDG 1237
Query: 1426 YFESVDKRDSANQSFRRSLYQRSKVFDARKSSSILSRDARMRRWSIKKSENGYKRMEEFL 1485
YF + R++AN SF + Y+RSK+ DARKSS++LSRDA MRR +++EN Y++MEEF+
Sbjct: 1238 YFADISNRENANLSFGQGFYKRSKLLDARKSSAVLSRDAEMRRLMQRQAENSYRKMEEFV 1297
Query: 1486 ASSLKEIMRVNTFEFFVPKVAEIEGRMKKGYYISHGLGSVKDDISRMCRDAIKAKNRGSA 1545
+ L+EIMR N F+FF+PKVA+IEGR+K GYY HG ++K DI MC+DA++ K+ +
Sbjct: 1298 INRLREIMRSNRFDFFIPKVAKIEGRLKNGYYARHGFRTIKHDIRTMCQDALRYKDGNDS 1357
Query: 1546 GDMNRITTLFIQLATRLEQGAKSSYYEREEMMKSWKDESPAGLYSATSKYKKKLSKMVSE 1605
GD+ +I FIQLA RL G ER + D YS +K KKK
Sbjct: 1358 GDIKQIVVSFIQLAKRL--GNPRHISERNGA--AAHDSLDISQYSFDTKLKKK------- 1406
Query: 1606 RKYMNRSNGTSLANGDFDYGEYASDREIRKRLSKLNRKSLDSGSETSDDLDGSSEDGKSD 1665
N++ G +L D A D EI++ LSKL +K + SGSETSDD D SE +++
Sbjct: 1407 ---QNKTRGANLVAAGADNSSRAFDLEIKRSLSKLKKKDVYSGSETSDDDDVYSEGDETE 1463
Query: 1666 SESTVSDTDSDMDFRSDGRARESRGAGDFTTDEGLDFSDDREWGARMTKASLVPPVTRKY 1725
SE+TVSDT+SD+D S A + +G G + G +DDR GARMTKASLVPPVTRKY
Sbjct: 1464 SETTVSDTESDLDVNSG--AWDLKGNGLKLIEPGESVTDDRILGARMTKASLVPPVTRKY 1521
Query: 1726 EVIDQYVIVADEEDVRRKMRVSLPEDYAEKLNAQKNGSEELDMELPEVKDYKPRKQLGDQ 1785
EVI++Y+IVAD E+V+RKMRV+LP+DY+EKL +QKNG+E L ELPEVKDY+PRK GD+
Sbjct: 1522 EVIEEYLIVADVEEVQRKMRVALPDDYSEKLLSQKNGTENL--ELPEVKDYQPRKVAGDE 1579
Query: 1786 VFEQEVYGIDPYTHNLLLDSMPDELDWNLLEKHLFIEDVLLRTLNKQ 1832
+ EQEVYGIDPYTHNLL D MP +L+ + +KH+FIE+ L NK+
Sbjct: 1580 ILEQEVYGIDPYTHNLLSDIMPADLELSPTDKHIFIEEGLGVVCNKK 1626
Score = 901 bits (2329), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 426/645 (66%), Positives = 507/645 (78%), Gaps = 18/645 (2%)
Query: 1812 WNLLEKHLFIEDV--LLRTL---------NKQVRHFTGTGNT--PMMYPLQP---VIEEI 1855
+ ++E++L + DV + R + K + GT N P + QP +EI
Sbjct: 1521 YEVIEEYLIVADVEEVQRKMRVALPDDYSEKLLSQKNGTENLELPEVKDYQPRKVAGDEI 1580
Query: 1856 EKEAVDDCDVRTMKMCRGILKA-MDSRPDDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLG 1914
++ V D T + I+ A ++ P DK++ +GLGVVCNK+GGFG DDFVVEF G
Sbjct: 1581 LEQEVYGIDPYTHNLLSDIMPADLELSPTDKHIFIEEGLGVVCNKKGGFGVDDFVVEFFG 1640
Query: 1915 EVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASR 1974
EVYP W+W+EKQDGI+ +Q N+ED APEFYNI LERPKGD DGYDLV VDAMHKANYASR
Sbjct: 1641 EVYPSWRWYEKQDGIKHIQNNSEDQAPEFYNIMLERPKGDRDGYDLVFVDAMHKANYASR 1700
Query: 1975 ICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQ 2034
ICHSC PNCEAKVTAVDG YQIG+YT+R I GEEITFDYNSVTESKEE+EASVCLCGSQ
Sbjct: 1701 ICHSCNPNCEAKVTAVDGKYQIGVYTLRPIAEGEEITFDYNSVTESKEEHEASVCLCGSQ 1760
Query: 2035 VCRGSYLNLTGEGAFEKVLKELHGLLDRHQLMLEACELNSVSEEDYLELGRAGLGSCLLG 2094
VCRGSYLN +GEGAFEKVL E HG+LDRH L+L+ACE +SVS++D ++LGRAGLG+CLL
Sbjct: 1761 VCRGSYLNFSGEGAFEKVLMEFHGVLDRHSLLLQACETDSVSQQDLIDLGRAGLGTCLLA 1820
Query: 2095 GLPNWVVAYSARLVRFINLERTKLPEEILRHNLEEKRKYFSDICLEVEKSDAEVQAEGVY 2154
GLP W+VAY+A LVRFI LER KLP+EILRHN++EKR++ +I ++ EK+DAEVQAEGV
Sbjct: 1821 GLPGWLVAYTANLVRFIYLERQKLPDEILRHNVDEKRQFLIEINMDSEKNDAEVQAEGVL 1880
Query: 2155 NQRLQNLAVTLDKVRYVMRCVFGDPKKAPPPVERLSPEETVSFLWKGEGSLVEELIQCMA 2214
N RLQ + TLDKVRYVMRCVFGDPK APPP+ RLS + VS +WKG+ S+V EL+Q M
Sbjct: 1881 NSRLQQIVHTLDKVRYVMRCVFGDPKNAPPPLVRLSGKSLVSAIWKGDSSIVAELLQSME 1940
Query: 2215 PHVEEDVLNDLKSKIQAHDPSGSEDIQRELRKSLLWLRDEVRNLPCTYKCRHDAAADLIH 2274
PHVEE+VL+DLK KI+AHDP SEDI+ +R SLLWLRDE+R LPCTYKCRHDAAADLIH
Sbjct: 1941 PHVEEEVLSDLKVKIRAHDPPDSEDIEGGIRNSLLWLRDELRTLPCTYKCRHDAAADLIH 2000
Query: 2275 IYAYTKCFFRVQEYKAFTSPPVYISPLDLGPKYADKLGADLQVYRKTYGENYCLGQLIFW 2334
+YAYTKCFFRV++YK SPPV+ISPLDLGPKYADKLG Q Y KTY ENYCL QLI+W
Sbjct: 2001 LYAYTKCFFRVRDYKTVKSPPVHISPLDLGPKYADKLGPGFQEYCKTYPENYCLAQLIYW 2060
Query: 2335 HIQTNADPDCTLARASRGCLSLPDIGSFYAKVQKPSRHRVYGPKTVRFMLSRMEKQPQRP 2394
+ Q N++P+ L RA +GC+SLPD+ SFY K KPS+ RVYG +TVRFMLSRMEKQ QRP
Sbjct: 2061 YSQ-NSEPESRLTRARKGCMSLPDVSSFYVKSAKPSQERVYGNRTVRFMLSRMEKQAQRP 2119
Query: 2395 WPKDRIWAFKSSPRIFGSPMLDSSLTGCPLDREMVHWLKHRPAIF 2439
WPKDRIW FKS PR FGSPM+D+ L PLD+EMVHWLK RP +F
Sbjct: 2120 WPKDRIWVFKSDPRFFGSPMMDAVLNNSPLDKEMVHWLKTRPNVF 2164
>gi|168059519|ref|XP_001781749.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666751|gb|EDQ53397.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 2661
Score = 846 bits (2186), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 543/1419 (38%), Positives = 762/1419 (53%), Gaps = 214/1419 (15%)
Query: 1199 TRAGKRARLLVRESDGDEETEEELQTIQDESTFEDLCGDASFPGEESASSAIESGGWGLL 1258
T GK AR E D+ET E ++ S DA +SS I GW L
Sbjct: 1282 TSTGKNAR----EVSSDQETSETGAVHENLSNL-----DAG-----GSSSQI---GWAWL 1324
Query: 1259 DGHTLAHVFHFLRSDMKSLAFASLTCRHWRAAVRFYKGISRQVDLSSVGPNCTDSLIRKT 1318
L V LR D KSL A TC+ W+ + K ++ VDLS +G CTD+ +
Sbjct: 1325 PTRILTKVLRLLRGDPKSLVAAMGTCQSWKDCAQDIKVSTKHVDLSGLGLRCTDA-VLGG 1383
Query: 1319 LNAFDKEKLNSILLVGCTNITSGMLEEILQSFPHLSSIDIRGCGQFGELALKFPNINWVK 1378
L F +L I L C N++S LE +L+S+P + + I GC + EL +P + WV
Sbjct: 1384 LLGFGGGQLKHITLDHCLNVSSKGLERLLKSYPSIREVGICGCARLIELVELYPQVRWVG 1443
Query: 1379 ---------------------SQKSRGAK--FNDSRSKIRSLKQI------TEKSSSAPK 1409
S KS G K + D + SL + TEKS S
Sbjct: 1444 NPFAVAHGIDTQRHGLRYNKLSSKSSGGKREYGDDVNSGGSLDETRYRDKSTEKSESPGT 1503
Query: 1410 SKGLGDDMD----------------DFGDLKD---------YFESVDKRDSANQSF--RR 1442
+ + + +D DF LK+ + S K + + +
Sbjct: 1504 IRRVVNSLDPLGKDVHASQMGYPCRDFKRLKENSMSGTRNGFCTSAGKHGISKRKLNSKS 1563
Query: 1443 SLYQRSKVFDARKSSSILSRDA----RMRRWSIKKSENGYKRMEEFLASSLKEIMRVNTF 1498
++ +S V + K + + S A + W++K +E K E+ +A +L+ +M ++
Sbjct: 1564 TIKSQSSVRGSLKGTPVSSEKADNVAKEVSWNVKDAE---KNPEKAMARALRVVMEADSE 1620
Query: 1499 EFFVPKVAE-------------------IEGRMKKGYYISH-GLGSVKDDISRMCRDAIK 1538
F E ++ ++K G+Y G+ K+D+ + R+A +
Sbjct: 1621 HLFHRMANEQVSEGRGAPTKSGQVDFCTVQKKLKLGHYGGRDGVKLFKEDLLQPLRNAFR 1680
Query: 1539 AKNRG----SAGDMNRITTLFIQLATRLEQGAKSSYYEREEMMKSWKDESPAGLYSATSK 1594
++ +AG + ++ Q L S + E+ + + SA +
Sbjct: 1681 LEHDSVIYKTAGRLFKVAHQVGQHLFNL-----LSKPQIRELADGQSKALSSCVTSARTL 1735
Query: 1595 YKKKLSKMVS-------ERKYMNRSNGTSLANGDFDYGEYASDREIRKRLSKLNRKSL-- 1645
K K SK ++ +R + + + S+ +Y SD ++ R S L
Sbjct: 1736 LKSKESKDLATEKQRGPKRSWDSETGARSVKRKVLNYMNSRSDGDVLSRDSDLQGSIDRD 1795
Query: 1646 -----------------DSGSETSDDLDGSSEDGKSDSESTVSDTDSDMDFRSDGRARES 1688
++GS +D+D + D D+E++ SD S +D +
Sbjct: 1796 ERESRRDRMRRSQWAESEAGSSDEEDMDEALYD--EDTETSGSDVASKSGI-ADELVYDY 1852
Query: 1689 RGAGDF-TTDEGLDFSDDREW-GARMTKASLVPPVTRKYEVIDQYVIVADEEDVRRKMR- 1745
R A D G R+W GARMTKA++VPP+TRKYEVI++Y IV D E V KM+
Sbjct: 1853 RDASDSDNIYSGFGEGSSRDWWGARMTKAAMVPPLTRKYEVIEEYRIVDDFERVASKMKR 1912
Query: 1746 ------------VSLPEDYAEKL-NAQKNGSEELDMELPEVKDYKPRKQLGDQVFEQEVY 1792
V LP+DY EKL A+K G +++PE+K+ +PRK+LG +V EQEVY
Sbjct: 1913 HCEVTSVNFWRKVILPDDYEEKLWAAKKVGDRYAHLDVPELKECRPRKRLGKEVLEQEVY 1972
Query: 1793 GIDPYTHNLLLDSMPDELD-WNLLEKHLFIEDVLLRTLNKQVRHFTGTGNTPMMYPLQPV 1851
GIDPYT+NLLL++MP + + + +K LFIE+ LLR LN++V FTG+G PM Y L+ V
Sbjct: 1973 GIDPYTYNLLLNTMPADTELFTEKQKQLFIEEKLLRALNREVSSFTGSGKAPMEYSLEKV 2032
Query: 1852 IEEIEKEAVDDCDVRTMKMCRGILKAMDSRPDDKYVAYRKGLGVVCNKEGGFGEDDFVVE 1911
I I +A D ++ CR +LK M +DKYVAYRKGLGVVCNK GF + DFVVE
Sbjct: 2033 IAHICGDAHADQPLQVF--CRSLLKNMQRHLNDKYVAYRKGLGVVCNKPEGFDDGDFVVE 2090
Query: 1912 FLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANY 1971
F GEVYP W+W+EKQDGIR+LQK ++PAPEFYNI ERPKGD+ GYD++VVDAMHKAN+
Sbjct: 2091 FFGEVYPPWRWYEKQDGIRALQKKEKEPAPEFYNIVFERPKGDSWGYDVLVVDAMHKANF 2150
Query: 1972 ASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLC 2031
ASR+CHSCRPNCEAKVTAV+G Y IG+YT+R I +GEE+TFDY VTESKEE+++SVCLC
Sbjct: 2151 ASRLCHSCRPNCEAKVTAVNGKYMIGVYTLRKIEFGEELTFDYCCVTESKEEHDSSVCLC 2210
Query: 2032 GSQVCRGSYLNLTGEGAFEKVLKELHGLLDRHQLMLEACELNSVSEEDYLELGRAGLGSC 2091
GSQ C+GSYL TG GA+++VLKE HG+LDRH L+L+AC +V+ + +L +AGLG C
Sbjct: 2211 GSQGCKGSYLCYTGPGAYDEVLKECHGILDRHNLLLQACTSGAVTFREQEDLKQAGLGPC 2270
Query: 2092 LLGGLPNWVVAYSARLVRFINLERTKLPEEILRHNLEEKRKYFSDICLEVEKSDAEVQA- 2150
LL GLP WV+ Y+A +V ++N ER +LP+E++ K + +++++ D EVQA
Sbjct: 2271 LLDGLPQWVIKYAAGIVSYLNFERQRLPDELM------KAEMLKHTGIDLDRQDVEVQAH 2324
Query: 2151 ---------------EGVYNQRLQNLAVTLDK--------------------------VR 2169
EGVYNQRLQNLA+TLDK VR
Sbjct: 2325 AALLSEGIPLETWMTEGVYNQRLQNLAITLDKVMPPPTISLCTGIVGVAEILSPLFLQVR 2384
Query: 2170 YVMRCVFG-DPKKAPPPVERLSPEETVSFLWKGEGSLVEELIQCMAPHVEEDVLNDLKSK 2228
+V+ ++G + KA PP+ L P E V ++W G+ S+V EL+QCMA H E L DL +
Sbjct: 2385 HVLTKLYGEEASKASPPLRMLEPHELVDYIWTGKDSVVGELLQCMAVHSPEG-LADLTRQ 2443
Query: 2229 IQAHDPSGSEDIQRELRKSLLWLRDEVRNLPCTYKCRHDAAADLIHIYAYTKCFFRVQEY 2288
IQ H+P DI+ LR+SLLWLRD +R +P T RHDAAADLIH+YAYTK FF +Y
Sbjct: 2444 IQDHNPPPGGDIEENLRRSLLWLRDTLRKVPATCMGRHDAAADLIHLYAYTKHFFTNNDY 2503
Query: 2289 KAFTSPPVYISPLDLGPKYADKLGADLQVYRKTYGENYCLGQLIFWHIQTNADPDCTLAR 2348
SPP+ I DLGPK++ GA ++RK+Y +NY GQLI W QT+ DP +L +
Sbjct: 2504 GLVDSPPILIYACDLGPKHS---GAGPYMWRKSYSKNYVWGQLISWFRQTSVDPGASLVQ 2560
Query: 2349 ASRGCLSLPDIGSFYAKVQKPSRHRVYGPKTVRFMLSRMEKQPQRPWPKD---RIWAFKS 2405
RGCL LPDI S YA+ + Y K + M+ ME PQ+ W + +W FKS
Sbjct: 2561 DRRGCLMLPDISSCYARTIQHDFRCGYSDKDRKKMIMHMETYPQKKWTRKLTPELWNFKS 2620
Query: 2406 SPRIFGSPMLDSSLTGCPLDREMVHWLKHRPAIFQAMWD 2444
+FGSPMLD+++ L++E + WLK R +F WD
Sbjct: 2621 DRGLFGSPMLDAAVAKTKLNKECMQWLKTRDTVFHGPWD 2659
Score = 107 bits (266), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 61/165 (36%), Positives = 89/165 (53%), Gaps = 23/165 (13%)
Query: 1038 DLQLQLGEWYYLDGAGHERGPSSFSELQVLVDQGCIQKHTSVFRKFDKVWVPLTFAT--- 1094
+LQL+ G W+YLD AGHERGP + S L+ +V +G + SV RK D +WVP++
Sbjct: 1039 ELQLESGVWHYLDAAGHERGPFTLSALKGIVAEGGLPAGASVLRKRDNLWVPVSHLVQYY 1098
Query: 1095 ----------------ETSASTVRNHGEKIMPSGDSSGLPPTQSQDAVLGESNNNVNSNA 1138
E SA+ VR+ + + P + A+ G ++V+S+
Sbjct: 1099 DAHSPAFLSKLQPDYLERSANLVRSAASTVASTVQDPAHP---ANLALKGLDVHSVSSST 1155
Query: 1139 FHTMHPQFIGYTRGKLHELVMKSYKNREFAAAINEVLDPWINAKQ 1183
FH PQF+GYT GKLHE VMKS++ FA N+ LD W ++K+
Sbjct: 1156 FHNELPQFLGYTNGKLHEYVMKSFRG-SFAGFFNDALDVWSSSKR 1199
>gi|168035499|ref|XP_001770247.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162678464|gb|EDQ64922.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 2852
Score = 835 bits (2158), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 417/800 (52%), Positives = 543/800 (67%), Gaps = 36/800 (4%)
Query: 1664 SDSESTVSDTDSDMDFRSDGRARESRGAGDFTTDEGLDFSDDREWGARMTKASLVPPVTR 1723
SD S D MD +DG + D + G + S WGARMTKA++VPP+TR
Sbjct: 2068 SDVASKSGIADEAMDDYNDGSESD-----DAYSGYGGEGSSRDWWGARMTKAAMVPPLTR 2122
Query: 1724 KYEVIDQYVIVADEEDVRRKMRVSLPEDYAEKLNAQKNGSEEL-DMELPEVKDYKPRKQL 1782
KYEVI++Y IV D E V KM+V LP+DY EKL K G + +++PE+K++KPRK+L
Sbjct: 2123 KYEVIEEYRIVDDYERVVAKMKVELPDDYEEKLRVAKKGGDRFAHLDVPELKEFKPRKRL 2182
Query: 1783 GDQVFEQEVYGIDPYTHNLLLDSMP-DELDWNLLEKHLFIEDV-------------LLRT 1828
G++V EQEVYGIDPYT+NLLL++MP D + +K LFIE+ LLR
Sbjct: 2183 GEEVLEQEVYGIDPYTYNLLLNTMPADTESFTDKQKQLFIEEAFSDEDCSLDMLQKLLRA 2242
Query: 1829 LNKQVRHFTGTGNTPMMYPLQPVIEEIEKEAVDDCDVRTMKMCRGILKAMDSRPDDKYVA 1888
LN++V FTG+G PM Y L+ VI I + D ++ CR +LK M S +DKYVA
Sbjct: 2243 LNREVSSFTGSGKAPMEYSLEKVISHICSDVHADQPLQVF--CRSLLKNMKSHLNDKYVA 2300
Query: 1889 YRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYL 1948
YRKGLGVVCNK GF + DFVVEF GEVYP W+W+EKQDGIR+LQK ++PAPEFYNI
Sbjct: 2301 YRKGLGVVCNKPEGFDDGDFVVEFFGEVYPPWRWYEKQDGIRALQKKEKEPAPEFYNIVF 2360
Query: 1949 ERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGE 2008
ERPKGD+ GYD++VVDAMHKAN+ASR+CHSCRPNCEAKVTAV+G Y IG+YT+R I +GE
Sbjct: 2361 ERPKGDSLGYDVLVVDAMHKANFASRLCHSCRPNCEAKVTAVNGKYMIGVYTLRKIEFGE 2420
Query: 2009 EITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDRHQLMLE 2068
E+TFDY VTESKEE+++SVCLCGSQ C+GSYL TG GA+++VLKE HG+LDRH L+L+
Sbjct: 2421 ELTFDYCCVTESKEEHDSSVCLCGSQGCKGSYLCYTGPGAYDEVLKEYHGILDRHNLLLQ 2480
Query: 2069 ACELNSVSEEDYLELGRAGLGSCLLGGLPNWVVAYSARLVRFINLERTKLPEEILRHNLE 2128
AC +V+ + +L +AGLG CLL GLP WV+ Y+A +V ++N ER +LP+E++
Sbjct: 2481 ACTSGAVTFREQEDLKQAGLGPCLLDGLPQWVIKYAAGIVSYLNFERQRLPDELM----- 2535
Query: 2129 EKRKYFSDICLEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRCVFG-DPKKAPPPVE 2187
K + +++++ D EVQ EGVYNQRLQNLA+TLDKVR+++ ++G + KAPPP+
Sbjct: 2536 -KAEMLKQTGIDIDRQDIEVQTEGVYNQRLQNLAITLDKVRHILVKLYGEEASKAPPPLR 2594
Query: 2188 RLSPEETVSFLWKGEGSLVEELIQCMAPHVEEDVLNDLKSKIQAHDPSGSEDIQRELRKS 2247
L P E V +W G+ S+V EL+QCMA H E L DL +IQ H+P DI+ LRKS
Sbjct: 2595 MLEPHELVKCIWTGKDSIVGELLQCMALHSPEG-LADLTKQIQEHNPPPGGDIEENLRKS 2653
Query: 2248 LLWLRDEVRNLPCTYKCRHDAAADLIHIYAYTKCFFRVQEYKAFTSPPVYISPLDLGPKY 2307
LLWLRD +R +P T RHDAAADLIH+YAYTK FF +Y SPP+ I DLGPK+
Sbjct: 2654 LLWLRDTLRKVPATCMGRHDAAADLIHLYAYTKHFFTNYDYGMVDSPPILIYACDLGPKH 2713
Query: 2308 ADKLGADLQVYRKTYGENYCLGQLIFWHIQTNADPDCTLARASRGCLSLPDIGSFYAKVQ 2367
+ GA ++RK+Y +NY GQLI W QT+ADP +L + RGCL LPDI S YA+
Sbjct: 2714 S---GAGPYMWRKSYSKNYIWGQLISWFRQTSADPGASLVQDRRGCLMLPDISSCYARTI 2770
Query: 2368 KPSRHRVYGPKTVRFMLSRMEKQPQRPWPKD---RIWAFKSSPRIFGSPMLDSSLTGCPL 2424
+ Y K + M+ ME PQ+ W + +W FKS +FGSPMLD+++ L
Sbjct: 2771 QHDFRCGYSDKDRKRMILHMETHPQKKWTRKFTPELWNFKSDRGLFGSPMLDAAVAKTKL 2830
Query: 2425 DREMVHWLKHRPAIFQAMWD 2444
++E +HWLK R +F WD
Sbjct: 2831 NKECIHWLKTRETVFHGPWD 2850
Score = 113 bits (282), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 67/181 (37%), Positives = 91/181 (50%), Gaps = 20/181 (11%)
Query: 1022 SIACINTPKDRLCTVDDLQLQLGEWYYLDGAGHERGPSSFSELQVLVDQGCIQKHTSVFR 1081
+ C P + +LQL+ G W+YLD AGHERGP S LQ V +G + SVFR
Sbjct: 1174 AFKCTKNPASVVLRKHELQLESGVWHYLDAAGHERGPFLMSALQGFVAEGGLPAGASVFR 1233
Query: 1082 KFDKVWVPLTFATETSASTVRNHGEKIMP-SGDSSGLP--PTQSQDAVLGESN------- 1131
K D +WVP++ + + K +P S + SG P P A+ E
Sbjct: 1234 KRDNLWVPVSHLIQLHNAHAPAFASKPLPHSLERSGYPVRPAVPSGAISCEDPAHTAHPA 1293
Query: 1132 ---------NNVNSNAFHTMHPQFIGYTRGKLHELVMKSYKNREFAAAINEVLDPWINAK 1182
+V+S+ FH HPQF+GYT GKLHE MKS++ FA N+ LD W N+K
Sbjct: 1294 HPDLMELDVRSVSSSIFHDDHPQFLGYTNGKLHEHAMKSFRG-SFAGFFNDALDVWSNSK 1352
Query: 1183 Q 1183
+
Sbjct: 1353 R 1353
Score = 79.3 bits (194), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 52/169 (30%), Positives = 81/169 (47%), Gaps = 12/169 (7%)
Query: 1254 GWGLLDGHTLAHVFHFLRSDMKSLAFASLTCRHWRAAVRFYKGISRQVDLSSVGPNCTDS 1313
G L L V LR D KSL A TC+ W+ + + ++ VDLS +G +C D+
Sbjct: 1547 GRAWLPPRILTKVLRLLRGDPKSLVAAMATCQSWKNCAQSIRMSTKHVDLSGLGSHCNDA 1606
Query: 1314 LIRKTLNAFDKEKLNSILLVGCTNITSGMLEEILQSFPHLSSIDIRGCGQFGELALKFPN 1373
+I L KL I L C N++S L +L+S+P + + I GC Q EL +P
Sbjct: 1607 IIGGLLGF-GGGKLRRITLDYCLNVSSKALGRLLKSYPSIREVSISGCVQLSELVELYPQ 1665
Query: 1374 INWVKSQKSRGAKFNDSRSKIRSLKQITEKSSSAPKSKGL----GDDMD 1418
++WV + + R +++ K + PKS G+ GDD++
Sbjct: 1666 VSWVGNPFAIPHGLESQRHNLKNNK-------TNPKSSGIKREFGDDVN 1707
>gi|115475081|ref|NP_001061137.1| Os08g0180100 [Oryza sativa Japonica Group]
gi|46805056|dbj|BAD17037.1| SET domain-containing protein-like [Oryza sativa Japonica Group]
gi|113623106|dbj|BAF23051.1| Os08g0180100 [Oryza sativa Japonica Group]
Length = 494
Score = 791 bits (2042), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 365/492 (74%), Positives = 418/492 (84%), Gaps = 1/492 (0%)
Query: 1948 LERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYG 2007
LERPKGD DGYDLV VDAMHKANYASRICHSC PNCEAKVTAVDGHYQIGIYTVR I G
Sbjct: 2 LERPKGDRDGYDLVFVDAMHKANYASRICHSCNPNCEAKVTAVDGHYQIGIYTVRPIAEG 61
Query: 2008 EEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDRHQLML 2067
EEITFDYNSVTESKEE+EASVCLCGSQ+CRGSYLN +GEGAFEKVL E HG+LDRH L+L
Sbjct: 62 EEITFDYNSVTESKEEHEASVCLCGSQICRGSYLNFSGEGAFEKVLMEFHGVLDRHSLLL 121
Query: 2068 EACELNSVSEEDYLELGRAGLGSCLLGGLPNWVVAYSARLVRFINLERTKLPEEILRHNL 2127
+ACE NSVS++D ++LGRAGLG+CLL GLP W+VAY+A LVRFI ER KLP EI +HN+
Sbjct: 122 QACEANSVSQQDLIDLGRAGLGTCLLAGLPGWLVAYTAHLVRFIFFERQKLPHEIFKHNV 181
Query: 2128 EEKRKYFSDICLEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRCVFGDPKKAPPPVE 2187
+EKR++F+DI ++ EK+DAEVQAEGV N RLQNL TLDKVRYVMRC+FGDPK APPP+
Sbjct: 182 DEKRQFFTDINMDSEKNDAEVQAEGVLNSRLQNLTHTLDKVRYVMRCIFGDPKNAPPPLV 241
Query: 2188 RLSPEETVSFLWKGEGSLVEELIQCMAPHVEEDVLNDLKSKIQAHDPSGSEDIQRELRKS 2247
RL+ VS +WKGEGSLV+EL++ M PHVEEDVL DLK+KI+AHDPSGSEDI+ E+R S
Sbjct: 242 RLTGRSLVSAIWKGEGSLVDELLESMEPHVEEDVLTDLKAKIRAHDPSGSEDIEGEIRSS 301
Query: 2248 LLWLRDEVRNLPCTYKCRHDAAADLIHIYAYTKCFFRVQEYKAFTSPPVYISPLDLGPKY 2307
LLWLRDE+R L CTYKCRHDAAADLIH+YAYTKCFFRV++YK SPPV ISPLDLGPKY
Sbjct: 302 LLWLRDELRTLSCTYKCRHDAAADLIHMYAYTKCFFRVRDYKTVKSPPVLISPLDLGPKY 361
Query: 2308 ADKLGADLQVYRKTYGENYCLGQLIFWHIQTNADPDCTLARASRGCLSLPDIGSFYAKVQ 2367
ADKLG Q Y KTY ENYCLGQLI+W+ Q NA+P+ L RA +GC+SLPD+ SFY K
Sbjct: 362 ADKLGPGFQEYCKTYPENYCLGQLIYWYSQ-NAEPESRLTRARKGCMSLPDVSSFYVKSV 420
Query: 2368 KPSRHRVYGPKTVRFMLSRMEKQPQRPWPKDRIWAFKSSPRIFGSPMLDSSLTGCPLDRE 2427
KP++ RVYG +TVRFML+RME Q QRPWPKDRIW FKS PR FG+PM+D+ L PLD+E
Sbjct: 421 KPTQERVYGSRTVRFMLARMENQAQRPWPKDRIWVFKSDPRFFGTPMMDAVLNNSPLDKE 480
Query: 2428 MVHWLKHRPAIF 2439
MVHWLK R +F
Sbjct: 481 MVHWLKTRSNVF 492
>gi|302821685|ref|XP_002992504.1| hypothetical protein SELMODRAFT_448778 [Selaginella moellendorffii]
gi|300139706|gb|EFJ06442.1| hypothetical protein SELMODRAFT_448778 [Selaginella moellendorffii]
Length = 1806
Score = 672 bits (1733), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 322/525 (61%), Positives = 406/525 (77%), Gaps = 8/525 (1%)
Query: 1704 DDREWGARMTKASLVPPVTRKYEVIDQYVIVADEEDVRRKMRVSLPEDYAEKLNAQK-NG 1762
D REWGARMTKA++VPPVTRKYE+I+ Y IV D++ V RKM+V +P+DY EKL A K
Sbjct: 1260 DVREWGARMTKAAMVPPVTRKYEIIEDYWIVIDKDLVERKMKVEVPDDYEEKLRASKLKR 1319
Query: 1763 SEELDMELPEVKDYKPRKQLGDQVFEQEVYGIDPYTHNLLLDSMPDELDWNLLEKHLFIE 1822
E +++P++K+Y PR++LG +V EQEVYGIDPYTHNLLLD+MP L EK F+E
Sbjct: 1320 GEYSHLDIPDIKEYHPRRELGLEVMEQEVYGIDPYTHNLLLDTMPKIPAMTLQEKLQFME 1379
Query: 1823 DVLLRTLNKQVRHFTGTGNTPMMYPLQPVIEEIEKEAVDDC-DVRTMKMCRGILKAMDSR 1881
+ LL+ +NK+V+ FTGTG P+ + L+PVI+ I VDD D + +G+L M +R
Sbjct: 1380 ETLLQAINKEVKQFTGTGKAPIDFSLEPVIQRI----VDDAQDTSMQQFAQGLLSNMRNR 1435
Query: 1882 PDDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAP 1941
+KY+AYRKGLGVVCNK+GGF EDDFVVEF GEVYP W+W+EKQDG R LQK +++P P
Sbjct: 1436 TKEKYLAYRKGLGVVCNKDGGFKEDDFVVEFFGEVYPAWRWYEKQDGCRYLQKKDKEPLP 1495
Query: 1942 EFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTV 2001
EFYNI LERPKGDA GYDLVVVDAMHKAN+ASRICHSCRPNCEAKVTAV G Y IG+Y +
Sbjct: 1496 EFYNILLERPKGDAAGYDLVVVDAMHKANFASRICHSCRPNCEAKVTAVKGRYIIGVYAL 1555
Query: 2002 RGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLD 2061
R I GEE+TFDYNSVTESKEEY S+CLCGSQ CRGSYLNL GA + V+KE HGLLD
Sbjct: 1556 RPIQNGEELTFDYNSVTESKEEYNNSICLCGSQCCRGSYLNLANAGASQDVIKERHGLLD 1615
Query: 2062 RHQLMLEACELNSVSEEDYLELGRAGLGSCLLGGLPNWVVAYSARLVRFINLERTKLPEE 2121
RH L+LEAC V+ + E+ +AG+GSCLL GLP+W++ Y+ARLV F+NLER LP+E
Sbjct: 1616 RHVLLLEACCEGPVTRLELEEMRQAGVGSCLLDGLPDWLLKYTARLVEFMNLERQLLPDE 1675
Query: 2122 ILRHNLEEKRKYFSDICLEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRCVFGDPKK 2181
++R ++++KRK +D+ E+ + DAE QAEGVYNQRLQN+A+TLDKVRYV+R +F DP++
Sbjct: 1676 LMR-SVKKKRKD-ADLSYELGRVDAENQAEGVYNQRLQNIAITLDKVRYVLRQLFTDPRE 1733
Query: 2182 APPPVERLSPEETVSFLWKGEGSLVEELIQCMAPHVEEDVLNDLK 2226
APP +++L +E VS LW E S+V EL+ CM PH+ D L +LK
Sbjct: 1734 APPLLKKLDQKELVSRLWSAENSIVNELLSCMMPHIPADRLAELK 1778
Score = 208 bits (529), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 219/824 (26%), Positives = 330/824 (40%), Gaps = 143/824 (17%)
Query: 643 WFYLDHCGMECGPSRLCDLKTLVEEGVLVSDHFIKHLDSNRWETVENAVSPLVTVNFPSI 702
W Y++ G GP L LK L + DH + +N W T+E+A SP
Sbjct: 420 WVYINKNGDTQGPMELAALKLLASREFMPDDHLVMRCGTNAWITLEHAQSP-------DN 472
Query: 703 TSDSVTQLVSPPEASGNLLADTGDTAQSTGEEFPVTLQSQCCPDGSAA-AAESSEDLHID 761
S ++ +LV+P A+ N A +ST DG+A ES E L ID
Sbjct: 473 GSSALQKLVNPVAATRN--AKDATLVEST-------------IDGNAQFQNESFEFLDID 517
Query: 762 VRVGALLDGFTVIPGKEIETLGEILQTTFERVDWQNNGGPTWHGACVGEQKPGDQKVDEL 821
RV +L G L++ +D Q P A P +
Sbjct: 518 GRVERILATCRA------SDQGNELKSVLSALDAQRQHSPDEGSAPPELSNPWKDSYNLG 571
Query: 822 YISDTKMK--EAAELKSGDKDHWVVCFDSDE-------------WFSGRWSCKGGDWKRN 866
+ D ++ +A + ++E W GRW+ KGGDWK
Sbjct: 572 FGVDADLECLDAVPRVVPPEPVPAPPPSTEEDLHHHDHNPSPLKWKPGRWTSKGGDWKLL 631
Query: 867 DEAAQDRCSRKKQVLNDGFPLCQMPKSGYEDPRWNQKDDLYYPSHSRRLDLPPWAYACPD 926
Q+ K VLN+G LC+ P G DPR + S + +LP WA
Sbjct: 632 HPDGQNYV---KVVLNEGSLLCERPHYGV-DPRRQVQ-----VSERPKFELPQWAL---- 678
Query: 927 ERNDGSGGSRSTQSKLAAVRGVKGTMLPVVRINACVVNDHGSFVSEPRSKVRAKERHSSR 986
+RN + S T +A R K A DHG ++ R+ + ER SS
Sbjct: 679 DRNQKADSSAETTKSASATRPAK---------TATRAFDHGKEIAPERASM---ERPSSF 726
Query: 987 SARSYSSANDVRRSSAESDSHSKARNNQDSQGSWKSIACINTPKDRLCTVDDLQLQLGEW 1046
+ +S +D R + + H+ + S+ S +C K + C D L G+W
Sbjct: 727 LTKK-ASFSDTRPKTLPPERHTPSARTFASKAQRPS-SCSADIKAK-CNHD---LGRGDW 780
Query: 1047 YYLDGAGHERGPSSFSELQVLVDQGCIQKHTSVFRKFDKVWVPLTFATETSASTVRNHGE 1106
+Y DG G ERGP SF+ELQ ++ + + TS +RK D +WVPL
Sbjct: 781 FYKDGGGRERGPYSFAELQAMIGRELLIPGTSAYRKSDDLWVPLP--------------- 825
Query: 1107 KIMPSGDSSGLPPTQSQDAVLGESNNNVNSNAFHTMHPQFIGYTRGKLHELVMKSYKNRE 1166
P D T+ + + V + + +T G+LHE VMK YKN+
Sbjct: 826 --RPEMDDGNFNVTEVTTSSF-DGARRVRKVVTTNIQQTIMAFTSGQLHEHVMKHYKNQV 882
Query: 1167 FAAAINEVLDP-------------------------WI----NAKQPKKETEHVYRKSEG 1197
+A + E LD WI + P + + +G
Sbjct: 883 MSAILFEGLDARAKLIESNRRLSTCSTVALLSGEDSWIGYGRSHPSPSSSNDTSDEREDG 942
Query: 1198 DTRAGKRARLLVRE----------------SDGDEETEEELQTIQDESTFEDLCGDASFP 1241
D R L R +D E +E+EL T + + D
Sbjct: 943 DQDHRPNRRPLFRSNGLSQEQTSRKRRLVYNDDVESSEDELPTGRRTRQRCLIRNDVFDS 1002
Query: 1242 GEESASSAIESGGWGLLDGHTLAHVFHFLRSDMKSLAFASLTCRHWRAAVRFYKGISRQV 1301
+E ++ W L L ++H L+ D+KSLA S+TC+ WRAAV +K + +
Sbjct: 1003 SDEHLYEEGQNNSWETLGQLMLMRIYHHLKGDLKSLALISMTCKSWRAAVEKFKPKVKCL 1062
Query: 1302 DLSSVGPNCTDSLIRKTLNA-FDKEKLNSILLVGCTNITSGMLEEILQSFPHLSSIDIRG 1360
D +S+G +CTD+++ + L ILL CT ++ L + L++ P + +DI G
Sbjct: 1063 DFTSIGLHCTDAVLSSVQQQNYGGGNLKQILLKDCTVLSPDALGKFLEACPTIQDVDING 1122
Query: 1361 CGQFGELALKFPNINWV--KSQKSRGAK--FNDSRSKIRSLKQI 1400
C QFG+L+ FP +NWV S+ S A +DS K++SL I
Sbjct: 1123 CDQFGDLSHSFPQVNWVYDDSEVSDSATQGSDDSHRKMKSLNSI 1166
>gi|302817012|ref|XP_002990183.1| hypothetical protein SELMODRAFT_447947 [Selaginella moellendorffii]
gi|300142038|gb|EFJ08743.1| hypothetical protein SELMODRAFT_447947 [Selaginella moellendorffii]
Length = 1749
Score = 669 bits (1727), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 321/525 (61%), Positives = 405/525 (77%), Gaps = 8/525 (1%)
Query: 1704 DDREWGARMTKASLVPPVTRKYEVIDQYVIVADEEDVRRKMRVSLPEDYAEKLNAQK-NG 1762
D REWGARM KA++VPPVTRKYE+I+ Y IV D++ V RKM+V +P+DY EKL A K
Sbjct: 1203 DVREWGARMPKAAMVPPVTRKYEIIEDYWIVIDKDLVERKMKVEVPDDYEEKLRASKLKR 1262
Query: 1763 SEELDMELPEVKDYKPRKQLGDQVFEQEVYGIDPYTHNLLLDSMPDELDWNLLEKHLFIE 1822
E +++P++K+Y PR++LG +V EQEVYGIDPYTHNLLLD+MP L EK F+E
Sbjct: 1263 GEYSHLDIPDIKEYHPRRELGLEVMEQEVYGIDPYTHNLLLDTMPKIPAMTLQEKLQFME 1322
Query: 1823 DVLLRTLNKQVRHFTGTGNTPMMYPLQPVIEEIEKEAVDDC-DVRTMKMCRGILKAMDSR 1881
+ LL+ +NK+V+ FTGTG P+ + L+PVI+ I VDD D + +G+L M +R
Sbjct: 1323 ETLLQAINKEVKQFTGTGKAPIDFSLEPVIQRI----VDDAQDTSMQQFAQGLLSNMRNR 1378
Query: 1882 PDDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAP 1941
+KY+AYRKGLGVVCNK+GGF EDDFVVEF GEVYP W+W+EKQDG R LQK +++P P
Sbjct: 1379 TKEKYLAYRKGLGVVCNKDGGFKEDDFVVEFFGEVYPAWRWYEKQDGCRYLQKKDKEPLP 1438
Query: 1942 EFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTV 2001
EFYNI LERPKGDA GYDLVVVDAMHKAN+ASRICHSCRPNCEAKVTAV G Y IG+Y +
Sbjct: 1439 EFYNILLERPKGDAAGYDLVVVDAMHKANFASRICHSCRPNCEAKVTAVKGRYIIGVYAL 1498
Query: 2002 RGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLD 2061
R I GEE+TFDYNSVTESKEEY S+CLCGSQ CRGSYLNL GA + V+KE HGLLD
Sbjct: 1499 RPIQNGEELTFDYNSVTESKEEYNNSICLCGSQCCRGSYLNLANAGASQDVIKERHGLLD 1558
Query: 2062 RHQLMLEACELNSVSEEDYLELGRAGLGSCLLGGLPNWVVAYSARLVRFINLERTKLPEE 2121
RH L+LEAC V+ + E+ +AG+GSCLL GLP+W++ Y+ARLV F+NLER LP+E
Sbjct: 1559 RHVLLLEACCEGPVTRLELEEMRQAGVGSCLLDGLPDWLLKYTARLVEFMNLERQLLPDE 1618
Query: 2122 ILRHNLEEKRKYFSDICLEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRCVFGDPKK 2181
++R ++++KRK +D+ E+ + DAE QAEGVYNQRLQN+A+TLDKVRYV+R +F DP++
Sbjct: 1619 LMR-SVKKKRKD-ADLSYELGRVDAENQAEGVYNQRLQNIAITLDKVRYVLRQLFTDPRE 1676
Query: 2182 APPPVERLSPEETVSFLWKGEGSLVEELIQCMAPHVEEDVLNDLK 2226
APP +++L +E VS LW E S+V EL+ CM PH+ D L +LK
Sbjct: 1677 APPLLKKLDQKELVSRLWSAENSIVNELLSCMMPHIPADRLAELK 1721
Score = 196 bits (497), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 163/574 (28%), Positives = 244/574 (42%), Gaps = 95/574 (16%)
Query: 850 EWFSGRWSCKGGDWKRNDEAAQDRCSRKKQVLNDGFPLCQMPKSGYEDPRWNQKDDLYYP 909
+W GRW+ KGGDWK Q+ K VLN+G LC+ P G DPR +
Sbjct: 587 KWKPGRWTSKGGDWKLLHPDGQNYV---KVVLNEGSLLCERPHYGV-DPRRQVQ-----V 637
Query: 910 SHSRRLDLPPWAYACPDERNDGSGGSRSTQSKLAAVRGVKGTMLPVVRINACVVNDHGSF 969
S + +LP WA +RN + S T +A R K A DHG
Sbjct: 638 SERPKFELPQWAL----DRNQKADSSAETTKSASATRPAK---------TATRAFDHGKE 684
Query: 970 VSEPRSKVRAKERHSSRSARSYSSANDVRRSSAESDSHSKARNNQDSQGSWKSIACINTP 1029
++ R+ + ER SS + +S +D R + + H+ + S+ S +C
Sbjct: 685 IAPERASM---ERPSSFLTKK-ASFSDTRPKTLPPERHTPSARTFASKAQRPS-SCSADI 739
Query: 1030 KDRLCTVDDLQLQLGEWYYLDGAGHERGPSSFSELQVLVDQGCIQKHTSVFRKFDKVWVP 1089
K + C D L G+W+Y DG G ERGP SF+ELQ +V + + TS +RK D +WVP
Sbjct: 740 KAK-CNHD---LGRGDWFYKDGGGRERGPYSFAELQAMVGRELLIPGTSAYRKSDDLWVP 795
Query: 1090 LTFATETSASTVRNHGEKIMPSGDSSGLPPTQSQDAVLGESNNNVNSNAFHTMHPQFIGY 1149
L P D T+ + + V + + +
Sbjct: 796 LP-----------------RPEMDDGNFNVTEVTTSSF-DGARRVRKVVTTNIQQTIMAF 837
Query: 1150 TRGKLHELVMKSYKNREFAAAINEVLDP-------------------------WI----N 1180
T G+LHE VMK YKN+ +A + E LD WI +
Sbjct: 838 TSGQLHEHVMKHYKNQVMSAILFEGLDARAKLIESNRRLSTCSTVALLSGEDSWIGYGRS 897
Query: 1181 AKQPKKETEHVYRKSEGDTRAGKRARLLVRE----------------SDGDEETEEELQT 1224
P + + +GD R L R +D E +E+EL T
Sbjct: 898 HPSPSSSNDTSDEREDGDQDHRPNRRPLFRSNGLSQEQTSRKRRLVYNDDVESSEDELPT 957
Query: 1225 IQDESTFEDLCGDASFPGEESASSAIESGGWGLLDGHTLAHVFHFLRSDMKSLAFASLTC 1284
+ + D +E A ++ W L L ++H L+ D+KSLA S+TC
Sbjct: 958 GRRTRQRRLIRNDVFDSSDEHLYEAGQNNSWETLGQLMLMRIYHHLKGDLKSLALISMTC 1017
Query: 1285 RHWRAAVRFYKGISRQVDLSSVGPNCTDSLIRKTLNA-FDKEKLNSILLVGCTNITSGML 1343
+ WRAAV +K + +D +S+G +CTD+++ + L ILL CT ++ L
Sbjct: 1018 KSWRAAVEKFKPKVKCLDFTSIGVHCTDAVLSSVQQQNYGGGNLKQILLKDCTVLSPDAL 1077
Query: 1344 EEILQSFPHLSSIDIRGCGQFGELALKFPNINWV 1377
+ L++ P + +DI GC QFG+L+ FP +NWV
Sbjct: 1078 GKFLEACPTIQDVDINGCDQFGDLSHSFPQVNWV 1111
>gi|302825692|ref|XP_002994441.1| hypothetical protein SELMODRAFT_432361 [Selaginella moellendorffii]
gi|300137625|gb|EFJ04494.1| hypothetical protein SELMODRAFT_432361 [Selaginella moellendorffii]
Length = 1531
Score = 586 bits (1511), Expect = e-164, Method: Compositional matrix adjust.
Identities = 294/524 (56%), Positives = 372/524 (70%), Gaps = 62/524 (11%)
Query: 1704 DDREWGARMTKASLVPPVTRKYEVIDQYVIVADEEDVRRKMRVSLPEDYAEKLNAQK-NG 1762
D REWGARMTKA++VPPVTRKYE+I+ Y IV D++ V RKM+V +P+DY EKL A K
Sbjct: 1041 DVREWGARMTKAAMVPPVTRKYEIIEDYWIVIDKDLVERKMQVEVPDDYEEKLRASKLKR 1100
Query: 1763 SEELDMELPEVKDYKPRKQLGDQVFEQEVYGIDPYTHNLLLDSMPDELDWNLLEKHLFIE 1822
E +++P++K+Y PR++LG +V EQEVYGIDPYTHNLLLD+MP
Sbjct: 1101 GEYSHLDIPDIKNYHPRRELGVEVMEQEVYGIDPYTHNLLLDTMP--------------- 1145
Query: 1823 DVLLRTLNKQVRHFTGTGNTPMMYPLQPVIEEIEKEAVDDCDVRTMKMCRGILKAMDSRP 1882
P M LQ ++ +E+ +L+A++
Sbjct: 1146 ------------------KIPAM-TLQEKLQFMEET---------------LLQAIN--- 1168
Query: 1883 DDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPE 1942
++GLGVVCNK+GGF EDDFVVEF GEVYP W+W+EKQDG R LQK +++P PE
Sbjct: 1169 -------KEGLGVVCNKDGGFKEDDFVVEFFGEVYPAWRWYEKQDGCRYLQKKDKEPLPE 1221
Query: 1943 FYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVR 2002
FYNI LERPKGDA GYDLVVVDAMHKAN+ASRICHSCRPNCEAKVTAV G Y IG+Y +R
Sbjct: 1222 FYNILLERPKGDAAGYDLVVVDAMHKANFASRICHSCRPNCEAKVTAVKGRYIIGVYALR 1281
Query: 2003 GIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDR 2062
I GEE+TFDYNSVTESKEEY S+CLCGSQ CRGSYLNL GA + V+KE HGLLDR
Sbjct: 1282 PIQNGEELTFDYNSVTESKEEYNNSICLCGSQCCRGSYLNLANAGASQDVIKERHGLLDR 1341
Query: 2063 HQLMLEACELNSVSEEDYLELGRAGLGSCLLGGLPNWVVAYSARLVRFINLERTKLPEEI 2122
H L+LEAC V+ + E+ +AG+GSCLL GLP+W++ Y+ARLV F+NLER LP+E+
Sbjct: 1342 HVLLLEACCEGPVTRLELEEMRQAGVGSCLLDGLPDWLLKYTARLVEFMNLERQLLPDEL 1401
Query: 2123 LRHNLEEKRKYFSDICLEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRCVFGDPKKA 2182
+R ++++KRK +D+ E+ + DAE QAEGVYNQRLQN+A+TLDKVRYV+R +F DP++A
Sbjct: 1402 MR-SVKKKRKD-ADLSYELGRVDAENQAEGVYNQRLQNIAITLDKVRYVLRQLFTDPREA 1459
Query: 2183 PPPVERLSPEETVSFLWKGEGSLVEELIQCMAPHVEEDVLNDLK 2226
PP +++L +E VS LW E S+V EL+ CM PH+ D L +LK
Sbjct: 1460 PPLLKKLDQKELVSRLWSAENSIVNELLSCMMPHIPADRLAELK 1503
Score = 162 bits (410), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 118/410 (28%), Positives = 178/410 (43%), Gaps = 68/410 (16%)
Query: 1041 LQLGEWYYLDGAGHERGPSSFSELQVLVDQGCIQKHTSVFRKFDKVWVPLTFATETSAST 1100
L G+W+Y DG G ERGP SF+ELQ +V + + TS +RK D +WVPL
Sbjct: 556 LGRGDWFYKDGGGRERGPYSFAELQAMVGRELLIPGTSAYRKSDDLWVPLP--------- 606
Query: 1101 VRNHGEKIMPSGDSSGLPPTQSQDAVLGESNNNVNSNAFHTMHPQFIGYTRGKLHELVMK 1160
P D T+ + + V + + +T G+LHE VMK
Sbjct: 607 --------RPEMDDGNFNVTEVTTSSF-DGARRVRKVVTTNIQQTIMAFTSGQLHEHVMK 657
Query: 1161 SYKNREFAAAINEVLDP-------------------------WINAKQ----PKKETEHV 1191
YKN+ +A + E LD WI + P +
Sbjct: 658 HYKNQVMSAILFEGLDARAKLIESNRRLSTCSTVALLSGEDSWIGYGRSHPSPSSSNDTS 717
Query: 1192 YRKSEGDTRAGKRARLLVRES----------------DGDEETEEELQTIQDESTFEDLC 1235
+ +GD R L R + D E +E+EL T + +
Sbjct: 718 DEREDGDQDHRPNRRPLFRSNGLSQEQTSRKRRLVYNDDVESSEDELPTGRRTRQRRLIR 777
Query: 1236 GDASFPGEESASSAIESGGWGLLDGHTLAHVFHFLRSDMKSLAFASLTCRHWRAAVRFYK 1295
D +E A ++ W L L ++H L+ D+KSLA S+TC+ WRAAV +K
Sbjct: 778 NDVFDSSDEHLYEAGQNNSWETLGQLMLMRIYHHLKGDLKSLALISMTCKSWRAAVEKFK 837
Query: 1296 GISRQVDLSSVGPNCTDSLIRKTLNA-FDKEKLNSILLVGCTNITSGMLEEILQSFPHLS 1354
+ +D +S+G +CTD+++ + L ILL CT ++ L + L++ P +
Sbjct: 838 PKVKCLDFTSIGVHCTDAVLSSVQQQNYGGGNLKQILLKDCTVLSPDALGKFLEACPTIQ 897
Query: 1355 SIDIRGCGQFGELALKFPNINWV--KSQKSRGAK--FNDSRSKIRSLKQI 1400
+DI GC QFG+L+ FP +NWV S+ S A +DS K++SL I
Sbjct: 898 DVDINGCDQFGDLSHSFPQVNWVYDDSEVSDSATQGSDDSHRKMKSLNSI 947
Score = 42.7 bits (99), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 54/129 (41%), Gaps = 27/129 (20%)
Query: 643 WFYLDHCGMECGPSRLCDLKTLVEEGVLVSDHFIKHLDSNRWETVENAVSPLVTVNFPSI 702
W Y++ G GP L LK L + DH + +N W T+E+A S P
Sbjct: 361 WVYINKNGDTQGPMELAALKLLASREFMPDDHLVMRCGTNAWITLEHAQS-------PDN 413
Query: 703 TSDSVTQLVSPPEASGNLLADTGDTAQST---GEEFPVTLQSQCCPDGSAAAAESSEDLH 759
S ++ +LV+P A+ N A A+ST E+F ES E L
Sbjct: 414 GSSALQRLVNPVAATRN--AKDATLAESTIDGNEQF---------------QNESFEFLD 456
Query: 760 IDVRVGALL 768
ID RV +L
Sbjct: 457 IDGRVERIL 465
>gi|212723442|ref|NP_001132870.1| hypothetical protein [Zea mays]
gi|194695622|gb|ACF81895.1| unknown [Zea mays]
gi|413916953|gb|AFW56885.1| hypothetical protein ZEAMMB73_718091 [Zea mays]
Length = 302
Score = 456 bits (1174), Expect = e-125, Method: Compositional matrix adjust.
Identities = 211/301 (70%), Positives = 246/301 (81%), Gaps = 1/301 (0%)
Query: 2139 LEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRCVFGDPKKAPPPVERLSPEETVSFL 2198
++ EK+DAEVQAEGV N RLQ + TLDKVRYVMRC+FGDPK APPP+ RLS + VS +
Sbjct: 1 MDSEKNDAEVQAEGVLNSRLQQIVHTLDKVRYVMRCIFGDPKNAPPPLVRLSGKSLVSAI 60
Query: 2199 WKGEGSLVEELIQCMAPHVEEDVLNDLKSKIQAHDPSGSEDIQRELRKSLLWLRDEVRNL 2258
WKG+ S+V ELIQ M PHVEE+VL+DLK+KI+AHDPS SEDI+ +R SLLWLRDE+R L
Sbjct: 61 WKGDSSIVAELIQSMEPHVEEEVLSDLKAKIRAHDPSESEDIEGGIRNSLLWLRDELRTL 120
Query: 2259 PCTYKCRHDAAADLIHIYAYTKCFFRVQEYKAFTSPPVYISPLDLGPKYADKLGADLQVY 2318
CTYKCRHDAAADLIH+YAYTKCFFRV++YK SPPV+ISPLDLGPKYADKLG Q Y
Sbjct: 121 SCTYKCRHDAAADLIHLYAYTKCFFRVRDYKTVKSPPVHISPLDLGPKYADKLGPGFQEY 180
Query: 2319 RKTYGENYCLGQLIFWHIQTNADPDCTLARASRGCLSLPDIGSFYAKVQKPSRHRVYGPK 2378
KTY ENYCL QLI+W+ Q N++P+ L RA +GC+SLPD+ SFY K KPS+ R YG +
Sbjct: 181 CKTYPENYCLAQLIYWYSQ-NSEPESRLTRARKGCMSLPDVSSFYVKSAKPSQERAYGNR 239
Query: 2379 TVRFMLSRMEKQPQRPWPKDRIWAFKSSPRIFGSPMLDSSLTGCPLDREMVHWLKHRPAI 2438
TVRFMLSRMEKQ QRPWPKDRIW FKS PR FGSPM+D+ L PLD+EMVHWLK RP +
Sbjct: 240 TVRFMLSRMEKQAQRPWPKDRIWVFKSDPRFFGSPMMDTVLNNSPLDKEMVHWLKTRPNV 299
Query: 2439 F 2439
F
Sbjct: 300 F 300
>gi|237506940|gb|ACQ99221.1| hypothetical protein [Tragopogon dubius]
Length = 199
Score = 367 bits (942), Expect = 5e-98, Method: Compositional matrix adjust.
Identities = 168/199 (84%), Positives = 189/199 (94%)
Query: 1960 LVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTE 2019
LVVVDAMHKANYASRICHSCRPNCEAKVTAVDG YQIGIY+VR I YGEE+TFDYNSVTE
Sbjct: 1 LVVVDAMHKANYASRICHSCRPNCEAKVTAVDGQYQIGIYSVRPIVYGEEVTFDYNSVTE 60
Query: 2020 SKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDRHQLMLEACELNSVSEED 2079
SKEEYEASVCLCGSQVCRGS+LNLTGEGAF+KVLKE HG+L+RHQLMLEACELN VSEED
Sbjct: 61 SKEEYEASVCLCGSQVCRGSFLNLTGEGAFQKVLKECHGILNRHQLMLEACELNCVSEED 120
Query: 2080 YLELGRAGLGSCLLGGLPNWVVAYSARLVRFINLERTKLPEEILRHNLEEKRKYFSDICL 2139
Y+ELG+AGLGSCLL GLP+W+VAY+ARLVRFI+ ERTKLP+ IL HNLEEKRKYF+DIC+
Sbjct: 121 YIELGKAGLGSCLLSGLPDWLVAYAARLVRFIHFERTKLPQVILTHNLEEKRKYFTDICM 180
Query: 2140 EVEKSDAEVQAEGVYNQRL 2158
+ EK++AE+QAEGV+NQR+
Sbjct: 181 DTEKNEAEIQAEGVFNQRI 199
>gi|237506942|gb|ACQ99222.1| hypothetical protein [Tragopogon pratensis]
Length = 199
Score = 365 bits (936), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 167/199 (83%), Positives = 188/199 (94%)
Query: 1960 LVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTE 2019
LVVVDAMHKANYASRICHSCRPNCEAKVTAVDG YQIGIY+VR I YGEE+TFDYNSVTE
Sbjct: 1 LVVVDAMHKANYASRICHSCRPNCEAKVTAVDGQYQIGIYSVRPIVYGEEVTFDYNSVTE 60
Query: 2020 SKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDRHQLMLEACELNSVSEED 2079
SKEEYEASVCLCGSQVCRGS+LNLTGEGAF+KVLKE HG+L+RHQLMLEACELN VSEED
Sbjct: 61 SKEEYEASVCLCGSQVCRGSFLNLTGEGAFQKVLKECHGILNRHQLMLEACELNCVSEED 120
Query: 2080 YLELGRAGLGSCLLGGLPNWVVAYSARLVRFINLERTKLPEEILRHNLEEKRKYFSDICL 2139
Y+EL +AGLGSCLL GLP+W+VAY+ARLVRFI+ ERTKLP+ IL HNLEEKRKYF+DIC+
Sbjct: 121 YIELSKAGLGSCLLSGLPDWLVAYAARLVRFIHFERTKLPQVILTHNLEEKRKYFTDICM 180
Query: 2140 EVEKSDAEVQAEGVYNQRL 2158
+ EK++AE+QAEGV+NQR+
Sbjct: 181 DTEKNEAEIQAEGVFNQRI 199
>gi|308809269|ref|XP_003081944.1| SET domain-containing protein (ISS) [Ostreococcus tauri]
gi|116060411|emb|CAL55747.1| SET domain-containing protein (ISS) [Ostreococcus tauri]
Length = 1744
Score = 323 bits (828), Expect = 8e-85, Method: Compositional matrix adjust.
Identities = 237/766 (30%), Positives = 377/766 (49%), Gaps = 70/766 (9%)
Query: 1724 KYEVIDQYVIVADEEDVRRKMRVSLPEDYAEKLNAQKNGSEELDME----LPEVKDYKPR 1779
++ VI++YV DE + + + V+L +++ + + + D P + R
Sbjct: 987 QFTVIEEYVDRKDELECQIERSVTLAKNHPYVKGSTTSTANVDDGHPPGWWPSLGVKAER 1046
Query: 1780 KQLGDQVFEQEVYGIDPYT----HNLLLDSMPD--ELDWNLLEKHLFIEDVLLRTLNKQV 1833
K+L +V EQE YG D T L +PD E D L KHL L +N+
Sbjct: 1047 KKLVTEVIEQETYGCDFVTGRDATTTLQKVLPDFSEDDVWALYKHL------LSQVNESY 1100
Query: 1834 RHFT--GTGNTPMMYPLQPVIEEIEKEAVDDCDVRTMKMCRGILK-AMDSRPDDK-YVAY 1889
T + + + E+ E + V D++++ + + K A +SR + Y +
Sbjct: 1101 GAMTPDTLATQSLALAAEDLAEKFENQGVRMNDMKSIAFSKALWKLASESRVSPEFYAVH 1160
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNN---EDPAPEFYNI 1946
RKG GVVC + GE F+++FLGE+YP W W EKQD IR +QKN + PEFYN+
Sbjct: 1161 RKGFGVVCKEPIKKGE--FLIDFLGEIYPPWAWAEKQDAIRLVQKNRGLRDKGPPEFYNM 1218
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
+ERP GD +GY ++ DAMH+ NYA R+ H+C PN E + A++G Y+I T R I
Sbjct: 1219 QIERPGGDEEGYSVLFCDAMHENNYAGRLSHTCDPNVEVNLKAINGKYEIHFITNRDIEP 1278
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDRHQLM 2066
GEE+ ++Y+S T++ +E EA+ CLCG+++CRGSYLN GE +VL H L+DR +
Sbjct: 1279 GEELAYNYHSCTDNMKEVEAAFCLCGARMCRGSYLNFVGEDNNSQVLNTKHKLIDRQVIA 1338
Query: 2067 LEACELNS--VSEEDYLELGRAGL--GSCLLGGLPNWVVAYSARLVRFINLERTKLPEEI 2122
+A + + ++ + L + G G LL P+W++ + L ++ E +LP+ I
Sbjct: 1339 FKAIDRAAEPLNPKQVRCLEQVGFYPGKGLLKCCPSWLLHFVGDLAIYMEEEVNQLPKHI 1398
Query: 2123 LRHNLEEKRKYF---SDICLEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVM-RCVFGD 2178
L +E K + + A++ A V R Q++A+ L K+R V+ R G
Sbjct: 1399 LAAAKQEHEKLLLKSPGMEFTYNEKFAKIDALAVRENRTQSIAIMLSKLRRVLTRARDGG 1458
Query: 2179 PKK-----------APPPVERLSPEETVSFLWKGEG------SLVEELIQCMAPH---VE 2218
+K APPP RL+ +E W G G S++ L+ M PH +
Sbjct: 1459 AQKSVYECLDKFESAPPPFVRLTDDEVAVQFW-GTGTDGFDRSVIRGLLNAMGPHERKRD 1517
Query: 2219 EDVLNDLKSKIQAHDPSGSEDIQRELRKSLLWLRDEVRNLPCTYKCRHDAAADLIHIYAY 2278
D SK++A + + +LR+SLLWLRDE+ LP RHD AA L+H YA
Sbjct: 1518 ADEFVKWTSKVEAIADKARKG-KMDLRESLLWLRDELIKLPRDKCARHDLAAALVHFYAM 1576
Query: 2279 TKCFFR---VQEYKAFTSPPVYISPLDLGP------KYADKLGADLQVYRKTYGENYCLG 2329
T+ F++ E+ +TS V + ++ DK+ A ++ KTY Y
Sbjct: 1577 TEQFWQPSPAPEHMGYTSDKVAVREDEVNAWGVGAGGGGDKIVARVE---KTYRPGYSGA 1633
Query: 2330 QLIFWHIQTNADPDCTLARASRGCLSLPDIGSFYAKVQKPSRHRVYGPKTVRFMLSRMEK 2389
++ WH Q ADP L +G L++PDI Y+ R G + ++
Sbjct: 1634 TMLQWHKQEVADPTQHLVANRKGNLTMPDIACCYSSRPNQPLARA-GSLEHETWIGHLQN 1692
Query: 2390 QPQRPWPK-DRIWAFKSSPRIFGSPMLDSSLTGC-PLDREMVHWLK 2433
P+ WP W + ++ GSP++D+ + G + +++ W+K
Sbjct: 1693 WPEETWPNLSGPWGIGNPQKLIGSPIIDAWMQGKRSIPVKVLAWIK 1738
>gi|145351886|ref|XP_001420292.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144580526|gb|ABO98585.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 1361
Score = 313 bits (801), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 235/771 (30%), Positives = 374/771 (48%), Gaps = 84/771 (10%)
Query: 1724 KYEVIDQYVIVADEEDVRRKMRVSL----PEDYAEKLNAQKNGSEELDMELPEVKDYKPR 1779
++ VI +YV DE++ + + V+L P D K+ P + R
Sbjct: 610 QFTVITEYVDRKDEKECQIERSVTLAANHPFDKKSKMKTAHVDDGTPPGWWPSLGVKAER 669
Query: 1780 KQLGDQVFEQEVYGIDPYT--------HNLLLDSMPDELDWNLLEKHLFIEDVLLRTLNK 1831
K+L +V EQE YG+D T +L D D++ W L ++ LL +N+
Sbjct: 670 KKLVTEVIEQETYGVDFVTGRDATETLKRVLPDYSEDDV-WGLYKQ-------LLAQVNE 721
Query: 1832 QVRHFTGTGNTPMMYPLQPVIEEIEKEAVDDCDVRTMKMCRGILKAMDSRPD--DKYVAY 1889
+ T +T L E++ + D +++ + + K + + + Y +
Sbjct: 722 S--YGAMTPDTLATQNLAVAAEDLAVKLERKADAKSLAFSKALWKLAAAAIETPEYYYVH 779
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNN---EDPAPEFYNI 1946
RKG GVVCN+ GE F+++FLGE+YP W W EKQD IR +QK + PEFYN+
Sbjct: 780 RKGFGVVCNQPIKKGE--FLIDFLGEIYPPWAWAEKQDAIRQVQKARGLRDRGPPEFYNM 837
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
+ERP GDA+GY ++ DAMH+ NYA R+ H+C PN E + A++G Y+I T R I
Sbjct: 838 QIERPGGDAEGYSVLFCDAMHENNYAGRLSHTCDPNVEVNLKAINGKYEIHFITTRDIAP 897
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDRHQLM 2066
GEE+ ++Y+S T++ +E E + CLCG+++CRGSYLN GE +VL+ H L+DR +
Sbjct: 898 GEELAYNYHSCTDNMKEVEMAFCLCGARMCRGSYLNFVGEDHHSQVLESKHKLIDRQVMS 957
Query: 2067 LEACE--LNSVSEEDYLELGRAGL--GSCLLGGLPNWVVAYSARLVRFINLERTKLPEEI 2122
+A + + ++ + L G G LL P W++ + + +++ E +LP+ I
Sbjct: 958 FKAIDKAADPLTSKQERSLAAVGFYPGKGLLRNCPGWLLQFVGDVAVYMDTELNELPKHI 1017
Query: 2123 LRHNLEEKRKYFS---DICLEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVM------- 2172
L +E K + A++ A + R Q +A+ L K+R ++
Sbjct: 1018 LAAAKKEHAKLLEKNPQAEFSYTEKFAKIDALAMRENRTQCVAIMLSKLRRLLTRARDDG 1077
Query: 2173 ------RCVFGDPKKAPPPVERLSPEETVSFLWKG-----EGSLVEELIQCMAPHVEEDV 2221
C+ K APP V L+ E + W E S+V LI+ M PH +
Sbjct: 1078 PQKSVYECMDVFEKSAPPYVT-LTEAEIAAHFWGSGPENFEKSIVCGLIRAMGPHERK-- 1134
Query: 2222 LNDLKSKIQAHDPSGSEDIQRELRK-------SLLWLRDEVRNLPCTYKCRHDAAADLIH 2274
ND I+ S E + E+RK SLLWLRDE++ L T RHD AA LIH
Sbjct: 1135 -NDADKFIKW--TSMVESVAVEVRKGKMTRKESLLWLRDELKKLKQTDGARHDLAAGLIH 1191
Query: 2275 IYAYTKCFFR---VQEYKAFTSPPVYISPLDLGP------KYADKLGADLQVYRKTYGEN 2325
+YA T F++ E++ + S V + ++ DK+ A ++ KTY
Sbjct: 1192 LYAETNRFWQPSSAPEHQVYKSDKVAVREDEVNAWGVGAGGGGDKIVARVE---KTYRPG 1248
Query: 2326 YCLGQLIFWHIQTNADPDCTLARASRGCLSLPDIGSFYAKVQKPSRHRVYGP-KTVRFML 2384
+ ++ WH Q ADP + RG LS+PDI Y+ +P + + L
Sbjct: 1249 FSAATMLQWHKQEMADPTQYITANRRGNLSMPDIACCYS--SRPGQPLARSSDREHETWL 1306
Query: 2385 SRMEKQPQRPWPKDR-IWAFKSSPRIFGSPMLDSSLTGC-PLDREMVHWLK 2433
+ ++ P+ PWP+ W +S ++ GSP+LD+ + G + + + WLK
Sbjct: 1307 AHLQSWPEEPWPQSSGPWGVANSQKLIGSPVLDAWMKGQRSIPAKCLAWLK 1357
>gi|308805538|ref|XP_003080081.1| SET domain-containing protein (ISS) [Ostreococcus tauri]
gi|116058540|emb|CAL53729.1| SET domain-containing protein (ISS) [Ostreococcus tauri]
Length = 844
Score = 306 bits (784), Expect = 9e-80, Method: Compositional matrix adjust.
Identities = 189/574 (32%), Positives = 292/574 (50%), Gaps = 60/574 (10%)
Query: 1882 PDDKYVAYR---KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNED 1938
P + V +R KG GVVC G V ++GE+YP W+W+E+QD I+ N
Sbjct: 271 PKEHRVQFRIHPKGTGVVCINPNGLKAGTLVNYYIGEMYPPWQWYERQDAIKKSFPNMN- 329
Query: 1939 PAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
P F+NI LERP D G ++ V+AMHK ++ASR+ HSC PNC+ DG +G+
Sbjct: 330 -LPSFFNITLERPAHDERGRHVIFVEAMHKGSFASRLSHSCEPNCQTVTFTKDGKLTLGM 388
Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHG 2058
+TVR I YGEE+T+DY+ +TES EEY CLC S CRGS+L G GAF ++ + H
Sbjct: 389 FTVRDIAYGEEMTWDYSCITESAEEYRTGFCLCSSPGCRGSFLTYAGNGAFTAIVNKKHS 448
Query: 2059 LLDRHQLMLEACELNSVSEEDYLELGRAGLGSCLLGGLPNWVVAYSARLVRFINLERTKL 2118
L R+ ++ A +++ + L AG+ C L P+WVV ++A +++I LE +L
Sbjct: 449 FLHRNAILFVA-STTPLTKAESESLYVAGIRQCALEKCPDWVVKWAALTLQYIKLEEKEL 507
Query: 2119 PEEILRHNLEEKRKYFSDICLEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRCVFGD 2178
P+ +++ + E +Y ++ A+ +A GV + R+ NL VTLDK+RYV+
Sbjct: 508 PDVLMKLPVTEYGRY--------DEIGAKYEAAGVASTRITNLVVTLDKIRYVL----NR 555
Query: 2179 PKKAPPPVER-LSPEETVSFLWKGEGSLVEELIQCM------------------APHVEE 2219
P + R LS +E + LW GE S+ I M A E+
Sbjct: 556 PGQRRDSFFRALSDDEVIDHLWSGEASVFRRFIITMVNSGGDKRNEARSASMSTAAMFEK 615
Query: 2220 D-----VLNDLKSKIQAHDPSGSEDIQRELRKSLLWLRDEVRNLPCTYKCRHDAAADLIH 2274
V + LK+ ++ + + + R LL +R + + K H A DL+
Sbjct: 616 TWTDTRVASALKAIKKSVNVVERPETAEQARARLLQVRAALEH--AGDKAFHAQARDLLW 673
Query: 2275 IYAYTKCFFRVQEYKAFTSPPVYISPLDLGPKYADKLGADL---------QVYRKTYGEN 2325
++A T +F ++++ SPPV I D+ + + ++ L ++ +K YG
Sbjct: 674 LHANTLHYFTIEKFDLVLSPPVNID--DMKSQISCEMRTKLPNAVKGNRDKLLQKKYGPL 731
Query: 2326 YCLGQLIFWHIQTNADPDCTLARASRGCLSLPDIGSFYAKVQKPSRHRVYGPKTVRFMLS 2385
Y GQL+ W+ QT PD +L+ RG LSLPD S Y+ V P++ Y R ++
Sbjct: 732 YVWGQLVTWYKQTVYAPDASLSADRRGSLSLPDPESCYSAV--PTK---YTSSERRSLIK 786
Query: 2386 RMEKQPQRPWPKDRIWAFKSSPRIFGSPMLDSSL 2419
M WP W+FK+ +++GSPM D +L
Sbjct: 787 LMRSNIHAMWPTTMSWSFKNPTKVYGSPMFDEAL 820
>gi|110741769|dbj|BAE98829.1| hypothetical protein [Arabidopsis thaliana]
Length = 517
Score = 301 bits (770), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 231/597 (38%), Positives = 307/597 (51%), Gaps = 151/597 (25%)
Query: 1 MGDGGVACMPLQQQQQHNSIMERFPISDKTTICVGNSSNNSNKTNNNSISNNNDNKTNND 60
M DGGVACMPL +IME+ PI +KTT+C GN S KT
Sbjct: 1 MSDGGVACMPLL------NIMEKLPIVEKTTLCGGNES-----------------KTAAT 37
Query: 61 SSNNNGSSSSKNNETNKSNVKKNGVSTKTVRKKIVK-IKKVIAVKKKEVQKNSGSS---- 115
+ N + S ++K E+ +N K + S +K+IVK I+KV+ + K+ QK +
Sbjct: 38 TENGHTSIATKVPESQPAN-KPSASSQPVKKKRIVKVIRKVVKRRPKQPQKQADEQLKDQ 96
Query: 116 ---------------------KSNNNGENIDNKNVENGGAVGEVVTVDKENLKNEEVEEG 154
KS G K VENGG G +EVEEG
Sbjct: 97 PPSQVVQLPAESQLQIKEQDKKSEFKGGTSGVKEVENGGDSG----------FKDEVEEG 146
Query: 155 ELGTLKW----ENGEFVQPEKSQPQSQLQSQSKQIEKGEIIV------------------ 192
ELGTLK ENGE + P KS Q +IEKGEI+
Sbjct: 147 ELGTLKLHEDLENGE-ISPVKSL-------QKSEIEKGEIVGESWKKDEPTKGEFSHLKY 198
Query: 193 ---------FSS-KCRRGETEKGESGLWRGNKDDIEKGEFIPDRWHK-EVVKDEYGYSKS 241
FS+ K +G E+ E WR D+IEKGEFIPDRW K + KD++ Y +S
Sbjct: 199 HKGYVERRDFSADKNWKGGKEEREFRSWRDPSDEIEKGEFIPDRWQKMDTGKDDHSYIRS 258
Query: 242 RR----------YDYKLERTPPSGKYSGEDVYRRKEFDRSGSQHSKSSSRWESGQERNVR 291
RR Y+Y+ ERTPP G++ ED+Y ++EF SG +R R
Sbjct: 259 RRNGVDREKTWKYEYEYERTPPGGRFVNEDIYHQREF--------------RSGLDRTTR 304
Query: 292 ISSKIVDDEGLYKGEHNNGKNHGREYFH-GNRFKRHGTDSDSGDRKY-YGDYGDFAGLKS 349
ISSKIV +E L+K E+NN N +EY GNR KRHG + DS +RK+ Y DYGD+ K
Sbjct: 305 ISSKIVIEENLHKNEYNNSSNFVKEYSSTGNRLKRHGAEPDSIERKHSYADYGDYGSSKC 364
Query: 350 RRLSDDYNSRSVHSEHYSRHSVEKFHRNSSSSRISSLDKYSSRHHEPSLSSRVIYDRHGR 409
R+LSDD SRS+HS+HYS+HS E+ +R+S S+ SSL+KY +H + S ++ D+HG
Sbjct: 365 RKLSDDC-SRSLHSDHYSQHSAERLYRDSYPSKNSSLEKYPRKHQDASFPAKAFSDKHGH 423
Query: 410 SPSHSDRSPHDRGRYYDHRDRSPSRHDRSPYTRDRSPYTFDRSPYSRERSPYNRDRSPYA 469
SPS SD SPHDR RY+++R DRSPY R+RSPY F++S ++R+RSP +R
Sbjct: 424 SPSRSDWSPHDRSRYHENR-------DRSPYARERSPYIFEKSSHARKRSPRDRRHH--- 473
Query: 470 REKSPYDRSRHYDHRNRSPFSAERSPQDRARFHDRSDRTPNYLERSPLHRSRPNNHR 526
RSP +E SP DR+R DR D PN++E + R+R N HR
Sbjct: 474 -------------DYRRSPSYSEWSPHDRSRPSDRRDYIPNFMEDTQSDRNRRNGHR 517
>gi|145347753|ref|XP_001418326.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144578555|gb|ABO96619.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 931
Score = 288 bits (736), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 181/556 (32%), Positives = 282/556 (50%), Gaps = 36/556 (6%)
Query: 1874 ILKAMDSRPD-DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSL 1932
I+++ D R +++ + KG GVVC G FV ++GE+Y WKW+E+QD I+
Sbjct: 376 IVESHDPRARRERFRIHPKGTGVVCINPNGLKAGTFVNHYIGEIYSPWKWYERQDAIKKC 435
Query: 1933 QKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDG 1992
E P F+NI LERP D G + V+AMHK +ASR+ HSC PNC+ A G
Sbjct: 436 YPGME--LPSFFNITLERPPHDDRGRHVSFVEAMHKGCFASRLSHSCEPNCQTVTFAKGG 493
Query: 1993 HYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKV 2052
+G++T + I YGEE+T+DY+ +TES EEY CLC S CRGS+L +G GAF V
Sbjct: 494 KLTLGMFTTQDIAYGEEMTWDYSCITESAEEYRTGFCLCSSPTCRGSFLTYSGSGAFTAV 553
Query: 2053 LKELHGLLDRHQLMLEACELNSVSEEDYLELGRAGLGSCLLGGLPNWVVAYSARLVRFIN 2112
+ + H L R+ ++ +A ++++ D L +G+ C L P+WVV ++A + +I
Sbjct: 554 VNKKHAFLHRNAILFKA-STSALTNVDRKMLHDSGIRECALEHCPDWVVKWAALTLEYIK 612
Query: 2113 LERTKLPEEILRHNLEEKRKYFSDICLEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVM 2172
LE +LP E++ +Y + A+ +A GV R+ NL VTLDK+RYV+
Sbjct: 613 LEENELPNELMSLPATRFGRY--------NELGAKSEATGVAATRITNLIVTLDKIRYVL 664
Query: 2173 RCVFGDPKKAPPPVERLSPEETVSFLWKGEGSLVEELIQ-CMAPHVEEDVLNDL-KSKIQ 2230
++ P L+ E + LW G+ S++ +++ +A + N + KS++
Sbjct: 665 T---RSGQERAPFFRVLTESEVIEHLWSGDESILRRILRSILAGAGAKKGSNSVGKSRLV 721
Query: 2231 AHDPSGSEDIQRELRKSLLWLRDEVRNLPCTYKCRHDAAADLIHIYAYTKCFFRVQEYKA 2290
+ D + + + R ++ P T A+DL+ YA T+ +F +
Sbjct: 722 MAKMPKTGDARVDAAMKAIQERIDIDERPKT-------ASDLLWFYANTRNWFTHAKLDN 774
Query: 2291 FTSPPVYISPL-DLGPKYADK------LGADLQVYRKTYGENYCLGQLIFWHIQTNADPD 2343
SP V I + + P K G + +K YG Y GQL+ W+ QT PD
Sbjct: 775 VISPAVNIDDVASVIPTTQRKHIPNAFRGNREAMLQKRYGALYVWGQLVTWYKQTIYSPD 834
Query: 2344 CTLARASRGCLSLPDIGSFYAKVQKPSRHRVYGPKTVRFMLSRMEKQPQRPWPKDRIWAF 2403
+L+ RG LSLPD S + PS Y K + + + K + WP W+F
Sbjct: 835 SSLSADRRGTLSLPDPESCCSAA--PS---AYVNKERKELFKALRKNKHQSWPTATSWSF 889
Query: 2404 KSSPRIFGSPMLDSSL 2419
K+ +++GSPM D +L
Sbjct: 890 KNPAKVYGSPMFDDAL 905
>gi|145494033|ref|XP_001433011.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124400127|emb|CAK65614.1| unnamed protein product [Paramecium tetraurelia]
Length = 1065
Score = 284 bits (727), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 203/716 (28%), Positives = 350/716 (48%), Gaps = 76/716 (10%)
Query: 1733 IVADEEDVRRKMRVSLPEDYAEKLNAQKNGSEE----LDMELPEVKDYKPRKQLGDQVFE 1788
I D E RK RV E G+E+ + +++ + + K +++ V E
Sbjct: 281 ITWDSECPNRKNRVECLE---------HEGTEQPCMNMSIQMKQHQVNKMKQEENADVEE 331
Query: 1789 QEVYGIDPYTHNLLLDSMPDELDWNLLEKHLFIEDVLLRTLNKQVRHFTGTGNTPMMYPL 1848
+GID YT ++++ +P L+++ +K+ F+E +LL +N+ + Y +
Sbjct: 332 TPCWGIDAYTRKVIINILP--LNYDDAQKNKFLEKLLL-AINR-------PSDKENAYDM 381
Query: 1849 QPVIEEIEKEA---VDDCDVRTMKMCRGILKAMDSRPDDKYVAYRKGLGVVCNKEGGFGE 1905
+ I KE+ + KM + I K + + + + KG G+VC + G
Sbjct: 382 SLACDYIIKESKLMSSHYNKEDRKMAKQIQKVL-KYDTEGFRIHTKGFGLVCVNKQGIKN 440
Query: 1906 DDFVVEFLGEVYPVWKWFEKQDGIRSLQK--NNEDPAPEFYNIYLERPKGDADGYDLVVV 1963
+ ++ +LGE+Y W+W+EKQD I+ K N +D P+FYNI L+ + D G D + V
Sbjct: 441 NSLIIPYLGEIYQPWRWYEKQDFIKKQMKEHNQKDILPDFYNIMLDIHRDDIKGIDFLFV 500
Query: 1964 DAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEE 2023
D ++K NY+SR+ HSC PNC T +G Y IG+Y +R I YGEE+TFDY S TESK+E
Sbjct: 501 DPINKGNYSSRLSHSCNPNCGTVTTVSNGTYVIGMYAMREIQYGEELTFDYCSFTESKQE 560
Query: 2024 YEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDRHQLMLEACELN-SVSEEDYLE 2082
++CLCGS+ C+ YL L+ + +L + H L R+ ++L++C N S ED
Sbjct: 561 QLQALCLCGSEKCKIYYLQLSNCKEYNGILDKEHCFLTRNAILLKSCSDNVDKSNEDSEL 620
Query: 2083 LGRAGLGSCLLGGLPNWVVAYSARLVRFINLERTKLPEEILRHNLEEKRKYFSDICLEVE 2142
+ +GS +L P W+ + +++FI+ +E++ Y S++ L+ E
Sbjct: 621 YSKYRIGSSVLNDCPLWLKNWVGYILKFID---------------QERQTYKSELNLKYE 665
Query: 2143 KSDAEVQAEGVYNQ------RLQNLAVTLDKVRYVMRCVFGDPKKAPPPVERLSPEETVS 2196
++ AEV+ + R+QNL TLDK+++ + + + PP+ + + +
Sbjct: 666 QT-AEVEQWNHFTATQHSEDRIQNLIFTLDKIKFFL----NNSDSSEPPISIIGDSDLLD 720
Query: 2197 FLWK-------GEGSLVEELIQCMAPHVEEDVLNDL----KSKIQAHD-PSGSEDIQRE- 2243
WK E S EL Q H ++ ++ + K K Q HD +I R+
Sbjct: 721 SFWKDYSSGTSSECSFFNELYQLFQKHNQKKMIELIHVIYKKKQQLHDYKENIHNIHRQE 780
Query: 2244 -LRKSLLWLRDEVRNLPCTYKCRHDAAADLIHIYAYTKCFFRVQEYKAFTSPPVYISPLD 2302
L +L+L + Y H+A + ++++ AYT +F+ EY F+SPP+ D
Sbjct: 781 LLITRMLFLTLSHMLMQQQYTFHHEALSLILYMMAYTYTYFKPYEYTGFSSPPI----ED 836
Query: 2303 LGPKYADKLGADLQVYRKTYGENYCLGQLIFWHIQTNADPDCTLARASRGCLSLPDIGSF 2362
L + + + Y + GQLI W QT A P +L++ RG LS P I SF
Sbjct: 837 LEWRKVGAFKKKCKSEGRAYSSQFVWGQLIGWFKQTVAAPQASLSQDRRGTLSYPAISSF 896
Query: 2363 YAKVQKPSRHRVYGPKTVR-FMLSRMEKQPQRPWPKDRI-WAFKSSPRIFGSPMLD 2416
K + ++ R + + + +P WP + W++K++ +I+G+ + +
Sbjct: 897 DKAGDKAYPFQQSKAQSSRSYFIQHLLDKPSYMWPPETASWSYKNTYKIYGTILFE 952
>gi|403375645|gb|EJY87798.1| hypothetical protein OXYTRI_23635 [Oxytricha trifallax]
Length = 1691
Score = 275 bits (704), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 178/607 (29%), Positives = 308/607 (50%), Gaps = 53/607 (8%)
Query: 1781 QLGDQVFEQEVYGIDPYTHNLLLDSMPDELDWNLLEKHLFIEDVLLRTLNKQVRHFTGTG 1840
+LG V E+ +GID T L+ +P D + + FIE L+ + +Q G
Sbjct: 960 KLGIDVIEKISWGIDMGTAVNLMTLLPK--DMPMKAQSDFIEKRLVFAIQQQ-----GDQ 1012
Query: 1841 NTPMMYPLQPVIEEIEKEAVDDCDVRTMK-MCRGILKAMDSRPDDKYVAYRKGLGVVCNK 1899
+ L+ +I + E D D + M +GI D+ + + + KG+G+ C +
Sbjct: 1013 GYDVREALKFIINDRENPRFRDIDRELAQIMLQGITMVKDN-VERHFRVHSKGIGIFCKR 1071
Query: 1900 EGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNN--EDPAPEFYNIYLERPKGDADG 1957
G + ++E+ GE+Y W W+EKQD ++ Q P+FYNI ER D G
Sbjct: 1072 NEGIKASNLIIEYFGEIYQPWNWYEKQDVLKQGQNKQTLSKDLPDFYNITFERHHDDPQG 1131
Query: 1958 YDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSV 2017
YD+++VD + NY+SR+ HSC PNC + D Y IG++ ++ + +GEE+ F+Y S+
Sbjct: 1132 YDILMVDPILYGNYSSRLSHSCNPNCSTIIHVRDNQYSIGMFAIKDVSFGEELCFNYCSL 1191
Query: 2018 TESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDRHQLMLEACELNSVSE 2077
TES++EYE+++CLCG++VC+G YL L + ++K+ H +DR+ L+ +AC+ ++E
Sbjct: 1192 TESEKEYESAICLCGTEVCQGKYLQLANDKKHMAIMKKYHTFVDRNYLLYKACKFPEITE 1251
Query: 2078 EDYLELGRAGLGSCLLGGLPNWVVAYSARLVRFINLERTKLPEEILRHNLEEKRKYFSDI 2137
ED L G+ +L +P+W+ +++ + +I E P +F +I
Sbjct: 1252 EDEKRLNDFGIKESVLKDVPDWLKKWASLICEYIIFEEDIYP------------SFFKEI 1299
Query: 2138 CLEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRC--VFGDPKKAPPPVERLSPEETV 2195
++ D ++A+ + ++ N+A+T+DKV +V++ VF PP++ LS +E
Sbjct: 1300 YPTFKEEDLRIEAKNQRDSKIWNIAITIDKVMHVLKSMGVF------EPPIKDLSYKERC 1353
Query: 2196 SFLWKGEGSLVEELIQCMAPHVEE--DVLNDLKSKIQA--------HDPSGS--EDIQRE 2243
LW + SL E LI + H+E + L+ L+ I D +G ++ +E
Sbjct: 1354 IRLWDSKDSLRESLIDVLN-HIENYPEKLDALQKVISVPLVEAQFDEDENGRYYKEKYQE 1412
Query: 2244 LRKSLLWLRDEVRNLPCTYKCRHDAAADLIHIYAYTKCFFRVQE-YKAFTSPPVYISPLD 2302
++ + + +R + + K A D + +YA T +F E YK I D
Sbjct: 1413 IQSVIALISGILRPIK-SIKVMIPALCDSLWLYANTHTYFTPNENYKKCKGDEQKIRKCD 1471
Query: 2303 LGPKYADK-----LGADLQVYR--KTYGENYCLGQLIFWHIQTNADPDCTLARASRGCLS 2355
+ + ++ + + QVYR K Y +Y GQL+ W+ QT P+ +L+ RG LS
Sbjct: 1472 VRIENQNQASLNPVEQEKQVYRGFKEYDPSYVWGQLVGWYKQTVDKPNASLSADRRGTLS 1531
Query: 2356 LPDIGSF 2362
+PD+ SF
Sbjct: 1532 MPDLESF 1538
>gi|384247471|gb|EIE20958.1| hypothetical protein COCSUDRAFT_57497 [Coccomyxa subellipsoidea
C-169]
Length = 1198
Score = 275 bits (703), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 175/520 (33%), Positives = 269/520 (51%), Gaps = 59/520 (11%)
Query: 1820 FIEDVLLRTLNKQVRHFTGTGNTPMMYPLQPVIEEIEKEAVDDCDVRTMKMCRGILKAMD 1879
++E L+ +NKQ G ++ LQ V E + A D C V ++ +
Sbjct: 456 WVERELMPAINKQ-----GASGWDILLALQDVKEHAQA-AGDMCSVEAANAVEKRVQKVG 509
Query: 1880 SRPDDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDP 1939
S + + + KG+G+ C + G FV E+LGEV+ W+WFE QD IR K D
Sbjct: 510 S---NYFRIHPKGVGLKCCRSEGLPPLTFVEEYLGEVHTPWRWFEMQDIIR---KTMGDE 563
Query: 1940 APEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIY 1999
P+FYNI LERP+ D GYD++ +DA K YASR+ HSC PNC+A V A G I +Y
Sbjct: 564 LPDFYNIVLERPRDDPTGYDVMFIDAAAKGAYASRMSHSCTPNCQAVVMACGGRLTIAVY 623
Query: 2000 TVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGL 2059
T+R ++ GEE+TFDY VTES++E+ ++CLCG++ CRGS+L G AF +++ + H +
Sbjct: 624 TLRHVYPGEELTFDYACVTESEKEFRTAICLCGTRNCRGSFLTFAGSRAFMQIMTQRHSM 683
Query: 2060 LDRHQLMLEACELNSVSEEDYLELGRAGLGSCLLGG------LPNWVVAYSARLVRFINL 2113
L R L++ A ++E D L GL C LGG +P W+ ++A ++ F+
Sbjct: 684 LRRQALLVRAGA-EPLTERDRARLQEFGLRECALGGGGGQSRVPAWLEKWTALILEFVQE 742
Query: 2114 ERTKLPEEILRHNLEEKRKYFSDICLEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMR 2173
E+ LPE++L S+I + A +A+GV + RLQN+ +TLDKV+
Sbjct: 743 EQRLLPEQLL--------ALPSNIVAYTPFTAAS-EAKGVSDNRLQNVVITLDKVKL--- 790
Query: 2174 CVFGDPKKAPPPVERLSPEETVSFLWKGEGSLVEELIQCMAPHV-------------EED 2220
C+ + P+ L+ E +LW G SL ++ A + +ED
Sbjct: 791 CLSKPGQCLNAPLRLLTDSEVAEYLWTGTKSLARRWLRTAASQLANPSVARALSSAEDED 850
Query: 2221 VLNDLKSKIQAH-----------DPSGSEDIQRELRKSLLWLRDEVRNLPCTYKCRHDAA 2269
+ + ++ + H P+ E R L+ L ++VR L H AA
Sbjct: 851 DIQAVLARHRGHKHLEELAVLVLQPAAD---AAEGRARLMALAEKVRALDVACGGGHTAA 907
Query: 2270 ADLIHIYAYTKCFFRVQ-EYKAFTSPPVYISPLDLGPKYA 2308
AD++ +YA ++ +F + EYK FTSPPV+++ DL K A
Sbjct: 908 ADMLLLYASSQTWFTSEREYKGFTSPPVHLNLDDLMLKRA 947
Score = 73.2 bits (178), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 46/134 (34%), Positives = 69/134 (51%), Gaps = 11/134 (8%)
Query: 2309 DKLGADLQVYRKTYGENYCLGQLIFWHIQTNADPDCTLARASRGCLSLPDIGSFYAKVQK 2368
DK G K YG + GQL W+ QT DP +L+ RG +S+PDI S Y
Sbjct: 1055 DKHGRTQVGLAKKYGPGFVWGQLNGWYKQTVFDPTASLSAERRGTISMPDIESCYGG--- 1111
Query: 2369 PSRHRVYGPKTVRFMLSRMEKQPQRPWPKDRIWAFKSSPRIFGSPMLDS----SLTGCPL 2424
SR R Y K ++ +EK+P+ W IW+F++ +++GSPM D+ ++ G P
Sbjct: 1112 -SRSR-YTVKDRNHLIDHIEKRPEGMWKIGTIWSFRNEAKVYGSPMFDAVWAQTMPGAPP 1169
Query: 2425 D--REMVHWLKHRP 2436
D E++ L+ P
Sbjct: 1170 DPMPELLQKLRSAP 1183
>gi|357486421|ref|XP_003613498.1| Elongation factor 1-alpha [Medicago truncatula]
gi|355514833|gb|AES96456.1| Elongation factor 1-alpha [Medicago truncatula]
Length = 1488
Score = 273 bits (698), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 168/347 (48%), Positives = 187/347 (53%), Gaps = 111/347 (31%)
Query: 1641 NRKSLDSGSETSDDLDGSSEDGKSDSESTVSDTDSDMDFRSDGRARESRGAGDFTTDEGL 1700
N+KS+DS SETSDDLD SSE + ++ T + G + A + GL
Sbjct: 1058 NKKSIDSDSETSDDLDVSSEVKLAIVMMILTQTKIE-----PGNQKVMDTA--YLPYNGL 1110
Query: 1701 DF-SDDREWGARMTKASLVPPVTRKYEVIDQYVIVADEEDVRRKMRVSLPEDYAEKLNAQ 1759
DF SD+ EWG MTKASLV PVTRKY+VIDQYVIVA
Sbjct: 1111 DFISDECEWGHCMTKASLVSPVTRKYDVIDQYVIVA------------------------ 1146
Query: 1760 KNGSEELDMELPEVKDYKPRKQLGDQVFEQEVYGIDPYTHNLLLDSMPDELDWNLLEKHL 1819
D+V EQEVYGIDP THNLLLDSMP ELDW+L EK
Sbjct: 1147 ------------------------DEVIEQEVYGIDPSTHNLLLDSMPAELDWSLQEK-- 1180
Query: 1820 FIEDVLLRTLNKQVRHFTGTGNTPMMYPLQPVIEEIEKEAVDDCDVRTMKMCRGILKAMD 1879
+ H+ P + MC+GILKAMD
Sbjct: 1181 ----------TQSFGHWICKLGVP-----------------------RISMCQGILKAMD 1207
Query: 1880 SRPDDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDP 1939
RPDDKYVAYRKGLGVVCNKE GF EDDFVVEFLGEV +D
Sbjct: 1208 KRPDDKYVAYRKGLGVVCNKEEGFAEDDFVVEFLGEV--------------------KDS 1247
Query: 1940 APEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAK 1986
APEFYNIYLERPKGDADGYDLVVVDA HKAN+ASRICHSCRPNCEA+
Sbjct: 1248 APEFYNIYLERPKGDADGYDLVVVDATHKANHASRICHSCRPNCEAE 1294
Score = 182 bits (461), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 124/343 (36%), Positives = 163/343 (47%), Gaps = 129/343 (37%)
Query: 1041 LQLGEWYYLDGAGHERGPSSFSELQVLVDQGCIQKHTSVFRKFDKVWVPLTFATETSAST 1100
L LG+WY+LDG G ERGPSSF +LQ VDQ I+K ++F +
Sbjct: 764 LHLGDWYFLDGLGRERGPSSFLDLQSSVDQCIIKK-----KQF----------------S 802
Query: 1101 VRNHGEKIMPSGDSSGLPPTQSQDAVLGESNNNVNSNAFHTMHPQFIGYTRGKLHELVMK 1160
V N + + P Q +GYTRGK+HELV+K
Sbjct: 803 VANFLDSLYP----------------------------------QVVGYTRGKVHELVIK 828
Query: 1161 SYKNREFAAAINEVLDPWINAKQPKKE-TEHVYRKSE----------------------- 1196
SYK+REFAA INEVL PWINA+QPKKE + +Y ++
Sbjct: 829 SYKSREFAAVINEVLYPWINARQPKKEFKKQIYIGNQVTIYFLKFTTIPVFVSNSYEEMI 888
Query: 1197 ---------GDTRAGKRARLLVRESDGDEETEEELQTIQ-DESTFEDLCGDASFPGEESA 1246
GDT A KRAR+LV +SD + E+ I+ +EST E L GD +F EES
Sbjct: 889 CALSSLTLKGDTHASKRARVLVDDSDEEGGFEDCSFIIENNESTVEALSGDVTFSREESG 948
Query: 1247 SSAIESGGWGLLDGHTLAHVFHFLRSDMKSLAFASLTCRHWRAAVRFYKGISRQVDLSSV 1306
+ + G WGLLDG LA +FHFLRSD+KSL F
Sbjct: 949 ITVSKEGRWGLLDGRMLARIFHFLRSDLKSLVF--------------------------- 981
Query: 1307 GPNCTDSLIRKTLNAFDKEKLNSILLVGCTNITSGMLEEILQS 1349
NA++K+K+ S++L+GCTNIT+ +LE+ L S
Sbjct: 982 -------------NAYEKDKIKSMILMGCTNITADILEKFLVS 1011
Score = 60.8 bits (146), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 31/58 (53%), Positives = 38/58 (65%), Gaps = 12/58 (20%)
Query: 1481 MEEFLASSLKEIMRVNTFEFFVPKVAEIEGRMKKGYYISHGLGSVKDDISRMCRDAIK 1538
+E+FL S L+EIM+ N +FFVPKV E HGL SVK+ ISRMCRDA+K
Sbjct: 1005 LEKFLVSRLREIMKANACDFFVPKVPE------------HGLSSVKEGISRMCRDAMK 1050
>gi|118383419|ref|XP_001024864.1| SET domain containing protein [Tetrahymena thermophila]
gi|89306631|gb|EAS04619.1| SET domain containing protein [Tetrahymena thermophila SB210]
Length = 2631
Score = 256 bits (654), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 170/581 (29%), Positives = 291/581 (50%), Gaps = 66/581 (11%)
Query: 1882 PDDKYV-AYR---KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNE 1937
P++ Y A+R KG+G++C G +++F+ E++GE++P W+WFEKQD I+ K N
Sbjct: 1050 PENLYSEAFRIHTKGMGLICINPKGIEQNEFITEYIGEIFPPWRWFEKQDTIKKYMKENN 1109
Query: 1938 --DPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVD-GHY 1994
D P+F+NI LE K D GYD++ VD + K N++SR+ HSC PNC T + G+Y
Sbjct: 1110 KRDILPDFWNIMLEIHKDDPKGYDILFVDPILKGNFSSRLSHSCEPNCGTVPTITNTGNY 1169
Query: 1995 QIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLK 2054
I ++ + I YGEE++FDY +VTES +E++ ++CLCGS CRG YL L+ K
Sbjct: 1170 VIAMFAMHPIEYGEELSFDYMAVTESIQEHKRAICLCGSSKCRGRYLELSN--------K 1221
Query: 2055 ELHGLLDRHQLMLEAC--ELNSVSEEDYLELGRAGLGSCLLGGLPNWVVAYSARLVRFIN 2112
++H L+R + AC ELN EED L + S + P W+ ++A ++R IN
Sbjct: 1222 KIHCFLNRTYTLYIACTEELN---EEDENILEQYSFRSNIRENSPKWLQKWAALVLRIIN 1278
Query: 2113 LERTKLPEEILRHNLEEKRKYFSDICLEVEKSDAEVQ-------AEGVYNQRLQNLAVTL 2165
E EE++ ++ +K S + L E+ + ++ A+ + R+QN+ +++
Sbjct: 1279 QEYDLFLEELVEAEKKKVQKEESLVNLTQEQINQKIDLPYLKYLAQSRKDARIQNIVISI 1338
Query: 2166 DKVRYVMRCVFGDPKKAPPPVERLSPEETVSFLW-KGEGSLVEELIQCMAPHVEEDVLN- 2223
DK++Y + + PP+ + ++ + W K E +L ++L++ + + V+N
Sbjct: 1339 DKIKYYTNQI----NDSSPPLLNMQNDQLLENYWIKTENTLKDDLLEVLQ-QIHSKVINY 1393
Query: 2224 -----------DLKSKIQAHDPSGSEDIQRELRKSLLWLRDEVRNLPCTYKCRHDAAADL 2272
+ KIQ G D + + ++ + + + DA + +
Sbjct: 1394 QLVEYAHIIITNATKKIQIFQKFGQFDHGLLVVRVVVLIISDFFLQMKNCNLQSDALSLI 1453
Query: 2273 IHIYAYTKCFFRVQEYKAFTSPPVYISPLDL--GPKYADKLGADLQVY-----RKTYGEN 2325
+H +A+T +F+ YK FTS I D+ + + ++Q + +KTY
Sbjct: 1454 LHFHAFTHKYFKTHSYKKFTSEEQIIQKEDIINVELFDEDQQGNIQEFTAYSDKKTYTSL 1513
Query: 2326 YCLGQLIFWHIQTNADPDCTLARASRGCLSLPDI-------GSFYAKVQKPSRHRVYGPK 2378
+ GQL W+ Q+ +P TL+ RG L P + + Y V + PK
Sbjct: 1514 FVWGQLNMWYKQSVTNPATTLSLERRGPLIYPQLSNSFKESSTLYPFV---DNKQAENPK 1570
Query: 2379 TVRFMLSRMEKQPQRPWPKDRI--WAFKSSPRIFGSPMLDS 2417
V + ++ +P+ WP D W+FK+ +GS M DS
Sbjct: 1571 QV--FMDHLKTKPECYWPGDNFNKWSFKNQMSQYGSFMYDS 1609
>gi|145532427|ref|XP_001451969.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124419646|emb|CAK84572.1| unnamed protein product [Paramecium tetraurelia]
Length = 1024
Score = 255 bits (652), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 181/594 (30%), Positives = 297/594 (50%), Gaps = 44/594 (7%)
Query: 1779 RKQLGDQ--VFEQEVYGIDPYTHNLLLDSMPDELDWNLLEKHLFIEDVLL---RTLNKQV 1833
R GD V E +GID YT N++++ +P L++ +K+ FIE ++L R +K+
Sbjct: 321 RTNFGDNADVEETLCWGIDVYTRNVIINILP--LNYVESQKNQFIEKLILAINRPNDKER 378
Query: 1834 RHFTGTGNTPMMYPLQPVIEEIEKEAVDDCDVRTMKMCRGILKAMDSRPDDKYVAYRKGL 1893
+ G ++ + + K+ D + K + ++K +D + + KG
Sbjct: 379 GYDMGLACDYIIRESRMLSSLYNKD-----DRKMAKSIKRVIK-LDG---GGFRIHTKGC 429
Query: 1894 GVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQK--NNEDPAPEFYNIYLERP 1951
G+VC + G + ++ +LGE+Y W+W+EKQD I+ K N +D P+FYNI L+R
Sbjct: 430 GLVCVNKFGIKTNSLIIPYLGEIYQPWRWYEKQDFIKKQMKEQNKKDILPDFYNIMLDRH 489
Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
D DG D++ VD ++K N++SR+ HSC PNC T +G Y IG+Y +R I +GEE+T
Sbjct: 490 LDDEDGIDILFVDPINKGNFSSRLSHSCNPNCGTVTTVSNGTYVIGMYAMRDIQFGEELT 549
Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDRHQLMLEACE 2071
FDY S TESK+E ++CLCGS+ C+ YL L+ + + +L H L R+ ++ +C
Sbjct: 550 FDYCSFTESKQEQLQALCLCGSENCKKYYLGLSNQREYNAILDRTHCFLKRNAILFNSCL 609
Query: 2072 LNSVSEEDYLELGRAGLGSCLLGGLPNWVVAYSARLVRFINLERTKLPEEILRHNLEEKR 2131
N ++ L+ + +GS LL G P W+ + +L+ FI+ EE + + E
Sbjct: 610 DNFKIDQSLLD--KYKIGSSLLTGCPFWLKCWICQLLVFID-------EEYIIYKAELDT 660
Query: 2132 KYFSDICLEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRCVFGDPKKAPPPVERLSP 2191
K+ + E EK + + A+ +R+QNL TLDK+++ ++ PP+ +++
Sbjct: 661 KFILN--EETEKWN-QFTAQLHSEERIQNLIFTLDKIKFFLK----QSDTVEPPLTKITN 713
Query: 2192 EETVSFLW--KGEGSLVEELIQCMAPHVEEDVLNDLKSKIQAHDPSGSEDIQRELRKSLL 2249
E+ + W E L EL Q H + ++ + + D D+Q +L + L
Sbjct: 714 EDLIMNFWGMTNESLLSNELYQLFQKHGLKKLMELI---VLIQDKRHLYDVQEQLLLTRL 770
Query: 2250 WLRDEVRNLPCTYKCRHDAAADLI-HIYAYTKCFFRVQEYKAFTSPPVYISPLDLGPKYA 2308
L + + LI + A+T +F+ EYK F SPP I L+ G
Sbjct: 771 LFLVLSHLLLQQKQSFYYEGLSLILQMMAFTYTYFKPTEYKGFQSPP--IDDLEWGK--V 826
Query: 2309 DKLGADLQVYRKTYGENYCLGQLIFWHIQTNADPDCTLARASRGCLSLPDIGSF 2362
+ + KTY + GQL+ W+ QT P +L RG L P I SF
Sbjct: 827 GLIKRKCKAEGKTYSSLFAWGQLVGWYKQTVLAPQLSLCVDRRGTLLYPQISSF 880
>gi|412987959|emb|CCO19355.1| predicted protein [Bathycoccus prasinos]
Length = 2064
Score = 245 bits (625), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 156/464 (33%), Positives = 239/464 (51%), Gaps = 41/464 (8%)
Query: 1771 PEVKDYKPRKQLGDQVFEQEVYGIDPYTHN----LLLDSMPDELDWNLLEKHLFIEDVLL 1826
P V + PR + G V EQE YG D T +L +++P+ D ++H I L+
Sbjct: 1120 PCVGERNPRLKSGRDVREQETYGCDFVTGRDAVAVLAEALPEFSD----DEHWGIYAKLM 1175
Query: 1827 RTLNKQVRHFT--GTGNTPMMYPLQPVIEEIEKEAVDDC-DVRTMKMCRGILKAMDSRPD 1883
+N+ T + + + E+ E+ + D C L +
Sbjct: 1176 NQVNESYGKMTPDTLATQSLALAAEDLAEKFERANITSPKDGMKNLACAKALWTFSKKAR 1235
Query: 1884 DK---YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNN---E 1937
+ +V +RKG GVV +E + +FVV+FLGE+YP W W EKQD I+ QK +
Sbjct: 1236 ENPNLFVVHRKGYGVVNIREKNIQKGEFVVDFLGEIYPPWAWMEKQDAIKQAQKAKGLKD 1295
Query: 1938 DPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIG 1997
APEFYN+ +ERP GD G+ L+ DAMH N+A+R+ HSC PN + +T VDG Y+I
Sbjct: 1296 IGAPEFYNMQMERPGGDKHGFGLLFCDAMHYNNFAARMSHSCEPNVQVILTVVDGKYEIH 1355
Query: 1998 IYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELH 2057
Y R I GEE+ ++Y+S ++S +E EA+ CLCG++ CRGSYL+ GE +V H
Sbjct: 1356 FYATREIQKGEELCYNYHSCSDSMKEVEAAFCLCGAKKCRGSYLSFVGENNNSQVFDSEH 1415
Query: 2058 GLLDRHQLMLEAC-------ELNSVSEE---DYLELGRAGLGSCLLGGLPNWVVAYSARL 2107
+LDR+ ++L+A E N ++E LE +G +L P W+ Y A++
Sbjct: 1416 RILDRYAMLLDAIDEAKEKREGNDDNDEVVKTRLESLGFRIGCGILADAPKWLTNYYAKV 1475
Query: 2108 VRFINLERTKLP----EEILRHNLEEKRK----YFSDICLEVEKSDAEVQAEGVYNQRLQ 2159
FI+ ER LP E H++ +++ Y + + +AE++A V R+Q
Sbjct: 1476 ASFIDHERETLPPLIYEAAKEHHINRRKRGDPGYRGEFVY--TEKNAEIEAMAVRENRIQ 1533
Query: 2160 NLAVTLDKVRYVMRC--VFGDPK--KAPPPVERLSPEETVSFLW 2199
LAV + K+R ++ F PK +PPP +LS +ET+ W
Sbjct: 1534 ALAVCMSKIRRLLTLGEGFDSPKYGTSPPPYAKLSAKETIEKFW 1577
Score = 90.9 bits (224), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 63/205 (30%), Positives = 96/205 (46%), Gaps = 14/205 (6%)
Query: 2245 RKSLLWLRDEVRNLPCTYKCRHDAAADLIHIYAYTKCFFRVQEYKAFTSPPVYISPLDLG 2304
R +LLWLRD + +LP T RHD AD++H++A T+ F++ I +D+
Sbjct: 1733 RAALLWLRDALLDLPVTPCARHDLCADIVHLFANTEHFYKFDHLAPCYQTQAGIQ-IDVR 1791
Query: 2305 PKYADKLGADLQV--------YRKTYGENYCLGQLIFWHIQTNADPDCTLARASRGCLSL 2356
G Q + K Y +Y L+ WH Q ADP + + +GC L
Sbjct: 1792 EDEVMAFGVGAQAASHKIASSHSKKYKHDYIPAALLSWHKQELADPTKLVHVSFKGCAYL 1851
Query: 2357 PDIGSFYAKVQKPSRHRVYGP--KTVRFMLSRMEKQPQRPW-PKDRIWAFKSSPRIFGSP 2413
PDI Y V+ ++ V G + LS + ++ PW PK WA ++ ++ GSP
Sbjct: 1852 PDISCCYG-VRAEAKPIVNGCDLENRSKWLSCLTEKINEPWEPKTGPWAGTNAQKLIGSP 1910
Query: 2414 MLDS-SLTGCPLDREMVHWLKHRPA 2437
MLD+ LD ++ WL+ R A
Sbjct: 1911 MLDAWRKKQSMLDESVLDWLRTRKA 1935
>gi|340508154|gb|EGR33923.1| SET domain protein [Ichthyophthirius multifiliis]
Length = 935
Score = 243 bits (620), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 158/536 (29%), Positives = 278/536 (51%), Gaps = 59/536 (11%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQK--NNEDPAPEFYNIYL 1948
KG+G+ C G +++F+ E++GE+Y W+WFEKQD ++ K N ++ P+F+NI L
Sbjct: 413 KGIGLTCINSQGIQKNEFITEYVGEIYEPWRWFEKQDLLKKFIKENNQQNILPDFWNIML 472
Query: 1949 ERPKGDADGYDLVVVDAMHKANYASRICHSCRPNC-EAKVTAVDGHYQIGIYTVRGIHYG 2007
E K D GYD++ +D + K N++SR+ HSC+ NC V +G Y IG+Y ++ I YG
Sbjct: 473 EIHKDDPKGYDILFIDPIIKGNFSSRLNHSCQANCGTVPVINNEGKYVIGLYAMQQISYG 532
Query: 2008 EEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNL--TGEGAFEKVLKELHGLLDRHQL 2065
EE+TFDY +VTESK+E+ ++CLCGS CRG YL L TG + ++L+++ L R +
Sbjct: 533 EELTFDYMAVTESKQEHNRALCLCGSSKCRGKYLELSTTGIKEYNQILEDISCFLHRTYI 592
Query: 2066 MLEACELN-SVSEEDYLELGRAGLGSCLLGGLPNWVVAYSARLVRFINLERTKLPEEILR 2124
+ +C N ++ ED L S + G P W++ + + +R IN E +E
Sbjct: 593 LEYSCRKNVQLNAEDEQLLESESFRSNIKQGCPIWLLKWICQSLRIINQEYNIFLQE--- 649
Query: 2125 HNLEEKRKYFSDICLEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRCVFGDPKKAPP 2184
L K KY + + K A+ + + R+QNL +T++KV+Y + DPK P
Sbjct: 650 --LRNKNKYTN---FRILKYQAQTKKDN----RIQNLVITINKVKYFINKT-NDPK---P 696
Query: 2185 PVERLSPEETVSFLWKGEGSL-----VEELIQCMAPH--------VEEDVLNDLKSKIQA 2231
P+++L+ E ++ LW + + E++Q + + +++N + ++Q
Sbjct: 697 PLQQLNQEYILNILWLNDKQYSIKEGINEILQNIPDNETNYQYVIYSRNLINSIDKQVQI 756
Query: 2232 HD--PSGSEDIQRELRKSLLWLRDEVRNLPCTYKCRHDAAADLIHIYAYTKCFFRVQEYK 2289
++ S+ + +R LL + D ++ + + L+H +A+T +F YK
Sbjct: 757 YNSFKQYSQSLLL-IRFGLLTISDFLQQIQ-----NQQITSSLLHFHAFTHIYFTNYAYK 810
Query: 2290 AFTSPPVYISPLDL--GPKYADK--------LGADLQVYRKTYGENYCLGQLIFWHIQTN 2339
FTS + I D+ Y +K L L+ KTY + GQL W+ Q+
Sbjct: 811 QFTSEEILIQKGDVINVELYEEKQSQNQEQGLDNFLKKLSKTYQSLFVWGQLNIWYKQSV 870
Query: 2340 ADPDCTLARASRGCLSLPDIGSFYAKVQKPSRHRVYGPKTVRFMLSRMEKQPQRPW 2395
A+P L+ RG + P + +F + +Q +++ + ++++++KQ + W
Sbjct: 871 ANPGNLLSAERRGTIVYPCLQNFIS-IQNDKKYQ-----CSKSIINQIQKQRNKYW 920
>gi|412993322|emb|CCO16855.1| predicted protein [Bathycoccus prasinos]
Length = 1476
Score = 231 bits (589), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 123/335 (36%), Positives = 190/335 (56%), Gaps = 21/335 (6%)
Query: 1886 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN 1945
Y + KG+GVVC ++ G FV +LGE+Y W+W+E+ D ++ N E P F+N
Sbjct: 477 YRLHPKGVGVVCIRKEGLQPGMFVNHYLGEMYSPWRWYERCDAMKKRNPNQE--LPSFFN 534
Query: 1946 IYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIH 2005
I LERPK D G D V V+AMH+ +ASR+ HSC NC+ V + +G IG+YT I
Sbjct: 535 ITLERPKDDVRGKDTVFVEAMHECEFASRMSHSCAGNCQTTVISHEGKLSIGVYTNSKIE 594
Query: 2006 YGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDRHQL 2065
GEE+ +DY+ VTES++E+ A++CLC S CRGS+L+ G F V+ E H L R+ +
Sbjct: 595 CGEELCWDYSCVTESEKEFRAAICLCSSPNCRGSFLSYAGSSTFTAVMNEKHNFLHRNAM 654
Query: 2066 MLEACELNSVSEEDYLELGRAGLGSCLLGGL-----PNWVVAYSARLVRFINLERTKLPE 2120
+ AC +++ED L G+ L L P+W+V +++ ++R++ LE LPE
Sbjct: 655 LCRACS-EPLTDEDLALLSDYGIRDSALNTLSGERAPDWLVKWASLILRYVQLEEKLLPE 713
Query: 2121 EILRHNLEEKRKYFSDICLEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRCVFGDPK 2180
+ +++ KY +E + AE GV RLQN+ VTLDK++Y +R P+
Sbjct: 714 ALCNLPMQKGVKY------NLEGAKAETY--GVVATRLQNIVVTLDKIKYFLR----QPE 761
Query: 2181 KAPPPVERLSPE-ETVSFLWKGEGSLVEELIQCMA 2214
++ P R + E + + LW G S++ I ++
Sbjct: 762 QSDKPFMREATEADIIEHLWTGSESILVRAIGALS 796
Score = 88.6 bits (218), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 67/202 (33%), Positives = 102/202 (50%), Gaps = 19/202 (9%)
Query: 2221 VLNDLKSKIQAHDPSGSEDIQRELRKSLLWLRDEVRNLPCTYKCRHDAAADLIHIYAYTK 2280
VL+D+ +K + + SE ++ L + D +RNL H A ADL+ +YA T+
Sbjct: 994 VLDDIIAKSKMTPSTASE-----AKQWLAEVSDSIRNL----GIEHCACADLLLMYARTQ 1044
Query: 2281 CFFRVQEYKAFTSPPVYISPLDLGPK-YADKLGADLQ-VYRKTYGENYCLGQLIFWHIQT 2338
+F +++ F SPPV + D G K A K+ ++ K Y +Y GQ+ W QT
Sbjct: 1045 RWFTPEKFVGFMSPPVQLREHDPGCKQTASKISIHVKNTLTKKYQPHYPWGQMCSWFKQT 1104
Query: 2339 NADPDCTLARASRGCLSLPDIGSFYAKVQKPSRHRVYGPKTVRFMLSRMEKQ-PQRPWPK 2397
DP +L+ RG LSLPD+ S Y + Y KT R L R+ ++ R WP
Sbjct: 1105 IYDPTASLSADRRGTLSLPDVESAY------NNGGAYV-KTDRKQLFRILRENASRNWPT 1157
Query: 2398 DRIWAFKSSPRIFGSPMLDSSL 2419
W+FK+ +++GSP D +L
Sbjct: 1158 TMQWSFKNYAKMYGSPWFDDAL 1179
>gi|159486133|ref|XP_001701098.1| histone methyltransferase [Chlamydomonas reinhardtii]
gi|158271992|gb|EDO97800.1| histone methyltransferase [Chlamydomonas reinhardtii]
Length = 1028
Score = 224 bits (570), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 112/284 (39%), Positives = 170/284 (59%), Gaps = 11/284 (3%)
Query: 1848 LQPVIEEIEKEAVDDCDVRTMKMCRGILKAMDSRPDDKYVAYRKGLGVVCNKEGGFGEDD 1907
L V+E + EA + D + ++ + + + + KG GV+C GG
Sbjct: 725 LLKVLETVAAEAAERGDAPCAQAAEAVIARLRQIGWNYFRLHPKGRGVICRVPGGLEPFT 784
Query: 1908 FVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMH 1967
FV E+LGE++ W+WFE QD I+ L + P+FYNI LERP+ D DGYD++ V+A
Sbjct: 785 FVEEYLGELHSPWRWFEIQDAIKKLTQQE---LPDFYNITLERPRDDPDGYDVLFVEAAF 841
Query: 1968 KANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEAS 2027
A++ASR+ HSC PNC A V +V+G I +YT R I GEE+TFDY SVTES++EY +
Sbjct: 842 MASFASRMSHSCTPNCAAVVVSVNGRLTIAMYTKRRIEAGEELTFDYRSVTESEKEYREA 901
Query: 2028 VCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDRHQLMLEACELNSVSEEDYLELGRAG 2087
+CLCG++ CRGSYL +G AF +V++E H L R L+L A ++E+D+ L
Sbjct: 902 ICLCGTRSCRGSYLYYSGSDAFTQVMEEKHNFLHRQVLLLRASA-EDLTEDDHTRLRAHA 960
Query: 2088 LGSCLLGG-------LPNWVVAYSARLVRFINLERTKLPEEILR 2124
+G LG P+W+V ++A +++++ LE+ +LP +L+
Sbjct: 961 IGPTSLGDGSPGNNRAPDWLVKWAALVLQYVELEKRELPSFLLK 1004
>gi|340503864|gb|EGR30374.1| SET domain protein [Ichthyophthirius multifiliis]
Length = 827
Score = 223 bits (569), Expect = 9e-55, Method: Compositional matrix adjust.
Identities = 178/645 (27%), Positives = 301/645 (46%), Gaps = 81/645 (12%)
Query: 1777 KPRKQLGDQVFEQEVYGIDPYTH-NLLLDSMPDELDWNLLEKHLFIEDVLLRTLNKQVRH 1835
K +K + V E +GID YT NL +E D ++KH FI+ LL+ N
Sbjct: 185 KLQKIIDQDVQETLCWGIDLYTKKNLHYILHENECD---IKKHNFIQRSLLKAAN----- 236
Query: 1836 FTGTGNTPMMYPLQPVI-------EEIEKEAVDDCDVRTMKMCRGILKAMDSRPD-DKYV 1887
G M + +I EE K+ + + R K + ILK + D + +
Sbjct: 237 LCGNNGWDMQKVCEFIIQNSKKKDEENNKDYIFNNQDR--KFSKVILKTLKINVDPEAFR 294
Query: 1888 AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQK--NNEDPAPEFYN 1945
+ KG+GV+C G ++D ++++ GE+Y ++WFE+QD ++ K N +D P+FYN
Sbjct: 295 IHSKGMGVICLNRQGIEKNDLIIQYFGEIYRPYRWFERQDFVKKFMKENNQKDVLPDFYN 354
Query: 1946 IYLERPKGDADGYDLVV-------------VDAMHKANYASRICHSCRPNCEAKVTAVDG 1992
I LE K D GYD++V VD M K NY+SR+ HSC PNC T DG
Sbjct: 355 IMLEIHKNDPKGYDILVKKQKKQQNNIKKYVDPMQKGNYSSRLSHSCDPNCGTVATISDG 414
Query: 1993 HYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGA--FE 2050
Y I +Y ++ I YGEE+ FDY++VTESK+E+ + CLCG+ CRG Y+ + +
Sbjct: 415 KYNISMYAMKSIEYGEELAFDYSAVTESKQEHMQATCLCGTYKCRGKYIEFSNNNLKEYN 474
Query: 2051 KVLKELHGLLDRHQLMLEACELNSVSEEDYLELGRAGLGSCLLGGLPNWVVAYSARLVRF 2110
+L+++H L R+ +L C ++ ED L + + + P+W++ + + +++
Sbjct: 475 FILEKMHCFLKRNSDLLR-CSNEILNSEDLKLLEKHNMRKNITENCPSWLMKWISIILKT 533
Query: 2111 INLERTKLPEEILRHN--LEEKRKYFSDICLEVEKSDAEVQ------------------- 2149
I+ E++ E + N L +K D+ + E+ D +Q
Sbjct: 534 IDEEKSLFLEHQMNTNIFLLHSQKELRDLEEKNEEEDQSLQIKKEEKIKEIQKHVQFINY 593
Query: 2150 -AEGVYNQRLQNLAVTLDKVRYVMRCVFGDPKKAPPPVERLSPEETVSFLW-KGEGSLVE 2207
A R+QNL +++DKV+Y ++ V P++ L+ ++ L K + S+++
Sbjct: 594 LANSKVENRIQNLVISIDKVKYFLKKV----NDFQAPLDYLNFDQIFENLCGKNKESILD 649
Query: 2208 ELI--------QCMAPHVEEDVLNDLKSKIQAHDPSGSEDIQRELRKSLLWLRDEVRNLP 2259
E+ QC V ++ KS + + + R L + + + +
Sbjct: 650 EIYDLITSYKNQCGQILVYFNIFR--KSFLPKYASISKKQGLLAFRLFCLNISEFFKKIQ 707
Query: 2260 CTYKCRHDAAADL-IHIYAYTKCFFRVQEYKAFTSPPVYISPLDL-GPKYADKLGADLQV 2317
+ H +A + ++ Y++T +F EY + S + IS ++ D +
Sbjct: 708 SNF---HSSATFITLYFYSFTHTYFTPHEYASVCSEKMKISETEMQNLHLLDTEKKKKKH 764
Query: 2318 Y--RKTYGENYCLGQLIFWHIQTNADPDCTLARASRGCLSLPDIG 2360
Y ++ Y + GQL W QT A P TL++ RG LS P I
Sbjct: 765 YEEQRIYSPQFIWGQLTVWFKQTIASPQATLSQDRRGTLSFPSIN 809
>gi|300175979|emb|CBK22196.2| unnamed protein product [Blastocystis hominis]
Length = 671
Score = 216 bits (551), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 120/371 (32%), Positives = 198/371 (53%), Gaps = 24/371 (6%)
Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
PEFYNI LERP GYD++ +D + + N+ SR+ HSC PNC V+G I +
Sbjct: 6 PEFYNIMLERPPDSRGGYDVLYIDPIFRGNFGSRMSHSCSPNCATTTITVNGRLAIVLVA 65
Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLL 2060
+R I +GEE+ FDY V+ESK E+E + CLCGS CRGS+++ +F +V+ + +
Sbjct: 66 LRPIAWGEELCFDYACVSESKTEFEMATCLCGSLQCRGSFVSYADGNSFMQVMAKRFPFV 125
Query: 2061 DRHQLMLEACELNSVSEEDYLELGRAGLGSCLLGGLPNWVVAYSARLVRFINLERTKLPE 2120
R ++L++C ++VS++D L + G+ +L G P W+ + A ++RF+ E LP
Sbjct: 126 KRTAVLLDSCN-SAVSDDDARRLAKHGIKCSMLEGAPAWLQKWIASILRFMEFEEASLPA 184
Query: 2121 EIL-RHNLEEKRKYFSDICLEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRCVFGDP 2179
E+ + + Y SD L +E + GV+ RLQN+A+T D+V++ + P
Sbjct: 185 ELRGMKDCLGRDLYPSDAALRLE-------SHGVFATRLQNVAITADRVKHFLA---QQP 234
Query: 2180 K--KAPPPVERLSPEETVSFLWKGEGSLVEELIQCMAPHVEEDVLNDLKSKI-----QAH 2232
+A PP L+ E + FLW G S+++ L++ + + + + Q
Sbjct: 235 AELRAVPPFRLLTDAEVLDFLWFGAHSVMKRLLRAALAEISDLPAVQFFTALDRALQQPR 294
Query: 2233 DPSGSEDIQRELRKSLLWLRDEVRNLPCTYKCRHDAAADLIHIYAYTKCFFRVQEYKAFT 2292
D + E ++ ++ +RD + LP +C H AAADL+ +Y +T FF EY++
Sbjct: 295 DAATLEWVRLQMTS----VRDGLLTLPAQRRC-HQAAADLLTMYLHTAVFFVATEYRSVK 349
Query: 2293 SPPVYISPLDL 2303
P+ + DL
Sbjct: 350 GAPISLHCYDL 360
Score = 61.2 bits (147), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 32/100 (32%), Positives = 52/100 (52%), Gaps = 6/100 (6%)
Query: 2319 RKTYGENYCLGQLIFWHIQTNADPDCTLARASRGCLSLPDIGSFYAKVQKPSRHRV-YGP 2377
+++Y Y GQL W Q DP +L +GC+ LPD+ S Y + R + Y P
Sbjct: 531 QRSYPSGYVWGQLAMWFKQAGNDPSLSLTNERKGCVLLPDVESCY----ESKRFDLNYTP 586
Query: 2378 KTVRFMLSRMEKQPQRPWPKDRIWAFKSSPRIFGSPMLDS 2417
+ ++R+E P +PW W F+ S ++G+PM+D+
Sbjct: 587 EERGKWIARLEHCPAQPWLSTH-WTFRRSAHVYGTPMMDA 625
>gi|255073265|ref|XP_002500307.1| set domain protein [Micromonas sp. RCC299]
gi|226515569|gb|ACO61565.1| set domain protein [Micromonas sp. RCC299]
Length = 1496
Score = 213 bits (543), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 151/490 (30%), Positives = 230/490 (46%), Gaps = 96/490 (19%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG+G+VC + G ++ ++LGE+Y W+WFE+QD I+ + + E P+F+NI LER
Sbjct: 823 KGIGIVCIRPEGLPPGTYIQDYLGELYSPWRWFERQDAIKKREPDKE--LPDFFNITLER 880
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGH----------------- 1993
P DA G+D++ V+A H+ +ASR+ HSC PNC+ AV
Sbjct: 881 PAEDAAGHDVLFVEAAHRCTFASRLSHSCAPNCQTVGVAVADQTDQKLDQKLDQNNLDQK 940
Query: 1994 -----------YQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
I YT R + YGEE+ ++Y+ VTES++EY A++CLC S C+G++L+
Sbjct: 941 LGQTADPPRTKLSIAQYTTRHVSYGEELCWNYSCVTESEKEYRAAICLCSSTTCKGAFLD 1000
Query: 2043 LTGEGAFEKVLKELHGLLDRHQLMLEACELNSVSEEDYLELGRAGLGSCLLGG------- 2095
G AF V+ H LDR+ L++ AC ++ +D L AG+ S L
Sbjct: 1001 YAGSSAFTAVMNVRHNFLDRNALLIRACS-EPLTSDDRARLATAGIKSAALTMPGERTRT 1059
Query: 2096 -----LPNWVVAYSARLVRFINLERTKLPEEILRHNLEEKRKYFSDICLEVEKSDAEVQA 2150
P W++ +++ + +I +E+ LP + ++ + + A A
Sbjct: 1060 GERVECPEWLIKWASLTLEYIEMEKELLPAALTAKPIDG---------IVYDAGFAAATA 1110
Query: 2151 EGVYNQRLQNLAVTLDKVRYVMRCVFGDPKKAPPPVER-LSPEETVSFL----------- 2198
GV R+ NL VTLDK++YVMR P + P R LS E V L
Sbjct: 1111 AGVVATRISNLVVTLDKIKYVMR----QPGQNRAPFLRHLSDNEVVDHLLGDILKRAADT 1166
Query: 2199 -------------WKGEGSL---VEELIQCMAPHVEEDVLNDLKSKIQAHDPSGSEDIQR 2242
+ G+G+ E + E DVL + + A P SE +
Sbjct: 1167 FAKKVGVKAGLPFFGGKGARNAGAEAKMPAAVGQREGDVLRFILG-VLAKPP--SEFTPQ 1223
Query: 2243 ELRKSLLWLRDEVRNLPCTYKCRHDAAADLIHIYAYTKCFFRVQEYKAFTSPPVYISPLD 2302
E ++L ++R+L H A ADL+ +YA T + + Y F SPPV + PL
Sbjct: 1224 EASQTLETCSRKIRDLGAV----HCAMADLLLLYARTAHWCTPEAYAGFQSPPVRLVPL- 1278
Query: 2303 LGPKYADKLG 2312
PK DKLG
Sbjct: 1279 --PK--DKLG 1284
Score = 67.0 bits (162), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 57/103 (55%), Gaps = 7/103 (6%)
Query: 2317 VYRKTYGENYCLGQLIFWHIQTNADPDCTLARASRGCLSLPDIGSFYAKVQKPSRHRVYG 2376
V +K Y ++ GQL+ W QT DP +L+ RG +SLPD S Y ++ V G
Sbjct: 1356 VMKKKYQPHFAWGQLVSWFKQTIYDPSASLSAERRGAMSLPDPESAYG-----DKNYVTG 1410
Query: 2377 PKTVRFMLSRMEKQPQRPWPKDRIWAFKSSPRIFGSPMLDSSL 2419
+ R ML ++ + P + WP W+F++ +++GSP +D ++
Sbjct: 1411 DR--RSMLRQIARDPSKMWPTTWAWSFRNPGKVYGSPFIDDAI 1451
>gi|340507839|gb|EGR33721.1| SET domain protein [Ichthyophthirius multifiliis]
Length = 667
Score = 202 bits (514), Expect = 2e-48, Method: Composition-based stats.
Identities = 152/518 (29%), Positives = 252/518 (48%), Gaps = 67/518 (12%)
Query: 1889 YRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDP--APEFYNI 1946
+ KG G++C + G ++DF+ E++G++Y W+WFEKQ+ I+ + K P+F+NI
Sbjct: 122 HTKGKGLICINKKGIKQNDFITEYIGQIYQPWRWFEKQNFIKKIIKEKYKNYILPDFWNI 181
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEA-KVTAVDGHYQIGIYTVRGIH 2005
LE K D GYD++ +D++ K N++S I HSC+PNC +Y I +Y ++ I
Sbjct: 182 MLEIHKDDQKGYDILYIDSISKGNFSSSINHSCQPNCGTFSFITNQKNYVIAVYAIQQIE 241
Query: 2006 YGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNL--TGEGAFEKVLKELHGLLDRH 2063
YG+E+TFDY ++TES +E + S CLC S CRG YL+L T F ++L ++H LDR
Sbjct: 242 YGQELTFDYMAITESIKEQQLSKCLCMSPNCRGLYLDLQNTNFKQFNQILDKIHNFLDRT 301
Query: 2064 QLMLEACELNSVSEEDYLELGRAGLGSCLLGGLPNWVVAYSARLVRFINLERTKLPEEIL 2123
++ +AC + ED L L ++ P W+ + + ++R IN E +++L
Sbjct: 302 LIIQKAC-FEQFTNEDKLILEEFSFRFNIINDSPEWLQKWISYILRIINQENELFLKQLL 360
Query: 2124 -----RHNLEEKRKYFSDICLEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRCVFGD 2178
+ N +E+++ F+ A+ +QR+QN+ +++DKV+Y + +
Sbjct: 361 GPESEKLNKKEQKQKFN-------------LAQYRKDQRIQNIVISIDKVKYYINQL--- 404
Query: 2179 PKKAPPPVERLSPE-------------------------ETVSFLWKGEGSLVEEL-IQC 2212
+ PP +L+ E FL K +L+E I
Sbjct: 405 -QDFTPPFIKLNNEVLKYNIYIYIYIKQKIRRFFQNIWGNQKGFLKKDFLNLIEYFQINS 463
Query: 2213 MAPHVEEDVLNDLKSKIQ-------AHDPSGSEDIQRELRKSLLWLRDEVRNLPCTYKCR 2265
P ++ +KS I+ D +I +LR LL L D + L K
Sbjct: 464 KNPK-NVKIIEQIKSLIKNTSECVFKEDNENFSNILIQLRFLLLILSDLIFQLQKDEKII 522
Query: 2266 H-DAAADLIHIYAYTKCFFRVQEYKAFTSPPVYI---SPLDLGPKYADKLGADLQVYRKT 2321
+ D A ++H YA+T FF +YK S P+ I ++L D+ L+ +++
Sbjct: 523 NIDGFAIILHFYAFTHEFFTAYKYKQHQSEPIKIFKDEIINLQFLQDDQQNTFLE-EQQS 581
Query: 2322 YGENYCLGQLIFWHIQTNADPDCTLARASRGCLSLPDI 2359
Y + GQL W+ QT A P TL +G L P++
Sbjct: 582 YSSLFVWGQLNTWYKQTVAFPATTLGIERKGTLIYPNL 619
>gi|146185998|ref|XP_001032856.2| SET domain containing protein [Tetrahymena thermophila]
gi|146143072|gb|EAR85193.2| SET domain containing protein [Tetrahymena thermophila SB210]
Length = 2057
Score = 193 bits (490), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 98/273 (35%), Positives = 159/273 (58%), Gaps = 21/273 (7%)
Query: 1869 KMCRGILKAMDSRPD-DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQD 1927
K + + KA++ + D + + + KG+GV+C G ++D ++E++GE+Y ++WFE+QD
Sbjct: 819 KFAKILNKAIELKIDQEAFRIHPKGMGVICINRNGIDQNDLIIEYIGEIYRPYRWFERQD 878
Query: 1928 GIRSLQKNN--EDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEA 1985
I+ K+N +D P+FYNI LE K + G D++ VD M K NY+SR+ HSC PNC
Sbjct: 879 FIKKYMKDNNQQDVLPDFYNIMLELHKDEVKGIDILYVDPMQKGNYSSRLSHSCDPNCGT 938
Query: 1986 KVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSY--LNL 2043
T G+Y I ++ ++ I YGEE+ FDY++VTESK E++ S CLCG+Q CRG Y LN
Sbjct: 939 VATISKGYYNISMFALKSIEYGEELAFDYSAVTESKNEHKQSTCLCGTQKCRGKYIELNN 998
Query: 2044 TGEGAFEKVLKELHGLLDRHQLMLEACELNSVSEEDYLELGRAGLGSCLLGGLPNWVVAY 2103
+ + +L ++H L R+ +L + ++E+D L + L + G W
Sbjct: 999 NNQKEYNYILDKIHCFLKRNSDLLRSGS-EPLTEDDMNLLEKYNLKQNVQKGCEKW---- 1053
Query: 2104 SARLVRFINLERTKLPEEILRHNLEEKRKYFSD 2136
L+++I++ IL+ EE+ +FSD
Sbjct: 1054 ---LLKWISI--------ILKSVGEEQELFFSD 1075
Score = 88.2 bits (217), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 79/294 (26%), Positives = 135/294 (45%), Gaps = 42/294 (14%)
Query: 2157 RLQNLAVTLDKVRYVMRCVFGDPKKAPPPVERLSPEETVSFLW----------KGEG--- 2203
R+QN+ ++LDKV++ + V K PP+ LS +E LW K E
Sbjct: 1372 RIQNIIISLDKVKFYLNNV----KDIRPPLSYLSQKEIFENLWGRVNKFQKRRKPENQVY 1427
Query: 2204 SLVEELIQCMAPHVEEDVLNDLKSKIQAHDPSGSEDIQRELRKSLLWLRDEVRNLPCTYK 2263
S+VEEL+ + + + I+ DP ++ ++ K+LL R N+ +
Sbjct: 1428 SIVEELMDLIKYYSHYKECQYTQKFIELFDPFMTKYVEESYEKALLAWRLFCLNIHSVFN 1487
Query: 2264 CRHD----AAADLI--HIYAYTKCFFRVQEYKAFTSPPVYISPLDLGPKYADKLGADLQV 2317
D A A LI + YA+T +F EY+ F S + IS ++ + L D +
Sbjct: 1488 DIIDPQFHAGALLIVLYFYAFTHTYFTPHEYQPFNSEKMTISETEMFN--LELLDEDKKN 1545
Query: 2318 Y-----------RKTYGENYCLGQLIFWHIQTNADPDCTLARASRGCLSLPDIG-SFYAK 2365
+++Y + GQL W QT A P TL++ RG LS P + SF
Sbjct: 1546 TKKPNKKKNYEEQRSYSSQFIWGQLTVWFKQTVASPQATLSQDRRGTLSYPQLNQSFKTN 1605
Query: 2366 VQK-PSRHRVYGPKTVR-FMLSRMEKQPQRPWPKDRI-WAFKSSPRIFGSPMLD 2416
+ P + + KT R L+ M+++P+ WP ++ W+FK++ + +G+ + +
Sbjct: 1606 ILTYPFQEK--NDKTGRQTFLNHMKEKPKDMWPPEQAKWSFKNALKNYGTLLFE 1657
>gi|397573767|gb|EJK48860.1| hypothetical protein THAOC_32309, partial [Thalassiosira oceanica]
Length = 1092
Score = 192 bits (488), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 101/284 (35%), Positives = 145/284 (51%), Gaps = 22/284 (7%)
Query: 1871 CRGILKAMDSRPDDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIR 1930
R + A D+ DD + + KG G V +GG + + + GEVYP W+W EK D I
Sbjct: 512 ARLVRLASDAVDDDFFRIHPKGHGSVVIGDGGLKANSLITYYRGEVYPAWRWCEKLDAIE 571
Query: 1931 SLQK--NNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVT 1988
+QK N P+FYN+ +ERPK D GY L+ VDA K+ S HSC P CE +V
Sbjct: 572 RVQKEKNLRPNLPDFYNMAMERPKKDPRGYCLLFVDASRKSGLGSSFSHSCNPTCEVRVV 631
Query: 1989 AVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGA 2048
++ G Q+ + T+R + GEE+TFDYN+VTES +EY +VCLCG + CRGS+L+
Sbjct: 632 SLHGKLQLSMTTLRDLEQGEELTFDYNAVTESLDEYRFAVCLCGQRRCRGSFLHYATADC 691
Query: 2049 FEKVLKELHGLLDRHQLMLEACELNSVSEEDYLELGRAGLGSCLLGG------------- 2095
+++VL + R ++ C +S ED L R G + G
Sbjct: 692 YQQVLSRNSPMAARFANLVRGCTKQVMSREDSAILARHGFNTAAFGAVSFNHHAAATSLV 751
Query: 2096 -------LPNWVVAYSARLVRFINLERTKLPEEILRHNLEEKRK 2132
+P W+ Y A +R+I ER LP +L + +E K
Sbjct: 752 SRDSIDNVPIWLRTYVADCLRYIEYERRALPVALLCNQMERMSK 795
>gi|147775274|emb|CAN61590.1| hypothetical protein VITISV_033129 [Vitis vinifera]
Length = 576
Score = 190 bits (482), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 94/144 (65%), Positives = 112/144 (77%), Gaps = 1/144 (0%)
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
+P+ + ++LVV DA+HKANYASRICH CRPN EAK+TAV+G YQIGIYTVR I GEE
Sbjct: 319 QPQNKPNPWNLVV-DAIHKANYASRICHLCRPNREAKITAVEGQYQIGIYTVRQIQCGEE 377
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDRHQLMLEA 2069
I FDYNSVTESK+EYE SVCLCGSQVCR SYLNLTGEGAF+KVLK HG+LD++QLM E
Sbjct: 378 IIFDYNSVTESKKEYEVSVCLCGSQVCRMSYLNLTGEGAFQKVLKGCHGILDQYQLMSEL 437
Query: 2070 CELNSVSEEDYLELGRAGLGSCLL 2093
L+++ + R LG +L
Sbjct: 438 YTLSAMLRKFIENHTRLSLGKTVL 461
Score = 85.5 bits (210), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 36/60 (60%), Positives = 46/60 (76%)
Query: 1043 LGEWYYLDGAGHERGPSSFSELQVLVDQGCIQKHTSVFRKFDKVWVPLTFATETSASTVR 1102
+WYYLDGAGHE+ PSSFSELQ LVDQ IQKH+SV K +K+W+P+TFA + + V+
Sbjct: 258 FSDWYYLDGAGHEQWPSSFSELQSLVDQDSIQKHSSVLGKINKIWIPITFAADVPDAAVK 317
Score = 71.2 bits (173), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 42/103 (40%), Positives = 58/103 (56%), Gaps = 8/103 (7%)
Query: 756 EDLHIDVRVGALLDGFTVIPGKEIETLGEILQTTFERVDWQNNG--GPTWHGACVGEQKP 813
E L ID RV ALL FT IPG+E+ETLGE+LQ +FE W+ G G +WH +G Q
Sbjct: 162 EGLQIDERVRALLKSFTFIPGRELETLGEVLQASFEHAQWEKLGAEGLSWHQLRIGGQP- 220
Query: 814 GDQKVDELY-ISDTKMKEAAELKS---GDKDHWVVCFDSDEWF 852
DQ++D + + KEA + + DKD+ D +W+
Sbjct: 221 -DQRIDRFFRYPEITSKEALDSRLSTFSDKDYAFAFGDFSDWY 262
>gi|307109213|gb|EFN57451.1| hypothetical protein CHLNCDRAFT_142939 [Chlorella variabilis]
Length = 865
Score = 186 bits (473), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 163/567 (28%), Positives = 244/567 (43%), Gaps = 154/567 (27%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG+G++C ++GG F +++K D P+FYNI LER
Sbjct: 383 KGVGLICKQQGGIPPLTF---------------------DAVKKITGDELPDFYNIVLER 421
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
PK D DGYD++ +DA K ASR+ HSC PNC+A V A G I +YT+R EE+
Sbjct: 422 PKDDPDGYDVLFIDAAAKGALASRMSHSCTPNCQAIVMACGGRLTIALYTLR-----EEL 476
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDRHQLMLEAC 2070
TFDY+SVTES++E GSYL TG AF++V+ H ++
Sbjct: 477 TFDYSSVTESEKE--------------GSYLYFTGSRAFQQVMNTKHTVM---------- 512
Query: 2071 ELNSVSEEDYLELGRAGLGSCLLGGLPNWVVAYSARL-----VRFINLERTKLPEEILRH 2125
W +A A L +I E L E++L H
Sbjct: 513 ---------------------------GWAIARLAALSTEIICEYIEEEEAHLKEDLLGH 545
Query: 2126 NLEEKRKYFSDICLEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRCVFGDPKKAPPP 2185
L ++ A +A+GV RLQN+ +TLDKV+ V++ ++
Sbjct: 546 PLG-----------IYNEASATAEAKGVVINRLQNVVITLDKVKMVLQAPNQTDEELQSA 594
Query: 2186 VERLSPEETVSFLWKGEGSLVEELIQCMAPHVEEDVLNDLKSKIQAHDPSGSEDIQRELR 2245
V+R S E G+L + M P ++ D + K
Sbjct: 595 VQRHSAELP--------GALSKMCSLVMQPALD---FADARCK----------------- 626
Query: 2246 KSLLWLRDEVRNLPCTYKCRHDAAADLIHIYAYTKCFFRVQE-YKAFTSPPVYISPLDLG 2304
L+ L +++R L A ADL+ +YA T +F + YK TSPPV ++ DL
Sbjct: 627 --LMQLYEQLRALDVENNGGLTAVADLLLLYASTLHWFTCERGYKGVTSPPVPLNLADLA 684
Query: 2305 --------PKY--------------ADKLGADLQVYRKTYGENYCLGQLIFWHIQTNADP 2342
P +DKL + RK Y Y GQL W QT DP
Sbjct: 685 LDRTQEQTPAAAAAAVAAAAAAVVDSDKLLGSSNL-RKVYRPLYLWGQLSGWFKQTVNDP 743
Query: 2343 DCTLARASRGCLSLPDIGSFYAKVQKPSRHRVYGPKTV---RFMLSRMEKQPQRPWPKDR 2399
+L+ RG +SLPD+ S +A + +R+ T+ ++ +++K+P W
Sbjct: 744 TASLSAERRGTISLPDVDSSFAGGK--TRYTAKASSTLCDRGDLIDQLDKRPDAMWRTGT 801
Query: 2400 IWAFKSSPRIFGSPMLDSSLTGCPLDR 2426
+W+F++ +++GSPMLD+ C L R
Sbjct: 802 LWSFRNEAKVYGSPMLDA--VWCELSR 826
>gi|147814949|emb|CAN70304.1| hypothetical protein VITISV_006637 [Vitis vinifera]
Length = 694
Score = 182 bits (461), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 105/213 (49%), Positives = 134/213 (62%), Gaps = 14/213 (6%)
Query: 651 MECGPSRLCDLKTLVEEGVLVSDHFIKHLDSNRWETVENAVSPLVTVNFPSITSDSVTQL 710
ME GPS+LCDLK VE GVLVSDH IKH+DS+RW T++NA S LV VNFP + D+VTQL
Sbjct: 1 MERGPSKLCDLKKFVE-GVLVSDHLIKHIDSDRWLTIKNAASLLVPVNFPLLVLDTVTQL 59
Query: 711 VSPPEASGNLLADTGDTAQSTG---EEFPVT--LQSQCCPDGSAAAAESSEDLHIDVRVG 765
VSPPEA GN LA+ GDT +S EE P LQS C + ++ A+E E L ID RV
Sbjct: 60 VSPPEAPGNPLAEAGDTTESNKLLEEETPAATLLQSMSCNNDNSIASEPLEGLQIDERVR 119
Query: 766 ALLDGFTVIPGKEIETLGEILQTTFERVDWQNNG--GPTWHGACVGEQKPGDQKVDELY- 822
ALL F IPG+E+ETLGE+LQ +FE W+ G G +WH +G Q DQ++D +
Sbjct: 120 ALLKSFAFIPGRELETLGEVLQASFEHAQWEKLGAEGLSWHRLRIGGQP--DQRIDRFFR 177
Query: 823 ISDTKMKEAAELKS---GDKDHWVVCFDSDEWF 852
+ KEA + + DKD+ D +W+
Sbjct: 178 YPEITSKEALDSRLSTFSDKDYAFDFGDFSDWY 210
Score = 122 bits (305), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 55/72 (76%), Positives = 62/72 (86%)
Query: 1954 DADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFD 2013
+ + ++LVVVDA+HKANYASRICH CRPN EAKVTAV+G YQIGIYT+R I GEEI D
Sbjct: 214 ELNPWNLVVVDAIHKANYASRICHLCRPNREAKVTAVEGQYQIGIYTIRQIQCGEEIILD 273
Query: 2014 YNSVTESKEEYE 2025
YNSVTESKEEYE
Sbjct: 274 YNSVTESKEEYE 285
>gi|255084155|ref|XP_002508652.1| set domain protein [Micromonas sp. RCC299]
gi|226523929|gb|ACO69910.1| set domain protein [Micromonas sp. RCC299]
Length = 1342
Score = 178 bits (451), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 194/760 (25%), Positives = 322/760 (42%), Gaps = 157/760 (20%)
Query: 1783 GDQVFEQEVYGIDPYTHNLLLDSMPDELDWNLLEKHLFIEDVLLRTLNKQVRHFTGTGNT 1842
G V E+ G D +T + ++MP N + +H D + + + +R G
Sbjct: 166 GVDVAMVELKGFDAHTREKIAENMP-----NAVAEHDV--DEFIDLVAQTMRLDAVAGAD 218
Query: 1843 PMMYPLQPVIEEIEKEAVDDCDVRTMKMCRGILKAMDSRPDDKYVAYRKGLGVVCNKEGG 1902
P + I E + + ++K P + + A KG+G+V K+GG
Sbjct: 219 PSLELAAKTIAESKAATN-----AARACAKALVKLCAKDPKE-FKAKSKGVGLVVIKDGG 272
Query: 1903 FGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKN--NEDPAPEFYNIYLERPKGDADGYDL 1960
+D ++ + GE+YP W+WFEK+ ++++++ +D P FYN +ER D GYD+
Sbjct: 273 IPKDAYLGAYCGELYPGWRWFEKEAAAQAVRRDVKRDDEVPTFYNAAVERDLHDPRGYDV 332
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
+ +D M K + +R HSC+PN E +V +G Y + + T R + GEEI +DY T+S
Sbjct: 333 LFIDGMVKGSVLTRASHSCQPNAEMRVRVREGKYSVEMVTTREVRTGEEICWDYRCQTDS 392
Query: 2021 KEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLL-DRHQLMLEACELNSVSEED 2079
++E ++CLCGS+ CR SYL+ GE E +L D +L+ + +++ +
Sbjct: 393 EKEMRRAICLCGSKNCRVSYLHYNGESELAAFADERCAVLHDAARLLASCVDADALRLPE 452
Query: 2080 YLEL---GRAGLGS------------------------------CLLGGLPNWVVAYSAR 2106
L L G G G+ +L GLP W ++++
Sbjct: 453 TLNLNPKGTPGKGTPGKRGNNGGKRADDHWKSALIAAGVRADDEGMLAGLPQWARKFASK 512
Query: 2107 LVRFINLERTKLPEEILRHNLEE--KRKYF------------------------------ 2134
V + E+ ++L H+L E KR+ +
Sbjct: 513 CVATAHEEK-----KVLTHSLYESAKRRAYEAIDAARAEAAEYETDPEAWKKRFPRTVST 567
Query: 2135 --------SDICLEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVM--RCVFGDPKK--- 2181
SD L +DA+ +A G++ R+Q+LAVT+DKVR V+ GD +K
Sbjct: 568 PPTVPHEPSDADL---GADAKAEASGIHAARIQSLAVTMDKVRRVLAVHARGGDAEKPAA 624
Query: 2182 ------APPPVERLSPEETVSFLWKGEGSLVEELIQCMAPHVEE--DVLNDLKSKIQAHD 2233
A PP+ L+ E+ + L L A +V+ D L+ + A D
Sbjct: 625 GVDVSAAAPPLRLLNDEDAAAHLVAYASRLAS------AANVDNPFDALS-VDRLSAADD 677
Query: 2234 PSGS-EDIQRELRKSL-----------LWLRD-EVRNLPCTYKCRHDAAADLIHIYAYTK 2280
+GS ED +R +R LW R E T + A DL ++ + T+
Sbjct: 678 ENGSDEDARRWVRSDAARRILREMADNLWKRSLEESGASDTDRIARLACGDLAYLASQTR 737
Query: 2281 CFFR-VQEYKAFTSPPVYISPLDLGPKYADKLGADLQVYRKTYGENYCLGQLIFWHIQTN 2339
FF V + F SP V + + + GA + V R Y ++ LG L W +
Sbjct: 738 NFFAPVAGGERFKSPYVAMG------SHGSRDGAGV-VRRMDYPKHSALGFLCTWREEYL 790
Query: 2340 ADPDCTLARASRGCLSLP--DIGSFYA-KVQKPSRHRVYGPKTVRFMLSRM-----EKQP 2391
P LA +RG L LP D G+ Q P ++R ++ + + + +
Sbjct: 791 ERPCDRLATDARGGLCLPRFDRGALANYSTQNPGKNRRRAHASIDALSAHLCGEMAARGG 850
Query: 2392 QRPW-PKDRIWAF---------KSSPRIFGSPMLDSSLTG 2421
+ W P D W + + P + GSP++D++L G
Sbjct: 851 GKAWAPIDGCWRWDFDLDAGDERDGP-VLGSPVVDAALGG 889
>gi|147866108|emb|CAN83034.1| hypothetical protein VITISV_019861 [Vitis vinifera]
Length = 343
Score = 177 bits (449), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 93/160 (58%), Positives = 112/160 (70%), Gaps = 7/160 (4%)
Query: 651 MECGPSRLCDLKTLVEEGVLVSDHFIKHLDSNRWETVENAVSPLVTVNFPSITSDSVTQL 710
ME GPS+LCDLK VE GVLVSDH IKH+DS+RW T++NA S LV +NFP + SD+VTQL
Sbjct: 1 MERGPSKLCDLKKFVE-GVLVSDHLIKHVDSDRWLTIKNAASLLVPMNFPPLVSDTVTQL 59
Query: 711 VSPPEASGNLLADTGDTAQSTG---EEFPVT-LQSQCCPDGSAAAAESSEDLHIDVRVGA 766
VSPPEA GN L + GDT +S EE P T LQS C + S+ A+E E L ID RV A
Sbjct: 60 VSPPEAPGNPLVEAGDTTESNKLMEEETPATLLQSMSCNNDSSIASEPLEGLQIDERVRA 119
Query: 767 LLDGFTVIPGKEIETLGEILQTTFERVDWQNNG--GPTWH 804
LL F IPG+E+ETLGE+LQ +FE W+ G G +WH
Sbjct: 120 LLKSFAFIPGRELETLGEVLQASFEHAQWEKLGAEGLSWH 159
>gi|219116062|ref|XP_002178826.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217409593|gb|EEC49524.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 2187
Score = 173 bits (438), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 126/405 (31%), Positives = 184/405 (45%), Gaps = 62/405 (15%)
Query: 1785 QVFEQEVYGIDPYTHNLLLDSMPDELDWNLLEKHLFIEDVLLRTLN-----------KQV 1833
+V EQ V+G+D YT + + E D++ FIE LL +N
Sbjct: 698 EVVEQPVWGMDCYTRRNIASCL--ETDFDPATALHFIEKWLLPAINACPIDLAHKISNAA 755
Query: 1834 RHFTGTGNTPM-------------------MYPLQPVIEEIEKEAVDDCDVRTMKMCRGI 1874
R G M ++ P+ + + ++ V +
Sbjct: 756 RILEGLPFESMEDGEYGEKENINDRKTPEKLWAYSPLGKALREKIKVAAPVWLTAAAYLL 815
Query: 1875 LKAMDSRPDDKYVAYRKGLG-VVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQ 1933
KA + D + + KG G V+ N + G + V + GEVYP W+W EK D I Q
Sbjct: 816 RKAYTALGPDFFRVHPKGHGSVLLNSK--VGANTLVTFYRGEVYPSWRWGEKMDAIEITQ 873
Query: 1934 -KNNEDPA-PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVD 1991
+ PA P+FYN+ LERP+ D GY L+ VDA KA + S + HSC P CE +VTAV+
Sbjct: 874 SRKALKPALPDFYNMALERPQIDPRGYGLLFVDASRKAGHGSSLSHSCAPTCEVRVTAVN 933
Query: 1992 GHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEK 2051
G + + T+R + GEE+TFDYN+VTES EY ++VCLCG CRGS+L+ ++
Sbjct: 934 GELTLAMTTLRELEMGEELTFDYNAVTESLNEYRSAVCLCGYGKCRGSFLHFATADCYQL 993
Query: 2052 VLK-----------------------ELHGLLDRHQLMLEACELNSVSEEDYLELGRAGL 2088
VL E +L H + A SV+ + LE G+ G+
Sbjct: 994 VLNRNAPIATRFANLVKGSMKQVMSDEDTRVLHNHGFLTAAFGAISVNRRNLLEGGQKGV 1053
Query: 2089 GSCLLGGLPNWVVAYSARLVRFINLERTKLPEEIL-RHNLEEKRK 2132
L +P W+ + A +R+I ER LP ++ H KRK
Sbjct: 1054 LDT-LDIVPVWLRTFVADTLRYIEYERRALPIALICDHVSSAKRK 1097
>gi|299473409|emb|CBN77807.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 3474
Score = 172 bits (437), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 162/634 (25%), Positives = 279/634 (44%), Gaps = 101/634 (15%)
Query: 1675 SDMDFRSDGRARESRGAGDFTTDEGLDFSDDREWGARMTKASLVPPVTRKYEVIDQYVIV 1734
+ + +G +ES G G + G +F +D G R ++L+ K + I
Sbjct: 711 TTLAIAGEGATQESVGGGASGAENGGNFLNDE--GIRAQVSNLL----EKMCTMVSINIK 764
Query: 1735 ADEEDVRRK----MRVSLPEDY-AEKLNAQKNGSEELDMELPEVKDYKPRKQLGD-QVFE 1788
D+ +V+++ R + D N++ G + D+ +P +D LG Q+ E
Sbjct: 765 NDKNEVQKQPLTLTRYRVKSDSKGTSANSKGAGDSKDDVVVPAQED----GSLGQAQLEE 820
Query: 1789 QEVYGIDPYTHNLLLDSMPDELDWNLLEKHLFIEDVLLRTLNKQVRHFTGTGNTPMMYPL 1848
+ V+GID YT + + + L + + +I LL LN+Q G P + L
Sbjct: 821 RSVWGIDCYTRSNVEHMLDLTLGLSKEQAQHWITTTLLPALNRQ-NGPRGVDMLPALRDL 879
Query: 1849 QPVIEEIEKE-----------------------------AVDDCDVRTMKMCRGILKAMD 1879
VI + E E + +++ +A +
Sbjct: 880 CKVIPDGETEEELEASRARAEAEEFELGPKAEGEDLLAAQALLHAIEGLQLLHQEHRATE 939
Query: 1880 SRPDDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIR--SLQKNNE 1937
S + ++ KG GV+C + G D FV E+LG++YP W+W EK I L+ +
Sbjct: 940 SLVRCYFHSHPKGTGVICKAKEGLKADTFVSEYLGDLYPSWRWNEKLSAIEEAKLKHGLK 999
Query: 1938 DPAPEFYNIYLERPKGDADGYDLVVVDAM-HKANYASRICHSCRPNCEAKVTAVDGHYQI 1996
P+FYN +ERPK DA G+ L+ V+A H N++S + HSC NC + +G +
Sbjct: 1000 PDLPDFYNFMMERPKEDARGFGLLHVEAGNHVGNFSSSLSHSCNSNCTTATSVRNGRLCV 1059
Query: 1997 GIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKEL 2056
+ T R I +GEE+T +Y ++T + EY +VCLCGS +C+ S++ TG ++ K+L+ L
Sbjct: 1060 TLSTTRAIAFGEELTMNYGAITSCETEYGKAVCLCGSNLCQQSFMTFTGMDSYSKILRGL 1119
Query: 2057 HGLLDRHQLMLEACELNSVSEEDYLELGRAGLGSCLLGGL-PNWVVAYSARLVRFINLER 2115
L+ L+ A + +S + L + GL S LG L P W+ ++A +R++ ER
Sbjct: 1120 GPLMVFRGLIQSAAD-TPISSGELETLQKFGLKSSALGDLCPIWLKKFAAMQLRYVEFER 1178
Query: 2116 TKLPEEILRHNLEEKRKYFSDICLEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMR-- 2173
KLP ++ Y S A++++ V ++R+++L L V + ++
Sbjct: 1179 RKLPPTLMASG---DHTYQS----------ADIESHQVMDRRIRSLVEVLSGVYHFLQEQ 1225
Query: 2174 ------------------------CVFGDPKKAP--PPVERLSPEETVSFLWKGEGSLVE 2207
DP PP++ L +E V LW G S++
Sbjct: 1226 KTAHSRPLPEGFEPPAPAAGGRAGAANEDPATLAERPPLKLLVDDEVVEALWSGRQSMMR 1285
Query: 2208 ELIQCMAPHVEEDVLNDLKSKIQAHDPSGSEDIQ 2241
L++ + + + K ++ DP EDIQ
Sbjct: 1286 RLLRRL------EAIYCAKLVVETPDP---EDIQ 1310
>gi|303285194|ref|XP_003061887.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226456298|gb|EEH53599.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 1561
Score = 169 bits (428), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 82/227 (36%), Positives = 122/227 (53%), Gaps = 27/227 (11%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG+GVVC + G +V ++LGE+Y W+W+E+QD I+ + E P+F+NI LER
Sbjct: 762 KGVGVVCIRPEGLPAGTYVNDYLGEIYAPWRWYERQDAIKKREPGKE--LPDFFNITLER 819
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDG------------------ 1992
P DA GYD + V+A H+ ++SR+ HSC PN VD
Sbjct: 820 PAEDAAGYDTLFVEAAHRCTFSSRLSHSCAPNVHTVGVVVDASESGGDTSDDKAAEEKKA 879
Query: 1993 ------HYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGE 2046
I YT R + YGEE+ ++Y+ VTES++EY A++CLC + C+G++L+ G
Sbjct: 880 RESDAAKLTIAQYTTRRVEYGEELCWNYSCVTESEKEYRAAICLCSAPTCKGAFLDYAGS 939
Query: 2047 GAFEKVLKELHGLLDRHQLMLEACELNSVSEEDYLELGRAGLGSCLL 2093
AF V+ H LDR+ +++ AC V+ D L G+ S L
Sbjct: 940 SAFTVVMARRHNFLDRNAILMRACS-EPVTPADRALLAANGIKSSAL 985
Score = 114 bits (286), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 105/369 (28%), Positives = 163/369 (44%), Gaps = 68/369 (18%)
Query: 2097 PNWVVAYSARLVRFINLERTKLPEEILRHNLEEKRKYFSDICLEVEKSDAEVQAEGVYNQ 2156
P W+V ++A + +++LER LP+ +L L+ + + + A A GV
Sbjct: 1166 PEWLVKWAALTLEYVDLERALLPDALLEQPLDG---------IAYDAAFASATAAGVVAT 1216
Query: 2157 RLQNLAVTLDKVRYVMRCVFGDPKKAPPPVER-LSPEETVSFLWKGEGSLV-----EELI 2210
RLQN+ +TLDK++YV+R P + P R L+ EE V LW GE ++ E I
Sbjct: 1217 RLQNIIITLDKIKYVLR----QPGQCRAPFLRPLTEEEVVDHLWSGEHGVLKRAAEEATI 1272
Query: 2211 QCMAPHVEEDVLNDLKSKIQAHDPSGSED--------IQREL------------RKSLLW 2250
D A P S D R+L R LL
Sbjct: 1273 AAKCKGALAKRQRDRGGGATAAPPKPSVDRPDAASCAALRDLLDGPRPRDAKAARAGLLT 1332
Query: 2251 LRDEVRNLPCTYKCRHDAAADLIHIYAYTKCFFRVQEYKAFTSPPVYISPLDLGPKYADK 2310
+ +R+ P AAAD + +YA+ + + +++ FTSPPV++ PL G + A K
Sbjct: 1333 ASNILRDAPSGAH---AAAADALFMYAHVEHWCTPEKFNGFTSPPVHLEPLPPGERRA-K 1388
Query: 2311 L-----GADLQVYRKTYGE---------------NYCLGQLIFWHIQTNADPDCTLARAS 2350
L G + +K Y ++ GQL+ W QT DP +L+
Sbjct: 1389 LPMFCKGDAANIAKKKYQARSISHWSPYDRVGVPHFAWGQLVSWFKQTVYDPSASLSAER 1448
Query: 2351 RGCLSLPDIGSFYAKVQKPSRHRVYGPKTVRFMLSRMEKQPQRPWPKDRIWAFKSSPRIF 2410
RG +SLPD SFYA R R + ML +E++P WP ++F++ +++
Sbjct: 1449 RGTMSLPDPESFYACAPGEYRKR-----ERKAMLKSLERKPDAMWPTTWSFSFRNPAKVY 1503
Query: 2411 GSPMLDSSL 2419
GSP LD ++
Sbjct: 1504 GSPWLDDAI 1512
>gi|224002090|ref|XP_002290717.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220974139|gb|EED92469.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 3070
Score = 166 bits (419), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 109/348 (31%), Positives = 158/348 (45%), Gaps = 36/348 (10%)
Query: 1785 QVFEQEVYGIDPYTH----NLLLDSMPDELDWNLLEKHLF-------------------- 1820
+V EQEV+GID YT L+ E+ LEK L
Sbjct: 760 EVAEQEVWGIDCYTRRNVMTLIETEFSSEIATEFLEKWLLPAINACPIDLAHKMSTAAKI 819
Query: 1821 IEDVLLRT-------LNKQVRHFTGTGNTPMMYPLQPVIEEIEKEAVDDCDVRTMKMC-R 1872
+E + + T ++ Q R + N P + + + +K R
Sbjct: 820 LEGLPISTDTEDCPSISMQTRQNSPDKNKPKSPESSVFLRTALESKIKQFGPPWLKAAAR 879
Query: 1873 GILKAMDSRPDDK--YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIR 1930
I A DS +D + + KG G V E G + V + GEVYP W+W EK D I
Sbjct: 880 LIRLASDSLDEDDGFFRIHPKGHGSVVIGEEGLKANSLVTYYRGEVYPAWRWCEKLDAIE 939
Query: 1931 SLQKNN--EDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVT 1988
QK P+FYN+ +ERPK D GY L+ VDA K+ S HSC P CE +V
Sbjct: 940 LTQKQLGLRPNLPDFYNMAMERPKKDPRGYGLLFVDASRKSGLGSSFSHSCNPTCEVRVV 999
Query: 1989 AVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGA 2048
A++G + + T+R + GEE+TFDYN+VTES EY ++CLCG + CRGS+L+
Sbjct: 1000 ALNGKLSLSMTTLRDLEQGEELTFDYNAVTESLNEYRFAICLCGHKKCRGSFLHFATADC 1059
Query: 2049 FEKVLKELHGLLDRHQLMLEACELNSVSEEDYLELGRAGLGSCLLGGL 2096
+++VL + R ++ +S ED L + G + G +
Sbjct: 1060 YQQVLSRNSPIAARFANLVRGSMKQVMSREDSELLLKHGFNTAAFGAV 1107
>gi|302839886|ref|XP_002951499.1| hypothetical protein VOLCADRAFT_117846 [Volvox carteri f.
nagariensis]
gi|300263108|gb|EFJ47310.1| hypothetical protein VOLCADRAFT_117846 [Volvox carteri f.
nagariensis]
Length = 1516
Score = 159 bits (402), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 92/253 (36%), Positives = 146/253 (57%), Gaps = 20/253 (7%)
Query: 1968 KANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEAS 2027
+A++ASR+ HSC PNC A V +V+G I +Y R I GEE+TFDY+SVTES++EY +
Sbjct: 821 QASFASRMSHSCTPNCAAVVVSVNGRLTIAMYAKRRIEPGEELTFDYSSVTESEKEYREA 880
Query: 2028 VCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDRHQLMLEACELNSVSEEDYLELGRAG 2087
+CLCGS+ CRGSYL +G AF +V+++ H L R ++L A + E D+ L
Sbjct: 881 ICLCGSRNCRGSYLYYSGSTAFTQVMEQRHNFLHRQTILLRAST-EPLLESDWTRLKSHS 939
Query: 2088 LGSCLLG-------GLPNWVVAYSARLVRFINLERTKLPEEILRHNLEEKRKYFSDICLE 2140
LG LG P+W+V ++A ++ ++ LE+ +LP+ +L+ + R
Sbjct: 940 LGPTSLGDGGPGNNKAPDWLVKWAALVLEYVELEKRELPQVLLQLPPQLGR--------- 990
Query: 2141 VEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRCVFGDPKKAPPPVERLSPEETVSFLWK 2200
A ++AE + R+Q + +TLDKV+ V+R G + A P+ LS E V+ LW
Sbjct: 991 YTSESAAIEAEAIAQNRVQQIVITLDKVKQVLRQP-GQLQTA--PMRLLSESEVVAHLWS 1047
Query: 2201 GEGSLVEELIQCM 2213
G S+ + +++ +
Sbjct: 1048 GSNSIAKRVLKAV 1060
Score = 58.5 bits (140), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 34/95 (35%), Positives = 47/95 (49%), Gaps = 6/95 (6%)
Query: 2322 YGENYCLGQLIFWHIQTNADPDCTLARASRGCLSLPDIGSFYAKVQKPSRHRVYGPKTVR 2381
YG + GQL W+ QT DP +L+ RG LSLPD+ S Y K Y K
Sbjct: 1392 YGPWFMWGQLSGWYKQTVYDPTASLSAERRGTLSLPDVESCYGARAK------YTFKDRA 1445
Query: 2382 FMLSRMEKQPQRPWPKDRIWAFKSSPRIFGSPMLD 2416
+L +E+ P W + F++ +I+GSPM D
Sbjct: 1446 AVLRHLEQCPDAQWKTSLPFGFRNDAKIYGSPMFD 1480
>gi|303286928|ref|XP_003062753.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226455389|gb|EEH52692.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 1401
Score = 147 bits (371), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 126/489 (25%), Positives = 210/489 (42%), Gaps = 83/489 (16%)
Query: 1790 EVYGIDPYTHNLLLDSM---PDELDWNLLEKHLFIEDVLLRTLNKQVRHFTGTGNTPMMY 1846
E+ G+D YT +L++M ++ L E E VL + + +R G P ++
Sbjct: 183 ELEGVDAYTRERVLEAMTASAEDGGAGLSEDD--AEKVLAKVF-QTMRLRAVAGKDPGLH 239
Query: 1847 PLQPVIEEIEKEAVDDCDVRTMKMC------RGILKAMDSRPDDKYVAYRKGLGVVCNKE 1900
+ + K D + ++ C +L + ++ + A KG G+VC +
Sbjct: 240 HAAKTVANVPKRTPRDTEDLNLRDCGWMEAPALVLAKLCAKEPKEIRARSKGHGLVCVRA 299
Query: 1901 GGF--GEDDFV----------VEFLGEVYPVWKWFEKQDGIRSLQKN--NEDPAPEFYNI 1946
G G F+ V + +YP W+WFEK+ + ++++ +++ P FYN
Sbjct: 300 DGIPKGASSFLAPPDWSPYDRVRVVNALYPGWRWFEKEVAAQRVRRDVRDDEDVPVFYNA 359
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
+ER D GYD++ VD M K + +R HSC PN E +V +G Y + + + I
Sbjct: 360 AVERDVADPKGYDMLFVDGMVKGSLLTRASHSCEPNAEMRVRVREGSYAVEMVSTCHIAR 419
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLK-ELHGLLDRHQL 2065
GEE+ +DYN T+S+ E + ++C CG++ CR SYL+ G+ F L+ H L
Sbjct: 420 GEEVCWDYNCQTDSEREMKRAICCCGAKRCRVSYLHYAGDDDFASYLRARQHVAATTAAL 479
Query: 2066 MLEACELNSVSEEDYL----------ELGRAGLG---------SCLLGGLPNWVVAYSAR 2106
+ +C S +L AGL +L GLP W + ++A
Sbjct: 480 LRASCTSTSRPPPSSSSSITMSDIIRQLSDAGLKLGDASDDTERGVLSGLPEWTLRFAAS 539
Query: 2107 LVRFINLERTKLPEEILRHNLEEKRKYFSDICLEVEKS--------------------DA 2146
V++I E+ L + L + K + + DA
Sbjct: 540 AVKYIADEKAALRVTLASQALVKAAKARAAAAKDGGAGGGDGSAAAAAAAKAAKAAHLDA 599
Query: 2147 EVQAEGVYNQRLQNLAVTLDKVRYVM-----------------RCVFGDPKKAPPPVERL 2189
E +A GV RLQ+L VTLDKVR+V+ + V + APPP++ L
Sbjct: 600 ESEASGVAAGRLQSLVVTLDKVRHVLSTDAAGGTDPVRGMANEKMVADGVRAAPPPLKAL 659
Query: 2190 SPEETVSFL 2198
+ ++ L
Sbjct: 660 TAAHALTHL 668
>gi|147823106|emb|CAN66333.1| hypothetical protein VITISV_000601 [Vitis vinifera]
Length = 333
Score = 135 bits (341), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 63/80 (78%), Positives = 70/80 (87%), Gaps = 1/80 (1%)
Query: 1973 SRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCG 2032
+ ICH RPNC+A +TAV+G YQI IYTVR I YGEEITFDYNSVTESK+EYE SVCLCG
Sbjct: 29 TTICHLRRPNCKA-ITAVEGQYQIRIYTVRQIQYGEEITFDYNSVTESKKEYEESVCLCG 87
Query: 2033 SQVCRGSYLNLTGEGAFEKV 2052
SQVCR SYLNLTGEGAF+K+
Sbjct: 88 SQVCRMSYLNLTGEGAFQKL 107
>gi|147855182|emb|CAN83840.1| hypothetical protein VITISV_023231 [Vitis vinifera]
Length = 533
Score = 135 bits (341), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 89/206 (43%), Positives = 122/206 (59%), Gaps = 30/206 (14%)
Query: 1573 REEMMKSWKDESPAGLYSATSKYKKKLSKMVSERKYMNRSNGTSLANGDFDYGEYASDRE 1632
+EE+ + WK+ESP+GL S+ SK+K KL+K+V+ERKY ++S DYG+ ASD E
Sbjct: 157 KEEITRGWKNESPSGLRSSGSKHKNKLNKIVTERKYRSKSGS--------DYGQNASDGE 208
Query: 1633 IRKRLSKLNRKSLDSGSETSDDLDGSSEDGKSDSESTVSDTDSDMDFRSDGRARESRGAG 1692
IR+RLSKLN+K +DS S++ +DLD SS S D + R+ G
Sbjct: 209 IRRRLSKLNKKFMDSASDSCEDLDRSS----EGGSSGSEGYDQFVMERNPGF----NWLF 260
Query: 1693 DFTTDEGLDFSDDREWGARMTKASLVPPVTRKYEVIDQYVIVADEEDVRRKMRVSLPEDY 1752
F T L + +KYEVI+QY IVADE++V+RKM+VSLPE +
Sbjct: 261 PFYTQNSLCVCSEE--------------FVQKYEVIEQYAIVADEDEVQRKMKVSLPEGH 306
Query: 1753 AEKLNAQKNGSEELDMELPEVKDYKP 1778
EKL+AQKNG+EE DME+P + P
Sbjct: 307 NEKLSAQKNGTEESDMEIPNLISGTP 332
>gi|147856971|emb|CAN81810.1| hypothetical protein VITISV_020891 [Vitis vinifera]
Length = 682
Score = 132 bits (333), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 71/128 (55%), Positives = 85/128 (66%), Gaps = 6/128 (4%)
Query: 683 RWETVENAVSPLVTVNFPSITSDSVTQLVSPPEASGNLLADTGDTAQSTG---EEFPVT- 738
RW T++NA S LV VNFP SD+VTQLVSPPEA GN LA+ GDT +S EE P T
Sbjct: 365 RWLTIKNAASLLVPVNFPPFVSDTVTQLVSPPEAPGNPLAEAGDTTESNKLLEEETPATS 424
Query: 739 LQSQCCPDGSAAAAESSEDLHIDVRVGALLDGFTVIPGKEIETLGEILQTTFERVDWQNN 798
LQS C + ++ A+E E L ID RV ALL F IPGKE+ETLGE+LQ +FE W+
Sbjct: 425 LQSMSCNNDNSIASEPLEGLQIDERVRALLKSFAFIPGKELETLGEVLQASFEHAQWEKL 484
Query: 799 G--GPTWH 804
G G +WH
Sbjct: 485 GAEGLSWH 492
>gi|224095774|ref|XP_002310474.1| hypothetical protein POPTRDRAFT_562330 [Populus trichocarpa]
gi|222853377|gb|EEE90924.1| hypothetical protein POPTRDRAFT_562330 [Populus trichocarpa]
Length = 80
Score = 117 bits (293), Expect = 9e-23, Method: Composition-based stats.
Identities = 48/64 (75%), Positives = 57/64 (89%)
Query: 2323 GENYCLGQLIFWHIQTNADPDCTLARASRGCLSLPDIGSFYAKVQKPSRHRVYGPKTVRF 2382
G YC+GQLIFWH+QTN +PD TLA+AS+GCLSLP+IGSFYAKVQKPS+ R+YGPKTV+
Sbjct: 3 GAIYCMGQLIFWHVQTNTEPDFTLAKASKGCLSLPEIGSFYAKVQKPSQQRIYGPKTVKM 62
Query: 2383 MLSR 2386
ML R
Sbjct: 63 MLER 66
>gi|145485412|ref|XP_001428714.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124395802|emb|CAK61316.1| unnamed protein product [Paramecium tetraurelia]
Length = 844
Score = 94.4 bits (233), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 117/554 (21%), Positives = 222/554 (40%), Gaps = 99/554 (17%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGI-RSLQKNN-----EDPAPEFY 1944
KG G+VC G ++F+ + GEVY +WFEKQ + +Q N + P EF+
Sbjct: 291 KGKGMVCCLNEGLAGNEFICFYFGEVYTPQRWFEKQTIFHKRMQDGNRKTCSQSPYAEFF 350
Query: 1945 NIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGI 2004
I+ + + + +D N A I +SC PNC V+ + + T R I
Sbjct: 351 -IHDDLLVMFKNRFKF--IDPTRYGNMAQHISYSCDPNCRLIAVTVNQQNLLAVITSRKI 407
Query: 2005 HYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDRHQ 2064
+Y EE+T + ++ + CLCGS C+ NL E A + + ++R+
Sbjct: 408 NYFEELTLPFPYTSQDQ-------CLCGSIHCKRKQ-NLELENAHQ--ISIYSNYIERNV 457
Query: 2065 LMLEACELNSVSEEDYLELGRAGLGSCLLGGLPNWVVAYSA--RLVRFINL----ERTKL 2118
++L++ + S + ++ +P W+ + +IN+ ++ K
Sbjct: 458 ILLQSTLITSQNTQN---------------DIPEWLSNWQELNHQQNYINILSCVDKVKF 502
Query: 2119 PEEILRHNLEEKRKYFSDICLEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRCVFGD 2178
+L+H + F ++NQ +N K++
Sbjct: 503 ---VLQHLKTIQPPIFL--------------VTNIFNQFWKNCETNTQKIQ--------- 536
Query: 2179 PKKAPPPVERLSPEETVSFLWKGEGSLVEELIQCMAPHVEEDVLNDLKSKIQAHDPSGSE 2238
+E E V FL + +L QC + +++N +K I +
Sbjct: 537 -------MESSIMNEIVVFLKRH-----SQLHQC---QIGLEIINQMKKII-----DQNT 576
Query: 2239 DIQRELRKSLLWLRDE-VRNL-PCTYKCRHDAAADLIHIYAYTKCFFRVQEYKAFTSPPV 2296
D +L + L L E + N+ C++ + A + +++ ++T +F +Y+ F P
Sbjct: 577 DYALQLTRMLFLLLSEIILNIESCSF--NNKAFSTILYFMSFTHTYFSSTQYQGFDGKPF 634
Query: 2297 YISPLDLGPKYADKLGADLQVYRKTYGENYCLGQLIFWHIQTNADPDCTLARASRGCLSL 2356
+ + P+ +K L K Y + GQLI W+ QT +P ++A+ RG L
Sbjct: 635 EETEFEYIPQPKNKSKLSL---SKQYTPQFIWGQLINWNKQTLQNPQSSMAQERRGVLCY 691
Query: 2357 PDIGSFYAKVQKPSRHRVYG-PKTVRFMLSRMEKQPQRPWPKDRIWAFKSSPRIFGSPML 2415
P + + K ++ K + + S+ E QP W++K+ I+G+
Sbjct: 692 PSLLLSFDNKHKTFPYQCKTREKYLEYFQSKKEIQPDLS-----TWSYKNQHNIYGTIFF 746
Query: 2416 DSSLTGCPLDREMV 2429
+ + + + V
Sbjct: 747 EQYFSLSKVGEDFV 760
>gi|255086485|ref|XP_002509209.1| set domain protein [Micromonas sp. RCC299]
gi|226524487|gb|ACO70467.1| set domain protein [Micromonas sp. RCC299]
Length = 283
Score = 93.6 bits (231), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 61/238 (25%), Positives = 112/238 (47%), Gaps = 21/238 (8%)
Query: 2204 SLVEELIQCMAPH-------------VEEDVLNDLKSKIQAHDPSGSEDIQRELRKSLLW 2250
S+V+ L+Q M PH V N L +I D ++ L+ L+
Sbjct: 7 SMVQGLLQAMKPHARSAKDDDDHSAEVRHAEFNALVEQIARESTEPENDPEKSLKSGLIR 66
Query: 2251 LRDEVRNLPCTYKCRHDAAADLIHIYAYTKCFFRVQ---EYKAFTSPPVYISPLDLGPKY 2307
LRD + ++P T RHD AA+L+H++A+T+ ++ V+ + AFT+ + + ++
Sbjct: 67 LRDALASMPSTPSARHDVAAELVHLHAHTRRYWSVRRGDHHGAFTAEEIPVRENEVNSFG 126
Query: 2308 ADKLGADLQVYRKT---YGENYCLGQLIFWHIQTNADPDCTLARASRGCLSLPDIGSFYA 2364
GA Q+ ++ Y G L+ W+ Q +DP + RGC+ +PD+ Y+
Sbjct: 127 IGAEGASEQIVKQVRPEYKAGTAGGALLVWYKQEMSDPLQWVNANRRGCVVIPDVSCAYS 186
Query: 2365 KVQKPSRHRVYGPKTVRFMLSRMEKQPQRPWPKDR-IWAFKSSPRIFGSPMLDSSLTG 2421
+ + G + L+ + + P+ PWP+ W ++ R+ GSP+LD+ +
Sbjct: 187 PRPGVAVAKC-GAREREAWLAHLAEHPEDPWPQHTGPWGPANAQRLIGSPVLDAFMAA 243
>gi|361069841|gb|AEW09232.1| Pinus taeda anonymous locus UMN_839_01 genomic sequence
gi|383148794|gb|AFG56252.1| Pinus taeda anonymous locus UMN_839_01 genomic sequence
gi|383148795|gb|AFG56253.1| Pinus taeda anonymous locus UMN_839_01 genomic sequence
gi|383148796|gb|AFG56254.1| Pinus taeda anonymous locus UMN_839_01 genomic sequence
gi|383148797|gb|AFG56255.1| Pinus taeda anonymous locus UMN_839_01 genomic sequence
gi|383148798|gb|AFG56256.1| Pinus taeda anonymous locus UMN_839_01 genomic sequence
gi|383148799|gb|AFG56257.1| Pinus taeda anonymous locus UMN_839_01 genomic sequence
gi|383148800|gb|AFG56258.1| Pinus taeda anonymous locus UMN_839_01 genomic sequence
gi|383148801|gb|AFG56259.1| Pinus taeda anonymous locus UMN_839_01 genomic sequence
gi|383148802|gb|AFG56260.1| Pinus taeda anonymous locus UMN_839_01 genomic sequence
gi|383148803|gb|AFG56261.1| Pinus taeda anonymous locus UMN_839_01 genomic sequence
gi|383148804|gb|AFG56262.1| Pinus taeda anonymous locus UMN_839_01 genomic sequence
gi|383148805|gb|AFG56263.1| Pinus taeda anonymous locus UMN_839_01 genomic sequence
gi|383148806|gb|AFG56264.1| Pinus taeda anonymous locus UMN_839_01 genomic sequence
gi|383148807|gb|AFG56265.1| Pinus taeda anonymous locus UMN_839_01 genomic sequence
Length = 82
Score = 91.3 bits (225), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 43/59 (72%), Positives = 51/59 (86%)
Query: 1706 REWGARMTKASLVPPVTRKYEVIDQYVIVADEEDVRRKMRVSLPEDYAEKLNAQKNGSE 1764
REWGARMT ASLVPPVTRKYEVI++YV+VADE++V RKMRV LP+DY +KL A K+ E
Sbjct: 15 REWGARMTSASLVPPVTRKYEVIEEYVVVADEDEVSRKMRVCLPKDYEKKLAAAKDRRE 73
>gi|170573421|ref|XP_001892464.1| SET domain containing protein [Brugia malayi]
gi|158601976|gb|EDP38706.1| SET domain containing protein [Brugia malayi]
Length = 1603
Score = 91.3 bits (225), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 53/151 (35%), Positives = 82/151 (54%), Gaps = 19/151 (12%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
G G+ + + F+ E++GEV + + + Q+N+ Y + L
Sbjct: 919 GCGIGVKTDVNIDKGQFICEYIGEVVSMETFNIRSRTDYRYQRNH-------YALNL--- 968
Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
G+ VVDA HK N A I HSC PNCE + +V+GHY+IG++ +RGIH GEE+T
Sbjct: 969 ---CPGF---VVDAYHKGNIARFINHSCAPNCEMQRWSVNGHYRIGLFALRGIHEGEELT 1022
Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
+DYN ++ E + ++C CG+ CR +LN
Sbjct: 1023 YDYN--WDAFEFDDVTICCCGAXNCR-HFLN 1050
>gi|402585708|gb|EJW79647.1| hypothetical protein WUBG_09444, partial [Wuchereria bancrofti]
Length = 511
Score = 89.4 bits (220), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 53/151 (35%), Positives = 82/151 (54%), Gaps = 19/151 (12%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
G G+ + + F+ E++GEV + + + Q+N+ Y + L
Sbjct: 191 GCGIGVKTDVNIDKGQFICEYIGEVVSMETFNIRSRTDYRYQRNH-------YALNL--- 240
Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
G+ VVDA HK N A I HSC PNCE + +V+GHY+IG++ +RGIH GEE+T
Sbjct: 241 ---CPGF---VVDAYHKGNIARFINHSCAPNCEMQRWSVNGHYRIGLFALRGIHEGEELT 294
Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
+DYN ++ E + ++C CG+ CR +LN
Sbjct: 295 YDYN--WDAFEFDDVTICCCGAPNCR-HFLN 322
>gi|312116897|ref|XP_003151349.1| hypothetical protein LOAG_15812 [Loa loa]
gi|307753486|gb|EFO12720.1| hypothetical protein LOAG_15812 [Loa loa]
Length = 213
Score = 88.6 bits (218), Expect = 5e-14, Method: Composition-based stats.
Identities = 41/81 (50%), Positives = 57/81 (70%), Gaps = 3/81 (3%)
Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
VVDA HK N A I HSC PNCE + +V+GHY+IG++ +RGIH GEE+T+DYN ++
Sbjct: 70 VVDAYHKGNIARFINHSCAPNCEMQRWSVNGHYRIGLFALRGIHEGEELTYDYN--WDAF 127
Query: 2022 EEYEASVCLCGSQVCRGSYLN 2042
E + ++C CG+ CR +LN
Sbjct: 128 EFDDVTICCCGAPNCR-HFLN 147
>gi|145521184|ref|XP_001446447.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124413925|emb|CAK79050.1| unnamed protein product [Paramecium tetraurelia]
Length = 828
Score = 88.6 bits (218), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 57/181 (31%), Positives = 90/181 (49%), Gaps = 19/181 (10%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGI-RSLQKNNEDPAPEFYNIYLE 1949
KG GVVC GF ++F+ + GEVY +WFEKQ + +Q N F + Y E
Sbjct: 283 KGKGVVCCNFDGFVTNEFINFYFGEVYTPQRWFEKQTVFNKRMQDGNRKSG--FQSPYAE 340
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
D +L+ +D N A I +SC PNC+ ++ YQ+ I+T++ I+Y EE
Sbjct: 341 FHIND----ELLFIDPTRYGNIALHISYSCDPNCKFVTVQINSSYQLAIFTLKKINYLEE 396
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELH-GLLDRHQLMLE 2068
+T + S + +CLCGS C+ L+ AF L + + + R+ L+L+
Sbjct: 397 LTLPFPSTSN-------DLCLCGSIYCK----RLSQLEAFNNRLTQNYPNYIQRNALLLQ 445
Query: 2069 A 2069
+
Sbjct: 446 S 446
Score = 49.7 bits (117), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 35/162 (21%), Positives = 73/162 (45%), Gaps = 21/162 (12%)
Query: 2266 HDAAADLIHIYAYTKCFFRVQEYKAFTSPPVYISPLDLGPKYADKLGADLQVYRKTYGEN 2325
++ + +++ ++T +F +Y+ F P + + P+ +K L K Y
Sbjct: 588 NEGLSTILYFMSFTHTYFSSTQYEGFNGKPFEENEFEYIPQPKNKQKLAL---SKMYTPQ 644
Query: 2326 YCLGQLIFWHIQTNADPDCTLARASRGCLSLPD-IGSFYAKVQKPSRHRVYG------PK 2378
Y GQLI W+ QT +P ++A+ RG L P I SF ++H+++ K
Sbjct: 645 YIWGQLINWNKQTLQNPQSSMAQERRGVLCYPSLILSF------DNKHKLFPYQCKTREK 698
Query: 2379 TVRFMLSRMEKQPQRPWPKDRIWAFKSSPRIFGSPMLDSSLT 2420
+ + ++ + QP IW++K+ ++G+ + +
Sbjct: 699 FLEYFYTKSDIQPDL-----SIWSYKNQYNVYGTIFFEQCFS 735
>gi|296085302|emb|CBI29034.3| unnamed protein product [Vitis vinifera]
Length = 195
Score = 85.9 bits (211), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 40/77 (51%), Positives = 51/77 (66%), Gaps = 5/77 (6%)
Query: 1043 LGEWYYLDGAGHERGPSSFSELQVLVDQGCIQKHTSVFRKFDKVWVPLTFATETSASTVR 1102
+WYYLDGAGHE+ PSSFSELQ LVDQ IQKH+SV K +K+W+P+TFA + + V
Sbjct: 111 FSDWYYLDGAGHEQWPSSFSELQSLVDQDSIQKHSSVLGKINKIWIPITFAADVPDAAV- 169
Query: 1103 NHGEKIMPSGDSSGLPP 1119
KI P + + P
Sbjct: 170 ----KIQPQNKVTFIEP 182
Score = 75.9 bits (185), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 45/115 (39%), Positives = 65/115 (56%), Gaps = 8/115 (6%)
Query: 744 CPDGSAAAAESSEDLHIDVRVGALLDGFTVIPGKEIETLGEILQTTFERVDWQNNG--GP 801
C + ++ A+E E L ID RV ALL FT IPG+E+ETLGE+LQ +FE W+ G G
Sbjct: 3 CNNDNSIASEPLEGLQIDERVRALLKSFTFIPGRELETLGEVLQASFEHAQWEKLGAEGL 62
Query: 802 TWHGACVGEQKPGDQKVDELY-ISDTKMKEAAELK---SGDKDHWVVCFDSDEWF 852
+WH +G Q DQ++D + + KEA + + DKD+ D +W+
Sbjct: 63 SWHQLRIGGQP--DQRIDRFFRYPEITSKEALDSRLSTFSDKDYAFAFGDFSDWY 115
>gi|328865276|gb|EGG13662.1| SET domain-containing protein [Dictyostelium fasciculatum]
Length = 1418
Score = 85.9 bits (211), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 56/155 (36%), Positives = 74/155 (47%), Gaps = 26/155 (16%)
Query: 1886 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN 1945
+ A +KG G+ G F++E+ GEV K E+ S + FY
Sbjct: 1063 FSAEKKGWGL--KAVDNIGAKTFIIEYCGEVISKQKCLERMTESESEK--------YFYF 1112
Query: 1946 IYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIH 2005
+ L+R L +DA K N A I HSC PNCE + VDG +IGI+ +R I
Sbjct: 1113 LTLDR---------LECLDASRKGNLARFINHSCDPNCETQKWNVDGEVRIGIFAIRDIK 1163
Query: 2006 YGEEITFDYNSVTESKEEYEAS--VCLCGSQVCRG 2038
GEE+TFDYN E + S VC CG+ CRG
Sbjct: 1164 RGEELTFDYNY-----ERFGTSKQVCYCGAANCRG 1193
>gi|308487582|ref|XP_003105986.1| CRE-SET-2 protein [Caenorhabditis remanei]
gi|308254560|gb|EFO98512.1| CRE-SET-2 protein [Caenorhabditis remanei]
Length = 1505
Score = 85.5 bits (210), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 54/146 (36%), Positives = 79/146 (54%), Gaps = 17/146 (11%)
Query: 1903 FGEDDFVVEFLGEVYPVWKWFEK---QDGIRSLQKNNEDPAPEFYNI---YLERPKGDAD 1956
+D+ +VE++G+ V++ F IRSL + + A E I YL R ++
Sbjct: 1371 IAQDEMIVEYIGQTVIVFQNFSSILFHLQIRSLVADEREKAYERRGIGSSYLFRIDENS- 1429
Query: 1957 GYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNS 2016
V+DA + N+A I HSC+PNC AKV ++G +I IY+ I+ GEEIT+DY
Sbjct: 1430 -----VIDATKRGNFARFINHSCQPNCYAKVLTIEGEKRIVIYSRSVINKGEEITYDYKF 1484
Query: 2017 VTESKEEYEASVCLCGSQVCRGSYLN 2042
E + CLCG++ CRG YLN
Sbjct: 1485 PIED----DKIDCLCGAKACRG-YLN 1505
>gi|422293956|gb|EKU21256.1| set domain protein, partial [Nannochloropsis gaditana CCMP526]
Length = 92
Score = 82.4 bits (202), Expect = 3e-12, Method: Composition-based stats.
Identities = 45/103 (43%), Positives = 57/103 (55%), Gaps = 14/103 (13%)
Query: 1936 NEDPA-PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHY 1994
N PA P+FYNI LERP+GDA+GY + + H +V G
Sbjct: 3 NLKPALPDFYNILLERPRGDANGYG--------PPGAGADLVHG-----HQQVVGQAGKL 49
Query: 1995 QIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCR 2037
I + + R I YGEE+T DY S T S+EEY A+VCLCGS +CR
Sbjct: 50 TIAVCSDREIAYGEELTMDYCSFTHSEEEYLAAVCLCGSHICR 92
>gi|19075312|ref|NP_587812.1| histone lysine methyltransferase Set1 [Schizosaccharomyces pombe
972h-]
gi|74698592|sp|Q9Y7R4.1|SET1_SCHPO RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component set1; AltName:
Full=Lysine N-methyltransferase 2; AltName: Full=SET
domain-containing protein 1; AltName: Full=Set1 complex
component set1; Short=Set1C component set1; AltName:
Full=Spset1
gi|4704279|emb|CAB41652.1| histone lysine methyltransferase Set1 [Schizosaccharomyces pombe]
Length = 920
Score = 82.0 bits (201), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 51/141 (36%), Positives = 74/141 (52%), Gaps = 26/141 (18%)
Query: 1905 EDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDL---V 1961
++D V+E++GE+ IR +N + Y+ GD+ + + V
Sbjct: 803 KNDMVIEYIGEI------------IRQRVADNREKN------YVREGIGDSYLFRIDEDV 844
Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
+VDA K N A I HSC PNC A++ V+G +I IY R I +GEE+T+DY +
Sbjct: 845 IVDATKKGNIARFINHSCAPNCIARIIRVEGKRKIVIYADRDIMHGEELTYDY----KFP 900
Query: 2022 EEYEASVCLCGSQVCRGSYLN 2042
EE + CLCG+ CRG YLN
Sbjct: 901 EEADKIPCLCGAPTCRG-YLN 920
>gi|147863201|emb|CAN80485.1| hypothetical protein VITISV_032461 [Vitis vinifera]
Length = 508
Score = 81.6 bits (200), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 39/68 (57%), Positives = 50/68 (73%), Gaps = 2/68 (2%)
Query: 1237 DASFPGEESASSAIESGGWGLLDGHTLAHVFHFLRSDMKSLAF--ASLTCRHWRAAVRFY 1294
+A+F E+ + + S WGLLDG LA VFHFL++D+KSL F A+LTC H RAAVRF+
Sbjct: 178 NATFYQEDIVLAEMGSENWGLLDGDVLARVFHFLKTDVKSLVFFLAALTCEHRRAAVRFF 237
Query: 1295 KGISRQVD 1302
KG+ RQVD
Sbjct: 238 KGVPRQVD 245
>gi|341896007|gb|EGT51942.1| hypothetical protein CAEBREN_26218 [Caenorhabditis brenneri]
Length = 1670
Score = 81.3 bits (199), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 53/144 (36%), Positives = 77/144 (53%), Gaps = 28/144 (19%)
Query: 1903 FGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI---YLERPKGDADGYD 1959
+D+ ++E++G+ IRSL + + A E I YL R D +
Sbjct: 1551 IAQDEMIIEYIGQ------------KIRSLVADEREKAYERRGIGSSYLFR----IDEH- 1593
Query: 1960 LVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYN-SVT 2018
V+DA + N+A I HSC+PNC AKV ++G +I IY+ I+ GEEIT+DY +
Sbjct: 1594 -TVIDATKRGNFARFINHSCQPNCYAKVLTIEGEKRIVIYSRSTINKGEEITYDYKFPIE 1652
Query: 2019 ESKEEYEASVCLCGSQVCRGSYLN 2042
E K + CLCG++ CRG YLN
Sbjct: 1653 EDKID-----CLCGAKTCRG-YLN 1670
>gi|213402529|ref|XP_002172037.1| histone-lysine N-methyltransferase [Schizosaccharomyces japonicus
yFS275]
gi|212000084|gb|EEB05744.1| histone-lysine N-methyltransferase [Schizosaccharomyces japonicus
yFS275]
Length = 977
Score = 81.3 bits (199), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 46/96 (47%), Positives = 55/96 (57%), Gaps = 11/96 (11%)
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
YL R DA +VDA K N A I HSC PNC AK+ V+GH +I IY R I
Sbjct: 893 YLFRIDKDA------IVDATKKGNIARFINHSCAPNCIAKIIRVEGHQKIVIYADRDIEE 946
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
GEE+T+DY + EE + CLCG+ CRG YLN
Sbjct: 947 GEELTYDY----KFPEEVDKIPCLCGAPTCRG-YLN 977
>gi|390342260|ref|XP_003725626.1| PREDICTED: uncharacterized protein LOC578079 isoform 1
[Strongylocentrotus purpuratus]
Length = 3023
Score = 80.9 bits (198), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 49/153 (32%), Positives = 79/153 (51%), Gaps = 20/153 (13%)
Query: 1886 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN 1945
+ KG G+ +E +++FV+E++GEV ++ + ++ ++D FY
Sbjct: 1668 FYTEEKGHGLKAKEE--LKDNEFVMEYVGEVLNFHEFKHR------AKQYSKDKNLHFYF 1719
Query: 1946 IYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIH 2005
+ L K D ++DA K N + + HSC PNCE + V+G ++G +T R +
Sbjct: 1720 MAL---KSDE------IIDATEKGNVSRFMNHSCDPNCETQKWTVNGQLRVGFFTKRQVK 1770
Query: 2006 YGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
GEE+TFDY + EA CLCGS+ CRG
Sbjct: 1771 PGEELTFDYQFEVYGQ---EAQKCLCGSEKCRG 1800
>gi|390342258|ref|XP_783359.3| PREDICTED: uncharacterized protein LOC578079 isoform 2
[Strongylocentrotus purpuratus]
Length = 3024
Score = 80.9 bits (198), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 49/153 (32%), Positives = 79/153 (51%), Gaps = 20/153 (13%)
Query: 1886 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN 1945
+ KG G+ +E +++FV+E++GEV ++ + ++ ++D FY
Sbjct: 1668 FYTEEKGHGLKAKEE--LKDNEFVMEYVGEVLNFHEFKHR------AKQYSKDKNLHFYF 1719
Query: 1946 IYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIH 2005
+ L K D ++DA K N + + HSC PNCE + V+G ++G +T R +
Sbjct: 1720 MAL---KSDE------IIDATEKGNVSRFMNHSCDPNCETQKWTVNGQLRVGFFTKRQVK 1770
Query: 2006 YGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
GEE+TFDY + EA CLCGS+ CRG
Sbjct: 1771 PGEELTFDYQFEVYGQ---EAQKCLCGSEKCRG 1800
>gi|198418893|ref|XP_002124393.1| PREDICTED: similar to SET domain containing 2 [Ciona intestinalis]
Length = 2228
Score = 80.5 bits (197), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 56/162 (34%), Positives = 80/162 (49%), Gaps = 25/162 (15%)
Query: 1882 PDDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSL--QKNNEDP 1939
P + + KG G+ + G V+E+ GEV + ++ G RSL + N+
Sbjct: 1064 PTEVFQTKWKGWGIRATENLSPGM--LVMEYCGEVLDLQEF-----GRRSLLYSRGNQQ- 1115
Query: 1940 APEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIY 1999
FY + L + + ++DA K N + I HSC PNCE + V+G ++G +
Sbjct: 1116 --HFYFMALSQDE---------IIDATTKGNTSRFINHSCDPNCETQKWTVNGRLRVGFF 1164
Query: 2000 TVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
T+R I+ GEEITFDY K EA C CGS CRG YL
Sbjct: 1165 TMRDINKGEEITFDYQFQRYGK---EAQACYCGSSNCRG-YL 1202
>gi|326488341|dbj|BAJ93839.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 1070
Score = 79.7 bits (195), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 56/164 (34%), Positives = 82/164 (50%), Gaps = 22/164 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ +E E F++E++GEV + + +Q S K + FY + L
Sbjct: 353 KKGYGLQLLEE--VSEGRFLIEYVGEVLDITTYESRQRDYASKGKKH------FYFMAL- 403
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
DG + V+DA K N I HSC PNC + V+G IGI+ +R I GEE
Sbjct: 404 ------DGGE--VIDACTKGNLGRFINHSCSPNCRTEKWMVNGEVCIGIFAMRNIKKGEE 455
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL--NLTGEGAFEK 2051
+TFDYN V S + C CG+ CRG Y+ +++G G +
Sbjct: 456 LTFDYNYVRVSGAAPQK--CFCGTAKCRG-YIGGDISGSGIITQ 496
>gi|340372263|ref|XP_003384664.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
[Amphimedon queenslandica]
Length = 1171
Score = 79.7 bits (195), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 56/177 (31%), Positives = 86/177 (48%), Gaps = 24/177 (13%)
Query: 1879 DSRPDDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNED 1938
+S P + +G G+ + G DFV+E++GE+ + E+ L+K E
Sbjct: 774 ESVPTQTFYTGNRGWGLKTMRSLSPG--DFVIEYVGEIVDMAAVQER------LKKTQEA 825
Query: 1939 PAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
FY + LER +++DA K+N+A I HSC PNCE + V+G +IGI
Sbjct: 826 SVSSFYFLTLERN---------LIIDARVKSNHARFINHSCDPNCETQKWTVNGETRIGI 876
Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKE 2055
+ ++ I E+TFDY E+ CLCG+Q C G GE ++ LK+
Sbjct: 877 FAIKDIKEDTELTFDYQFDCLGNEK---KACLCGAQNCSG----FLGEKPKQEKLKQ 926
>gi|17552320|ref|NP_498039.1| Protein SET-2, isoform c [Caenorhabditis elegans]
gi|351058302|emb|CCD65736.1| Protein SET-2, isoform c [Caenorhabditis elegans]
Length = 1510
Score = 79.3 bits (194), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 54/145 (37%), Positives = 72/145 (49%), Gaps = 28/145 (19%)
Query: 1902 GFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI---YLERPKGDADGY 1958
D+ +VE++G+ IRSL + A E I YL R
Sbjct: 1390 SIAPDEMIVEYIGQT------------IRSLVAEEREKAYERRGIGSSYLFR-------I 1430
Query: 1959 DLV-VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSV 2017
DL V+DA + N+A I HSC+PNC AKV ++G +I IY+ I GEEIT+DY
Sbjct: 1431 DLHHVIDATKRGNFARFINHSCQPNCYAKVLTIEGEKRIVIYSRTIIKKGEEITYDYKFP 1490
Query: 2018 TESKEEYEASVCLCGSQVCRGSYLN 2042
E + CLCG++ CRG YLN
Sbjct: 1491 IED----DKIDCLCGAKTCRG-YLN 1510
>gi|17552318|ref|NP_498040.1| Protein SET-2, isoform a [Caenorhabditis elegans]
gi|30173238|sp|Q18221.2|SET2_CAEEL RecName: Full=Probable histone-lysine N-methyltransferase set-2;
AltName: Full=SET domain-containing protein 2
gi|351058300|emb|CCD65734.1| Protein SET-2, isoform a [Caenorhabditis elegans]
Length = 1507
Score = 79.3 bits (194), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 54/145 (37%), Positives = 72/145 (49%), Gaps = 28/145 (19%)
Query: 1902 GFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI---YLERPKGDADGY 1958
D+ +VE++G+ IRSL + A E I YL R
Sbjct: 1387 SIAPDEMIVEYIGQT------------IRSLVAEEREKAYERRGIGSSYLFR-------I 1427
Query: 1959 DLV-VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSV 2017
DL V+DA + N+A I HSC+PNC AKV ++G +I IY+ I GEEIT+DY
Sbjct: 1428 DLHHVIDATKRGNFARFINHSCQPNCYAKVLTIEGEKRIVIYSRTIIKKGEEITYDYKFP 1487
Query: 2018 TESKEEYEASVCLCGSQVCRGSYLN 2042
E + CLCG++ CRG YLN
Sbjct: 1488 IED----DKIDCLCGAKTCRG-YLN 1507
>gi|17552316|ref|NP_498041.1| Protein SET-2, isoform b [Caenorhabditis elegans]
gi|351058301|emb|CCD65735.1| Protein SET-2, isoform b [Caenorhabditis elegans]
Length = 739
Score = 79.0 bits (193), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 54/145 (37%), Positives = 72/145 (49%), Gaps = 28/145 (19%)
Query: 1902 GFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI---YLERPKGDADGY 1958
D+ +VE++G+ IRSL + A E I YL R
Sbjct: 619 SIAPDEMIVEYIGQT------------IRSLVAEEREKAYERRGIGSSYLFR-------I 659
Query: 1959 DLV-VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSV 2017
DL V+DA + N+A I HSC+PNC AKV ++G +I IY+ I GEEIT+DY
Sbjct: 660 DLHHVIDATKRGNFARFINHSCQPNCYAKVLTIEGEKRIVIYSRTIIKKGEEITYDYKFP 719
Query: 2018 TESKEEYEASVCLCGSQVCRGSYLN 2042
E + CLCG++ CRG YLN
Sbjct: 720 IED----DKIDCLCGAKTCRG-YLN 739
>gi|189237403|ref|XP_973596.2| PREDICTED: similar to AGAP011688-PA [Tribolium castaneum]
gi|270007628|gb|EFA04076.1| hypothetical protein TcasGA2_TC014310 [Tribolium castaneum]
Length = 1569
Score = 79.0 bits (193), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 52/157 (33%), Positives = 77/157 (49%), Gaps = 20/157 (12%)
Query: 1882 PDDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAP 1941
P + + +KGLG+ +GE F++E++GEV ++ + D + D
Sbjct: 574 PVEVFKTEKKGLGLRAAANIPYGE--FILEYVGEVLDPEEFDNRADDYSN------DKNK 625
Query: 1942 EFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTV 2001
+Y + L + DA ++DA K N + I HSC PN E + V+G +IG ++
Sbjct: 626 HYYFMSL---RADA------IIDATMKGNISRFINHSCDPNAETQKWTVNGELRIGFFST 676
Query: 2002 RGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
R I GEEITFDY K EA C C S +CRG
Sbjct: 677 RTILAGEEITFDYRFQRYGK---EAQKCYCESSLCRG 710
>gi|25395700|pir||H88444 protein C26E6.12 [imported] - Caenorhabditis elegans
Length = 1802
Score = 78.6 bits (192), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 54/144 (37%), Positives = 72/144 (50%), Gaps = 28/144 (19%)
Query: 1903 FGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI---YLERPKGDADGYD 1959
D+ +VE++G+ IRSL + A E I YL R D
Sbjct: 1683 IAPDEMIVEYIGQT------------IRSLVAEEREKAYERRGIGSSYLFR-------ID 1723
Query: 1960 LV-VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVT 2018
L V+DA + N+A I HSC+PNC AKV ++G +I IY+ I GEEIT+DY
Sbjct: 1724 LHHVIDATKRGNFARFINHSCQPNCYAKVLTIEGEKRIVIYSRTIIKKGEEITYDYKFPI 1783
Query: 2019 ESKEEYEASVCLCGSQVCRGSYLN 2042
E + CLCG++ CRG YLN
Sbjct: 1784 ED----DKIDCLCGAKTCRG-YLN 1802
>gi|291227185|ref|XP_002733567.1| PREDICTED: HSPC069-like [Saccoglossus kowalevskii]
Length = 2376
Score = 78.2 bits (191), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 52/151 (34%), Positives = 73/151 (48%), Gaps = 26/151 (17%)
Query: 1891 KGLGV-VCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
KG G+ C E FV+E++GEV + E + + K+N +Y + L
Sbjct: 1175 KGFGLRTC---AEIPEGKFVLEYVGEV---LNYSEFKSRTKHYNKDNRK---HYYFMALT 1225
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
+ ++DA K N + I HSC PNCE + V+GH ++G +T R I GEE
Sbjct: 1226 SDE---------IIDATKKGNVSRFINHSCDPNCETQKWTVNGHIRVGFFTKRAIPAGEE 1276
Query: 2010 ITFDYNSVTESKEEY--EASVCLCGSQVCRG 2038
+TFDY E Y EA C CG+ CRG
Sbjct: 1277 LTFDYQF-----ERYGKEAQKCYCGASNCRG 1302
>gi|344241969|gb|EGV98072.1| putative histone-lysine N-methyltransferase ASH1L [Cricetulus
griseus]
Length = 1546
Score = 77.8 bits (190), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 50/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+++ A KG G+ + G+ F++E+LGEV +Q EF
Sbjct: 1239 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEV------VSEQ---------------EF 1275
Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
N +E+ +D Y L +V+D+ N A I HSC PNCE + +V+G Y+IG+
Sbjct: 1276 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 1335
Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
Y ++ + G E+T+DYN + + E+ + +C CG + CRG
Sbjct: 1336 YALKDVLAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 1373
>gi|384499018|gb|EIE89509.1| hypothetical protein RO3G_14220 [Rhizopus delemar RA 99-880]
Length = 962
Score = 77.8 bits (190), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 53/156 (33%), Positives = 74/156 (47%), Gaps = 30/156 (19%)
Query: 1893 LGVVCNKEGGFG--------EDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFY 1944
+ V+ ++ GFG + F++E++GEV P Q+ IR + E A
Sbjct: 168 VDVIRTEKKGFGLRALTDLPTNSFIMEYIGEVIP------NQEFIR---RTKEYEASGLE 218
Query: 1945 NIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGI 2004
+ Y K D ++DA K A I HSC PNC + V + +IGI+T RGI
Sbjct: 219 HYYFMTLKTDE------IIDATKKGCLARFINHSCNPNCVTQKWVVGKNMRIGIFTNRGI 272
Query: 2005 HYGEEITFDYNSVTESKEEY--EASVCLCGSQVCRG 2038
GEE+TFDY E Y +A VC CG C+G
Sbjct: 273 KAGEELTFDYKF-----ERYGAQAQVCYCGEFACKG 303
>gi|328856222|gb|EGG05344.1| hypothetical protein MELLADRAFT_78094 [Melampsora larici-populina
98AG31]
Length = 1098
Score = 77.8 bits (190), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 55/180 (30%), Positives = 81/180 (45%), Gaps = 27/180 (15%)
Query: 1864 DVRTMKMCRGI-LKAMDSRPDDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKW 1922
+ R ++MC+ + P + RKG GV + D FV E++GEV
Sbjct: 265 ECRCLQMCQNQRFQKRQYAPIEIVATERKGFGV--RLKSDVPADSFVYEYIGEVV----- 317
Query: 1923 FEKQDGIRSLQKNNEDPAPE----FYNIYLERPKGDADGYDLVVVDAMHKANYASRICHS 1978
G ++ Q+ ++ A E FY + L+R + +DA K + HS
Sbjct: 318 -----GEKAFQRRIKEYAQEGLKHFYFMQLQREE---------YIDATKKGGLGRFLNHS 363
Query: 1979 CRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
C PNC V H ++GI+T R + GEE+TF+YN V + YEA C CG C G
Sbjct: 364 CNPNCYIGKWVVGRHLRMGIFTKRAVKGGEELTFNYN-VDRYGQVYEAQECFCGEAQCVG 422
>gi|324507672|gb|ADY43247.1| Histone-lysine N-methyltransferase set-2 [Ascaris suum]
Length = 539
Score = 77.4 bits (189), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 36/78 (46%), Positives = 51/78 (65%), Gaps = 3/78 (3%)
Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
V+DA + N+A I HSC+PNC AKV VDG +I IY+ I+ G+EIT+DY E +
Sbjct: 463 VIDATNMGNFARFINHSCQPNCYAKVVVVDGEKRIVIYSKTPINKGDEITYDYKFPIEEE 522
Query: 2022 EEYEASVCLCGSQVCRGS 2039
++ + CLCG+ CRG+
Sbjct: 523 DKID---CLCGAPSCRGT 537
>gi|440470515|gb|ELQ39582.1| histone-lysine N-methyltransferase [Magnaporthe oryzae Y34]
gi|440488496|gb|ELQ68221.1| histone-lysine N-methyltransferase [Magnaporthe oryzae P131]
Length = 1278
Score = 77.4 bits (189), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 52/139 (37%), Positives = 70/139 (50%), Gaps = 19/139 (13%)
Query: 1905 EDDFVVEFLGE-VYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVV 1963
+DD ++E++GE V P + RS + YL R DA V+
Sbjct: 1158 KDDMIIEYVGEEVRPSVAQVREARYDRS----------GIGSSYLFRIDEDA------VI 1201
Query: 1964 DAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEE 2023
DA K A I HSC PNC AK+ V+G +I IY +R I EE+T+DY E KEE
Sbjct: 1202 DATKKGGIARFINHSCMPNCTAKIIRVEGTKRIVIYALRDIARNEELTYDYKFELEEKEE 1261
Query: 2024 YEASVCLCGSQVCRGSYLN 2042
+ CLCG+ C+G +LN
Sbjct: 1262 -DRVPCLCGTTNCKG-FLN 1278
>gi|389634753|ref|XP_003715029.1| histone-lysine N-methyltransferase [Magnaporthe oryzae 70-15]
gi|351647362|gb|EHA55222.1| histone-lysine N-methyltransferase [Magnaporthe oryzae 70-15]
Length = 1278
Score = 77.4 bits (189), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 52/139 (37%), Positives = 70/139 (50%), Gaps = 19/139 (13%)
Query: 1905 EDDFVVEFLGE-VYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVV 1963
+DD ++E++GE V P + RS + YL R DA V+
Sbjct: 1158 KDDMIIEYVGEEVRPSVAQVREARYDRS----------GIGSSYLFRIDEDA------VI 1201
Query: 1964 DAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEE 2023
DA K A I HSC PNC AK+ V+G +I IY +R I EE+T+DY E KEE
Sbjct: 1202 DATKKGGIARFINHSCMPNCTAKIIRVEGTKRIVIYALRDIARNEELTYDYKFELEEKEE 1261
Query: 2024 YEASVCLCGSQVCRGSYLN 2042
+ CLCG+ C+G +LN
Sbjct: 1262 -DRVPCLCGTTNCKG-FLN 1278
>gi|357149500|ref|XP_003575133.1| PREDICTED: histone-lysine N-methyltransferase ASHH2-like
[Brachypodium distachyon]
Length = 1022
Score = 77.4 bits (189), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 53/168 (31%), Positives = 82/168 (48%), Gaps = 22/168 (13%)
Query: 1886 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN 1945
+ + +KG G+ +E E F++E++GEV + + +Q S + + FY
Sbjct: 251 FCSGKKGFGLQLKEE--VTEGRFLIEYVGEVLDITAYECRQRYYASKGQKH------FYF 302
Query: 1946 IYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIH 2005
+ L + V+DA K N I HSC PNC + V+G IGI+ +R I
Sbjct: 303 MALNGGE---------VIDACTKGNLGRFINHSCSPNCRTEKWMVNGEVCIGIFAMRNIK 353
Query: 2006 YGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYL--NLTGEGAFEK 2051
GEE+TFDYN V S + C CG+ CRG Y+ +++G G +
Sbjct: 354 KGEELTFDYNYVRVSGAAPQK--CFCGTAKCRG-YIGGDISGSGIIAQ 398
>gi|443894422|dbj|GAC71770.1| histone H3 (Lys4) methyltransferase complex, subunit SET1 [Pseudozyma
antarctica T-34]
Length = 1366
Score = 77.4 bits (189), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 49/133 (36%), Positives = 67/133 (50%), Gaps = 20/133 (15%)
Query: 1907 DFVVEFLGEVY--PVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVD 1964
D V+E++GEV V EKQ Q N ++ YL R D +VVD
Sbjct: 1249 DMVIEYVGEVVRQQVADEREKQ---YERQGN--------FSTYLFRVDDD------LVVD 1291
Query: 1965 AMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEY 2024
A HK N A + H C PNC AK+ ++G +I ++ I GEE+T+DY S ++
Sbjct: 1292 ATHKGNIARLMNHCCTPNCNAKILTLNGEKRIVLFAKSPIRPGEELTYDYK-FQSSADDE 1350
Query: 2025 EASVCLCGSQVCR 2037
+A CLCGS CR
Sbjct: 1351 DAIPCLCGSPGCR 1363
>gi|354478852|ref|XP_003501628.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
N-methyltransferase ASH1L-like [Cricetulus griseus]
Length = 2962
Score = 77.4 bits (189), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+++ A KG G+ + G+ F++E+LGEV S Q EF
Sbjct: 2139 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2175
Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
N +E+ +D Y L +V+D+ N A I HSC PNCE + +V+G Y+IG+
Sbjct: 2176 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2235
Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
Y ++ + G E+T+DYN + + E+ + +C CG + CRG
Sbjct: 2236 YALKDVLAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2273
>gi|326933478|ref|XP_003212830.1| PREDICTED: probable histone-lysine N-methyltransferase ASH1L-like
[Meleagris gallopavo]
Length = 2974
Score = 77.0 bits (188), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+++ A KG G+ + G+ F++E+LGEV S Q EF
Sbjct: 2154 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2190
Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
N +E+ +D Y L +V+D+ N A I HSC PNCE + +V+G Y+IG+
Sbjct: 2191 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCNPNCEMQKWSVNGVYRIGL 2250
Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
Y ++ + G E+T+DYN + + E+ + +C CG + CRG
Sbjct: 2251 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2288
>gi|363742848|ref|XP_422858.3| PREDICTED: probable histone-lysine N-methyltransferase ASH1L [Gallus
gallus]
Length = 2954
Score = 77.0 bits (188), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+++ A KG G+ + G+ F++E+LGEV S Q EF
Sbjct: 2134 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2170
Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
N +E+ +D Y L +V+D+ N A I HSC PNCE + +V+G Y+IG+
Sbjct: 2171 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCNPNCEMQKWSVNGVYRIGL 2230
Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
Y ++ + G E+T+DYN + + E+ + +C CG + CRG
Sbjct: 2231 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2268
>gi|157818737|ref|NP_001101159.1| probable histone-lysine N-methyltransferase ASH1L [Rattus norvegicus]
gi|149048100|gb|EDM00676.1| ash1 (absent, small, or homeotic)-like (Drosophila) (predicted)
[Rattus norvegicus]
Length = 2918
Score = 77.0 bits (188), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+++ A KG G+ + G+ F++E+LGEV S Q EF
Sbjct: 2098 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2134
Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
N +E+ +D Y L +V+D+ N A I HSC PNCE + +V+G Y+IG+
Sbjct: 2135 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2194
Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
Y ++ + G E+T+DYN + + E+ + +C CG + CRG
Sbjct: 2195 YALKDVPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2232
>gi|71015569|ref|XP_758824.1| hypothetical protein UM02677.1 [Ustilago maydis 521]
gi|74702458|sp|Q4PB36.1|SET1_USTMA RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component SET1; AltName:
Full=SET domain-containing protein 1
gi|46098614|gb|EAK83847.1| hypothetical protein UM02677.1 [Ustilago maydis 521]
Length = 1468
Score = 77.0 bits (188), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 46/137 (33%), Positives = 67/137 (48%), Gaps = 28/137 (20%)
Query: 1907 DFVVEFLGEVYPVW------KWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDL 1960
D V+E++GEV K +E+Q ++ YL R D
Sbjct: 1351 DMVIEYVGEVVRQQVADEREKQYERQGN---------------FSTYLFRVDDD------ 1389
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
+VVDA HK N A + H C PNC AK+ ++G +I ++ I GEE+T+DY + +
Sbjct: 1390 LVVDATHKGNIARLMNHCCTPNCNAKILTLNGEKRIVLFAKTAIRAGEELTYDYKFQSSA 1449
Query: 2021 KEEYEASVCLCGSQVCR 2037
+E +A CLCGS CR
Sbjct: 1450 DDE-DAIPCLCGSPGCR 1465
>gi|343429488|emb|CBQ73061.1| related to regulatory protein SET1 [Sporisorium reilianum SRZ2]
Length = 1453
Score = 77.0 bits (188), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 45/131 (34%), Positives = 68/131 (51%), Gaps = 16/131 (12%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
D V+E++GEV V + + + ++ N ++ YL R D +VVDA
Sbjct: 1336 DMVIEYVGEV--VRQQVADEREKQYERQGN-------FSTYLFRVDDD------LVVDAT 1380
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
HK N A + H C PNC AK+ ++G +I ++ I GEE+T+DY S ++ +A
Sbjct: 1381 HKGNIARLMNHCCTPNCNAKILTLNGEKRIVLFAKSPIRAGEELTYDYK-FQSSADDEDA 1439
Query: 2027 SVCLCGSQVCR 2037
CLCGS CR
Sbjct: 1440 IPCLCGSPGCR 1450
>gi|222623047|gb|EEE57179.1| hypothetical protein OsJ_07116 [Oryza sativa Japonica Group]
Length = 1963
Score = 77.0 bits (188), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 50/154 (32%), Positives = 74/154 (48%), Gaps = 19/154 (12%)
Query: 1885 KYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFY 1944
K+ +KG G+ ++ E F++E++GEV + + +Q S + + FY
Sbjct: 1297 KFHTGKKGYGLQLKED--VSEGRFLIEYVGEVLDITAYESRQRYYASKGQKH------FY 1348
Query: 1945 NIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGI 2004
+ L + V+DA K N I HSC PNC + V+G IGI+ +R I
Sbjct: 1349 FMALNGGE---------VIDACTKGNLGRFINHSCSPNCRTEKWMVNGEVCIGIFAMRNI 1399
Query: 2005 HYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
GEE+TFDYN V S + C CG+ CRG
Sbjct: 1400 KKGEELTFDYNYVRVSGAAPQK--CFCGTAKCRG 1431
>gi|218190961|gb|EEC73388.1| hypothetical protein OsI_07633 [Oryza sativa Indica Group]
Length = 1906
Score = 76.6 bits (187), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 50/154 (32%), Positives = 74/154 (48%), Gaps = 19/154 (12%)
Query: 1885 KYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFY 1944
K+ +KG G+ ++ E F++E++GEV + + +Q S + + FY
Sbjct: 1312 KFHTGKKGYGLQLKED--VSEGRFLIEYVGEVLDITAYESRQRYYASKGQKH------FY 1363
Query: 1945 NIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGI 2004
+ L + V+DA K N I HSC PNC + V+G IGI+ +R I
Sbjct: 1364 FMALNGGE---------VIDACTKGNLGRFINHSCSPNCRTEKWMVNGEVCIGIFAMRNI 1414
Query: 2005 HYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
GEE+TFDYN V S + C CG+ CRG
Sbjct: 1415 KKGEELTFDYNYVRVSGAAPQK--CFCGTAKCRG 1446
>gi|195566590|ref|XP_002106863.1| GD17127 [Drosophila simulans]
gi|194204255|gb|EDX17831.1| GD17127 [Drosophila simulans]
Length = 2246
Score = 76.6 bits (187), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 51/149 (34%), Positives = 77/149 (51%), Gaps = 20/149 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ GE F++E++GEV + FE++ + S +N +Y + L
Sbjct: 1371 KKGCGITAELLIPPGE--FIMEYVGEVIDSEE-FERRQHLYSKDRNRH-----YYFMAL- 1421
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
+G+A V+DA K N + I HSC PN E + V+G +IG ++V+ I GEE
Sbjct: 1422 --RGEA------VIDATSKGNISRYINHSCDPNAETQKWTVNGELRIGFFSVKPIQPGEE 1473
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
ITFDY + + +A C C S CRG
Sbjct: 1474 ITFDYQYLRYGR---DAQRCYCESTNCRG 1499
>gi|332219957|ref|XP_003259124.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
ASH1L [Nomascus leucogenys]
Length = 2892
Score = 76.6 bits (187), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+++ A KG G+ + G+ F++E+LGEV S Q EF
Sbjct: 2143 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2179
Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
N +E+ +D Y L +V+D+ N A I HSC PNCE + +V+G Y+IG+
Sbjct: 2180 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2239
Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
Y ++ + G E+T+DYN + + E+ + +C CG + CRG
Sbjct: 2240 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2277
>gi|426216789|ref|XP_004002640.1| PREDICTED: histone-lysine N-methyltransferase ASH1L [Ovis aries]
Length = 2965
Score = 76.6 bits (187), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+++ A KG G+ + G+ F++E+LGEV S Q EF
Sbjct: 2144 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2180
Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
N +E+ +D Y L +V+D+ N A I HSC PNCE + +V+G Y+IG+
Sbjct: 2181 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2240
Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
Y ++ + G E+T+DYN + + E+ + +C CG + CRG
Sbjct: 2241 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2278
>gi|73622271|ref|NP_619620.3| histone-lysine N-methyltransferase ASH1L [Mus musculus]
gi|341940590|sp|Q99MY8.3|ASH1L_MOUSE RecName: Full=Histone-lysine N-methyltransferase ASH1L; AltName:
Full=ASH1-like protein; AltName: Full=Absent small and
homeotic disks protein 1 homolog
Length = 2958
Score = 76.6 bits (187), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+++ A KG G+ + G+ F++E+LGEV S Q EF
Sbjct: 2138 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2174
Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
N +E+ +D Y L +V+D+ N A I HSC PNCE + +V+G Y+IG+
Sbjct: 2175 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2234
Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
Y ++ + G E+T+DYN + + E+ + +C CG + CRG
Sbjct: 2235 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2272
>gi|417407091|gb|JAA50172.1| Putative histone-lysine n-methyltransferase ash1l isoform 1 [Desmodus
rotundus]
Length = 2962
Score = 76.6 bits (187), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+++ A KG G+ + G+ F++E+LGEV S Q EF
Sbjct: 2141 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2177
Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
N +E+ +D Y L +V+D+ N A I HSC PNCE + +V+G Y+IG+
Sbjct: 2178 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2237
Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
Y ++ + G E+T+DYN + + E+ + +C CG + CRG
Sbjct: 2238 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2275
>gi|338724967|ref|XP_001499134.2| PREDICTED: probable histone-lysine N-methyltransferase ASH1L isoform
1 [Equus caballus]
Length = 2963
Score = 76.6 bits (187), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+++ A KG G+ + G+ F++E+LGEV S Q EF
Sbjct: 2142 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2178
Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
N +E+ +D Y L +V+D+ N A I HSC PNCE + +V+G Y+IG+
Sbjct: 2179 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2238
Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
Y ++ + G E+T+DYN + + E+ + +C CG + CRG
Sbjct: 2239 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2276
>gi|195478285|ref|XP_002100470.1| GE17076 [Drosophila yakuba]
gi|194187994|gb|EDX01578.1| GE17076 [Drosophila yakuba]
Length = 2397
Score = 76.6 bits (187), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 49/149 (32%), Positives = 77/149 (51%), Gaps = 20/149 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ + GE F++E++GEV + FE++ + S +N +Y + L
Sbjct: 1437 KKGCGITAELQIPPGE--FIMEYVGEVIDSEE-FERRQHLYSKDRNRH-----YYFMAL- 1487
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
+G+A ++DA K N + I HSC PN E + V+G +IG ++V+ I GEE
Sbjct: 1488 --RGEA------IIDATSKGNISRYINHSCDPNAETQKWTVNGELRIGFFSVKPIQPGEE 1539
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
ITFDY + +A C C + CRG
Sbjct: 1540 ITFDYQYQRYGR---DAQRCYCEAANCRG 1565
>gi|417515828|gb|JAA53722.1| histone-lysine N-methyltransferase ASH1L [Sus scrofa]
Length = 2951
Score = 76.6 bits (187), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+++ A KG G+ + G+ F++E+LGEV S Q EF
Sbjct: 2130 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2166
Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
N +E+ +D Y L +V+D+ N A I HSC PNCE + +V+G Y+IG+
Sbjct: 2167 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2226
Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
Y ++ + G E+T+DYN + + E+ + +C CG + CRG
Sbjct: 2227 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2264
>gi|440903623|gb|ELR54260.1| Putative histone-lysine N-methyltransferase ASH1L [Bos grunniens
mutus]
Length = 2965
Score = 76.6 bits (187), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+++ A KG G+ + G+ F++E+LGEV S Q EF
Sbjct: 2144 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2180
Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
N +E+ +D Y L +V+D+ N A I HSC PNCE + +V+G Y+IG+
Sbjct: 2181 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2240
Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
Y ++ + G E+T+DYN + + E+ + +C CG + CRG
Sbjct: 2241 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2278
>gi|326676505|ref|XP_692254.4| PREDICTED: probable histone-lysine N-methyltransferase ASH1L [Danio
rerio]
Length = 2933
Score = 76.6 bits (187), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 83/160 (51%), Gaps = 30/160 (18%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+++ A KG G+ + G+ F++E+LGEV S Q EF
Sbjct: 2064 ERFRAEGKGWGIRTKQPLRAGQ--FIIEYLGEVV-------------SEQ--------EF 2100
Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
+ +E+ + Y L +V+D+ N A + HSC PNCE + +V+G Y+IG+
Sbjct: 2101 RSRMMEQYFSHSGHYCLNLDSGMVIDSYRMGNEARFVNHSCEPNCEMQKWSVNGVYRIGL 2160
Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
+ ++ I+ G E+T+DYN + + EE + VC CGS+ CRG
Sbjct: 2161 FALKDINSGTELTYDYNFHSFNTEEQQ--VCKCGSEGCRG 2198
>gi|395845197|ref|XP_003795328.1| PREDICTED: histone-lysine N-methyltransferase ASH1L [Otolemur
garnettii]
Length = 2961
Score = 76.6 bits (187), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+++ A KG G+ + G+ F++E+LGEV S Q EF
Sbjct: 2140 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2176
Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
N +E+ +D Y L +V+D+ N A I HSC PNCE + +V+G Y+IG+
Sbjct: 2177 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2236
Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
Y ++ + G E+T+DYN + + E+ + +C CG + CRG
Sbjct: 2237 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2274
>gi|327286108|ref|XP_003227773.1| PREDICTED: probable histone-lysine N-methyltransferase ASH1L-like
[Anolis carolinensis]
Length = 2957
Score = 76.6 bits (187), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+++ A KG G+ + G+ F++E+LGEV S Q EF
Sbjct: 2136 ERFRAEEKGWGIRTKESLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2172
Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
N +E+ +D Y L +V+D+ N A I HSC PNCE + +V+G Y+IG+
Sbjct: 2173 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCNPNCEMQKWSVNGVYRIGL 2232
Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
Y ++ + G E+T+DYN + + E+ + +C CG + CRG
Sbjct: 2233 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2270
>gi|299746032|ref|XP_002910994.1| Setd1a protein [Coprinopsis cinerea okayama7#130]
gi|298406870|gb|EFI27500.1| Setd1a protein [Coprinopsis cinerea okayama7#130]
Length = 1614
Score = 76.6 bits (187), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 39/81 (48%), Positives = 46/81 (56%), Gaps = 4/81 (4%)
Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
VVDA K N I HSC PNC AK+ + G +I IY + I GEEIT+DY+ E
Sbjct: 1538 VVDATKKGNLGRLINHSCDPNCTAKIITISGVKKIVIYAKQDIELGEEITYDYHFPIEQD 1597
Query: 2022 EEYEASVCLCGSQVCRGSYLN 2042
+ CLCGS CRG YLN
Sbjct: 1598 NKIP---CLCGSARCRG-YLN 1614
>gi|242061944|ref|XP_002452261.1| hypothetical protein SORBIDRAFT_04g022620 [Sorghum bicolor]
gi|241932092|gb|EES05237.1| hypothetical protein SORBIDRAFT_04g022620 [Sorghum bicolor]
Length = 1840
Score = 76.6 bits (187), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/156 (32%), Positives = 77/156 (49%), Gaps = 20/156 (12%)
Query: 1886 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN 1945
+ + +KG G+ ++ E F++E++GEV + + +Q S + + FY
Sbjct: 1136 FYSGKKGYGLQLQED--VTEGRFLIEYVGEVLDITSYESRQRYYASKGQKH------FYF 1187
Query: 1946 IYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIH 2005
+ L + V+DA K N I HSC PNC + V+G IGI+++R I
Sbjct: 1188 MALNGGE---------VIDACTKGNLGRFINHSCSPNCRTEKWMVNGEVCIGIFSLRNIK 1238
Query: 2006 YGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
GEE+TFDYN V S + C CG+ CRG YL
Sbjct: 1239 KGEELTFDYNYVRVSGAAPQK--CFCGTAKCRG-YL 1271
>gi|345567899|gb|EGX50801.1| hypothetical protein AOL_s00054g887 [Arthrobotrys oligospora ATCC
24927]
Length = 1338
Score = 76.6 bits (187), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 39/84 (46%), Positives = 49/84 (58%), Gaps = 2/84 (2%)
Query: 1959 DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVT 2018
+ V+DA K A I HSC PNC AK+ V+G +I IY +R IH EE+T+DY
Sbjct: 1257 ETTVIDATKKGGIARFINHSCTPNCTAKIIKVEGTKRIVIYALRDIHKDEELTYDYKFER 1316
Query: 2019 ESKEEYEASVCLCGSQVCRGSYLN 2042
E E E CLCGS C+G +LN
Sbjct: 1317 EIDSE-ERIPCLCGSSGCKG-FLN 1338
>gi|148683294|gb|EDL15241.1| ash1 (absent, small, or homeotic)-like (Drosophila) [Mus musculus]
Length = 2918
Score = 76.6 bits (187), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+++ A KG G+ + G+ F++E+LGEV S Q EF
Sbjct: 2098 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2134
Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
N +E+ +D Y L +V+D+ N A I HSC PNCE + +V+G Y+IG+
Sbjct: 2135 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2194
Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
Y ++ + G E+T+DYN + + E+ + +C CG + CRG
Sbjct: 2195 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2232
>gi|417407083|gb|JAA50168.1| Putative histone-lysine n-methyltransferase ash1l isoform 1 [Desmodus
rotundus]
Length = 2832
Score = 76.6 bits (187), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+++ A KG G+ + G+ F++E+LGEV S Q EF
Sbjct: 2141 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2177
Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
N +E+ +D Y L +V+D+ N A I HSC PNCE + +V+G Y+IG+
Sbjct: 2178 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2237
Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
Y ++ + G E+T+DYN + + E+ + +C CG + CRG
Sbjct: 2238 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2275
>gi|403293713|ref|XP_003937857.1| PREDICTED: histone-lysine N-methyltransferase ASH1L [Saimiri
boliviensis boliviensis]
Length = 2970
Score = 76.6 bits (187), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+++ A KG G+ + G+ F++E+LGEV S Q EF
Sbjct: 2149 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2185
Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
N +E+ +D Y L +V+D+ N A I HSC PNCE + +V+G Y+IG+
Sbjct: 2186 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2245
Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
Y ++ + G E+T+DYN + + E+ + +C CG + CRG
Sbjct: 2246 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2283
>gi|73960946|ref|XP_537251.2| PREDICTED: probable histone-lysine N-methyltransferase ASH1L isoform
1 [Canis lupus familiaris]
Length = 2965
Score = 76.6 bits (187), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+++ A KG G+ + G+ F++E+LGEV S Q EF
Sbjct: 2144 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2180
Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
N +E+ +D Y L +V+D+ N A I HSC PNCE + +V+G Y+IG+
Sbjct: 2181 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2240
Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
Y ++ + G E+T+DYN + + E+ + +C CG + CRG
Sbjct: 2241 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2278
>gi|301785832|ref|XP_002928328.1| PREDICTED: probable histone-lysine N-methyltransferase ASH1L-like
[Ailuropoda melanoleuca]
Length = 2965
Score = 76.6 bits (187), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+++ A KG G+ + G+ F++E+LGEV S Q EF
Sbjct: 2144 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2180
Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
N +E+ +D Y L +V+D+ N A I HSC PNCE + +V+G Y+IG+
Sbjct: 2181 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2240
Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
Y ++ + G E+T+DYN + + E+ + +C CG + CRG
Sbjct: 2241 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2278
>gi|291397821|ref|XP_002715465.1| PREDICTED: absent, small, or homeotic 1-like [Oryctolagus cuniculus]
Length = 2961
Score = 76.6 bits (187), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+++ A KG G+ + G+ F++E+LGEV S Q EF
Sbjct: 2140 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2176
Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
N +E+ +D Y L +V+D+ N A I HSC PNCE + +V+G Y+IG+
Sbjct: 2177 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2236
Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
Y ++ + G E+T+DYN + + E+ + +C CG + CRG
Sbjct: 2237 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2274
>gi|13442965|gb|AAK26242.1|AF247132_1 putative chromatin remodeling factor [Mus musculus]
Length = 2669
Score = 76.6 bits (187), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+++ A KG G+ + G+ F++E+LGEV S Q EF
Sbjct: 1849 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 1885
Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
N +E+ +D Y L +V+D+ N A I HSC PNCE + +V+G Y+IG+
Sbjct: 1886 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 1945
Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
Y ++ + G E+T+DYN + + E+ + +C CG + CRG
Sbjct: 1946 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 1983
>gi|380814664|gb|AFE79206.1| putative histone-lysine N-methyltransferase ASH1L [Macaca mulatta]
gi|383419979|gb|AFH33203.1| putative histone-lysine N-methyltransferase ASH1L [Macaca mulatta]
Length = 2963
Score = 76.6 bits (187), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+++ A KG G+ + G+ F++E+LGEV S Q EF
Sbjct: 2142 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2178
Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
N +E+ +D Y L +V+D+ N A I HSC PNCE + +V+G Y+IG+
Sbjct: 2179 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2238
Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
Y ++ + G E+T+DYN + + E+ + +C CG + CRG
Sbjct: 2239 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2276
>gi|351696657|gb|EHA99575.1| Putative histone-lysine N-methyltransferase ASH1L [Heterocephalus
glaber]
Length = 2930
Score = 76.6 bits (187), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+++ A KG G+ + G+ F++E+LGEV S Q EF
Sbjct: 2109 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2145
Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
N +E+ +D Y L +V+D+ N A I HSC PNCE + +V+G Y+IG+
Sbjct: 2146 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2205
Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
Y ++ + G E+T+DYN + + E+ + +C CG + CRG
Sbjct: 2206 YALKDMTAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2243
>gi|300795068|ref|NP_001179672.1| probable histone-lysine N-methyltransferase ASH1L [Bos taurus]
gi|296489728|tpg|DAA31841.1| TPA: ash1 (absent, small, or homeotic)-like [Bos taurus]
Length = 2965
Score = 76.6 bits (187), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+++ A KG G+ + G+ F++E+LGEV S Q EF
Sbjct: 2144 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2180
Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
N +E+ +D Y L +V+D+ N A I HSC PNCE + +V+G Y+IG+
Sbjct: 2181 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2240
Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
Y ++ + G E+T+DYN + + E+ + +C CG + CRG
Sbjct: 2241 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2278
>gi|388853505|emb|CCF52904.1| related to regulatory protein SET1 [Ustilago hordei]
Length = 1489
Score = 76.3 bits (186), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 46/131 (35%), Positives = 68/131 (51%), Gaps = 16/131 (12%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
D V+E++GE+ +Q + +K E ++ YL R D +VVDA
Sbjct: 1372 DMVIEYVGEMV-------RQQVADNREKQYERQG--NFSTYLFRVDDD------LVVDAT 1416
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
HK N A + H C PNC AK+ V+G +I ++ I GEE+T+DY + + +E +A
Sbjct: 1417 HKGNIARLMNHCCTPNCNAKILTVNGEKRIVLFAKSPIKAGEELTYDYKFQSSADDE-DA 1475
Query: 2027 SVCLCGSQVCR 2037
CLCGS CR
Sbjct: 1476 IPCLCGSDGCR 1486
>gi|355745722|gb|EHH50347.1| hypothetical protein EGM_01160 [Macaca fascicularis]
Length = 2904
Score = 76.3 bits (186), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+++ A KG G+ + G+ F++E+LGEV S Q EF
Sbjct: 2144 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2180
Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
N +E+ +D Y L +V+D+ N A I HSC PNCE + +V+G Y+IG+
Sbjct: 2181 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2240
Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
Y ++ + G E+T+DYN + + E+ + +C CG + CRG
Sbjct: 2241 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2278
>gi|110349788|ref|NP_060959.2| histone-lysine N-methyltransferase ASH1L [Homo sapiens]
gi|225000936|gb|AAI72595.1| Ash1 (absent, small, or homeotic)-like (Drosophila) [synthetic
construct]
Length = 2964
Score = 76.3 bits (186), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+++ A KG G+ + G+ F++E+LGEV S Q EF
Sbjct: 2143 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2179
Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
N +E+ +D Y L +V+D+ N A I HSC PNCE + +V+G Y+IG+
Sbjct: 2180 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2239
Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
Y ++ + G E+T+DYN + + E+ + +C CG + CRG
Sbjct: 2240 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2277
>gi|117949323|sp|Q9NR48.2|ASH1L_HUMAN RecName: Full=Histone-lysine N-methyltransferase ASH1L; AltName:
Full=ASH1-like protein; Short=huASH1; AltName:
Full=Absent small and homeotic disks protein 1 homolog;
AltName: Full=Lysine N-methyltransferase 2H
Length = 2969
Score = 76.3 bits (186), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+++ A KG G+ + G+ F++E+LGEV S Q EF
Sbjct: 2148 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2184
Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
N +E+ +D Y L +V+D+ N A I HSC PNCE + +V+G Y+IG+
Sbjct: 2185 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2244
Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
Y ++ + G E+T+DYN + + E+ + +C CG + CRG
Sbjct: 2245 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2282
>gi|350583322|ref|XP_003125756.3| PREDICTED: probable histone-lysine N-methyltransferase ASH1L-like,
partial [Sus scrofa]
Length = 2824
Score = 76.3 bits (186), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+++ A KG G+ + G+ F++E+LGEV S Q EF
Sbjct: 1997 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2033
Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
N +E+ +D Y L +V+D+ N A I HSC PNCE + +V+G Y+IG+
Sbjct: 2034 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2093
Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
Y ++ + G E+T+DYN + + E+ + +C CG + CRG
Sbjct: 2094 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2131
>gi|344286471|ref|XP_003414981.1| PREDICTED: probable histone-lysine N-methyltransferase ASH1L
[Loxodonta africana]
Length = 2917
Score = 76.3 bits (186), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+++ A KG G+ + G+ F++E+LGEV S Q EF
Sbjct: 2096 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2132
Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
N +E+ +D Y L +V+D+ N A I HSC PNCE + +V+G Y+IG+
Sbjct: 2133 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2192
Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
Y ++ + G E+T+DYN + + E+ + +C CG + CRG
Sbjct: 2193 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2230
>gi|397492363|ref|XP_003817092.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
ASH1L [Pan paniscus]
Length = 2964
Score = 76.3 bits (186), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+++ A KG G+ + G+ F++E+LGEV S Q EF
Sbjct: 2143 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2179
Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
N +E+ +D Y L +V+D+ N A I HSC PNCE + +V+G Y+IG+
Sbjct: 2180 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2239
Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
Y ++ + G E+T+DYN + + E+ + +C CG + CRG
Sbjct: 2240 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2277
>gi|281338719|gb|EFB14303.1| hypothetical protein PANDA_018255 [Ailuropoda melanoleuca]
Length = 2981
Score = 76.3 bits (186), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+++ A KG G+ + G+ F++E+LGEV S Q EF
Sbjct: 2160 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2196
Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
N +E+ +D Y L +V+D+ N A I HSC PNCE + +V+G Y+IG+
Sbjct: 2197 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2256
Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
Y ++ + G E+T+DYN + + E+ + +C CG + CRG
Sbjct: 2257 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2294
>gi|7739725|gb|AAF68983.1|AF257305_1 ASH1 [Homo sapiens]
Length = 2969
Score = 76.3 bits (186), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+++ A KG G+ + G+ F++E+LGEV S Q EF
Sbjct: 2148 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2184
Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
N +E+ +D Y L +V+D+ N A I HSC PNCE + +V+G Y+IG+
Sbjct: 2185 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2244
Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
Y ++ + G E+T+DYN + + E+ + +C CG + CRG
Sbjct: 2245 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2282
>gi|390476801|ref|XP_002760038.2| PREDICTED: histone-lysine N-methyltransferase ASH1L [Callithrix
jacchus]
Length = 2970
Score = 76.3 bits (186), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+++ A KG G+ + G+ F++E+LGEV S Q EF
Sbjct: 2149 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2185
Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
N +E+ +D Y L +V+D+ N A I HSC PNCE + +V+G Y+IG+
Sbjct: 2186 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2245
Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
Y ++ + G E+T+DYN + + E+ + +C CG + CRG
Sbjct: 2246 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2283
>gi|324500453|gb|ADY40214.1| Histone-lysine N-methyltransferase lin-59 [Ascaris suum]
Length = 1467
Score = 76.3 bits (186), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 55/170 (32%), Positives = 79/170 (46%), Gaps = 20/170 (11%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
+GLGV + + FV E++GEV + + N F N Y
Sbjct: 799 RGLGV--RTDVPLQKGQFVCEYVGEVV----------SMETFDARNAHSYRAFRNHYA-- 844
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
GY V+DA K N A + HSC PNCE + +V+G ++IG++ +R + GEE+
Sbjct: 845 -LNLCPGY---VIDAYQKGNIARFVNHSCVPNCEMQRWSVNGQHRIGLFALRVVAKGEEL 900
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLL 2060
T+DYN +S + Y + C CG CRG A EK L G+L
Sbjct: 901 TYDYN--WDSFDFYGVTPCSCGVPNCRGFLNKNVLMNAKEKELARSSGVL 948
>gi|119573453|gb|EAW53068.1| ash1 (absent, small, or homeotic)-like (Drosophila) [Homo sapiens]
Length = 2969
Score = 76.3 bits (186), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+++ A KG G+ + G+ F++E+LGEV S Q EF
Sbjct: 2148 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2184
Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
N +E+ +D Y L +V+D+ N A I HSC PNCE + +V+G Y+IG+
Sbjct: 2185 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2244
Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
Y ++ + G E+T+DYN + + E+ + +C CG + CRG
Sbjct: 2245 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2282
>gi|195376627|ref|XP_002047094.1| GJ13235 [Drosophila virilis]
gi|194154252|gb|EDW69436.1| GJ13235 [Drosophila virilis]
Length = 2005
Score = 76.3 bits (186), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 50/149 (33%), Positives = 77/149 (51%), Gaps = 20/149 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ + GE F++E++GEV + FE++ + S +N +Y + L
Sbjct: 1082 KKGCGITAELQIPPGE--FIMEYVGEVIDSEE-FERRQHLYSEDRNRH-----YYFMAL- 1132
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
+G+A ++DA K N + I HSC PN E + V+G +IG ++V+ I GEE
Sbjct: 1133 --RGEA------IIDATTKGNISRYINHSCDPNAETQKWTVNGELRIGFFSVKTILPGEE 1184
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
ITFDY + +A C C S CRG
Sbjct: 1185 ITFDYQYQRYGR---DAQRCYCESANCRG 1210
>gi|195126250|ref|XP_002007587.1| GI12297 [Drosophila mojavensis]
gi|193919196|gb|EDW18063.1| GI12297 [Drosophila mojavensis]
Length = 1972
Score = 76.3 bits (186), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 50/149 (33%), Positives = 77/149 (51%), Gaps = 20/149 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ + GE F++E++GEV + FE++ + S +N +Y + L
Sbjct: 1058 KKGCGITAELQIQPGE--FIMEYVGEVIDSEE-FERRQHLYSEDRNRH-----YYFMAL- 1108
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
+G+A ++DA K N + I HSC PN E + V+G +IG ++V+ I GEE
Sbjct: 1109 --RGEA------IIDATTKGNISRYINHSCDPNAETQKWTVNGELRIGFFSVKTIMPGEE 1160
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
ITFDY + +A C C S CRG
Sbjct: 1161 ITFDYQYQRYGR---DAQRCYCESANCRG 1186
>gi|194895514|ref|XP_001978270.1| GG17783 [Drosophila erecta]
gi|190649919|gb|EDV47197.1| GG17783 [Drosophila erecta]
Length = 2384
Score = 76.3 bits (186), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 49/149 (32%), Positives = 77/149 (51%), Gaps = 20/149 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ + GE F++E++GEV + FE++ + S +N +Y + L
Sbjct: 1424 KKGCGITAELQIPPGE--FIMEYVGEVIDSEE-FERRQHLYSKDRNRH-----YYFMAL- 1474
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
+G+A ++DA K N + I HSC PN E + V+G +IG ++V+ I GEE
Sbjct: 1475 --RGEA------IIDATSKGNISRYINHSCDPNAETQKWTVNGELRIGFFSVKPIQPGEE 1526
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
ITFDY + +A C C + CRG
Sbjct: 1527 ITFDYQYQRYGR---DAQRCYCEAANCRG 1552
>gi|410986772|ref|XP_003999683.1| PREDICTED: histone-lysine N-methyltransferase ASH1L isoform 1 [Felis
catus]
Length = 2965
Score = 76.3 bits (186), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+++ A KG G+ + G+ F++E+LGEV S Q EF
Sbjct: 2144 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2180
Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
N +E+ +D Y L +V+D+ N A I HSC PNCE + +V+G Y+IG+
Sbjct: 2181 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2240
Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
Y ++ + G E+T+DYN + + E+ + +C CG + CRG
Sbjct: 2241 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2278
>gi|410986774|ref|XP_003999684.1| PREDICTED: histone-lysine N-methyltransferase ASH1L isoform 2 [Felis
catus]
Length = 2974
Score = 76.3 bits (186), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+++ A KG G+ + G+ F++E+LGEV S Q EF
Sbjct: 2153 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2189
Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
N +E+ +D Y L +V+D+ N A I HSC PNCE + +V+G Y+IG+
Sbjct: 2190 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2249
Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
Y ++ + G E+T+DYN + + E+ + +C CG + CRG
Sbjct: 2250 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2287
>gi|241753587|ref|XP_002401135.1| huntingtin interacting protein, putative [Ixodes scapularis]
gi|215508354|gb|EEC17808.1| huntingtin interacting protein, putative [Ixodes scapularis]
Length = 1594
Score = 76.3 bits (186), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/155 (32%), Positives = 78/155 (50%), Gaps = 20/155 (12%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+K++ +KG G+ + G FV+E++GEV + F K+ ++ ++N +
Sbjct: 621 EKFLTEKKGWGLRTVETLASGA--FVMEYVGEVL-TPEDFRKR--VKQYARDNHQ---HY 672
Query: 1944 YNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRG 2003
Y + L + D ++DA K N + I HSC PNCE + V+G +IG +T R
Sbjct: 673 YFMAL---RSDE------IIDATQKGNVSRFINHSCDPNCETQKWTVNGELRIGFFTRRP 723
Query: 2004 IHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
+ GEE+TFDY K EA C C S CRG
Sbjct: 724 LRAGEELTFDYQFQRYGK---EAQKCYCESSKCRG 755
>gi|410033849|ref|XP_003949641.1| PREDICTED: histone-lysine N-methyltransferase ASH1L [Pan troglodytes]
Length = 2964
Score = 76.3 bits (186), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+++ A KG G+ + G+ F++E+LGEV S Q EF
Sbjct: 2143 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2179
Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
N +E+ +D Y L +V+D+ N A I HSC PNCE + +V+G Y+IG+
Sbjct: 2180 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2239
Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
Y ++ + G E+T+DYN + + E+ + +C CG + CRG
Sbjct: 2240 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2277
>gi|410226116|gb|JAA10277.1| ash1 (absent, small, or homeotic)-like [Pan troglodytes]
gi|410264036|gb|JAA19984.1| ash1 (absent, small, or homeotic)-like [Pan troglodytes]
gi|410264040|gb|JAA19986.1| ash1 (absent, small, or homeotic)-like [Pan troglodytes]
gi|410306368|gb|JAA31784.1| ash1 (absent, small, or homeotic)-like [Pan troglodytes]
gi|410355463|gb|JAA44335.1| ash1 (absent, small, or homeotic)-like [Pan troglodytes]
Length = 2964
Score = 76.3 bits (186), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+++ A KG G+ + G+ F++E+LGEV S Q EF
Sbjct: 2143 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2179
Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
N +E+ +D Y L +V+D+ N A I HSC PNCE + +V+G Y+IG+
Sbjct: 2180 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2239
Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
Y ++ + G E+T+DYN + + E+ + +C CG + CRG
Sbjct: 2240 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2277
>gi|355558542|gb|EHH15322.1| hypothetical protein EGK_01394 [Macaca mulatta]
Length = 2796
Score = 76.3 bits (186), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+++ A KG G+ + G+ F++E+LGEV S Q EF
Sbjct: 2068 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2104
Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
N +E+ +D Y L +V+D+ N A I HSC PNCE + +V+G Y+IG+
Sbjct: 2105 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2164
Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
Y ++ + G E+T+DYN + + E+ + +C CG + CRG
Sbjct: 2165 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2202
>gi|224084984|ref|XP_002307459.1| SET domain protein [Populus trichocarpa]
gi|222856908|gb|EEE94455.1| SET domain protein [Populus trichocarpa]
Length = 594
Score = 76.3 bits (186), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 52/149 (34%), Positives = 74/149 (49%), Gaps = 19/149 (12%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ +++ G+ F++E++GEV V + +Q S + FY + L
Sbjct: 170 KKGFGLRLDEDISRGQ--FLIEYVGEVLDVHAYEARQKDYASKGHKH------FYFMTL- 220
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
DG + V+DA K N I HSC PNC + V+G IG++ +R I GEE
Sbjct: 221 ------DGSE--VIDACAKGNLGRFINHSCDPNCRTEKWVVNGEICIGLFALRDIKMGEE 272
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
+TFDYN V A C CGS CRG
Sbjct: 273 VTFDYNYVRVVGA--AAKRCYCGSPQCRG 299
>gi|348579791|ref|XP_003475662.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
N-methyltransferase ASH1L-like [Cavia porcellus]
Length = 2964
Score = 76.3 bits (186), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+++ A KG G+ + G+ F++E+LGEV S Q EF
Sbjct: 2143 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2179
Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
N +E+ +D Y L +V+D+ N A I HSC PNCE + +V+G Y+IG+
Sbjct: 2180 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2239
Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
Y ++ + G E+T+DYN + + E+ + +C CG + CRG
Sbjct: 2240 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2277
>gi|449490008|ref|XP_004176439.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
ASH1L-like [Taeniopygia guttata]
Length = 2968
Score = 76.3 bits (186), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 81/160 (50%), Gaps = 30/160 (18%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+++ A KG G+ + G+ F++E+LGEV S Q EF
Sbjct: 2148 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2184
Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
N +E+ +D Y L +V+D+ N A I HSC PNCE + +V+G Y+IG+
Sbjct: 2185 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCNPNCEMQKWSVNGVYRIGL 2244
Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
Y ++ + G E+T+DYN + + E+ + +C CG CRG
Sbjct: 2245 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFDKCRG 2282
>gi|410911836|ref|XP_003969396.1| PREDICTED: histone-lysine N-methyltransferase ASH1L-like [Takifugu
rubripes]
Length = 2782
Score = 76.3 bits (186), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 61/205 (29%), Positives = 97/205 (47%), Gaps = 44/205 (21%)
Query: 1853 EEIEKEAVDDCDVR-TMKMCRGILKAMDSRPDDKYVA----------YR---KGLGVVCN 1898
+ IEK +DDC R + C + D++++ +R KG G+
Sbjct: 1937 DRIEKSCLDDCLNRMSFAECSPSTCPSADQCDNQHIQRHDWVQCLERFRTEGKGWGIRTK 1996
Query: 1899 KEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGY 1958
+ G+ F++E+LGEV S Q EF + +E+ + Y
Sbjct: 1997 EPLRAGQ--FIIEYLGEVV-------------SEQ--------EFRSRMMEQYFSHSGNY 2033
Query: 1959 DL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFD 2013
L +V+D+ N A I HSC PNCE + +V+G Y+IG++ + I G E+T+D
Sbjct: 2034 CLNLDSGMVIDSYRMGNEARFINHSCEPNCEMQKWSVNGVYRIGLFALGEIPSGTELTYD 2093
Query: 2014 YNSVTESKEEYEASVCLCGSQVCRG 2038
YN + + EE +A C+CGS+ CRG
Sbjct: 2094 YNFHSFNTEEQQA--CMCGSESCRG 2116
>gi|195352880|ref|XP_002042939.1| GM11634 [Drosophila sechellia]
gi|194126986|gb|EDW49029.1| GM11634 [Drosophila sechellia]
Length = 1965
Score = 76.3 bits (186), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 50/153 (32%), Positives = 78/153 (50%), Gaps = 20/153 (13%)
Query: 1886 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN 1945
+ +KG G+ GE F++E++GEV + FE++ + S +N +Y
Sbjct: 1272 FRTEKKGCGITAELLIPPGE--FIMEYVGEVIDSEE-FERRQHLYSKDRNRH-----YYF 1323
Query: 1946 IYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIH 2005
+ L +G+A V+DA K N + I HSC PN E + V+G +IG ++V+ I
Sbjct: 1324 MAL---RGEA------VIDATSKGNISRYINHSCDPNAETQKWTVNGELRIGFFSVKPIQ 1374
Query: 2006 YGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
GEEITFDY + + +A C C + CRG
Sbjct: 1375 PGEEITFDYQYLRYGR---DAQRCYCEATNCRG 1404
>gi|168044865|ref|XP_001774900.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162673794|gb|EDQ60312.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1980
Score = 76.3 bits (186), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 46/133 (34%), Positives = 69/133 (51%), Gaps = 21/133 (15%)
Query: 1908 FVVEFLGEVY--PVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDA 1965
F++E++GEV P ++ +K+ + S QK+ FY + L + ++DA
Sbjct: 928 FIIEYVGEVLDMPSFEARQKEYSMNS-QKH-------FYFMTLSANE---------IIDA 970
Query: 1966 MHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYE 2025
K N I HSC PNC+ + VDG IG++ +R + GEE+TFDYN V +
Sbjct: 971 CSKGNLGRFINHSCEPNCQTEKWMVDGEVCIGLFAIRDVKKGEEVTFDYNFVRVGGA--D 1028
Query: 2026 ASVCLCGSQVCRG 2038
A C CG+ CRG
Sbjct: 1029 AKKCECGANKCRG 1041
>gi|403167549|ref|XP_003327326.2| histone-lysine N-methyltransferase SETD2 [Puccinia graminis f. sp.
tritici CRL 75-36-700-3]
gi|375167080|gb|EFP82907.2| histone-lysine N-methyltransferase SETD2 [Puccinia graminis f. sp.
tritici CRL 75-36-700-3]
Length = 974
Score = 76.3 bits (186), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 56/184 (30%), Positives = 86/184 (46%), Gaps = 38/184 (20%)
Query: 1893 LGVVCNKEGGFG--------EDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPE-- 1942
+ +V + GFG +D FV E+LGEV G+++L K +D E
Sbjct: 200 IEIVLTPKKGFGMRLQADVPKDTFVYEYLGEVI----------GVKALHKRLKDYGQEGI 249
Query: 1943 --FYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
FY + L++ D Y +DA K + + HSC PNC V ++GI+T
Sbjct: 250 KHFYFMELQK-----DQY----IDATKKGGFGRFLNHSCNPNCYIGKWVVGRQLRMGIFT 300
Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYL---NLTGEGAFEKVLKELH 2057
R + GEE+TF+YN + +EA C CG C G +L T GA +++ +
Sbjct: 301 KRAVRGGEELTFNYNV---DRYGHEAQECFCGEANCVG-FLGGKTQTDLGAMDELYIDAL 356
Query: 2058 GLLD 2061
G++D
Sbjct: 357 GIVD 360
>gi|395532129|ref|XP_003768124.1| PREDICTED: histone-lysine N-methyltransferase ASH1L isoform 1
[Sarcophilus harrisii]
Length = 2969
Score = 75.9 bits (185), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 81/160 (50%), Gaps = 30/160 (18%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+++ A KG G+ + G+ F++E+LGEV S Q EF
Sbjct: 2149 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2185
Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
N +E+ +D Y L +V+D+ N A I HSC PNCE + +V+G Y+IG+
Sbjct: 2186 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCNPNCEMQKWSVNGVYRIGL 2245
Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
Y ++ + G E+T+DYN + + E+ + +C CG CRG
Sbjct: 2246 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFDKCRG 2283
>gi|395532131|ref|XP_003768125.1| PREDICTED: histone-lysine N-methyltransferase ASH1L isoform 2
[Sarcophilus harrisii]
Length = 2974
Score = 75.9 bits (185), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 81/160 (50%), Gaps = 30/160 (18%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+++ A KG G+ + G+ F++E+LGEV S Q EF
Sbjct: 2154 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2190
Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
N +E+ +D Y L +V+D+ N A I HSC PNCE + +V+G Y+IG+
Sbjct: 2191 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCNPNCEMQKWSVNGVYRIGL 2250
Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
Y ++ + G E+T+DYN + + E+ + +C CG CRG
Sbjct: 2251 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFDKCRG 2288
>gi|126307634|ref|XP_001366993.1| PREDICTED: probable histone-lysine N-methyltransferase ASH1L
[Monodelphis domestica]
Length = 2968
Score = 75.9 bits (185), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 81/160 (50%), Gaps = 30/160 (18%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+++ A KG G+ + G+ F++E+LGEV S Q EF
Sbjct: 2148 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2184
Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
N +E+ +D Y L +V+D+ N A I HSC PNCE + +V+G Y+IG+
Sbjct: 2185 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCNPNCEMQKWSVNGVYRIGL 2244
Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
Y ++ + G E+T+DYN + + E+ + +C CG CRG
Sbjct: 2245 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFDKCRG 2282
>gi|24641786|ref|NP_572888.2| Set2, isoform A [Drosophila melanogaster]
gi|22832197|gb|AAF48273.2| Set2, isoform A [Drosophila melanogaster]
Length = 2362
Score = 75.9 bits (185), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 50/149 (33%), Positives = 77/149 (51%), Gaps = 20/149 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ GE F++E++GEV + FE++ + S +N +Y + L
Sbjct: 1420 KKGCGITAELLIPPGE--FIMEYVGEVIDSEE-FERRQHLYSKDRNRH-----YYFMAL- 1470
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
+G+A V+DA K N + I HSC PN E + V+G +IG ++V+ I GEE
Sbjct: 1471 --RGEA------VIDATSKGNISRYINHSCDPNAETQKWTVNGELRIGFFSVKPIQPGEE 1522
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
ITFDY + + +A C C + CRG
Sbjct: 1523 ITFDYQYLRYGR---DAQRCYCEAANCRG 1548
>gi|281360813|ref|NP_001162740.1| Set2, isoform B [Drosophila melanogaster]
gi|118582047|sp|Q9VYD1.2|C1716_DROME RecName: Full=Probable histone-lysine N-methyltransferase CG1716
gi|92109778|gb|ABE73213.1| LD27386p [Drosophila melanogaster]
gi|272506087|gb|ACZ95275.1| Set2, isoform B [Drosophila melanogaster]
Length = 2313
Score = 75.9 bits (185), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 50/149 (33%), Positives = 77/149 (51%), Gaps = 20/149 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ GE F++E++GEV + FE++ + S +N +Y + L
Sbjct: 1371 KKGCGITAELLIPPGE--FIMEYVGEVIDSEE-FERRQHLYSKDRNRH-----YYFMAL- 1421
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
+G+A V+DA K N + I HSC PN E + V+G +IG ++V+ I GEE
Sbjct: 1422 --RGEA------VIDATSKGNISRYINHSCDPNAETQKWTVNGELRIGFFSVKPIQPGEE 1473
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
ITFDY + + +A C C + CRG
Sbjct: 1474 ITFDYQYLRYGR---DAQRCYCEAANCRG 1499
>gi|348530060|ref|XP_003452529.1| PREDICTED: hypothetical protein LOC100707110 [Oreochromis niloticus]
Length = 2876
Score = 75.5 bits (184), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 51/153 (33%), Positives = 78/153 (50%), Gaps = 30/153 (19%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+ + G+ F++E+LGEV S Q EF + +E+
Sbjct: 2027 KGWGIRTKESLRSGQ--FIIEYLGEVV-------------SEQ--------EFRSRMMEQ 2063
Query: 1951 PKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIH 2005
+ Y L +V+D+ N A I HSC PNCE + +V+G Y+IG++ ++ I
Sbjct: 2064 YFSHSGHYCLNLDSGMVIDSYRMGNEARFINHSCEPNCEMQKWSVNGVYRIGLFALKDIS 2123
Query: 2006 YGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
G E+T+DYN + + EE + VC CGS+ CRG
Sbjct: 2124 SGTELTYDYNFHSFNTEEQQ--VCKCGSESCRG 2154
>gi|432881031|ref|XP_004073771.1| PREDICTED: histone-lysine N-methyltransferase ASH1L-like [Oryzias
latipes]
Length = 2798
Score = 75.5 bits (184), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+++ A KG G+ + G+ F++E+LGEV S Q EF
Sbjct: 1957 ERFRAEGKGWGIRTKEPLRAGQ--FIIEYLGEVV-------------SEQ--------EF 1993
Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
+ +E+ + Y L +V+D+ N A I HSC PNCE + +V+G Y+IG+
Sbjct: 1994 RSRMMEQYFSHSGHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2053
Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
+ ++ + G E+T+DYN + + EE +A C CGS+ CRG
Sbjct: 2054 FALKDVSSGTELTYDYNFHSFNTEEQQA--CKCGSESCRG 2091
>gi|358058803|dbj|GAA95766.1| hypothetical protein E5Q_02423 [Mixia osmundae IAM 14324]
Length = 2083
Score = 75.5 bits (184), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 47/130 (36%), Positives = 64/130 (49%), Gaps = 23/130 (17%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI---YLERPKGDADGYDLVVV 1963
D ++E++GE+ IR + + A E I YL R D +VV
Sbjct: 1412 DMIIEYVGEL------------IRQQVADKREKAYEKMGIGSSYLFRVDDD------LVV 1453
Query: 1964 DAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEE 2023
DA K YA I H C PNC A++ + GH +I IY + I G+EIT+DY+ TES +
Sbjct: 1454 DATKKGTYARLINHCCAPNCTARIITIGGHKKIVIYALTDIEPGDEITYDYHFATESDDL 1513
Query: 2024 YEASVCLCGS 2033
CLCGS
Sbjct: 1514 KIP--CLCGS 1521
>gi|320168697|gb|EFW45596.1| Setd1a protein [Capsaspora owczarzaki ATCC 30864]
Length = 1312
Score = 75.5 bits (184), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 40/83 (48%), Positives = 50/83 (60%), Gaps = 9/83 (10%)
Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
VVDA +K N A + H C PNC AK+ VDGH +I IY+ R I GEEIT+DY K
Sbjct: 1237 VVDATYKGNLARFMNHCCEPNCYAKIIMVDGHQRIVIYSKRDIKKGEEITYDY------K 1290
Query: 2022 EEYEAS--VCLCGSQVCRGSYLN 2042
YE + CLCG+ C+ +LN
Sbjct: 1291 FPYEENKIPCLCGAVNCK-KFLN 1312
>gi|395329295|gb|EJF61682.1| hypothetical protein DICSQDRAFT_85722 [Dichomitus squalens LYAD-421
SS1]
Length = 1095
Score = 75.5 bits (184), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 50/136 (36%), Positives = 65/136 (47%), Gaps = 25/136 (18%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI---YLERPKGDADGYDLVVV 1963
D V+E++GEV IR+ + + A E I YL R D +VV
Sbjct: 980 DLVIEYVGEV------------IRAQVADKREKAYERQGIGSSYLFRIDED------LVV 1021
Query: 1964 DAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEE 2023
DA K N I HSC PNC AK+ + G +I IY + I G EIT+DY+ E
Sbjct: 1022 DATKKGNLGRLINHSCDPNCTAKIITISGEKKIVIYAKQDIELGSEITYDYHFPIEQ--- 1078
Query: 2024 YEASVCLCGSQVCRGS 2039
+ CLCGS CRG+
Sbjct: 1079 -DKIPCLCGSAKCRGT 1093
>gi|312072804|ref|XP_003139232.1| hypothetical protein LOAG_03647 [Loa loa]
gi|307765598|gb|EFO24832.1| hypothetical protein LOAG_03647 [Loa loa]
Length = 1422
Score = 75.5 bits (184), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 46/133 (34%), Positives = 69/133 (51%), Gaps = 21/133 (15%)
Query: 1908 FVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMH 1967
F++E++GEV + ++ IR ++ +DP + + YL K A V+DA
Sbjct: 627 FIIEYIGEV------IDAEEMIRRGRRYGKDP--KHVHHYLMALKNGA------VIDATA 672
Query: 1968 KANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEY--E 2025
K N + I HSC PNCE++ VD ++G + ++ I GEEI FDY E Y +
Sbjct: 673 KGNVSRFINHSCDPNCESQKWTVDRQLRVGFFVIKPIALGEEIVFDYQL-----ERYGRK 727
Query: 2026 ASVCLCGSQVCRG 2038
A C CG+ CRG
Sbjct: 728 AQRCFCGAANCRG 740
>gi|402081815|gb|EJT76960.1| histone-lysine N-methyltransferase [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 1319
Score = 75.5 bits (184), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 48/138 (34%), Positives = 70/138 (50%), Gaps = 17/138 (12%)
Query: 1905 EDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVD 1964
+DD ++E++GE V K R L+ + YL R +A V+D
Sbjct: 1199 KDDMIIEYVGEE--VRPSVAKVREARYLKSG-------IGSTYLFRIDDEA------VID 1243
Query: 1965 AMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEY 2024
A K A I HSC PNC AK+ V+G +I IY +R I EE+T+DY E +++
Sbjct: 1244 ATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIGQNEELTYDYKFEPE-EDQK 1302
Query: 2025 EASVCLCGSQVCRGSYLN 2042
+ CLCG+ C+G +LN
Sbjct: 1303 DRVPCLCGTTACKG-FLN 1319
>gi|195392836|ref|XP_002055060.1| GJ19006 [Drosophila virilis]
gi|194149570|gb|EDW65261.1| GJ19006 [Drosophila virilis]
Length = 2101
Score = 75.5 bits (184), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 50/149 (33%), Positives = 77/149 (51%), Gaps = 20/149 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ + GE F++E++GEV + FE++ I S +N +Y + L
Sbjct: 1071 KKGCGITAELQIPPGE--FIMEYVGEVIDSEE-FERRQHIYSRDRNRH-----YYFMAL- 1121
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
+G+A ++DA K N + I HSC PN E + V+G +IG ++V+ I GEE
Sbjct: 1122 --RGEA------IIDATAKGNISRYINHSCDPNAETQKWTVNGELRIGFFSVKTIMPGEE 1173
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
ITFDY + +A C C + CRG
Sbjct: 1174 ITFDYQYQRYGR---DAQRCYCEASNCRG 1199
>gi|15150415|gb|AAK84931.1| SD01656p [Drosophila melanogaster]
Length = 1443
Score = 75.1 bits (183), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 50/149 (33%), Positives = 77/149 (51%), Gaps = 20/149 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ GE F++E++GEV + FE++ + S +N +Y + L
Sbjct: 501 KKGCGITAELLIPPGE--FIMEYVGEVIDSEE-FERRQHLYSKDRNRH-----YYFMAL- 551
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
+G+A V+DA K N + I HSC PN E + V+G +IG ++V+ I GEE
Sbjct: 552 --RGEA------VIDATSKGNISRYINHSCDPNAETQKWTVNGELRIGFFSVKPIQPGEE 603
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
ITFDY + + +A C C + CRG
Sbjct: 604 ITFDYQYLRYGR---DAQRCYCEAANCRG 629
>gi|301614673|ref|XP_002936809.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Xenopus
(Silurana) tropicalis]
Length = 1298
Score = 75.1 bits (183), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 45/158 (28%), Positives = 86/158 (54%), Gaps = 21/158 (13%)
Query: 1882 PDDKYVAYR-KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
P+ K + KG G++ ++ GE FV E++GE+ ++++ + ++ E+
Sbjct: 1007 PETKIIKTEGKGWGLIATRDIKKGE--FVNEYIGEL------IDEEECMYRIRHAQENDI 1058
Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
FY + +++ + ++DA K N++ + HSC+PNCE + +V+G ++G++
Sbjct: 1059 THFYMLTIDKDR---------IIDAGPKGNFSRFMNHSCQPNCETQKWSVNGDTRVGLFA 1109
Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
VR I GEE+TF+YN E+ ++C CG+ C G
Sbjct: 1110 VRDIPAGEELTFNYNLDCLGNEK---TICRCGAPNCSG 1144
>gi|427794953|gb|JAA62928.1| hypothetical protein, partial [Rhipicephalus pulchellus]
Length = 1557
Score = 75.1 bits (183), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 50/145 (34%), Positives = 70/145 (48%), Gaps = 30/145 (20%)
Query: 1903 FGEDDFVVEFLGE-VYPVW----KWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADG 1957
D+ V+E++G+ V P+ + F Q GI S + + +E
Sbjct: 1438 IAADEMVIEYVGQMVRPIMADRREQFYTQIGIGSSY---------LFRVDVE-------- 1480
Query: 1958 YDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSV 2017
++DA N A I HSC PNC AKV V+G +I IY+ + I+ EEIT+DY
Sbjct: 1481 ---TIIDATKCGNLARFINHSCNPNCYAKVITVEGQKKIVIYSKQPINVNEEITYDYKFP 1537
Query: 2018 TESKEEYEASVCLCGSQVCRGSYLN 2042
E E VCLCG+ CRG +LN
Sbjct: 1538 LED----EKIVCLCGAPQCRG-FLN 1557
>gi|219111565|ref|XP_002177534.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217412069|gb|EEC51997.1| predicted protein, partial [Phaeodactylum tricornutum CCAP 1055/1]
Length = 144
Score = 75.1 bits (183), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 51/150 (34%), Positives = 81/150 (54%), Gaps = 20/150 (13%)
Query: 1891 KGLGVV-CNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
KG G+V C+K G+ D V+E++G V EK+D + ++++ + P FY + L
Sbjct: 13 KGWGLVPCDK---IGKGDLVLEYVGNVIDA---KEKEDRLSEWERDHPND-PNFYIMSLR 65
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
D +DA HKAN + I HSC PNC V+G+ + GI+ R I GE
Sbjct: 66 ---------DQWYIDARHKANLSRFINHSCAPNCFLTQINVNGYARNGIFAKRDIQAGEF 116
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGS 2039
+++DY+ T+ + + VC CG++ CRG+
Sbjct: 117 LSYDYHFDTKQGDRF---VCRCGAKSCRGT 143
>gi|115446669|ref|NP_001047114.1| Os02g0554000 [Oryza sativa Japonica Group]
gi|50725771|dbj|BAD33302.1| SET domain-containing protein-like [Oryza sativa Japonica Group]
gi|113536645|dbj|BAF09028.1| Os02g0554000 [Oryza sativa Japonica Group]
Length = 637
Score = 75.1 bits (183), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 50/154 (32%), Positives = 74/154 (48%), Gaps = 19/154 (12%)
Query: 1885 KYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFY 1944
K+ +KG G+ ++ E F++E++GEV + + +Q S + + FY
Sbjct: 199 KFHTGKKGYGLQLKED--VSEGRFLIEYVGEVLDITAYESRQRYYASKGQKH------FY 250
Query: 1945 NIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGI 2004
+ L + V+DA K N I HSC PNC + V+G IGI+ +R I
Sbjct: 251 FMALNGGE---------VIDACTKGNLGRFINHSCSPNCRTEKWMVNGEVCIGIFAMRNI 301
Query: 2005 HYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
GEE+TFDYN V S + C CG+ CRG
Sbjct: 302 KKGEELTFDYNYVRVSGAAPQK--CFCGTAKCRG 333
>gi|346974289|gb|EGY17741.1| histone-lysine N-methyltransferase [Verticillium dahliae VdLs.17]
Length = 1148
Score = 75.1 bits (183), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 47/146 (32%), Positives = 73/146 (50%), Gaps = 23/146 (15%)
Query: 1900 EGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYD 1959
E +DD ++E++GE + ++ IR ++ YL++ G + +
Sbjct: 1023 EENINKDDMIIEYVGE-----QVRQQISEIREVR-------------YLKQGMGSSYLFR 1064
Query: 1960 L---VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNS 2016
+ V+DA K A I HSC PNC AK+ VDG +I IY +R I EE+T+DY
Sbjct: 1065 IDENTVIDATKKGGIARFINHSCMPNCTAKIIKVDGSKRIVIYALRDIARTEELTYDYKF 1124
Query: 2017 VTESKEEYEASVCLCGSQVCRGSYLN 2042
E + CLCG+ +C+G +LN
Sbjct: 1125 EREIG-SLDRIPCLCGTALCKG-FLN 1148
>gi|357604624|gb|EHJ64265.1| mixed-lineage leukemia protein, mll [Danaus plexippus]
Length = 4387
Score = 74.7 bits (182), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 55/164 (33%), Positives = 76/164 (46%), Gaps = 32/164 (19%)
Query: 1886 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVW------KWFEKQDGIRSLQKNNEDP 1939
Y ++ G G+ C ++ E D V+E+ GEV K +E G R +
Sbjct: 4249 YRSHIHGRGLFCKRD--IEEGDMVIEYAGEVIRAVLADQREKKYEAMSGRRGVG------ 4300
Query: 1940 APEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIY 1999
Y+ R D +VVDA K N A I HSC PNC ++V + GH I I+
Sbjct: 4301 -----GCYMFRID------DNLVVDATLKGNAARFINHSCDPNCYSRVVDIHGHKHILIF 4349
Query: 2000 TVRGIHYGEEITFDYNSVTESKEEYEASV-CLCGSQVCRGSYLN 2042
+R I GEE+T+DY E E + C CG++ CR YLN
Sbjct: 4350 ALRRITIGEELTYDYKFPFE-----EVKIPCTCGAKKCR-KYLN 4387
>gi|358401203|gb|EHK50509.1| hypothetical protein TRIATDRAFT_171650, partial [Trichoderma
atroviride IMI 206040]
Length = 1241
Score = 74.7 bits (182), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 47/143 (32%), Positives = 70/143 (48%), Gaps = 23/143 (16%)
Query: 1903 FGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGY---D 1959
+DD ++E++GE E + I +++N YL+ G + + D
Sbjct: 1119 INKDDMIIEYVGE--------EVRQQIAEIRENR----------YLKSGIGSSYLFRIDD 1160
Query: 1960 LVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTE 2019
V+DA K A I HSC PNC AK+ V+G +I IY +R I EE+T+DY E
Sbjct: 1161 NTVIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAMNEELTYDYKFERE 1220
Query: 2020 SKEEYEASVCLCGSQVCRGSYLN 2042
+ CLCG+ C+G +LN
Sbjct: 1221 IG-SLDRIPCLCGTAACKG-FLN 1241
>gi|358389897|gb|EHK27489.1| hypothetical protein TRIVIDRAFT_34353 [Trichoderma virens Gv29-8]
Length = 1221
Score = 74.7 bits (182), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 47/143 (32%), Positives = 70/143 (48%), Gaps = 23/143 (16%)
Query: 1903 FGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGY---D 1959
+DD ++E++GE E + I +++N YL+ G + + D
Sbjct: 1099 INKDDMIIEYVGE--------EVRQQIAEIRENR----------YLKSGIGSSYLFRIDD 1140
Query: 1960 LVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTE 2019
V+DA K A I HSC PNC AK+ V+G +I IY +R I EE+T+DY E
Sbjct: 1141 NTVIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAMNEELTYDYKFERE 1200
Query: 2020 SKEEYEASVCLCGSQVCRGSYLN 2042
+ CLCG+ C+G +LN
Sbjct: 1201 IG-SLDRIPCLCGTAACKG-FLN 1221
>gi|301629157|ref|XP_002943714.1| PREDICTED: hypothetical protein LOC100496979 [Xenopus (Silurana)
tropicalis]
Length = 1666
Score = 74.7 bits (182), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 49/160 (30%), Positives = 78/160 (48%), Gaps = 30/160 (18%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+++ A KG G+ + F++E+LGEV EF
Sbjct: 54 ERFRAEGKGWGIRTKEP--LKASQFIIEYLGEVVSET---------------------EF 90
Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
N +E+ +D Y L +V+D+ N A I HSC PNCE + +V+G Y+IG+
Sbjct: 91 RNRTIEQYHNHSDHYCLSLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 150
Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
Y ++ + G E+T+DYN + + E+ + VC CG + CRG
Sbjct: 151 YALKDMPAGTELTYDYNFHSFNTEKQQ--VCKCGVEKCRG 188
>gi|431892339|gb|ELK02779.1| Putative histone-lysine N-methyltransferase ASH1L [Pteropus alecto]
Length = 1291
Score = 74.7 bits (182), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 52/160 (32%), Positives = 82/160 (51%), Gaps = 30/160 (18%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+++ A KG G+ + G+ F++E+LGEV S Q EF
Sbjct: 474 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 510
Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
N +E+ +D Y L +V+D+ N A I HSC PNCE + +V+G Y+IG+
Sbjct: 511 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 570
Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
Y +R + G E+T+DYN + + E+ + +C CG + CRG
Sbjct: 571 YALRDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 608
>gi|340514680|gb|EGR44940.1| predicted protein [Trichoderma reesei QM6a]
Length = 1236
Score = 74.7 bits (182), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 47/143 (32%), Positives = 70/143 (48%), Gaps = 23/143 (16%)
Query: 1903 FGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGY---D 1959
+DD ++E++GE E + I +++N YL+ G + + D
Sbjct: 1114 INKDDMIIEYVGE--------EVRQQIAEIRENR----------YLKSGIGSSYLFRIDD 1155
Query: 1960 LVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTE 2019
V+DA K A I HSC PNC AK+ V+G +I IY +R I EE+T+DY E
Sbjct: 1156 NTVIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAMNEELTYDYKFERE 1215
Query: 2020 SKEEYEASVCLCGSQVCRGSYLN 2042
+ CLCG+ C+G +LN
Sbjct: 1216 IG-SLDRIPCLCGTAACKG-FLN 1236
>gi|302416827|ref|XP_003006245.1| histone-lysine N-methyltransferase [Verticillium albo-atrum VaMs.102]
gi|261355661|gb|EEY18089.1| histone-lysine N-methyltransferase [Verticillium albo-atrum VaMs.102]
Length = 1135
Score = 74.7 bits (182), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 47/146 (32%), Positives = 73/146 (50%), Gaps = 23/146 (15%)
Query: 1900 EGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYD 1959
E +DD ++E++GE + ++ IR ++ YL++ G + +
Sbjct: 1010 EENINKDDMIIEYVGE-----QVRQQISEIREVR-------------YLKQGMGSSYLFR 1051
Query: 1960 L---VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNS 2016
+ V+DA K A I HSC PNC AK+ VDG +I IY +R I EE+T+DY
Sbjct: 1052 IDENTVIDATKKGGIARFINHSCMPNCTAKIIKVDGSKRIVIYALRDIARTEELTYDYKF 1111
Query: 2017 VTESKEEYEASVCLCGSQVCRGSYLN 2042
E + CLCG+ +C+G +LN
Sbjct: 1112 EREIG-SLDRIPCLCGTALCKG-FLN 1135
>gi|260800140|ref|XP_002594994.1| hypothetical protein BRAFLDRAFT_99284 [Branchiostoma floridae]
gi|229280233|gb|EEN51005.1| hypothetical protein BRAFLDRAFT_99284 [Branchiostoma floridae]
Length = 1541
Score = 74.7 bits (182), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 41/132 (31%), Positives = 73/132 (55%), Gaps = 18/132 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
DFV E++GE+ ++++ R ++K +ED FY + L++ + ++DA
Sbjct: 1161 DFVYEYVGEL------IDEEEVQRRIKKAHEDNVTNFYMLTLDKNR---------IIDAG 1205
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
KAN + + HSC+PNCE + V+G ++G++ + I G E+TF+YN E+
Sbjct: 1206 PKANMSRFMNHSCQPNCETQKWMVNGDIRVGLFAMDDIPTGSELTFNYNLDCLGNEK--- 1262
Query: 2027 SVCLCGSQVCRG 2038
+ C CG+ +C G
Sbjct: 1263 TPCNCGAPICSG 1274
>gi|66828443|ref|XP_647576.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|60475584|gb|EAL73519.1| SET domain-containing protein [Dictyostelium discoideum AX4]
Length = 898
Score = 74.7 bits (182), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 50/152 (32%), Positives = 77/152 (50%), Gaps = 23/152 (15%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G++ N++ E F++E+ GEV KQ +R +++ + FY + L+
Sbjct: 626 KKGWGLIANED--IEEKQFIMEYCGEV------ISKQTCLRRMKEAENEKF--FYFLTLD 675
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
+ +DA + N A + HSC PNCE + V G +IGI+ ++ I G E
Sbjct: 676 SKE---------CLDASKRGNLARFMNHSCDPNCETQKWTVGGEVKIGIFAIKPIPKGTE 726
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
+TFDYN ++ E C CGS CRG YL
Sbjct: 727 LTFDYNYERFGAQKQE---CYCGSVNCRG-YL 754
>gi|281206847|gb|EFA81031.1| SET domain-containing protein [Polysphondylium pallidum PN500]
Length = 1363
Score = 74.7 bits (182), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 51/154 (33%), Positives = 73/154 (47%), Gaps = 26/154 (16%)
Query: 1886 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN 1945
+ A +KG G+ ++ FV+E+ GEV + ++ D FY
Sbjct: 988 FNAKKKGWGLKAKEK--ISAHQFVIEYCGEVITRAQSMDRM--------READGEKYFYF 1037
Query: 1946 IYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIH 2005
+ L+ + V+DA K N A I HSC PNCE + +VDG +IGI+ ++ I
Sbjct: 1038 LTLDSKE---------VLDASRKGNLARFINHSCDPNCETQKWSVDGETRIGIFALKDIE 1088
Query: 2006 YGEEITFDYN--SVTESKEEYEASVCLCGSQVCR 2037
G E+TFDYN V SK+ C CGS CR
Sbjct: 1089 AGTELTFDYNYERVGSSKQS-----CYCGSVNCR 1117
>gi|194766778|ref|XP_001965501.1| GF22528 [Drosophila ananassae]
gi|190619492|gb|EDV35016.1| GF22528 [Drosophila ananassae]
Length = 2414
Score = 74.3 bits (181), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 48/149 (32%), Positives = 76/149 (51%), Gaps = 20/149 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ + GE F++E++GEV + FE++ + S + +Y + L
Sbjct: 1433 KKGCGITAELQIPPGE--FIMEYVGEVI-DSEEFERRQHLYSRDRKRH-----YYFMAL- 1483
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
+G+A ++DA K N + I HSC PN E + V+G +IG ++V+ I GEE
Sbjct: 1484 --RGEA------IIDATSKGNISRYINHSCDPNAETQKWTVNGELRIGFFSVKTIQPGEE 1535
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
ITFDY + +A C C + CRG
Sbjct: 1536 ITFDYQYQRYGR---DAQRCYCEATNCRG 1561
>gi|427779581|gb|JAA55242.1| Putative histone-lysine n-methyltransferase setd2 [Rhipicephalus
pulchellus]
Length = 2038
Score = 74.3 bits (181), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 52/155 (33%), Positives = 78/155 (50%), Gaps = 20/155 (12%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+K++ +KG G+ + G FV+E++GEV + F K+ ++ ++N +
Sbjct: 872 EKFMTEKKGWGLRTLETVSSG--TFVMEYVGEVL-TPEDFRKR--VKQYARDNNQ---HY 923
Query: 1944 YNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRG 2003
Y + L AD ++DA K N + I HSC PNCE + V+G +IG +T R
Sbjct: 924 YFMALR-----ADE----IIDATQKGNVSRFINHSCDPNCETQKWTVNGELRIGFFTRRP 974
Query: 2004 IHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
+ GEE+TFDY K EA C C S CRG
Sbjct: 975 LRAGEELTFDYQFQRYGK---EAQRCHCESSNCRG 1006
>gi|336372757|gb|EGO01096.1| hypothetical protein SERLA73DRAFT_50848 [Serpula lacrymans var.
lacrymans S7.3]
Length = 260
Score = 74.3 bits (181), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 52/139 (37%), Positives = 68/139 (48%), Gaps = 26/139 (18%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI---YLERPKGDADGYDLVVV 1963
+ V+E++GEV IR+ + + E I YL R D +VV
Sbjct: 145 EMVIEYVGEV------------IRAQVADKREKVYERQGIGSSYLFRIDED------LVV 186
Query: 1964 DAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEE 2023
DA K N I HSC PNC AK+ ++G +I IY + I GEEIT+DY+ E
Sbjct: 187 DATKKGNLGRLINHSCDPNCTAKIITINGEKKIVIYAKQDIELGEEITYDYHFPIEQ--- 243
Query: 2024 YEASVCLCGSQVCRGSYLN 2042
+ CLCGS CRG YLN
Sbjct: 244 -DKIPCLCGSAKCRG-YLN 260
>gi|195012609|ref|XP_001983710.1| GH16034 [Drosophila grimshawi]
gi|193897192|gb|EDV96058.1| GH16034 [Drosophila grimshawi]
Length = 2059
Score = 74.3 bits (181), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 48/149 (32%), Positives = 76/149 (51%), Gaps = 20/149 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ + GE F++E++GEV + FE++ + S +N +Y + L
Sbjct: 1167 KKGCGITAELQMPSGE--FIMEYVGEVIDSEE-FERRQHLYSEDRNRH-----YYFMAL- 1217
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
+ D+ ++DA K N + I HSC PN E + V+G +IG ++++ I GEE
Sbjct: 1218 --RSDS------IIDATSKGNISRYINHSCDPNAETQKWTVNGELRIGFFSLKTIMPGEE 1269
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
ITFDY + +A C C S CRG
Sbjct: 1270 ITFDYQYQRYGR---DAQRCYCESANCRG 1295
>gi|390355933|ref|XP_784903.3| PREDICTED: uncharacterized protein LOC579712 isoform 3
[Strongylocentrotus purpuratus]
Length = 3326
Score = 74.3 bits (181), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 44/131 (33%), Positives = 69/131 (52%), Gaps = 18/131 (13%)
Query: 1908 FVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMH 1967
F++E+LGEV V + +++ QK++ Y + L DG +V+D
Sbjct: 2556 FIIEYLGEVISVKELWKRALDDYQYQKHH-------YCLNL-------DGG--MVIDGYR 2599
Query: 1968 KANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEAS 2027
N + HSC PNCE + V+G Y+IG++ +R I GEE+T+DYN + + E +
Sbjct: 2600 YGNEGRFVNHSCNPNCEMQKWMVNGLYRIGMFALRDIQPGEELTYDYNFHSFNMETQQE- 2658
Query: 2028 VCLCGSQVCRG 2038
C CG + CRG
Sbjct: 2659 -CNCGHETCRG 2668
>gi|390355935|ref|XP_003728661.1| PREDICTED: uncharacterized protein LOC579712 isoform 1
[Strongylocentrotus purpuratus]
Length = 3164
Score = 74.3 bits (181), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 44/131 (33%), Positives = 69/131 (52%), Gaps = 18/131 (13%)
Query: 1908 FVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMH 1967
F++E+LGEV V + +++ QK++ Y + L DG +V+D
Sbjct: 2377 FIIEYLGEVISVKELWKRALDDYQYQKHH-------YCLNL-------DGG--MVIDGYR 2420
Query: 1968 KANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEAS 2027
N + HSC PNCE + V+G Y+IG++ +R I GEE+T+DYN + + E +
Sbjct: 2421 YGNEGRFVNHSCNPNCEMQKWMVNGLYRIGMFALRDIQPGEELTYDYNFHSFNMETQQE- 2479
Query: 2028 VCLCGSQVCRG 2038
C CG + CRG
Sbjct: 2480 -CNCGHETCRG 2489
>gi|402856517|ref|XP_003892835.1| PREDICTED: histone-lysine N-methyltransferase ASH1L [Papio anubis]
Length = 1277
Score = 74.3 bits (181), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+++ A KG G+ + G+ F++E+LGEV S Q EF
Sbjct: 456 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 492
Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
N +E+ +D Y L +V+D+ N A I HSC PNCE + +V+G Y+IG+
Sbjct: 493 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 552
Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
Y ++ + G E+T+DYN + + E+ + +C CG + CRG
Sbjct: 553 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 590
>gi|74140676|dbj|BAC28183.2| unnamed protein product [Mus musculus]
Length = 418
Score = 74.3 bits (181), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 49/160 (30%), Positives = 80/160 (50%), Gaps = 30/160 (18%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+++ A KG G+ + G+ F++E+LGEV EF
Sbjct: 91 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVVS---------------------EQEF 127
Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
N +E+ +D Y L +V+D+ N A I HSC PNCE + +V+G Y+IG+
Sbjct: 128 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 187
Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
Y ++ + G E+T+DYN + + E+ + +C CG + CRG
Sbjct: 188 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 225
>gi|390355937|ref|XP_003728662.1| PREDICTED: uncharacterized protein LOC579712 isoform 2
[Strongylocentrotus purpuratus]
Length = 3111
Score = 74.3 bits (181), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 44/131 (33%), Positives = 69/131 (52%), Gaps = 18/131 (13%)
Query: 1908 FVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMH 1967
F++E+LGEV V + +++ QK++ Y + L DG +V+D
Sbjct: 2324 FIIEYLGEVISVKELWKRALDDYQYQKHH-------YCLNL-------DGG--MVIDGYR 2367
Query: 1968 KANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEAS 2027
N + HSC PNCE + V+G Y+IG++ +R I GEE+T+DYN + + E +
Sbjct: 2368 YGNEGRFVNHSCNPNCEMQKWMVNGLYRIGMFALRDIQPGEELTYDYNFHSFNMETQQE- 2426
Query: 2028 VCLCGSQVCRG 2038
C CG + CRG
Sbjct: 2427 -CNCGHETCRG 2436
>gi|302910631|ref|XP_003050330.1| histone H3 methyltransferase complex protein [Nectria haematococca
mpVI 77-13-4]
gi|256731267|gb|EEU44617.1| histone H3 methyltransferase complex protein [Nectria haematococca
mpVI 77-13-4]
Length = 1281
Score = 74.3 bits (181), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 46/143 (32%), Positives = 70/143 (48%), Gaps = 23/143 (16%)
Query: 1903 FGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDL-- 1960
+DD ++E++GE E + I +++N YL+ G + + +
Sbjct: 1159 IAKDDMIIEYVGE--------EVRQQIAEIRENR----------YLKSGIGSSYLFRIDE 1200
Query: 1961 -VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTE 2019
V+DA K A I HSC PNC AK+ V+G +I IY +R I EE+T+DY E
Sbjct: 1201 NTVIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAMNEELTYDYKFERE 1260
Query: 2020 SKEEYEASVCLCGSQVCRGSYLN 2042
+ CLCG+ C+G +LN
Sbjct: 1261 IG-SLDRIPCLCGTAACKG-FLN 1281
>gi|320593249|gb|EFX05658.1| set domain containing protein [Grosmannia clavigera kw1407]
Length = 1450
Score = 73.9 bits (180), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 37/84 (44%), Positives = 47/84 (55%), Gaps = 2/84 (2%)
Query: 1959 DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVT 2018
D V+DA K A I HSC PNC AK+ V+G +I IY +R I EE+T+DY
Sbjct: 1369 DGTVIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIGQNEELTYDYKFEP 1428
Query: 2019 ESKEEYEASVCLCGSQVCRGSYLN 2042
E E CLCG+ C+G +LN
Sbjct: 1429 EDNPEDRVP-CLCGTTACKG-FLN 1450
>gi|195448204|ref|XP_002071555.1| GK25076 [Drosophila willistoni]
gi|194167640|gb|EDW82541.1| GK25076 [Drosophila willistoni]
Length = 2217
Score = 73.9 bits (180), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 49/149 (32%), Positives = 77/149 (51%), Gaps = 20/149 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ + GE F++E++GEV + FE++ + S +N +Y + L
Sbjct: 1167 KKGCGITAELQIPPGE--FIMEYVGEVIDAEE-FERRQHLYSKDRNRH-----YYFMAL- 1217
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
+G+A ++DA K N + I HSC PN E + V+G +IG ++V+ I GEE
Sbjct: 1218 --RGEA------IIDATSKGNISRYINHSCDPNAETQKWTVNGELRIGFFSVKTILPGEE 1269
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
ITFDY + +A C C + CRG
Sbjct: 1270 ITFDYQY---QRYGRDAQRCYCEAINCRG 1295
>gi|296422581|ref|XP_002840838.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295637063|emb|CAZ85029.1| unnamed protein product [Tuber melanosporum]
Length = 1200
Score = 73.9 bits (180), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 38/84 (45%), Positives = 48/84 (57%), Gaps = 2/84 (2%)
Query: 1959 DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVT 2018
D V+DA A I HSC PNC AK+ V+G +I IY +R I EE+T+DY
Sbjct: 1119 DTTVIDATKAGGIARFINHSCTPNCTAKIIKVEGSKRIVIYALRDIRENEELTYDYKFER 1178
Query: 2019 ESKEEYEASVCLCGSQVCRGSYLN 2042
E + E E CLCGS C+G +LN
Sbjct: 1179 ELESE-ERIPCLCGSSGCKG-FLN 1200
>gi|408391029|gb|EKJ70413.1| hypothetical protein FPSE_09407 [Fusarium pseudograminearum CS3096]
Length = 1263
Score = 73.9 bits (180), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 46/143 (32%), Positives = 70/143 (48%), Gaps = 23/143 (16%)
Query: 1903 FGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGY---D 1959
+DD ++E++GE + + I +++N YL+ G + + D
Sbjct: 1141 IAKDDMIIEYVGE--------QVRQQISEIRENR----------YLKSGIGSSYLFRIDD 1182
Query: 1960 LVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTE 2019
V+DA K A I HSC PNC AK+ V+G +I IY +R I EE+T+DY E
Sbjct: 1183 NTVIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIALNEELTYDYKFERE 1242
Query: 2020 SKEEYEASVCLCGSQVCRGSYLN 2042
+ CLCG+ C+G +LN
Sbjct: 1243 IG-STDRIPCLCGTAACKG-FLN 1263
>gi|168009924|ref|XP_001757655.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691349|gb|EDQ77712.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1715
Score = 73.9 bits (180), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 46/133 (34%), Positives = 69/133 (51%), Gaps = 21/133 (15%)
Query: 1908 FVVEFLGEVY--PVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDA 1965
F++E++GEV P ++ +K+ + S QK+ FY + L + ++DA
Sbjct: 766 FIIEYVGEVLDMPSFEARQKEYSMNS-QKH-------FYFMTLSANE---------IIDA 808
Query: 1966 MHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYE 2025
+K N I HSC PNC+ + VDG IG++ +R I EE+TFDYN V +
Sbjct: 809 CNKGNLGRFINHSCEPNCQTEKWMVDGEVCIGLFAIRDIKEREEVTFDYNFVRVGG--AD 866
Query: 2026 ASVCLCGSQVCRG 2038
A C CG+ CRG
Sbjct: 867 AKKCECGASKCRG 879
>gi|260791327|ref|XP_002590691.1| hypothetical protein BRAFLDRAFT_125552 [Branchiostoma floridae]
gi|229275887|gb|EEN46702.1| hypothetical protein BRAFLDRAFT_125552 [Branchiostoma floridae]
Length = 2482
Score = 73.9 bits (180), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
+++DA N A I H C PNC AK+ V+G+ +I IY+ R I EEIT+DY E
Sbjct: 2406 MIIDATKNGNLARFINHCCNPNCYAKIITVEGYKKIVIYSRRDIAVNEEITYDYKFPIED 2465
Query: 2021 KEEYEASVCLCGSQVCRGS 2039
E CLCG++ CRG+
Sbjct: 2466 ----EKIPCLCGAENCRGT 2480
>gi|410516926|sp|Q4I5R3.2|SET1_GIBZE RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component SET1; AltName:
Full=SET domain-containing protein 1
Length = 1263
Score = 73.9 bits (180), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 46/143 (32%), Positives = 70/143 (48%), Gaps = 23/143 (16%)
Query: 1903 FGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGY---D 1959
+DD ++E++GE + + I +++N YL+ G + + D
Sbjct: 1141 IAKDDMIIEYVGE--------QVRQQISEIRENR----------YLKSGIGSSYLFRIDD 1182
Query: 1960 LVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTE 2019
V+DA K A I HSC PNC AK+ V+G +I IY +R I EE+T+DY E
Sbjct: 1183 NTVIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIALNEELTYDYKFERE 1242
Query: 2020 SKEEYEASVCLCGSQVCRGSYLN 2042
+ CLCG+ C+G +LN
Sbjct: 1243 IG-STDRIPCLCGTAACKG-FLN 1263
>gi|170591502|ref|XP_001900509.1| SET domain containing protein [Brugia malayi]
gi|158592121|gb|EDP30723.1| SET domain containing protein [Brugia malayi]
Length = 1056
Score = 73.9 bits (180), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 37/79 (46%), Positives = 48/79 (60%), Gaps = 6/79 (7%)
Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYN-SVTES 2020
V+DA N A I HSC+PNC AK+ VDG +I IY+ I+ G+EIT+DY + E
Sbjct: 981 VIDATQMGNLARFINHSCQPNCYAKIVVVDGEKRIVIYSKLAINKGDEITYDYKFPIEED 1040
Query: 2021 KEEYEASVCLCGSQVCRGS 2039
K + CLCG+ CRGS
Sbjct: 1041 KID-----CLCGAPGCRGS 1054
>gi|154422490|ref|XP_001584257.1| SET domain containing protein [Trichomonas vaginalis G3]
gi|121918503|gb|EAY23271.1| SET domain containing protein [Trichomonas vaginalis G3]
Length = 259
Score = 73.9 bits (180), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 41/86 (47%), Positives = 50/86 (58%), Gaps = 3/86 (3%)
Query: 1959 DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVT 2018
D + +DA HK A + HSC PNC+ V G I I+ + I EE+T+DYN
Sbjct: 158 DDLYIDATHKGGIARFLNHSCDPNCKTCVVEAGGQRHIVIFAKKKIEPFEELTYDYNLPY 217
Query: 2019 ESKEEYEASVCLCGSQVCRGSYLNLT 2044
ESKE +A VCLCGS CRG YLN T
Sbjct: 218 ESKE--KAIVCLCGSPKCRG-YLNYT 240
>gi|158301050|ref|XP_001238385.2| AGAP011688-PA [Anopheles gambiae str. PEST]
gi|157013454|gb|EAU75883.2| AGAP011688-PA [Anopheles gambiae str. PEST]
Length = 2404
Score = 73.9 bits (180), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 52/159 (32%), Positives = 80/159 (50%), Gaps = 21/159 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ + GE F++E++GEV ++ E+ + S +KN +Y + L
Sbjct: 1289 KKGFGIQASSAIAPGE--FIMEYVGEVLNSAQFDERAEAY-SREKNKH-----YYFMALR 1340
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
+DG ++DA K N + I HSC PN E + V+G +IG ++ + I GEE
Sbjct: 1341 -----SDG----IIDATTKGNISRFINHSCDPNAETQKWTVNGELRIGFFSTKYILPGEE 1391
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSY-LNLTGEG 2047
ITFDY + +A C C ++ CRG TGEG
Sbjct: 1392 ITFDYQFQRYGR---KAQKCYCEAESCRGWIGAKPTGEG 1427
>gi|391325531|ref|XP_003737286.1| PREDICTED: histone-lysine N-methyltransferase SETD1B-A-like
[Metaseiulus occidentalis]
Length = 976
Score = 73.9 bits (180), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 38/82 (46%), Positives = 49/82 (59%), Gaps = 5/82 (6%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N A I HSC PNC A+V V+G +I IY+ R I EEIT+DY
Sbjct: 900 TIIDATKCGNLARFINHSCNPNCYARVITVEGQKKIVIYSKRDISVNEEITYDYKF---P 956
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+EE + + CLCG+ CRG YLN
Sbjct: 957 REEVKIT-CLCGTPQCRG-YLN 976
>gi|430813239|emb|CCJ29409.1| unnamed protein product [Pneumocystis jirovecii]
Length = 375
Score = 73.9 bits (180), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 49/133 (36%), Positives = 64/133 (48%), Gaps = 19/133 (14%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
D V+E++GE+ + IR Q + YL R D VVDA
Sbjct: 260 DMVIEYVGEIVR-----QTVADIRERQYERQGIGSS----YLFRIDDDT------VVDAT 304
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K N A I HSC P+C AK+ V+G +I IY R I GEEIT+DY E +
Sbjct: 305 KKGNIARFINHSCDPSCTAKIIRVEGEKKIVIYAHRDIEKGEEITYDYKFPIEDVK---- 360
Query: 2027 SVCLCGSQVCRGS 2039
CLCG++ CRG+
Sbjct: 361 IPCLCGAKACRGT 373
>gi|295913201|gb|ADG57859.1| transcription factor [Lycoris longituba]
Length = 164
Score = 73.9 bits (180), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 51/134 (38%), Positives = 67/134 (50%), Gaps = 19/134 (14%)
Query: 1906 DDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDA 1965
+DFV+E++GE+ + IR Q YL R DGY VVDA
Sbjct: 48 EDFVIEYVGELVR-----RQISDIRECQYEKMGIGSS----YLFRLD---DGY---VVDA 92
Query: 1966 MHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYE 2025
+ A I HSC PNC KV V+G +I IY R IH GEE+T++Y E ++
Sbjct: 93 TKRGGIARFINHSCEPNCYTKVITVEGQKKIFIYAKRHIHAGEELTYNYKFPLEEQK--- 149
Query: 2026 ASVCLCGSQVCRGS 2039
+C CGS+ CRGS
Sbjct: 150 -ILCNCGSKRCRGS 162
>gi|346322948|gb|EGX92546.1| histone-lysine N-methyltransferase [Cordyceps militaris CM01]
Length = 1151
Score = 73.9 bits (180), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 46/141 (32%), Positives = 70/141 (49%), Gaps = 23/141 (16%)
Query: 1905 EDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDL---V 1961
+DD ++E++GE E + I +++N YL+ G + + +
Sbjct: 1031 KDDMIIEYVGE--------EVRQQISEIRENR----------YLKSGIGSSYLFRIDENT 1072
Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
V+DA K A I HSC PNC AK+ V+G +I IY +R I EE+T+DY E
Sbjct: 1073 VIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDITTNEELTYDYKFEREIG 1132
Query: 2022 EEYEASVCLCGSQVCRGSYLN 2042
+ CLCG+ C+G +LN
Sbjct: 1133 -SLDRIPCLCGTAACKG-FLN 1151
>gi|413937237|gb|AFW71788.1| hypothetical protein ZEAMMB73_686749 [Zea mays]
Length = 1815
Score = 73.6 bits (179), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 50/152 (32%), Positives = 73/152 (48%), Gaps = 20/152 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ ++ E F++E++GEV + + +Q + + FY + L
Sbjct: 1077 KKGYGLQLQED--VTEGRFLIEYVGEVLDITSYESRQRYYACKGQKH------FYFMALN 1128
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
+ V+DA K N I HSC PNC + V+G IGI+ +R I GEE
Sbjct: 1129 GGE---------VIDACTKGNLGRFINHSCSPNCCTEKWMVNGEVCIGIFALRSIKKGEE 1179
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
+TFDYN V S + C CG+ CRG YL
Sbjct: 1180 LTFDYNYVRVSGAAPQK--CFCGTAKCRG-YL 1208
>gi|353243391|emb|CCA74938.1| related to regulatory protein SET1 [Piriformospora indica DSM 11827]
Length = 1224
Score = 73.6 bits (179), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 37/84 (44%), Positives = 48/84 (57%), Gaps = 5/84 (5%)
Query: 1959 DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVT 2018
D +VVDA N I HSC PNC AK+ + G +I IY IH G+E+T+DY+
Sbjct: 1146 DDLVVDATKIGNLGRLINHSCDPNCTAKIITIGGQKKIVIYAKVDIHPGDEVTYDYHFPI 1205
Query: 2019 ESKEEYEASVCLCGSQVCRGSYLN 2042
E+ E CLCG+ CRG +LN
Sbjct: 1206 EN----EKIPCLCGAAKCRG-FLN 1224
>gi|413937236|gb|AFW71787.1| hypothetical protein ZEAMMB73_686749 [Zea mays]
Length = 1756
Score = 73.6 bits (179), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 50/152 (32%), Positives = 73/152 (48%), Gaps = 20/152 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ ++ E F++E++GEV + + +Q + + FY + L
Sbjct: 1018 KKGYGLQLQED--VTEGRFLIEYVGEVLDITSYESRQRYYACKGQKH------FYFMALN 1069
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
+ V+DA K N I HSC PNC + V+G IGI+ +R I GEE
Sbjct: 1070 GGE---------VIDACTKGNLGRFINHSCSPNCCTEKWMVNGEVCIGIFALRSIKKGEE 1120
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
+TFDYN V S + C CG+ CRG YL
Sbjct: 1121 LTFDYNYVRVSGAAPQK--CFCGTAKCRG-YL 1149
>gi|400596097|gb|EJP63881.1| histone H3 methyltransferase complex protein [Beauveria bassiana
ARSEF 2860]
Length = 1220
Score = 73.6 bits (179), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 46/141 (32%), Positives = 70/141 (49%), Gaps = 23/141 (16%)
Query: 1905 EDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDL---V 1961
+DD ++E++GE E + I +++N YL+ G + + +
Sbjct: 1100 KDDMIIEYVGE--------EVRQQISEIRENR----------YLKSGIGSSYLFRIDENT 1141
Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
V+DA K A I HSC PNC AK+ V+G +I IY +R I EE+T+DY E
Sbjct: 1142 VIDATKKGGIARFINHSCLPNCTAKIIKVEGSKRIVIYALREIAMNEELTYDYKFEREIG 1201
Query: 2022 EEYEASVCLCGSQVCRGSYLN 2042
+ CLCG+ C+G +LN
Sbjct: 1202 -SLDRIPCLCGTAACKG-FLN 1220
>gi|407921620|gb|EKG14761.1| hypothetical protein MPH_08036 [Macrophomina phaseolina MS6]
Length = 1167
Score = 73.6 bits (179), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 50/137 (36%), Positives = 71/137 (51%), Gaps = 17/137 (12%)
Query: 1906 DDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDA 1965
+D ++E++GE K +K IR ++ + + + YL R D+ VVDA
Sbjct: 1048 NDMIIEYVGE-----KVRQKVADIREIKYDKQG----VGSSYLFRIDEDS------VVDA 1092
Query: 1966 MHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYE 2025
K A I HSC PNC AK+ VDG +I IY +R I EE+T+DY E + +
Sbjct: 1093 TKKGGIARFINHSCSPNCTAKIIRVDGTKRIVIYALRDIKTNEELTYDYKFEREIGSD-D 1151
Query: 2026 ASVCLCGSQVCRGSYLN 2042
CLCGS C+G +LN
Sbjct: 1152 RIPCLCGSVNCKG-FLN 1167
>gi|47219458|emb|CAG10822.1| unnamed protein product [Tetraodon nigroviridis]
Length = 2598
Score = 73.6 bits (179), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 51/153 (33%), Positives = 77/153 (50%), Gaps = 30/153 (19%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+ + G+ F++E+LGEV S Q EF + +E+
Sbjct: 1755 KGWGIRTKQPLRAGQ--FIIEYLGEVV-------------SEQ--------EFRSRMMEQ 1791
Query: 1951 PKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIH 2005
+ Y L +V+D+ N A I HSC PNCE + +V+G Y+IG++ + I
Sbjct: 1792 YFSHSGNYCLNLDSGMVIDSYRMGNEARFINHSCEPNCEMQKWSVNGVYRIGLFALGEIP 1851
Query: 2006 YGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
G E+T+DYN + + EE +A C CGS+ CRG
Sbjct: 1852 SGTELTYDYNFHSFNTEEQQA--CKCGSESCRG 1882
>gi|301122693|ref|XP_002909073.1| histone-lysine N-methyltransferase, putative [Phytophthora infestans
T30-4]
gi|262099835|gb|EEY57887.1| histone-lysine N-methyltransferase, putative [Phytophthora infestans
T30-4]
Length = 751
Score = 73.6 bits (179), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 57/176 (32%), Positives = 86/176 (48%), Gaps = 27/176 (15%)
Query: 1863 CDVRTMKMCRGILKAMDSRPDDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKW 1922
C R +K R LK+M +Y+ G G++ N++ GE FV+E++GEV
Sbjct: 190 CSNRAIK--RRQLKSMRV----EYIPGGPGFGLITNEDINAGE--FVIEYVGEV------ 235
Query: 1923 FEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPN 1982
+ ++ R + ++ FY + LE+ +V+DA +++N + I HSC PN
Sbjct: 236 IDDKECERRMITYRDNGEVNFYMMELEKN---------IVIDAKYRSNDSRFINHSCDPN 286
Query: 1983 CEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
+ VDG +IGI+ R I EEIT DYN EA+ C CGS C G
Sbjct: 287 SVTQKWNVDGMQRIGIFARRNIAPNEEITIDYN----FSHFGEAADCRCGSTACTG 338
>gi|390605099|gb|EIN14490.1| SET domain-containing protein [Punctularia strigosozonata HHB-11173
SS5]
Length = 164
Score = 73.6 bits (179), Expect = 1e-09, Method: Composition-based stats.
Identities = 52/139 (37%), Positives = 69/139 (49%), Gaps = 26/139 (18%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI---YLERPKGDADGYDLVVV 1963
+ V+E++GEV IR+ + + A E I YL R D +VV
Sbjct: 49 EMVIEYVGEV------------IRAQVADKREKAYERQGIGSSYLFRIDED------LVV 90
Query: 1964 DAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEE 2023
DA K N I HSC PNC AK+ ++G +I IY + I G+EIT+DY+ E
Sbjct: 91 DATKKGNLGRLINHSCDPNCTAKIITINGEKKIVIYAKQDIELGDEITYDYHFPIEQ--- 147
Query: 2024 YEASVCLCGSQVCRGSYLN 2042
+ CLCGS CRG YLN
Sbjct: 148 -DKIPCLCGSAKCRG-YLN 164
>gi|116199091|ref|XP_001225357.1| hypothetical protein CHGG_07701 [Chaetomium globosum CBS 148.51]
gi|121922631|sp|Q2GWF3.1|SET1_CHAGB RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component SET1; AltName:
Full=SET domain-containing protein 1
gi|88178980|gb|EAQ86448.1| hypothetical protein CHGG_07701 [Chaetomium globosum CBS 148.51]
Length = 1076
Score = 73.6 bits (179), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 48/141 (34%), Positives = 70/141 (49%), Gaps = 23/141 (16%)
Query: 1905 EDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGY---DLV 1961
+DD ++E++GE E + I L++N YL+ G + + D
Sbjct: 956 KDDMIIEYVGE--------EVRQQIAELRENR----------YLKSGIGSSYLFRIDDNT 997
Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
V+DA K A I HSC PNC AK+ V+G +I IY +R I EE+T+DY E
Sbjct: 998 VIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDYKFERELG 1057
Query: 2022 EEYEASVCLCGSQVCRGSYLN 2042
+ CLCG+ C+G +LN
Sbjct: 1058 ST-DRIPCLCGTAACKG-FLN 1076
>gi|358338843|dbj|GAA57433.1| histone-lysine N-methyltransferase trithorax, partial [Clonorchis
sinensis]
Length = 328
Score = 73.6 bits (179), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 51/141 (36%), Positives = 69/141 (48%), Gaps = 21/141 (14%)
Query: 1902 GFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLV 1961
GF ED+ V+E++GE+ + ++ RS + Y+ R D +
Sbjct: 209 GFREDEMVIEYMGELIRNFVCETREIRYRSAG----------VDCYMFRIDSD------L 252
Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
V+DA + N A I HSC PNC AKV VD I I R I+ GEE+T+DY ES
Sbjct: 253 VIDATYAGNAARFINHSCDPNCYAKVVTVDDKKHIVILAQRRIYPGEELTYDYRFPKES- 311
Query: 2022 EEYEASVCLCGSQVCRGSYLN 2042
+ +C CGS CR YLN
Sbjct: 312 ---DKLLCNCGSYNCR-KYLN 328
>gi|170587756|ref|XP_001898640.1| SET domain containing protein [Brugia malayi]
gi|158593910|gb|EDP32504.1| SET domain containing protein [Brugia malayi]
Length = 1449
Score = 73.2 bits (178), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 45/133 (33%), Positives = 69/133 (51%), Gaps = 21/133 (15%)
Query: 1908 FVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMH 1967
F++E++GEV + ++ IR ++ +DP + + YL K A V+DA
Sbjct: 657 FIIEYVGEV------IDAEEMIRRGRRYGKDP--KHVHHYLMALKNGA------VIDATA 702
Query: 1968 KANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEY--E 2025
K N + I HSC PNCE++ V+ ++G + ++ I GEEI FDY E Y +
Sbjct: 703 KGNVSRFINHSCDPNCESQKWTVNRQLRVGFFVIKPIALGEEIVFDYQL-----ERYGRK 757
Query: 2026 ASVCLCGSQVCRG 2038
A C CG+ CRG
Sbjct: 758 AQRCFCGAANCRG 770
>gi|429862241|gb|ELA36898.1| histone-lysine n-methyltransferase [Colletotrichum gloeosporioides
Nara gc5]
Length = 1270
Score = 73.2 bits (178), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 47/146 (32%), Positives = 70/146 (47%), Gaps = 23/146 (15%)
Query: 1900 EGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGY- 1958
E +DD ++E++GE + + I +++ YL+ G + +
Sbjct: 1145 EENINKDDMIIEYVGE--------QVRQSISEIREKR----------YLKSGMGSSYLFR 1186
Query: 1959 --DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNS 2016
D V+DA K A I HSC PNC AK+ VDG +I IY +R I EE+T+DY
Sbjct: 1187 IDDNTVIDATKKGGIARFINHSCMPNCTAKIIKVDGSKRIVIYALRDIAQHEELTYDYKF 1246
Query: 2017 VTESKEEYEASVCLCGSQVCRGSYLN 2042
E + CLCG+ C+G +LN
Sbjct: 1247 EREIG-SLDRIPCLCGTAACKG-FLN 1270
>gi|357161607|ref|XP_003579145.1| PREDICTED: uncharacterized protein LOC100843412 [Brachypodium
distachyon]
Length = 1194
Score = 73.2 bits (178), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 54/139 (38%), Positives = 67/139 (48%), Gaps = 29/139 (20%)
Query: 1906 DDFVVEFLGEVY--PVWKWFEKQ---DGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDL 1960
+DFV+E++GE+ PV E Q GI S YL R D
Sbjct: 1078 EDFVIEYVGELIRRPVSDIREAQYEKSGIGS--------------SYLFRLDDD------ 1117
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
VVDA + A I HSC PNC KV VDG +I IY+ R I+ GEE+T++Y E
Sbjct: 1118 YVVDATKRGGLARFINHSCEPNCYTKVITVDGQKKIFIYSKRRIYAGEELTYNYKFPLEE 1177
Query: 2021 KEEYEASVCLCGSQVCRGS 2039
K+ C CGS CRGS
Sbjct: 1178 KK----IPCHCGSLRCRGS 1192
>gi|336385606|gb|EGO26753.1| hypothetical protein SERLADRAFT_385814 [Serpula lacrymans var.
lacrymans S7.9]
Length = 115
Score = 73.2 bits (178), Expect = 2e-09, Method: Composition-based stats.
Identities = 39/82 (47%), Positives = 48/82 (58%), Gaps = 5/82 (6%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
+VVDA K N I HSC PNC AK+ ++G +I IY + I GEEIT+DY+ E
Sbjct: 39 LVVDATKKGNLGRLINHSCDPNCTAKIITINGEKKIVIYAKQDIELGEEITYDYHFPIEQ 98
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+ CLCGS CRG YLN
Sbjct: 99 ----DKIPCLCGSAKCRG-YLN 115
>gi|328768995|gb|EGF79040.1| hypothetical protein BATDEDRAFT_35515 [Batrachochytrium dendrobatidis
JAM81]
Length = 1367
Score = 73.2 bits (178), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 48/137 (35%), Positives = 65/137 (47%), Gaps = 25/137 (18%)
Query: 1906 DDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI---YLERPKGDADGYDLVV 1962
+D V+E++GE+ IR ++ + E I YL R D +
Sbjct: 1251 NDMVIEYIGEI------------IRQKVADHREKLYEASGIGSSYLFRVDED------TI 1292
Query: 1963 VDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKE 2022
+DA N A I H C PNC AKV +VDG +I IY R I GEE+T+DY E
Sbjct: 1293 IDATKTGNLARFINHCCEPNCNAKVISVDGTKRIVIYANRDIKEGEELTYDYKFPIEE-- 1350
Query: 2023 EYEASVCLCGSQVCRGS 2039
+ CLCG+ CRG+
Sbjct: 1351 --DKIPCLCGAVNCRGT 1365
>gi|19115892|ref|NP_594980.1| histone lysine methyltransferase Set2 [Schizosaccharomyces pombe
972h-]
gi|74626626|sp|O14026.1|SET2_SCHPO RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36
specific; AltName: Full=Lysine N-methyltransferase 3;
AltName: Full=SET domain-containing protein 2
gi|2408044|emb|CAB16247.1| histone lysine methyltransferase Set2 [Schizosaccharomyces pombe]
Length = 798
Score = 73.2 bits (178), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 47/155 (30%), Positives = 75/155 (48%), Gaps = 20/155 (12%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
D ++ +KG G+ + +D FV E++GEV P K+ ++ +++ + + F
Sbjct: 183 DVFLTEKKGFGL--RADANLPKDTFVYEYIGEVIPEQKFRKR------MRQYDSEGIKHF 234
Query: 1944 YNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRG 2003
Y + L+ KG+ +DA + + A HSCRPNC V ++GI+ R
Sbjct: 235 YFMMLQ--KGE-------YIDATKRGSLARFCNHSCRPNCYVDKWMVGDKLRMGIFCKRD 285
Query: 2004 IHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
I GEE+TFDYN + +A C CG C G
Sbjct: 286 IIRGEELTFDYNV---DRYGAQAQPCYCGEPCCVG 317
>gi|409047697|gb|EKM57176.1| hypothetical protein PHACADRAFT_142398 [Phanerochaete carnosa
HHB-10118-sp]
Length = 1389
Score = 73.2 bits (178), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 50/139 (35%), Positives = 67/139 (48%), Gaps = 26/139 (18%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI---YLERPKGDADGYDLVVV 1963
+ V+E++GE+ IR+ + + A E I YL R D +VV
Sbjct: 1274 EMVIEYVGEI------------IRAQVADKREKAYERQGIGSSYLFRIDED------LVV 1315
Query: 1964 DAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEE 2023
DA K N I HSC PNC AK+ ++ +I IY + I G EIT+DY+ E
Sbjct: 1316 DATKKGNLGRLINHSCDPNCTAKIITINSEKKIVIYAKQDIELGSEITYDYHFPIEQ--- 1372
Query: 2024 YEASVCLCGSQVCRGSYLN 2042
+ CLCGS CRG YLN
Sbjct: 1373 -DKIPCLCGSAKCRG-YLN 1389
>gi|302141761|emb|CBI18964.3| unnamed protein product [Vitis vinifera]
Length = 1958
Score = 73.2 bits (178), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 49/149 (32%), Positives = 72/149 (48%), Gaps = 19/149 (12%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ ++ G+ F++E++GEV + + +Q S + FY + L
Sbjct: 1274 KKGYGLQLQQDISQGQ--FLIEYVGEVLDLQTYEARQKEYASRGHKH------FYFMTLN 1325
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
+ V+DA K N I HSC PNC + V+G IG++ +R I GEE
Sbjct: 1326 GSE---------VIDACAKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALRDIKKGEE 1376
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
+TFDYN V A C+CGS CRG
Sbjct: 1377 VTFDYNYVRVFG--AAAKKCVCGSPQCRG 1403
>gi|402594990|gb|EJW88916.1| SET domain-containing protein [Wuchereria bancrofti]
Length = 1425
Score = 73.2 bits (178), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 45/133 (33%), Positives = 69/133 (51%), Gaps = 21/133 (15%)
Query: 1908 FVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMH 1967
F++E++GEV + ++ IR ++ +DP + + YL K A V+DA
Sbjct: 630 FIIEYVGEV------IDAEEMIRRGRRYGKDP--KHVHHYLMALKNGA------VIDATA 675
Query: 1968 KANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEY--E 2025
K N + I HSC PNCE++ V+ ++G + ++ I GEEI FDY E Y +
Sbjct: 676 KGNVSRFINHSCDPNCESQKWTVNRQLRVGFFVIKPIALGEEIVFDYQL-----ERYGRK 730
Query: 2026 ASVCLCGSQVCRG 2038
A C CG+ CRG
Sbjct: 731 AQRCFCGAANCRG 743
>gi|380494835|emb|CCF32851.1| SET domain-containing protein [Colletotrichum higginsianum]
Length = 1257
Score = 73.2 bits (178), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 47/146 (32%), Positives = 70/146 (47%), Gaps = 23/146 (15%)
Query: 1900 EGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGY- 1958
E +DD ++E++GE + + I +++ YL+ G + +
Sbjct: 1132 EENINKDDMIIEYVGE--------QVRQSISEIREKR----------YLKSGMGSSYLFR 1173
Query: 1959 --DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNS 2016
D V+DA K A I HSC PNC AK+ VDG +I IY +R I EE+T+DY
Sbjct: 1174 IDDNTVIDATKKGGIARFINHSCMPNCTAKIIKVDGSKRIVIYALRDIGQHEELTYDYKF 1233
Query: 2017 VTESKEEYEASVCLCGSQVCRGSYLN 2042
E + CLCG+ C+G +LN
Sbjct: 1234 EREIG-SLDRIPCLCGTAACKG-FLN 1257
>gi|392560212|gb|EIW53395.1| hypothetical protein TRAVEDRAFT_154887 [Trametes versicolor FP-101664
SS1]
Length = 1014
Score = 73.2 bits (178), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 49/136 (36%), Positives = 65/136 (47%), Gaps = 25/136 (18%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI---YLERPKGDADGYDLVVV 1963
+ V+E++GEV IR+ + + A E I YL R D +VV
Sbjct: 899 EMVIEYVGEV------------IRAQVADKREKAYERQGIGSSYLFRIDED------LVV 940
Query: 1964 DAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEE 2023
DA K N I HSC PNC AK+ + G +I IY + I G EIT+DY+ E
Sbjct: 941 DATKKGNLGRLINHSCDPNCTAKIITISGEKKIVIYAKQDIELGSEITYDYHFPIEQ--- 997
Query: 2024 YEASVCLCGSQVCRGS 2039
+ CLCGS CRG+
Sbjct: 998 -DKIPCLCGSAKCRGT 1012
>gi|30704948|gb|AAH52194.1| Ash1l protein, partial [Mus musculus]
Length = 963
Score = 73.2 bits (178), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 49/160 (30%), Positives = 80/160 (50%), Gaps = 30/160 (18%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+++ A KG G+ + G+ F++E+LGEV EF
Sbjct: 143 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVVS---------------------EQEF 179
Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
N +E+ +D Y L +V+D+ N A I HSC PNCE + +V+G Y+IG+
Sbjct: 180 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 239
Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
Y ++ + G E+T+DYN + + E+ + +C CG + CRG
Sbjct: 240 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 277
>gi|224063022|ref|XP_002300966.1| SET domain protein [Populus trichocarpa]
gi|222842692|gb|EEE80239.1| SET domain protein [Populus trichocarpa]
Length = 605
Score = 73.2 bits (178), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 51/149 (34%), Positives = 73/149 (48%), Gaps = 19/149 (12%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ ++ G+ F++E++GEV V + +Q S + FY + L
Sbjct: 136 KKGFGLRLEEDITRGQ--FLIEYVGEVLDVHAYEARQKEYASKGHKH------FYFMTL- 186
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
DG + V+DA K N I HSC PNC + V+G IG++ +R I GEE
Sbjct: 187 ------DGSE--VIDACVKGNLGRFINHSCDPNCRTEKWVVNGEICIGLFALRDIKKGEE 238
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
+TFDYN V A C CGS C+G
Sbjct: 239 VTFDYNYVRVVGA--AAKRCYCGSPQCQG 265
>gi|242019388|ref|XP_002430143.1| histone-lysine N-methyltransferase SUVR5, putative [Pediculus humanus
corporis]
gi|212515234|gb|EEB17405.1| histone-lysine N-methyltransferase SUVR5, putative [Pediculus humanus
corporis]
Length = 1448
Score = 72.8 bits (177), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 50/149 (33%), Positives = 76/149 (51%), Gaps = 20/149 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ E + F++E++GEV +K+ G R ++ ++ FY + L
Sbjct: 571 KKGFGL--RAEEDLSGNTFIMEYVGEVVN-----QKEFG-RRVKMYAKENNKHFYFMAL- 621
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
KGDA V+DA +K N + I HSC PN E + ++G ++G +T R + GEE
Sbjct: 622 --KGDA------VIDATNKGNISRFINHSCDPNAETQKWTINGELRVGFFTRRFVAAGEE 673
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
ITFDY K +A C C + CRG
Sbjct: 674 ITFDYQFQRYGK---QAQKCYCEASNCRG 699
>gi|384484496|gb|EIE76676.1| hypothetical protein RO3G_01380 [Rhizopus delemar RA 99-880]
Length = 565
Score = 72.8 bits (177), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 48/133 (36%), Positives = 64/133 (48%), Gaps = 19/133 (14%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
D V+E++GEV +Q +K+ E + YL R D +V+DA
Sbjct: 450 DIVIEYIGEVI-------RQQVAEIREKHYERIG--IGSSYLFRVDDD------MVIDAT 494
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K A I H C PNC AK+ VD ++ IY R I GEEIT+DY E+ E
Sbjct: 495 KKGGMARFINHCCTPNCSAKIITVDKQKKVVIYANRDIEPGEEITYDYKFPIEA----EK 550
Query: 2027 SVCLCGSQVCRGS 2039
C CGS+ C+GS
Sbjct: 551 IPCFCGSKFCKGS 563
>gi|345480373|ref|XP_001606723.2| PREDICTED: hypothetical protein LOC100123115 [Nasonia vitripennis]
Length = 1746
Score = 72.8 bits (177), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 48/147 (32%), Positives = 72/147 (48%), Gaps = 22/147 (14%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
GL N E G DF++E++GEV + +D + ++ ++D +Y + L
Sbjct: 858 GLRATTNLEAG----DFIMEYVGEV------LDPKDFRKRAKEYSKDKNRHYYFMAL--- 904
Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
K D ++DA K N + I HSC PN E + V+G +IG + + + GEEIT
Sbjct: 905 KSDQ------IIDATMKGNISRFINHSCDPNAETQKWTVNGELRIGFFNKKFVAAGEEIT 958
Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRG 2038
FDY+ K EA C C + CRG
Sbjct: 959 FDYHFQRYGK---EAQKCFCEATNCRG 982
>gi|448106516|ref|XP_004200765.1| Piso0_003363 [Millerozyma farinosa CBS 7064]
gi|448109616|ref|XP_004201396.1| Piso0_003363 [Millerozyma farinosa CBS 7064]
gi|359382187|emb|CCE81024.1| Piso0_003363 [Millerozyma farinosa CBS 7064]
gi|359382952|emb|CCE80259.1| Piso0_003363 [Millerozyma farinosa CBS 7064]
Length = 1062
Score = 72.8 bits (177), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 48/82 (58%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA K A I H C P+C AK+ VDG +I IY +R I EE+T+DY E+
Sbjct: 983 TVIDATKKGGIARFINHCCSPSCTAKIIKVDGKKRIVIYALRDIDKNEELTYDYKFERET 1042
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+E E CLCG+ C+G YLN
Sbjct: 1043 NDE-ERIRCLCGAPGCKG-YLN 1062
>gi|378725927|gb|EHY52386.1| histone-lysine N-methyltransferase SETD1 [Exophiala dermatitidis
NIH/UT8656]
Length = 1277
Score = 72.8 bits (177), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 38/82 (46%), Positives = 47/82 (57%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
VVDA K A I HSC PNC AK+ V G +I IY +R I EE+T+DY E
Sbjct: 1198 TVVDATKKGGIARFINHSCSPNCTAKIIRVGGTKRIVIYALRDIEKDEELTYDYKFEREI 1257
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+ + CLCGS VC+G +LN
Sbjct: 1258 DSD-DRIPCLCGSAVCKG-FLN 1277
>gi|330792328|ref|XP_003284241.1| hypothetical protein DICPUDRAFT_27300 [Dictyostelium purpureum]
gi|325085814|gb|EGC39214.1| hypothetical protein DICPUDRAFT_27300 [Dictyostelium purpureum]
Length = 151
Score = 72.8 bits (177), Expect = 2e-09, Method: Composition-based stats.
Identities = 51/150 (34%), Positives = 74/150 (49%), Gaps = 26/150 (17%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G++ + + DFV+E+ GEV K + +Q+N + FY + L
Sbjct: 1 KGWGLISCEN--INKGDFVMEYCGEV------ISKTTCLNRMQENENEKF--FYFLTLNS 50
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ +DA + N A I HSC PNCE + V G +IGI++++ I G E+
Sbjct: 51 KE---------CLDASRRGNLARFINHSCDPNCETQKWIVGGEVKIGIFSIKPIEKGTEL 101
Query: 2011 TFDYN--SVTESKEEYEASVCLCGSQVCRG 2038
TFDYN SK+E C CGS+ CRG
Sbjct: 102 TFDYNYERFGASKQE-----CYCGSKNCRG 126
>gi|115489550|ref|NP_001067262.1| Os12g0613200 [Oryza sativa Japonica Group]
gi|108862955|gb|ABA99391.2| SET domain containing protein, expressed [Oryza sativa Japonica
Group]
gi|113649769|dbj|BAF30281.1| Os12g0613200 [Oryza sativa Japonica Group]
Length = 1212
Score = 72.8 bits (177), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 57/170 (33%), Positives = 80/170 (47%), Gaps = 25/170 (14%)
Query: 1874 ILKAMDSRPDDKYVAYRKG----LGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGI 1929
+LK M S+ K + +++ G+V + +DFV+E++GE+ + I
Sbjct: 1062 LLKIMQSKSRKKRLRFQRSKIHEWGLVALE--SIDAEDFVIEYVGELI-----RRQVSDI 1114
Query: 1930 RSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTA 1989
R Q + YL R D VVDA + A I HSC PNC KV
Sbjct: 1115 REDQYEKSG----IGSSYLFRLDDD------YVVDATKRGGLARFINHSCDPNCYTKVIT 1164
Query: 1990 VDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGS 2039
V+G +I IY R I+ GEE+T++Y E K+ C CGSQ CRGS
Sbjct: 1165 VEGQKKIVIYAKRRIYAGEELTYNYKFPLEEKK----IPCHCGSQRCRGS 1210
>gi|330797279|ref|XP_003286689.1| hypothetical protein DICPUDRAFT_150684 [Dictyostelium purpureum]
gi|325083363|gb|EGC36818.1| hypothetical protein DICPUDRAFT_150684 [Dictyostelium purpureum]
Length = 1340
Score = 72.8 bits (177), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 48/136 (35%), Positives = 65/136 (47%), Gaps = 25/136 (18%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGY---DLVVV 1963
D V+E++GEV IR + A E Y+++ G + + D ++
Sbjct: 1225 DMVIEYIGEV------------IR------QKVADEREKRYIKKGIGSSYLFRVDDDTII 1266
Query: 1964 DAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEE 2023
DA K N A I H C PNC AKV ++ +I IY R I+ GEEIT+DY E
Sbjct: 1267 DATLKGNLARFINHCCDPNCIAKVLTINNQKKIIIYAKRDINIGEEITYDYKFPIED--- 1323
Query: 2024 YEASVCLCGSQVCRGS 2039
E CLC S CRG+
Sbjct: 1324 -EKIPCLCKSPKCRGT 1338
>gi|310792530|gb|EFQ28057.1| SET domain-containing protein [Glomerella graminicola M1.001]
Length = 1262
Score = 72.8 bits (177), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 47/146 (32%), Positives = 70/146 (47%), Gaps = 23/146 (15%)
Query: 1900 EGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGY- 1958
E +DD ++E++GE + + I +++ YL+ G + +
Sbjct: 1137 EENINKDDMIIEYVGE--------QVRQSISEIREKR----------YLKSGMGSSYLFR 1178
Query: 1959 --DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNS 2016
D V+DA K A I HSC PNC AK+ VDG +I IY +R I EE+T+DY
Sbjct: 1179 IDDNTVIDATKKGGIARFINHSCMPNCTAKIIKVDGSKRIVIYALRDIGQHEELTYDYKF 1238
Query: 2017 VTESKEEYEASVCLCGSQVCRGSYLN 2042
E + CLCG+ C+G +LN
Sbjct: 1239 EREIG-SLDRIPCLCGTAACKG-FLN 1262
>gi|147837037|emb|CAN63644.1| hypothetical protein VITISV_006299 [Vitis vinifera]
Length = 258
Score = 72.8 bits (177), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 36/67 (53%), Positives = 51/67 (76%), Gaps = 5/67 (7%)
Query: 2016 SVTESKEEYEAS----VCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDRHQLMLEACE 2071
++T+S+ +++ VC+ + R SYLNLTGEG+F+KVLKE HG+LDR+QLM EACE
Sbjct: 148 TITQSRRVRKSTKFLPVCVVVKFIER-SYLNLTGEGSFQKVLKECHGILDRYQLMFEACE 206
Query: 2072 LNSVSEE 2078
LN +SE+
Sbjct: 207 LNMLSEK 213
>gi|342887802|gb|EGU87231.1| hypothetical protein FOXB_02213 [Fusarium oxysporum Fo5176]
Length = 1258
Score = 72.8 bits (177), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 45/143 (31%), Positives = 70/143 (48%), Gaps = 23/143 (16%)
Query: 1903 FGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGY---D 1959
+DD ++E++GE + + I +++N YL+ G + + D
Sbjct: 1136 IAKDDMIIEYVGE--------QVRQQIAEIRENR----------YLKSGIGSSYLFRIDD 1177
Query: 1960 LVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTE 2019
V+DA K A I HSC PNC AK+ V+G +I IY ++ I EE+T+DY E
Sbjct: 1178 NTVIDATKKGGIARFINHSCEPNCTAKIIKVEGSKRIVIYALQDIAMSEELTYDYKFERE 1237
Query: 2020 SKEEYEASVCLCGSQVCRGSYLN 2042
+ CLCG+ C+G +LN
Sbjct: 1238 IG-SLDRIPCLCGTAACKG-FLN 1258
>gi|348520760|ref|XP_003447895.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
[Oreochromis niloticus]
Length = 1167
Score = 72.8 bits (177), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 81/148 (54%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G++C ++ GE FV E++GE+ ++++ ++ +E+ +FY + +++
Sbjct: 865 KGWGLICLRDIKKGE--FVNEYIGEL------IDEEECRARIKYAHENNITDFYMLTIDK 916
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NY+ + HSC+PNCE + V+G ++G++ V I G E+
Sbjct: 917 DR---------IIDAGPKGNYSRFMNHSCQPNCETQKWTVNGDTRVGLFAVCDIPAGTEL 967
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN E+ +VC CG+ C G
Sbjct: 968 TFNYNLDCLGNEK---TVCRCGAPNCSG 992
>gi|301754075|ref|XP_002912890.1| PREDICTED: histone-lysine N-methyltransferase SETD2-like [Ailuropoda
melanoleuca]
Length = 2549
Score = 72.8 bits (177), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 57/199 (28%), Positives = 87/199 (43%), Gaps = 43/199 (21%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ K+ + FV+E+ GEV K F+ + + KN + Y
Sbjct: 1544 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1592
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
K D ++DA K N + + HSC PNCE + V+G ++G +T + + G E
Sbjct: 1593 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1646
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNL----------------------TGEG 2047
+TFDY K EA C CGS CRG YL T +G
Sbjct: 1647 LTFDYQFQRYGK---EAQKCFCGSANCRG-YLGGENRVSIRAAGGKMKKERSRKKDTVDG 1702
Query: 2048 AFEKVLKELHGLLDRHQLM 2066
E +++ GL D++Q++
Sbjct: 1703 ELEALMENGEGLSDKNQVL 1721
>gi|123703948|ref|NP_001038599.2| histone-lysine N-methyltransferase SETD1B-A [Danio rerio]
gi|166977691|sp|Q1LY77.2|SE1BA_DANRE RecName: Full=Histone-lysine N-methyltransferase SETD1B-A; AltName:
Full=SET domain-containing protein 1B-A
gi|123293815|emb|CAK10781.2| novel protein [Danio rerio]
Length = 1844
Score = 72.4 bits (176), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N+A I HSC PNC AKV V+ +I IY+ + I+ EEIT+DY E
Sbjct: 1768 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSRQPINVNEEITYDYKFPIED 1827
Query: 2021 KEEYEASVCLCGSQVCRGS 2039
E CLCG++ CRG+
Sbjct: 1828 ----EKIPCLCGAENCRGT 1842
>gi|281343603|gb|EFB19187.1| hypothetical protein PANDA_000629 [Ailuropoda melanoleuca]
Length = 2535
Score = 72.4 bits (176), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 57/199 (28%), Positives = 87/199 (43%), Gaps = 43/199 (21%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ K+ + FV+E+ GEV K F+ + + KN + Y
Sbjct: 1530 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1578
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
K D ++DA K N + + HSC PNCE + V+G ++G +T + + G E
Sbjct: 1579 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1632
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNL----------------------TGEG 2047
+TFDY K EA C CGS CRG YL T +G
Sbjct: 1633 LTFDYQFQRYGK---EAQKCFCGSANCRG-YLGGENRVSIRAAGGKMKKERSRKKDTVDG 1688
Query: 2048 AFEKVLKELHGLLDRHQLM 2066
E +++ GL D++Q++
Sbjct: 1689 ELEALMENGEGLSDKNQVL 1707
>gi|402593200|gb|EJW87127.1| SET domain-containing protein, partial [Wuchereria bancrofti]
Length = 602
Score = 72.4 bits (176), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 37/79 (46%), Positives = 48/79 (60%), Gaps = 6/79 (7%)
Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYN-SVTES 2020
V+DA N A I HSC+PNC AK+ VDG +I IY+ I+ G+EIT+DY + E
Sbjct: 527 VIDATQMGNLARFINHSCQPNCYAKIVVVDGEKRIVIYSKLAINKGDEITYDYKFPIEED 586
Query: 2021 KEEYEASVCLCGSQVCRGS 2039
K + CLCG+ CRGS
Sbjct: 587 KID-----CLCGAPGCRGS 600
>gi|68473736|ref|XP_718971.1| potential COMPASS histone methyltransferase subunit Set1p [Candida
albicans SC5314]
gi|68473945|ref|XP_718869.1| potential COMPASS histone methyltransferase subunit Set1p [Candida
albicans SC5314]
gi|74586641|sp|Q5ABG1.1|SET1_CANAL RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component SET1; AltName:
Full=SET domain-containing protein 1
gi|46440662|gb|EAK99965.1| potential COMPASS histone methyltransferase subunit Set1p [Candida
albicans SC5314]
gi|46440768|gb|EAL00070.1| potential COMPASS histone methyltransferase subunit Set1p [Candida
albicans SC5314]
Length = 1040
Score = 72.4 bits (176), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 37/84 (44%), Positives = 49/84 (58%), Gaps = 2/84 (2%)
Query: 1959 DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVT 2018
D V+DA K A I H C P+C AK+ V+G +I IY +R I EE+T+DY
Sbjct: 959 DNTVIDATKKGGIARFINHCCSPSCTAKIIKVEGKKRIVIYALRDIEANEELTYDYKFER 1018
Query: 2019 ESKEEYEASVCLCGSQVCRGSYLN 2042
E+ +E E CLCG+ C+G YLN
Sbjct: 1019 ETNDE-ERIRCLCGAPGCKG-YLN 1040
>gi|432887915|ref|XP_004074975.1| PREDICTED: uncharacterized protein LOC101162384 [Oryzias latipes]
Length = 1787
Score = 72.4 bits (176), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N+A I HSC PNC AKV V+ +I IY+ + I+ EEIT+DY E
Sbjct: 1711 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSRQPINVNEEITYDYKFPIED 1770
Query: 2021 KEEYEASVCLCGSQVCRGS 2039
E CLCG++ CRG+
Sbjct: 1771 ----EKIPCLCGAENCRGT 1785
>gi|238879404|gb|EEQ43042.1| conserved hypothetical protein [Candida albicans WO-1]
Length = 1040
Score = 72.4 bits (176), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 37/84 (44%), Positives = 49/84 (58%), Gaps = 2/84 (2%)
Query: 1959 DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVT 2018
D V+DA K A I H C P+C AK+ V+G +I IY +R I EE+T+DY
Sbjct: 959 DNTVIDATKKGGIARFINHCCSPSCTAKIIKVEGKKRIVIYALRDIEANEELTYDYKFER 1018
Query: 2019 ESKEEYEASVCLCGSQVCRGSYLN 2042
E+ +E E CLCG+ C+G YLN
Sbjct: 1019 ETNDE-ERIRCLCGAPGCKG-YLN 1040
>gi|334333796|ref|XP_001375978.2| PREDICTED: histone-lysine N-methyltransferase SETD2 [Monodelphis
domestica]
Length = 2592
Score = 72.4 bits (176), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 87/201 (43%), Gaps = 47/201 (23%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ K+ + FV+E+ GEV K F+ + + KN + Y
Sbjct: 1582 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1630
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
K D ++DA K N + + HSC PNCE + V+G ++G +T + + G E
Sbjct: 1631 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1684
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGE----------------------- 2046
+TFDY K EA C CGS CRG YL GE
Sbjct: 1685 LTFDYQFQRYGK---EAQKCFCGSANCRG-YLG--GENRVSIRAAGGKMKKERSRKKDSV 1738
Query: 2047 -GAFEKVLKELHGLLDRHQLM 2066
G E +L+ GL D++Q++
Sbjct: 1739 DGELEALLENGEGLSDKNQVL 1759
>gi|449492020|ref|XP_004174653.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
SETD2 [Taeniopygia guttata]
Length = 2489
Score = 72.4 bits (176), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 60/201 (29%), Positives = 89/201 (44%), Gaps = 47/201 (23%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ K+ + FV+E+ GEV K F+ + + KN +Y + L
Sbjct: 1543 KKGWGLRAAKD--LPSNTFVLEYCGEVL-DHKEFKARVKEYARNKNIH-----YYFMAL- 1593
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
K D ++DA K N + + HSC PNCE + V+G ++G +T + + G E
Sbjct: 1594 --KNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1645
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGE----------------------- 2046
+TFDY K EA C CGS CRG YL GE
Sbjct: 1646 LTFDYQFQRYGK---EAQKCFCGSSNCRG-YLG--GENRVSIRAAGGKMKKERSRKKDSV 1699
Query: 2047 -GAFEKVLKELHGLLDRHQLM 2066
G E +L+ GL D++Q++
Sbjct: 1700 DGELEALLENGEGLSDKNQVL 1720
>gi|356558250|ref|XP_003547420.1| PREDICTED: uncharacterized protein LOC100806034 [Glycine max]
Length = 1300
Score = 72.4 bits (176), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 53/137 (38%), Positives = 68/137 (49%), Gaps = 25/137 (18%)
Query: 1906 DDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI---YLERPKGDADGYDLVV 1962
+DFV+E++GE+ IR + + E I YL R DGY V
Sbjct: 1184 EDFVIEYIGEL------------IRPRISDIRERQYEKMGIGSSYLFRLD---DGY---V 1225
Query: 1963 VDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKE 2022
VDA + A I HSC PNC KV +V+G +I IY R I GEEIT++Y E K+
Sbjct: 1226 VDATKRGGIARFINHSCEPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKK 1285
Query: 2023 EYEASVCLCGSQVCRGS 2039
C CGS+ CRGS
Sbjct: 1286 ----IPCNCGSRKCRGS 1298
>gi|340373417|ref|XP_003385238.1| PREDICTED: hypothetical protein LOC100636150 [Amphimedon
queenslandica]
Length = 1053
Score = 72.4 bits (176), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 35/78 (44%), Positives = 43/78 (55%), Gaps = 4/78 (5%)
Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
V+DA N+A I H C PNC AK+ V +I IY+ R I GEEIT+DY E
Sbjct: 978 VIDATKSGNFARFINHCCDPNCYAKIITVGNQKKIVIYSKRDIRAGEEITYDYKFPIED- 1036
Query: 2022 EEYEASVCLCGSQVCRGS 2039
E CLCG+ CRG+
Sbjct: 1037 ---EKIPCLCGAPQCRGT 1051
>gi|405951732|gb|EKC19620.1| Histone-lysine N-methyltransferase MLL4 [Crassostrea gigas]
Length = 4493
Score = 72.4 bits (176), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 55/160 (34%), Positives = 75/160 (46%), Gaps = 29/160 (18%)
Query: 1886 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN 1945
Y ++ G G+ C + E + V+E+ GEV IR + + E
Sbjct: 4360 YRSHIHGRGLYCKR--NIDEGEMVIEYSGEV------------IRGSLTDKREKYYEGKG 4405
Query: 1946 I--YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRG 2003
I Y+ R D YD V+DA N A I HSC PNC +KV VDG I I+ ++
Sbjct: 4406 IGCYMFR----IDDYD--VIDATLHGNAARFINHSCEPNCYSKVINVDGKKHIVIFAMKS 4459
Query: 2004 IHYGEEITFDYNSVTESKEEYEASV-CLCGSQVCRGSYLN 2042
I GEE+T+DY E E + C CG++ CR YLN
Sbjct: 4460 IKRGEELTYDYKFPIE-----EVKIPCTCGAKKCR-RYLN 4493
>gi|432880997|ref|XP_004073754.1| PREDICTED: uncharacterized protein LOC101157226 [Oryzias latipes]
Length = 2812
Score = 72.4 bits (176), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 55/162 (33%), Positives = 79/162 (48%), Gaps = 31/162 (19%)
Query: 1886 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN 1945
Y + G G+ C + GE V+E+ G V IRS+ + + +FY+
Sbjct: 2677 YRSLIHGRGLFCKRNIEAGE--MVIEYAGTV------------IRSVLTDKRE---KFYD 2719
Query: 1946 -----IYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
Y+ R D +D VVDA + N A I HSC PNC ++V VDG I I+
Sbjct: 2720 GKGIGCYMFR----IDDFD--VVDATMQGNAARFINHSCEPNCYSRVINVDGRKHIVIFA 2773
Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
+R I+ GEE+T+DY E +E C CG++ CR +LN
Sbjct: 2774 LRKIYRGEELTYDYKFPIE--DEDNKLHCNCGTRRCR-RFLN 2812
>gi|426370676|ref|XP_004052287.1| PREDICTED: histone-lysine N-methyltransferase MLL [Gorilla gorilla
gorilla]
Length = 3837
Score = 72.4 bits (176), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 52/156 (33%), Positives = 75/156 (48%), Gaps = 31/156 (19%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
G G+ C + GE V+E+ G V IRS+Q + + ++Y+
Sbjct: 3708 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3750
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
Y+ R D D VVDA N A I HSC PNC ++V +DG I I+ +R I+
Sbjct: 3751 YMFR----ID--DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3804
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
GEE+T+DY E + C CG++ CR +LN
Sbjct: 3805 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3837
>gi|443722431|gb|ELU11300.1| hypothetical protein CAPTEDRAFT_160470, partial [Capitella teleta]
Length = 282
Score = 72.4 bits (176), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 48/135 (35%), Positives = 68/135 (50%), Gaps = 26/135 (19%)
Query: 1908 FVVEFLGEV--YPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDA 1965
FV+E++GEV YP ++ KQ ED Y + L GD ++DA
Sbjct: 79 FVMEYVGEVLDYPNFRLRCKQYA--------EDNHTHHYFMAL---NGDE------IIDA 121
Query: 1966 MHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYE 2025
K N + I HSC PNCE + V+G ++G +T+R I G E+TFDY E+Y
Sbjct: 122 TQKGNTSRFINHSCDPNCETQKWTVNGQLRVGFFTLRSIPAGTELTFDYQF-----EQYG 176
Query: 2026 ASV--CLCGSQVCRG 2038
+ + C CG+ CRG
Sbjct: 177 SEIQRCFCGADSCRG 191
>gi|327289513|ref|XP_003229469.1| PREDICTED: histone-lysine N-methyltransferase SETD2-like [Anolis
carolinensis]
Length = 2579
Score = 72.4 bits (176), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 87/201 (43%), Gaps = 47/201 (23%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ K+ + FV+E+ GEV K F+ + + KN + Y
Sbjct: 1577 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKTRVKEYARSKN--------IHYYFM 1625
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
K D ++DA K N + + HSC PNCE + V+G ++G +T + + G E
Sbjct: 1626 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKMVPSGSE 1679
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGE----------------------- 2046
+TFDY K EA C CGS CRG YL GE
Sbjct: 1680 LTFDYQFQRYGK---EAQKCFCGSTNCRG-YLG--GENRVSIRAAGGKMKKERSRKKDSV 1733
Query: 2047 -GAFEKVLKELHGLLDRHQLM 2066
G E +L+ GL D++Q++
Sbjct: 1734 DGELEALLENGEGLSDKNQVL 1754
>gi|198467361|ref|XP_001354372.2| GA14357 [Drosophila pseudoobscura pseudoobscura]
gi|198149208|gb|EAL31425.2| GA14357 [Drosophila pseudoobscura pseudoobscura]
Length = 2918
Score = 72.4 bits (176), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 47/153 (30%), Positives = 77/153 (50%), Gaps = 20/153 (13%)
Query: 1886 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN 1945
+ +KG G+ + GE F++E++GEV + FE++ S +N +Y
Sbjct: 1916 FRTEKKGCGITAELQIPAGE--FIMEYVGEVI-DSEEFERRQHRYSKDRNRH-----YYF 1967
Query: 1946 IYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIH 2005
+ L +G+A ++DA + N + I HSC PN E + V+G +IG ++++ I
Sbjct: 1968 MAL---RGEA------IIDATMRGNISRYINHSCDPNAETQKWTVNGELRIGFFSLKNIL 2018
Query: 2006 YGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
GEEITFDY + +A C C + CRG
Sbjct: 2019 PGEEITFDYQYQRYGR---DAQRCYCEAANCRG 2048
>gi|312091131|ref|XP_003146871.1| histone methyltransferase [Loa loa]
gi|307757965|gb|EFO17199.1| histone methyltransferase [Loa loa]
Length = 278
Score = 72.0 bits (175), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 37/79 (46%), Positives = 48/79 (60%), Gaps = 6/79 (7%)
Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYN-SVTES 2020
V+DA N A I HSC+PNC AK+ VDG +I IY+ I+ G+EIT+DY + E
Sbjct: 203 VIDATQMGNLARFINHSCQPNCYAKIVVVDGEKRIVIYSKLAINKGDEITYDYKFPIEED 262
Query: 2021 KEEYEASVCLCGSQVCRGS 2039
K + CLCG+ CRGS
Sbjct: 263 KID-----CLCGAPGCRGS 276
>gi|384499027|gb|EIE89518.1| hypothetical protein RO3G_14229 [Rhizopus delemar RA 99-880]
Length = 1674
Score = 72.0 bits (175), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 36/81 (44%), Positives = 46/81 (56%), Gaps = 4/81 (4%)
Query: 1959 DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVT 2018
D V+DA + + A I H C PNC AK+ VD +I IY R I GEEIT+DY
Sbjct: 1596 DDTVIDATKRGSIARFINHCCSPNCSAKIITVDKQKKIVIYANRDIEPGEEITYDYKFPI 1655
Query: 2019 ESKEEYEASVCLCGSQVCRGS 2039
E+ E CLCGS+ C+G+
Sbjct: 1656 EA----EKIPCLCGSKFCKGT 1672
>gi|444725290|gb|ELW65863.1| Histone-lysine N-methyltransferase MLL [Tupaia chinensis]
Length = 3806
Score = 72.0 bits (175), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
G G+ C + GE V+E+ G V IRS+Q + + ++Y+
Sbjct: 3677 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3719
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
Y+ R D VVDA N A I HSC PNC ++V +DG I I+ +R I+
Sbjct: 3720 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3773
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
GEE+T+DY E + C CG++ CR +LN
Sbjct: 3774 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3806
>gi|351705860|gb|EHB08779.1| Histone-lysine N-methyltransferase HRX [Heterocephalus glaber]
Length = 3899
Score = 72.0 bits (175), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
G G+ C + GE V+E+ G V IRS+Q + + ++Y+
Sbjct: 3770 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3812
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
Y+ R D VVDA N A I HSC PNC ++V +DG I I+ +R I+
Sbjct: 3813 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3866
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
GEE+T+DY E + C CG++ CR +LN
Sbjct: 3867 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3899
>gi|348534024|ref|XP_003454503.1| PREDICTED: histone-lysine N-methyltransferase SETD2-like [Oreochromis
niloticus]
Length = 2253
Score = 72.0 bits (175), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 46/148 (31%), Positives = 71/148 (47%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+ K+ + FV+E+ GEV K F+ + + KN +Y + L+
Sbjct: 1065 KGWGLRAAKD--LAPNTFVLEYCGEVL-DHKEFKTRVKEYARNKNIH-----YYFMSLKN 1116
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K N + + HSC PNCE + V+G ++G +T + + G E+
Sbjct: 1117 NE---------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKAVTAGTEL 1167
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TFDY K EA C CG+ CRG
Sbjct: 1168 TFDYQFQRYGK---EAQKCFCGAPSCRG 1192
>gi|348532887|ref|XP_003453937.1| PREDICTED: histone-lysine N-methyltransferase SETD1B-A-like
[Oreochromis niloticus]
Length = 1846
Score = 72.0 bits (175), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N+A I HSC PNC AKV V+ +I IY+ + I+ EEIT+DY E
Sbjct: 1770 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSRQPINVNEEITYDYKFPIED 1829
Query: 2021 KEEYEASVCLCGSQVCRGS 2039
E CLCG++ CRG+
Sbjct: 1830 ----EKIPCLCGAENCRGT 1844
>gi|367024877|ref|XP_003661723.1| SET1-like protein [Myceliophthora thermophila ATCC 42464]
gi|347008991|gb|AEO56478.1| SET1-like protein [Myceliophthora thermophila ATCC 42464]
Length = 1260
Score = 72.0 bits (175), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 47/141 (33%), Positives = 70/141 (49%), Gaps = 23/141 (16%)
Query: 1905 EDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGY---DLV 1961
+DD ++E++GE E + I L+++ YL+ G + + D
Sbjct: 1140 KDDMIIEYVGE--------EVRQQIAELREHR----------YLKSGIGSSYLFRIDDNT 1181
Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
V+DA K A I HSC PNC AK+ V+G +I IY +R I EE+T+DY E
Sbjct: 1182 VIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDYKFERELG 1241
Query: 2022 EEYEASVCLCGSQVCRGSYLN 2042
+ CLCG+ C+G +LN
Sbjct: 1242 -STDRIPCLCGTAACKG-FLN 1260
>gi|380812066|gb|AFE77908.1| histone-lysine N-methyltransferase SETD2 [Macaca mulatta]
Length = 2565
Score = 72.0 bits (175), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 61/207 (29%), Positives = 89/207 (42%), Gaps = 48/207 (23%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ K+ + FV+E+ GEV K F+ + + KN + Y
Sbjct: 1560 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1608
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
K D ++DA K N + + HSC PNCE + V+G ++G +T + + G E
Sbjct: 1609 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1662
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGE----------------------- 2046
+TFDY K EA C CGS CRG YL GE
Sbjct: 1663 LTFDYQFQRYGK---EAQKCFCGSANCRG-YLG--GENRVSIRAAGGKMKKERSRKKDSV 1716
Query: 2047 -GAFEKVLKELHGLLDRHQLMLEACEL 2072
G E +++ GL D++Q+ L C L
Sbjct: 1717 DGELEALMENGEGLSDKNQV-LSLCRL 1742
>gi|432909264|ref|XP_004078147.1| PREDICTED: histone-lysine N-methyltransferase SETD2-like [Oryzias
latipes]
Length = 1665
Score = 72.0 bits (175), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 47/148 (31%), Positives = 71/148 (47%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+ KE + FV+E+ GEV K F+ + + KN +Y + L+
Sbjct: 641 KGWGLRAAKE--MAPNTFVLEYCGEVLD-HKEFKTRVKEYARNKNIH-----YYFMSLKN 692
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K N + + HSC PNCE + V+G ++G +T + + G E+
Sbjct: 693 NE---------IIDATLKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKAVAAGTEL 743
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TFDY K EA C CG+ CRG
Sbjct: 744 TFDYQFQRYGK---EAQKCFCGAPSCRG 768
>gi|109040979|ref|XP_001113652.1| PREDICTED: histone-lysine N-methyltransferase SETD2-like isoform 2
[Macaca mulatta]
Length = 2550
Score = 72.0 bits (175), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 61/207 (29%), Positives = 89/207 (42%), Gaps = 48/207 (23%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ K+ + FV+E+ GEV K F+ + + KN + Y
Sbjct: 1545 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1593
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
K D ++DA K N + + HSC PNCE + V+G ++G +T + + G E
Sbjct: 1594 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1647
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGE----------------------- 2046
+TFDY K EA C CGS CRG YL GE
Sbjct: 1648 LTFDYQFQRYGK---EAQKCFCGSANCRG-YLG--GENRVSIRAAGGKMKKERSRKKDSV 1701
Query: 2047 -GAFEKVLKELHGLLDRHQLMLEACEL 2072
G E +++ GL D++Q+ L C L
Sbjct: 1702 DGELEALMENGEGLSDKNQV-LSLCRL 1727
>gi|395520196|ref|XP_003764223.1| PREDICTED: histone-lysine N-methyltransferase MLL [Sarcophilus
harrisii]
Length = 3995
Score = 72.0 bits (175), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 52/153 (33%), Positives = 72/153 (47%), Gaps = 25/153 (16%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI--YLE 1949
G G+ C + GE V+E+ G V IRS+Q + + E I Y+
Sbjct: 3866 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKREKYYESKGIGCYMF 3911
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
R D VVDA N A I HSC PNC ++V +DG I I+ +R I+ GEE
Sbjct: 3912 RID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEE 3965
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
+T+DY E + C CG++ CR +LN
Sbjct: 3966 LTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3995
>gi|388581385|gb|EIM21694.1| SET domain-containing protein [Wallemia sebi CBS 633.66]
Length = 681
Score = 72.0 bits (175), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 61/243 (25%), Positives = 106/243 (43%), Gaps = 34/243 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ N + D F++E++GEV ++ +R + +++ FY + L+
Sbjct: 95 KKGYGLRANVD--LDRDTFLIEYIGEVVTQTQF------LRRMNTYSKEGIKHFYFMMLQ 146
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
+ +DA + N HSC PNC V + ++GI+T R I GEE
Sbjct: 147 NEE---------FIDATRRGNIGRFANHSCAPNCFVSKWVVGKYVKMGIFTKRKIEKGEE 197
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSY--LNLTGEGAFEKVLKELHGL----LDRH 2063
+TF+YN + ++A C CG C G T G + + + G+ + +H
Sbjct: 198 LTFNYNV---DRYGHDAQPCYCGEPNCVGFIGGKTQTDIGGMDDQILDALGITPEEIFQH 254
Query: 2064 QLMLEACELNSVSEEDYLELGRAGLGSCLLGGLPNWVVAY----SARLVRFINLERTKLP 2119
QL + + +EDY L +L +P + A + R + L+R ++
Sbjct: 255 QLKGSRKKKSKKLDEDY----ELTLKPMVLTDVPKVITAVRQSSTNRKILIKLLQRMRMT 310
Query: 2120 EEI 2122
EEI
Sbjct: 311 EEI 313
>gi|402860278|ref|XP_003894560.1| PREDICTED: histone-lysine N-methyltransferase SETD2 [Papio anubis]
Length = 2521
Score = 72.0 bits (175), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 61/207 (29%), Positives = 89/207 (42%), Gaps = 48/207 (23%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ K+ + FV+E+ GEV K F+ + + KN + Y
Sbjct: 1516 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1564
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
K D ++DA K N + + HSC PNCE + V+G ++G +T + + G E
Sbjct: 1565 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1618
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGE----------------------- 2046
+TFDY K EA C CGS CRG YL GE
Sbjct: 1619 LTFDYQFQRYGK---EAQKCFCGSANCRG-YLG--GENRVSIRAAGGKMKKERSRKKDSV 1672
Query: 2047 -GAFEKVLKELHGLLDRHQLMLEACEL 2072
G E +++ GL D++Q+ L C L
Sbjct: 1673 DGELEALMENGEGLSDKNQV-LSLCRL 1698
>gi|354496911|ref|XP_003510567.1| PREDICTED: histone-lysine N-methyltransferase MLL [Cricetulus
griseus]
Length = 3907
Score = 72.0 bits (175), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 52/156 (33%), Positives = 75/156 (48%), Gaps = 31/156 (19%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
G G+ C + GE V+E+ G V IRS+Q + + ++Y+
Sbjct: 3778 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3820
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
Y+ R D D VVDA N A I HSC PNC ++V +DG I I+ +R I+
Sbjct: 3821 YMFR----ID--DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3874
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
GEE+T+DY E + C CG++ CR +LN
Sbjct: 3875 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3907
>gi|344304500|gb|EGW34732.1| hypothetical protein SPAPADRAFT_133304 [Spathaspora passalidarum NRRL
Y-27907]
Length = 1060
Score = 72.0 bits (175), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 48/82 (58%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA K A I H C P+C AK+ V+G +I IY +R I EE+T+DY E+
Sbjct: 981 TVIDATKKGGIARFINHCCSPSCTAKIIKVEGKKRIVIYALRDIEANEELTYDYKFERET 1040
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+E E CLCG+ C+G YLN
Sbjct: 1041 NDE-ERIRCLCGAPGCKG-YLN 1060
>gi|334330381|ref|XP_001380704.2| PREDICTED: histone-lysine N-methyltransferase MLL [Monodelphis
domestica]
Length = 3960
Score = 72.0 bits (175), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 52/153 (33%), Positives = 72/153 (47%), Gaps = 25/153 (16%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI--YLE 1949
G G+ C + GE V+E+ G V IRS+Q + + E I Y+
Sbjct: 3831 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKREKYYESKGIGCYMF 3876
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
R D VVDA N A I HSC PNC ++V +DG I I+ +R I+ GEE
Sbjct: 3877 RID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEE 3930
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
+T+DY E + C CG++ CR +LN
Sbjct: 3931 LTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3960
>gi|355752689|gb|EHH56809.1| hypothetical protein EGM_06289 [Macaca fascicularis]
Length = 3844
Score = 72.0 bits (175), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
G G+ C + GE V+E+ G V IRS+Q + + ++Y+
Sbjct: 3715 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3757
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
Y+ R D VVDA N A I HSC PNC ++V +DG I I+ +R I+
Sbjct: 3758 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3811
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
GEE+T+DY E + C CG++ CR +LN
Sbjct: 3812 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3844
>gi|197927225|ref|NP_001074809.2| SET domain containing 2 [Mus musculus]
Length = 2537
Score = 72.0 bits (175), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ K+ + FV+E+ GEV K F+ + + KN + Y
Sbjct: 1533 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1581
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
K D ++DA K N + + HSC PNCE + V+G ++G +T + + G E
Sbjct: 1582 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1635
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
+TFDY K EA C CGS CRG YL
Sbjct: 1636 LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1663
>gi|395516140|ref|XP_003762252.1| PREDICTED: histone-lysine N-methyltransferase SETD2 [Sarcophilus
harrisii]
Length = 2570
Score = 72.0 bits (175), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 87/201 (43%), Gaps = 47/201 (23%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ K+ + FV+E+ GEV K F+ + + KN + Y
Sbjct: 1561 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1609
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
K D ++DA K N + + HSC PNCE + V+G ++G +T + + G E
Sbjct: 1610 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1663
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGE----------------------- 2046
+TFDY K EA C CGS CRG YL GE
Sbjct: 1664 LTFDYQFQRYGK---EAQKCFCGSANCRG-YLG--GENRVSIRAAGGKMKKERSRKKDSV 1717
Query: 2047 -GAFEKVLKELHGLLDRHQLM 2066
G E +L+ GL D++Q++
Sbjct: 1718 DGELEALLENGEGLSDKNQVL 1738
>gi|392350034|ref|XP_003750554.1| PREDICTED: histone-lysine N-methyltransferase MLL, partial [Rattus
norvegicus]
Length = 3894
Score = 72.0 bits (175), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 52/156 (33%), Positives = 75/156 (48%), Gaps = 31/156 (19%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
G G+ C + GE V+E+ G V IRS+Q + + ++Y+
Sbjct: 3765 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3807
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
Y+ R D D VVDA N A I HSC PNC ++V +DG I I+ +R I+
Sbjct: 3808 YMFR----ID--DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3861
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
GEE+T+DY E + C CG++ CR +LN
Sbjct: 3862 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3894
>gi|297482744|ref|XP_002693122.1| PREDICTED: histone-lysine N-methyltransferase MLL, partial [Bos
taurus]
gi|296480196|tpg|DAA22311.1| TPA: myeloid/lymphoid or mixed-lineage leukemia-like [Bos taurus]
Length = 3821
Score = 72.0 bits (175), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
G G+ C + GE V+E+ G V IRS+Q + + ++Y+
Sbjct: 3692 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3734
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
Y+ R D VVDA N A I HSC PNC ++V +DG I I+ +R I+
Sbjct: 3735 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3788
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
GEE+T+DY E + C CG++ CR +LN
Sbjct: 3789 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3821
>gi|390461098|ref|XP_003732596.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 2
[Callithrix jacchus]
Length = 1400
Score = 71.6 bits (174), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 45/148 (30%), Positives = 79/148 (53%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+V ++ GE FV E++GEV ++++ + ++ +E+ FY + +++
Sbjct: 1108 KGWGLVAKRDIRKGE--FVNEYVGEV------IDEEECMARIKHAHENDITHFYMLTIDK 1159
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NY+ + HSC+PNCE V+G ++G++ V I G E+
Sbjct: 1160 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1210
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN E+ +VC CG+ C G
Sbjct: 1211 TFNYNLDCLGNEK---TVCRCGASNCSG 1235
>gi|114640631|ref|XP_508792.2| PREDICTED: histone-lysine N-methyltransferase MLL [Pan troglodytes]
Length = 3969
Score = 71.6 bits (174), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
G G+ C + GE V+E+ G V IRS+Q + + ++Y+
Sbjct: 3840 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3882
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
Y+ R D VVDA N A I HSC PNC ++V +DG I I+ +R I+
Sbjct: 3883 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3936
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
GEE+T+DY E + C CG++ CR +LN
Sbjct: 3937 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3969
>gi|403287002|ref|XP_003934751.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Saimiri
boliviensis boliviensis]
Length = 1368
Score = 71.6 bits (174), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 45/148 (30%), Positives = 79/148 (53%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+V ++ GE FV E++GEV ++++ + ++ +E+ FY + +++
Sbjct: 1073 KGWGLVAKRDIRKGE--FVNEYVGEV------IDEEECMARIKHAHENDITHFYMLTIDK 1124
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NY+ + HSC+PNCE V+G ++G++ V I G E+
Sbjct: 1125 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1175
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN E+ +VC CG+ C G
Sbjct: 1176 TFNYNLDCLGNEK---TVCRCGASNCSG 1200
>gi|119587788|gb|EAW67384.1| myeloid/lymphoid or mixed-lineage leukemia (trithorax homolog,
Drosophila), isoform CRA_e [Homo sapiens]
Length = 3972
Score = 71.6 bits (174), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
G G+ C + GE V+E+ G V IRS+Q + + ++Y+
Sbjct: 3843 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3885
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
Y+ R D VVDA N A I HSC PNC ++V +DG I I+ +R I+
Sbjct: 3886 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3939
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
GEE+T+DY E + C CG++ CR +LN
Sbjct: 3940 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3972
>gi|119587784|gb|EAW67380.1| myeloid/lymphoid or mixed-lineage leukemia (trithorax homolog,
Drosophila), isoform CRA_a [Homo sapiens]
Length = 3969
Score = 71.6 bits (174), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
G G+ C + GE V+E+ G V IRS+Q + + ++Y+
Sbjct: 3840 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3882
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
Y+ R D VVDA N A I HSC PNC ++V +DG I I+ +R I+
Sbjct: 3883 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3936
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
GEE+T+DY E + C CG++ CR +LN
Sbjct: 3937 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3969
>gi|449666506|ref|XP_002161122.2| PREDICTED: uncharacterized protein LOC100198749 [Hydra
magnipapillata]
Length = 1403
Score = 71.6 bits (174), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 36/78 (46%), Positives = 44/78 (56%), Gaps = 4/78 (5%)
Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
V+DA A I H C PNC AKV V+G +I IY+ R I GEEIT+DY E
Sbjct: 1328 VIDATKDGCNARFINHCCDPNCYAKVILVEGAKKIVIYSRRAIKLGEEITYDYKFPIED- 1386
Query: 2022 EEYEASVCLCGSQVCRGS 2039
E CLCG+ +CRG+
Sbjct: 1387 ---EKIPCLCGAALCRGT 1401
>gi|308199413|ref|NP_001184033.1| histone-lysine N-methyltransferase MLL isoform 1 precursor [Homo
sapiens]
Length = 3972
Score = 71.6 bits (174), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
G G+ C + GE V+E+ G V IRS+Q + + ++Y+
Sbjct: 3843 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3885
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
Y+ R D VVDA N A I HSC PNC ++V +DG I I+ +R I+
Sbjct: 3886 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3939
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
GEE+T+DY E + C CG++ CR +LN
Sbjct: 3940 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3972
>gi|119587787|gb|EAW67383.1| myeloid/lymphoid or mixed-lineage leukemia (trithorax homolog,
Drosophila), isoform CRA_d [Homo sapiens]
Length = 4002
Score = 71.6 bits (174), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
G G+ C + GE V+E+ G V IRS+Q + + ++Y+
Sbjct: 3873 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3915
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
Y+ R D VVDA N A I HSC PNC ++V +DG I I+ +R I+
Sbjct: 3916 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3969
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
GEE+T+DY E + C CG++ CR +LN
Sbjct: 3970 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 4002
>gi|56550039|ref|NP_005924.2| histone-lysine N-methyltransferase MLL isoform 2 precursor [Homo
sapiens]
gi|146345435|sp|Q03164.5|MLL1_HUMAN RecName: Full=Histone-lysine N-methyltransferase MLL; AltName:
Full=ALL-1; AltName: Full=CXXC-type zinc finger protein
7; AltName: Full=Lysine N-methyltransferase 2A;
Short=KMT2A; AltName: Full=Trithorax-like protein;
AltName: Full=Zinc finger protein HRX; Contains: RecName:
Full=MLL cleavage product N320; AltName: Full=N-terminal
cleavage product of 320 kDa; Short=p320; Contains:
RecName: Full=MLL cleavage product C180; AltName:
Full=C-terminal cleavage product of 180 kDa; Short=p180
gi|34305635|gb|AAQ63624.1| myeloid/lymphoid or mixed-lineage leukemia (trithorax homolog,
Drosophila) [Homo sapiens]
Length = 3969
Score = 71.6 bits (174), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
G G+ C + GE V+E+ G V IRS+Q + + ++Y+
Sbjct: 3840 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3882
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
Y+ R D VVDA N A I HSC PNC ++V +DG I I+ +R I+
Sbjct: 3883 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3936
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
GEE+T+DY E + C CG++ CR +LN
Sbjct: 3937 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3969
>gi|392341954|ref|XP_003754471.1| PREDICTED: histone-lysine N-methyltransferase MLL [Rattus norvegicus]
Length = 3987
Score = 71.6 bits (174), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 52/156 (33%), Positives = 75/156 (48%), Gaps = 31/156 (19%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
G G+ C + GE V+E+ G V IRS+Q + + ++Y+
Sbjct: 3858 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3900
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
Y+ R D D VVDA N A I HSC PNC ++V +DG I I+ +R I+
Sbjct: 3901 YMFR----ID--DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3954
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
GEE+T+DY E + C CG++ CR +LN
Sbjct: 3955 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3987
>gi|395743560|ref|XP_002822597.2| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL [Pongo abelii]
Length = 4012
Score = 71.6 bits (174), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
G G+ C + GE V+E+ G V IRS+Q + + ++Y+
Sbjct: 3883 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3925
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
Y+ R D VVDA N A I HSC PNC ++V +DG I I+ +R I+
Sbjct: 3926 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3979
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
GEE+T+DY E + C CG++ CR +LN
Sbjct: 3980 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 4012
>gi|355567103|gb|EHH23482.1| hypothetical protein EGK_06957, partial [Macaca mulatta]
Length = 3824
Score = 71.6 bits (174), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
G G+ C + GE V+E+ G V IRS+Q + + ++Y+
Sbjct: 3695 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3737
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
Y+ R D VVDA N A I HSC PNC ++V +DG I I+ +R I+
Sbjct: 3738 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3791
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
GEE+T+DY E + C CG++ CR +LN
Sbjct: 3792 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3824
>gi|331214149|ref|XP_003319756.1| Setd1a protein [Puccinia graminis f. sp. tritici CRL 75-36-700-3]
Length = 1014
Score = 71.6 bits (174), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 35/81 (43%), Positives = 47/81 (58%), Gaps = 4/81 (4%)
Query: 1959 DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVT 2018
D +VVDA K N I H C PNC AK+ ++G +I IY I G+E+T+DY+
Sbjct: 936 DDLVVDATKKGNLGRLINHCCSPNCTAKIITINGEKKIVIYAKVTIELGDEVTYDYHF-- 993
Query: 2019 ESKEEYEASVCLCGSQVCRGS 2039
KEE + CLCGS C+G+
Sbjct: 994 -PKEEVKIP-CLCGSVKCKGT 1012
>gi|356532622|ref|XP_003534870.1| PREDICTED: uncharacterized protein LOC100805708 [Glycine max]
Length = 1213
Score = 71.6 bits (174), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 52/137 (37%), Positives = 68/137 (49%), Gaps = 25/137 (18%)
Query: 1906 DDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI---YLERPKGDADGYDLVV 1962
+DFV+E++GE+ IR + + E I YL R DGY V
Sbjct: 1097 EDFVIEYIGEL------------IRPRISDIRERQYEKMGIGSSYLFRLD---DGY---V 1138
Query: 1963 VDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKE 2022
VDA + A + HSC PNC KV +V+G +I IY R I GEEIT++Y E K+
Sbjct: 1139 VDATKRGGIARFVNHSCEPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKK 1198
Query: 2023 EYEASVCLCGSQVCRGS 2039
C CGS+ CRGS
Sbjct: 1199 ----IPCNCGSRKCRGS 1211
>gi|195171947|ref|XP_002026763.1| GL27000 [Drosophila persimilis]
gi|194111702|gb|EDW33745.1| GL27000 [Drosophila persimilis]
Length = 944
Score = 71.6 bits (174), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 45/149 (30%), Positives = 76/149 (51%), Gaps = 20/149 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ + GE F++E++GEV + ++ R + ++D +Y + L
Sbjct: 62 KKGCGITAELQIPAGE--FIMEYVGEV------IDSEEFERRQHRYSKDRNRHYYFMAL- 112
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
+G+A ++DA + N + I HSC PN E + V+G +IG ++++ I GEE
Sbjct: 113 --RGEA------IIDATMRGNISRYINHSCDPNAETQKWTVNGELRIGFFSLKNILPGEE 164
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
ITFDY + +A C C + CRG
Sbjct: 165 ITFDYQYQRYGR---DAQRCYCEAANCRG 190
>gi|332208875|ref|XP_003253537.1| PREDICTED: histone-lysine N-methyltransferase MLL [Nomascus
leucogenys]
Length = 3968
Score = 71.6 bits (174), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
G G+ C + GE V+E+ G V IRS+Q + + ++Y+
Sbjct: 3839 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3881
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
Y+ R D VVDA N A I HSC PNC ++V +DG I I+ +R I+
Sbjct: 3882 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3935
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
GEE+T+DY E + C CG++ CR +LN
Sbjct: 3936 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3968
>gi|297458806|ref|XP_585092.4| PREDICTED: histone-lysine N-methyltransferase MLL [Bos taurus]
Length = 3826
Score = 71.6 bits (174), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
G G+ C + GE V+E+ G V IRS+Q + + ++Y+
Sbjct: 3697 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3739
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
Y+ R D VVDA N A I HSC PNC ++V +DG I I+ +R I+
Sbjct: 3740 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3793
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
GEE+T+DY E + C CG++ CR +LN
Sbjct: 3794 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3826
>gi|440904942|gb|ELR55394.1| Histone-lysine N-methyltransferase MLL, partial [Bos grunniens mutus]
Length = 3846
Score = 71.6 bits (174), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
G G+ C + GE V+E+ G V IRS+Q + + ++Y+
Sbjct: 3717 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3759
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
Y+ R D VVDA N A I HSC PNC ++V +DG I I+ +R I+
Sbjct: 3760 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3813
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
GEE+T+DY E + C CG++ CR +LN
Sbjct: 3814 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3846
>gi|1490271|emb|CAA93625.1| ALL-1 protein [Homo sapiens]
Length = 4005
Score = 71.6 bits (174), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
G G+ C + GE V+E+ G V IRS+Q + + ++Y+
Sbjct: 3876 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3918
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
Y+ R D VVDA N A I HSC PNC ++V +DG I I+ +R I+
Sbjct: 3919 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3972
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
GEE+T+DY E + C CG++ CR +LN
Sbjct: 3973 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 4005
>gi|402895434|ref|XP_003910832.1| PREDICTED: histone-lysine N-methyltransferase MLL [Papio anubis]
Length = 3968
Score = 71.6 bits (174), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
G G+ C + GE V+E+ G V IRS+Q + + ++Y+
Sbjct: 3839 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3881
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
Y+ R D VVDA N A I HSC PNC ++V +DG I I+ +R I+
Sbjct: 3882 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3935
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
GEE+T+DY E + C CG++ CR +LN
Sbjct: 3936 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3968
>gi|344293012|ref|XP_003418218.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL-like [Loxodonta africana]
Length = 3962
Score = 71.6 bits (174), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
G G+ C + GE V+E+ G V IRS+Q + + ++Y+
Sbjct: 3833 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3875
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
Y+ R D VVDA N A I HSC PNC ++V +DG I I+ +R I+
Sbjct: 3876 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3929
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
GEE+T+DY E + C CG++ CR +LN
Sbjct: 3930 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3962
>gi|326921432|ref|XP_003206963.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
SETD2-like [Meleagris gallopavo]
Length = 2147
Score = 71.6 bits (174), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 59/204 (28%), Positives = 88/204 (43%), Gaps = 47/204 (23%)
Query: 1887 VAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI 1946
+ +KG G+ K+ + FV+E+ GEV K F+ + + KN +
Sbjct: 1361 LTEKKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHY 1409
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
Y K D ++DA K N + + HSC PNCE + V+G ++G +T + +
Sbjct: 1410 YFMALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPS 1463
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGE-------------------- 2046
G E+TFDY K EA C CGS CRG YL GE
Sbjct: 1464 GSELTFDYQFQRYGK---EAQKCFCGSANCRG-YLG--GENRVSIRAAGGKMKKERSRKK 1517
Query: 2047 ----GAFEKVLKELHGLLDRHQLM 2066
G E +L+ GL D++Q++
Sbjct: 1518 DSVDGELEALLENGEGLSDKNQVL 1541
>gi|390469747|ref|XP_002754504.2| PREDICTED: histone-lysine N-methyltransferase MLL [Callithrix
jacchus]
Length = 3994
Score = 71.6 bits (174), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
G G+ C + GE V+E+ G V IRS+Q + + ++Y+
Sbjct: 3865 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3907
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
Y+ R D VVDA N A I HSC PNC ++V +DG I I+ +R I+
Sbjct: 3908 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3961
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
GEE+T+DY E + C CG++ CR +LN
Sbjct: 3962 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3994
>gi|355559685|gb|EHH16413.1| hypothetical protein EGK_11693 [Macaca mulatta]
Length = 2343
Score = 71.6 bits (174), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 61/207 (29%), Positives = 89/207 (42%), Gaps = 48/207 (23%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ K+ + FV+E+ GEV K F+ + + KN + Y
Sbjct: 1338 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1386
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
K D ++DA K N + + HSC PNCE + V+G ++G +T + + G E
Sbjct: 1387 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1440
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGE----------------------- 2046
+TFDY K EA C CGS CRG YL GE
Sbjct: 1441 LTFDYQFQRYGK---EAQKCFCGSANCRG-YLG--GENRVSIRAAGGKMKKERSRKKDSV 1494
Query: 2047 -GAFEKVLKELHGLLDRHQLMLEACEL 2072
G E +++ GL D++Q+ L C L
Sbjct: 1495 DGELEALMENGEGLSDKNQV-LSLCRL 1520
>gi|328701191|ref|XP_003241521.1| PREDICTED: hypothetical protein LOC100573227 [Acyrthosiphon pisum]
Length = 1315
Score = 71.6 bits (174), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 44/79 (55%), Gaps = 4/79 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N A I HSC PNC AK+ +DG +I IY+ + I EEIT+DY E
Sbjct: 1239 TIIDATKCGNLARFINHSCNPNCYAKIIQIDGQKKIVIYSKQPIGVNEEITYDYKFPLED 1298
Query: 2021 KEEYEASVCLCGSQVCRGS 2039
+ CLCG+ CRG+
Sbjct: 1299 NK----IPCLCGTHCCRGT 1313
>gi|432873648|ref|XP_004072321.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like [Oryzias
latipes]
Length = 1597
Score = 71.6 bits (174), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 46/148 (31%), Positives = 74/148 (50%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
+G G+ N+ + DFV E++GEV + ++ + +++ +E+ FY + L +
Sbjct: 1308 RGWGLQTNQ--ALRKGDFVAEYVGEV------IDSEECQQRIKRAHENHVTNFYMLTLTK 1359
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ V+DA K N A I HSC PNCE + V+G +IGI+ + I G E+
Sbjct: 1360 DR---------VIDAGPKGNSARFINHSCNPNCETQKWTVNGDVRIGIFALCDIEAGTEL 1410
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN + C CGS+ C G
Sbjct: 1411 TFNYNLHCVGNRR---TSCHCGSENCSG 1435
>gi|195130337|ref|XP_002009608.1| GI15146 [Drosophila mojavensis]
gi|193908058|gb|EDW06925.1| GI15146 [Drosophila mojavensis]
Length = 1885
Score = 71.6 bits (174), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 47/153 (30%), Positives = 76/153 (49%), Gaps = 20/153 (13%)
Query: 1886 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN 1945
+ +KG G+ GE F++E++GEV ++ +Q ++ + +Y
Sbjct: 987 FRTKKKGCGITAEMLIPPGE--FIMEYVGEVIDSEEFERRQHHYSQIRNRH------YYF 1038
Query: 1946 IYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIH 2005
+ L +G+A ++DA K N + I HSC PN E + V+G +IG ++V+ I
Sbjct: 1039 MAL---RGEA------IIDATVKGNISRYINHSCDPNAETQKWTVNGELRIGFFSVKTIL 1089
Query: 2006 YGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
GEEITFDY + +A C C S+ CRG
Sbjct: 1090 PGEEITFDYQYQRYGR---DAQRCYCESENCRG 1119
>gi|426244626|ref|XP_004016122.1| PREDICTED: histone-lysine N-methyltransferase MLL [Ovis aries]
Length = 3710
Score = 71.6 bits (174), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
G G+ C + GE V+E+ G V IRS+Q + + ++Y+
Sbjct: 3581 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3623
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
Y+ R D VVDA N A I HSC PNC ++V +DG I I+ +R I+
Sbjct: 3624 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3677
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
GEE+T+DY E + C CG++ CR +LN
Sbjct: 3678 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3710
>gi|241948091|ref|XP_002416768.1| COMPASS complex histone methyltransferase subunit, putative;
histone-lysine n-methyltransferase, h3 lysine-4 specific,
putative [Candida dubliniensis CD36]
gi|223640106|emb|CAX44352.1| COMPASS complex histone methyltransferase subunit, putative [Candida
dubliniensis CD36]
Length = 1032
Score = 71.6 bits (174), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 48/82 (58%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA K A I H C P+C AK+ V+G +I IY +R I EE+T+DY E+
Sbjct: 953 TVIDATKKGGIARFINHCCSPSCTAKIIKVEGKKRIVIYALRDIEANEELTYDYKFERET 1012
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+E E CLCG+ C+G YLN
Sbjct: 1013 NDE-ERIRCLCGAPGCKG-YLN 1032
>gi|345799715|ref|XP_536554.3| PREDICTED: histone-lysine N-methyltransferase MLL [Canis lupus
familiaris]
Length = 3829
Score = 71.6 bits (174), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
G G+ C + GE V+E+ G V IRS+Q + + ++Y+
Sbjct: 3700 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3742
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
Y+ R D VVDA N A I HSC PNC ++V +DG I I+ +R I+
Sbjct: 3743 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3796
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
GEE+T+DY E + C CG++ CR +LN
Sbjct: 3797 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3829
>gi|296197020|ref|XP_002746091.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 1
[Callithrix jacchus]
Length = 1365
Score = 71.6 bits (174), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 45/148 (30%), Positives = 79/148 (53%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+V ++ GE FV E++GEV ++++ + ++ +E+ FY + +++
Sbjct: 1073 KGWGLVAKRDIRKGE--FVNEYVGEV------IDEEECMARIKHAHENDITHFYMLTIDK 1124
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NY+ + HSC+PNCE V+G ++G++ V I G E+
Sbjct: 1125 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1175
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN E+ +VC CG+ C G
Sbjct: 1176 TFNYNLDCLGNEK---TVCRCGASNCSG 1200
>gi|432892259|ref|XP_004075732.1| PREDICTED: histone-lysine N-methyltransferase MLL-like [Oryzias
latipes]
Length = 4536
Score = 71.6 bits (174), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 58/173 (33%), Positives = 79/173 (45%), Gaps = 31/173 (17%)
Query: 1875 LKAMDSRPDDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQK 1934
LKAM Y + G G+ C K GE V+E+ G V IRS+
Sbjct: 4390 LKAMSKETVGVYRSPIHGRGLFCKKTIEAGE--MVIEYSGNV------------IRSVLT 4435
Query: 1935 NNEDPAPEFYN-----IYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTA 1989
+ + ++Y+ Y+ R D Y+ VVDA N A I HSC PNC ++V
Sbjct: 4436 DKRE---KYYDAKGIGCYMFR----IDDYE--VVDATVHGNAARFINHSCEPNCYSRVLT 4486
Query: 1990 VDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
VDG I I+ R I GEE+T+DY E + C CG++ CR +LN
Sbjct: 4487 VDGQKHIVIFASRRICCGEELTYDYKFPIE--DASNKLPCNCGTKKCR-KFLN 4536
>gi|417414196|gb|JAA53397.1| Putative histone-lysine n-methyltransferase mll, partial [Desmodus
rotundus]
Length = 3966
Score = 71.6 bits (174), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
G G+ C + GE V+E+ G V IRS+Q + + ++Y+
Sbjct: 3837 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3879
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
Y+ R D VVDA N A I HSC PNC ++V +DG I I+ +R I+
Sbjct: 3880 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3933
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
GEE+T+DY E + C CG++ CR +LN
Sbjct: 3934 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3966
>gi|157824020|ref|NP_001101659.1| histone-lysine N-methyltransferase SETD2 [Rattus norvegicus]
gi|149018436|gb|EDL77077.1| kinesin family member 9 (predicted) [Rattus norvegicus]
Length = 2294
Score = 71.6 bits (174), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ K+ + FV+E+ GEV K F+ + + KN + Y
Sbjct: 1290 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1338
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
K D ++DA K N + + HSC PNCE + V+G ++G +T + + G E
Sbjct: 1339 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1392
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
+TFDY K EA C CGS CRG YL
Sbjct: 1393 LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1420
>gi|403159096|ref|XP_003890756.1| histone-lysine N-methyltransferase SETD1 [Puccinia graminis f. sp.
tritici CRL 75-36-700-3]
gi|375166585|gb|EHS63201.1| histone-lysine N-methyltransferase SETD1 [Puccinia graminis f. sp.
tritici CRL 75-36-700-3]
Length = 1502
Score = 71.6 bits (174), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 35/81 (43%), Positives = 47/81 (58%), Gaps = 4/81 (4%)
Query: 1959 DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVT 2018
D +VVDA K N I H C PNC AK+ ++G +I IY I G+E+T+DY+
Sbjct: 1424 DDLVVDATKKGNLGRLINHCCSPNCTAKIITINGEKKIVIYAKVTIELGDEVTYDYHF-- 1481
Query: 2019 ESKEEYEASVCLCGSQVCRGS 2039
KEE + CLCGS C+G+
Sbjct: 1482 -PKEEVKIP-CLCGSVKCKGT 1500
>gi|354484245|ref|XP_003504300.1| PREDICTED: histone-lysine N-methyltransferase SETD2-like [Cricetulus
griseus]
gi|344236054|gb|EGV92157.1| Histone-lysine N-methyltransferase SETD2 [Cricetulus griseus]
Length = 2412
Score = 71.6 bits (174), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ K+ + FV+E+ GEV K F+ + + KN + Y
Sbjct: 1408 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1456
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
K D ++DA K N + + HSC PNCE + V+G ++G +T + + G E
Sbjct: 1457 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1510
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
+TFDY K EA C CGS CRG YL
Sbjct: 1511 LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1538
>gi|297269329|ref|XP_001093874.2| PREDICTED: histone-lysine N-methyltransferase MLL [Macaca mulatta]
Length = 3986
Score = 71.6 bits (174), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
G G+ C + GE V+E+ G V IRS+Q + + ++Y+
Sbjct: 3857 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3899
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
Y+ R D VVDA N A I HSC PNC ++V +DG I I+ +R I+
Sbjct: 3900 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3953
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
GEE+T+DY E + C CG++ CR +LN
Sbjct: 3954 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3986
>gi|73985747|ref|XP_864158.1| PREDICTED: histone-lysine N-methyltransferase SETD2 isoform 11 [Canis
lupus familiaris]
Length = 2562
Score = 71.6 bits (174), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ K+ + FV+E+ GEV K F+ + + KN + Y
Sbjct: 1557 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1605
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
K D ++DA K N + + HSC PNCE + V+G ++G +T + + G E
Sbjct: 1606 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1659
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
+TFDY K EA C CGS CRG YL
Sbjct: 1660 LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1687
>gi|627837|pir||A48205 All-1 protein +GTE form - mouse (fragment)
Length = 3869
Score = 71.6 bits (174), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
G G+ C + GE V+E+ G V IRS+Q + + ++Y+
Sbjct: 3740 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3782
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
Y+ R D VVDA N A I HSC PNC ++V +DG I I+ +R I+
Sbjct: 3783 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3836
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
GEE+T+DY E + C CG++ CR +LN
Sbjct: 3837 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3869
>gi|297817294|ref|XP_002876530.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297322368|gb|EFH52789.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 354
Score = 71.6 bits (174), Expect = 6e-09, Method: Composition-based stats.
Identities = 48/146 (32%), Positives = 73/146 (50%), Gaps = 20/146 (13%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
G G+V +++ GE F++E++GEV K E++ L K N FY +
Sbjct: 123 GYGIVADEDINSGE--FIIEYVGEVVIDEKICEER-----LWKLNHKVEKNFYLCQINWN 175
Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
+V+DA HK N + I HSC PN E + +DG +IGI+ R I+ GE++T
Sbjct: 176 ---------MVIDATHKGNKSRYINHSCNPNTEMQKWIIDGETRIGIFATRFINKGEQLT 226
Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCR 2037
+DY V ++ C CG+ CR
Sbjct: 227 YDYQFVQFGADQD----CYCGAVCCR 248
>gi|328711160|ref|XP_001945277.2| PREDICTED: histone-lysine N-methyltransferase SETD1B-like
[Acyrthosiphon pisum]
Length = 1322
Score = 71.6 bits (174), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 44/79 (55%), Gaps = 4/79 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N A I HSC PNC AK+ +DG +I IY+ + I EEIT+DY E
Sbjct: 1246 TIIDATKCGNLARFINHSCNPNCYAKIIQIDGQKKIVIYSKQPIGVNEEITYDYKFPLED 1305
Query: 2021 KEEYEASVCLCGSQVCRGS 2039
+ CLCG+ CRG+
Sbjct: 1306 NK----IPCLCGTHCCRGT 1320
>gi|119585211|gb|EAW64807.1| SET domain containing 2, isoform CRA_c [Homo sapiens]
Length = 1819
Score = 71.6 bits (174), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 50/155 (32%), Positives = 73/155 (47%), Gaps = 21/155 (13%)
Query: 1887 VAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI 1946
+ +KG G+ K+ + FV+E+ GEV K F+ + + KN +
Sbjct: 1334 LTEKKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHY 1382
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
Y K D ++DA K N + + HSC PNCE + V+G ++G +T + +
Sbjct: 1383 YFMALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPS 1436
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
G E+TFDY K EA C CGS CRG YL
Sbjct: 1437 GSELTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1467
>gi|301785015|ref|XP_002927929.1| PREDICTED: histone-lysine N-methyltransferase MLL-like [Ailuropoda
melanoleuca]
Length = 3981
Score = 71.6 bits (174), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
G G+ C + GE V+E+ G V IRS+Q + + ++Y+
Sbjct: 3852 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3894
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
Y+ R D VVDA N A I HSC PNC ++V +DG I I+ +R I+
Sbjct: 3895 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3948
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
GEE+T+DY E + C CG++ CR +LN
Sbjct: 3949 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3981
>gi|124486682|ref|NP_001074518.1| histone-lysine N-methyltransferase MLL [Mus musculus]
Length = 3963
Score = 71.6 bits (174), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
G G+ C + GE V+E+ G V IRS+Q + + ++Y+
Sbjct: 3834 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3876
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
Y+ R D VVDA N A I HSC PNC ++V +DG I I+ +R I+
Sbjct: 3877 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3930
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
GEE+T+DY E + C CG++ CR +LN
Sbjct: 3931 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3963
>gi|688443|gb|AAA62593.1| All-1 protein, partial [Mus musculus]
Length = 3866
Score = 71.6 bits (174), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
G G+ C + GE V+E+ G V IRS+Q + + ++Y+
Sbjct: 3737 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3779
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
Y+ R D VVDA N A I HSC PNC ++V +DG I I+ +R I+
Sbjct: 3780 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3833
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
GEE+T+DY E + C CG++ CR +LN
Sbjct: 3834 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3866
>gi|426232375|ref|XP_004010202.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
N-methyltransferase NSD2 [Ovis aries]
Length = 1273
Score = 71.6 bits (174), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 80/148 (54%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+V ++ GE FV E++GE+ ++++ + +++ +E+ FY + +++
Sbjct: 1019 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKRAHENDITHFYMLTIDK 1070
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NY+ + HSC+PNCE V+G ++G++ V I G E+
Sbjct: 1071 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1121
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN E+ +VC CG+ C G
Sbjct: 1122 TFNYNLDCLGNEK---TVCRCGASNCSG 1146
>gi|385305977|gb|EIF49918.1| putative compass histone methyltransferase subunit set1p [Dekkera
bruxellensis AWRI1499]
Length = 1104
Score = 71.6 bits (174), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 38/84 (45%), Positives = 48/84 (57%), Gaps = 2/84 (2%)
Query: 1959 DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVT 2018
D V+DA K A I H C P+C AK+ VDG +I IY +R I EE+T+DY
Sbjct: 1023 DNTVIDASKKGGIARFINHCCDPSCTAKIIKVDGKKRIVIYALRDIAANEELTYDYKFEK 1082
Query: 2019 ESKEEYEASVCLCGSQVCRGSYLN 2042
E+ E E CLCG+ C+G YLN
Sbjct: 1083 ETNPE-ERIPCLCGAPNCKG-YLN 1104
>gi|363729887|ref|XP_418510.3| PREDICTED: histone-lysine N-methyltransferase SETD2 [Gallus gallus]
Length = 2554
Score = 71.6 bits (174), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 87/201 (43%), Gaps = 47/201 (23%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ K+ + FV+E+ GEV K F+ + + KN + Y
Sbjct: 1557 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1605
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
K D ++DA K N + + HSC PNCE + V+G ++G +T + + G E
Sbjct: 1606 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1659
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGE----------------------- 2046
+TFDY K EA C CGS CRG YL GE
Sbjct: 1660 LTFDYQFQRYGK---EAQKCFCGSANCRG-YLG--GENRVSIRAAGGKMKKERSRKKDSV 1713
Query: 2047 -GAFEKVLKELHGLLDRHQLM 2066
G E +L+ GL D++Q++
Sbjct: 1714 DGELEALLENGEGLSDKNQVL 1734
>gi|341940997|sp|P55200.3|MLL1_MOUSE RecName: Full=Histone-lysine N-methyltransferase MLL; AltName:
Full=ALL-1; AltName: Full=Zinc finger protein HRX;
Contains: RecName: Full=MLL cleavage product N320;
AltName: Full=N-terminal cleavage product of 320 kDa;
Short=p320; Contains: RecName: Full=MLL cleavage product
C180; AltName: Full=C-terminal cleavage product of 180
kDa; Short=p180
Length = 3966
Score = 71.6 bits (174), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
G G+ C + GE V+E+ G V IRS+Q + + ++Y+
Sbjct: 3837 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3879
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
Y+ R D VVDA N A I HSC PNC ++V +DG I I+ +R I+
Sbjct: 3880 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3933
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
GEE+T+DY E + C CG++ CR +LN
Sbjct: 3934 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3966
>gi|76666643|ref|XP_613048.2| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 1
[Bos taurus]
gi|297476142|ref|XP_002688498.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Bos
taurus]
gi|296486298|tpg|DAA28411.1| TPA: Wolf-Hirschhorn syndrome candidate 1 [Bos taurus]
Length = 1365
Score = 71.6 bits (174), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 80/148 (54%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+V ++ GE FV E++GE+ ++++ + +++ +E+ FY + +++
Sbjct: 1073 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKRAHENDITHFYMLTIDK 1124
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NY+ + HSC+PNCE V+G ++G++ V I G E+
Sbjct: 1125 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1175
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN E+ +VC CG+ C G
Sbjct: 1176 TFNYNLDCLGNEK---TVCRCGASNCSG 1200
>gi|395848655|ref|XP_003796965.1| PREDICTED: histone-lysine N-methyltransferase MLL [Otolemur
garnettii]
Length = 4062
Score = 71.6 bits (174), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 52/156 (33%), Positives = 75/156 (48%), Gaps = 31/156 (19%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
G G+ C + GE V+E+ G V IRS+Q + + ++Y+
Sbjct: 3933 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3975
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
Y+ R D D VVDA N A I HSC PNC ++V +DG I I+ +R I+
Sbjct: 3976 YMFR----ID--DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 4029
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
GEE+T+DY E + C CG++ CR +LN
Sbjct: 4030 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 4062
>gi|340378403|ref|XP_003387717.1| PREDICTED: histone-lysine N-methyltransferase SETD2-like [Amphimedon
queenslandica]
Length = 862
Score = 71.6 bits (174), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 49/149 (32%), Positives = 72/149 (48%), Gaps = 26/149 (17%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
GL C+ FV+E+ GEV + + FE++ I + +Y + L
Sbjct: 136 GLKATCD----ISRYSFVMEYCGEVCSLEE-FERRRNIYEKESRRH-----YYFMSL--- 182
Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
K D ++DA K N + I HSC PNCE + V+G ++G + +R I GEE+T
Sbjct: 183 KTDE------ILDATRKGNLSRFINHSCEPNCETQKWTVNGRLRVGFFALRHIPAGEELT 236
Query: 2012 FDYNSVTESKEEYEASV--CLCGSQVCRG 2038
FDY + + SV C CGS+ CRG
Sbjct: 237 FDYQF-----QRFGESVQKCYCGSETCRG 260
>gi|426340342|ref|XP_004034089.1| PREDICTED: histone-lysine N-methyltransferase SETD2 [Gorilla gorilla
gorilla]
Length = 2564
Score = 71.6 bits (174), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ K+ + FV+E+ GEV K F+ + + KN + Y
Sbjct: 1559 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1607
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
K D ++DA K N + + HSC PNCE + V+G ++G +T + + G E
Sbjct: 1608 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1661
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
+TFDY K EA C CGS CRG YL
Sbjct: 1662 LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1689
>gi|184394|gb|AAA58669.1| HRX [Homo sapiens]
Length = 3969
Score = 71.6 bits (174), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
G G+ C + GE V+E+ G V IRS+Q + + ++Y+
Sbjct: 3840 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3882
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
Y+ R D VVDA N A I HSC PNC ++V +DG I I+ +R I+
Sbjct: 3883 YMFRID------DSEVVDATMHGNRARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3936
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
GEE+T+DY E + C CG++ CR +LN
Sbjct: 3937 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3969
>gi|448520177|ref|XP_003868242.1| Set1 Lysine histone methyltransferase [Candida orthopsilosis Co
90-125]
gi|380352581|emb|CCG22808.1| Set1 Lysine histone methyltransferase [Candida orthopsilosis]
Length = 1038
Score = 71.6 bits (174), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 48/82 (58%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA K A I H C P+C AK+ V+G +I IY +R I EE+T+DY E+
Sbjct: 959 TVIDATKKGGIARFINHCCNPSCTAKIIKVEGKKRIVIYALRDIEANEELTYDYKFERET 1018
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+E E CLCG+ C+G YLN
Sbjct: 1019 NDE-ERIRCLCGAPGCKG-YLN 1038
>gi|296474690|tpg|DAA16805.1| TPA: Wolf-Hirschhorn syndrome candidate 1 protein-like [Bos taurus]
Length = 2547
Score = 71.6 bits (174), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ K+ + FV+E+ GEV K F+ + + KN + Y
Sbjct: 1542 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1590
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
K D ++DA K N + + HSC PNCE + V+G ++G +T + + G E
Sbjct: 1591 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1644
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
+TFDY K EA C CGS CRG YL
Sbjct: 1645 LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1672
>gi|149041498|gb|EDL95339.1| myeloid/lymphoid or mixed-lineage leukemia (mapped) [Rattus
norvegicus]
Length = 3725
Score = 71.6 bits (174), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
G G+ C + GE V+E+ G V IRS+Q + + ++Y+
Sbjct: 3596 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3638
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
Y+ R D VVDA N A I HSC PNC ++V +DG I I+ +R I+
Sbjct: 3639 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3692
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
GEE+T+DY E + C CG++ CR +LN
Sbjct: 3693 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3725
>gi|157112020|ref|XP_001657377.1| huntingtin interacting protein [Aedes aegypti]
gi|108878208|gb|EAT42433.1| AAEL006013-PA [Aedes aegypti]
Length = 2367
Score = 71.6 bits (174), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 48/153 (31%), Positives = 76/153 (49%), Gaps = 20/153 (13%)
Query: 1886 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN 1945
+ +KG G+ + E G DF++E++GEV + F+++ + S +KN +Y
Sbjct: 1277 FRTEKKGFGIQASTEIVPG--DFIMEYVGEVL-NSEQFDERAELYSKEKNQH-----YYF 1328
Query: 1946 IYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIH 2005
+ L + DA ++DA K N + I HSC PN E + V+G +IG + + I
Sbjct: 1329 MAL---RSDA------IIDATTKGNISRFINHSCDPNAETQKWTVNGELRIGFFCTKYIM 1379
Query: 2006 YGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
GEEITFDY + A C C ++ C G
Sbjct: 1380 PGEEITFDYQFQRYGR---RAQKCYCEAENCTG 1409
>gi|432092361|gb|ELK24976.1| Histone-lysine N-methyltransferase SETD2 [Myotis davidii]
Length = 2865
Score = 71.6 bits (174), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ K+ + FV+E+ GEV K F+ + + KN + Y
Sbjct: 1862 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1910
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
K D ++DA K N + + HSC PNCE + V+G ++G +T + + G E
Sbjct: 1911 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1964
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
+TFDY K EA C CGS CRG YL
Sbjct: 1965 LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1992
>gi|403263194|ref|XP_003923935.1| PREDICTED: histone-lysine N-methyltransferase MLL [Saimiri
boliviensis boliviensis]
Length = 3985
Score = 71.6 bits (174), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
G G+ C + GE V+E+ G V IRS+Q + + ++Y+
Sbjct: 3856 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3898
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
Y+ R D VVDA N A I HSC PNC ++V +DG I I+ +R I+
Sbjct: 3899 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3952
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
GEE+T+DY E + C CG++ CR +LN
Sbjct: 3953 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3985
>gi|336259450|ref|XP_003344526.1| hypothetical protein SMAC_07534 [Sordaria macrospora k-hell]
gi|380093240|emb|CCC08898.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 1314
Score = 71.6 bits (174), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 47/143 (32%), Positives = 69/143 (48%), Gaps = 23/143 (16%)
Query: 1903 FGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGY---D 1959
+DD ++E++GE E + I L++ YL+ G + + D
Sbjct: 1192 INKDDMIIEYVGE--------EVRQQIAELREAR----------YLKSGIGSSYLFRIDD 1233
Query: 1960 LVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTE 2019
V+DA K A I HSC PNC AK+ V+G +I IY +R I EE+T+DY E
Sbjct: 1234 NTVIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDYKFERE 1293
Query: 2020 SKEEYEASVCLCGSQVCRGSYLN 2042
+ CLCG+ C+G +LN
Sbjct: 1294 IGST-DRIPCLCGTAACKG-FLN 1314
>gi|119914792|ref|XP_589886.3| PREDICTED: histone-lysine N-methyltransferase SETD2 [Bos taurus]
Length = 2547
Score = 71.6 bits (174), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ K+ + FV+E+ GEV K F+ + + KN + Y
Sbjct: 1542 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1590
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
K D ++DA K N + + HSC PNCE + V+G ++G +T + + G E
Sbjct: 1591 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1644
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
+TFDY K EA C CGS CRG YL
Sbjct: 1645 LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1672
>gi|344228738|gb|EGV60624.1| histone H3-K4 methyltransferase Set1 [Candida tenuis ATCC 10573]
Length = 1037
Score = 71.6 bits (174), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA K A I H C P+C AK+ VDG +I IY +R I EE+T+DY E+
Sbjct: 958 TVIDATKKGGIARFINHCCSPSCTAKIIKVDGKKRIVIYALRDIEANEELTYDYKFERET 1017
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+ E CLCG+ C+G YLN
Sbjct: 1018 NDS-ERIRCLCGAPGCKG-YLN 1037
>gi|397498815|ref|XP_003820170.1| PREDICTED: histone-lysine N-methyltransferase MLL [Pan paniscus]
Length = 4202
Score = 71.6 bits (174), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
G G+ C + GE V+E+ G V IRS+Q + + ++Y+
Sbjct: 4073 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 4115
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
Y+ R D VVDA N A I HSC PNC ++V +DG I I+ +R I+
Sbjct: 4116 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 4169
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
GEE+T+DY E + C CG++ CR +LN
Sbjct: 4170 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 4202
>gi|432105765|gb|ELK31956.1| Histone-lysine N-methyltransferase MLL, partial [Myotis davidii]
Length = 3463
Score = 71.2 bits (173), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
G G+ C + GE V+E+ G V IRS+Q + + ++Y+
Sbjct: 3334 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3376
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
Y+ R D VVDA N A I HSC PNC ++V +DG I I+ +R I+
Sbjct: 3377 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3430
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
GEE+T+DY E + C CG++ CR +LN
Sbjct: 3431 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3463
>gi|148693675|gb|EDL25622.1| mCG1547 [Mus musculus]
Length = 3706
Score = 71.2 bits (173), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
G G+ C + GE V+E+ G V IRS+Q + + ++Y+
Sbjct: 3577 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3619
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
Y+ R D VVDA N A I HSC PNC ++V +DG I I+ +R I+
Sbjct: 3620 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3673
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
GEE+T+DY E + C CG++ CR +LN
Sbjct: 3674 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3706
>gi|410220670|gb|JAA07554.1| SET domain containing 2 [Pan troglodytes]
gi|410261336|gb|JAA18634.1| SET domain containing 2 [Pan troglodytes]
gi|410295964|gb|JAA26582.1| SET domain containing 2 [Pan troglodytes]
gi|410339683|gb|JAA38788.1| SET domain containing 2 [Pan troglodytes]
Length = 2564
Score = 71.2 bits (173), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ K+ + FV+E+ GEV K F+ + + KN + Y
Sbjct: 1559 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1607
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
K D ++DA K N + + HSC PNCE + V+G ++G +T + + G E
Sbjct: 1608 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1661
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
+TFDY K EA C CGS CRG YL
Sbjct: 1662 LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1689
>gi|348582642|ref|XP_003477085.1| PREDICTED: histone-lysine N-methyltransferase SETD2-like [Cavia
porcellus]
Length = 2565
Score = 71.2 bits (173), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ K+ + FV+E+ GEV K F+ + + KN + Y
Sbjct: 1560 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1608
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
K D ++DA K N + + HSC PNCE + V+G ++G +T + + G E
Sbjct: 1609 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1662
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
+TFDY K EA C CGS CRG YL
Sbjct: 1663 LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1690
>gi|197313748|ref|NP_054878.5| histone-lysine N-methyltransferase SETD2 [Homo sapiens]
gi|296452963|sp|Q9BYW2.3|SETD2_HUMAN RecName: Full=Histone-lysine N-methyltransferase SETD2; AltName:
Full=HIF-1; AltName: Full=Huntingtin yeast partner B;
AltName: Full=Huntingtin-interacting protein 1;
Short=HIP-1; AltName: Full=Huntingtin-interacting protein
B; AltName: Full=Lysine N-methyltransferase 3A; AltName:
Full=SET domain-containing protein 2; Short=hSET2;
AltName: Full=p231HBP
Length = 2564
Score = 71.2 bits (173), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ K+ + FV+E+ GEV K F+ + + KN + Y
Sbjct: 1559 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1607
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
K D ++DA K N + + HSC PNCE + V+G ++G +T + + G E
Sbjct: 1608 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1661
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
+TFDY K EA C CGS CRG YL
Sbjct: 1662 LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1689
>gi|297671474|ref|XP_002813857.1| PREDICTED: histone-lysine N-methyltransferase SETD2 [Pongo abelii]
Length = 2563
Score = 71.2 bits (173), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ K+ + FV+E+ GEV K F+ + + KN + Y
Sbjct: 1558 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1606
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
K D ++DA K N + + HSC PNCE + V+G ++G +T + + G E
Sbjct: 1607 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1660
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
+TFDY K EA C CGS CRG YL
Sbjct: 1661 LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1688
>gi|114586572|ref|XP_516423.2| PREDICTED: histone-lysine N-methyltransferase SETD2 isoform 3 [Pan
troglodytes]
Length = 2549
Score = 71.2 bits (173), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ K+ + FV+E+ GEV K F+ + + KN + Y
Sbjct: 1544 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1592
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
K D ++DA K N + + HSC PNCE + V+G ++G +T + + G E
Sbjct: 1593 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1646
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
+TFDY K EA C CGS CRG YL
Sbjct: 1647 LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1674
>gi|440891718|gb|ELR45266.1| Histone-lysine N-methyltransferase SETD2, partial [Bos grunniens
mutus]
Length = 2533
Score = 71.2 bits (173), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ K+ + FV+E+ GEV K F+ + + KN + Y
Sbjct: 1528 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1576
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
K D ++DA K N + + HSC PNCE + V+G ++G +T + + G E
Sbjct: 1577 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1630
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
+TFDY K EA C CGS CRG YL
Sbjct: 1631 LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1658
>gi|410911878|ref|XP_003969417.1| PREDICTED: uncharacterized protein LOC101064190 [Takifugu rubripes]
Length = 2720
Score = 71.2 bits (173), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 52/154 (33%), Positives = 74/154 (48%), Gaps = 30/154 (19%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
G G+ C + GE V+E+ G V IR++ + ++Y+
Sbjct: 2591 GRGLFCKRNIEAGE--MVIEYAGTV------------IRAVLTDKRQ---KYYDGKGIGC 2633
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
Y+ R D +D VVDA + N A I HSC PNC ++V VDG I I+ +R I+
Sbjct: 2634 YMFR----IDDFD--VVDATMQGNAARFINHSCEPNCYSRVINVDGRKHIVIFALRKIYR 2687
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSY 2040
GEE+T+DY E E C CG++ CRGS
Sbjct: 2688 GEELTYDYKFPIEDDESKLH--CNCGTRRCRGSL 2719
>gi|397495290|ref|XP_003818492.1| PREDICTED: histone-lysine N-methyltransferase SETD2 [Pan paniscus]
Length = 2564
Score = 71.2 bits (173), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ K+ + FV+E+ GEV K F+ + + KN + Y
Sbjct: 1559 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1607
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
K D ++DA K N + + HSC PNCE + V+G ++G +T + + G E
Sbjct: 1608 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1661
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
+TFDY K EA C CGS CRG YL
Sbjct: 1662 LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1689
>gi|242010887|ref|XP_002426189.1| mixed-lineage leukemia protein, mll, putative [Pediculus humanus
corporis]
gi|212510240|gb|EEB13451.1| mixed-lineage leukemia protein, mll, putative [Pediculus humanus
corporis]
Length = 574
Score = 71.2 bits (173), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 48/142 (33%), Positives = 68/142 (47%), Gaps = 24/142 (16%)
Query: 1903 FGEDDFVVEFLGE-VYPVWKWF-EKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDL 1960
D+ V+E++G+ V P F EK+ R + + + I LE
Sbjct: 455 IAADEMVIEYVGQMVRPFLADFREKEYEKRGIG------SSYLFRIDLE----------- 497
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N A I HSC PNC AK+ ++G +I IY+ + I EEIT+DY E
Sbjct: 498 TIIDATKCGNLARFINHSCNPNCYAKIITIEGQKKIVIYSKKDIKVDEEITYDYKFPIEE 557
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
E CLCG+ C+G YLN
Sbjct: 558 ----EKIPCLCGAAQCKG-YLN 574
>gi|312378119|gb|EFR24776.1| hypothetical protein AND_10404 [Anopheles darlingi]
Length = 2632
Score = 71.2 bits (173), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 48/149 (32%), Positives = 74/149 (49%), Gaps = 20/149 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ + GE F++E++GEV F+++ S KN +Y + L
Sbjct: 1470 KKGFGIQASAPIAPGE--FIMEYVGEVL-NGSQFDQRAEAYSRDKNKH-----YYFMALR 1521
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
+DG ++DA K N + I HSC PN E + V+G +IG ++ + I GEE
Sbjct: 1522 -----SDG----IIDATTKGNISRFINHSCDPNAETQKWTVNGELRIGFFSTKYILPGEE 1572
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
ITFDY + +A C C ++ CRG
Sbjct: 1573 ITFDYQF---QRYGRKAQKCFCEAENCRG 1598
>gi|296225059|ref|XP_002758501.1| PREDICTED: histone-lysine N-methyltransferase SETD2 [Callithrix
jacchus]
Length = 2510
Score = 71.2 bits (173), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ K+ + FV+E+ GEV K F+ + + KN + Y
Sbjct: 1505 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1553
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
K D ++DA K N + + HSC PNCE + V+G ++G +T + + G E
Sbjct: 1554 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1607
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
+TFDY K EA C CGS CRG YL
Sbjct: 1608 LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1635
>gi|354544237|emb|CCE40960.1| hypothetical protein CPAR2_109980 [Candida parapsilosis]
Length = 1042
Score = 71.2 bits (173), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 48/82 (58%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA K A I H C P+C AK+ V+G +I IY +R I EE+T+DY E+
Sbjct: 963 TVIDATKKGGIARFINHCCNPSCTAKIIKVEGKKRIVIYALRDIEANEELTYDYKFERET 1022
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+E E CLCG+ C+G YLN
Sbjct: 1023 NDE-ERIRCLCGAPGCKG-YLN 1042
>gi|255539394|ref|XP_002510762.1| set domain protein, putative [Ricinus communis]
gi|223551463|gb|EEF52949.1| set domain protein, putative [Ricinus communis]
Length = 1258
Score = 71.2 bits (173), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 53/137 (38%), Positives = 68/137 (49%), Gaps = 25/137 (18%)
Query: 1906 DDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI---YLERPKGDADGYDLVV 1962
+DFV+E++GE+ IR + + E I YL R DGY V
Sbjct: 1142 EDFVIEYVGEL------------IRPRISDIRERLYEKMGIGSSYLFRLD---DGY---V 1183
Query: 1963 VDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKE 2022
VDA + A I HSC PNC KV +V+G +I IY R I GEEIT++Y E K+
Sbjct: 1184 VDATKRGGVARFINHSCEPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKK 1243
Query: 2023 EYEASVCLCGSQVCRGS 2039
C CGS+ CRGS
Sbjct: 1244 ----IPCNCGSRKCRGS 1256
>gi|384493570|gb|EIE84061.1| hypothetical protein RO3G_08766 [Rhizopus delemar RA 99-880]
Length = 815
Score = 71.2 bits (173), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 36/79 (45%), Positives = 45/79 (56%), Gaps = 7/79 (8%)
Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
++DA K A I HSC PNC + V + +IGI+T R I GEE+TFDY
Sbjct: 103 IIDATKKGCLARFINHSCNPNCVTQKWVVGKNMRIGIFTTRCIKAGEELTFDYKF----- 157
Query: 2022 EEY--EASVCLCGSQVCRG 2038
E Y +A VC CG QVC+G
Sbjct: 158 ERYGAQAQVCYCGEQVCKG 176
>gi|332216412|ref|XP_003257344.1| PREDICTED: histone-lysine N-methyltransferase SETD2 [Nomascus
leucogenys]
Length = 2499
Score = 71.2 bits (173), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ K+ + FV+E+ GEV K F+ + + KN + Y
Sbjct: 1544 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1592
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
K D ++DA K N + + HSC PNCE + V+G ++G +T + + G E
Sbjct: 1593 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1646
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
+TFDY K EA C CGS CRG YL
Sbjct: 1647 LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1674
>gi|302682536|ref|XP_003030949.1| hypothetical protein SCHCODRAFT_77129 [Schizophyllum commune H4-8]
gi|300104641|gb|EFI96046.1| hypothetical protein SCHCODRAFT_77129 [Schizophyllum commune H4-8]
Length = 171
Score = 71.2 bits (173), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 48/82 (58%), Gaps = 5/82 (6%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
+VVDA K N I HSC PNC AK+ ++G +I IY R I G+EIT+DY+ E
Sbjct: 95 IVVDATKKGNLGRLINHSCDPNCTAKIITINGEKKIVIYAKRDIELGDEITYDYHFPFEQ 154
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+ CLCG+ CRG +LN
Sbjct: 155 ----DKIPCLCGTAKCRG-FLN 171
>gi|431905124|gb|ELK10179.1| Histone-lysine N-methyltransferase SETD2 [Pteropus alecto]
Length = 2482
Score = 71.2 bits (173), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ K+ + FV+E+ GEV K F+ + + KN + Y
Sbjct: 1477 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1525
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
K D ++DA K N + + HSC PNCE + V+G ++G +T + + G E
Sbjct: 1526 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1579
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
+TFDY K EA C CGS CRG YL
Sbjct: 1580 LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1607
>gi|410972021|ref|XP_003992459.1| PREDICTED: histone-lysine N-methyltransferase MLL [Felis catus]
Length = 3554
Score = 71.2 bits (173), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
G G+ C + GE V+E+ G V IRS+Q + + ++Y+
Sbjct: 3425 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3467
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
Y+ R D VVDA N A I HSC PNC ++V +DG I I+ +R I+
Sbjct: 3468 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3521
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
GEE+T+DY E + C CG++ CR +LN
Sbjct: 3522 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3554
>gi|392590566|gb|EIW79895.1| SET domain-containing protein [Coniophora puteana RWD-64-598 SS2]
Length = 160
Score = 71.2 bits (173), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 50/139 (35%), Positives = 68/139 (48%), Gaps = 26/139 (18%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI---YLERPKGDADGYDLVVV 1963
+ V+E++GEV +R+ + + A E I YL R D +VV
Sbjct: 45 EMVIEYVGEV------------VRAQVADKREKAYERQGIGSSYLFRIDED------LVV 86
Query: 1964 DAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEE 2023
DA K N I HSC PNC A++ + G +I IY + I G+EIT+DY+ E
Sbjct: 87 DATKKGNLGRLINHSCDPNCTARIITISGEKKIVIYAKQDIELGDEITYDYHFPIEQ--- 143
Query: 2024 YEASVCLCGSQVCRGSYLN 2042
+ CLCGS CRG YLN
Sbjct: 144 -DKIPCLCGSAKCRG-YLN 160
>gi|42407424|dbj|BAD10031.1| SET domain protein-like [Oryza sativa Japonica Group]
Length = 437
Score = 71.2 bits (173), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 46/145 (31%), Positives = 71/145 (48%), Gaps = 20/145 (13%)
Query: 1905 EDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVD 1964
+DDFV+EF+GEV E+ + +R N Y+ + K D V+D
Sbjct: 311 KDDFVIEFVGEVIDDETCEERLEDMRRRGDKN---------FYMCKVKKD------FVID 355
Query: 1965 AMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEY 2024
A K N HSC PNC+ + V+G ++G++ + I GE +T+DY E
Sbjct: 356 ATFKGNDCRFFNHSCEPNCQLQKWQVNGKTRLGVFASKAIEVGEPLTYDYRFEQHYGPEI 415
Query: 2025 EASVCLCGSQVCRGSYLNLTGEGAF 2049
E C CG+Q C+G+ +++ G G F
Sbjct: 416 E---CFCGAQNCQGN-MSIVG-GCF 435
>gi|119585214|gb|EAW64810.1| SET domain containing 2, isoform CRA_f [Homo sapiens]
Length = 2342
Score = 71.2 bits (173), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ K+ + FV+E+ GEV K F+ + + KN + Y
Sbjct: 1337 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1385
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
K D ++DA K N + + HSC PNCE + V+G ++G +T + + G E
Sbjct: 1386 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1439
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
+TFDY K EA C CGS CRG YL
Sbjct: 1440 LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1467
>gi|350294046|gb|EGZ75131.1| histone-lysine N-methyltransferase, H3 lysine-4 specific [Neurospora
tetrasperma FGSC 2509]
Length = 1313
Score = 71.2 bits (173), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 47/143 (32%), Positives = 69/143 (48%), Gaps = 23/143 (16%)
Query: 1903 FGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGY---D 1959
+DD ++E++GE E + I L++ YL+ G + + D
Sbjct: 1191 INKDDMIIEYVGE--------EVRQQIAELREAR----------YLKSGIGSSYLFRIDD 1232
Query: 1960 LVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTE 2019
V+DA K A I HSC PNC AK+ V+G +I IY +R I EE+T+DY E
Sbjct: 1233 NTVIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDYKFERE 1292
Query: 2020 SKEEYEASVCLCGSQVCRGSYLN 2042
+ CLCG+ C+G +LN
Sbjct: 1293 IGST-DRIPCLCGTAACKG-FLN 1313
>gi|148677064|gb|EDL09011.1| mCG15806 [Mus musculus]
Length = 2034
Score = 71.2 bits (173), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ K+ + FV+E+ GEV K F+ + + KN + Y
Sbjct: 1030 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1078
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
K D ++DA K N + + HSC PNCE + V+G ++G +T + + G E
Sbjct: 1079 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1132
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
+TFDY K EA C CGS CRG YL
Sbjct: 1133 LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1160
>gi|359078405|ref|XP_002697155.2| PREDICTED: histone-lysine N-methyltransferase SETD2 [Bos taurus]
Length = 1448
Score = 71.2 bits (173), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ K+ + FV+E+ GEV K F+ + + KN + Y
Sbjct: 482 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 530
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
K D ++DA K N + + HSC PNCE + V+G ++G +T + + G E
Sbjct: 531 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 584
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
+TFDY K EA C CGS CRG YL
Sbjct: 585 LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 612
>gi|432114829|gb|ELK36567.1| Putative histone-lysine N-methyltransferase NSD2 [Myotis davidii]
Length = 1037
Score = 71.2 bits (173), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 78/148 (52%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+V ++ GE FV E++GE+ ++++ + ++ E+ FY + +++
Sbjct: 687 KGWGLVATRDIRKGE--FVNEYVGEL------IDEEECMARIKHAQENDITHFYMLTIDK 738
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NY+ + HSC+PNCE V+G ++G++ V I G E+
Sbjct: 739 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 789
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN E+ +VC CG+ C G
Sbjct: 790 TFNYNLDCLGNEK---TVCRCGAPNCSG 814
>gi|367037743|ref|XP_003649252.1| lysine methyltransferase enzyme-like protein [Thielavia terrestris
NRRL 8126]
gi|346996513|gb|AEO62916.1| lysine methyltransferase enzyme-like protein [Thielavia terrestris
NRRL 8126]
Length = 1286
Score = 71.2 bits (173), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 36/84 (42%), Positives = 47/84 (55%), Gaps = 2/84 (2%)
Query: 1959 DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVT 2018
D V+DA K A I HSC PNC AK+ V+G +I IY +R I EE+T+DY
Sbjct: 1205 DNTVIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDYKFER 1264
Query: 2019 ESKEEYEASVCLCGSQVCRGSYLN 2042
E + CLCG+ C+G +LN
Sbjct: 1265 ELGST-DRIPCLCGTAACKG-FLN 1286
>gi|119585209|gb|EAW64805.1| SET domain containing 2, isoform CRA_a [Homo sapiens]
Length = 1538
Score = 71.2 bits (173), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 50/155 (32%), Positives = 73/155 (47%), Gaps = 21/155 (13%)
Query: 1887 VAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI 1946
+ +KG G+ K+ + FV+E+ GEV K F+ + + KN +
Sbjct: 1053 LTEKKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHY 1101
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
Y K D ++DA K N + + HSC PNCE + V+G ++G +T + +
Sbjct: 1102 YFMALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPS 1155
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
G E+TFDY K EA C CGS CRG YL
Sbjct: 1156 GSELTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1186
>gi|210032580|ref|NP_055863.1| histone-lysine N-methyltransferase SETD1B [Homo sapiens]
gi|166977692|sp|Q9UPS6.2|SET1B_HUMAN RecName: Full=Histone-lysine N-methyltransferase SETD1B; AltName:
Full=Lysine N-methyltransferase 2G; AltName: Full=SET
domain-containing protein 1B; Short=hSET1B
Length = 1923
Score = 71.2 bits (173), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N+A I HSC PNC AKV V+ +I IY+ + I+ EEIT+DY E
Sbjct: 1847 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 1906
Query: 2021 KEEYEASVCLCGSQVCRGS 2039
+ CLCGS+ CRG+
Sbjct: 1907 VK----IPCLCGSENCRGT 1921
>gi|334192482|gb|AEG67286.1| histone-lysine N-methyltransferase [Homo sapiens]
Length = 1966
Score = 71.2 bits (173), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N+A I HSC PNC AKV V+ +I IY+ + I+ EEIT+DY E
Sbjct: 1890 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 1949
Query: 2021 KEEYEASVCLCGSQVCRGS 2039
+ CLCGS+ CRG+
Sbjct: 1950 VK----IPCLCGSENCRGT 1964
>gi|443726566|gb|ELU13685.1| hypothetical protein CAPTEDRAFT_150651 [Capitella teleta]
Length = 292
Score = 71.2 bits (173), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 35/79 (44%), Positives = 45/79 (56%), Gaps = 4/79 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
+++DA N A I HSC PNC AK+ V+ H +I IY+ R I EEIT+DY E
Sbjct: 216 LIIDATKCGNLARFINHSCNPNCVAKIITVESHKKIVIYSRRDIGVNEEITYDYKFPLED 275
Query: 2021 KEEYEASVCLCGSQVCRGS 2039
E CLCG+ CRG+
Sbjct: 276 ----EKIPCLCGTSACRGT 290
>gi|431908264|gb|ELK11862.1| Histone-lysine N-methyltransferase HRX [Pteropus alecto]
Length = 3459
Score = 71.2 bits (173), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
G G+ C + GE V+E+ G V IRS+Q + + ++Y+
Sbjct: 3330 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3372
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
Y+ R D VVDA N A I HSC PNC ++V +DG I I+ +R I+
Sbjct: 3373 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3426
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
GEE+T+DY E + C CG++ CR +LN
Sbjct: 3427 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3459
>gi|74697791|sp|Q8X0S9.1|SET1_NEUCR RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component SET1; AltName:
Full=SET domain-containing protein 1
gi|18376303|emb|CAD21415.1| related to regulatory protein SET1 [Neurospora crassa]
Length = 1313
Score = 71.2 bits (173), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 47/143 (32%), Positives = 69/143 (48%), Gaps = 23/143 (16%)
Query: 1903 FGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGY---D 1959
+DD ++E++GE E + I L++ YL+ G + + D
Sbjct: 1191 INKDDMIIEYVGE--------EVRQQIAELREAR----------YLKSGIGSSYLFRIDD 1232
Query: 1960 LVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTE 2019
V+DA K A I HSC PNC AK+ V+G +I IY +R I EE+T+DY E
Sbjct: 1233 NTVIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDYKFERE 1292
Query: 2020 SKEEYEASVCLCGSQVCRGSYLN 2042
+ CLCG+ C+G +LN
Sbjct: 1293 IGST-DRIPCLCGTAACKG-FLN 1313
>gi|410047437|ref|XP_003314036.2| PREDICTED: uncharacterized protein LOC473295, partial [Pan
troglodytes]
Length = 1955
Score = 71.2 bits (173), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N+A I HSC PNC AKV V+ +I IY+ + I+ EEIT+DY E
Sbjct: 1879 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 1938
Query: 2021 KEEYEASVCLCGSQVCRGS 2039
+ CLCGS+ CRG+
Sbjct: 1939 VK----IPCLCGSENCRGT 1953
>gi|242221772|ref|XP_002476627.1| predicted protein [Postia placenta Mad-698-R]
gi|220724099|gb|EED78169.1| predicted protein [Postia placenta Mad-698-R]
Length = 115
Score = 71.2 bits (173), Expect = 8e-09, Method: Composition-based stats.
Identities = 50/137 (36%), Positives = 67/137 (48%), Gaps = 26/137 (18%)
Query: 1909 VVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI---YLERPKGDADGYDLVVVDA 1965
V+E++GE+ IR+ + + A E I YL R D +VVDA
Sbjct: 2 VIEYVGEI------------IRAQVADKREKAYERQGIGSSYLFRIDED------LVVDA 43
Query: 1966 MHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYE 2025
K N I HSC PNC AK+ ++G +I IY + I G EIT+DY+ E +
Sbjct: 44 TKKGNLGRLINHSCDPNCTAKIITINGEKKIVIYAKQDIELGSEITYDYHFPIEQ----D 99
Query: 2026 ASVCLCGSQVCRGSYLN 2042
CLCGS CRG +LN
Sbjct: 100 KIPCLCGSAKCRG-FLN 115
>gi|198435574|ref|XP_002121834.1| PREDICTED: absent, small, or homeotic discs 1 homolog [Ciona
intestinalis]
Length = 2850
Score = 71.2 bits (173), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 53/174 (30%), Positives = 83/174 (47%), Gaps = 22/174 (12%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
+G GV N + E F++E++GEV E++ R+++ N + + Y + LE
Sbjct: 2130 RGWGVRTNSD--IPEGQFLLEYVGEVVS-----EREFRRRTIE--NYNAHNDHYCVQLEA 2180
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
V+D AN + HSC+PNCE + V+G Y++G++ R I EE+
Sbjct: 2181 G---------TVIDGYRLANEGRFVNHSCQPNCEMQKWVVNGEYRVGLFAKRPIVSSEEL 2231
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFE--KVLKELHGLLDR 2062
T+DYN + + + C CGS CRG T GA + K LH +R
Sbjct: 2232 TYDYNFHAYNLDRQQP--CRCGSSECRGVIGGKTQRGAEQGGKTRSTLHPTKER 2283
>gi|242786320|ref|XP_002480782.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
gi|218720929|gb|EED20348.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
Length = 1155
Score = 71.2 bits (173), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 47/82 (57%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA + A I HSC PNC AK+ VDG +I IY +R I EE+T+DY E
Sbjct: 1076 AVIDATKRGGIARFINHSCTPNCTAKIIRVDGSKRIVIYALRDISKDEELTYDYKFEREW 1135
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
E + CLCGS C+G +LN
Sbjct: 1136 DSE-DRIPCLCGSAGCKG-FLN 1155
>gi|384250559|gb|EIE24038.1| SET domain-containing protein, partial [Coccomyxa subellipsoidea
C-169]
Length = 295
Score = 70.9 bits (172), Expect = 8e-09, Method: Composition-based stats.
Identities = 54/170 (31%), Positives = 83/170 (48%), Gaps = 25/170 (14%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+K A KG G+ ++ G+ F+VE++GEV E+++ +R E +
Sbjct: 85 EKRRAGAKGFGLFATQDLVAGQ--FIVEYIGEV------LEEEEYLRRKDYYQESGQRHY 136
Query: 1944 YNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRG 2003
Y ++ G+ V+DA K I HSC PNCE + V G IG+Y ++
Sbjct: 137 Y--FMNIGNGE-------VIDAARKGALGRFINHSCNPNCETQKWVVRGELAIGLYALKD 187
Query: 2004 IHYGEEITFDYNSVTESKEEY--EASVCLCGSQVCRGSYLNLTGEGAFEK 2051
I G E+TFDYN E Y + CLC ++VCRG ++ TGE ++
Sbjct: 188 IPAGVELTFDYNF-----ERYGDKPMRCLCEAKVCRG-FIGGTGEAVAQE 231
>gi|119618696|gb|EAW98290.1| hCG1812756 [Homo sapiens]
Length = 1048
Score = 70.9 bits (172), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 34/80 (42%), Positives = 46/80 (57%), Gaps = 4/80 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N+A I HSC PNC AKV V+ +I IY+ + I+ EEIT+DY E
Sbjct: 972 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 1031
Query: 2021 KEEYEASVCLCGSQVCRGSY 2040
+ CLCGS+ CRG+
Sbjct: 1032 VK----IPCLCGSENCRGTL 1047
>gi|336472713|gb|EGO60873.1| hypothetical protein NEUTE1DRAFT_144212 [Neurospora tetrasperma FGSC
2508]
Length = 1282
Score = 70.9 bits (172), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 47/143 (32%), Positives = 69/143 (48%), Gaps = 23/143 (16%)
Query: 1903 FGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGY---D 1959
+DD ++E++GE E + I L++ YL+ G + + D
Sbjct: 1160 INKDDMIIEYVGE--------EVRQQIAELREAR----------YLKSGIGSSYLFRIDD 1201
Query: 1960 LVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTE 2019
V+DA K A I HSC PNC AK+ V+G +I IY +R I EE+T+DY E
Sbjct: 1202 NTVIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDYKFERE 1261
Query: 2020 SKEEYEASVCLCGSQVCRGSYLN 2042
+ CLCG+ C+G +LN
Sbjct: 1262 IGST-DRIPCLCGTAACKG-FLN 1282
>gi|440632035|gb|ELR01954.1| hypothetical protein GMDG_05127 [Geomyces destructans 20631-21]
Length = 1301
Score = 70.9 bits (172), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 35/82 (42%), Positives = 47/82 (57%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
VVDA + A I HSC PNC AK+ V+G +I IY +R I EE+T+DY E
Sbjct: 1222 TVVDATKRGGIARFINHSCMPNCTAKIIKVEGTRRIVIYALRDIKLNEELTYDYKFEREI 1281
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+ + CLCG+ C+G +LN
Sbjct: 1282 GSD-DRIPCLCGTVACKG-FLN 1301
>gi|417406999|gb|JAA50136.1| Putative clathrin coat binding protein/huntingtin [Desmodus rotundus]
Length = 2557
Score = 70.9 bits (172), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ K+ + FV+E+ GEV K F+ + + KN + Y
Sbjct: 1552 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1600
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
K D ++DA K N + + HSC PNCE + V+G ++G +T + + G E
Sbjct: 1601 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1654
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
+TFDY K EA C CGS CRG YL
Sbjct: 1655 LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1682
>gi|212543321|ref|XP_002151815.1| SET domain protein [Talaromyces marneffei ATCC 18224]
gi|210066722|gb|EEA20815.1| SET domain protein [Talaromyces marneffei ATCC 18224]
Length = 1188
Score = 70.9 bits (172), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 47/82 (57%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA + A I HSC PNC AK+ VDG +I IY +R I EE+T+DY E
Sbjct: 1109 AVIDATKRGGIARFINHSCTPNCTAKIIRVDGSKRIVIYALRDISKDEELTYDYKFEREW 1168
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
E + CLCGS C+G +LN
Sbjct: 1169 DSE-DRIPCLCGSAGCKG-FLN 1188
>gi|403281795|ref|XP_003932362.1| PREDICTED: histone-lysine N-methyltransferase SETD1B [Saimiri
boliviensis boliviensis]
Length = 1823
Score = 70.9 bits (172), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N+A I HSC PNC AKV V+ +I IY+ + I+ EEIT+DY E
Sbjct: 1747 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 1806
Query: 2021 KEEYEASVCLCGSQVCRGS 2039
+ CLCGS+ CRG+
Sbjct: 1807 VK----IPCLCGSENCRGT 1821
>gi|211830050|gb|AAH38367.2| Setd1b protein [Mus musculus]
Length = 1103
Score = 70.9 bits (172), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 34/80 (42%), Positives = 46/80 (57%), Gaps = 4/80 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N+A I HSC PNC AKV V+ +I IY+ + I+ EEIT+DY E
Sbjct: 1027 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 1086
Query: 2021 KEEYEASVCLCGSQVCRGSY 2040
+ CLCGS+ CRG+
Sbjct: 1087 VK----IPCLCGSENCRGTL 1102
>gi|322788177|gb|EFZ13959.1| hypothetical protein SINV_06678 [Solenopsis invicta]
Length = 1093
Score = 70.9 bits (172), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 40/131 (30%), Positives = 66/131 (50%), Gaps = 18/131 (13%)
Query: 1908 FVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMH 1967
FV+E++GE+ + + R L + E FY + ++ + +DA
Sbjct: 915 FVIEYVGEI------IDDAEYKRRLHRKKELKNENFYFLTIDNNR---------TIDAEP 959
Query: 1968 KANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEAS 2027
K N + + HSC PNCE + V+G +IG++ +R I GEE+TF+YN ++ +
Sbjct: 960 KGNLSRFMNHSCAPNCETQKWTVNGDTRIGLFALRDIESGEELTFNYNLASDGETR---K 1016
Query: 2028 VCLCGSQVCRG 2038
CLCG+ C G
Sbjct: 1017 ACLCGASNCSG 1027
>gi|224071200|ref|XP_002193972.1| PREDICTED: histone-lysine N-methyltransferase SETD1B [Taeniopygia
guttata]
Length = 2004
Score = 70.9 bits (172), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N+A I HSC PNC AKV V+ +I IY+ + I+ EEIT+DY E
Sbjct: 1928 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 1987
Query: 2021 KEEYEASVCLCGSQVCRGS 2039
+ CLCGS+ CRG+
Sbjct: 1988 VK----IPCLCGSENCRGT 2002
>gi|12697196|emb|CAC28349.1| huntingtin interacting protein 1 [Homo sapiens]
gi|50512435|gb|AAT77612.1| HSPC069 isoform a [Homo sapiens]
Length = 2061
Score = 70.9 bits (172), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ K+ + FV+E+ GEV K F+ + + KN + Y
Sbjct: 1056 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1104
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
K D ++DA K N + + HSC PNCE + V+G ++G +T + + G E
Sbjct: 1105 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1158
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
+TFDY K EA C CGS CRG YL
Sbjct: 1159 LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1186
>gi|194764639|ref|XP_001964436.1| GF23177 [Drosophila ananassae]
gi|190614708|gb|EDV30232.1| GF23177 [Drosophila ananassae]
Length = 3708
Score = 70.9 bits (172), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 55/151 (36%), Positives = 73/151 (48%), Gaps = 23/151 (15%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
G G+ C K+ GE V+E+ GE+ IRS + + + I
Sbjct: 3581 GRGLYCTKDIEAGE--MVIEYAGEL------------IRSTLTDKRERYYDSRGIGCYMF 3626
Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
K D D +VVDA + N A I HSC PNC +KV + GH I I+ +R I GEE+T
Sbjct: 3627 KID----DNLVVDATMRGNAARFINHSCEPNCYSKVVDILGHKHIIIFALRRIVQGEELT 3682
Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
+DY + E E C CGS+ CR YLN
Sbjct: 3683 YDY----KFPFEEEKIPCSCGSKRCR-KYLN 3708
>gi|109658484|gb|AAI17163.1| SET domain containing 2 [Homo sapiens]
gi|109658962|gb|AAI17165.1| SET domain containing 2 [Homo sapiens]
Length = 2061
Score = 70.9 bits (172), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ K+ + FV+E+ GEV K F+ + + KN + Y
Sbjct: 1056 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1104
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
K D ++DA K N + + HSC PNCE + V+G ++G +T + + G E
Sbjct: 1105 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1158
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
+TFDY K EA C CGS CRG YL
Sbjct: 1159 LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1186
>gi|402887949|ref|XP_003907341.1| PREDICTED: uncharacterized protein LOC101023789 [Papio anubis]
Length = 1927
Score = 70.9 bits (172), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N+A I HSC PNC AKV V+ +I IY+ + I+ EEIT+DY E
Sbjct: 1851 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 1910
Query: 2021 KEEYEASVCLCGSQVCRGS 2039
+ CLCGS+ CRG+
Sbjct: 1911 VK----IPCLCGSENCRGT 1925
>gi|160333334|ref|NP_001103749.1| histone-lysine N-methyltransferase MLL [Danio rerio]
gi|158714185|gb|ABW79914.1| myeloid/lymphoid or mixed-lineage leukemia [Danio rerio]
Length = 4218
Score = 70.9 bits (172), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 55/162 (33%), Positives = 76/162 (46%), Gaps = 31/162 (19%)
Query: 1886 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN 1945
Y + G G+ C K GE V+E+ G V IRS+ + + ++Y+
Sbjct: 4083 YRSAIHGRGLFCRKNIEPGE--MVIEYSGNV------------IRSVLTDKRE---KYYD 4125
Query: 1946 -----IYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
Y+ R D Y+ VVDA N A I HSC PNC ++V VDG I I+
Sbjct: 4126 DKGIGCYMFR----IDDYE--VVDATIHGNSARFINHSCEPNCYSRVVNVDGQKHIVIFA 4179
Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
R I+ GEE+T+DY E E C CG++ CR +LN
Sbjct: 4180 TRKIYKGEELTYDYKFPIE--EPGNKLPCNCGAKKCR-KFLN 4218
>gi|302801428|ref|XP_002982470.1| hypothetical protein SELMODRAFT_421873 [Selaginella moellendorffii]
gi|300149569|gb|EFJ16223.1| hypothetical protein SELMODRAFT_421873 [Selaginella moellendorffii]
Length = 1285
Score = 70.9 bits (172), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 49/152 (32%), Positives = 70/152 (46%), Gaps = 26/152 (17%)
Query: 1895 VVCNKEG-------GFGEDDFVVEFLGEVYPVWKW-FEKQDGIRSLQKNNEDPAPEFYNI 1946
V C K+G + FV+E++GEV + +++ R QK+ FY +
Sbjct: 580 VRCGKKGFGLKALENIAKGSFVIEYVGEVLDSRSFELRQKEYARQRQKH-------FYFM 632
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
L + V+DA K N I HSC PNC+ + V+G IG++ +R +
Sbjct: 633 TLNSSE---------VIDACRKGNLGRFINHSCEPNCQTEKWCVNGEICIGLFAIRDVAK 683
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
EEITF+YN E A C CGS CRG
Sbjct: 684 NEEITFNYN--FERLYGAAAKKCHCGSAHCRG 713
>gi|225380774|gb|ACN88688.1| myeloid/lymphoid or mixed-lineage leukemia [Danio rerio]
Length = 4219
Score = 70.9 bits (172), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 55/162 (33%), Positives = 76/162 (46%), Gaps = 31/162 (19%)
Query: 1886 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN 1945
Y + G G+ C K GE V+E+ G V IRS+ + + ++Y+
Sbjct: 4084 YRSAIHGRGLFCRKNIEPGE--MVIEYSGNV------------IRSVLTDKRE---KYYD 4126
Query: 1946 -----IYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
Y+ R D Y+ VVDA N A I HSC PNC ++V VDG I I+
Sbjct: 4127 DKGIGCYMFR----IDDYE--VVDATIHGNSARFINHSCEPNCYSRVVNVDGQKHIVIFA 4180
Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
R I+ GEE+T+DY E E C CG++ CR +LN
Sbjct: 4181 TRKIYKGEELTYDYKFPIE--EPGNKLPCNCGAKKCR-KFLN 4219
>gi|134084734|emb|CAK43391.1| unnamed protein product [Aspergillus niger]
Length = 1079
Score = 70.9 bits (172), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA + A I HSC PNC AK+ VDG +I IY +R I EE+T+DY E
Sbjct: 1000 TVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREW 1059
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+ + CLCGS C+G +LN
Sbjct: 1060 DSD-DRIPCLCGSTGCKG-FLN 1079
>gi|409078063|gb|EKM78427.1| hypothetical protein AGABI1DRAFT_41599 [Agaricus bisporus var.
burnettii JB137-S8]
gi|426194069|gb|EKV44001.1| histone methyltransferase [Agaricus bisporus var. bisporus H97]
Length = 163
Score = 70.9 bits (172), Expect = 9e-09, Method: Composition-based stats.
Identities = 51/139 (36%), Positives = 68/139 (48%), Gaps = 26/139 (18%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI---YLERPKGDADGYDLVVV 1963
+ V+E++GEV IR+ + + A E I YL R D +VV
Sbjct: 48 EMVIEYVGEV------------IRAAVADKREKAYERQGIGSSYLFRIDED------LVV 89
Query: 1964 DAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEE 2023
DA K N I HSC PNC AK+ + G +I IY + I G+EIT+DY+ E
Sbjct: 90 DATKKGNLGRLINHSCDPNCTAKIITISGVKKIVIYAKQDIELGDEITYDYHFPFEQ--- 146
Query: 2024 YEASVCLCGSQVCRGSYLN 2042
+ CLCGS CRG +LN
Sbjct: 147 -DKIPCLCGSAKCRG-FLN 163
>gi|338714932|ref|XP_001495700.3| PREDICTED: histone-lysine N-methyltransferase SETD2 [Equus caballus]
Length = 2064
Score = 70.9 bits (172), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ K+ + FV+E+ GEV K F+ + + KN + Y
Sbjct: 1059 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1107
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
K D ++DA K N + + HSC PNCE + V+G ++G +T + + G E
Sbjct: 1108 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1161
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
+TFDY K EA C CGS CRG YL
Sbjct: 1162 LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1189
>gi|317037780|ref|XP_001399137.2| histone-lysine N-methyltransferase, H3 lysine-4 specific [Aspergillus
niger CBS 513.88]
Length = 1239
Score = 70.9 bits (172), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA + A I HSC PNC AK+ VDG +I IY +R I EE+T+DY E
Sbjct: 1160 TVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREW 1219
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+ + CLCGS C+G +LN
Sbjct: 1220 DSD-DRIPCLCGSTGCKG-FLN 1239
>gi|403268536|ref|XP_003926329.1| PREDICTED: histone-lysine N-methyltransferase SETD2 [Saimiri
boliviensis boliviensis]
Length = 2057
Score = 70.9 bits (172), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ K+ + FV+E+ GEV K F+ + + KN + Y
Sbjct: 1052 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1100
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
K D ++DA K N + + HSC PNCE + V+G ++G +T + + G E
Sbjct: 1101 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1154
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
+TFDY K EA C CGS CRG YL
Sbjct: 1155 LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1182
>gi|396578140|ref|NP_001035488.2| histone-lysine N-methyltransferase SETD1B [Mus musculus]
gi|166977693|sp|Q8CFT2.2|SET1B_MOUSE RecName: Full=Histone-lysine N-methyltransferase SETD1B; AltName:
Full=SET domain-containing protein 1B
Length = 1985
Score = 70.9 bits (172), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N+A I HSC PNC AKV V+ +I IY+ + I+ EEIT+DY E
Sbjct: 1909 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 1968
Query: 2021 KEEYEASVCLCGSQVCRGS 2039
+ CLCGS+ CRG+
Sbjct: 1969 VK----IPCLCGSENCRGT 1983
>gi|170050731|ref|XP_001861443.1| set domain protein [Culex quinquefasciatus]
gi|167872245|gb|EDS35628.1| set domain protein [Culex quinquefasciatus]
Length = 1181
Score = 70.9 bits (172), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 43/148 (29%), Positives = 74/148 (50%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+V ++ G+ FV+E++GEV ++ R +++ E +Y + ++
Sbjct: 961 KGWGLVAQEDIHQGQ--FVIEYVGEV------INGEELARRIKQKQEQKDENYYFLTVD- 1011
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ +DA K N A I HSC PNCE + V G +G++ ++ + GEE+
Sbjct: 1012 --------SELTIDAGPKGNLARFINHSCEPNCETLLWKVGGSQSVGLFALKDLKAGEEL 1063
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN T ++ +C CG+ C G
Sbjct: 1064 TFNYNFETFGDQK---KICHCGAAKCSG 1088
>gi|441630858|ref|XP_003280765.2| PREDICTED: uncharacterized protein LOC100584028 [Nomascus leucogenys]
Length = 1863
Score = 70.9 bits (172), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N+A I HSC PNC AKV V+ +I IY+ + I+ EEIT+DY E
Sbjct: 1787 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 1846
Query: 2021 KEEYEASVCLCGSQVCRGS 2039
+ CLCGS+ CRG+
Sbjct: 1847 VK----IPCLCGSENCRGT 1861
>gi|395543169|ref|XP_003773493.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
N-methyltransferase NSD2 [Sarcophilus harrisii]
Length = 1464
Score = 70.9 bits (172), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+V ++ GE FV E++GE+ ++++ + ++ +E+ FY + +++
Sbjct: 1074 KGWGLVAKRDIKKGE--FVNEYVGEL------IDEEECMARIKYAHENDITHFYMLTIDK 1125
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NY+ + HSC+PNCE V+G ++G++ V I G E+
Sbjct: 1126 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1176
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN E+ +VC CG+ C G
Sbjct: 1177 TFNYNLDCLGNEK---TVCRCGASNCSG 1201
>gi|71897211|ref|NP_001025832.1| histone-lysine N-methyltransferase SETD1B [Gallus gallus]
gi|82231199|sp|Q5F3P8.1|SET1B_CHICK RecName: Full=Histone-lysine N-methyltransferase SETD1B; AltName:
Full=SET domain-containing protein 1B
gi|60098811|emb|CAH65236.1| hypothetical protein RCJMB04_10j6 [Gallus gallus]
Length = 2008
Score = 70.9 bits (172), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N+A I HSC PNC AKV V+ +I IY+ + I+ EEIT+DY E
Sbjct: 1932 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 1991
Query: 2021 KEEYEASVCLCGSQVCRGS 2039
+ CLCGS+ CRG+
Sbjct: 1992 VK----IPCLCGSENCRGT 2006
>gi|301754587|ref|XP_002913168.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
SETD1B-like [Ailuropoda melanoleuca]
Length = 1805
Score = 70.9 bits (172), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N+A I HSC PNC AKV V+ +I IY+ + I+ EEIT+DY E
Sbjct: 1729 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 1788
Query: 2021 KEEYEASVCLCGSQVCRGS 2039
+ CLCGS+ CRG+
Sbjct: 1789 VK----IPCLCGSENCRGT 1803
>gi|149063329|gb|EDM13652.1| rCG21620 [Rattus norvegicus]
Length = 1091
Score = 70.9 bits (172), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 34/80 (42%), Positives = 46/80 (57%), Gaps = 4/80 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N+A I HSC PNC AKV V+ +I IY+ + I+ EEIT+DY E
Sbjct: 1015 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 1074
Query: 2021 KEEYEASVCLCGSQVCRGSY 2040
+ CLCGS+ CRG+
Sbjct: 1075 VK----IPCLCGSENCRGTL 1090
>gi|157126101|ref|XP_001654536.1| set domain protein [Aedes aegypti]
gi|108873380|gb|EAT37605.1| AAEL010414-PA [Aedes aegypti]
Length = 1480
Score = 70.9 bits (172), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 49/149 (32%), Positives = 75/149 (50%), Gaps = 20/149 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+V ++ G+ FV+E++GEV ++ R LQ +Y + +
Sbjct: 1229 QKGWGLVAQEDIRQGQ--FVIEYVGEV------ISNEELERRLQHKVAQKDENYYFLTV- 1279
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
D++ + +DA K N A I HSC PNCE + V G +G++ + I GEE
Sbjct: 1280 ----DSE----LTIDAGPKGNLARFINHSCEPNCETMLWTVGGAQSVGLFAIMDIKAGEE 1331
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
+TF+YN ESK + E VC C + C G
Sbjct: 1332 LTFNYN--FESKSD-EKKVCHCNASKCSG 1357
>gi|392352531|ref|XP_003751234.1| PREDICTED: histone-lysine N-methyltransferase SETD1B-like [Rattus
norvegicus]
Length = 1900
Score = 70.9 bits (172), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N+A I HSC PNC AKV V+ +I IY+ + I+ EEIT+DY E
Sbjct: 1824 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 1883
Query: 2021 KEEYEASVCLCGSQVCRGS 2039
+ CLCGS+ CRG+
Sbjct: 1884 VK----IPCLCGSENCRGT 1898
>gi|312083807|ref|XP_003144016.1| hypothetical protein LOAG_08436 [Loa loa]
Length = 761
Score = 70.9 bits (172), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 59/176 (33%), Positives = 82/176 (46%), Gaps = 22/176 (12%)
Query: 1867 TMKMC-RGILKAMDSRPDDKYVAYR----KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWK 1921
+ +MC L+ D+ DD ++ + KG G + G D + E++G V +
Sbjct: 448 SQQMCANNFLRHHDTNDDDLFMEEKPTILKGFGAFAKCDINKGTD--LTEYVGHVMTKEE 505
Query: 1922 WFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRP 1981
+FEK R L N E ++ + L D Y VDA + N A HSC P
Sbjct: 506 YFEKLR-FRCLFNNLE---ASYFGMQLTN-----DFY----VDARNYGNIARSFNHSCEP 552
Query: 1982 NCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCR 2037
N + VDG Y++ I T+R I GEE+TFDY+ TE E CLCGS CR
Sbjct: 553 NTKVDAVVVDGIYRLKISTIRDIKKGEELTFDYD--TEIIEGLVGMECLCGSTNCR 606
>gi|410951014|ref|XP_003982197.1| PREDICTED: histone-lysine N-methyltransferase SETD2 [Felis catus]
Length = 2064
Score = 70.9 bits (172), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ K+ + FV+E+ GEV K F+ + + KN + Y
Sbjct: 1059 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1107
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
K D ++DA K N + + HSC PNCE + V+G ++G +T + + G E
Sbjct: 1108 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1161
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
+TFDY K EA C CGS CRG YL
Sbjct: 1162 LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1189
>gi|344297409|ref|XP_003420391.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
SETD1B-like [Loxodonta africana]
Length = 1750
Score = 70.9 bits (172), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N+A I HSC PNC AKV V+ +I IY+ + I+ EEIT+DY E
Sbjct: 1674 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 1733
Query: 2021 KEEYEASVCLCGSQVCRGS 2039
+ CLCGS+ CRG+
Sbjct: 1734 VK----IPCLCGSENCRGT 1748
>gi|164426120|ref|XP_961572.2| hypothetical protein NCU01206 [Neurospora crassa OR74A]
gi|157071206|gb|EAA32336.2| conserved hypothetical protein [Neurospora crassa OR74A]
Length = 1150
Score = 70.9 bits (172), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 47/143 (32%), Positives = 69/143 (48%), Gaps = 23/143 (16%)
Query: 1903 FGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGY---D 1959
+DD ++E++GE E + I L++ YL+ G + + D
Sbjct: 1028 INKDDMIIEYVGE--------EVRQQIAELREAR----------YLKSGIGSSYLFRIDD 1069
Query: 1960 LVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTE 2019
V+DA K A I HSC PNC AK+ V+G +I IY +R I EE+T+DY E
Sbjct: 1070 NTVIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDYKFERE 1129
Query: 2020 SKEEYEASVCLCGSQVCRGSYLN 2042
+ CLCG+ C+G +LN
Sbjct: 1130 IGST-DRIPCLCGTAACKG-FLN 1150
>gi|345321023|ref|XP_001506028.2| PREDICTED: hypothetical protein LOC100074411 [Ornithorhynchus
anatinus]
Length = 1258
Score = 70.9 bits (172), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N+A I HSC PNC AKV V+ +I IY+ + I+ EEIT+DY E
Sbjct: 1182 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 1241
Query: 2021 KEEYEASVCLCGSQVCRGS 2039
+ CLCGS+ CRG+
Sbjct: 1242 VK----IPCLCGSENCRGT 1256
>gi|255730355|ref|XP_002550102.1| hypothetical protein CTRG_04399 [Candida tropicalis MYA-3404]
gi|240132059|gb|EER31617.1| hypothetical protein CTRG_04399 [Candida tropicalis MYA-3404]
Length = 1056
Score = 70.9 bits (172), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 48/82 (58%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA K A I H C P+C AK+ V+G +I IY +R I EE+T+DY E+
Sbjct: 977 TVIDATKKGGIARFINHCCSPSCTAKIIKVEGIKRIVIYALRDIEANEELTYDYKFERET 1036
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+E E CLCG+ C+G YLN
Sbjct: 1037 NDE-ERIRCLCGAPGCKG-YLN 1056
>gi|118090799|ref|XP_420839.2| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Gallus
gallus]
Length = 1369
Score = 70.9 bits (172), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+V ++ GE FV E++GE+ ++++ + ++ +E+ FY + +++
Sbjct: 1078 KGWGLVAKRDIKKGE--FVNEYVGEL------IDEEECMARIKYAHENDITHFYMLTIDK 1129
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NY+ + HSC+PNCE V+G ++G++ V I G E+
Sbjct: 1130 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1180
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN E+ +VC CG+ C G
Sbjct: 1181 TFNYNLDCLGNEK---TVCKCGAPNCSG 1205
>gi|154278862|ref|XP_001540244.1| hypothetical protein HCAG_04084 [Ajellomyces capsulatus NAm1]
gi|150412187|gb|EDN07574.1| hypothetical protein HCAG_04084 [Ajellomyces capsulatus NAm1]
Length = 1266
Score = 70.9 bits (172), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA + A I HSC PNC AK+ VDG +I IY +R I EE+T+DY E
Sbjct: 1187 TVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREW 1246
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+ + CLCGS C+G +LN
Sbjct: 1247 DSD-DRIPCLCGSTGCKG-FLN 1266
>gi|225554361|gb|EEH02660.1| histone-lysine N-methyltransferase [Ajellomyces capsulatus G186AR]
Length = 1267
Score = 70.9 bits (172), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA + A I HSC PNC AK+ VDG +I IY +R I EE+T+DY E
Sbjct: 1188 TVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREW 1247
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+ + CLCGS C+G +LN
Sbjct: 1248 DSD-DRIPCLCGSTGCKG-FLN 1267
>gi|168275530|dbj|BAG10485.1| SET domain-containing protein 2 [synthetic construct]
Length = 2642
Score = 70.9 bits (172), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 34/82 (41%), Positives = 46/82 (56%), Gaps = 4/82 (4%)
Query: 1960 LVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTE 2019
L ++DA K N + + HSC PNCE + V+G ++G +T + + G E+TFDY
Sbjct: 1690 LQIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSELTFDYQFQRY 1749
Query: 2020 SKEEYEASVCLCGSQVCRGSYL 2041
K EA C CGS CRG YL
Sbjct: 1750 GK---EAQKCFCGSANCRG-YL 1767
>gi|115400872|ref|XP_001216024.1| hypothetical protein ATEG_07403 [Aspergillus terreus NIH2624]
gi|114189965|gb|EAU31665.1| hypothetical protein ATEG_07403 [Aspergillus terreus NIH2624]
Length = 1230
Score = 70.9 bits (172), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA + A I HSC PNC AK+ VDG +I IY +R I EE+T+DY E
Sbjct: 1151 TVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREW 1210
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+ + CLCGS C+G +LN
Sbjct: 1211 DSD-DRIPCLCGSTGCKG-FLN 1230
>gi|406607680|emb|CCH40952.1| Histone-lysine N-methyltransferase [Wickerhamomyces ciferrii]
Length = 1071
Score = 70.9 bits (172), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 35/82 (42%), Positives = 48/82 (58%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA K A I H C+P+C AK+ V+G +I IY +R I EE+T+DY E+
Sbjct: 992 TVIDATKKGGIARFINHCCQPSCTAKIIKVEGQKRIVIYALRDIGANEELTYDYKFERET 1051
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+ E CLCG+ C+G YLN
Sbjct: 1052 NDN-ERVRCLCGAPGCKG-YLN 1071
>gi|392332670|ref|XP_003752655.1| PREDICTED: uncharacterized protein LOC100359816 [Rattus norvegicus]
Length = 2265
Score = 70.9 bits (172), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N+A I HSC PNC AKV V+ +I IY+ + I+ EEIT+DY E
Sbjct: 2189 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 2248
Query: 2021 KEEYEASVCLCGSQVCRGS 2039
+ CLCGS+ CRG+
Sbjct: 2249 VK----IPCLCGSENCRGT 2263
>gi|340381930|ref|XP_003389474.1| PREDICTED: histone-lysine N-methyltransferase trithorax-like
[Amphimedon queenslandica]
Length = 192
Score = 70.9 bits (172), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 56/147 (38%), Positives = 71/147 (48%), Gaps = 24/147 (16%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
GLG+ C +E G D V+E+ G V IRS + + E I
Sbjct: 65 GLGLFCLQEIDSG--DMVIEYAGTV------------IRSTLTDYRERFYESRGIGCYMF 110
Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
+ D+D VVDA N A I HSC PNC +KV AVDG +I I+ +R I GEE+T
Sbjct: 111 RIDSDE----VVDATMSGNMARFINHSCEPNCYSKVVAVDGQKKIMIFALRRIVPGEELT 166
Query: 2012 FDYNSVTESKEEYEASV-CLCGSQVCR 2037
+DY E EA + C CGS CR
Sbjct: 167 YDYKFPIE-----EAKIPCKCGSARCR 188
>gi|307180358|gb|EFN68384.1| Histone-lysine N-methyltransferase trithorax [Camponotus floridanus]
Length = 3218
Score = 70.9 bits (172), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 60/175 (34%), Positives = 81/175 (46%), Gaps = 23/175 (13%)
Query: 1868 MKMCRGILKAMDSRPDDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQD 1927
M M ILK + Y ++ G G+ C ++ GE V+E+ GEV
Sbjct: 3067 MAMRFRILKETSKKSVGVYHSHIHGRGLFCLRDIEAGE--MVIEYAGEV----------- 3113
Query: 1928 GIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKV 1987
IRS + + + NI K D D +VVDA K N A I HSC PNC ++V
Sbjct: 3114 -IRSSLTDKREKYYDSKNIGCYMFKID----DHLVVDATMKGNAARFINHSCEPNCYSRV 3168
Query: 1988 TAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
+ G I I+ +R I GEE+T+DY E + C CGS+ CR YLN
Sbjct: 3169 VDILGKKHILIFALRRIIQGEELTYDYKFPFEDIK----IPCTCGSRRCR-KYLN 3218
>gi|426374487|ref|XP_004054104.1| PREDICTED: uncharacterized protein LOC101124677 [Gorilla gorilla
gorilla]
Length = 1922
Score = 70.9 bits (172), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N+A I HSC PNC AKV V+ +I IY+ + I+ EEIT+DY E
Sbjct: 1846 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 1905
Query: 2021 KEEYEASVCLCGSQVCRGS 2039
+ CLCGS+ CRG+
Sbjct: 1906 VK----IPCLCGSENCRGT 1920
>gi|410976579|ref|XP_003994695.1| PREDICTED: uncharacterized protein LOC101096419 [Felis catus]
Length = 1919
Score = 70.9 bits (172), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N+A I HSC PNC AKV V+ +I IY+ + I+ EEIT+DY E
Sbjct: 1843 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 1902
Query: 2021 KEEYEASVCLCGSQVCRGS 2039
+ CLCGS+ CRG+
Sbjct: 1903 VK----IPCLCGSENCRGT 1917
>gi|350630881|gb|EHA19253.1| hypothetical protein ASPNIDRAFT_56859 [Aspergillus niger ATCC 1015]
Length = 1101
Score = 70.5 bits (171), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA + A I HSC PNC AK+ VDG +I IY +R I EE+T+DY E
Sbjct: 1022 TVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREW 1081
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+ + CLCGS C+G +LN
Sbjct: 1082 DSD-DRIPCLCGSTGCKG-FLN 1101
>gi|325089235|gb|EGC42545.1| histone-lysine N-methyltransferase [Ajellomyces capsulatus H88]
Length = 1267
Score = 70.5 bits (171), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA + A I HSC PNC AK+ VDG +I IY +R I EE+T+DY E
Sbjct: 1188 TVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREW 1247
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+ + CLCGS C+G +LN
Sbjct: 1248 DSD-DRIPCLCGSTGCKG-FLN 1267
>gi|195109821|ref|XP_001999480.1| GI24532 [Drosophila mojavensis]
gi|193916074|gb|EDW14941.1| GI24532 [Drosophila mojavensis]
Length = 3756
Score = 70.5 bits (171), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 55/151 (36%), Positives = 72/151 (47%), Gaps = 23/151 (15%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
G G+ C K+ GE V+E+ GE+ IRS + + + I
Sbjct: 3629 GRGLYCTKDIEAGE--MVIEYAGEL------------IRSTLTDKRERYYDSRGIGCYMF 3674
Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
K D D +VVDA + N A I HSC PNC +KV + GH I I+ +R I GEE+T
Sbjct: 3675 KID----DNLVVDATMRGNAARFINHSCEPNCYSKVVDILGHKHIIIFALRRIVQGEELT 3730
Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
+DY E E C CGS+ CR YLN
Sbjct: 3731 YDYKFPFED----EKIPCSCGSKRCR-KYLN 3756
>gi|20521978|dbj|BAB21823.2| KIAA1732 protein [Homo sapiens]
Length = 1915
Score = 70.5 bits (171), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ K+ + FV+E+ GEV K F+ + + KN + Y
Sbjct: 910 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 958
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
K D ++DA K N + + HSC PNCE + V+G ++G +T + + G E
Sbjct: 959 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1012
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
+TFDY K EA C CGS CRG YL
Sbjct: 1013 LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1040
>gi|390360513|ref|XP_785219.3| PREDICTED: histone-lysine N-methyltransferase NSD3-like
[Strongylocentrotus purpuratus]
Length = 1736
Score = 70.5 bits (171), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 41/149 (27%), Positives = 80/149 (53%), Gaps = 19/149 (12%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
DFV E++GE+ ++++ R +++ +E+ +FY + L++ + ++DA
Sbjct: 1257 DFVNEYVGEL------VDEEECRRRIKQAHEENITDFYFLTLDKDR---------IIDAG 1301
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K N + + HSC+PNCE + V+G ++G++ +R I G EI+F+YN E+
Sbjct: 1302 PKGNLSRFMNHSCQPNCETQKWTVNGDTRVGLFAIRNIAAGNEISFNYNLDCLGNEKKR- 1360
Query: 2027 SVCLCGSQVCRGSYLNLTGEGAFEKVLKE 2055
C CG+ C G ++ + + A ++E
Sbjct: 1361 --CECGAPNCSG-FIGVRPKTAAAAAMEE 1386
>gi|348554403|ref|XP_003463015.1| PREDICTED: hypothetical protein LOC100714908 [Cavia porcellus]
Length = 1931
Score = 70.5 bits (171), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 45/79 (56%), Gaps = 4/79 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N+A I HSC PNC AKV V+ +I IY+ + I EEIT+DY E
Sbjct: 1855 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHISVNEEITYDYKFPIED 1914
Query: 2021 KEEYEASVCLCGSQVCRGS 2039
+ CLCGS+ CRG+
Sbjct: 1915 VK----IPCLCGSENCRGT 1929
>gi|340959767|gb|EGS20948.1| hypothetical protein CTHT_0027870 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 1295
Score = 70.5 bits (171), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 36/84 (42%), Positives = 47/84 (55%), Gaps = 2/84 (2%)
Query: 1959 DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVT 2018
D V+DA K A I HSC PNC AK+ V+G +I IY +R I EE+T+DY
Sbjct: 1214 DNTVIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAKNEELTYDYKFER 1273
Query: 2019 ESKEEYEASVCLCGSQVCRGSYLN 2042
E + CLCG+ C+G +LN
Sbjct: 1274 ELGSA-DRIPCLCGTAACKG-FLN 1295
>gi|195392284|ref|XP_002054789.1| trx [Drosophila virilis]
gi|194152875|gb|EDW68309.1| trx [Drosophila virilis]
Length = 3822
Score = 70.5 bits (171), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 55/151 (36%), Positives = 72/151 (47%), Gaps = 23/151 (15%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
G G+ C K+ GE V+E+ GE+ IRS + + + I
Sbjct: 3695 GRGLYCTKDIEAGE--MVIEYAGEL------------IRSTLTDKRERYYDSRGIGCYMF 3740
Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
K D D +VVDA + N A I HSC PNC +KV + GH I I+ +R I GEE+T
Sbjct: 3741 KID----DNLVVDATMRGNAARFINHSCEPNCYSKVVDILGHKHIIIFALRRIVQGEELT 3796
Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
+DY E E C CGS+ CR YLN
Sbjct: 3797 YDYKFPFED----EKIPCSCGSKRCR-KYLN 3822
>gi|355746723|gb|EHH51337.1| hypothetical protein EGM_10693 [Macaca fascicularis]
Length = 2343
Score = 70.5 bits (171), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 60/207 (28%), Positives = 89/207 (42%), Gaps = 48/207 (23%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ ++ + FV+E+ GEV K F+ + + KN + Y
Sbjct: 1338 KKGWGLRAARD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1386
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
K D ++DA K N + + HSC PNCE + V+G ++G +T + + G E
Sbjct: 1387 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1440
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGE----------------------- 2046
+TFDY K EA C CGS CRG YL GE
Sbjct: 1441 LTFDYQFQRYGK---EAQKCFCGSANCRG-YLG--GENRVSIRAAGGKMKKERSRKKDSV 1494
Query: 2047 -GAFEKVLKELHGLLDRHQLMLEACEL 2072
G E +++ GL D++Q+ L C L
Sbjct: 1495 DGELEALMENGEGLSDKNQV-LSLCRL 1520
>gi|60688116|gb|AAH90954.1| SETD2 protein [Homo sapiens]
Length = 1845
Score = 70.5 bits (171), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ K+ + FV+E+ GEV K F+ + + KN + Y
Sbjct: 840 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 888
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
K D ++DA K N + + HSC PNCE + V+G ++G +T + + G E
Sbjct: 889 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 942
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
+TFDY K EA C CGS CRG YL
Sbjct: 943 LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 970
>gi|327348240|gb|EGE77097.1| histone-lysine N-methyltransferase [Ajellomyces dermatitidis ATCC
18188]
Length = 1280
Score = 70.5 bits (171), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA + A I HSC PNC AK+ VDG +I IY +R I EE+T+DY E
Sbjct: 1201 TVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREW 1260
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+ + CLCGS C+G +LN
Sbjct: 1261 DSD-DRIPCLCGSTGCKG-FLN 1280
>gi|358373521|dbj|GAA90119.1| SET domain protein [Aspergillus kawachii IFO 4308]
Length = 1239
Score = 70.5 bits (171), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA + A I HSC PNC AK+ VDG +I IY +R I EE+T+DY E
Sbjct: 1160 TVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREW 1219
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+ + CLCGS C+G +LN
Sbjct: 1220 DSD-DRIPCLCGSTGCKG-FLN 1239
>gi|28972602|dbj|BAC65717.1| mKIAA1076 protein [Mus musculus]
Length = 855
Score = 70.5 bits (171), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 34/80 (42%), Positives = 46/80 (57%), Gaps = 4/80 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N+A I HSC PNC AKV V+ +I IY+ + I+ EEIT+DY E
Sbjct: 779 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 838
Query: 2021 KEEYEASVCLCGSQVCRGSY 2040
+ CLCGS+ CRG+
Sbjct: 839 VK----IPCLCGSENCRGTL 854
>gi|297263735|ref|XP_002808043.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
SETD1B-like [Macaca mulatta]
Length = 2216
Score = 70.5 bits (171), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N+A I HSC PNC AKV V+ +I IY+ + I+ EEIT+DY E
Sbjct: 2140 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 2199
Query: 2021 KEEYEASVCLCGSQVCRGS 2039
+ CLCGS+ CRG+
Sbjct: 2200 VK----IPCLCGSENCRGT 2214
>gi|261201264|ref|XP_002627032.1| histone-lysine N-methyltransferase [Ajellomyces dermatitidis
SLH14081]
gi|239592091|gb|EEQ74672.1| histone-lysine N-methyltransferase [Ajellomyces dermatitidis
SLH14081]
gi|239611745|gb|EEQ88732.1| histone-lysine N-methyltransferase [Ajellomyces dermatitidis ER-3]
Length = 1259
Score = 70.5 bits (171), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA + A I HSC PNC AK+ VDG +I IY +R I EE+T+DY E
Sbjct: 1180 TVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREW 1239
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+ + CLCGS C+G +LN
Sbjct: 1240 DSD-DRIPCLCGSTGCKG-FLN 1259
>gi|126332220|ref|XP_001374612.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2
[Monodelphis domestica]
Length = 1366
Score = 70.5 bits (171), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+V ++ GE FV E++GE+ ++++ + ++ +E+ FY + +++
Sbjct: 1074 KGWGLVAKRDIKKGE--FVNEYVGEL------IDEEECMARIKYAHENDITHFYMLTIDK 1125
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NY+ + HSC+PNCE V+G ++G++ V I G E+
Sbjct: 1126 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1176
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN E+ +VC CG+ C G
Sbjct: 1177 TFNYNLDCLGNEK---TVCRCGASNCSG 1201
>gi|5689489|dbj|BAA83028.1| KIAA1076 protein [Homo sapiens]
Length = 804
Score = 70.5 bits (171), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N+A I HSC PNC AKV V+ +I IY+ + I+ EEIT+DY E
Sbjct: 728 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 787
Query: 2021 KEEYEASVCLCGSQVCRGS 2039
+ CLCGS+ CRG+
Sbjct: 788 VK----IPCLCGSENCRGT 802
>gi|297672976|ref|XP_002814554.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
N-methyltransferase NSD2 [Pongo abelii]
Length = 1365
Score = 70.5 bits (171), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+V ++ GE FV E++GE+ ++++ + ++ +E+ FY + +++
Sbjct: 1073 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKHAHENDITHFYMLTIDK 1124
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NY+ + HSC+PNCE V+G ++G++ V I G E+
Sbjct: 1125 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1175
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN E+ +VC CG+ C G
Sbjct: 1176 TFNYNLDCLGNEK---TVCRCGASNCSG 1200
>gi|119587786|gb|EAW67382.1| myeloid/lymphoid or mixed-lineage leukemia (trithorax homolog,
Drosophila), isoform CRA_c [Homo sapiens]
Length = 3130
Score = 70.5 bits (171), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
G G+ C + GE V+E+ G V IRS+Q + + ++Y+
Sbjct: 3001 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3043
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
Y+ R D VVDA N A I HSC PNC ++V +DG I I+ +R I+
Sbjct: 3044 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3097
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
GEE+T+DY E + C CG++ CR +LN
Sbjct: 3098 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3130
>gi|27371314|gb|AAH41681.1| Setd1b protein, partial [Mus musculus]
Length = 917
Score = 70.5 bits (171), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 34/80 (42%), Positives = 46/80 (57%), Gaps = 4/80 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N+A I HSC PNC AKV V+ +I IY+ + I+ EEIT+DY E
Sbjct: 841 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 900
Query: 2021 KEEYEASVCLCGSQVCRGSY 2040
+ CLCGS+ CRG+
Sbjct: 901 VK----IPCLCGSENCRGTL 916
>gi|26251880|gb|AAH40775.1| Setd1b protein, partial [Mus musculus]
Length = 911
Score = 70.5 bits (171), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 34/80 (42%), Positives = 46/80 (57%), Gaps = 4/80 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N+A I HSC PNC AKV V+ +I IY+ + I+ EEIT+DY E
Sbjct: 835 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 894
Query: 2021 KEEYEASVCLCGSQVCRGSY 2040
+ CLCGS+ CRG+
Sbjct: 895 VK----IPCLCGSENCRGTL 910
>gi|351704076|gb|EHB06995.1| Putative histone-lysine N-methyltransferase NSD2 [Heterocephalus
glaber]
Length = 1372
Score = 70.5 bits (171), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+V ++ GE FV E++GE+ ++++ + ++ +E+ FY + +++
Sbjct: 1079 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKYAHENDITHFYMLTIDK 1130
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NY+ + HSC+PNCE V+G ++G++ V I G E+
Sbjct: 1131 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1181
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN E+ +VC CG+ C G
Sbjct: 1182 TFNYNLDCLGNEK---TVCRCGASNCSG 1206
>gi|383860108|ref|XP_003705533.1| PREDICTED: uncharacterized protein LOC100883855 [Megachile rotundata]
Length = 1766
Score = 70.5 bits (171), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 49/149 (32%), Positives = 74/149 (49%), Gaps = 20/149 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ + GE F++E++GEV + +D R ++ ++D +Y + L
Sbjct: 822 KKGFGLRAMADMLAGE--FIMEYVGEV------VDPKDFRRRAKEYSKDKNRHYYFMAL- 872
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
K D ++DA K N + I HSC PN E + V+G +IG + + I GEE
Sbjct: 873 --KSDQ------IIDATMKGNVSRFINHSCDPNSETQKWTVNGELRIGFFNKKFIAAGEE 924
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
ITFDY+ K EA C C + CRG
Sbjct: 925 ITFDYHFQRYGK---EAQKCFCEAANCRG 950
>gi|195064789|ref|XP_001996640.1| GH19675 [Drosophila grimshawi]
gi|193892772|gb|EDV91638.1| GH19675 [Drosophila grimshawi]
Length = 3837
Score = 70.5 bits (171), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 55/151 (36%), Positives = 72/151 (47%), Gaps = 23/151 (15%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
G G+ C K+ GE V+E+ GE+ IRS + + + I
Sbjct: 3710 GRGLYCTKDIEAGE--MVIEYAGEL------------IRSTLTDKRERYYDSRGIGCYMF 3755
Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
K D D +VVDA + N A I HSC PNC +KV + GH I I+ +R I GEE+T
Sbjct: 3756 KID----DNLVVDATMRGNAARFINHSCEPNCYSKVVDILGHKHIIIFALRRIVQGEELT 3811
Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
+DY E E C CGS+ CR YLN
Sbjct: 3812 YDYKFPFED----EKIPCSCGSKRCR-KYLN 3837
>gi|149243887|ref|XP_001526541.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
gi|146448935|gb|EDK43191.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
Length = 1156
Score = 70.5 bits (171), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 35/82 (42%), Positives = 48/82 (58%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA K A I H C P+C AK+ VDG +I IY +R I EE+T+DY E+
Sbjct: 1077 TVIDATKKGGIARFINHCCSPSCTAKIIKVDGKKRIVIYALRDIEANEELTYDYKFERET 1136
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
++ E CLCG+ C+G +LN
Sbjct: 1137 NDD-ERIRCLCGAPGCKG-FLN 1156
>gi|355744804|gb|EHH49429.1| Putative histone-lysine N-methyltransferase NSD2 [Macaca
fascicularis]
Length = 1365
Score = 70.5 bits (171), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+V ++ GE FV E++GE+ ++++ + ++ +E+ FY + +++
Sbjct: 1073 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKHAHENDITHFYMLTIDK 1124
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NY+ + HSC+PNCE V+G ++G++ V I G E+
Sbjct: 1125 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1175
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN E+ +VC CG+ C G
Sbjct: 1176 TFNYNLDCLGNEK---TVCRCGASNCSG 1200
>gi|410905477|ref|XP_003966218.1| PREDICTED: histone-lysine N-methyltransferase SETD2-like [Takifugu
rubripes]
Length = 1950
Score = 70.5 bits (171), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 46/148 (31%), Positives = 71/148 (47%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+ K+ + FV+E+ GEV K F+ + + KN +Y + L+
Sbjct: 932 KGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKTRVKEYARNKNIH-----YYFMALKN 983
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K N + + HSC PNCE + V+G ++G +T + + G E+
Sbjct: 984 NE---------IIDATLKGNLSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKAVTAGTEL 1034
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TFDY K EA C CG+ CRG
Sbjct: 1035 TFDYQFQRYGK---EAQKCFCGTLSCRG 1059
>gi|393910299|gb|EFO20057.2| hypothetical protein LOAG_08436 [Loa loa]
Length = 770
Score = 70.5 bits (171), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 59/176 (33%), Positives = 81/176 (46%), Gaps = 22/176 (12%)
Query: 1867 TMKMC-RGILKAMDSRPDDKYV----AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWK 1921
+ +MC L+ D+ DD ++ KG G + G D + E++G V +
Sbjct: 457 SQQMCANNFLRHHDTNDDDLFMEEKPTILKGFGAFAKCDINKGTD--LTEYVGHVMTKEE 514
Query: 1922 WFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRP 1981
+FEK R L N E ++ + L D Y VDA + N A HSC P
Sbjct: 515 YFEKLR-FRCLFNNLE---ASYFGMQLTN-----DFY----VDARNYGNIARSFNHSCEP 561
Query: 1982 NCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCR 2037
N + VDG Y++ I T+R I GEE+TFDY+ TE E CLCGS CR
Sbjct: 562 NTKVDAVVVDGIYRLKISTIRDIKKGEELTFDYD--TEIIEGLVGMECLCGSTNCR 615
>gi|383421363|gb|AFH33895.1| putative histone-lysine N-methyltransferase NSD2 isoform 1 [Macaca
mulatta]
gi|384949270|gb|AFI38240.1| putative histone-lysine N-methyltransferase NSD2 isoform 1 [Macaca
mulatta]
gi|387540940|gb|AFJ71097.1| putative histone-lysine N-methyltransferase NSD2 isoform 1 [Macaca
mulatta]
Length = 1365
Score = 70.5 bits (171), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+V ++ GE FV E++GE+ ++++ + ++ +E+ FY + +++
Sbjct: 1073 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKHAHENDITHFYMLTIDK 1124
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NY+ + HSC+PNCE V+G ++G++ V I G E+
Sbjct: 1125 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1175
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN E+ +VC CG+ C G
Sbjct: 1176 TFNYNLDCLGNEK---TVCRCGASNCSG 1200
>gi|355557406|gb|EHH14186.1| Putative histone-lysine N-methyltransferase NSD2 [Macaca mulatta]
Length = 1365
Score = 70.5 bits (171), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+V ++ GE FV E++GE+ ++++ + ++ +E+ FY + +++
Sbjct: 1073 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKHAHENDITHFYMLTIDK 1124
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NY+ + HSC+PNCE V+G ++G++ V I G E+
Sbjct: 1125 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1175
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN E+ +VC CG+ C G
Sbjct: 1176 TFNYNLDCLGNEK---TVCRCGASNCSG 1200
>gi|348530102|ref|XP_003452550.1| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Oreochromis
niloticus]
Length = 399
Score = 70.5 bits (171), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 38/84 (45%), Positives = 50/84 (59%), Gaps = 3/84 (3%)
Query: 1959 DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVT 2018
D VVDA + N A I HSC PNC ++V VDG I I+ +R I+ GEE+T+DY
Sbjct: 319 DFDVVDATMQGNAARFINHSCEPNCYSRVINVDGRKHIVIFALRKIYRGEELTYDYKFPI 378
Query: 2019 ESKEEYEASVCLCGSQVCRGSYLN 2042
E +E +C CG++ CR YLN
Sbjct: 379 E--DESNKLLCNCGARRCR-RYLN 399
>gi|344276291|ref|XP_003409942.1| PREDICTED: histone-lysine N-methyltransferase SETD2 [Loxodonta
africana]
Length = 2551
Score = 70.5 bits (171), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 49/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ ++ + FV+E+ GEV K F+ + + KN + Y
Sbjct: 1545 KKGWGLRAARD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1593
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
K D ++DA K N + + HSC PNCE + V+G ++G +T + + G E
Sbjct: 1594 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1647
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
+TFDY K EA C CGS CRG YL
Sbjct: 1648 LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1675
>gi|19913348|ref|NP_579877.1| histone-lysine N-methyltransferase NSD2 isoform 1 [Homo sapiens]
gi|19913350|ref|NP_579878.1| histone-lysine N-methyltransferase NSD2 isoform 1 [Homo sapiens]
gi|19913358|ref|NP_579890.1| histone-lysine N-methyltransferase NSD2 isoform 1 [Homo sapiens]
gi|109633019|ref|NP_001035889.1| histone-lysine N-methyltransferase NSD2 isoform 1 [Homo sapiens]
gi|74706096|sp|O96028.1|NSD2_HUMAN RecName: Full=Histone-lysine N-methyltransferase NSD2; AltName:
Full=Multiple myeloma SET domain-containing protein;
Short=MMSET; AltName: Full=Nuclear SET domain-containing
protein 2; Short=NSD2; AltName: Full=Protein trithorax-5;
AltName: Full=Wolf-Hirschhorn syndrome candidate 1
protein; Short=WHSC1
gi|3249713|gb|AAC24150.1| MMSET type II [Homo sapiens]
gi|4378019|gb|AAD19343.1| putative WHSC1 protein [Homo sapiens]
gi|4521954|gb|AAD21770.1| putative WHSC1 protein [Homo sapiens]
gi|4521955|gb|AAD21771.1| putative WHSC1 protein [Homo sapiens]
gi|5123789|emb|CAB45386.1| TRX5 protein [Homo sapiens]
gi|6683809|gb|AAF23370.1| MMSET type II [Homo sapiens]
gi|119602958|gb|EAW82552.1| Wolf-Hirschhorn syndrome candidate 1, isoform CRA_e [Homo sapiens]
gi|119602959|gb|EAW82553.1| Wolf-Hirschhorn syndrome candidate 1, isoform CRA_e [Homo sapiens]
gi|119602962|gb|EAW82556.1| Wolf-Hirschhorn syndrome candidate 1, isoform CRA_e [Homo sapiens]
gi|168273154|dbj|BAG10416.1| histone-lysine N-methyltransferase NSD2 [synthetic construct]
gi|187252511|gb|AAI66668.1| Wolf-Hirschhorn syndrome candidate 1 [synthetic construct]
Length = 1365
Score = 70.5 bits (171), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+V ++ GE FV E++GE+ ++++ + ++ +E+ FY + +++
Sbjct: 1073 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKHAHENDITHFYMLTIDK 1124
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NY+ + HSC+PNCE V+G ++G++ V I G E+
Sbjct: 1125 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1175
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN E+ +VC CG+ C G
Sbjct: 1176 TFNYNLDCLGNEK---TVCRCGASNCSG 1200
>gi|357619110|gb|EHJ71815.1| putative huntingtin interacting protein [Danaus plexippus]
Length = 225
Score = 70.5 bits (171), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 54/157 (34%), Positives = 78/157 (49%), Gaps = 20/157 (12%)
Query: 1882 PDDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAP 1941
P + A +KG GV + GE F++E++GEV +++++ Q ++D
Sbjct: 78 PLKVFYADKKGCGVEATTDITNGE--FLMEYVGEVLDYDQFYKRA------QAYSDDNNL 129
Query: 1942 EFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTV 2001
Y + L KGD V+DA K N + I HSC PN E + V+G +IG ++
Sbjct: 130 HHYFMSL---KGD------TVIDATLKGNISRFINHSCEPNAETQKWTVNGELRIGFFSK 180
Query: 2002 RGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
R I GEEITFDY K A C CG++ CRG
Sbjct: 181 REISAGEEITFDYQFQRFGK---VAQRCYCGAENCRG 214
>gi|241998002|ref|XP_002433644.1| set domain protein, putative [Ixodes scapularis]
gi|215495403|gb|EEC05044.1| set domain protein, putative [Ixodes scapularis]
Length = 729
Score = 70.5 bits (171), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 43/137 (31%), Positives = 72/137 (52%), Gaps = 19/137 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
DFV+E++GE+ +Q+ R L + + + + FY + L+R + ++DA
Sbjct: 596 DFVMEYVGEI------INEQECERRLSRLHLEHSSNFYFLTLDRDR---------IIDAG 640
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
+ N + + HSC PNCE + V+G ++GI+ +R I G E+TF+YN E +
Sbjct: 641 PRGNLSRFMNHSCDPNCETQKWTVNGDTRVGIFAIRDIAPGTELTFNYNLDCRGNERIK- 699
Query: 2027 SVCLCGSQVCRGSYLNL 2043
C CG+ C G Y+ L
Sbjct: 700 --CACGASNCSG-YMGL 713
>gi|10720313|sp|Q24742.1|TRX_DROVI RecName: Full=Histone-lysine N-methyltransferase trithorax
gi|899254|emb|CAA90349.1| predicted trithorax protein [Drosophila virilis]
Length = 3828
Score = 70.5 bits (171), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 55/151 (36%), Positives = 72/151 (47%), Gaps = 23/151 (15%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
G G+ C K+ GE V+E+ GE+ IRS + + + I
Sbjct: 3701 GRGLYCTKDIEAGE--MVIEYAGEL------------IRSTLTDKRERYYDSRGIGCYMF 3746
Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
K D D +VVDA + N A I HSC PNC +KV + GH I I+ +R I GEE+T
Sbjct: 3747 KID----DNLVVDATMRGNAARFINHSCEPNCYSKVVDILGHKHIIIFALRRIVQGEELT 3802
Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
+DY E E C CGS+ CR YLN
Sbjct: 3803 YDYKFPFED----EKIPCSCGSKRCR-KYLN 3828
>gi|345791349|ref|XP_543382.3| PREDICTED: histone-lysine N-methyltransferase SETD1B [Canis lupus
familiaris]
Length = 1920
Score = 70.5 bits (171), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N+A I HSC PNC AKV V+ +I IY+ + I+ EEIT+DY E
Sbjct: 1844 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSNQHINVNEEITYDYKFPIED 1903
Query: 2021 KEEYEASVCLCGSQVCRGS 2039
+ CLCGS+ CRG+
Sbjct: 1904 VK----IPCLCGSENCRGT 1918
>gi|452822785|gb|EME29801.1| myeloid/lymphoid or mixed-lineage leukemia protein 3 [Galdieria
sulphuraria]
Length = 969
Score = 70.5 bits (171), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 51/141 (36%), Positives = 71/141 (50%), Gaps = 33/141 (23%)
Query: 1905 EDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----IYLERPKGDADGYD 1959
+++FV+E+ GE+ IR + D +FY+ Y+ R D
Sbjct: 852 DEEFVIEYAGEL------------IRPVIA---DIREKFYDRRKIGCYMFRLNDD----- 891
Query: 1960 LVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQ-IGIYTVRGIHYGEEITFDYNSVT 2018
+VDA K NYA I HSC PNC +K+ VDG Q IGI+ R I GEE+T+DY
Sbjct: 892 -FIVDATMKGNYARFINHSCEPNCRSKIITVDGDKQVIGIFAKRNIAAGEELTYDYQF-- 948
Query: 2019 ESKEEYEASV-CLCGSQVCRG 2038
EE+ ++ C CG+ CRG
Sbjct: 949 ---EEFGETIPCNCGAPNCRG 966
>gi|426343599|ref|XP_004038381.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 1
[Gorilla gorilla gorilla]
Length = 1365
Score = 70.5 bits (171), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+V ++ GE FV E++GE+ ++++ + ++ +E+ FY + +++
Sbjct: 1073 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKHAHENDITHFYMLTIDK 1124
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NY+ + HSC+PNCE V+G ++G++ V I G E+
Sbjct: 1125 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1175
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN E+ +VC CG+ C G
Sbjct: 1176 TFNYNLDCLGNEK---TVCRCGASNCSG 1200
>gi|391863483|gb|EIT72791.1| histone H3 (Lys4) methyltransferase complex, subunit SET1
[Aspergillus oryzae 3.042]
Length = 1223
Score = 70.5 bits (171), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA + A I HSC PNC AK+ VDG +I IY +R I EE+T+DY E
Sbjct: 1144 TVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREW 1203
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+ + CLCGS C+G +LN
Sbjct: 1204 DSD-DRIPCLCGSTGCKG-FLN 1223
>gi|351698529|gb|EHB01448.1| Histone-lysine N-methyltransferase SETD1B [Heterocephalus glaber]
Length = 1486
Score = 70.5 bits (171), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N+A I HSC PNC AKV V+ +I IY+ + I+ EEIT+DY E
Sbjct: 1410 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 1469
Query: 2021 KEEYEASVCLCGSQVCRGS 2039
+ CLCGS+ CRG+
Sbjct: 1470 VK----IPCLCGSENCRGT 1484
>gi|171692915|ref|XP_001911382.1| hypothetical protein [Podospora anserina S mat+]
gi|170946406|emb|CAP73207.1| unnamed protein product [Podospora anserina S mat+]
Length = 1083
Score = 70.5 bits (171), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 36/84 (42%), Positives = 47/84 (55%), Gaps = 2/84 (2%)
Query: 1959 DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVT 2018
D V+DA K A I HSC PNC AK+ V+G +I IY +R I EE+T+DY
Sbjct: 1002 DNTVIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDYKFER 1061
Query: 2019 ESKEEYEASVCLCGSQVCRGSYLN 2042
E + CLCG+ C+G +LN
Sbjct: 1062 EIGAT-DRIPCLCGTAACKG-FLN 1083
>gi|169769549|ref|XP_001819244.1| histone-lysine N-methyltransferase, H3 lysine-4 specific [Aspergillus
oryzae RIB40]
gi|121933328|sp|Q2UMH3.1|SET1_ASPOR RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component SET1; AltName:
Full=SET domain-containing protein 1
gi|83767103|dbj|BAE57242.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 1229
Score = 70.5 bits (171), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA + A I HSC PNC AK+ VDG +I IY +R I EE+T+DY E
Sbjct: 1150 TVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREW 1209
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+ + CLCGS C+G +LN
Sbjct: 1210 DSD-DRIPCLCGSTGCKG-FLN 1229
>gi|294658913|ref|XP_461254.2| DEHA2F20834p [Debaryomyces hansenii CBS767]
gi|218511781|sp|Q6BKL7.2|SET1_DEBHA RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component SET1; AltName:
Full=SET domain-containing protein 1
gi|202953480|emb|CAG89643.2| DEHA2F20834p [Debaryomyces hansenii CBS767]
Length = 1088
Score = 70.5 bits (171), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
VVDA K A I H C P+C AK+ V+G +I IY +R I EE+T+DY E+
Sbjct: 1009 TVVDATKKGGIARFINHCCNPSCTAKIIKVEGKKRIVIYALRDIEANEELTYDYKFEKET 1068
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+ E CLCG+ C+G YLN
Sbjct: 1069 NDA-ERIRCLCGAPGCKG-YLN 1088
>gi|114592860|ref|XP_001146084.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 6
[Pan troglodytes]
gi|114592864|ref|XP_001146248.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 7
[Pan troglodytes]
gi|114592866|ref|XP_001146323.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 8
[Pan troglodytes]
gi|114592870|ref|XP_001146473.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform
10 [Pan troglodytes]
gi|397483594|ref|XP_003812984.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Pan
paniscus]
gi|410227780|gb|JAA11109.1| Wolf-Hirschhorn syndrome candidate 1 [Pan troglodytes]
gi|410259494|gb|JAA17713.1| Wolf-Hirschhorn syndrome candidate 1 [Pan troglodytes]
gi|410299310|gb|JAA28255.1| Wolf-Hirschhorn syndrome candidate 1 [Pan troglodytes]
gi|410334709|gb|JAA36301.1| Wolf-Hirschhorn syndrome candidate 1 [Pan troglodytes]
Length = 1365
Score = 70.5 bits (171), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+V ++ GE FV E++GE+ ++++ + ++ +E+ FY + +++
Sbjct: 1073 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKHAHENDITHFYMLTIDK 1124
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NY+ + HSC+PNCE V+G ++G++ V I G E+
Sbjct: 1125 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1175
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN E+ +VC CG+ C G
Sbjct: 1176 TFNYNLDCLGNEK---TVCRCGASNCSG 1200
>gi|390351134|ref|XP_003727587.1| PREDICTED: histone-lysine N-methyltransferase SETD1B-A-like
[Strongylocentrotus purpuratus]
Length = 282
Score = 70.5 bits (171), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 45/140 (32%), Positives = 66/140 (47%), Gaps = 25/140 (17%)
Query: 1903 FGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI---YLERPKGDADGYD 1959
D+ V+E++GE +R ++ + A E I YL R
Sbjct: 163 IAADEMVIEYVGE------------SVRQSIADSREKAYERMGIGSSYLFRIDA------ 204
Query: 1960 LVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTE 2019
+ ++DA N A I HSC PNC AK+ V+ +I IY+ + I+ G+EIT+DY E
Sbjct: 205 VTIIDATKSGNLARFINHSCNPNCYAKIITVESEKKIVIYSKQTINVGDEITYDYKFPIE 264
Query: 2020 SKEEYEASVCLCGSQVCRGS 2039
E CLCG+ CRG+
Sbjct: 265 D----EKISCLCGAAQCRGT 280
>gi|334327124|ref|XP_003340832.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
SETD1B-like, partial [Monodelphis domestica]
Length = 1723
Score = 70.5 bits (171), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 45/79 (56%), Gaps = 4/79 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N+A I HSC PNC AKV V+ +I IY+ + I EEIT+DY E
Sbjct: 1647 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHISVNEEITYDYKFPIED 1706
Query: 2021 KEEYEASVCLCGSQVCRGS 2039
+ CLCGS+ CRG+
Sbjct: 1707 VK----IPCLCGSENCRGT 1721
>gi|300796853|ref|NP_001178481.1| probable histone-lysine N-methyltransferase NSD2 [Rattus norvegicus]
Length = 1346
Score = 70.5 bits (171), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+V ++ GE FV E++GE+ ++++ + ++ +E+ FY + +++
Sbjct: 1054 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKYAHENDITHFYMLTIDK 1105
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NY+ + HSC+PNCE V+G ++G++ V I G E+
Sbjct: 1106 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1156
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN E+ +VC CG+ C G
Sbjct: 1157 TFNYNLDCLGNEK---TVCRCGASNCSG 1181
>gi|47223666|emb|CAF99275.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1830
Score = 70.5 bits (171), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 46/79 (58%), Gaps = 4/79 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N+A I HSC PNC AKV V+ +I IY+ + I+ EEIT+DY E
Sbjct: 1754 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSRQPINVNEEITYDYKFPIED 1813
Query: 2021 KEEYEASVCLCGSQVCRGS 2039
+ CLCG++ CRG+
Sbjct: 1814 VK----IPCLCGAENCRGT 1828
>gi|162318272|gb|AAI56161.1| Wolf-Hirschhorn syndrome candidate 1 (human) [synthetic construct]
gi|162318442|gb|AAI56968.1| Wolf-Hirschhorn syndrome candidate 1 (human) [synthetic construct]
Length = 1346
Score = 70.5 bits (171), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+V ++ GE FV E++GE+ ++++ + ++ +E+ FY + +++
Sbjct: 1054 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKYAHENDITHFYMLTIDK 1105
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NY+ + HSC+PNCE V+G ++G++ V I G E+
Sbjct: 1106 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1156
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN E+ +VC CG+ C G
Sbjct: 1157 TFNYNLDCLGNEK---TVCRCGASNCSG 1181
>gi|50512437|gb|AAT77613.1| HSPC069 isoform b [Homo sapiens]
Length = 1211
Score = 70.5 bits (171), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 50/155 (32%), Positives = 73/155 (47%), Gaps = 21/155 (13%)
Query: 1887 VAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI 1946
+ +KG G+ K+ + FV+E+ GEV K F+ + + KN +
Sbjct: 1053 LTEKKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHY 1101
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
Y K D ++DA K N + + HSC PNCE + V+G ++G +T + +
Sbjct: 1102 YFMALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPS 1155
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
G E+TFDY K EA C CGS CRG YL
Sbjct: 1156 GSELTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1186
>gi|410958014|ref|XP_003985618.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Felis
catus]
Length = 1300
Score = 70.1 bits (170), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+V ++ GE FV E++GE+ ++++ + ++ +E+ FY + +++
Sbjct: 1008 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKHAHENDITHFYMLTIDK 1059
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NY+ + HSC+PNCE V+G ++G++ V I G E+
Sbjct: 1060 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1110
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN E+ +VC CG+ C G
Sbjct: 1111 TFNYNLDCLGNEK---TVCRCGASNCSG 1135
>gi|158818|gb|AAA29025.1| zinc-binding protein [Drosophila melanogaster]
Length = 3759
Score = 70.1 bits (170), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 55/151 (36%), Positives = 71/151 (47%), Gaps = 23/151 (15%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
G G+ C K+ GE V+E+ GE+ IRS + + + I
Sbjct: 3632 GRGLYCTKDIEAGE--MVIEYAGEL------------IRSTLTDKRERYYDSRGIGCYMF 3677
Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
K D D +VVDA + N A I H C PNC +KV + GH I I+ VR I GEE+T
Sbjct: 3678 KID----DNLVVDATMRGNAARFINHCCEPNCYSKVVDILGHKHIIIFAVRRIVQGEELT 3733
Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
+DY E E C CGS+ CR YLN
Sbjct: 3734 YDYKFPFED----EKIPCSCGSKRCR-KYLN 3759
>gi|326919530|ref|XP_003206033.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
[Meleagris gallopavo]
Length = 1348
Score = 70.1 bits (170), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+V ++ GE FV E++GE+ ++++ + ++ +E+ FY + +++
Sbjct: 1057 KGWGLVAKRDIKKGE--FVNEYVGEL------IDEEECMARIKYAHENDITHFYMLTIDK 1108
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NY+ + HSC+PNCE V+G ++G++ V I G E+
Sbjct: 1109 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1159
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN E+ +VC CG+ C G
Sbjct: 1160 TFNYNLDCLGNEK---TVCKCGAPNCSG 1184
>gi|255938628|ref|XP_002560084.1| Pc14g00900 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211584705|emb|CAP74231.1| Pc14g00900 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 1202
Score = 70.1 bits (170), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA + A I HSC PNC AK+ VDG +I IY +R I EE+T+DY E
Sbjct: 1123 TVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREW 1182
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+ + CLCGS C+G +LN
Sbjct: 1183 DSD-DRIPCLCGSTGCKG-FLN 1202
>gi|224050217|ref|XP_002195834.1| PREDICTED: histone-lysine N-methyltransferase NSD2 [Taeniopygia
guttata]
Length = 1339
Score = 70.1 bits (170), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+V ++ GE FV E++GE+ ++++ + ++ +E+ FY + +++
Sbjct: 1077 KGWGLVAKRDIKKGE--FVNEYVGEL------IDEEECMARIKYAHENDITHFYMLTIDK 1128
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NY+ + HSC+PNCE V+G ++G++ V I G E+
Sbjct: 1129 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1179
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN E+ +VC CG+ C G
Sbjct: 1180 TFNYNLDCLGNEK---TVCKCGAPNCSG 1204
>gi|332025910|gb|EGI66066.1| Histone-lysine N-methyltransferase trithorax [Acromyrmex echinatior]
Length = 3452
Score = 70.1 bits (170), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 60/175 (34%), Positives = 80/175 (45%), Gaps = 23/175 (13%)
Query: 1868 MKMCRGILKAMDSRPDDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQD 1927
M M ILK Y ++ G G+ C ++ GE V+E+ GEV
Sbjct: 3301 MAMRFRILKETSKASVGVYYSHIHGRGLFCLRDIEPGE--MVIEYAGEV----------- 3347
Query: 1928 GIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKV 1987
IRS + + + NI K D D +VVDA K N A I HSC PNC ++V
Sbjct: 3348 -IRSSLTDKREKYYDSKNIGCYMFKID----DHLVVDATMKGNAARFINHSCEPNCYSRV 3402
Query: 1988 TAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
+ G I I+ +R I GEE+T+DY E + C CGS+ CR YLN
Sbjct: 3403 VDILGKKHILIFALRRIIQGEELTYDYKFPFEDIK----IPCTCGSRKCR-KYLN 3452
>gi|62088596|dbj|BAD92745.1| myeloid/lymphoid or mixed-lineage leukemia (trithorax homolog,
Drosophila) variant [Homo sapiens]
Length = 2880
Score = 70.1 bits (170), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
G G+ C + GE V+E+ G V IRS+Q + + ++Y+
Sbjct: 2751 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 2793
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
Y+ R D VVDA N A I HSC PNC ++V +DG I I+ +R I+
Sbjct: 2794 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 2847
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
GEE+T+DY E + C CG++ CR +LN
Sbjct: 2848 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 2880
>gi|469800|emb|CAA83516.1| predicted trithorax protein [Drosophila melanogaster]
gi|1052593|emb|CAA90513.1| trithorax protein trxII [Drosophila melanogaster]
gi|1311653|gb|AAB35873.1| large trx isoform=trithorax gene product large isoform {alternatively
spliced, exon II-containing isoform} [Drosophila,
embryos, Peptide, 3726 aa]
Length = 3726
Score = 70.1 bits (170), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 55/151 (36%), Positives = 71/151 (47%), Gaps = 23/151 (15%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
G G+ C K+ GE V+E+ GE+ IRS + + + I
Sbjct: 3599 GRGLYCTKDIEAGE--MVIEYAGEL------------IRSTLTDKRERYYDSRGIGCYMF 3644
Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
K D D +VVDA + N A I H C PNC +KV + GH I I+ VR I GEE+T
Sbjct: 3645 KID----DNLVVDATMRGNAARFINHCCEPNCYSKVVDILGHKHIIIFAVRRIVQGEELT 3700
Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
+DY E E C CGS+ CR YLN
Sbjct: 3701 YDYKFPFED----EKIPCSCGSKRCR-KYLN 3726
>gi|469801|emb|CAA83515.1| predicted trithorax protein [Drosophila melanogaster]
gi|1052594|emb|CAA90514.1| trithorax protein trxI [Drosophila melanogaster]
Length = 3358
Score = 70.1 bits (170), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 55/151 (36%), Positives = 71/151 (47%), Gaps = 23/151 (15%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
G G+ C K+ GE V+E+ GE+ IRS + + + I
Sbjct: 3231 GRGLYCTKDIEAGE--MVIEYAGEL------------IRSTLTDKRERYYDSRGIGCYMF 3276
Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
K D D +VVDA + N A I H C PNC +KV + GH I I+ VR I GEE+T
Sbjct: 3277 KID----DNLVVDATMRGNAARFINHCCEPNCYSKVVDILGHKHIIIFAVRRIVQGEELT 3332
Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
+DY E E C CGS+ CR YLN
Sbjct: 3333 YDYKFPFED----EKIPCSCGSKRCR-KYLN 3358
>gi|195156904|ref|XP_002019336.1| GL12290 [Drosophila persimilis]
gi|194115927|gb|EDW37970.1| GL12290 [Drosophila persimilis]
Length = 1548
Score = 70.1 bits (170), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 44/79 (55%), Gaps = 4/79 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N A I HSC PNC AKV ++ +I IY+ + I EEIT+DY E
Sbjct: 1472 TIIDATKCGNLARFINHSCNPNCYAKVITIESEKKIVIYSKQPIGVNEEITYDYKFPLED 1531
Query: 2021 KEEYEASVCLCGSQVCRGS 2039
E CLCG+Q CRG+
Sbjct: 1532 ----EKIPCLCGAQGCRGT 1546
>gi|295424166|ref|NP_780440.2| histone-lysine N-methyltransferase NSD2 isoform 2 [Mus musculus]
gi|118572947|sp|Q8BVE8.2|NSD2_MOUSE RecName: Full=Histone-lysine N-methyltransferase NSD2; AltName:
Full=Multiple myeloma SET domain-containing protein;
Short=MMSET; AltName: Full=Nuclear SET domain-containing
protein 2; Short=NSD2; AltName: Full=Wolf-Hirschhorn
syndrome candidate 1 protein homolog; Short=WHSC1
Length = 1365
Score = 70.1 bits (170), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+V ++ GE FV E++GE+ ++++ + ++ +E+ FY + +++
Sbjct: 1073 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKYAHENDITHFYMLTIDK 1124
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NY+ + HSC+PNCE V+G ++G++ V I G E+
Sbjct: 1125 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1175
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN E+ +VC CG+ C G
Sbjct: 1176 TFNYNLDCLGNEK---TVCRCGASNCSG 1200
>gi|198452207|ref|XP_002137435.1| GA27210, isoform A [Drosophila pseudoobscura pseudoobscura]
gi|198131831|gb|EDY67993.1| GA27210, isoform A [Drosophila pseudoobscura pseudoobscura]
Length = 3779
Score = 70.1 bits (170), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 55/151 (36%), Positives = 72/151 (47%), Gaps = 23/151 (15%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
G G+ C K+ GE V+E+ GE+ IRS + + + I
Sbjct: 3652 GRGLYCTKDIEAGE--MVIEYAGEL------------IRSTLTDKRERYYDSRGIGCYMF 3697
Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
K D D +VVDA + N A I HSC PNC +KV + GH I I+ +R I GEE+T
Sbjct: 3698 KID----DNLVVDATMRGNAARFINHSCEPNCYSKVVDILGHKHIIIFALRRIVQGEELT 3753
Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
+DY E E C CGS+ CR YLN
Sbjct: 3754 YDYKFPFED----EKIPCSCGSKRCR-KYLN 3779
>gi|355786615|gb|EHH66798.1| hypothetical protein EGM_03852, partial [Macaca fascicularis]
Length = 673
Score = 70.1 bits (170), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N+A I HSC PNC AKV V+ +I IY+ + I+ EEIT+DY E
Sbjct: 597 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 656
Query: 2021 KEEYEASVCLCGSQVCRGS 2039
+ CLCGS+ CRG+
Sbjct: 657 VK----IPCLCGSENCRGT 671
>gi|295424164|ref|NP_001074571.2| histone-lysine N-methyltransferase NSD2 isoform 1 [Mus musculus]
Length = 1366
Score = 70.1 bits (170), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+V ++ GE FV E++GE+ ++++ + ++ +E+ FY + +++
Sbjct: 1074 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKYAHENDITHFYMLTIDK 1125
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NY+ + HSC+PNCE V+G ++G++ V I G E+
Sbjct: 1126 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1176
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN E+ +VC CG+ C G
Sbjct: 1177 TFNYNLDCLGNEK---TVCRCGASNCSG 1201
>gi|354483938|ref|XP_003504149.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 1
[Cricetulus griseus]
Length = 1365
Score = 70.1 bits (170), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+V ++ GE FV E++GE+ ++++ + ++ +E+ FY + +++
Sbjct: 1073 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKYAHENDITHFYMLTIDK 1124
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NY+ + HSC+PNCE V+G ++G++ V I G E+
Sbjct: 1125 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1175
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN E+ +VC CG+ C G
Sbjct: 1176 TFNYNLDCLGNEK---TVCRCGASNCSG 1200
>gi|281339990|gb|EFB15574.1| hypothetical protein PANDA_004672 [Ailuropoda melanoleuca]
Length = 1363
Score = 70.1 bits (170), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+V ++ GE FV E++GE+ ++++ + ++ +E+ FY + +++
Sbjct: 1071 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKYAHENDITHFYMLTIDK 1122
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NY+ + HSC+PNCE V+G ++G++ V I G E+
Sbjct: 1123 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1173
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN E+ +VC CG+ C G
Sbjct: 1174 TFNYNLDCLGNEK---TVCRCGASNCSG 1198
>gi|147899914|ref|NP_001087630.1| histone-lysine N-methyltransferase SETD1B [Xenopus laevis]
gi|82234463|sp|Q66J90.1|SET1B_XENLA RecName: Full=Histone-lysine N-methyltransferase SETD1B; AltName:
Full=SET domain-containing protein 1B
gi|51703454|gb|AAH81016.1| MGC81602 protein [Xenopus laevis]
Length = 1938
Score = 70.1 bits (170), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 46/79 (58%), Gaps = 4/79 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N+A I HSC PNC AKV V+ +I IY+ + I+ EEIT+DY E
Sbjct: 1862 TIIDATKCGNFARFINHSCNPNCYAKVVTVESQKKIVIYSKQYINVNEEITYDYKFPIED 1921
Query: 2021 KEEYEASVCLCGSQVCRGS 2039
+ CLCG++ CRG+
Sbjct: 1922 VK----IPCLCGAENCRGT 1936
>gi|350588548|ref|XP_003357368.2| PREDICTED: histone-lysine N-methyltransferase MLL [Sus scrofa]
Length = 2525
Score = 70.1 bits (170), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
G G+ C + GE V+E+ G V IRS+Q + + ++Y+
Sbjct: 2396 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 2438
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
Y+ R D VVDA N A I HSC PNC ++V +DG I I+ +R I+
Sbjct: 2439 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 2492
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
GEE+T+DY E + C CG++ CR +LN
Sbjct: 2493 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 2525
>gi|67539250|ref|XP_663399.1| hypothetical protein AN5795.2 [Aspergillus nidulans FGSC A4]
gi|74680884|sp|Q5B0Y5.1|SET1_EMENI RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component SET1; AltName:
Full=SET domain-containing protein 1
gi|40743698|gb|EAA62888.1| hypothetical protein AN5795.2 [Aspergillus nidulans FGSC A4]
gi|259484715|tpe|CBF81174.1| TPA: Histone-lysine N-methyltransferase, H3 lysine-4 specific (EC
2.1.1.43)(COMPASS component SET1)(SET domain-containing
protein 1) [Source:UniProtKB/Swiss-Prot;Acc:Q5B0Y5]
[Aspergillus nidulans FGSC A4]
Length = 1220
Score = 70.1 bits (170), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA + A I HSC PNC AK+ VDG +I IY +R I EE+T+DY E
Sbjct: 1141 TVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREW 1200
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+ + CLCGS C+G +LN
Sbjct: 1201 DSD-DRIPCLCGSAGCKG-FLN 1220
>gi|328778088|ref|XP_392252.4| PREDICTED: histone-lysine N-methyltransferase trithorax [Apis
mellifera]
Length = 3195
Score = 70.1 bits (170), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 59/175 (33%), Positives = 81/175 (46%), Gaps = 23/175 (13%)
Query: 1868 MKMCRGILKAMDSRPDDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQD 1927
M M ILK Y ++ G G+ C ++ GE V+E+ GEV
Sbjct: 3044 MAMRFRILKETSKESVGVYHSHIHGRGLFCLRDIEAGE--MVIEYAGEV----------- 3090
Query: 1928 GIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKV 1987
IR+ + + + NI K D D +VVDA K N A I HSC PNC ++V
Sbjct: 3091 -IRASLTDKREKYYDSKNIGCYMFKID----DHLVVDATMKGNAARFINHSCEPNCYSRV 3145
Query: 1988 TAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
+ G I I+ +R I+ GEE+T+DY E + C CGS+ CR YLN
Sbjct: 3146 VDILGKKHILIFALRRINQGEELTYDYKFPFEDIK----IPCTCGSRRCR-KYLN 3195
>gi|395513793|ref|XP_003761107.1| PREDICTED: uncharacterized protein LOC100928096 [Sarcophilus
harrisii]
Length = 1224
Score = 70.1 bits (170), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 34/80 (42%), Positives = 46/80 (57%), Gaps = 4/80 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N+A I HSC PNC AKV V+ +I IY+ + I+ EEIT+DY E
Sbjct: 1148 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 1207
Query: 2021 KEEYEASVCLCGSQVCRGSY 2040
+ CLCGS+ CRG+
Sbjct: 1208 VK----IPCLCGSENCRGTL 1223
>gi|344244292|gb|EGW00396.1| putative histone-lysine N-methyltransferase NSD2 [Cricetulus griseus]
Length = 1344
Score = 70.1 bits (170), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+V ++ GE FV E++GE+ ++++ + ++ +E+ FY + +++
Sbjct: 1052 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKYAHENDITHFYMLTIDK 1103
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NY+ + HSC+PNCE V+G ++G++ V I G E+
Sbjct: 1104 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1154
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN E+ +VC CG+ C G
Sbjct: 1155 TFNYNLDCLGNEK---TVCRCGASNCSG 1179
>gi|157127309|ref|XP_001654916.1| hypothetical protein AaeL_AAEL010807 [Aedes aegypti]
gi|108872954|gb|EAT37179.1| AAEL010807-PA [Aedes aegypti]
Length = 1670
Score = 70.1 bits (170), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 44/79 (55%), Gaps = 4/79 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N A I HSC PNC AKV ++ +I IY+ + I EEIT+DY E
Sbjct: 1594 TIIDATKCGNLARFINHSCNPNCYAKVITIESEKKIVIYSKQAIGINEEITYDYKFPLED 1653
Query: 2021 KEEYEASVCLCGSQVCRGS 2039
E CLCG+Q CRG+
Sbjct: 1654 ----EKIPCLCGAQGCRGT 1668
>gi|260944792|ref|XP_002616694.1| hypothetical protein CLUG_03935 [Clavispora lusitaniae ATCC 42720]
gi|238850343|gb|EEQ39807.1| hypothetical protein CLUG_03935 [Clavispora lusitaniae ATCC 42720]
Length = 469
Score = 70.1 bits (170), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA K A I H C P+C AK+ VDG +I IY +R I EE+T+DY E+
Sbjct: 390 TVIDATKKGGIARFINHCCNPSCTAKIIKVDGKKRIVIYALRDIEANEELTYDYKFERET 449
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+ E CLCG+ C+G YLN
Sbjct: 450 NDA-ERIRCLCGAPGCKG-YLN 469
>gi|190344535|gb|EDK36223.2| hypothetical protein PGUG_00321 [Meyerozyma guilliermondii ATCC 6260]
Length = 1055
Score = 70.1 bits (170), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 35/81 (43%), Positives = 48/81 (59%), Gaps = 2/81 (2%)
Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
V+DA K A I H C P+C AK+ V+G +I IY +R I EE+T+DY E+
Sbjct: 977 VIDATKKGGIARFINHCCNPSCTAKIIKVEGKKRIVIYALRDIEANEELTYDYKFERETN 1036
Query: 2022 EEYEASVCLCGSQVCRGSYLN 2042
++ E CLCG+ C+G YLN
Sbjct: 1037 DD-ERIRCLCGAPGCKG-YLN 1055
>gi|383861703|ref|XP_003706324.1| PREDICTED: uncharacterized protein LOC100882965 [Megachile rotundata]
Length = 3434
Score = 70.1 bits (170), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 60/177 (33%), Positives = 82/177 (46%), Gaps = 27/177 (15%)
Query: 1868 MKMCRGILKAMDSRPDDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQD 1927
M M ILK Y ++ G G+ C ++ GE V+E+ GEV
Sbjct: 3283 MAMRFRILKETSKESVGVYHSHIHGRGLFCLRDIEAGE--MVIEYAGEV----------- 3329
Query: 1928 GIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKV 1987
IR+ + + + NI K D D +VVDA K N A I HSC PNC ++V
Sbjct: 3330 -IRASLTDKREKYYDSKNIGCYMFKID----DHLVVDATMKGNAARFINHSCEPNCYSRV 3384
Query: 1988 TAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYE--ASVCLCGSQVCRGSYLN 2042
+ G I I+ +R I+ GEE+T+DY K +E C CGS+ CR YLN
Sbjct: 3385 VDILGKKHILIFALRRINQGEELTYDY------KFPFEDIKIPCTCGSRRCR-KYLN 3434
>gi|301606681|ref|XP_002932945.1| PREDICTED: histone-lysine N-methyltransferase MLL isoform 2 [Xenopus
(Silurana) tropicalis]
Length = 3840
Score = 70.1 bits (170), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 51/157 (32%), Positives = 72/157 (45%), Gaps = 33/157 (21%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
G G+ C + GE V+E+ G V IRS+ + + ++Y+
Sbjct: 3711 GRGLFCRRNIDAGE--MVIEYSGNV------------IRSILTDKRE---KYYD------ 3747
Query: 1952 KGDADGY------DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIH 2005
G G D VVDA N A I HSC PNC ++V +DG I I+ +R I+
Sbjct: 3748 -GKGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVIPIDGQKHIVIFAMRKIY 3806
Query: 2006 YGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
GEE+T+DY E A C CG++ CR +LN
Sbjct: 3807 RGEELTYDYKFPIEDANNKLA--CNCGTKKCR-KFLN 3840
>gi|301606679|ref|XP_002932944.1| PREDICTED: histone-lysine N-methyltransferase MLL isoform 1 [Xenopus
(Silurana) tropicalis]
Length = 3855
Score = 70.1 bits (170), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 51/157 (32%), Positives = 72/157 (45%), Gaps = 33/157 (21%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
G G+ C + GE V+E+ G V IRS+ + + ++Y+
Sbjct: 3726 GRGLFCRRNIDAGE--MVIEYSGNV------------IRSILTDKRE---KYYD------ 3762
Query: 1952 KGDADGY------DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIH 2005
G G D VVDA N A I HSC PNC ++V +DG I I+ +R I+
Sbjct: 3763 -GKGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVIPIDGQKHIVIFAMRKIY 3821
Query: 2006 YGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
GEE+T+DY E A C CG++ CR +LN
Sbjct: 3822 RGEELTYDYKFPIEDANNKLA--CNCGTKKCR-KFLN 3855
>gi|198454568|ref|XP_002137902.1| GA26260 [Drosophila pseudoobscura pseudoobscura]
gi|198132853|gb|EDY68460.1| GA26260 [Drosophila pseudoobscura pseudoobscura]
Length = 1755
Score = 70.1 bits (170), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 44/79 (55%), Gaps = 4/79 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N A I HSC PNC AKV ++ +I IY+ + I EEIT+DY E
Sbjct: 1679 TIIDATKCGNLARFINHSCNPNCYAKVITIESEKKIVIYSKQPIGVNEEITYDYKFPLED 1738
Query: 2021 KEEYEASVCLCGSQVCRGS 2039
E CLCG+Q CRG+
Sbjct: 1739 ----EKIPCLCGAQGCRGT 1753
>gi|146422003|ref|XP_001486944.1| hypothetical protein PGUG_00321 [Meyerozyma guilliermondii ATCC 6260]
Length = 1055
Score = 70.1 bits (170), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 35/81 (43%), Positives = 48/81 (59%), Gaps = 2/81 (2%)
Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
V+DA K A I H C P+C AK+ V+G +I IY +R I EE+T+DY E+
Sbjct: 977 VIDATKKGGIARFINHCCNPSCTAKIIKVEGKKRIVIYALRDIEANEELTYDYKFERETN 1036
Query: 2022 EEYEASVCLCGSQVCRGSYLN 2042
++ E CLCG+ C+G YLN
Sbjct: 1037 DD-ERIRCLCGAPGCKG-YLN 1055
>gi|6841376|gb|AAF29041.1|AF161554_1 HSPC069 [Homo sapiens]
Length = 591
Score = 70.1 bits (170), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 50/155 (32%), Positives = 73/155 (47%), Gaps = 21/155 (13%)
Query: 1887 VAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI 1946
+ +KG G+ K+ + FV+E+ GEV K F+ + + KN +
Sbjct: 106 LTEKKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHY 154
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
Y K D ++DA K N + + HSC PNCE + V+G ++G +T + +
Sbjct: 155 YFMALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPS 208
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
G E+TFDY K EA C CGS CRG YL
Sbjct: 209 GSELTFDYQFQRYGK---EAQKCFCGSANCRG-YL 239
>gi|390178053|ref|XP_003736554.1| GA27210, isoform B [Drosophila pseudoobscura pseudoobscura]
gi|388859306|gb|EIM52627.1| GA27210, isoform B [Drosophila pseudoobscura pseudoobscura]
Length = 3474
Score = 70.1 bits (170), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 55/151 (36%), Positives = 72/151 (47%), Gaps = 23/151 (15%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
G G+ C K+ GE V+E+ GE+ IRS + + + I
Sbjct: 3347 GRGLYCTKDIEAGE--MVIEYAGEL------------IRSTLTDKRERYYDSRGIGCYMF 3392
Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
K D D +VVDA + N A I HSC PNC +KV + GH I I+ +R I GEE+T
Sbjct: 3393 KID----DNLVVDATMRGNAARFINHSCEPNCYSKVVDILGHKHIIIFALRRIVQGEELT 3448
Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
+DY E E C CGS+ CR YLN
Sbjct: 3449 YDYKFPFED----EKIPCSCGSKRCR-KYLN 3474
>gi|355564772|gb|EHH21272.1| hypothetical protein EGK_04290, partial [Macaca mulatta]
Length = 663
Score = 70.1 bits (170), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N+A I HSC PNC AKV V+ +I IY+ + I+ EEIT+DY E
Sbjct: 587 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 646
Query: 2021 KEEYEASVCLCGSQVCRGS 2039
+ CLCGS+ CRG+
Sbjct: 647 VK----IPCLCGSENCRGT 661
>gi|348571627|ref|XP_003471597.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 2
[Cavia porcellus]
Length = 1367
Score = 70.1 bits (170), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+V ++ GE FV E++GE+ ++++ + ++ +E+ FY + +++
Sbjct: 1074 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKYAHENDITHFYMLTIDK 1125
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NY+ + HSC+PNCE V+G ++G++ V I G E+
Sbjct: 1126 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1176
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN E+ +VC CG+ C G
Sbjct: 1177 TFNYNLDCLGNEK---TVCRCGASNCSG 1201
>gi|348571625|ref|XP_003471596.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 1
[Cavia porcellus]
Length = 1366
Score = 70.1 bits (170), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+V ++ GE FV E++GE+ ++++ + ++ +E+ FY + +++
Sbjct: 1073 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKYAHENDITHFYMLTIDK 1124
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NY+ + HSC+PNCE V+G ++G++ V I G E+
Sbjct: 1125 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1175
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN E+ +VC CG+ C G
Sbjct: 1176 TFNYNLDCLGNEK---TVCRCGASNCSG 1200
>gi|332028801|gb|EGI68830.1| Putative histone-lysine N-methyltransferase NSD2 [Acromyrmex
echinatior]
Length = 1304
Score = 70.1 bits (170), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 40/131 (30%), Positives = 66/131 (50%), Gaps = 18/131 (13%)
Query: 1908 FVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMH 1967
FV+E++GE+ + + R L + E FY + ++ + +DA
Sbjct: 973 FVIEYVGEI------IDDAEYKRRLHRKKELKNENFYFLTIDNNR---------TIDAEP 1017
Query: 1968 KANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEAS 2027
K N + + HSC PNCE + V+G +IG++ +R I GEE+TF+YN ++ +
Sbjct: 1018 KGNLSRFMNHSCAPNCETQKWTVNGDTRIGLFALRDIESGEELTFNYNLASDGETR---K 1074
Query: 2028 VCLCGSQVCRG 2038
CLCG+ C G
Sbjct: 1075 ACLCGAPNCSG 1085
>gi|358056897|dbj|GAA97247.1| hypothetical protein E5Q_03924 [Mixia osmundae IAM 14324]
Length = 949
Score = 70.1 bits (170), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 51/152 (33%), Positives = 69/152 (45%), Gaps = 28/152 (18%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPE----FYNI 1946
KG GV ++ +D FV E++GEV G LQK +D E FY +
Sbjct: 279 KGFGVRAAED--MLKDAFVYEYIGEVV----------GAGQLQKRMKDYYEEGIEHFYFM 326
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
L+R + +DA K N + HSC PNC V ++GI+T R I
Sbjct: 327 ALQREE---------FIDATKKGNKGRFLNHSCSPNCYVSKWVVGEKMRMGIFTKRKIQA 377
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
GEE+TF+YN + +EA C CG C G
Sbjct: 378 GEELTFNYNV---DRYGHEAQPCYCGEANCVG 406
>gi|301762334|ref|XP_002916587.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
N-methyltransferase NSD2-like [Ailuropoda melanoleuca]
Length = 1364
Score = 70.1 bits (170), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+V ++ GE FV E++GE+ ++++ + ++ +E+ FY + +++
Sbjct: 1072 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKYAHENDITHFYMLTIDK 1123
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NY+ + HSC+PNCE V+G ++G++ V I G E+
Sbjct: 1124 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1174
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN E+ +VC CG+ C G
Sbjct: 1175 TFNYNLDCLGNEK---TVCRCGASNCSG 1199
>gi|195446231|ref|XP_002070688.1| GK10891 [Drosophila willistoni]
gi|194166773|gb|EDW81674.1| GK10891 [Drosophila willistoni]
Length = 447
Score = 70.1 bits (170), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 55/151 (36%), Positives = 72/151 (47%), Gaps = 23/151 (15%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
G G+ C K+ GE V+E+ GE+ IRS + + + I
Sbjct: 320 GRGLYCTKDIEAGE--MVIEYAGEL------------IRSTLTDKRERYYDSRGIGCYMF 365
Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
K D D +VVDA + N A I HSC PNC +KV + GH I I+ +R I GEE+T
Sbjct: 366 KID----DNLVVDATMRGNAARFINHSCEPNCYSKVVDILGHKHIIIFALRRIVQGEELT 421
Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
+DY E E C CGS+ CR YLN
Sbjct: 422 YDYKFPFEE----EKIPCSCGSKRCR-KYLN 447
>gi|47225482|emb|CAG11965.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1625
Score = 70.1 bits (170), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 47/155 (30%), Positives = 73/155 (47%), Gaps = 20/155 (12%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
D + KG G+ K+ + FV+E+ GEV K F+ + + KN +
Sbjct: 295 DVILTENKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKTRVKEYARNKNIH-----Y 346
Query: 1944 YNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRG 2003
Y + L+ + ++DA K N + + HSC PNCE + V+G ++G +T +
Sbjct: 347 YFMSLKNNE---------IIDATLKGNLSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKA 397
Query: 2004 IHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
+ G E+TFDY K EA C CG+ CRG
Sbjct: 398 VTAGTELTFDYQFQRYGK---EAQKCFCGTPNCRG 429
>gi|18406465|ref|NP_566010.1| histone-lysine N-methyltransferase ASHH3 [Arabidopsis thaliana]
gi|94707125|sp|Q945S8.2|ASHH3_ARATH RecName: Full=Histone-lysine N-methyltransferase ASHH3; AltName:
Full=ASH1 homolog 3; AltName: Full=Protein SET DOMAIN
GROUP 7
gi|15028059|gb|AAK76560.1| unknown protein [Arabidopsis thaliana]
gi|20197070|gb|AAC23419.2| expressed protein [Arabidopsis thaliana]
gi|20259301|gb|AAM14386.1| unknown protein [Arabidopsis thaliana]
gi|330255289|gb|AEC10383.1| histone-lysine N-methyltransferase ASHH3 [Arabidopsis thaliana]
Length = 363
Score = 70.1 bits (170), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 57/187 (30%), Positives = 84/187 (44%), Gaps = 36/187 (19%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
G G+V +E GE F++E++GEV + + L K FY + R
Sbjct: 127 GSGIVAEEEIEAGE--FIIEYVGEV------IDDKTCEERLWKMKHRGETNFYLCEITRD 178
Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
+V+DA HK N + I HSC PN + + +DG +IGI+ RGI GE +T
Sbjct: 179 ---------MVIDATHKGNKSRYINHSCNPNTQMQKWIIDGETRIGIFATRGIKKGEHLT 229
Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCR------GSYLNLTGEGAFEKVLKEL--------- 2056
+DY V ++ C CG+ CR S + + AF V EL
Sbjct: 230 YDYQFVQFGADQD----CHCGAVGCRRKLGVKPSKPKIASDEAFNLVAHELAQTLPKVHQ 285
Query: 2057 HGLLDRH 2063
+GL++RH
Sbjct: 286 NGLVNRH 292
>gi|307111585|gb|EFN59819.1| hypothetical protein CHLNCDRAFT_18588, partial [Chlorella variabilis]
Length = 380
Score = 69.7 bits (169), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 52/160 (32%), Positives = 73/160 (45%), Gaps = 39/160 (24%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVY----------PVWKWFEKQDGIRSLQKNNEDPA 1940
KG G+ ++ G+ F++E+LGEV WK + + G R
Sbjct: 183 KGFGLFAAEDMKAGQ--FLIEYLGEVLEEEEYHRRQGAAWKEYFIETGQRHYY------- 233
Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
F N+ G+ + V+DA + N I HSC PNCE + V G IG++T
Sbjct: 234 --FMNV------GNGE-----VIDASRRGNLGRFINHSCEPNCETQKWVVHGELAIGLFT 280
Query: 2001 VRGIHYGEEITFDYNSVTESKEEY--EASVCLCGSQVCRG 2038
+ I G E+TFDYN E Y + CLCGS+ CRG
Sbjct: 281 LEDISAGTELTFDYNF-----ERYGDKPMKCLCGSKNCRG 315
>gi|119185079|ref|XP_001243361.1| hypothetical protein CIMG_07257 [Coccidioides immitis RS]
gi|121936913|sp|Q1DR06.1|SET1_COCIM RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component SET1; AltName:
Full=SET domain-containing protein 1
gi|392866240|gb|EAS28850.2| histone-lysine N-methyltransferase, H3 lysine-4 specific
[Coccidioides immitis RS]
Length = 1271
Score = 69.7 bits (169), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA + A I HSC PNC AK+ VDG +I IY +R I EE+T+DY E
Sbjct: 1192 TVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIDRDEELTYDYKFEREW 1251
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+ + CLCGS C+G +LN
Sbjct: 1252 DSD-DRIPCLCGSAGCKG-FLN 1271
>gi|444724926|gb|ELW65512.1| Histone-lysine N-methyltransferase SETD1B [Tupaia chinensis]
Length = 1554
Score = 69.7 bits (169), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 45/79 (56%), Gaps = 4/79 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N+A I HSC PNC AKV V+ +I IY+ + I EEIT+DY E
Sbjct: 1478 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHISVNEEITYDYKFPIED 1537
Query: 2021 KEEYEASVCLCGSQVCRGS 2039
+ CLCGS+ CRG+
Sbjct: 1538 IK----IPCLCGSENCRGT 1552
>gi|348573849|ref|XP_003472703.1| PREDICTED: histone-lysine N-methyltransferase MLL-like, partial
[Cavia porcellus]
Length = 2799
Score = 69.7 bits (169), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
G G+ C + GE V+E+ G V IRS+Q + + ++Y+
Sbjct: 2670 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 2712
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
Y+ R D VVDA N A I HSC PNC ++V +DG I I+ +R I+
Sbjct: 2713 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 2766
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
GEE+T+DY E + C CG++ CR +LN
Sbjct: 2767 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 2799
>gi|320032561|gb|EFW14513.1| histone-lysine N-methyltransferase [Coccidioides posadasii str.
Silveira]
Length = 1271
Score = 69.7 bits (169), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA + A I HSC PNC AK+ VDG +I IY +R I EE+T+DY E
Sbjct: 1192 TVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIDRDEELTYDYKFEREW 1251
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+ + CLCGS C+G +LN
Sbjct: 1252 DSD-DRIPCLCGSAGCKG-FLN 1271
>gi|303313714|ref|XP_003066866.1| SET domain containing protein [Coccidioides posadasii C735 delta
SOWgp]
gi|240106533|gb|EER24721.1| SET domain containing protein [Coccidioides posadasii C735 delta
SOWgp]
Length = 1271
Score = 69.7 bits (169), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA + A I HSC PNC AK+ VDG +I IY +R I EE+T+DY E
Sbjct: 1192 TVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIDRDEELTYDYKFEREW 1251
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+ + CLCGS C+G +LN
Sbjct: 1252 DSD-DRIPCLCGSAGCKG-FLN 1271
>gi|149047443|gb|EDM00113.1| similar to Wolf-Hirschhorn syndrome candidate 1 protein isoform 3
(predicted) [Rattus norvegicus]
Length = 1298
Score = 69.7 bits (169), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+V ++ GE FV E++GE+ ++++ + ++ +E+ FY + +++
Sbjct: 1006 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKYAHENDITHFYMLTIDK 1057
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NY+ + HSC+PNCE V+G ++G++ V I G E+
Sbjct: 1058 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1108
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN E+ +VC CG+ C G
Sbjct: 1109 TFNYNLDCLGNEK---TVCRCGASNCSG 1133
>gi|345798392|ref|XP_536224.3| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
N-methyltransferase NSD2 [Canis lupus familiaris]
Length = 1364
Score = 69.7 bits (169), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+V ++ GE FV E++GE+ ++++ + ++ +E+ FY + +++
Sbjct: 1072 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKYAHENDITHFYMLTIDK 1123
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NY+ + HSC+PNCE V+G ++G++ V I G E+
Sbjct: 1124 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1174
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN E+ +VC CG+ C G
Sbjct: 1175 TFNYNLDCLGNEK---TVCRCGASNCSG 1199
>gi|389746109|gb|EIM87289.1| SET domain-containing protein [Stereum hirsutum FP-91666 SS1]
Length = 191
Score = 69.7 bits (169), Expect = 2e-08, Method: Composition-based stats.
Identities = 52/139 (37%), Positives = 68/139 (48%), Gaps = 26/139 (18%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI---YLERPKGDADGYDLVVV 1963
+ V+E++GEV IR+ + + A E I YL R D +VV
Sbjct: 76 EMVIEYVGEV------------IRAQIADKREKAYERQGIGSSYLFRIDED------LVV 117
Query: 1964 DAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEE 2023
DA K N I HSC PNC AK+ + G +I IY + I G+EIT+DY+ E
Sbjct: 118 DATKKGNLGRLINHSCDPNCTAKIITILGEKKIVIYAKQDIELGDEITYDYHFPIEQ--- 174
Query: 2024 YEASVCLCGSQVCRGSYLN 2042
+ CLCGS CRG YLN
Sbjct: 175 -DKIPCLCGSARCRG-YLN 191
>gi|195356446|ref|XP_002044683.1| GM18767 [Drosophila sechellia]
gi|194133849|gb|EDW55365.1| GM18767 [Drosophila sechellia]
Length = 1637
Score = 69.7 bits (169), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 44/79 (55%), Gaps = 4/79 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N A I HSC PNC AKV ++ +I IY+ + I EEIT+DY E
Sbjct: 1561 TIIDATKCGNLARFINHSCNPNCYAKVITIESEKKIVIYSKQPIGINEEITYDYKFPLEE 1620
Query: 2021 KEEYEASVCLCGSQVCRGS 2039
E CLCG+Q CRG+
Sbjct: 1621 ----EKIPCLCGAQGCRGT 1635
>gi|350587283|ref|XP_003128857.3| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Sus
scrofa]
Length = 1338
Score = 69.7 bits (169), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 78/148 (52%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+V ++ GE FV E++GE+ ++++ + +++ E FY + +++
Sbjct: 1046 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIRRAQEHDITRFYMLTIDK 1097
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NY+ + HSC+PNCE V+G ++G++ V I G E+
Sbjct: 1098 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1148
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN E+ +VC CG+ C G
Sbjct: 1149 TFNYNLDCLGNEK---TVCRCGASNCSG 1173
>gi|195453659|ref|XP_002073883.1| GK12911 [Drosophila willistoni]
gi|194169968|gb|EDW84869.1| GK12911 [Drosophila willistoni]
Length = 1765
Score = 69.7 bits (169), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 44/79 (55%), Gaps = 4/79 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N A I HSC PNC AKV ++ +I IY+ + I EEIT+DY E
Sbjct: 1689 TIIDATKCGNLARFINHSCNPNCYAKVITIESEKKIVIYSKQPIGVNEEITYDYKFPLED 1748
Query: 2021 KEEYEASVCLCGSQVCRGS 2039
E CLCG+Q CRG+
Sbjct: 1749 ----EKIPCLCGAQGCRGT 1763
>gi|15213542|gb|AAK92049.1|AF322907_1 NSD1 [Homo sapiens]
Length = 2596
Score = 69.7 bits (169), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 38/132 (28%), Positives = 71/132 (53%), Gaps = 18/132 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
+FV E++GE+ ++++ + ++ +E+ FY + +++ + ++DA
Sbjct: 1863 EFVNEYVGEL------IDEEECMARIKHAHENDITHFYMLTIDKDR---------IIDAG 1907
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K NY+ + HSC+PNCE V+G ++G++ V I G E+TF+YN E+
Sbjct: 1908 PKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTELTFNYNLDCLGNEK--- 1964
Query: 2027 SVCLCGSQVCRG 2038
+VC CG+ C G
Sbjct: 1965 TVCRCGASNCSG 1976
>gi|242073096|ref|XP_002446484.1| hypothetical protein SORBIDRAFT_06g016720 [Sorghum bicolor]
gi|241937667|gb|EES10812.1| hypothetical protein SORBIDRAFT_06g016720 [Sorghum bicolor]
Length = 521
Score = 69.7 bits (169), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 49/151 (32%), Positives = 73/151 (48%), Gaps = 26/151 (17%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
+G G+V ++ G+ FV+E+ GEV W E + R Q + Y IYL
Sbjct: 95 RGWGLVADENIMAGQ--FVIEYCGEVI---SWKESK---RRAQAYETQGLKDAYIIYLNA 146
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ +DA K N+A I HSC+PNCE + V G ++GI+ + I +G E+
Sbjct: 147 DES---------IDATRKGNFARFINHSCQPNCETRKWNVLGEVRVGIFAKQDIPFGTEL 197
Query: 2011 TFDYNSVTESKEEYEASV---CLCGSQVCRG 2038
++DYN E+ V CLCG+ C G
Sbjct: 198 SYDYNF------EWYGGVMVRCLCGAASCSG 222
>gi|149756942|ref|XP_001488967.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 1
[Equus caballus]
Length = 1365
Score = 69.7 bits (169), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+V ++ GE FV E++GE+ ++++ + ++ +E+ FY + +++
Sbjct: 1073 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKYAHENDITHFYMLTIDK 1124
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NY+ + HSC+PNCE V+G ++G++ V I G E+
Sbjct: 1125 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1175
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN E+ +VC CG+ C G
Sbjct: 1176 TFNYNLDCLGNEK---TVCRCGASNCSG 1200
>gi|121709862|ref|XP_001272547.1| SET domain protein [Aspergillus clavatus NRRL 1]
gi|119400697|gb|EAW11121.1| SET domain protein [Aspergillus clavatus NRRL 1]
Length = 1232
Score = 69.7 bits (169), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA + A I HSC PNC AK+ VDG +I IY +R I EE+T+DY E
Sbjct: 1153 TVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREW 1212
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+ + CLCGS C+G +LN
Sbjct: 1213 DSD-DRIPCLCGSTGCKG-FLN 1232
>gi|323348281|gb|EGA82530.1| Set1p [Saccharomyces cerevisiae Lalvin QA23]
Length = 980
Score = 69.7 bits (169), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA K A I H C PNC AK+ V G +I IY +R I EE+T+DY E
Sbjct: 901 TVIDATKKGGIARFINHCCNPNCTAKIIKVGGRRRIVIYALRDIAASEELTYDYKFERE- 959
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
K++ E CLCG+ C+G +LN
Sbjct: 960 KDDEERLPCLCGAPNCKG-FLN 980
>gi|195453973|ref|XP_002074027.1| GK14418 [Drosophila willistoni]
gi|194170112|gb|EDW85013.1| GK14418 [Drosophila willistoni]
Length = 1420
Score = 69.7 bits (169), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 50/169 (29%), Positives = 84/169 (49%), Gaps = 20/169 (11%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
+G G+VC + E DFV+E++GEV + F+K R LQK D +Y + +E+
Sbjct: 1206 RGFGLVCRE--AIAEGDFVIEYVGEVINHAE-FQK----RMLQKQ-RDRDENYYFLGVEK 1257
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
++DA K N A + HSC PNCE + +V+ +++G++ ++ I E+
Sbjct: 1258 D---------FIIDAGPKGNLARFMNHSCEPNCETQKWSVNCIHRVGLFAIKDIPANTEL 1308
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRGSY-LNLTGEGAFEKVLKELHG 2058
TF+Y + + C CG++ C G L +G E +L+G
Sbjct: 1309 TFNY--LWDDLMNNGKKACYCGAERCSGQIGGKLKDQGLKETTSAQLNG 1355
>gi|170095481|ref|XP_001878961.1| histone methyltransferase [Laccaria bicolor S238N-H82]
gi|164646265|gb|EDR10511.1| histone methyltransferase [Laccaria bicolor S238N-H82]
Length = 144
Score = 69.7 bits (169), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 50/139 (35%), Positives = 67/139 (48%), Gaps = 26/139 (18%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI---YLERPKGDADGYDLVVV 1963
+ V+E++GEV IR+ + E I YL R D +VV
Sbjct: 29 EMVIEYVGEV------------IRAQVAEKREKTYERQGIGSSYLFRIDED------LVV 70
Query: 1964 DAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEE 2023
DA K N I HSC PNC AK+ + G +I IY + I G+EIT+DY+ E
Sbjct: 71 DATKKGNLGRLINHSCDPNCTAKIITISGEKKIVIYAKQDIELGDEITYDYHFPFEQ--- 127
Query: 2024 YEASVCLCGSQVCRGSYLN 2042
+ +CLCGS CRG +LN
Sbjct: 128 -DKILCLCGSVKCRG-FLN 144
>gi|148705490|gb|EDL37437.1| mCG16344 [Mus musculus]
Length = 1298
Score = 69.7 bits (169), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+V ++ GE FV E++GE+ ++++ + ++ +E+ FY + +++
Sbjct: 1006 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKYAHENDITHFYMLTIDK 1057
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NY+ + HSC+PNCE V+G ++G++ V I G E+
Sbjct: 1058 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1108
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN E+ +VC CG+ C G
Sbjct: 1109 TFNYNLDCLGNEK---TVCRCGASNCSG 1133
>gi|431897323|gb|ELK06585.1| Putative histone-lysine N-methyltransferase NSD2 [Pteropus alecto]
Length = 502
Score = 69.7 bits (169), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+V ++ GE FV E++GE+ ++ + + +++ +E+ FY + +++
Sbjct: 143 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEDECMARIKRAHENDITHFYMLTIDK 194
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NY+ + HSC+PNCE V+G ++G++ V I G E+
Sbjct: 195 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 245
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN E+ +VC CG+ C G
Sbjct: 246 TFNYNLDCLGNEK---TVCRCGASNCSG 270
>gi|326674803|ref|XP_003200208.1| PREDICTED: histone-lysine N-methyltransferase SETD2-like [Danio
rerio]
Length = 1428
Score = 69.7 bits (169), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 31/77 (40%), Positives = 43/77 (55%), Gaps = 3/77 (3%)
Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
++DA K N + + HSC PNCE + V+G +IG +T + + G E+TFDY K
Sbjct: 645 IIDATLKGNCSRFMNHSCEPNCETQKWTVNGQLRIGFFTTKAVTAGTELTFDYQFQRYGK 704
Query: 2022 EEYEASVCLCGSQVCRG 2038
EA C CG+ CRG
Sbjct: 705 ---EAQKCFCGAPSCRG 718
>gi|195496958|ref|XP_002095897.1| GE25383 [Drosophila yakuba]
gi|194181998|gb|EDW95609.1| GE25383 [Drosophila yakuba]
Length = 1628
Score = 69.7 bits (169), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 44/79 (55%), Gaps = 4/79 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N A I HSC PNC AKV ++ +I IY+ + I EEIT+DY E
Sbjct: 1552 TIIDATKCGNLARFINHSCNPNCYAKVITIESEKKIVIYSKQPIGINEEITYDYKFPLEE 1611
Query: 2021 KEEYEASVCLCGSQVCRGS 2039
E CLCG+Q CRG+
Sbjct: 1612 ----EKIPCLCGAQGCRGT 1626
>gi|50312247|ref|XP_456155.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|74636430|sp|Q6CIT4.1|SET1_KLULA RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component SET1; AltName:
Full=SET domain-containing protein 1
gi|49645291|emb|CAG98863.1| KLLA0F24134p [Kluyveromyces lactis]
Length = 1000
Score = 69.7 bits (169), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 35/82 (42%), Positives = 48/82 (58%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA + A I H C P+C AK+ VDG +I IY +R I EE+T+DY E+
Sbjct: 921 TVIDATKRGGIARFINHCCEPSCTAKIIKVDGRKRIVIYALRDIGTNEELTYDYKFERET 980
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+E E CLCG+ C+G +LN
Sbjct: 981 -DEGERLPCLCGAPSCKG-FLN 1000
>gi|259146872|emb|CAY80128.1| Set1p [Saccharomyces cerevisiae EC1118]
Length = 1080
Score = 69.7 bits (169), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 46/82 (56%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA K A I H C PNC AK+ V G +I IY +R I EE+T+DY E
Sbjct: 1001 TVIDATKKGGIARFINHCCNPNCTAKIIKVGGRRRIVIYALRDIAASEELTYDYKFEREK 1060
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+E E CLCG+ C+G +LN
Sbjct: 1061 DDE-ERLPCLCGAPNCKG-FLN 1080
>gi|122937787|gb|ABM68621.1| AAEL000054-PA [Aedes aegypti]
Length = 3489
Score = 69.7 bits (169), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 54/157 (34%), Positives = 75/157 (47%), Gaps = 23/157 (14%)
Query: 1886 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN 1945
Y ++ G G+ CN++ GE V+E+ GE+ IRS + + +
Sbjct: 3356 YRSHIHGRGLFCNRDIEAGE--MVIEYAGEL------------IRSTLTDKRERYYDSRG 3401
Query: 1946 IYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIH 2005
I K D + VVDA + N A I HSC PNC +KV + GH I I+ +R I
Sbjct: 3402 IGCYMFKID----EHFVVDATMRGNAARFINHSCEPNCYSKVVDILGHKHIIIFALRRIV 3457
Query: 2006 YGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
GEE+T+DY E + C CGS+ CR YLN
Sbjct: 3458 QGEELTYDYKFPFEDVK----IPCSCGSKKCR-KYLN 3489
>gi|328875054|gb|EGG23419.1| SET domain-containing protein [Dictyostelium fasciculatum]
Length = 1359
Score = 69.7 bits (169), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 35/81 (43%), Positives = 44/81 (54%), Gaps = 4/81 (4%)
Query: 1959 DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVT 2018
D ++DA K N A I H C PNC AKV + G +I IY R I+ GEE+T+DY
Sbjct: 1281 DDTIIDATFKGNQARFINHCCDPNCMAKVITMGGQKKIIIYAKRDINVGEELTYDYKFPI 1340
Query: 2019 ESKEEYEASVCLCGSQVCRGS 2039
E + CLC S CRG+
Sbjct: 1341 EDVK----IPCLCKSAKCRGT 1357
>gi|118404602|ref|NP_001072649.1| histone-lysine N-methyltransferase SETD1B [Xenopus (Silurana)
tropicalis]
gi|123884540|sp|Q08D57.1|SET1B_XENTR RecName: Full=Histone-lysine N-methyltransferase SETD1B; AltName:
Full=SET domain-containing protein 1B
gi|115312893|gb|AAI23933.1| hypothetical protein MGC145850 [Xenopus (Silurana) tropicalis]
Length = 1956
Score = 69.7 bits (169), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 46/79 (58%), Gaps = 4/79 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N+A I HSC PNC AKV V+ +I IY+ + I+ EEIT+DY E
Sbjct: 1880 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQYINVNEEITYDYKFPIED 1939
Query: 2021 KEEYEASVCLCGSQVCRGS 2039
+ CLCG++ CRG+
Sbjct: 1940 VK----IPCLCGAENCRGT 1954
>gi|302666919|ref|XP_003025054.1| hypothetical protein TRV_00712 [Trichophyton verrucosum HKI 0517]
gi|291189136|gb|EFE44443.1| hypothetical protein TRV_00712 [Trichophyton verrucosum HKI 0517]
Length = 1376
Score = 69.7 bits (169), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 46/82 (56%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA A I HSC PNC AK+ VDG +I IY +R I EE+T+DY E
Sbjct: 1297 TVIDATKHGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREW 1356
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+ + CLCGS C+G +LN
Sbjct: 1357 DSD-DRIPCLCGSTGCKG-FLN 1376
>gi|194898301|ref|XP_001978769.1| GG11901 [Drosophila erecta]
gi|190650472|gb|EDV47727.1| GG11901 [Drosophila erecta]
Length = 1626
Score = 69.3 bits (168), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 44/79 (55%), Gaps = 4/79 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N A I HSC PNC AKV ++ +I IY+ + I EEIT+DY E
Sbjct: 1550 TIIDATKCGNLARFINHSCNPNCYAKVITIESEKKIVIYSKQPIGINEEITYDYKFPLEE 1609
Query: 2021 KEEYEASVCLCGSQVCRGS 2039
E CLCG+Q CRG+
Sbjct: 1610 ----EKIPCLCGAQGCRGT 1624
>gi|348527268|ref|XP_003451141.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like [Oreochromis
niloticus]
Length = 1605
Score = 69.3 bits (168), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 45/148 (30%), Positives = 73/148 (49%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
+G G+ N+ + DFV E++GEV + ++ + +++ +E+ FY + L +
Sbjct: 1315 RGWGLRTNQ--ALKKGDFVTEYVGEV------IDSEECQQRIKRAHENHVTNFYMLTLTK 1366
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ V+DA K N + I HSC PNCE + V+G +IGI+ + I G E+
Sbjct: 1367 DR---------VIDAGPKGNSSRFINHSCSPNCETQKWTVNGDVRIGIFALCDIEAGTEL 1417
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN + C CGS C G
Sbjct: 1418 TFNYNLHCVGNRR---TSCHCGSDNCSG 1442
>gi|195388606|ref|XP_002052970.1| GJ23622 [Drosophila virilis]
gi|194151056|gb|EDW66490.1| GJ23622 [Drosophila virilis]
Length = 1687
Score = 69.3 bits (168), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 44/79 (55%), Gaps = 4/79 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N A I HSC PNC AKV ++ +I IY+ + I EEIT+DY E
Sbjct: 1611 TIIDATKCGNLARFINHSCNPNCYAKVITIESEKKIVIYSKQPIGINEEITYDYKFPLEE 1670
Query: 2021 KEEYEASVCLCGSQVCRGS 2039
E CLCG+Q CRG+
Sbjct: 1671 ----EKIPCLCGAQGCRGT 1685
>gi|302839691|ref|XP_002951402.1| histone H3 Lys 36 methyltransferase/ASH1 [Volvox carteri f.
nagariensis]
gi|300263377|gb|EFJ47578.1| histone H3 Lys 36 methyltransferase/ASH1 [Volvox carteri f.
nagariensis]
Length = 2345
Score = 69.3 bits (168), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 44/137 (32%), Positives = 69/137 (50%), Gaps = 19/137 (13%)
Query: 1908 FVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMH 1967
F++E+ GEV + ++ R ++ + P FY + L A G + +DA
Sbjct: 1595 FIIEYAGEV------IDDRELGRRMEHARMNGEPHFYIMEL------AAG---LYIDARR 1639
Query: 1968 KANYASRICHSCRPNCEAKV--TAVDGHYQIGIYTVRGIHYGEEITFDY--NSVTESKEE 2023
K N A I SC PNCE + A G ++GI+ R I GEE+ +DY ++ K+
Sbjct: 1640 KGNIARLINSSCDPNCETQKWHDASTGEIRVGIFASRDIPPGEELVYDYFFSTYGAIKQS 1699
Query: 2024 YEASVCLCGSQVCRGSY 2040
+ VC+CGS+ CRG+
Sbjct: 1700 AASFVCMCGSKNCRGTM 1716
>gi|367010698|ref|XP_003679850.1| hypothetical protein TDEL_0B05100 [Torulaspora delbrueckii]
gi|359747508|emb|CCE90639.1| hypothetical protein TDEL_0B05100 [Torulaspora delbrueckii]
Length = 1019
Score = 69.3 bits (168), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 35/82 (42%), Positives = 47/82 (57%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA K A I H C P+C AK+ V G +I IY +R I EE+T+DY E+
Sbjct: 940 TVIDATKKGGIARFINHCCDPSCTAKIIKVGGKKRIVIYALRDIAANEELTYDYKFERET 999
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+E E CLCG+ C+G +LN
Sbjct: 1000 DDE-ERLPCLCGAPTCKG-FLN 1019
>gi|320580861|gb|EFW95083.1| histone-lysine n-methyltransferase, h3 lysine-4 specific, putative
[Ogataea parapolymorpha DL-1]
Length = 658
Score = 69.3 bits (168), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 48/82 (58%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA K A I H C P+C AK+ V+G +I IY +R I EE+T+DY E+
Sbjct: 579 TVIDASKKGGIARFINHCCVPSCTAKIIKVEGKKRIVIYALRDIAANEELTYDYKFERET 638
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+E E CLCG+ C+G YLN
Sbjct: 639 NDE-ERIPCLCGAPGCKG-YLN 658
>gi|380019005|ref|XP_003693408.1| PREDICTED: uncharacterized protein LOC100869667 [Apis florea]
Length = 1392
Score = 69.3 bits (168), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 49/149 (32%), Positives = 74/149 (49%), Gaps = 20/149 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ + GE F++E++GEV + +D R ++ ++D +Y + L
Sbjct: 447 KKGFGLRAMVDLLAGE--FIMEYVGEV------VDPKDFRRRAKEYSKDKNKHYYFMAL- 497
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
K D ++DA K N + I HSC PN E + V+G +IG + + I GEE
Sbjct: 498 --KSDQ------IIDATMKGNVSRFINHSCDPNSETQKWTVNGELRIGFFNKKFIAAGEE 549
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
ITFDY+ K EA C C + CRG
Sbjct: 550 ITFDYHFQRYGK---EAQKCFCEAPNCRG 575
>gi|328790605|ref|XP_003251435.1| PREDICTED: hypothetical protein LOC100578450 [Apis mellifera]
Length = 1394
Score = 69.3 bits (168), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 49/149 (32%), Positives = 74/149 (49%), Gaps = 20/149 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ + GE F++E++GEV + +D R ++ ++D +Y + L
Sbjct: 447 KKGFGLRAMVDLLAGE--FIMEYVGEV------VDPKDFRRRAKEYSKDKNKHYYFMAL- 497
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
K D ++DA K N + I HSC PN E + V+G +IG + + I GEE
Sbjct: 498 --KSDQ------IIDATMKGNVSRFINHSCDPNSETQKWTVNGELRIGFFNKKFIAAGEE 549
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
ITFDY+ K EA C C + CRG
Sbjct: 550 ITFDYHFQRYGK---EAQKCFCEAPNCRG 575
>gi|327304525|ref|XP_003236954.1| histone-lysine N-methyltransferase [Trichophyton rubrum CBS 118892]
gi|326459952|gb|EGD85405.1| histone-lysine N-methyltransferase [Trichophyton rubrum CBS 118892]
Length = 1337
Score = 69.3 bits (168), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 46/82 (56%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA A I HSC PNC AK+ VDG +I IY +R I EE+T+DY E
Sbjct: 1258 TVIDATKHGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREW 1317
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+ + CLCGS C+G +LN
Sbjct: 1318 DSD-DRIPCLCGSTGCKG-FLN 1337
>gi|405967140|gb|EKC32340.1| Histone-lysine N-methyltransferase SETD1B-A [Crassostrea gigas]
Length = 1401
Score = 69.3 bits (168), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 42/79 (53%), Gaps = 4/79 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N A I H C PNC AK+ V+ +I IY+ R I EEIT+DY E
Sbjct: 1325 TIIDATKCGNLARFINHCCNPNCYAKIITVESQKKIVIYSKRDIDVNEEITYDYKFPIED 1384
Query: 2021 KEEYEASVCLCGSQVCRGS 2039
E CLCG+ CRG+
Sbjct: 1385 ----EKIPCLCGAPNCRGT 1399
>gi|328858772|gb|EGG07883.1| hypothetical protein MELLADRAFT_74594 [Melampsora larici-populina
98AG31]
Length = 191
Score = 69.3 bits (168), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 47/136 (34%), Positives = 66/136 (48%), Gaps = 25/136 (18%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI---YLERPKGDADGYDLVVV 1963
+ V+E++GEV IR + + A E I YL R D +VV
Sbjct: 76 EMVIEYVGEV------------IRQAVADRREKAYERMGIGSSYLFRVDDD------LVV 117
Query: 1964 DAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEE 2023
DA K N I H C PNC AK+ ++G +I IY I G+E+T+DY+ KE+
Sbjct: 118 DATKKGNLGRLINHCCAPNCTAKIITINGEKKIVIYAKATIELGDEVTYDYHF---PKED 174
Query: 2024 YEASVCLCGSQVCRGS 2039
+ CLCGS C+G+
Sbjct: 175 VKIP-CLCGSSKCKGT 189
>gi|145548702|ref|XP_001460031.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124427859|emb|CAK92634.1| unnamed protein product [Paramecium tetraurelia]
Length = 672
Score = 69.3 bits (168), Expect = 3e-08, Method: Composition-based stats.
Identities = 47/156 (30%), Positives = 71/156 (45%), Gaps = 22/156 (14%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGI-RSLQKNNEDPAPEFYNIYLE 1949
KG G+VC + GF ++F+ + GEVY +WFEKQ + +Q N + + Y+E
Sbjct: 119 KGKGMVCCQGEGFATNEFICFYFGEVYSPQRWFEKQTIFNKRMQDGNRKTCSQ--SPYVE 176
Query: 1950 RPKGDADGYDLVV--------VDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTV 2001
D DL+V +D N A I +SC PNC V+ + + T
Sbjct: 177 FFIND----DLLVMFKKYFQFIDPTRYGNMAQHISYSCDPNCRLVTVIVNQQNLLAVMTA 232
Query: 2002 RGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCR 2037
+ I+Y EE+T + + CLCGS C+
Sbjct: 233 KKINYLEELTLPFPLTCMDQ-------CLCGSLHCK 261
Score = 59.7 bits (143), Expect = 2e-05, Method: Composition-based stats.
Identities = 62/283 (21%), Positives = 120/283 (42%), Gaps = 49/283 (17%)
Query: 2154 YNQRLQNLAVTLDKVRYVMRCVFGDPKKAPPPVERLSPEETVSFLWKGEGSLVEELIQCM 2213
+ Q N+ +DKV++ ++ + K PP+ ++ WK GS +++
Sbjct: 314 HQQNYINIFSCVDKVKFALQHL----KTVQPPIFLVT--NIFDQFWKNYGSNTQKI---- 363
Query: 2214 APHVEEDVLND----LKSKIQAHDPSGSEDIQRELRKSL---------------LWLRDE 2254
+E ++N+ LK Q H S +I +++++ + L L +
Sbjct: 364 --QLESSIINEIVIFLKRHSQQHQCSIGLEIIKQMKQIIDQNSIYALELTRMLFLLLSEI 421
Query: 2255 VRNL-PCTYKCRHDAAADLIHIYAYTKCFFRVQEYKAFTSPPVYISPLDLGPKYADKLGA 2313
+ N+ C++ + A A +++ ++T +F +Y+ F S P + + P+ +K
Sbjct: 422 ILNIESCSFN--NKAFATILYFMSFTHTYFSSTQYQGFDSKPFEENEFEYIPQPKNKSKL 479
Query: 2314 DLQVYRKTYGENYCLGQLIFWHIQTNADPDCTLARASRGCLSLPDIGSFYAKVQKPSRHR 2373
L K Y + GQLI W+ QT +P ++A+ RG L P S +
Sbjct: 480 ALS---KQYTPQFIWGQLINWNKQTLQNPQSSMAQERRGVLCYP---SLLLSFDNKHKTF 533
Query: 2374 VYGPKT----VRFMLSRMEKQPQRPWPKDRIWAFKSSPRIFGS 2412
Y KT + + S+ E QP W++K+ I+G+
Sbjct: 534 PYQCKTREIYLEYFQSKKEIQPDL-----STWSYKNQHNIYGT 571
>gi|189238620|ref|XP_969339.2| PREDICTED: similar to CG40351 CG40351-PC [Tribolium castaneum]
gi|270009170|gb|EFA05618.1| hypothetical protein TcasGA2_TC015826 [Tribolium castaneum]
Length = 1268
Score = 69.3 bits (168), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 33/78 (42%), Positives = 43/78 (55%), Gaps = 4/78 (5%)
Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
++DA N A I HSC PNC AKV ++ +I IY+ + I EEIT+DY E
Sbjct: 1193 IIDATKCGNLARFINHSCNPNCYAKVITIESQKKIVIYSKQSIGVNEEITYDYKFPIED- 1251
Query: 2022 EEYEASVCLCGSQVCRGS 2039
E CLCG+ CRG+
Sbjct: 1252 ---EKIPCLCGAATCRGT 1266
>gi|150866258|ref|XP_001385792.2| histone methyltransferase involved in gene regulation
[Scheffersomyces stipitis CBS 6054]
gi|149387514|gb|ABN67763.2| histone methyltransferase involved in gene regulation
[Scheffersomyces stipitis CBS 6054]
Length = 1055
Score = 69.3 bits (168), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 35/82 (42%), Positives = 46/82 (56%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA K A I H C P+C AK+ VD +I IY +R I EE+T+DY E+
Sbjct: 976 TVIDATKKGGIARFINHCCSPSCTAKIIKVDNQKRIVIYALRDIDANEELTYDYKFERET 1035
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+ E CLCG+ C+G YLN
Sbjct: 1036 NDA-ERIRCLCGAPGCKG-YLN 1055
>gi|350413847|ref|XP_003490133.1| PREDICTED: hypothetical protein LOC100748492 [Bombus impatiens]
Length = 3522
Score = 69.3 bits (168), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 65/198 (32%), Positives = 88/198 (44%), Gaps = 28/198 (14%)
Query: 1847 PLQPVIEEIEKEAVDDCDVRTMKMCRGILKAMDSRPDDKYVAYRKGLGVVCNKEGGFGED 1906
P I E E V ++ M M ILK Y ++ G G+ C ++ GE
Sbjct: 3351 PKMIAISEAESRRVASTNL-PMAMRFRILKETSKESVGVYHSHIHGRGLFCLRDIEAGE- 3408
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
V+E+ GEV IR+ + + + NI K D D +VVDA
Sbjct: 3409 -MVIEYAGEV------------IRASLTDKREKYYDSKNIGCYMFKID----DHLVVDAT 3451
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYE- 2025
K N A I HSC PNC ++V + G I I+ +R I GEE+T+DY K +E
Sbjct: 3452 MKGNAARFINHSCEPNCYSRVVDILGKKHILIFALRRIIQGEELTYDY------KFPFED 3505
Query: 2026 -ASVCLCGSQVCRGSYLN 2042
C CGS+ CR YLN
Sbjct: 3506 IKIPCTCGSRRCR-KYLN 3522
>gi|358333784|dbj|GAA31138.2| histone-lysine N-methyltransferase SETD1B [Clonorchis sinensis]
Length = 1685
Score = 69.3 bits (168), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 35/82 (42%), Positives = 46/82 (56%), Gaps = 4/82 (4%)
Query: 1959 DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVT 2018
D V+DA N I HSC+PNC AK+ V+G +I IY+ R I+ EEIT+DY
Sbjct: 1607 DDFVIDATMCGNNGRFINHSCQPNCYAKIITVEGKKKIVIYSKRDINVMEEITYDY---- 1662
Query: 2019 ESKEEYEASVCLCGSQVCRGSY 2040
+ E E C CG+ CRG+
Sbjct: 1663 KFPYEEEKIPCQCGASTCRGTL 1684
>gi|444722051|gb|ELW62755.1| putative histone-lysine N-methyltransferase NSD2 [Tupaia chinensis]
Length = 1421
Score = 69.3 bits (168), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 43/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+V ++ GE FV E++GE+ ++++ + ++ +E+ FY + +++
Sbjct: 916 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKYAHENDITHFYMLTIDK 967
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NY+ + HSC+PNCE V+G ++G++ + I G E+
Sbjct: 968 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFALCDIPAGTEL 1018
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN E+ +VC CG+ C G
Sbjct: 1019 TFNYNLDCLGNEK---TVCRCGASNCSG 1043
>gi|195062427|ref|XP_001996188.1| GH22347 [Drosophila grimshawi]
gi|193899683|gb|EDV98549.1| GH22347 [Drosophila grimshawi]
Length = 1714
Score = 69.3 bits (168), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 44/79 (55%), Gaps = 4/79 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N A I HSC PNC AKV ++ +I IY+ + I EEIT+DY E
Sbjct: 1638 TIIDATKCGNLARFINHSCNPNCYAKVITIESEKKIVIYSKQPIGINEEITYDYKFPLEE 1697
Query: 2021 KEEYEASVCLCGSQVCRGS 2039
E CLCG+Q CRG+
Sbjct: 1698 ----EKIPCLCGAQGCRGT 1712
>gi|62862148|ref|NP_001015221.1| Set1, isoform A [Drosophila melanogaster]
gi|62862150|ref|NP_001015222.1| Set1, isoform B [Drosophila melanogaster]
gi|161076059|ref|NP_001104406.1| Set1, isoform C [Drosophila melanogaster]
gi|281366745|ref|NP_001163846.1| Set1, isoform D [Drosophila melanogaster]
gi|281366747|ref|NP_001163847.1| Set1, isoform E [Drosophila melanogaster]
gi|281366749|ref|NP_001163848.1| Set1, isoform F [Drosophila melanogaster]
gi|281366751|ref|NP_001163849.1| Set1, isoform G [Drosophila melanogaster]
gi|281366753|ref|NP_001163850.1| Set1, isoform H [Drosophila melanogaster]
gi|281366755|ref|NP_001163851.1| Set1, isoform I [Drosophila melanogaster]
gi|51951109|gb|EAL24598.1| Set1, isoform A [Drosophila melanogaster]
gi|51951110|gb|EAL24599.1| Set1, isoform B [Drosophila melanogaster]
gi|158529717|gb|EDP28071.1| Set1, isoform C [Drosophila melanogaster]
gi|281309231|gb|EFA98694.1| Set1, isoform D [Drosophila melanogaster]
gi|281309232|gb|EFA98695.1| Set1, isoform E [Drosophila melanogaster]
gi|281309233|gb|EFA98696.1| Set1, isoform F [Drosophila melanogaster]
gi|281309234|gb|EFA98697.1| Set1, isoform G [Drosophila melanogaster]
gi|281309235|gb|EFA98698.1| Set1, isoform H [Drosophila melanogaster]
gi|281309236|gb|EFA98699.1| Set1, isoform I [Drosophila melanogaster]
Length = 1641
Score = 69.3 bits (168), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 44/79 (55%), Gaps = 4/79 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N A I HSC PNC AKV ++ +I IY+ + I EEIT+DY E
Sbjct: 1565 TIIDATKCGNLARFINHSCNPNCYAKVITIESEKKIVIYSKQPIGINEEITYDYKFPLED 1624
Query: 2021 KEEYEASVCLCGSQVCRGS 2039
E CLCG+Q CRG+
Sbjct: 1625 ----EKIPCLCGAQGCRGT 1639
>gi|414587222|tpg|DAA37793.1| TPA: hypothetical protein ZEAMMB73_251567 [Zea mays]
Length = 489
Score = 69.3 bits (168), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 49/151 (32%), Positives = 72/151 (47%), Gaps = 26/151 (17%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
+G G+V ++ G+ FV+E+ GEV WK + R Q + Y IYL
Sbjct: 71 RGWGLVADENIMAGQ--FVIEYCGEVIS-WK-----EAKRRAQAYETQCLKDAYIIYLNA 122
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ +DA K N A I HSC+PNCE + V G ++GI+ + I +G E+
Sbjct: 123 DES---------IDATRKGNLARFINHSCQPNCETRKWNVLGEVRVGIFAKQNIPFGTEL 173
Query: 2011 TFDYNSVTESKEEYEASV---CLCGSQVCRG 2038
++DYN E+ V CLCG+ C G
Sbjct: 174 SYDYNF------EWYGGVMVRCLCGAASCSG 198
>gi|326472906|gb|EGD96915.1| histone-lysine N-methyltransferase [Trichophyton tonsurans CBS
112818]
Length = 1330
Score = 69.3 bits (168), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 46/82 (56%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA A I HSC PNC AK+ VDG +I IY +R I EE+T+DY E
Sbjct: 1251 TVIDATKHGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREW 1310
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+ + CLCGS C+G +LN
Sbjct: 1311 DSD-DRIPCLCGSTGCKG-FLN 1330
>gi|119467882|ref|XP_001257747.1| SET domain protein [Neosartorya fischeri NRRL 181]
gi|119405899|gb|EAW15850.1| SET domain protein [Neosartorya fischeri NRRL 181]
Length = 1241
Score = 69.3 bits (168), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA + A I HSC PNC AK+ VDG +I IY +R I EE+T+DY E
Sbjct: 1162 TVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIGRDEELTYDYKFEREW 1221
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+ + CLCGS C+G +LN
Sbjct: 1222 DSD-DRIPCLCGSTGCKG-FLN 1241
>gi|359492362|ref|XP_002284621.2| PREDICTED: uncharacterized protein LOC100245350 [Vitis vinifera]
Length = 2184
Score = 69.3 bits (168), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 35/77 (45%), Positives = 43/77 (55%), Gaps = 2/77 (2%)
Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
V+DA K N I HSC PNC + V+G IG++ +R I GEE+TFDYN V
Sbjct: 1313 VIDACAKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALRDIKKGEEVTFDYNYVRVFG 1372
Query: 2022 EEYEASVCLCGSQVCRG 2038
A C+CGS CRG
Sbjct: 1373 --AAAKKCVCGSPQCRG 1387
>gi|348527922|ref|XP_003451468.1| PREDICTED: hypothetical protein LOC100692734 [Oreochromis niloticus]
Length = 2421
Score = 69.3 bits (168), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 49/158 (31%), Positives = 78/158 (49%), Gaps = 21/158 (13%)
Query: 1886 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN 1945
+ +G G+ C + G+ FV E++GEV E + IR Q NN FY
Sbjct: 2005 FRTLSRGWGLRCVHDIKKGQ--FVSEYVGEVI---DEEECRSRIRHAQDNN---ICNFYM 2056
Query: 1946 IYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIH 2005
+ L++ + ++DA K N A + HSC+PNCE + V+G ++G++ + I
Sbjct: 2057 LTLDKDR---------IIDAGPKGNEARFMNHSCQPNCETQKWTVNGDTRVGLFALIDIA 2107
Query: 2006 YGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNL 2043
G E+TF+YN + +VC CG+ C G +L L
Sbjct: 2108 AGTELTFNYNLECLGNRK---TVCKCGASNCSG-FLGL 2141
>gi|332026544|gb|EGI66662.1| Histone-lysine N-methyltransferase SETD2 [Acromyrmex echinatior]
Length = 1841
Score = 69.3 bits (168), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 49/149 (32%), Positives = 74/149 (49%), Gaps = 20/149 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ + GE F++E++GEV + +D R ++ ++D +Y + L
Sbjct: 881 KKGFGLRAVVDIMAGE--FIMEYVGEV------VDPKDFRRRAKEYSKDKNRHYYFMAL- 931
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
K D ++DA K N + I HSC PN E + V+G +IG + + I GEE
Sbjct: 932 --KSDQ------IIDATMKGNISRFINHSCDPNAETQKWTVNGELRIGFFNKKFIAAGEE 983
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
ITFDY+ K EA C C + CRG
Sbjct: 984 ITFDYHFQRYGK---EAQKCYCEALNCRG 1009
>gi|157103255|ref|XP_001647894.1| mixed-lineage leukemia protein, mll [Aedes aegypti]
gi|108884726|gb|EAT48951.1| AAEL000054-PA, partial [Aedes aegypti]
Length = 3069
Score = 69.3 bits (168), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 54/157 (34%), Positives = 75/157 (47%), Gaps = 23/157 (14%)
Query: 1886 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN 1945
Y ++ G G+ CN++ GE V+E+ GE+ IRS + + +
Sbjct: 2936 YRSHIHGRGLFCNRDIEAGE--MVIEYAGEL------------IRSTLTDKRERYYDSRG 2981
Query: 1946 IYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIH 2005
I K D + VVDA + N A I HSC PNC +KV + GH I I+ +R I
Sbjct: 2982 IGCYMFKID----EHFVVDATMRGNAARFINHSCEPNCYSKVVDILGHKHIIIFALRRIV 3037
Query: 2006 YGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
GEE+T+DY E + C CGS+ CR YLN
Sbjct: 3038 QGEELTYDYKFPFEDVK----IPCSCGSKKCR-KYLN 3069
>gi|401842102|gb|EJT44375.1| SET1-like protein [Saccharomyces kudriavzevii IFO 1802]
Length = 1087
Score = 69.3 bits (168), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 46/82 (56%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA K A I H C PNC AK+ V G +I IY +R I EE+T+DY E
Sbjct: 1008 TVIDATKKGGIARFINHCCDPNCTAKIIKVGGRRRIVIYALRDIGANEELTYDYKFEREQ 1067
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+E E CLCG+ C+G +LN
Sbjct: 1068 DDE-ERLPCLCGASNCKG-FLN 1087
>gi|326477398|gb|EGE01408.1| histone-lysine N-methyltransferase [Trichophyton equinum CBS 127.97]
Length = 1331
Score = 69.3 bits (168), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 46/82 (56%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA A I HSC PNC AK+ VDG +I IY +R I EE+T+DY E
Sbjct: 1252 TVIDATKHGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREW 1311
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+ + CLCGS C+G +LN
Sbjct: 1312 DSD-DRIPCLCGSTGCKG-FLN 1331
>gi|194900731|ref|XP_001979909.1| GG21380 [Drosophila erecta]
gi|190651612|gb|EDV48867.1| GG21380 [Drosophila erecta]
Length = 3741
Score = 69.3 bits (168), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 54/151 (35%), Positives = 72/151 (47%), Gaps = 23/151 (15%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
G G+ C K+ GE V+E+ GE+ IRS + + + I
Sbjct: 3614 GRGLYCTKDIEAGE--MVIEYAGEL------------IRSTLTDKRERYYDSRGIGCYMF 3659
Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
K D D +VVDA + N A I H C PNC +KV + GH I I+ +R I GEE+T
Sbjct: 3660 KID----DNLVVDATMRGNAARFINHCCEPNCYSKVVDILGHKHIIIFALRRIVQGEELT 3715
Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
+DY + E E C CGS+ CR YLN
Sbjct: 3716 YDY----KFPFEEEKIPCSCGSKRCR-KYLN 3741
>gi|225685245|gb|EEH23529.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb03]
Length = 756
Score = 68.9 bits (167), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 46/149 (30%), Positives = 80/149 (53%), Gaps = 22/149 (14%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
+G GV N+ F + +VE+ GE+ K E++ +R++ KNNE +Y +Y ++
Sbjct: 375 RGYGVRSNRT--FAPNQIIVEYTGEII-TQKECERR--MRTVYKNNEC----YYLMYFDQ 425
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVR-GIHYGEE 2009
+++DA + + A + HSC PNCE + V G ++ ++ + GI GEE
Sbjct: 426 N---------MIIDAT-RGSIARFVNHSCEPNCEMEKWTVAGKPRMALFAGKNGITTGEE 475
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
+T+DYN S++ + C CG++ CRG
Sbjct: 476 LTYDYNFDPYSQKNVQE--CRCGAETCRG 502
>gi|410898830|ref|XP_003962900.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
[Takifugu rubripes]
Length = 1329
Score = 68.9 bits (167), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 47/158 (29%), Positives = 81/158 (51%), Gaps = 21/158 (13%)
Query: 1882 PDDKYVAYR-KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
PD K + KG G++ ++ GE FV E++GE+ E + I+ Q+NN
Sbjct: 1023 PDTKIIKTPGKGWGLITLRDIKKGE--FVNEYIGELI---DEEECRARIKYAQENN---V 1074
Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
FY + +++ + ++DA K NY+ + HSC+PNCE + V+G ++G++
Sbjct: 1075 TNFYMLTIDKDR---------IIDAGPKGNYSRFMNHSCQPNCETQKWTVNGDTRVGLFA 1125
Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
+ + G E+TF+YN E+ + C CG+ C G
Sbjct: 1126 ICDVPAGTELTFNYNLDCLGNEK---TACCCGAPNCSG 1160
>gi|296805347|ref|XP_002843498.1| histone-lysine N-methyltransferase [Arthroderma otae CBS 113480]
gi|238844800|gb|EEQ34462.1| histone-lysine N-methyltransferase [Arthroderma otae CBS 113480]
Length = 1344
Score = 68.9 bits (167), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 46/82 (56%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA A I HSC PNC AK+ VDG +I IY +R I EE+T+DY E
Sbjct: 1265 TVIDATKHGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREW 1324
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+ + CLCGS C+G +LN
Sbjct: 1325 DSD-DRIPCLCGSTGCKG-FLN 1344
>gi|414587221|tpg|DAA37792.1| TPA: hypothetical protein ZEAMMB73_251567 [Zea mays]
Length = 503
Score = 68.9 bits (167), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 49/151 (32%), Positives = 72/151 (47%), Gaps = 26/151 (17%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
+G G+V ++ G+ FV+E+ GEV WK + R Q + Y IYL
Sbjct: 85 RGWGLVADENIMAGQ--FVIEYCGEVIS-WK-----EAKRRAQAYETQCLKDAYIIYLNA 136
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ +DA K N A I HSC+PNCE + V G ++GI+ + I +G E+
Sbjct: 137 DES---------IDATRKGNLARFINHSCQPNCETRKWNVLGEVRVGIFAKQNIPFGTEL 187
Query: 2011 TFDYNSVTESKEEYEASV---CLCGSQVCRG 2038
++DYN E+ V CLCG+ C G
Sbjct: 188 SYDYNF------EWYGGVMVRCLCGAASCSG 212
>gi|340710026|ref|XP_003393599.1| PREDICTED: hypothetical protein LOC100646252 [Bombus terrestris]
Length = 3530
Score = 68.9 bits (167), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 60/177 (33%), Positives = 81/177 (45%), Gaps = 27/177 (15%)
Query: 1868 MKMCRGILKAMDSRPDDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQD 1927
M M ILK Y ++ G G+ C ++ GE V+E+ GEV
Sbjct: 3379 MAMRFRILKETSKESVGVYHSHIHGRGLFCLRDIEAGE--MVIEYAGEV----------- 3425
Query: 1928 GIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKV 1987
IR+ + + + NI K D D +VVDA K N A I HSC PNC ++V
Sbjct: 3426 -IRASLTDKREKYYDSKNIGCYMFKID----DHLVVDATMKGNAARFINHSCEPNCYSRV 3480
Query: 1988 TAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYE--ASVCLCGSQVCRGSYLN 2042
+ G I I+ +R I GEE+T+DY K +E C CGS+ CR YLN
Sbjct: 3481 VDILGKKHILIFALRRIIQGEELTYDY------KFPFEDIKIPCTCGSRRCR-KYLN 3530
>gi|347968475|ref|XP_563394.4| AGAP002741-PA [Anopheles gambiae str. PEST]
gi|333467986|gb|EAL40845.4| AGAP002741-PA [Anopheles gambiae str. PEST]
Length = 4925
Score = 68.9 bits (167), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 54/157 (34%), Positives = 75/157 (47%), Gaps = 23/157 (14%)
Query: 1886 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN 1945
Y ++ G G+ CN++ GE V+E+ GE+ IRS + + +
Sbjct: 4792 YRSHIHGRGLFCNRDIEAGE--MVIEYAGEL------------IRSTLTDKRERYYDSRG 4837
Query: 1946 IYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIH 2005
I K D + VVDA + N A I HSC PNC +KV + GH I I+ +R I
Sbjct: 4838 IGCYMFKIDEN----FVVDATMRGNAARFINHSCEPNCYSKVVDILGHKHIIIFALRRIV 4893
Query: 2006 YGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
GEE+T+DY E + C CGS+ CR YLN
Sbjct: 4894 QGEELTYDYKFPFEDVK----IPCSCGSKKCR-KYLN 4925
>gi|17136558|ref|NP_476770.1| trithorax, isoform B [Drosophila melanogaster]
gi|19550181|ref|NP_599108.1| trithorax, isoform C [Drosophila melanogaster]
gi|62472551|ref|NP_001014621.1| trithorax, isoform E [Drosophila melanogaster]
gi|23171245|gb|AAN13600.1| trithorax, isoform B [Drosophila melanogaster]
gi|23171246|gb|AAN13601.1| trithorax, isoform C [Drosophila melanogaster]
gi|61679333|gb|AAX52951.1| trithorax, isoform E [Drosophila melanogaster]
Length = 3358
Score = 68.9 bits (167), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 54/151 (35%), Positives = 71/151 (47%), Gaps = 23/151 (15%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
G G+ C K+ GE V+E+ GE+ IRS + + + I
Sbjct: 3231 GRGLYCTKDIEAGE--MVIEYAGEL------------IRSTLTDKRERYYDSRGIGCYMF 3276
Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
K D D +VVDA + N A I H C PNC +KV + GH I I+ +R I GEE+T
Sbjct: 3277 KID----DNLVVDATMRGNAARFINHCCEPNCYSKVVDILGHKHIIIFALRRIVQGEELT 3332
Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
+DY E E C CGS+ CR YLN
Sbjct: 3333 YDYKFPFED----EKIPCSCGSKRCR-KYLN 3358
>gi|402221447|gb|EJU01516.1| SET domain-containing protein [Dacryopinax sp. DJM-731 SS1]
Length = 164
Score = 68.9 bits (167), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 50/139 (35%), Positives = 66/139 (47%), Gaps = 26/139 (18%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI---YLERPKGDADGYDLVVV 1963
D V+E++GEV +R + + E I YL R D +VV
Sbjct: 49 DMVIEYVGEV------------VRQQVADKREKVYERQGIGSSYLFRIDDD------LVV 90
Query: 1964 DAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEE 2023
DA K N I HSC PNC A++ ++ +I IY I GEEIT+DY+ E
Sbjct: 91 DATMKGNIGRLINHSCSPNCTARIITINSSKKIVIYAKTPIEPGEEITYDYHFPIEQ--- 147
Query: 2024 YEASVCLCGSQVCRGSYLN 2042
E CLCGS+ CRG +LN
Sbjct: 148 -EKIPCLCGSEKCRG-FLN 164
>gi|350421470|ref|XP_003492853.1| PREDICTED: hypothetical protein LOC100746901 [Bombus impatiens]
Length = 1777
Score = 68.9 bits (167), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 49/149 (32%), Positives = 74/149 (49%), Gaps = 20/149 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ + GE F++E++GEV + +D R ++ ++D +Y + L
Sbjct: 830 KKGFGLRAMVDLLAGE--FIMEYVGEV------VDPKDFRRRAKEYSKDKNKHYYFMAL- 880
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
K D ++DA K N + I HSC PN E + V+G +IG + + I GEE
Sbjct: 881 --KSDQ------IIDATLKGNVSRFINHSCDPNSETQKWTVNGELRIGFFNKKFIAAGEE 932
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
ITFDY+ K EA C C + CRG
Sbjct: 933 ITFDYHFQRYGK---EAQKCFCEAPNCRG 958
>gi|340726897|ref|XP_003401788.1| PREDICTED: hypothetical protein LOC100652142 [Bombus terrestris]
Length = 1777
Score = 68.9 bits (167), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 49/149 (32%), Positives = 74/149 (49%), Gaps = 20/149 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ + GE F++E++GEV + +D R ++ ++D +Y + L
Sbjct: 830 KKGFGLRAMVDLLAGE--FIMEYVGEV------VDPKDFRRRAKEYSKDKNKHYYFMAL- 880
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
K D ++DA K N + I HSC PN E + V+G +IG + + I GEE
Sbjct: 881 --KSDQ------IIDATLKGNVSRFINHSCDPNSETQKWTVNGELRIGFFNKKFIAAGEE 932
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
ITFDY+ K EA C C + CRG
Sbjct: 933 ITFDYHFQRYGK---EAQKCFCEAPNCRG 958
>gi|70991351|ref|XP_750524.1| SET domain protein [Aspergillus fumigatus Af293]
gi|74671075|sp|Q4WNH8.1|SET1_ASPFU RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component set1; AltName:
Full=SET domain-containing protein 1
gi|66848157|gb|EAL88486.1| SET domain protein [Aspergillus fumigatus Af293]
Length = 1241
Score = 68.9 bits (167), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA + A I HSC PNC AK+ VDG +I IY +R I EE+T+DY E
Sbjct: 1162 TVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIGRDEELTYDYKFEREW 1221
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+ + CLCGS C+G +LN
Sbjct: 1222 DSD-DRIPCLCGSTGCKG-FLN 1241
>gi|159124080|gb|EDP49198.1| SET domain protein [Aspergillus fumigatus A1163]
Length = 1241
Score = 68.9 bits (167), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA + A I HSC PNC AK+ VDG +I IY +R I EE+T+DY E
Sbjct: 1162 TVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIGRDEELTYDYKFEREW 1221
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+ + CLCGS C+G +LN
Sbjct: 1222 DSD-DRIPCLCGSTGCKG-FLN 1241
>gi|256271664|gb|EEU06704.1| Set1p [Saccharomyces cerevisiae JAY291]
Length = 1080
Score = 68.9 bits (167), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 46/82 (56%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA K A I H C PNC AK+ V G +I IY +R I EE+T+DY E
Sbjct: 1001 TVIDATKKGGIARFINHCCDPNCTAKIIKVGGRRRIVIYALRDIAASEELTYDYKFEREK 1060
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+E E CLCG+ C+G +LN
Sbjct: 1061 DDE-ERLPCLCGAPNCKG-FLN 1080
>gi|162463380|ref|NP_001105665.1| SET domain-containing protein SET102 [Zea mays]
gi|22121720|gb|AAM89289.1| SET domain-containing protein SET102 [Zea mays]
gi|414587223|tpg|DAA37794.1| TPA: SET domain-containing protein SET102 [Zea mays]
Length = 513
Score = 68.9 bits (167), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 49/151 (32%), Positives = 72/151 (47%), Gaps = 26/151 (17%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
+G G+V ++ G+ FV+E+ GEV WK + R Q + Y IYL
Sbjct: 95 RGWGLVADENIMAGQ--FVIEYCGEVIS-WK-----EAKRRAQAYETQCLKDAYIIYLNA 146
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ +DA K N A I HSC+PNCE + V G ++GI+ + I +G E+
Sbjct: 147 DES---------IDATRKGNLARFINHSCQPNCETRKWNVLGEVRVGIFAKQNIPFGTEL 197
Query: 2011 TFDYNSVTESKEEYEASV---CLCGSQVCRG 2038
++DYN E+ V CLCG+ C G
Sbjct: 198 SYDYNF------EWYGGVMVRCLCGAASCSG 222
>gi|17136556|ref|NP_476769.1| trithorax, isoform D [Drosophila melanogaster]
gi|19550184|ref|NP_599109.1| trithorax, isoform A [Drosophila melanogaster]
gi|290457684|sp|P20659.4|TRX_DROME RecName: Full=Histone-lysine N-methyltransferase trithorax; AltName:
Full=Lysine N-methyltransferase 2A
gi|10726522|gb|AAF55041.2| trithorax, isoform A [Drosophila melanogaster]
gi|23171244|gb|AAN13599.1| trithorax, isoform D [Drosophila melanogaster]
Length = 3726
Score = 68.9 bits (167), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 54/151 (35%), Positives = 71/151 (47%), Gaps = 23/151 (15%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
G G+ C K+ GE V+E+ GE+ IRS + + + I
Sbjct: 3599 GRGLYCTKDIEAGE--MVIEYAGEL------------IRSTLTDKRERYYDSRGIGCYMF 3644
Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
K D D +VVDA + N A I H C PNC +KV + GH I I+ +R I GEE+T
Sbjct: 3645 KID----DNLVVDATMRGNAARFINHCCEPNCYSKVVDILGHKHIIIFALRRIVQGEELT 3700
Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
+DY E E C CGS+ CR YLN
Sbjct: 3701 YDYKFPFED----EKIPCSCGSKRCR-KYLN 3726
>gi|410923178|ref|XP_003975059.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like [Takifugu
rubripes]
Length = 1499
Score = 68.9 bits (167), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 45/148 (30%), Positives = 72/148 (48%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
+G G+ N+ GE FV E++GEV + ++ + +++ +E+ FY + L +
Sbjct: 1207 RGWGLKANQPLKKGE--FVTEYVGEV------IDAEECQQRIKRAHENHMTNFYMLTLTK 1258
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ V+DA K N + I HSC PNCE + V+G IG++ + I G E+
Sbjct: 1259 DR---------VIDAAQKGNLSRFINHSCSPNCETQKWTVNGDVHIGLFALCDIDAGTEL 1309
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN + C CGS C G
Sbjct: 1310 TFNYNLHCVGNRR---TTCNCGSDNCSG 1334
>gi|315045626|ref|XP_003172188.1| histone-lysine N-methyltransferase [Arthroderma gypseum CBS 118893]
gi|311342574|gb|EFR01777.1| histone-lysine N-methyltransferase [Arthroderma gypseum CBS 118893]
Length = 1334
Score = 68.9 bits (167), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 46/82 (56%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA A I HSC PNC AK+ VDG +I IY +R I EE+T+DY E
Sbjct: 1255 TVIDATKHGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREW 1314
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+ + CLCGS C+G +LN
Sbjct: 1315 DSD-DRIPCLCGSTGCKG-FLN 1334
>gi|151944065|gb|EDN62358.1| SET domain-containing protein [Saccharomyces cerevisiae YJM789]
Length = 1080
Score = 68.9 bits (167), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 46/82 (56%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA K A I H C PNC AK+ V G +I IY +R I EE+T+DY E
Sbjct: 1001 TVIDATKKGGIARFINHCCDPNCTAKIIKVGGRRRIVIYALRDIAASEELTYDYKFEREK 1060
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+E E CLCG+ C+G +LN
Sbjct: 1061 DDE-ERLPCLCGAPNCKG-FLN 1080
>gi|241612901|ref|XP_002407306.1| mixed-lineage leukemia protein, mll, putative [Ixodes scapularis]
gi|215502770|gb|EEC12264.1| mixed-lineage leukemia protein, mll, putative [Ixodes scapularis]
Length = 208
Score = 68.9 bits (167), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 37/81 (45%), Positives = 47/81 (58%), Gaps = 5/81 (6%)
Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
++DA N A I HSC PNC AKV V+G +I IY+ + I+ EEIT+DY E
Sbjct: 133 IIDATKCGNLARFINHSCNPNCYAKVITVEGQKKIVIYSKQPINVNEEITYDYKFPLEE- 191
Query: 2022 EEYEASVCLCGSQVCRGSYLN 2042
E CLCG+ CRG +LN
Sbjct: 192 ---EKISCLCGAPQCRG-FLN 208
>gi|6321911|ref|NP_011987.1| Set1p [Saccharomyces cerevisiae S288c]
gi|731707|sp|P38827.1|SET1_YEAST RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component SET1; AltName:
Full=Lysine N-methyltransferase 2; AltName: Full=SET
domain-containing protein 1
gi|529135|gb|AAB68867.1| Set1p [Saccharomyces cerevisiae]
gi|190405898|gb|EDV09165.1| histone-lysine N-methyltransferase [Saccharomyces cerevisiae RM11-1a]
gi|285810026|tpg|DAA06813.1| TPA: Set1p [Saccharomyces cerevisiae S288c]
gi|392298926|gb|EIW10021.1| Set1p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 1080
Score = 68.9 bits (167), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 46/82 (56%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA K A I H C PNC AK+ V G +I IY +R I EE+T+DY E
Sbjct: 1001 TVIDATKKGGIARFINHCCDPNCTAKIIKVGGRRRIVIYALRDIAASEELTYDYKFEREK 1060
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+E E CLCG+ C+G +LN
Sbjct: 1061 DDE-ERLPCLCGAPNCKG-FLN 1080
>gi|374370210|ref|ZP_09628219.1| methyltransferase [Cupriavidus basilensis OR16]
gi|373098212|gb|EHP39324.1| methyltransferase [Cupriavidus basilensis OR16]
Length = 188
Score = 68.9 bits (167), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 51/156 (32%), Positives = 76/156 (48%), Gaps = 31/156 (19%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
G GV N GE + ++E+ GE + WK +L+++ DPA + Y
Sbjct: 50 GKGVYAN--APIGEGERIIEYKGE-HISWK--------EALKRHPHDPADPNHTFYFSLE 98
Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
GD V+DA N A I H+C PNCEA+ + ++ I+ +R I GEE+
Sbjct: 99 DGD-------VIDAKFGGNRARWINHACEPNCEAR----EKKGRVFIHALRDIASGEELF 147
Query: 2012 FDYNSVTES------KEEYEASVCLCGSQVCRGSYL 2041
+DY V ++ K+E+E C CGS CRG+ L
Sbjct: 148 YDYGLVIDARYTKKLKKEFE---CRCGSPKCRGTML 180
>gi|207344594|gb|EDZ71692.1| YHR119Wp-like protein [Saccharomyces cerevisiae AWRI1631]
Length = 1080
Score = 68.9 bits (167), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 46/82 (56%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA K A I H C PNC AK+ V G +I IY +R I EE+T+DY E
Sbjct: 1001 TVIDATKKGGIARFINHCCDPNCTAKIIKVGGRRRIVIYALRDIAASEELTYDYKFEREK 1060
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+E E CLCG+ C+G +LN
Sbjct: 1061 DDE-ERLPCLCGAPNCKG-FLN 1080
>gi|449295340|gb|EMC91362.1| hypothetical protein BAUCODRAFT_80239 [Baudoinia compniacensis UAMH
10762]
Length = 1279
Score = 68.9 bits (167), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 47/140 (33%), Positives = 72/140 (51%), Gaps = 23/140 (16%)
Query: 1906 DDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDA 1965
+D ++E++GE K +K +R L+ + + YL R D +VDA
Sbjct: 1160 NDLIIEYVGE-----KVRQKVADLRELRYEKQG----VGSSYLFRMMDDE------IVDA 1204
Query: 1966 MHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYE 2025
K A I HSC PNC AK+ V+G +I IY ++ I EE+T+DY + + EY
Sbjct: 1205 TKKGGIARFINHSCSPNCTAKIIKVEGTPRIVIYALKDIGKNEELTYDY----KFEREYG 1260
Query: 2026 AS---VCLCGSQVCRGSYLN 2042
++ CLCG+ C+G +LN
Sbjct: 1261 STDRIPCLCGTANCKG-FLN 1279
>gi|357627347|gb|EHJ77076.1| hypothetical protein KGM_14526 [Danaus plexippus]
Length = 1912
Score = 68.9 bits (167), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 39/131 (29%), Positives = 68/131 (51%), Gaps = 18/131 (13%)
Query: 1908 FVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMH 1967
FV+E++GE+ ++++ R + + +E FY + L++ + ++DA
Sbjct: 1688 FVIEYVGEL------IDEEEFRRRMNRKHEVRDENFYFLTLDKER---------MIDAGP 1732
Query: 1968 KANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEAS 2027
K N A + HSC PNCE + V G ++G++ +R I E+TF+YN T E
Sbjct: 1733 KGNLARFMNHSCEPNCETQKWTVLGDVRVGLFALRDIPANSELTFNYNLETSG---IEKK 1789
Query: 2028 VCLCGSQVCRG 2038
C+CG++ C G
Sbjct: 1790 RCMCGAKRCSG 1800
>gi|349578671|dbj|GAA23836.1| K7_Set1p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 1080
Score = 68.9 bits (167), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 46/82 (56%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA K A I H C PNC AK+ V G +I IY +R I EE+T+DY E
Sbjct: 1001 TVIDATKKGGIARFINHCCDPNCTAKIIKVGGRRRIVIYALRDIAASEELTYDYKFEREK 1060
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+E E CLCG+ C+G +LN
Sbjct: 1061 DDE-ERLPCLCGAPNCKG-FLN 1080
>gi|47216786|emb|CAG03790.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1443
Score = 68.9 bits (167), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 42/134 (31%), Positives = 71/134 (52%), Gaps = 18/134 (13%)
Query: 1905 EDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVD 1964
+ +FV E++GE+ E + I+ Q+NN FY + +++ + ++D
Sbjct: 1117 QGEFVNEYIGELI---DEEECRARIKYAQENNIT---NFYMLTIDKDR---------IID 1161
Query: 1965 AMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEY 2024
A K NY+ + HSC+PNCE + V+G ++G++ V I G E+TF+YN E+
Sbjct: 1162 AGPKGNYSRFMNHSCQPNCETQKWTVNGDTRVGLFAVCDIPAGTELTFNYNLDCLGNEK- 1220
Query: 2025 EASVCLCGSQVCRG 2038
+VC CG+ C G
Sbjct: 1221 --TVCCCGAPNCSG 1232
>gi|322792358|gb|EFZ16342.1| hypothetical protein SINV_07789 [Solenopsis invicta]
Length = 3272
Score = 68.9 bits (167), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 60/175 (34%), Positives = 79/175 (45%), Gaps = 23/175 (13%)
Query: 1868 MKMCRGILKAMDSRPDDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQD 1927
M M ILK Y + G G+ C ++ GE V+E+ GEV
Sbjct: 3121 MAMRFRILKETSKASVGVYYSRIHGRGLFCLRDIEPGE--MVIEYAGEV----------- 3167
Query: 1928 GIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKV 1987
IRS + + + NI K D D +VVDA K N A I HSC PNC ++V
Sbjct: 3168 -IRSSLTDKREKYYDSKNIGCYMFKID----DHLVVDATMKGNAARFINHSCEPNCYSRV 3222
Query: 1988 TAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
+ G I I+ +R I GEE+T+DY E + C CGS+ CR YLN
Sbjct: 3223 VDILGKKHILIFALRRIIQGEELTYDYKFPFEDIK----IPCTCGSRKCR-KYLN 3272
>gi|406694364|gb|EKC97692.1| hypothetical protein A1Q2_08004 [Trichosporon asahii var. asahii CBS
8904]
Length = 1218
Score = 68.9 bits (167), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 37/96 (38%), Positives = 50/96 (52%), Gaps = 8/96 (8%)
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
YL R GD +V DA K + + I HSC P AK+ ++GH +I IY R ++
Sbjct: 1131 YLFRIDGD------IVCDATFKGSVSRLINHSCNPTANAKIININGHNKIVIYAKRTLYP 1184
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
G+E+T+ YN E E CLCG C G +LN
Sbjct: 1185 GDEVTYSYNFPLEQDESLRVR-CLCGEPTCLG-FLN 1218
>gi|402852477|ref|XP_003890948.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
[Papio anubis]
Length = 1013
Score = 68.9 bits (167), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+V ++ GE FV E++GE+ ++++ + ++ +E+ FY + +++
Sbjct: 721 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKHAHENDITHFYMLTIDK 772
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NY+ + HSC+PNCE V+G ++G++ V I G E+
Sbjct: 773 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 823
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN E+ +VC CG+ C G
Sbjct: 824 TFNYNLDCLGNEK---TVCRCGASNCSG 848
>gi|213406581|ref|XP_002174062.1| histone-lysine N-methyltransferase [Schizosaccharomyces japonicus
yFS275]
gi|212002109|gb|EEB07769.1| histone-lysine N-methyltransferase [Schizosaccharomyces japonicus
yFS275]
Length = 779
Score = 68.9 bits (167), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 47/149 (31%), Positives = 71/149 (47%), Gaps = 20/149 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ N + FV E++GEV P ++ ++ +++ +E FY + L+
Sbjct: 167 KKGFGLRAN--SYLTKGTFVYEYIGEVIPEVRFRKR------MREYDERGIRHFYFMMLQ 218
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
KG+ +DA K + A HSCRPNC V ++GI+ R I GEE
Sbjct: 219 --KGE-------YIDATVKGSLARFCNHSCRPNCYVDKWVVGNKLRMGIFCKRDIQKGEE 269
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
+TFDYN + +A C CG C G
Sbjct: 270 LTFDYNV---DRYGAQAQPCYCGEDCCLG 295
>gi|357631650|gb|EHJ79119.1| hypothetical protein KGM_15585 [Danaus plexippus]
Length = 1491
Score = 68.9 bits (167), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 35/82 (42%), Positives = 45/82 (54%), Gaps = 5/82 (6%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N A I HSC PNC AK+ ++ +I IY+ + I EEIT+DY E
Sbjct: 1415 TIIDATKCGNLARFINHSCNPNCYAKIITIESQKKIVIYSKQPIGVDEEITYDYKFPLED 1474
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
E CLCG+ CRG YLN
Sbjct: 1475 ----EKIPCLCGAPQCRG-YLN 1491
>gi|255558564|ref|XP_002520307.1| huntingtin interacting protein, putative [Ricinus communis]
gi|223540526|gb|EEF42093.1| huntingtin interacting protein, putative [Ricinus communis]
Length = 1746
Score = 68.9 bits (167), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 35/77 (45%), Positives = 42/77 (54%), Gaps = 2/77 (2%)
Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
V+DA K N I HSC PNC + V+G IG++ +R I GEE+TFDYN V
Sbjct: 897 VIDACAKGNLGRFINHSCDPNCRTEKWVVNGEICIGLFALRDIKKGEELTFDYNYVRVCG 956
Query: 2022 EEYEASVCLCGSQVCRG 2038
A C CGS CRG
Sbjct: 957 --AAAKRCYCGSPQCRG 971
>gi|355718741|gb|AES06369.1| SET domain containing 1B [Mustela putorius furo]
Length = 359
Score = 68.9 bits (167), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N+A I HSC PNC AKV V+ +I IY+ + I+ EEIT+DY E
Sbjct: 284 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 343
Query: 2021 KEEYEASVCLCGSQVCRGS 2039
+ CLCGS+ CRG+
Sbjct: 344 VK----IPCLCGSENCRGT 358
>gi|328767162|gb|EGF77213.1| hypothetical protein BATDEDRAFT_91931 [Batrachochytrium dendrobatidis
JAM81]
Length = 779
Score = 68.6 bits (166), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 46/154 (29%), Positives = 68/154 (44%), Gaps = 24/154 (15%)
Query: 1886 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN 1945
+ A +G G+ + G ++E+ GE+ K E+ D I S QKN+
Sbjct: 567 FYAPNRGFGLYTDVPIKAGV--LIIEYRGEIISTAKCIERNDTIYSGQKNH--------- 615
Query: 1946 IYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIH 2005
+LE G +V+D K A HSC PNC + V +++GI+ I
Sbjct: 616 YFLEYGNG-------LVLDGCRKGTIARFANHSCDPNCHVEKWYVGTEFRVGIFATNNIS 668
Query: 2006 YGEEITFDYNSVTESKEEY-EASVCLCGSQVCRG 2038
G E+T+DY + Y + C CGSQ CRG
Sbjct: 669 VGSELTYDYRF-----DSYGQMQPCYCGSQNCRG 697
>gi|393244480|gb|EJD51992.1| histone methyltransferase [Auricularia delicata TFB-10046 SS5]
Length = 153
Score = 68.6 bits (166), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 38/83 (45%), Positives = 51/83 (61%), Gaps = 7/83 (8%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
+VVDA K N I HSC PNC AK+ +V+G +I IY + I G+E+T+DY+ E
Sbjct: 77 LVVDATKKGNLGRLINHSCDPNCTAKIISVNGVKKIVIYAKQDIELGDELTYDYHFPRE- 135
Query: 2021 KEEYEASV-CLCGSQVCRGSYLN 2042
EA + CLCG+ CRG +LN
Sbjct: 136 ----EAKIPCLCGAAKCRG-FLN 153
>gi|28204960|gb|AAH46473.1| Whsc1 protein, partial [Mus musculus]
Length = 851
Score = 68.6 bits (166), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+V ++ GE FV E++GE+ ++++ + ++ +E+ FY + +++
Sbjct: 559 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKYAHENDITHFYMLTIDK 610
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NY+ + HSC+PNCE V+G ++G++ V I G E+
Sbjct: 611 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 661
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN E+ +VC CG+ C G
Sbjct: 662 TFNYNLDCLGNEK---TVCRCGASNCSG 686
>gi|391331299|ref|XP_003740087.1| PREDICTED: uncharacterized protein LOC100899404 [Metaseiulus
occidentalis]
Length = 2686
Score = 68.6 bits (166), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 52/158 (32%), Positives = 72/158 (45%), Gaps = 35/158 (22%)
Query: 1888 AYRKGL---GVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFY 1944
YR G+ G+ C K+ GE ++E+ GEV ++ + D ++Y
Sbjct: 2552 VYRSGIHGRGLYCKKDIAKGE--MIIEYAGEV---------------IRASLCDRREKYY 2594
Query: 1945 -----NIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIY 1999
Y+ R D VVDA K N A I HSC PNC +K+ VD I IY
Sbjct: 2595 EGRGLGCYMFRMDNDE------VVDATVKGNAARFINHSCDPNCYSKMITVDNKKHIVIY 2648
Query: 2000 TVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCR 2037
+R I GEE+T+DY E + + C CGS+ CR
Sbjct: 2649 ALREIRTGEELTYDYKFPIEDDKLH----CTCGSRRCR 2682
>gi|37360238|dbj|BAC98097.1| mKIAA1090 protein [Mus musculus]
Length = 857
Score = 68.6 bits (166), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+V ++ GE FV E++GE+ ++++ + ++ +E+ FY + +++
Sbjct: 565 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKYAHENDITHFYMLTIDK 616
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NY+ + HSC+PNCE V+G ++G++ V I G E+
Sbjct: 617 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 667
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN E+ +VC CG+ C G
Sbjct: 668 TFNYNLDCLGNEK---TVCRCGASNCSG 692
>gi|348675982|gb|EGZ15800.1| hypothetical protein PHYSODRAFT_263017 [Phytophthora sojae]
Length = 823
Score = 68.6 bits (166), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 48/147 (32%), Positives = 73/147 (49%), Gaps = 21/147 (14%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
G G+V N++ GE F++E++GEV + + R + + ++ FY + LE+
Sbjct: 246 GFGLVANEKINAGE--FIIEYVGEV------IDDIECERRMIQYRDNGEVNFYMMELEKN 297
Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
+V+DA +++N + I H C PN + VDG +IGI+ R I EEIT
Sbjct: 298 ---------IVIDAKYRSNDSRFINHCCDPNSVTQKWNVDGMQRIGIFARRNIAPDEEIT 348
Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRG 2038
DYN EA+ C CGS C G
Sbjct: 349 IDYN----FSHFGEAADCKCGSTACTG 371
>gi|195143973|ref|XP_002012971.1| GL23881 [Drosophila persimilis]
gi|194101914|gb|EDW23957.1| GL23881 [Drosophila persimilis]
Length = 1466
Score = 68.6 bits (166), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 41/150 (27%), Positives = 77/150 (51%), Gaps = 23/150 (15%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
+G G+VC + E DF++E++GEV +++ R + + +D FY + +E+
Sbjct: 1279 RGFGLVCREP--IAEGDFIIEYVGEV------INQEEFQRRMLRKQKDRDENFYFLGVEK 1330
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
++DA K N A + HSC PNC ++ V+ +++G++ ++ I E+
Sbjct: 1331 E---------FIIDAGPKGNLARFMNHSCEPNCTSQKWTVNCTHRVGLFAIQDIPAETEL 1381
Query: 2011 TFDY--NSVTESKEEYEASVCLCGSQVCRG 2038
TF+Y + + K++ C CGS+ C G
Sbjct: 1382 TFNYLWDDLLNDKKK----ACHCGSERCSG 1407
>gi|328768890|gb|EGF78935.1| hypothetical protein BATDEDRAFT_90118 [Batrachochytrium dendrobatidis
JAM81]
Length = 1361
Score = 68.6 bits (166), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 46/150 (30%), Positives = 71/150 (47%), Gaps = 24/150 (16%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+ + G F++E+ GEV P F K+ + +++ + A FY + L++
Sbjct: 251 KGFGIYARENIAGGA--FIIEYCGEVIPA-SLFGKR-----ITEHSNNSAQHFYFMSLKK 302
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
D Y +DA K N + + HSC PNC + V +IG++ +R I E+
Sbjct: 303 -----DEY----IDASKKGNLSRYLNHSCDPNCSLQKWLVGDTIRIGLFALRAIPKNAEL 353
Query: 2011 TFDYNSVTESKEEY--EASVCLCGSQVCRG 2038
TFDY E Y +A C CG+ C G
Sbjct: 354 TFDYKF-----ERYGSKAQECYCGAAACTG 378
>gi|321472797|gb|EFX83766.1| hypothetical protein DAPPUDRAFT_301653 [Daphnia pulex]
Length = 303
Score = 68.6 bits (166), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 33/78 (42%), Positives = 45/78 (57%), Gaps = 4/78 (5%)
Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
++DA N A I HSC PNC A+V ++ +I IY+ + I GEEIT+DY E
Sbjct: 228 IIDATKCGNLARFINHSCNPNCYARVITIESQKKIVIYSKQPIGVGEEITYDYKFPIEE- 286
Query: 2022 EEYEASVCLCGSQVCRGS 2039
+ +CLCGS CRG+
Sbjct: 287 ---DKIICLCGSSQCRGT 301
>gi|213624868|gb|AAI71696.1| Wolf-Hirschhorn syndrome candidate 1 [Danio rerio]
Length = 1461
Score = 68.6 bits (166), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 47/148 (31%), Positives = 78/148 (52%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G++ ++ GE FV E++GE+ E + IR Q+N+ FY + +++
Sbjct: 1164 KGWGLISLRDIKKGE--FVNEYVGELI---DEEECRSRIRHAQEND---ITHFYMLTIDK 1215
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NY+ + HSC+PNCE + V+G ++G++ V I G E+
Sbjct: 1216 DR---------IIDAGPKGNYSRFMNHSCQPNCETQKWTVNGDTRVGLFAVCDIPAGTEL 1266
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN E+ +VC CG+ C G
Sbjct: 1267 TFNYNLDCLGNEK---TVCRCGAPNCSG 1291
>gi|128485462|ref|NP_001076020.1| probable histone-lysine N-methyltransferase NSD2 [Danio rerio]
Length = 1461
Score = 68.6 bits (166), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 43/148 (29%), Positives = 78/148 (52%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G++ ++ GE FV E++GE+ ++++ ++ E+ FY + +++
Sbjct: 1164 KGWGLISLRDIKKGE--FVNEYVGEL------IDEEECRSRIRHAQENDITHFYMLTIDK 1215
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NY+ + HSC+PNCE + V+G ++G++ V I G E+
Sbjct: 1216 DR---------IIDAGPKGNYSRFMNHSCQPNCETQKWTVNGDTRVGLFAVCDIPAGTEL 1266
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN E+ +VC CG+ C G
Sbjct: 1267 TFNYNLDCLGNEK---TVCRCGAPNCSG 1291
>gi|443714650|gb|ELU06966.1| hypothetical protein CAPTEDRAFT_176480 [Capitella teleta]
Length = 936
Score = 68.6 bits (166), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 44/155 (28%), Positives = 74/155 (47%), Gaps = 20/155 (12%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+K+V +G GV + ++ E+LGEV ++ R ++ AP
Sbjct: 224 EKFVTADRGHGV--RSKHPLVNGQYICEYLGEVV-------SEEEFRRRMADDYSAAPHH 274
Query: 1944 YNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRG 2003
Y + L+ V+D + + I HSC PNCE + ++G Y+I +++++
Sbjct: 275 YCLNLD---------SGTVIDGYRMGSISRFINHSCEPNCEMQKWNINGVYRIALFSLKD 325
Query: 2004 IHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
I GEE+T+DYN +S + +C CGS CRG
Sbjct: 326 IPPGEELTYDYN--FQSYNVHSQQICKCGSANCRG 358
>gi|321468162|gb|EFX79148.1| hypothetical protein DAPPUDRAFT_319776 [Daphnia pulex]
Length = 1408
Score = 68.6 bits (166), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 46/149 (30%), Positives = 75/149 (50%), Gaps = 20/149 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG+G+ ++ G DF++E++GEV + F ++ + +KN +Y + L
Sbjct: 476 KKGVGLRALQDMDPG--DFIIEYVGEVIDP-REFHRRAKDYAREKNKH-----YYFMAL- 526
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
K DA ++DA + N + I HSC PN E + V+G ++G + + + G+E
Sbjct: 527 --KSDA------IIDATQQGNVSRFINHSCDPNAETQKWTVNGDLRVGFFARKSLKSGDE 578
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
+TFDY K EA C C S CRG
Sbjct: 579 VTFDYQFQRYGK---EAQRCYCESSNCRG 604
>gi|195145308|ref|XP_002013638.1| GL23289 [Drosophila persimilis]
gi|194102581|gb|EDW24624.1| GL23289 [Drosophila persimilis]
Length = 293
Score = 68.6 bits (166), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 55/151 (36%), Positives = 72/151 (47%), Gaps = 23/151 (15%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
G G+ C K+ GE V+E+ GE+ IRS + + + I
Sbjct: 166 GRGLYCTKDIEAGE--MVIEYAGEL------------IRSTLTDKRERYYDSRGIGCYMF 211
Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
K D D +VVDA + N A I HSC PNC +KV + GH I I+ +R I GEE+T
Sbjct: 212 KID----DNLVVDATMRGNAARFINHSCEPNCYSKVVDILGHKHIIIFALRRIVQGEELT 267
Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
+DY E E C CGS+ CR YLN
Sbjct: 268 YDYKFPFED----EKIPCSCGSKRCR-KYLN 293
>gi|297842509|ref|XP_002889136.1| hypothetical protein ARALYDRAFT_476894 [Arabidopsis lyrata subsp.
lyrata]
gi|297334977|gb|EFH65395.1| hypothetical protein ARALYDRAFT_476894 [Arabidopsis lyrata subsp.
lyrata]
Length = 1766
Score = 68.2 bits (165), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 43/134 (32%), Positives = 66/134 (49%), Gaps = 17/134 (12%)
Query: 1905 EDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVD 1964
E F++E++GEV + + +Q + + FY + L +G + V+D
Sbjct: 1048 EGQFLIEYVGEVLDMQSYDTRQKEYACKGQKH------FYFMTL-------NGNE--VID 1092
Query: 1965 AMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEY 2024
A K N I HSC PNC + V+G +GI++++ + G+E+TFDYN V
Sbjct: 1093 AGAKGNLGRFINHSCEPNCRTEKWMVNGEICVGIFSMKDLKKGQELTFDYNYVRVFG--A 1150
Query: 2025 EASVCLCGSQVCRG 2038
A C CGS CRG
Sbjct: 1151 AAKKCYCGSSHCRG 1164
>gi|15292119|gb|AAK93328.1| LD39445p [Drosophila melanogaster]
Length = 751
Score = 68.2 bits (165), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 54/151 (35%), Positives = 71/151 (47%), Gaps = 23/151 (15%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
G G+ C K+ GE V+E+ GE+ IRS + + + I
Sbjct: 624 GRGLYCTKDIEAGE--MVIEYAGEL------------IRSTLTDKRERYYDSRGIGCYMF 669
Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
K D D +VVDA + N A I H C PNC +KV + GH I I+ +R I GEE+T
Sbjct: 670 KID----DNLVVDATMRGNAARFINHCCEPNCYSKVVDILGHKHIIIFALRRIVQGEELT 725
Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
+DY E E C CGS+ CR YLN
Sbjct: 726 YDYKFPFED----EKIPCSCGSKRCR-KYLN 751
>gi|194746360|ref|XP_001955648.1| GF16138 [Drosophila ananassae]
gi|190628685|gb|EDV44209.1| GF16138 [Drosophila ananassae]
Length = 1460
Score = 68.2 bits (165), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 44/151 (29%), Positives = 76/151 (50%), Gaps = 25/151 (16%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
+G G+VC + E FV+E++GEV E Q+ + Q+N ++ +Y + +E+
Sbjct: 1253 RGFGLVCREP--IAEGTFVIEYVGEVI---NHAEFQERLIQKQRNRDE---NYYFLGVEK 1304
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
++DA K N A + HSC PNCE + V+ +++GI+ ++ I E+
Sbjct: 1305 D---------FIIDAGPKGNLARFMNHSCEPNCETQKWTVNCVHRVGIFAIKDIPANTEL 1355
Query: 2011 TFDY---NSVTESKEEYEASVCLCGSQVCRG 2038
TF+Y + + SK+ C CG+ C G
Sbjct: 1356 TFNYLWDDLMNNSKK-----ACFCGATRCSG 1381
>gi|86278478|gb|ABC88477.1| Wolf-Hirschhorn syndrome candidate 1 protein [Danio rerio]
Length = 1366
Score = 68.2 bits (165), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 43/148 (29%), Positives = 78/148 (52%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G++ ++ GE FV E++GE+ ++++ ++ E+ FY + +++
Sbjct: 1069 KGWGLISLRDIKKGE--FVNEYVGEL------IDEEECRSRIRHAQENDITHFYMLTIDK 1120
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NY+ + HSC+PNCE + V+G ++G++ V I G E+
Sbjct: 1121 DR---------IIDAGPKGNYSRFMNHSCQPNCETQKWTVNGDTRVGLFAVCDIPAGTEL 1171
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN E+ +VC CG+ C G
Sbjct: 1172 TFNYNLDCLGNEK---TVCRCGAPNCSG 1196
>gi|15232214|ref|NP_191555.1| putative histone-lysine N-methyltransferase ASHH4 [Arabidopsis
thaliana]
gi|75264575|sp|Q9M1X9.1|ASHH4_ARATH RecName: Full=Putative histone-lysine N-methyltransferase ASHH4;
AltName: Full=ASH1 homolog 4; AltName: Full=Protein SET
DOMAIN GROUP 24
gi|7019690|emb|CAB75815.1| putative protein [Arabidopsis thaliana]
gi|332646470|gb|AEE79991.1| putative histone-lysine N-methyltransferase ASHH4 [Arabidopsis
thaliana]
Length = 352
Score = 68.2 bits (165), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 46/146 (31%), Positives = 71/146 (48%), Gaps = 21/146 (14%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
G G+V +++ GE F++E++GEV + + L K N FY +
Sbjct: 122 GYGIVADEDINSGE--FIIEYVGEV------IDDKICEERLWKLNHKVETNFYLCQINWN 173
Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
+V+DA HK N + I HSC PN E + +DG +IGI+ R I+ GE++T
Sbjct: 174 ---------MVIDATHKGNKSRYINHSCSPNTEMQKWIIDGETRIGIFATRFINKGEQLT 224
Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCR 2037
+DY V ++ C CG+ CR
Sbjct: 225 YDYQFVQFGADQD----CYCGAVCCR 246
>gi|168037139|ref|XP_001771062.1| histone H3 methyltransferase complex, subunit SET1 [Physcomitrella
patens subsp. patens]
gi|162677595|gb|EDQ64063.1| histone H3 methyltransferase complex, subunit SET1 [Physcomitrella
patens subsp. patens]
Length = 2607
Score = 68.2 bits (165), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 45/130 (34%), Positives = 64/130 (49%), Gaps = 23/130 (17%)
Query: 1906 DDFVVEFLGEVY--PVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVV 1963
+DFV+E++GE+ V + E+Q I + + YL R D +VV
Sbjct: 2496 EDFVIEYVGEIIRRQVSNFRERQYEIMGIGSS-----------YLFRVD------DELVV 2538
Query: 1964 DAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEE 2023
DA K A I HSC PNC K+ V+G ++ IY+ R I GEE+T+DY E K+
Sbjct: 2539 DATQKGGLARFINHSCNPNCYTKIITVEGRKKVVIYSKRAIGAGEELTYDYKFSLEDKK- 2597
Query: 2024 YEASVCLCGS 2033
C CG+
Sbjct: 2598 ---IPCYCGA 2604
>gi|240254387|ref|NP_177854.6| histone-lysine N-methyltransferase SETD2 [Arabidopsis thaliana]
gi|157734196|gb|ABV68921.1| SDG8 [Arabidopsis thaliana]
gi|332197839|gb|AEE35960.1| histone-lysine N-methyltransferase SETD2 [Arabidopsis thaliana]
Length = 1805
Score = 68.2 bits (165), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 46/155 (29%), Positives = 77/155 (49%), Gaps = 19/155 (12%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+++ + +KG G+ ++ E F++E++GEV + + +Q + + F
Sbjct: 1029 ERFQSGKKGYGLRLLED--VREGQFLIEYVGEVLDMQSYETRQKEYAFKGQKH------F 1080
Query: 1944 YNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRG 2003
Y + L +G + V+DA K N I HSC PNC + V+G +GI++++
Sbjct: 1081 YFMTL-------NGNE--VIDAGAKGNLGRFINHSCEPNCRTEKWMVNGEICVGIFSMQD 1131
Query: 2004 IHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
+ G+E+TFDYN V A C CGS CRG
Sbjct: 1132 LKKGQELTFDYNYVRVFG--AAAKKCYCGSSHCRG 1164
>gi|406606267|emb|CCH42258.1| Histone-lysine N-methyltransferase [Wickerhamomyces ciferrii]
Length = 1074
Score = 68.2 bits (165), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 49/149 (32%), Positives = 72/149 (48%), Gaps = 20/149 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G++ + F VVE+ GEV E + + ++ K ++ +Y + LE
Sbjct: 255 KKGCGLLSIR--SFNAGSLVVEYTGEVI---HLDEVEHRLNTIYKESDS----YYFLGLE 305
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
+ V+DA K + A HSC PN E + V+G +IG++ R I GEE
Sbjct: 306 ---------EEYVIDAGQKGSVARFANHSCDPNAEMQKWYVNGEPRIGLFAKRSIEAGEE 356
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
IT+DYN E E E C CGS+ C G
Sbjct: 357 ITYDYN--FEWFENGEPQKCYCGSKNCHG 383
>gi|317455359|pdb|3OPE|A Chain A, Structural Basis Of Auto-Inhibitory Mechanism Of Histone
Methyltransferase
gi|317455360|pdb|3OPE|B Chain B, Structural Basis Of Auto-Inhibitory Mechanism Of Histone
Methyltransferase
Length = 222
Score = 68.2 bits (165), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 49/160 (30%), Positives = 80/160 (50%), Gaps = 30/160 (18%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+++ A KG G+ + G+ F++E+LGEV EF
Sbjct: 77 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVVS---------------------EQEF 113
Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
N +E+ +D Y L +V+D+ N A I HSC PNCE + +V+G Y+IG+
Sbjct: 114 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 173
Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
Y ++ + G E+T+DYN + + E+ + +C CG + CRG
Sbjct: 174 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 211
>gi|3540208|gb|AAC34358.1| Hypothetical protein [Arabidopsis thaliana]
Length = 1767
Score = 68.2 bits (165), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 43/134 (32%), Positives = 66/134 (49%), Gaps = 17/134 (12%)
Query: 1905 EDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVD 1964
E F++E++GEV + + +Q + + FY + L +G + V+D
Sbjct: 1048 EGQFLIEYVGEVLDMQSYETRQKEYAFKGQKH------FYFMTL-------NGNE--VID 1092
Query: 1965 AMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEY 2024
A K N I HSC PNC + V+G +GI++++ + G+E+TFDYN V
Sbjct: 1093 AGAKGNLGRFINHSCEPNCRTEKWMVNGEICVGIFSMQDLKKGQELTFDYNYVRVFG--A 1150
Query: 2025 EASVCLCGSQVCRG 2038
A C CGS CRG
Sbjct: 1151 AAKKCYCGSSHCRG 1164
>gi|198432159|ref|XP_002123225.1| PREDICTED: similar to Wolf-Hirschhorn syndrome candidate 1 protein,
partial [Ciona intestinalis]
Length = 752
Score = 68.2 bits (165), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 72/132 (54%), Gaps = 18/132 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
+FV E++GE+ + ++ +R ++ +++ FY + +++ + ++DA
Sbjct: 305 EFVSEYVGEL------VDSEECMRRIEDAHKNNVTNFYMLTIDKDR---------IIDAG 349
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K NY+ + HSC PNCE + V+G ++G++ +R I GEE+ F+YN ++
Sbjct: 350 PKGNYSRFMNHSCDPNCETQKWMVNGDTRVGLFALREIQDGEELMFNYNLDCLGNDK--- 406
Query: 2027 SVCLCGSQVCRG 2038
+ C+CGS C G
Sbjct: 407 TPCMCGSANCSG 418
>gi|94707110|sp|Q2LAE1.1|ASHH2_ARATH RecName: Full=Histone-lysine N-methyltransferase ASHH2; AltName:
Full=ASH1 homolog 2; AltName: Full=H3-K4-HMTase; AltName:
Full=Histone H3-K36 methyltransferase 8;
Short=H3-K36-HMTase 8; AltName: Full=Protein EARLY
FLOWERING IN SHORT DAYS; AltName: Full=Protein SET DOMAIN
GROUP 8
gi|85036158|gb|ABC69038.1| SDG8 [Arabidopsis thaliana]
Length = 1759
Score = 68.2 bits (165), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 43/134 (32%), Positives = 66/134 (49%), Gaps = 17/134 (12%)
Query: 1905 EDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVD 1964
E F++E++GEV + + +Q + + FY + L +G + V+D
Sbjct: 1048 EGQFLIEYVGEVLDMQSYETRQKEYAFKGQKH------FYFMTL-------NGNE--VID 1092
Query: 1965 AMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEY 2024
A K N I HSC PNC + V+G +GI++++ + G+E+TFDYN V
Sbjct: 1093 AGAKGNLGRFINHSCEPNCRTEKWMVNGEICVGIFSMQDLKKGQELTFDYNYVRVFG--A 1150
Query: 2025 EASVCLCGSQVCRG 2038
A C CGS CRG
Sbjct: 1151 AAKKCYCGSSHCRG 1164
>gi|50293843|ref|XP_449333.1| hypothetical protein [Candida glabrata CBS 138]
gi|74637287|sp|Q6FKB1.1|SET1_CANGA RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component SET1; AltName:
Full=SET domain-containing protein 1
gi|49528646|emb|CAG62307.1| unnamed protein product [Candida glabrata]
Length = 1111
Score = 68.2 bits (165), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 35/82 (42%), Positives = 46/82 (56%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA K A I H C P+C AK+ V G +I IY +R I EE+T+DY E+
Sbjct: 1032 TVIDATKKGGIARFINHCCEPSCTAKIIKVGGKRRIVIYALRDIAANEELTYDYKFERET 1091
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
E E CLCG+ C+G +LN
Sbjct: 1092 DAE-ERLPCLCGAPSCKG-FLN 1111
>gi|254569422|ref|XP_002491821.1| hypothetical protein [Komagataella pastoris GS115]
gi|238031618|emb|CAY69541.1| hypothetical protein PAS_chr2-2_0494 [Komagataella pastoris GS115]
gi|328351679|emb|CCA38078.1| histone-lysine N-methyltransferase SETD1 [Komagataella pastoris CBS
7435]
Length = 1020
Score = 68.2 bits (165), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 35/82 (42%), Positives = 46/82 (56%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA K A I H C+P+C AK+ V+G +I IY ++ I EE+T+DY E
Sbjct: 941 TVIDATKKGGIARFINHCCQPSCTAKIIKVEGKKRIVIYALKDIAANEELTYDYKFERED 1000
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
E E CLCG C+G YLN
Sbjct: 1001 NNE-ERIPCLCGVPGCKG-YLN 1020
>gi|145327721|ref|NP_001077836.1| histone-lysine N-methyltransferase SETD2 [Arabidopsis thaliana]
gi|332197840|gb|AEE35961.1| histone-lysine N-methyltransferase SETD2 [Arabidopsis thaliana]
Length = 1501
Score = 68.2 bits (165), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 42/134 (31%), Positives = 64/134 (47%), Gaps = 17/134 (12%)
Query: 1905 EDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVD 1964
E F++E++GEV + + +Q ++ FY + L + V+D
Sbjct: 1048 EGQFLIEYVGEVLDMQSYETRQ------KEYAFKGQKHFYFMTLNGNE---------VID 1092
Query: 1965 AMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEY 2024
A K N I HSC PNC + V+G +GI++++ + G+E+TFDYN V
Sbjct: 1093 AGAKGNLGRFINHSCEPNCRTEKWMVNGEICVGIFSMQDLKKGQELTFDYNYVRVFG--A 1150
Query: 2025 EASVCLCGSQVCRG 2038
A C CGS CRG
Sbjct: 1151 AAKKCYCGSSHCRG 1164
>gi|295663144|ref|XP_002792125.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
gi|226279300|gb|EEH34866.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
Length = 816
Score = 68.2 bits (165), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 46/149 (30%), Positives = 80/149 (53%), Gaps = 22/149 (14%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
+G GV N+ F + +VE+ GE+ K E++ +R++ KNNE +Y +Y ++
Sbjct: 436 RGYGVRSNRT--FEPNQIIVEYTGEIV-TQKECERR--MRTVYKNNEC----YYLMYFDQ 486
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVR-GIHYGEE 2009
+++DA + + A + HSC PNCE + V G ++ ++ + GI GEE
Sbjct: 487 N---------MIIDAT-RGSIARFVNHSCEPNCEMEKWTVAGKPRMALFAGKNGITTGEE 536
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
+T+DYN S++ + C CG++ CRG
Sbjct: 537 LTYDYNFDPYSQKNVQE--CRCGAETCRG 563
>gi|401625463|gb|EJS43472.1| set1p [Saccharomyces arboricola H-6]
Length = 1089
Score = 67.8 bits (164), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 46/82 (56%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA K A I H C PNC AK+ V G +I IY +R I EE+T+DY E
Sbjct: 1010 TVIDATKKGGIARFINHCCDPNCTAKIIKVGGRRRIVIYALRDIGANEELTYDYKFEREQ 1069
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+E E CLCG+ C+G +LN
Sbjct: 1070 DDE-ERLPCLCGAPNCKG-FLN 1089
>gi|302757968|ref|XP_002962407.1| hypothetical protein SELMODRAFT_438147 [Selaginella moellendorffii]
gi|300169268|gb|EFJ35870.1| hypothetical protein SELMODRAFT_438147 [Selaginella moellendorffii]
Length = 1326
Score = 67.8 bits (164), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 54/181 (29%), Positives = 82/181 (45%), Gaps = 34/181 (18%)
Query: 1873 GILKAMDSRPDDKYVAYRKGLGVVCNKEGGFG--------EDDFVVEFLGEVYPVWKWFE 1924
G+ K SR +K A +K L +K +G +DF+VE++GEV
Sbjct: 1169 GVRKLGGSRAMEKMRARKKLLKFQRSKIHAWGVVAMEVIEPEDFIVEYVGEVL-----RP 1223
Query: 1925 KQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGY---DLVVVDAMHKANYASRICHSCRP 1981
K +R ++ YL + G + + D V+DA + I HSC P
Sbjct: 1224 KVADVREVR-------------YLRQGLGSSYFFRVGDGFVIDATQRGGLGRFINHSCEP 1270
Query: 1982 NCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
NC K+ V+G ++ IY I G E+T+DY E ++ CLCG++ CRG +L
Sbjct: 1271 NCYPKIITVEGQKRVFIYARTHIAPGTELTYDYKFPHEDQK----IPCLCGAERCRG-FL 1325
Query: 2042 N 2042
N
Sbjct: 1326 N 1326
>gi|326432726|gb|EGD78296.1| hypothetical protein PTSG_09362 [Salpingoeca sp. ATCC 50818]
Length = 1279
Score = 67.8 bits (164), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 49/163 (30%), Positives = 73/163 (44%), Gaps = 19/163 (11%)
Query: 1886 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN 1945
++ KG G+ ++ E FV+E++GE+ D ++ A ++
Sbjct: 1068 FLTQSKGWGLKAGED--IAEGQFVIEYVGEII---------DATECRRRLAASQAANDHS 1116
Query: 1946 IYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIH 2005
Y+ G + VDA +KAN A I HSC PNCE + V G ++GI+ I
Sbjct: 1117 FYILSLSGSS------FVDARNKANLARFINHSCGPNCETQKWNVLGETRVGIFAKEDIP 1170
Query: 2006 YGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGA 2048
G E+TFDY +S + C CG+ CRG L E A
Sbjct: 1171 KGTELTFDYQ--LDSLGSRGRTTCHCGASSCRGVIEKLGREAA 1211
>gi|270015132|gb|EFA11580.1| trithorax [Tribolium castaneum]
Length = 2343
Score = 67.8 bits (164), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 53/159 (33%), Positives = 77/159 (48%), Gaps = 35/159 (22%)
Query: 1889 YRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN--- 1945
+R+GL + + E G + V+E+ GEV IRS+ + + ++YN
Sbjct: 2215 HRRGLFCLRDFEAG----EMVIEYSGEV------------IRSVLTDKRE---KYYNSKG 2255
Query: 1946 --IYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRG 2003
Y+ R D +VVDA N A I HSC PNC +KV + GH I I+ +R
Sbjct: 2256 IGCYMFRID------DNLVVDATMTGNAARFINHSCDPNCYSKVVEILGHKHIIIFALRR 2309
Query: 2004 IHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
I GEE+T+DY E + C CG++ CR +LN
Sbjct: 2310 IICGEELTYDYKFPIEE----DKIPCTCGTRRCR-KFLN 2343
>gi|340718068|ref|XP_003397494.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
H3 lysine-36 and H4 lysine-20 specific-like [Bombus
terrestris]
Length = 1238
Score = 67.8 bits (164), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 40/131 (30%), Positives = 66/131 (50%), Gaps = 18/131 (13%)
Query: 1908 FVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMH 1967
FV+E++GEV ++ + R L + E FY + ++ + ++DA
Sbjct: 868 FVIEYVGEV------IDEAEYKRRLHRKKELKNENFYFLTIDNNR---------MIDAEP 912
Query: 1968 KANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEAS 2027
K N + + HSC PNCE + V+G +IG++ + I GEE+TF+YN + +
Sbjct: 913 KGNLSRFMNHSCSPNCETQKWTVNGDTRIGLFALCDIECGEELTFNYNLACDGETR---K 969
Query: 2028 VCLCGSQVCRG 2038
CLCG+ C G
Sbjct: 970 PCLCGASNCSG 980
>gi|31418293|gb|AAH53454.1| Whsc1 protein, partial [Mus musculus]
Length = 558
Score = 67.8 bits (164), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+V ++ GE FV E++GE+ ++++ + ++ +E+ FY + +++
Sbjct: 266 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKYAHENDITHFYMLTIDK 317
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NY+ + HSC+PNCE V+G ++G++ V I G E+
Sbjct: 318 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 368
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN E+ +VC CG+ C G
Sbjct: 369 TFNYNLDCLGNEK---TVCRCGASNCSG 393
>gi|452837203|gb|EME39145.1| hypothetical protein DOTSEDRAFT_75034 [Dothistroma septosporum NZE10]
Length = 1275
Score = 67.8 bits (164), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 46/143 (32%), Positives = 71/143 (49%), Gaps = 17/143 (11%)
Query: 1900 EGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYD 1959
E G ++ ++E++GE K +K +R ++ + + YL R D
Sbjct: 1150 EENIGINELIIEYVGE-----KVRQKVADMREIKYEKQG----VGSSYLFRMMDDE---- 1196
Query: 1960 LVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTE 2019
+VDA K A I HSC PNC AK+ V+G +I IY ++ I+ +E+T+DY E
Sbjct: 1197 --IVDATKKGGIARFINHSCDPNCTAKIIKVEGTPRIVIYALKDIYKNDELTYDYKFERE 1254
Query: 2020 SKEEYEASVCLCGSQVCRGSYLN 2042
+ CLCGS C+G +LN
Sbjct: 1255 IG-STDRIPCLCGSANCKG-FLN 1275
>gi|441664377|ref|XP_003279042.2| PREDICTED: histone-lysine N-methyltransferase NSD2-like [Nomascus
leucogenys]
Length = 780
Score = 67.8 bits (164), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+V ++ GE FV E++GE+ ++++ + ++ +E+ FY + +++
Sbjct: 568 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKHAHENDITHFYMLTIDK 619
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NY+ + HSC+PNCE V+G ++G++ V I G E+
Sbjct: 620 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 670
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN E+ +VC CG+ C G
Sbjct: 671 TFNYNLDCLGNEK---TVCRCGASNCSG 695
>gi|452846178|gb|EME48111.1| hypothetical protein DOTSEDRAFT_167709 [Dothistroma septosporum
NZE10]
Length = 963
Score = 67.8 bits (164), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 52/174 (29%), Positives = 80/174 (45%), Gaps = 27/174 (15%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVY--PVWKWFEKQDGIRSLQKNNEDPAPEFYNIY 1947
+KG G+ +KE G DFV E++GEV V++ R +Q+ +E+ FY +
Sbjct: 218 KKGYGLRADKELRPG--DFVYEYIGEVIGENVFR--------RRMQQYDEEGIKHFY--F 265
Query: 1948 LERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYG 2007
+ KG+ VDA K N HSC PNC V+ ++GI+ R I G
Sbjct: 266 MSLTKGE-------FVDATKKGNLGRFCNHSCNPNCYVDKWVVNDKLRMGIFVERNIQAG 318
Query: 2008 EEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLD 2061
EE+ F+YN + + C CG C G + G+ E+ K H +++
Sbjct: 319 EELVFNYNV---DRYGADPQPCYCGEPNCTGY---IGGKTQTERGTKLSHTIIE 366
>gi|120974668|gb|ABM46716.1| MLL [Gorilla gorilla]
Length = 338
Score = 67.8 bits (164), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 50/151 (33%), Positives = 70/151 (46%), Gaps = 21/151 (13%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
G G+ C + GE V+E+ G V IRS+Q + + + I
Sbjct: 209 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKREKYYDSKGIGCYMF 254
Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
+ D D VVDA N A I HSC PNC ++V +DG I I+ +R I+ GEE+T
Sbjct: 255 RID----DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELT 310
Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
+DY E C CG++ CR +LN
Sbjct: 311 YDYKFPIEDAS--NKLPCNCGAKKCR-KFLN 338
>gi|198451130|ref|XP_001358254.2| GA18567 [Drosophila pseudoobscura pseudoobscura]
gi|198131348|gb|EAL27392.2| GA18567 [Drosophila pseudoobscura pseudoobscura]
Length = 1541
Score = 67.8 bits (164), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 45/160 (28%), Positives = 80/160 (50%), Gaps = 24/160 (15%)
Query: 1881 RPDDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
R D Y+ R G G+VC + E DF++E++GEV +++ R + + +D
Sbjct: 1345 RMDVVYMNAR-GFGLVCREP--IAEGDFIIEYVGEV------INQEEFQRRMLRKQKDRD 1395
Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
FY + +E+ ++DA K N A + HSC PNC ++ V+ ++G++
Sbjct: 1396 ENFYFLGVEKE---------FIIDAGPKGNLARFMNHSCEPNCTSQKWTVNCTNRVGLFA 1446
Query: 2001 VRGIHYGEEITFDY--NSVTESKEEYEASVCLCGSQVCRG 2038
++ I E+TF+Y + + K++ C CGS+ C G
Sbjct: 1447 IQDIPAETELTFNYLWDDLLNDKKK----ACYCGSERCSG 1482
>gi|91076142|ref|XP_970289.1| PREDICTED: similar to mixed-lineage leukemia protein, mll [Tribolium
castaneum]
Length = 1824
Score = 67.8 bits (164), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 53/159 (33%), Positives = 77/159 (48%), Gaps = 35/159 (22%)
Query: 1889 YRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN--- 1945
+R+GL + + E G + V+E+ GEV IRS+ + + ++YN
Sbjct: 1696 HRRGLFCLRDFEAG----EMVIEYSGEV------------IRSVLTDKRE---KYYNSKG 1736
Query: 1946 --IYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRG 2003
Y+ R D +VVDA N A I HSC PNC +KV + GH I I+ +R
Sbjct: 1737 IGCYMFRID------DNLVVDATMTGNAARFINHSCDPNCYSKVVEILGHKHIIIFALRR 1790
Query: 2004 IHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
I GEE+T+DY E + C CG++ CR +LN
Sbjct: 1791 IICGEELTYDYKFPIEE----DKIPCTCGTRRCR-KFLN 1824
>gi|62531333|gb|AAH93421.1| Whsc1 protein [Danio rerio]
Length = 320
Score = 67.8 bits (164), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 47/148 (31%), Positives = 79/148 (53%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G++ ++ GE FV E++GE+ E + IR+ Q+N+ FY + +++
Sbjct: 23 KGWGLISLRDIKKGE--FVNEYVGELI---DEEECRSRIRNAQEND---ITHFYMLTIDK 74
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NY+ + HSC+PNCE + V+G ++G++ V I G E+
Sbjct: 75 DR---------IIDAGPKGNYSRFMNHSCQPNCETQKWTVNGDTRVGLFAVCDIPAGTEL 125
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN E+ +VC CG+ C G
Sbjct: 126 TFNYNLDCLGNEK---TVCRCGAPNCSG 150
>gi|194380712|dbj|BAG58509.1| unnamed protein product [Homo sapiens]
Length = 323
Score = 67.8 bits (164), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 50/151 (33%), Positives = 71/151 (47%), Gaps = 21/151 (13%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
G G+ C + GE V+E+ G V IRS+Q + + + I
Sbjct: 194 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKREKYYDSKGIGCYMF 239
Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
+ D D VVDA N A I HSC PNC ++V +DG I I+ +R I+ GEE+T
Sbjct: 240 RID----DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELT 295
Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
+DY E + C CG++ CR +LN
Sbjct: 296 YDYKFPIE--DASNKLPCNCGAKKCR-KFLN 323
>gi|124111218|gb|ABM91999.1| MLL [Pan troglodytes]
Length = 338
Score = 67.8 bits (164), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 50/151 (33%), Positives = 71/151 (47%), Gaps = 21/151 (13%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
G G+ C + GE V+E+ G V IRS+Q + + + I
Sbjct: 209 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKREKYYDSKGIGCYMF 254
Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
+ D D VVDA N A I HSC PNC ++V +DG I I+ +R I+ GEE+T
Sbjct: 255 RID----DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELT 310
Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
+DY E + C CG++ CR +LN
Sbjct: 311 YDYKFPIE--DASNKLPCNCGAKKCR-KFLN 338
>gi|241554585|ref|XP_002399516.1| mixed-lineage leukemia protein, mll, putative [Ixodes scapularis]
gi|215501703|gb|EEC11197.1| mixed-lineage leukemia protein, mll, putative [Ixodes scapularis]
Length = 544
Score = 67.8 bits (164), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 39/81 (48%), Positives = 48/81 (59%), Gaps = 5/81 (6%)
Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
VVDA N A I HSC PNC +KV AV G I IY +R I+ GEE+T+DY K
Sbjct: 469 VVDATTHGNAARFINHSCDPNCYSKVIAVFGQKHIIIYALRKIYKGEELTYDYKF---PK 525
Query: 2022 EEYEASVCLCGSQVCRGSYLN 2042
EE + C CG++ CR +LN
Sbjct: 526 EEVKIP-CSCGARRCR-KFLN 544
>gi|363756170|ref|XP_003648301.1| hypothetical protein Ecym_8199 [Eremothecium cymbalariae DBVPG#7215]
gi|356891501|gb|AET41484.1| Hypothetical protein Ecym_8199 [Eremothecium cymbalariae DBVPG#7215]
Length = 995
Score = 67.8 bits (164), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 35/84 (41%), Positives = 47/84 (55%), Gaps = 2/84 (2%)
Query: 1959 DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVT 2018
+ V+DA K A I H C P+C AK+ V G +I IY +R I EE+T+DY
Sbjct: 914 EYTVIDATKKGGIARFINHCCDPSCTAKIIKVGGRKRIVIYALRDIAANEELTYDYKFER 973
Query: 2019 ESKEEYEASVCLCGSQVCRGSYLN 2042
E +E E CLCG+ C+G +LN
Sbjct: 974 EVDDE-ERLPCLCGAATCKG-FLN 995
>gi|26347387|dbj|BAC37342.1| unnamed protein product [Mus musculus]
Length = 601
Score = 67.8 bits (164), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+V ++ GE FV E++GE+ ++++ + ++ +E+ FY + +++
Sbjct: 309 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKYAHENDITHFYMLTIDK 360
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NY+ + HSC+PNCE V+G ++G++ V I G E+
Sbjct: 361 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 411
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN E+ +VC CG+ C G
Sbjct: 412 TFNYNLDCLGNEK---TVCRCGASNCSG 436
>gi|168057166|ref|XP_001780587.1| trithorax-like protein, histone-lysine N-methyltransferase
[Physcomitrella patens subsp. patens]
gi|162667953|gb|EDQ54570.1| trithorax-like protein, histone-lysine N-methyltransferase
[Physcomitrella patens subsp. patens]
Length = 902
Score = 67.8 bits (164), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 37/99 (37%), Positives = 50/99 (50%), Gaps = 4/99 (4%)
Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
VVDA H A I HSC PNC ++ G +I I+ R I GEE+T+DY +++
Sbjct: 797 VVDATHAGTIAHLINHSCEPNCYSRTVTASGEDRIIIFAKRNIEVGEELTYDYRFMSKD- 855
Query: 2022 EEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLL 2060
E C CG CRGS + G+G K+ L L+
Sbjct: 856 ---EVLTCYCGCAGCRGSVNVVDGDGDSTKLSVPLSELI 891
>gi|380797995|gb|AFE70873.1| putative histone-lysine N-methyltransferase NSD2 isoform 1, partial
[Macaca mulatta]
Length = 421
Score = 67.8 bits (164), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+V ++ GE FV E++GE+ ++++ + ++ +E+ FY + +++
Sbjct: 129 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKHAHENDITHFYMLTIDK 180
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NY+ + HSC+PNCE V+G ++G++ V I G E+
Sbjct: 181 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 231
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN E+ +VC CG+ C G
Sbjct: 232 TFNYNLDCLGNEK---TVCRCGASNCSG 256
>gi|302821061|ref|XP_002992195.1| hypothetical protein SELMODRAFT_430432 [Selaginella moellendorffii]
gi|300139962|gb|EFJ06692.1| hypothetical protein SELMODRAFT_430432 [Selaginella moellendorffii]
Length = 1052
Score = 67.8 bits (164), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 34/78 (43%), Positives = 44/78 (56%), Gaps = 4/78 (5%)
Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
VVDA H + A I HSC PNC +++ VD I I+ R IH EE+T+DY ++
Sbjct: 953 VVDATHVGSMAHLINHSCEPNCYSRIITVDAKDSIIIFAKRDIHPWEELTYDYRFASKGA 1012
Query: 2022 EEYEASVCLCGSQVCRGS 2039
E VC CG+ CRGS
Sbjct: 1013 E----LVCNCGALKCRGS 1026
>gi|302800676|ref|XP_002982095.1| hypothetical protein SELMODRAFT_445108 [Selaginella moellendorffii]
gi|300150111|gb|EFJ16763.1| hypothetical protein SELMODRAFT_445108 [Selaginella moellendorffii]
Length = 1045
Score = 67.8 bits (164), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 34/78 (43%), Positives = 44/78 (56%), Gaps = 4/78 (5%)
Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
VVDA H + A I HSC PNC +++ VD I I+ R IH EE+T+DY ++
Sbjct: 946 VVDATHVGSMAHLINHSCEPNCYSRIITVDAKDSIIIFAKRDIHPWEELTYDYRFASKGA 1005
Query: 2022 EEYEASVCLCGSQVCRGS 2039
E VC CG+ CRGS
Sbjct: 1006 E----LVCNCGALKCRGS 1019
>gi|270001477|gb|EEZ97924.1| hypothetical protein TcasGA2_TC000311 [Tribolium castaneum]
Length = 1647
Score = 67.8 bits (164), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 47/156 (30%), Positives = 79/156 (50%), Gaps = 22/156 (14%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+K++ KG GV GE F++E++GEV ++ E+ I ++
Sbjct: 916 EKFMTENKGWGVRTKLPIKSGE--FILEYVGEVVSDQEFKERMATIYVNDTHH------- 966
Query: 1944 YNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRG 2003
Y ++L DG +V+D + HSC+PNCE + +V+G +++ ++ +R
Sbjct: 967 YCLHL-------DGG--LVIDGHRMGGDGRFVNHSCQPNCEMQKWSVNGQFRMALFALRD 1017
Query: 2004 IHYGEEITFDYN-SVTESKEEYEASVCLCGSQVCRG 2038
I EE+T+DYN S+ E E C CGS++CRG
Sbjct: 1018 IESSEELTYDYNFSLFNPAEGQE---CKCGSEMCRG 1050
>gi|384500869|gb|EIE91360.1| hypothetical protein RO3G_16071 [Rhizopus delemar RA 99-880]
Length = 883
Score = 67.8 bits (164), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 44/138 (31%), Positives = 62/138 (44%), Gaps = 22/138 (15%)
Query: 1903 FGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVV 1962
+ F++E++GEV ++ L + E A F + Y K D +
Sbjct: 269 LSSNSFIMEYIGEVITQNEF---------LHRTREYDAQGFKHYYFMTLKNDE------I 313
Query: 1963 VDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKE 2022
+DA K A + HSCRPNC + + +IGI+T R I GEE+TFDY E
Sbjct: 314 IDATRKGCLARFMNHSCRPNCVTQKWVIGKKMRIGIFTSRNIKAGEELTFDYKF-----E 368
Query: 2023 EYEASV--CLCGSQVCRG 2038
Y A C CG C+G
Sbjct: 369 RYGAVAQKCFCGEVNCKG 386
>gi|195570949|ref|XP_002103466.1| GD20433 [Drosophila simulans]
gi|194199393|gb|EDX12969.1| GD20433 [Drosophila simulans]
Length = 152
Score = 67.8 bits (164), Expect = 9e-08, Method: Composition-based stats.
Identities = 54/151 (35%), Positives = 71/151 (47%), Gaps = 23/151 (15%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
G G+ C K+ GE V+E+ GE+ IRS + + + I
Sbjct: 25 GRGLYCTKDIEAGE--MVIEYAGEL------------IRSTLTDKRERYYDSRGIGCYMF 70
Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
K D D +VVDA + N A I H C PNC +KV + GH I I+ +R I GEE+T
Sbjct: 71 KID----DNLVVDATMRGNAARFINHCCEPNCYSKVVDILGHKHIIIFALRRIVQGEELT 126
Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
+DY E E C CGS+ CR YLN
Sbjct: 127 YDYKFPFED----EKIPCSCGSKRCR-KYLN 152
>gi|255711468|ref|XP_002552017.1| KLTH0B05280p [Lachancea thermotolerans]
gi|238933395|emb|CAR21579.1| KLTH0B05280p [Lachancea thermotolerans CBS 6340]
Length = 986
Score = 67.8 bits (164), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 34/82 (41%), Positives = 46/82 (56%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA K A I H C P+C AK+ V G +I IY +R I EE+T+DY E+
Sbjct: 907 TVIDATKKGGIARFINHCCDPSCTAKIIRVGGRKRIVIYALRDIAANEELTYDYKFERET 966
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+E E C CG+ C+G +LN
Sbjct: 967 DDE-ERLPCFCGAPTCKG-FLN 986
>gi|350420881|ref|XP_003492659.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
isoform 2 [Bombus impatiens]
Length = 1239
Score = 67.4 bits (163), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 40/131 (30%), Positives = 66/131 (50%), Gaps = 18/131 (13%)
Query: 1908 FVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMH 1967
FV+E++GEV ++ + R L + E FY + ++ + ++DA
Sbjct: 869 FVIEYVGEV------IDEAEYKRRLHRKKELKNENFYFLTIDNNR---------MIDAEP 913
Query: 1968 KANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEAS 2027
K N + + HSC PNCE + V+G +IG++ + I GEE+TF+YN + +
Sbjct: 914 KGNLSRFMNHSCSPNCETQKWTVNGDTRIGLFALCDIERGEELTFNYNLACDGETR---K 970
Query: 2028 VCLCGSQVCRG 2038
CLCG+ C G
Sbjct: 971 PCLCGAPNCSG 981
>gi|452820773|gb|EME27811.1| histone-lysine N-methyltransferase isoform 1 [Galdieria sulphuraria]
Length = 769
Score = 67.4 bits (163), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 44/142 (30%), Positives = 70/142 (49%), Gaps = 28/142 (19%)
Query: 1906 DDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDL----- 1960
++F++E++GE+ IR QK +++ ++ +G D Y
Sbjct: 651 NEFIIEYVGEI------------IR--QKISDEREKRYFR------QGIGDSYMFRLDED 690
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA K + A + HSC N AK+ +D +I Y+ R I GEEIT+DY TE
Sbjct: 691 QIIDATRKGSVARFVNHSCESNAVAKIITIDNSKKIVFYSKRLIRAGEEITYDYKFNTE- 749
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+E +CLCG+ CR +LN
Sbjct: 750 -DENNKILCLCGAPTCR-KFLN 769
>gi|157126650|ref|XP_001654691.1| mixed-lineage leukemia protein, mll [Aedes aegypti]
gi|108873214|gb|EAT37439.1| AAEL010578-PA [Aedes aegypti]
Length = 172
Score = 67.4 bits (163), Expect = 9e-08, Method: Composition-based stats.
Identities = 58/175 (33%), Positives = 79/175 (45%), Gaps = 23/175 (13%)
Query: 1868 MKMCRGILKAMDSRPDDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQD 1927
M M LK Y ++ G G+ CN++ GE V+E+ GE+
Sbjct: 21 MAMRYRTLKETSKESVGVYRSHIHGRGLFCNRDIEAGE--MVIEYAGEL----------- 67
Query: 1928 GIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKV 1987
IRS + + + I K D + VVDA + N A I HSC PNC +KV
Sbjct: 68 -IRSTLTDKRERYYDSRGIGCYMFKID----EHFVVDATMRGNAARFINHSCEPNCYSKV 122
Query: 1988 TAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
+ GH I I+ +R I GEE+T+DY E + C CGS+ CR YLN
Sbjct: 123 VDILGHKHIIIFALRRIVQGEELTYDYKFPFEDVK----IPCSCGSKKCR-KYLN 172
>gi|432879768|ref|XP_004073538.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific-like [Oryzias latipes]
Length = 2321
Score = 67.4 bits (163), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 40/131 (30%), Positives = 68/131 (51%), Gaps = 18/131 (13%)
Query: 1908 FVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMH 1967
FV E++GEV ++++ ++ E FY + L++ + V+DA
Sbjct: 1890 FVSEYVGEV------IDEEECRARIRHAQEHDICNFYMLTLDKDR---------VIDAGP 1934
Query: 1968 KANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEAS 2027
K N A + HSC+PNCE + V+G ++G++ ++ I GEE+TF+YN + +
Sbjct: 1935 KGNQARFMNHSCQPNCETQKWTVNGDTRVGLFALQDIAKGEELTFNYNLECRGNGK---T 1991
Query: 2028 VCLCGSQVCRG 2038
VC CG+ C G
Sbjct: 1992 VCKCGAPNCSG 2002
>gi|350420879|ref|XP_003492658.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
isoform 1 [Bombus impatiens]
Length = 1230
Score = 67.4 bits (163), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 40/131 (30%), Positives = 66/131 (50%), Gaps = 18/131 (13%)
Query: 1908 FVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMH 1967
FV+E++GEV ++ + R L + E FY + ++ + ++DA
Sbjct: 860 FVIEYVGEV------IDEAEYKRRLHRKKELKNENFYFLTIDNNR---------MIDAEP 904
Query: 1968 KANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEAS 2027
K N + + HSC PNCE + V+G +IG++ + I GEE+TF+YN + +
Sbjct: 905 KGNLSRFMNHSCSPNCETQKWTVNGDTRIGLFALCDIERGEELTFNYNLACDGETR---K 961
Query: 2028 VCLCGSQVCRG 2038
CLCG+ C G
Sbjct: 962 PCLCGAPNCSG 972
>gi|452820772|gb|EME27810.1| histone-lysine N-methyltransferase isoform 2 [Galdieria sulphuraria]
Length = 797
Score = 67.4 bits (163), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 45/140 (32%), Positives = 71/140 (50%), Gaps = 24/140 (17%)
Query: 1906 DDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDL---VV 1962
++F++E++GE+ IR QK +++ Y + GD+ + L +
Sbjct: 679 NEFIIEYVGEI------------IR--QKISDEREKR----YFRQGIGDSYMFRLDEDQI 720
Query: 1963 VDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKE 2022
+DA K + A + HSC N AK+ +D +I Y+ R I GEEIT+DY TE +
Sbjct: 721 IDATRKGSVARFVNHSCESNAVAKIITIDNSKKIVFYSKRLIRAGEEITYDYKFNTE--D 778
Query: 2023 EYEASVCLCGSQVCRGSYLN 2042
E +CLCG+ CR +LN
Sbjct: 779 ENNKILCLCGAPTCR-KFLN 797
>gi|12642795|gb|AAK00344.1|AF330040_1 IL-5 promoter REII-region-binding protein [Homo sapiens]
gi|119602961|gb|EAW82555.1| Wolf-Hirschhorn syndrome candidate 1, isoform CRA_g [Homo sapiens]
gi|133777178|gb|AAH94825.2| Wolf-Hirschhorn syndrome candidate 1 [Homo sapiens]
Length = 584
Score = 67.4 bits (163), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+V ++ GE FV E++GE+ ++++ + ++ +E+ FY + +++
Sbjct: 292 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKHAHENDITHFYMLTIDK 343
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NY+ + HSC+PNCE V+G ++G++ V I G E+
Sbjct: 344 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 394
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN E+ +VC CG+ C G
Sbjct: 395 TFNYNLDCLGNEK---TVCRCGASNCSG 419
>gi|355729163|gb|AES09785.1| Wolf-Hirschhorn syndrome candidate 1 [Mustela putorius furo]
Length = 409
Score = 67.4 bits (163), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+V ++ GE FV E++GE+ ++++ + ++ +E+ FY + +++
Sbjct: 118 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKYAHENDITHFYMLTIDK 169
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NY+ + HSC+PNCE V+G ++G++ V I G E+
Sbjct: 170 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 220
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN E+ +VC CG+ C G
Sbjct: 221 TFNYNLDCLGNEK---TVCRCGASNCSG 245
>gi|432094921|gb|ELK26329.1| Histone-lysine N-methyltransferase SETD1B [Myotis davidii]
Length = 1462
Score = 67.4 bits (163), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 45/79 (56%), Gaps = 4/79 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N+A I HSC PNC AKV V+ +I IY+ + I+ EEIT+DY E
Sbjct: 1386 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 1445
Query: 2021 KEEYEASVCLCGSQVCRGS 2039
+ CLC S+ CRG+
Sbjct: 1446 VK----IPCLCNSENCRGT 1460
>gi|297824409|ref|XP_002880087.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297325926|gb|EFH56346.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 363
Score = 67.4 bits (163), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 47/146 (32%), Positives = 69/146 (47%), Gaps = 21/146 (14%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
G G+V +E GE F++E++GEV + + L K FY + R
Sbjct: 127 GSGIVAEEEIKPGE--FIIEYVGEV------IDDKTCEERLWKMKHRGETNFYLCEITRD 178
Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
+V+DA HK N + I HSC PN + + +DG +IGI+ RGI GE +T
Sbjct: 179 ---------MVIDATHKGNKSRYINHSCNPNTQMQKWIIDGETRIGIFATRGIKKGEHLT 229
Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCR 2037
+DY V ++ C CG+ CR
Sbjct: 230 YDYQFVQFGADQD----CHCGAVGCR 251
>gi|195503632|ref|XP_002098733.1| GE10528 [Drosophila yakuba]
gi|194184834|gb|EDW98445.1| GE10528 [Drosophila yakuba]
Length = 1441
Score = 67.4 bits (163), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 45/164 (27%), Positives = 80/164 (48%), Gaps = 25/164 (15%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
+G G+V N+E E DFV+E++GEV + R +++ D +Y + +E+
Sbjct: 1248 RGFGLV-NREP-IAEGDFVIEYVGEV------INHAEFQRRMEQKQRDRDENYYFLGVEK 1299
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
++DA K N A + HSC PNCE + V+ +++G++ ++ I E+
Sbjct: 1300 D---------FIIDAGPKGNLARFMNHSCEPNCETQKWTVNCIHRVGLFAIKDIPVNTEL 1350
Query: 2011 TFDY---NSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEK 2051
TF+Y + + SK+ C CG+ C G +GA ++
Sbjct: 1351 TFNYLWDDLMNNSKK-----ACFCGATRCSGEIGGKLKDGAVKE 1389
>gi|452980621|gb|EME80382.1| hypothetical protein MYCFIDRAFT_204567 [Pseudocercospora fijiensis
CIRAD86]
Length = 1200
Score = 67.4 bits (163), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 47/146 (32%), Positives = 74/146 (50%), Gaps = 23/146 (15%)
Query: 1900 EGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYD 1959
E +D ++E++GE K +K +R ++ + + + YL R D
Sbjct: 1075 EENIAVNDLIIEYVGE-----KVRQKVADMREIKYDKQG----VGSSYLFRMIDDE---- 1121
Query: 1960 LVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTE 2019
+VDA K A I HSC PNC AK+ V+G +I IY ++ I +E+T+DY +
Sbjct: 1122 --IVDATKKGGIARFINHSCDPNCTAKIIKVEGTPRIVIYALKDIGKNDELTYDY----K 1175
Query: 2020 SKEEYEAS---VCLCGSQVCRGSYLN 2042
+ EY ++ CLCGS C+G +LN
Sbjct: 1176 FEREYGSTDRIPCLCGSANCKG-FLN 1200
>gi|123454343|ref|XP_001314927.1| SET domain containing protein [Trichomonas vaginalis G3]
gi|121897588|gb|EAY02704.1| SET domain containing protein [Trichomonas vaginalis G3]
Length = 486
Score = 67.4 bits (163), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 40/122 (32%), Positives = 62/122 (50%), Gaps = 7/122 (5%)
Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
K D+D Y +DA + A I HSC PNCE+++ ++G + + + ++ I+ EE+T
Sbjct: 349 KADSDHY----LDATFRGGIARWINHSCDPNCESRIIKLNGRFAVVLVAIKDINPCEELT 404
Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDRHQLMLEACE 2071
+DY E E +A CLCGS CRG +LN +K E+ ++L E
Sbjct: 405 YDYKLPYEP--EDKAIKCLCGSPNCRG-WLNRDKNTLDDKTFSEVKFKNISEDVLLRLVE 461
Query: 2072 LN 2073
N
Sbjct: 462 NN 463
>gi|397568484|gb|EJK46160.1| hypothetical protein THAOC_35187 [Thalassiosira oceanica]
Length = 473
Score = 67.4 bits (163), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 50/150 (33%), Positives = 67/150 (44%), Gaps = 19/150 (12%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA-PEFYNIYLE 1949
KG G++ G D V+E+ GEV E R + P P FY + L
Sbjct: 299 KGWGLI--SVDGVKSGDLVIEYAGEVID-----ESTKESRLAAWTRDHPTDPNFYVMAL- 350
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
G A Y +DA H AN A + HSC PNC V GH ++ I VR + GE
Sbjct: 351 ---GQAGWY----IDARHVANQARFVNHSCDPNCRLVPLNVAGHMRVAIVAVRDVRPGEF 403
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGS 2039
+++DY T + + C CGS CRG+
Sbjct: 404 LSYDYQFDTRQGDRF---TCRCGSSNCRGT 430
>gi|398394325|ref|XP_003850621.1| histone methyltransferase, partial [Zymoseptoria tritici IPO323]
gi|339470500|gb|EGP85597.1| histone methyltransferase [Zymoseptoria tritici IPO323]
Length = 1163
Score = 67.4 bits (163), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 46/143 (32%), Positives = 70/143 (48%), Gaps = 17/143 (11%)
Query: 1900 EGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYD 1959
E +D ++E++GE K +K +R ++ + + YL R D
Sbjct: 1038 EENIAVNDLIIEYVGE-----KVRQKIADLREIRYEKQG----VGSSYLFRMIDDE---- 1084
Query: 1960 LVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTE 2019
+VDA K A I HSC PNC AK+ V+G +I IY ++ I +E+T+DY E
Sbjct: 1085 --IVDATKKGGIARFINHSCSPNCTAKIIKVEGTPRIVIYALKDIGKNDELTYDYKFERE 1142
Query: 2020 SKEEYEASVCLCGSQVCRGSYLN 2042
+ + CLCGS C+G +LN
Sbjct: 1143 M-DSTDRIPCLCGSANCKG-FLN 1163
>gi|91077840|ref|XP_971447.1| PREDICTED: similar to set domain protein [Tribolium castaneum]
Length = 1549
Score = 67.4 bits (163), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 47/156 (30%), Positives = 79/156 (50%), Gaps = 22/156 (14%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+K++ KG GV GE F++E++GEV ++ E+ I ++
Sbjct: 818 EKFMTENKGWGVRTKLPIKSGE--FILEYVGEVVSDQEFKERMATIYVNDTHH------- 868
Query: 1944 YNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRG 2003
Y ++L DG +V+D + HSC+PNCE + +V+G +++ ++ +R
Sbjct: 869 YCLHL-------DGG--LVIDGHRMGGDGRFVNHSCQPNCEMQKWSVNGQFRMALFALRD 919
Query: 2004 IHYGEEITFDYN-SVTESKEEYEASVCLCGSQVCRG 2038
I EE+T+DYN S+ E E C CGS++CRG
Sbjct: 920 IESSEELTYDYNFSLFNPAEGQE---CKCGSEMCRG 952
>gi|449458127|ref|XP_004146799.1| PREDICTED: uncharacterized protein LOC101220062 [Cucumis sativus]
Length = 1289
Score = 67.4 bits (163), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 51/134 (38%), Positives = 64/134 (47%), Gaps = 19/134 (14%)
Query: 1906 DDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDA 1965
+DFV+E++GE+ + IR Q YL R DGY VVDA
Sbjct: 1173 EDFVIEYVGELIR-----PRISDIRERQYEKMGIGSS----YLFRLD---DGY---VVDA 1217
Query: 1966 MHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYE 2025
+ A I HSC PNC KV V+G +I IY R I GEEIT++Y E K+
Sbjct: 1218 TKRGGVARFINHSCEPNCYTKVITVEGQKKIFIYAKRHISAGEEITYNYKFPLEEKK--- 1274
Query: 2026 ASVCLCGSQVCRGS 2039
C C S+ CRGS
Sbjct: 1275 -IPCNCRSRRCRGS 1287
>gi|426331996|ref|XP_004026979.1| PREDICTED: histone-lysine N-methyltransferase ASH1L [Gorilla gorilla
gorilla]
Length = 2776
Score = 67.0 bits (162), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 44/137 (32%), Positives = 69/137 (50%), Gaps = 28/137 (20%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+++ A KG G+ + G+ F++E+LGEV S Q EF
Sbjct: 2143 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2179
Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
N +E+ +D Y L +V+D+ N A I HSC PNCE + +V+G Y+IG+
Sbjct: 2180 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2239
Query: 1999 YTVRGIHYGEEITFDYN 2015
Y ++ + G E+T+DYN
Sbjct: 2240 YALKDMPAGTELTYDYN 2256
>gi|156230137|gb|AAI52413.1| WHSC1 protein [Homo sapiens]
Length = 713
Score = 67.0 bits (162), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+V ++ GE FV E++GE+ ++++ + ++ +E+ FY + +++
Sbjct: 421 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKHAHENDITHFYMLTIDK 472
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NY+ + HSC+PNCE V+G ++G++ V I G E+
Sbjct: 473 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 523
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN E+ +VC CG+ C G
Sbjct: 524 TFNYNLDCLGNEK---TVCRCGASNCSG 548
>gi|383864320|ref|XP_003707627.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific-like [Megachile rotundata]
Length = 1302
Score = 67.0 bits (162), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 40/131 (30%), Positives = 66/131 (50%), Gaps = 18/131 (13%)
Query: 1908 FVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMH 1967
FV+E++GEV ++ + R L + E FY + ++ + ++DA
Sbjct: 933 FVIEYVGEV------IDEAEYKRRLHRKKELKNENFYFLTIDNNR---------MIDAEP 977
Query: 1968 KANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEAS 2027
K N + + HSC PNCE + V+G +IG++ + I GEE+TF+YN + +
Sbjct: 978 KGNLSRFMNHSCSPNCETQKWTVNGDTRIGLFALCDIEPGEELTFNYNLACDGETR---K 1034
Query: 2028 VCLCGSQVCRG 2038
CLCG+ C G
Sbjct: 1035 PCLCGAPNCSG 1045
>gi|40789042|dbj|BAA83042.2| KIAA1090 protein [Homo sapiens]
Length = 715
Score = 67.0 bits (162), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+V ++ GE FV E++GE+ ++++ + ++ +E+ FY + +++
Sbjct: 423 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKHAHENDITHFYMLTIDK 474
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NY+ + HSC+PNCE V+G ++G++ V I G E+
Sbjct: 475 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 525
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN E+ +VC CG+ C G
Sbjct: 526 TFNYNLDCLGNEK---TVCRCGASNCSG 550
>gi|374106286|gb|AEY95196.1| FABR136Wp [Ashbya gossypii FDAG1]
Length = 975
Score = 67.0 bits (162), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 35/82 (42%), Positives = 47/82 (57%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA K A I H C P+C AK+ V G +I IY +R I EE+T+DY E+
Sbjct: 896 TVIDATKKGGIARFINHCCDPSCTAKIIKVGGMKRIVIYALRDIAANEELTYDYKFERET 955
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+E E CLCG+ C+G +LN
Sbjct: 956 DDE-ERLPCLCGAPNCKG-FLN 975
>gi|302306708|ref|NP_983083.2| ABR136Wp [Ashbya gossypii ATCC 10895]
gi|442570023|sp|Q75D88.2|SET1_ASHGO RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component SET1; AltName:
Full=SET domain-containing protein 1
gi|299788647|gb|AAS50907.2| ABR136Wp [Ashbya gossypii ATCC 10895]
Length = 975
Score = 67.0 bits (162), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 35/82 (42%), Positives = 47/82 (57%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA K A I H C P+C AK+ V G +I IY +R I EE+T+DY E+
Sbjct: 896 TVIDATKKGGIARFINHCCDPSCTAKIIKVGGMKRIVIYALRDIAANEELTYDYKFERET 955
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
+E E CLCG+ C+G +LN
Sbjct: 956 DDE-ERLPCLCGAPNCKG-FLN 975
>gi|196001997|ref|XP_002110866.1| hypothetical protein TRIADDRAFT_54228 [Trichoplax adhaerens]
gi|190586817|gb|EDV26870.1| hypothetical protein TRIADDRAFT_54228 [Trichoplax adhaerens]
Length = 1004
Score = 67.0 bits (162), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 47/152 (30%), Positives = 76/152 (50%), Gaps = 21/152 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ ++ ++ FV+E+ GEV + + FE++ + +K +Y + L
Sbjct: 132 KKGFGLRTLED--LEDNQFVLEYCGEVIDL-REFERRKRDYAKKK-----IKHYYFMTLS 183
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
+ ++DA K ++ I HSC PNC + V+G +IG +T+R I E
Sbjct: 184 PNE---------IIDASRKGTFSRFINHSCDPNCVTQKWTVNGMLRIGFFTLRKIPANTE 234
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
+TFDY +E E C CGS+ CRG YL
Sbjct: 235 LTFDYQFERYGREVQE---CYCGSEKCRG-YL 262
>gi|334311241|ref|XP_003339591.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
H3 lysine-36 and H4 lysine-20 specific-like [Monodelphis
domestica]
Length = 2705
Score = 67.0 bits (162), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 51/193 (26%), Positives = 92/193 (47%), Gaps = 21/193 (10%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
+FV E++GE+ ++++ ++ E FY + L++ + ++DA
Sbjct: 1968 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 2012
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K NYA + H C+PNCE + +V+G ++G++ + I G E+TF+YN +
Sbjct: 2013 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 2069
Query: 2027 SVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDRHQLMLEA-CELNSVSEEDYLELGR 2085
+VC CG+ C G +L + + ++ L R Q+ + E+ E++ G
Sbjct: 2070 TVCKCGAPNCSG-FLGVRPKNHPNPTEEKSKKLKRRQQVKRRSQGEITKEREDECFSCGD 2128
Query: 2086 AG-LGSCLLGGLP 2097
AG L SC G P
Sbjct: 2129 AGQLVSCKKPGCP 2141
>gi|328781326|ref|XP_003249962.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
[Apis mellifera]
Length = 1218
Score = 67.0 bits (162), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 40/131 (30%), Positives = 65/131 (49%), Gaps = 18/131 (13%)
Query: 1908 FVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMH 1967
FV+E++GEV ++ + R L + E FY + ++ + +DA
Sbjct: 852 FVIEYVGEV------IDEAEYKRRLHRKKELKNENFYFLTIDNNR---------TIDAEP 896
Query: 1968 KANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEAS 2027
K N + + HSC PNCE + V+G +IG++ + I GEE+TF+YN + +
Sbjct: 897 KGNLSRFMNHSCSPNCETQKWTVNGDTRIGLFALCDIEPGEELTFNYNLACDGETR---K 953
Query: 2028 VCLCGSQVCRG 2038
CLCG+ C G
Sbjct: 954 PCLCGASNCSG 964
>gi|119602957|gb|EAW82551.1| Wolf-Hirschhorn syndrome candidate 1, isoform CRA_d [Homo sapiens]
Length = 742
Score = 67.0 bits (162), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+V ++ GE FV E++GE+ ++++ + ++ +E+ FY + +++
Sbjct: 450 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKHAHENDITHFYMLTIDK 501
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NY+ + HSC+PNCE V+G ++G++ V I G E+
Sbjct: 502 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 552
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN E+ +VC CG+ C G
Sbjct: 553 TFNYNLDCLGNEK---TVCRCGASNCSG 577
>gi|297282129|ref|XP_002802212.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
[Macaca mulatta]
Length = 713
Score = 67.0 bits (162), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+V ++ GE FV E++GE+ ++++ + ++ +E+ FY + +++
Sbjct: 421 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKHAHENDITHFYMLTIDK 472
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NY+ + HSC+PNCE V+G ++G++ V I G E+
Sbjct: 473 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 523
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN E+ +VC CG+ C G
Sbjct: 524 TFNYNLDCLGNEK---TVCRCGASNCSG 548
>gi|405966542|gb|EKC31816.1| Putative histone-lysine N-methyltransferase ASH1L [Crassostrea gigas]
Length = 2162
Score = 67.0 bits (162), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 43/155 (27%), Positives = 75/155 (48%), Gaps = 26/155 (16%)
Query: 1892 GLGVVCNKEGGFG--------EDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
GL V+ K+ G+G F++E+LGEV ++ ++ E+ + E
Sbjct: 1401 GLEVIVTKDRGYGIRTSDSISNGQFILEYLGEVVSEAEF---------RRRMTEEYSQER 1451
Query: 1944 YNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRG 2003
++ L G V+D N + HSC PNCE + V+G Y++G++ ++
Sbjct: 1452 HHYCLNLDSG-------AVIDGYRMGNIGRYVNHSCEPNCEMQKWNVNGVYRMGLFALKD 1504
Query: 2004 IHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
I E+T+DYN + + + + +C CGS+ CRG
Sbjct: 1505 ISPNMELTYDYNFHSFNVDAQQ--LCRCGSENCRG 1537
>gi|93003038|tpd|FAA00102.1| TPA: zinc finger protein [Ciona intestinalis]
Length = 883
Score = 67.0 bits (162), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 49/158 (31%), Positives = 77/158 (48%), Gaps = 20/158 (12%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
+G GV N + E F++E++GEV E++ R+++ N + + Y + LE
Sbjct: 130 RGWGVRTNSD--IPEGQFLLEYVGEVVS-----EREFRRRTIE--NYNAHNDHYCVQLEA 180
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
V+D AN + HSC+PNCE + V+G Y++G++ R I EE+
Sbjct: 181 G---------TVIDGYRLANEGRFVNHSCQPNCEMQKWVVNGEYRVGLFAKRPIVSSEEL 231
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGA 2048
T+DYN + + + C CGS CRG T GA
Sbjct: 232 TYDYNFHAYNLDRQQP--CRCGSSECRGVIGGKTQRGA 267
>gi|260836403|ref|XP_002613195.1| hypothetical protein BRAFLDRAFT_278042 [Branchiostoma floridae]
gi|229298580|gb|EEN69204.1| hypothetical protein BRAFLDRAFT_278042 [Branchiostoma floridae]
Length = 313
Score = 67.0 bits (162), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 55/156 (35%), Positives = 73/156 (46%), Gaps = 31/156 (19%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
G G+ C + GE V+E+ G V IRS+ D +YN
Sbjct: 184 GRGLFCKRNIDSGE--MVIEYAGMV------------IRSVLT---DKRENYYNSKGIGC 226
Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
Y+ R D Y+ VVDA N A I HSC PNC ++V V+G I I+ +R I+
Sbjct: 227 YMFR----IDDYE--VVDATMHGNAARFINHSCDPNCYSRVIQVEGKKHIVIFAMRKIYK 280
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
GEE+T+DY E + C CGS+ CR YLN
Sbjct: 281 GEELTYDYKFPIEDQN--SKIDCTCGSKRCR-KYLN 313
>gi|157278865|gb|AAI15212.1| Whsc1 protein [Danio rerio]
Length = 486
Score = 67.0 bits (162), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 43/148 (29%), Positives = 78/148 (52%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G++ ++ GE FV E++GE+ ++++ ++ E+ FY + +++
Sbjct: 189 KGWGLISLRDIKKGE--FVNEYVGEL------IDEEECRSRIRHAQENDITHFYMLTIDK 240
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NY+ + HSC+PNCE + V+G ++G++ V I G E+
Sbjct: 241 DR---------IIDAGPKGNYSRFMNHSCQPNCETQKWTVNGDTRVGLFAVCDIPAGTEL 291
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN E+ +VC CG+ C G
Sbjct: 292 TFNYNLDCLGNEK---TVCRCGAPNCSG 316
>gi|453082196|gb|EMF10244.1| hypothetical protein SEPMUDRAFT_151237 [Mycosphaerella populorum
SO2202]
Length = 1254
Score = 67.0 bits (162), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 45/142 (31%), Positives = 68/142 (47%), Gaps = 27/142 (19%)
Query: 1906 DDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDA 1965
+D ++E++GE K +K +R ++ + + + L D +VDA
Sbjct: 1135 NDLIIEYVGE-----KVRQKVADMREIKYDKQGVGSSYLFRML----------DDEIVDA 1179
Query: 1966 MHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYE 2025
K A I HSC PNC AK+ V+G +I IY ++ I +E+T+DY K E E
Sbjct: 1180 TKKGGIARFINHSCSPNCTAKIIKVEGTPRIVIYALKDISKNDELTYDY------KFERE 1233
Query: 2026 ASV-----CLCGSQVCRGSYLN 2042
CLCGS C+G +LN
Sbjct: 1234 IGATDRIPCLCGSANCKG-FLN 1254
>gi|349604316|gb|AEP99904.1| Histone-lysine N-methyltransferase HRX-like protein, partial [Equus
caballus]
Length = 297
Score = 67.0 bits (162), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 50/151 (33%), Positives = 70/151 (46%), Gaps = 21/151 (13%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
G G+ C + GE V+E+ G V IRS+Q + + + I
Sbjct: 168 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKREKYYDSKGIGCYMF 213
Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
+ D D VVDA N A I HSC PNC ++V +DG I I+ +R I+ GEE+T
Sbjct: 214 RID----DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELT 269
Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
+DY E C CG++ CR +LN
Sbjct: 270 YDYKFPIEDAS--NKLPCNCGAKKCR-KFLN 297
>gi|313221636|emb|CBY36121.1| unnamed protein product [Oikopleura dioica]
Length = 207
Score = 67.0 bits (162), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 43/131 (32%), Positives = 70/131 (53%), Gaps = 17/131 (12%)
Query: 1908 FVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMH 1967
F++E++GE+ + IR L+++ + +Y + L+ +L ++DA
Sbjct: 44 FIIEYIGEIIS-----HDESRIR-LEESAKIGVTNYYILELD---------NLRMIDAGP 88
Query: 1968 KANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEAS 2027
+ N A I HSC PNC V G +IGI++ R I GEE+TF+Y + +S +E +
Sbjct: 89 RGNIARFINHSCDPNCGIDPWIVQGDTRIGIFSKRDIQEGEELTFNYQ-LQQSSDEGKTK 147
Query: 2028 VCLCGSQVCRG 2038
CLCGS+ C G
Sbjct: 148 -CLCGSKNCAG 157
>gi|414589296|tpg|DAA39867.1| TPA: putative histone-lysine N-methyltransferase family protein [Zea
mays]
Length = 343
Score = 67.0 bits (162), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 50/151 (33%), Positives = 71/151 (47%), Gaps = 27/151 (17%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
G G+V E GE FV+E++GEV +D E ++ +
Sbjct: 127 GHGLVAEDEIKKGE--FVIEYVGEVI-------------------DDRTCE-NRLWTMKR 164
Query: 1952 KGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
D D Y +V+DA +K N + I HSC PN + VDG ++GI+ +R I
Sbjct: 165 LLDTDFYLCEVSSNMVIDATNKGNRSRFINHSCEPNTAMQKWTVDGETRVGIFALRDIKI 224
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCR 2037
GEE+T+DYN + + A VC CGS CR
Sbjct: 225 GEELTYDYNIMYRFVQFGAAQVCHCGSSNCR 255
>gi|242048842|ref|XP_002462165.1| hypothetical protein SORBIDRAFT_02g020844 [Sorghum bicolor]
gi|241925542|gb|EER98686.1| hypothetical protein SORBIDRAFT_02g020844 [Sorghum bicolor]
Length = 341
Score = 67.0 bits (162), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 51/151 (33%), Positives = 71/151 (47%), Gaps = 31/151 (20%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
G G+V E GE FV+E++GEV +D A E ++ +
Sbjct: 128 GHGLVAEDEIKKGE--FVIEYVGEVI-------------------DDRACE-NRLWTMKR 165
Query: 1952 KGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
D D Y +V+DA +K N + I HSC PN + + VDG ++GI+ +R I
Sbjct: 166 LNDTDFYLCEVSSNMVIDATNKGNLSRFINHSCEPNTKMQKWTVDGETRVGIFALRDIKI 225
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCR 2037
GEE+T+DY V A VC CGS CR
Sbjct: 226 GEELTYDYKFVQFGA----AQVCHCGSSKCR 252
>gi|449016155|dbj|BAM79557.1| unknown RNA binding protein [Cyanidioschyzon merolae strain 10D]
Length = 1151
Score = 66.6 bits (161), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 46/139 (33%), Positives = 68/139 (48%), Gaps = 23/139 (16%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDL---VVV 1963
D+++E+ GE+ +RS + + A Y ++ GD+ + + VV
Sbjct: 1033 DYIIEYRGEL------------VRSAVADLRERA------YRQQGMGDSFMFRIDADTVV 1074
Query: 1964 DAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEE 2023
DA H + A + HSC PN A++ + G I Y+ R I GEEIT+DYN E +
Sbjct: 1075 DATHIGSVARFVNHSCDPNAIARIVQLGGASHILFYSKRSICVGEEITYDYNFDIED-DA 1133
Query: 2024 YEASVCLCGSQVCRGSYLN 2042
E CLCG+ CR YLN
Sbjct: 1134 SEKVPCLCGAPNCR-QYLN 1151
>gi|418528271|ref|ZP_13094221.1| nuclear protein SET [Comamonas testosteroni ATCC 11996]
gi|371454647|gb|EHN67649.1| nuclear protein SET [Comamonas testosteroni ATCC 11996]
Length = 168
Score = 66.6 bits (161), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 50/155 (32%), Positives = 74/155 (47%), Gaps = 29/155 (18%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
G GV ++ E + ++E++GEV W E QD ++ DP+ + Y +
Sbjct: 23 GKGVFAAQD--IAEGETIIEYVGEVI---DWQEAQD------RHPHDPSQPNHTFYFQVD 71
Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
D V+DA HK N + I HSC PNC +DG ++ I +R I GEE+
Sbjct: 72 -------DERVIDATHKGNSSRWINHSCAPNC--YTDEIDG--RVYIVALRNIAAGEELN 120
Query: 2012 FDYNSVTESKEEYEASV-----CLCGSQVCRGSYL 2041
+DY + E E Y A + C CG+ CRG+ L
Sbjct: 121 YDYGLMVE--ERYTAKLKAEYACYCGAANCRGTML 153
>gi|195501654|ref|XP_002097885.1| GE26460 [Drosophila yakuba]
gi|194183986|gb|EDW97597.1| GE26460 [Drosophila yakuba]
Length = 343
Score = 66.6 bits (161), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 54/151 (35%), Positives = 71/151 (47%), Gaps = 23/151 (15%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
G G+ C K+ GE V+E+ GE+ IRS + + + I
Sbjct: 216 GRGLYCTKDIEAGE--MVIEYAGEL------------IRSTLTDKRERYYDSRGIGCYMF 261
Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
K D D +VVDA + N A I H C PNC +KV + GH I I+ +R I GEE+T
Sbjct: 262 KID----DNLVVDATMRGNAARFINHCCEPNCYSKVVDILGHKHIIIFALRRIVQGEELT 317
Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
+DY E E C CGS+ CR YLN
Sbjct: 318 YDYKFPFEE----EKIPCSCGSKRCR-KYLN 343
>gi|356507632|ref|XP_003522568.1| PREDICTED: histone-lysine N-methyltransferase ASHH2-like [Glycine
max]
Length = 2081
Score = 66.6 bits (161), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 47/153 (30%), Positives = 74/153 (48%), Gaps = 19/153 (12%)
Query: 1886 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN 1945
+ +KG G+ ++ G+ F++E++GEV + + +E + +L+ + FY
Sbjct: 1230 FKCGKKGYGLKAIEDVAQGQ--FLIEYVGEVLDM-QTYEARQREYALKGHRH-----FYF 1281
Query: 1946 IYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIH 2005
+ L + V+DA K N I HSC PNC + V+G IG++ +R +
Sbjct: 1282 MTLNGSE---------VIDASAKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALRNVK 1332
Query: 2006 YGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
EE+TFDYN V A C CGS CRG
Sbjct: 1333 KDEELTFDYNYVRVFG--AAAKKCYCGSSNCRG 1363
>gi|449463442|ref|XP_004149443.1| PREDICTED: histone-lysine N-methyltransferase ASHH2-like [Cucumis
sativus]
Length = 1814
Score = 66.6 bits (161), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 35/77 (45%), Positives = 42/77 (54%), Gaps = 2/77 (2%)
Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
V+DA K N I HSC PNC + V+G IG++ +R I GEE+TFDYN V
Sbjct: 1174 VIDACGKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALRDIKKGEEVTFDYNYVRVFG 1233
Query: 2022 EEYEASVCLCGSQVCRG 2038
A C CGS CRG
Sbjct: 1234 --AAAKKCYCGSFHCRG 1248
>gi|326496078|dbj|BAJ90660.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 362
Score = 66.6 bits (161), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 49/153 (32%), Positives = 72/153 (47%), Gaps = 35/153 (22%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYP-------VWKWFEKQDGIRSLQKNNEDPAPEFY 1944
G G++ E GE FV+E++GEV +WK ++Q +
Sbjct: 139 GFGLIAEDEIKKGE--FVIEYVGEVIDDRTCEERLWK-MKRQ---------------RYT 180
Query: 1945 NIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGI 2004
N YL + +V+DA +K N + I HSC PN E + VDG ++GI+ +R I
Sbjct: 181 NFYLCEVSSN------MVIDATNKGNKSRFINHSCEPNTEMQKWTVDGETRVGIFALRDI 234
Query: 2005 HYGEEITFDYNSVTESKEEYEASVCLCGSQVCR 2037
GEE+T+DY V ++ C CGS CR
Sbjct: 235 ERGEELTYDYKFVQFGADQ----DCHCGSSNCR 263
>gi|264680920|ref|YP_003280830.1| nuclear protein SET [Comamonas testosteroni CNB-2]
gi|299530912|ref|ZP_07044326.1| nuclear protein SET [Comamonas testosteroni S44]
gi|262211436|gb|ACY35534.1| nuclear protein SET [Comamonas testosteroni CNB-2]
gi|298721133|gb|EFI62076.1| nuclear protein SET [Comamonas testosteroni S44]
Length = 168
Score = 66.6 bits (161), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 50/155 (32%), Positives = 74/155 (47%), Gaps = 29/155 (18%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
G GV ++ E + ++E++GEV W E QD ++ DP+ + Y +
Sbjct: 23 GKGVFAAQD--IAEGETIIEYVGEVI---DWQEAQD------RHPHDPSQPNHTFYFQVD 71
Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
D V+DA HK N + I HSC PNC +DG ++ I +R I GEE+
Sbjct: 72 -------DERVIDATHKGNSSRWINHSCAPNC--YTDEIDG--RVYIVALRNIAAGEELN 120
Query: 2012 FDYNSVTESKEEYEASV-----CLCGSQVCRGSYL 2041
+DY + E E Y A + C CG+ CRG+ L
Sbjct: 121 YDYGLMVE--ERYTAKLKAEYACYCGAANCRGTML 153
>gi|403223606|dbj|BAM41736.1| uncharacterized protein TOT_040000118 [Theileria orientalis strain
Shintoku]
Length = 944
Score = 66.6 bits (161), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 49/158 (31%), Positives = 75/158 (47%), Gaps = 15/158 (9%)
Query: 1882 PDDKYVAYR-KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
P K V + KG+G V +E E++ V E++GEV F K S + ++D
Sbjct: 639 PKLKLVYFEGKGIGAVATEE--IRENELVCEYVGEVITQTD-FHKSLASSSFAEIDDDNQ 695
Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
+Y + + + V +D+ H N A I HSC PNC + V G Y++G++
Sbjct: 696 CHWYVMKVHKE---------VYIDSTHLGNVARFINHSCDPNCSSIPINVRGSYRMGVFA 746
Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
R I GEE+T++Y SK C C ++ CRG
Sbjct: 747 SRKILKGEEVTYNYGFT--SKGVGGGFRCKCNAKNCRG 782
>gi|190349638|gb|ACE75882.1| multiple-myeloma-related WHSC1/MMSET isoform RE-IIBP [Homo sapiens]
Length = 704
Score = 66.6 bits (161), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+V ++ GE FV E++GE+ ++++ + ++ +E+ FY + +++
Sbjct: 412 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKYAHENDITHFYMLTIDK 463
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NY+ + HSC+PNCE V+G ++G++ V I G E+
Sbjct: 464 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 514
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN E+ +VC CG+ C G
Sbjct: 515 TFNYNLDCLGNEK---TVCRCGASNCSG 539
>gi|395505173|ref|XP_003756919.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Sarcophilus harrisii]
Length = 2717
Score = 66.2 bits (160), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 59/220 (26%), Positives = 101/220 (45%), Gaps = 35/220 (15%)
Query: 1886 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN 1945
+ ++G G+ + GE FV E++GE+ ++++ ++ E FY
Sbjct: 1950 FRTLQRGWGLRTKTDIKKGE--FVNEYVGEL------IDEEECRARIRYAQEHDITNFYM 2001
Query: 1946 IYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIH 2005
+ L++ + ++DA K NYA + H C+PNCE + +V+G ++G++ + I
Sbjct: 2002 LTLDKDR---------IIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIK 2052
Query: 2006 YGEEITFDYNSVTESKEEYEASVCLCGSQVCRG-------SYLNLTGEGAFEKVLKELHG 2058
G E+TF+YN + +VC CG+ C G ++ N T E + K LK
Sbjct: 2053 AGTELTFNYNLECLGNGK---TVCKCGAPNCSGFLGVRPKNHPNPTEEKS--KKLKRKQQ 2107
Query: 2059 LLDRHQLMLEACELNSVSEEDYLELGRAG-LGSCLLGGLP 2097
+ R Q E+ E++ G AG L SC G P
Sbjct: 2108 VKRRSQG-----EITKEREDECFSCGDAGQLVSCKKPGCP 2142
>gi|270014006|gb|EFA10454.1| hypothetical protein TcasGA2_TC012700 [Tribolium castaneum]
Length = 1740
Score = 66.2 bits (160), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 40/131 (30%), Positives = 69/131 (52%), Gaps = 18/131 (13%)
Query: 1908 FVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMH 1967
FV+E++GE+ ++Q+ R +QK +E +Y + +++ + ++DA
Sbjct: 1385 FVIEYVGEM------IDEQEYQRRVQKMHEQKEENYYFLTIDKDR---------MLDAGP 1429
Query: 1968 KANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEAS 2027
K N A + HSC PNCE + V+G ++G++ I G E+TF+YN KE+
Sbjct: 1430 KGNVARFMNHSCDPNCETQKWTVNGDTRVGLFANCDIPAGTELTFNYNLECIGKEK---K 1486
Query: 2028 VCLCGSQVCRG 2038
+C CG+ C G
Sbjct: 1487 ICHCGAPNCSG 1497
>gi|260833262|ref|XP_002611576.1| hypothetical protein BRAFLDRAFT_117164 [Branchiostoma floridae]
gi|229296947|gb|EEN67586.1| hypothetical protein BRAFLDRAFT_117164 [Branchiostoma floridae]
Length = 734
Score = 66.2 bits (160), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 44/155 (28%), Positives = 76/155 (49%), Gaps = 26/155 (16%)
Query: 1892 GLGVVCNKEGGFG--------EDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
GL + K+ G+G + +F++E++GEV ++ ++ + +N
Sbjct: 86 GLERIVTKDRGYGVRSKTPIPQGNFILEYVGEVVSEQEF--RRRTVEIYHDHNH-----H 138
Query: 1944 YNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRG 2003
Y + L + V+D + HSC PNCE + +V+G Y+IG++ +R
Sbjct: 139 YCLNL---------HSGAVIDGYKYGCEGRFVNHSCEPNCEMQKWSVNGVYRIGLFALRD 189
Query: 2004 IHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
I GEE+T+DYN + E+ + +C CGS CRG
Sbjct: 190 IPAGEELTYDYNFHAFNMEKQQ--ICKCGSAKCRG 222
>gi|347972366|ref|XP_316738.5| AGAP004656-PA [Anopheles gambiae str. PEST]
gi|333469400|gb|EAA11974.5| AGAP004656-PA [Anopheles gambiae str. PEST]
Length = 1259
Score = 66.2 bits (160), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 47/150 (31%), Positives = 79/150 (52%), Gaps = 24/150 (16%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+V ++ G+ FV+E++GEV ++ + +++ ++ N +Y + +E
Sbjct: 1032 KGFGLVALEDLKSGQ--FVIEYVGEVINSEEFDRRVMMMQAAKETN------YYFLTVEP 1083
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
DL + DA K N + I HSC PNCE + + IG++ ++ I+ GEE+
Sbjct: 1084 --------DLTI-DAGPKGNVSRFINHSCEPNCETQKWTIGETRVIGLFAIKDINAGEEL 1134
Query: 2011 TFDYN--SVTESKEEYEASVCLCGSQVCRG 2038
TF+YN S+ +K VCLCG+ C G
Sbjct: 1135 TFNYNLESLGNNKR-----VCLCGAGKCSG 1159
>gi|442621474|ref|NP_001263029.1| Mes-4, isoform B [Drosophila melanogaster]
gi|440217972|gb|AGB96409.1| Mes-4, isoform B [Drosophila melanogaster]
Length = 1423
Score = 66.2 bits (160), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 42/151 (27%), Positives = 74/151 (49%), Gaps = 25/151 (16%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
+G G+V + G DFV+E++GEV + R +++ D +Y + +E+
Sbjct: 1240 RGFGLVNREPIAVG--DFVIEYVGEV------INHAEFQRRMEQKQRDRDENYYFLGVEK 1291
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
++DA K N A + HSC PNCE + V+ +++GI+ ++ I E+
Sbjct: 1292 D---------FIIDAGPKGNLARFMNHSCEPNCETQKWTVNCIHRVGIFAIKDIPVNSEL 1342
Query: 2011 TFDY---NSVTESKEEYEASVCLCGSQVCRG 2038
TF+Y + + SK+ C CG++ C G
Sbjct: 1343 TFNYLWDDLMNNSKK-----ACFCGAKRCSG 1368
>gi|410080444|ref|XP_003957802.1| hypothetical protein KAFR_0F00700 [Kazachstania africana CBS 2517]
gi|372464389|emb|CCF58667.1| hypothetical protein KAFR_0F00700 [Kazachstania africana CBS 2517]
Length = 1133
Score = 66.2 bits (160), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 35/81 (43%), Positives = 46/81 (56%), Gaps = 2/81 (2%)
Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
V+DA K A I H C P+C AK+ V G +I IY +R I EE+T+DY E
Sbjct: 1055 VIDATKKGGIARFINHCCDPSCTAKIIKVGGKRRIVIYALRDIAKNEELTYDYKFEREQD 1114
Query: 2022 EEYEASVCLCGSQVCRGSYLN 2042
+E E CLCG+ C+G +LN
Sbjct: 1115 DE-ERLPCLCGAPNCKG-FLN 1133
>gi|412991390|emb|CCO16235.1| unnamed protein product [Bathycoccus prasinos]
Length = 825
Score = 66.2 bits (160), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 63/137 (45%), Gaps = 17/137 (12%)
Query: 1905 EDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVD 1964
E DF+VE++GE+ ++++ R L P FY + + + ++D
Sbjct: 518 EGDFIVEYMGEI------VDEEECTRRLLACKGKNEPNFYLMEITPSQ---------IID 562
Query: 1965 AMHKANYASRICHSCRPNCEAK--VTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKE 2022
A N A I SC PNCE + V A ++GI+ I G E+T+DYN E
Sbjct: 563 ARFCGNNARFINSSCHPNCETQRWVDASTNETRVGIFATEDIKSGTELTYDYNFAHFGGE 622
Query: 2023 EYEASVCLCGSQVCRGS 2039
+ C CG +C+G+
Sbjct: 623 GTTSFTCFCGHPMCKGT 639
>gi|156393989|ref|XP_001636609.1| predicted protein [Nematostella vectensis]
gi|156223714|gb|EDO44546.1| predicted protein [Nematostella vectensis]
Length = 213
Score = 66.2 bits (160), Expect = 2e-07, Method: Composition-based stats.
Identities = 48/137 (35%), Positives = 65/137 (47%), Gaps = 25/137 (18%)
Query: 1906 DDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDL---VV 1962
D+ V+E++GEV IR + + Y ER G + + L +
Sbjct: 97 DEMVIEYVGEV------------IRQAIADYRE------RCYEERGIGSSYMFRLDETTI 138
Query: 1963 VDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKE 2022
+DA N+A I H C PNC AKV AV+ +I IY+ R I EEIT+DY E
Sbjct: 139 IDATTMGNFARFINHCCDPNCYAKVIAVENMKKIVIYSKRDIQVDEEITYDYKFPIED-- 196
Query: 2023 EYEASVCLCGSQVCRGS 2039
E CLCG+ CRG+
Sbjct: 197 --EKIPCLCGAPQCRGT 211
>gi|24650756|ref|NP_733239.1| Mes-4, isoform A [Drosophila melanogaster]
gi|29427833|sp|Q8MT36.2|MES4_DROME RecName: Full=Probable histone-lysine N-methyltransferase Mes-4;
AltName: Full=Maternal-effect sterile 4 homolog
gi|23172478|gb|AAF56762.2| Mes-4, isoform A [Drosophila melanogaster]
gi|94400569|gb|ABF17912.1| FI01019p [Drosophila melanogaster]
Length = 1427
Score = 66.2 bits (160), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 42/151 (27%), Positives = 74/151 (49%), Gaps = 25/151 (16%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
+G G+V + G DFV+E++GEV + R +++ D +Y + +E+
Sbjct: 1244 RGFGLVNREPIAVG--DFVIEYVGEV------INHAEFQRRMEQKQRDRDENYYFLGVEK 1295
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
++DA K N A + HSC PNCE + V+ +++GI+ ++ I E+
Sbjct: 1296 D---------FIIDAGPKGNLARFMNHSCEPNCETQKWTVNCIHRVGIFAIKDIPVNSEL 1346
Query: 2011 TFDY---NSVTESKEEYEASVCLCGSQVCRG 2038
TF+Y + + SK+ C CG++ C G
Sbjct: 1347 TFNYLWDDLMNNSKK-----ACFCGAKRCSG 1372
>gi|348516272|ref|XP_003445663.1| PREDICTED: histone-lysine N-methyltransferase SETD1B-A-like
[Oreochromis niloticus]
Length = 595
Score = 66.2 bits (160), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 45/140 (32%), Positives = 65/140 (46%), Gaps = 25/140 (17%)
Query: 1903 FGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI---YLERPKGDADGYD 1959
D+ V+E++G++ IR + + + E I YL R D
Sbjct: 476 IAADEMVIEYVGQI------------IRQVIADMREQRYEEEGIGSSYLFRVDQDT---- 519
Query: 1960 LVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTE 2019
++DA N A I HSC PNC AK+ V+ +I IY+ + I+ EEIT+DY E
Sbjct: 520 --IIDATKCGNLARFINHSCNPNCYAKIITVESQKKIVIYSRQPININEEITYDYKFPIE 577
Query: 2020 SKEEYEASVCLCGSQVCRGS 2039
+ CLCG+ CRGS
Sbjct: 578 ETK----IPCLCGADGCRGS 593
>gi|302798461|ref|XP_002980990.1| hypothetical protein SELMODRAFT_3415 [Selaginella moellendorffii]
gi|300151044|gb|EFJ17691.1| hypothetical protein SELMODRAFT_3415 [Selaginella moellendorffii]
Length = 242
Score = 66.2 bits (160), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 45/132 (34%), Positives = 64/132 (48%), Gaps = 19/132 (14%)
Query: 1908 FVVEFLGEVYPVWKW-FEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
FV+E++GEV + +++ R QK+ FY + L + V+DA
Sbjct: 97 FVIEYVGEVLDSRSFELRQKEYARQRQKH-------FYFMTLNSSE---------VIDAC 140
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K N I HSC PNC+ + V+G IG++ +R + EEITF+YN E A
Sbjct: 141 RKGNLGRFINHSCEPNCQTEKWCVNGEICIGLFAIRDVAKNEEITFNYN--FERLYGAAA 198
Query: 2027 SVCLCGSQVCRG 2038
C CGS CRG
Sbjct: 199 KKCHCGSAHCRG 210
>gi|195352984|ref|XP_002042990.1| GM16309 [Drosophila sechellia]
gi|194127055|gb|EDW49098.1| GM16309 [Drosophila sechellia]
Length = 1418
Score = 66.2 bits (160), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 43/151 (28%), Positives = 75/151 (49%), Gaps = 25/151 (16%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
+G G+V N+E DFV+E++GEV + R +++ D +Y + +E+
Sbjct: 1235 RGFGLV-NREP-IAAGDFVIEYVGEV------INHAEFQRRMEQKQRDRDENYYFLGVEK 1286
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
++DA K N A + HSC PNCE + V+ +++GI+ ++ I E+
Sbjct: 1287 D---------FIIDAGPKGNLARFMNHSCEPNCETQKWTVNCIHRVGIFAIKDIPVNTEL 1337
Query: 2011 TFDY---NSVTESKEEYEASVCLCGSQVCRG 2038
TF+Y + + SK+ C CG++ C G
Sbjct: 1338 TFNYLWDDLMNNSKK-----ACFCGAKRCSG 1363
>gi|91090902|ref|XP_973711.1| PREDICTED: similar to NSD1 [Tribolium castaneum]
Length = 1795
Score = 66.2 bits (160), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 40/131 (30%), Positives = 69/131 (52%), Gaps = 18/131 (13%)
Query: 1908 FVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMH 1967
FV+E++GE+ ++Q+ R +QK +E +Y + +++ + ++DA
Sbjct: 1440 FVIEYVGEM------IDEQEYQRRVQKMHEQKEENYYFLTIDKDR---------MLDAGP 1484
Query: 1968 KANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEAS 2027
K N A + HSC PNCE + V+G ++G++ I G E+TF+YN KE+
Sbjct: 1485 KGNVARFMNHSCDPNCETQKWTVNGDTRVGLFANCDIPAGTELTFNYNLECIGKEK---K 1541
Query: 2028 VCLCGSQVCRG 2038
+C CG+ C G
Sbjct: 1542 ICHCGAPNCSG 1552
>gi|157734198|gb|ABV68922.1| SDG25 [Arabidopsis thaliana]
Length = 1388
Score = 66.2 bits (160), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 49/137 (35%), Positives = 66/137 (48%), Gaps = 25/137 (18%)
Query: 1906 DDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI---YLERPKGDADGYDLVV 1962
+DFV+E++GE+ IRS + E I YL R DGY V
Sbjct: 1272 EDFVIEYVGEL------------IRSSISEIRERQYEKMGIGSSYLFRLD---DGY---V 1313
Query: 1963 VDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKE 2022
+DA + A I HSC PNC K+ +V+G +I IY R I GEEI+++Y E
Sbjct: 1314 LDATKRGGIARFINHSCEPNCYTKIISVEGKKKIFIYAKRHIDAGEEISYNYKFPLED-- 1371
Query: 2023 EYEASVCLCGSQVCRGS 2039
+ C CG+ CRGS
Sbjct: 1372 --DKIPCNCGAPKCRGS 1386
>gi|449477606|ref|XP_002188016.2| PREDICTED: histone-lysine N-methyltransferase SETD1B-like
[Taeniopygia guttata]
Length = 228
Score = 66.2 bits (160), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N+A I HSC PNC AKV V+ +I IY+ + I+ EEIT+DY E
Sbjct: 152 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 211
Query: 2021 KEEYEASVCLCGSQVCRGS 2039
+ CLCGS+ CRG+
Sbjct: 212 VK----IPCLCGSENCRGT 226
>gi|260830013|ref|XP_002609956.1| hypothetical protein BRAFLDRAFT_124382 [Branchiostoma floridae]
gi|229295318|gb|EEN65966.1| hypothetical protein BRAFLDRAFT_124382 [Branchiostoma floridae]
Length = 902
Score = 66.2 bits (160), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 44/155 (28%), Positives = 77/155 (49%), Gaps = 26/155 (16%)
Query: 1892 GLGVVCNKEGGFG--------EDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
GL + K+ G+G + +F++E++GEV E++ R+++ ++
Sbjct: 114 GLERIVTKDRGYGVRSKTPIPQGNFILEYVGEVVS-----EQEFRRRTVEIYHDHNHHYC 168
Query: 1944 YNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRG 2003
N++ V+D + HSC PNCE + +V+G Y+IG++ +R
Sbjct: 169 LNLH-----------SGAVIDGYKYGCEGRFVNHSCEPNCEMQKWSVNGVYRIGLFALRD 217
Query: 2004 IHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
I GEE+T+DYN + E+ + +C CGS CRG
Sbjct: 218 IPAGEELTYDYNFHAFNMEKQQ--ICKCGSAKCRG 250
>gi|326436327|gb|EGD81897.1| hypothetical protein PTSG_11893 [Salpingoeca sp. ATCC 50818]
Length = 296
Score = 66.2 bits (160), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 49/141 (34%), Positives = 69/141 (48%), Gaps = 29/141 (20%)
Query: 1905 EDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI-----YLERPKGDADGYD 1959
+D+ V+E++GE+ +R Q ED + I YL R D
Sbjct: 177 KDELVIEYVGEI------------VR--QTVAEDRERRYARIGIGSSYLFRIDED----- 217
Query: 1960 LVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTE 2019
V+DA + A I HSC NC A+V +VDG +IGIY+ R I EEIT+DY
Sbjct: 218 -YVIDATRMGSIARFINHSCDANCYAQVVSVDGKKRIGIYSKRPIAANEEITYDYKF--- 273
Query: 2020 SKEEYEASV-CLCGSQVCRGS 2039
+EE + C CG++ CRG+
Sbjct: 274 PREEGPNKIPCFCGARTCRGT 294
>gi|171462836|ref|YP_001796949.1| nuclear protein SET [Polynucleobacter necessarius subsp. necessarius
STIR1]
gi|171192374|gb|ACB43335.1| nuclear protein SET [Polynucleobacter necessarius subsp. necessarius
STIR1]
Length = 163
Score = 66.2 bits (160), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 53/158 (33%), Positives = 77/158 (48%), Gaps = 27/158 (17%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
G GV K GE ++E+ GE WK EK+ + +DP FY LE
Sbjct: 25 GKGVFVAKPIKKGEA--IIEYKGERIS-WKLAEKRH-----PHDPKDPNHTFY-FSLE-- 73
Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
D V+DA + N A I HSC+P+CE + + +G ++ IY R + GEE+
Sbjct: 74 -------DGRVIDAKYGGNAARWINHSCKPSCETREDSFNGEPRVFIYAKRALKVGEELF 126
Query: 2012 FDYNSVTES------KEEYEASVCLCGSQVCRGSYLNL 2043
+DY+ E K++YE C CG++ CRG+ L L
Sbjct: 127 YDYSLDIEGKITKQMKKDYE---CRCGAKKCRGTMLAL 161
>gi|428171302|gb|EKX40220.1| hypothetical protein GUITHDRAFT_75734 [Guillardia theta CCMP2712]
Length = 156
Score = 66.2 bits (160), Expect = 2e-07, Method: Composition-based stats.
Identities = 34/91 (37%), Positives = 49/91 (53%), Gaps = 9/91 (9%)
Query: 1949 ERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGE 2008
E P+G A +VDA + N I H C PNCEAK+ ++G +I I + + +GE
Sbjct: 73 EPPEGRA-----AIVDATIRHNIGHYINHCCDPNCEAKILKINGQRRIIISAIHDVQFGE 127
Query: 2009 EITFDYNSVTESKEEYEASVCLCGSQVCRGS 2039
E+T+DY E K+ C CG+ CRG+
Sbjct: 128 ELTYDYKLPFEDKK----IPCHCGAPTCRGT 154
>gi|221069761|ref|ZP_03545866.1| Histone-lysine N-methyltransferase [Comamonas testosteroni KF-1]
gi|220714784|gb|EED70152.1| Histone-lysine N-methyltransferase [Comamonas testosteroni KF-1]
Length = 168
Score = 66.2 bits (160), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 52/155 (33%), Positives = 74/155 (47%), Gaps = 29/155 (18%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
G GV ++ GE ++E++GEV W E QD ++ DP+ + Y +
Sbjct: 23 GKGVFAAQDIAQGET--LIEYVGEVI---DWQEAQD------RHPHDPSQPNHTFYFQVD 71
Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
D V+DA HK N + I HSC PNC +DG +I I +R I GEE+
Sbjct: 72 -------DERVIDATHKGNSSRWINHSCDPNC--YTDEIDG--RIYIIALRNIAAGEELN 120
Query: 2012 FDYNSVTESKEEYEASV-----CLCGSQVCRGSYL 2041
+DY + E E Y A + C CG+ CRG+ L
Sbjct: 121 YDYGLMVE--ERYTAKLKAEYACYCGAANCRGTML 153
>gi|47226564|emb|CAG08580.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1404
Score = 66.2 bits (160), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 72/148 (48%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
+G G+ N+ GE FV+E++GEV + ++ + +++ +E+ FY + L +
Sbjct: 1145 RGWGLKANQPIKKGE--FVIEYVGEV------IDAEECQQRIKRAHENHMTNFYMLTLTK 1196
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ V+DA K N + I HSC PNCE + V+G IG++ + I E+
Sbjct: 1197 DR---------VIDAGQKGNLSRFINHSCSPNCETQKWTVNGDVHIGLFALCDIETDTEL 1247
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN + C CGS C G
Sbjct: 1248 TFNYNLHCVGNRR---ATCNCGSDNCSG 1272
>gi|28277052|gb|AAH44818.1| Mll1 protein, partial [Mus musculus]
Length = 142
Score = 66.2 bits (160), Expect = 3e-07, Method: Composition-based stats.
Identities = 51/153 (33%), Positives = 72/153 (47%), Gaps = 25/153 (16%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI--YLE 1949
G G+ C + GE V+E+ G V IRS+Q + + + I Y+
Sbjct: 13 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKREKYYDSKGIGCYMF 58
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
R D VVDA N A I HSC PNC ++V +DG I I+ +R I+ GEE
Sbjct: 59 RID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEE 112
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
+T+DY E + C CG++ CR +LN
Sbjct: 113 LTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 142
>gi|431912177|gb|ELK14315.1| Histone-lysine N-methyltransferase SETD1B [Pteropus alecto]
Length = 245
Score = 66.2 bits (160), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 45/79 (56%), Gaps = 4/79 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
++DA N+A I HSC PNC AKV V+ +I IY+ + I EEIT+DY E
Sbjct: 169 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHISVNEEITYDYKFPIED 228
Query: 2021 KEEYEASVCLCGSQVCRGS 2039
+ CLCGS+ CRG+
Sbjct: 229 VK----IPCLCGSENCRGT 243
>gi|357157974|ref|XP_003577976.1| PREDICTED: histone-lysine N-methyltransferase ASHH3-like
[Brachypodium distachyon]
Length = 338
Score = 66.2 bits (160), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 47/153 (30%), Positives = 74/153 (48%), Gaps = 35/153 (22%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYP-------VWKWFEKQDGIRSLQKNNEDPAPEFY 1944
G G+V + G + +F++E++GEV +WK ++Q +
Sbjct: 115 GFGLV--ADDGIQKGEFIIEYVGEVIDDRTCEERLWK-MKRQ---------------RYT 156
Query: 1945 NIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGI 2004
N YL + +V+DA +K N + I HSC+PN E + VDG ++GI+ +R I
Sbjct: 157 NFYLCEVSSN------MVIDATNKGNKSRFINHSCQPNTEMQKWTVDGETRVGIFALRDI 210
Query: 2005 HYGEEITFDYNSVTESKEEYEASVCLCGSQVCR 2037
GEE+T+DY V ++ C CGS CR
Sbjct: 211 KKGEELTYDYKFVQFGADQ----DCHCGSSKCR 239
>gi|432952957|ref|XP_004085262.1| PREDICTED: histone-lysine N-methyltransferase NSD2-like, partial
[Oryzias latipes]
Length = 1167
Score = 66.2 bits (160), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 42/148 (28%), Positives = 79/148 (53%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+V ++ G+ FV E++GE+ ++++ ++ +E+ FY + +++
Sbjct: 967 KGWGLVALRDIKKGK--FVNEYIGEL------IDEEECRARIKYAHENNITNFYMLTIDK 1018
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NY+ + HSC+PNCE + V+G ++G++ V I G E+
Sbjct: 1019 DR---------IIDAGPKGNYSRFMNHSCQPNCETQKWTVNGDTRVGLFAVCHIPAGTEL 1069
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN E+ ++C CG+ C G
Sbjct: 1070 TFNYNLDCLGNEK---TICRCGAPNCSG 1094
>gi|327265653|ref|XP_003217622.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific-like [Anolis carolinensis]
Length = 2106
Score = 65.9 bits (159), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 40/148 (27%), Positives = 75/148 (50%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
+G G+ ++ GE FV E++GE+ ++++ ++ E FY + L++
Sbjct: 1383 RGWGLQAKRDIKKGE--FVNEYVGEL------IDEEECRARIRHAQEHDITNFYMLTLDK 1434
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NYA + H C+PNCE + +V+G ++G++ + + G E+
Sbjct: 1435 DR---------IIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFAITNVKAGTEL 1485
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN + +VC CG+ C G
Sbjct: 1486 TFNYNLECLGNGK---TVCKCGAPNCSG 1510
>gi|367004711|ref|XP_003687088.1| hypothetical protein TPHA_0I01480 [Tetrapisispora phaffii CBS 4417]
gi|357525391|emb|CCE64654.1| hypothetical protein TPHA_0I01480 [Tetrapisispora phaffii CBS 4417]
Length = 1030
Score = 65.9 bits (159), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 34/82 (41%), Positives = 46/82 (56%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA K A I H C P+C AK+ V G +I IY +R I EE+T+DY E
Sbjct: 951 TVIDATKKGGIARFINHCCDPSCTAKIIKVGGKKRIVIYALRDIDVNEELTYDYKFEREE 1010
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
++ E CLCG+ C+G +LN
Sbjct: 1011 DDQ-ERLPCLCGAPNCKG-FLN 1030
>gi|449679772|ref|XP_002161520.2| PREDICTED: histone-lysine N-methyltransferase MLL-like [Hydra
magnipapillata]
Length = 281
Score = 65.9 bits (159), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 35/84 (41%), Positives = 48/84 (57%), Gaps = 5/84 (5%)
Query: 1959 DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVT 2018
D VVDA K N A I HSC PNC +++ ++DG +I IY + + GEE+T+DY
Sbjct: 203 DTDVVDATTKGNAARFINHSCEPNCFSRIISIDGCKKIIIYAQKRVTVGEELTYDYKFAI 262
Query: 2019 ESKEEYEASVCLCGSQVCRGSYLN 2042
E + C CG++ CR YLN
Sbjct: 263 ED----DKLPCFCGAKKCR-KYLN 281
>gi|33305503|gb|AAQ02781.1|AF373874_1 Mll protein [Xenopus laevis]
Length = 84
Score = 65.9 bits (159), Expect = 3e-07, Method: Composition-based stats.
Identities = 36/84 (42%), Positives = 47/84 (55%), Gaps = 3/84 (3%)
Query: 1959 DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVT 2018
D VVDA N A I HSC PNC ++V +DG I I+ +R I+ GEE+T+DY
Sbjct: 4 DSEVVDATMHGNAARFINHSCEPNCYSRVIPIDGQKHIVIFAMRKIYRGEELTYDYKFPI 63
Query: 2019 ESKEEYEASVCLCGSQVCRGSYLN 2042
E A C CG++ CR +LN
Sbjct: 64 EDANNKLA--CNCGTKKCR-KFLN 84
>gi|354472091|ref|XP_003498274.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1
[Cricetulus griseus]
Length = 1436
Score = 65.9 bits (159), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 45/158 (28%), Positives = 79/158 (50%), Gaps = 21/158 (13%)
Query: 1882 PDDKYV-AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
PD + + RKG G+ + GE FV E++GE+ ++++ +++ +E+
Sbjct: 1145 PDAEIIKTERKGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSV 1196
Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
FY + + + + ++DA K NY+ + HSC PNCE + V+G ++G++
Sbjct: 1197 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1247
Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
+ I G E+TF+YN +VC CGS C G
Sbjct: 1248 ICDIPAGMELTFNYNLDCLGNGR---TVCHCGSDNCSG 1282
>gi|396478086|ref|XP_003840449.1| hypothetical protein LEMA_P101010.1 [Leptosphaeria maculans JN3]
gi|312217021|emb|CBX96970.1| hypothetical protein LEMA_P101010.1 [Leptosphaeria maculans JN3]
Length = 962
Score = 65.9 bits (159), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 49/149 (32%), Positives = 67/149 (44%), Gaps = 20/149 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ NK+ G DFV E++GEV EK R LQ ++E FY ++
Sbjct: 243 KKGFGLRANKDMAPG--DFVFEYIGEVI-----DEKTFRRRMLQYDHEG-IKHFY--FMS 292
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
KG+ VDA K N HSC PNC V ++GI+ R + GEE
Sbjct: 293 LTKGE-------FVDATKKGNLGRFCNHSCNPNCFVDKWVVGDKLRMGIFVERRVQAGEE 345
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
+ F+YN + + C CG C G
Sbjct: 346 LVFNYNV---DRYGADPQPCYCGEPNCSG 371
>gi|356518575|ref|XP_003527954.1| PREDICTED: histone-lysine N-methyltransferase ASHH2-like [Glycine
max]
Length = 2037
Score = 65.9 bits (159), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 48/149 (32%), Positives = 72/149 (48%), Gaps = 19/149 (12%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ + G+ F++E++GEV + + +E + +L+ + FY + L
Sbjct: 1190 KKGYGLKAIENVAQGQ--FLIEYVGEVLDM-QAYEARQREYALKGHRH-----FYFMTLN 1241
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
+ V+DA K N I HSC PNC + V+G IG++ +R I EE
Sbjct: 1242 GSE---------VIDASAKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALRDIKKDEE 1292
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
+TFDYN V A C CGS CRG
Sbjct: 1293 LTFDYNYVRVFG--AAAKKCYCGSPNCRG 1319
>gi|354472093|ref|XP_003498275.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2
[Cricetulus griseus]
Length = 1387
Score = 65.9 bits (159), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 45/158 (28%), Positives = 79/158 (50%), Gaps = 21/158 (13%)
Query: 1882 PDDKYV-AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
PD + + RKG G+ + GE FV E++GE+ ++++ +++ +E+
Sbjct: 1096 PDAEIIKTERKGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSV 1147
Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
FY + + + + ++DA K NY+ + HSC PNCE + V+G ++G++
Sbjct: 1148 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1198
Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
+ I G E+TF+YN +VC CGS C G
Sbjct: 1199 ICDIPAGMELTFNYNLDCLGNGR---TVCHCGSDNCSG 1233
>gi|198437220|ref|XP_002124518.1| PREDICTED: similar to Histone-lysine N-methyltransferase HRX (Zinc
finger protein HRX) (ALL-1) (Trithorax-like protein)
(Lysine N-methyltransferase 2A) (CXXC-type zinc finger
protein 7) [Ciona intestinalis]
Length = 3406
Score = 65.9 bits (159), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/152 (28%), Positives = 72/152 (47%), Gaps = 20/152 (13%)
Query: 1886 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN 1945
Y + G G+ C ++ GE ++E+ G++ +Q+ +K E + Y
Sbjct: 3271 YRSTIHGRGLYCKRDFDSGE--MIMEYTGQII-------RQELTDKREKYYESKSIGCYM 3321
Query: 1946 IYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIH 2005
++ D VVDA + A I HSC PNC +++ +G I I+ +R I+
Sbjct: 3322 FRMD---------DFYVVDATVLGSGARFINHSCDPNCYSRIVQFEGKKHIVIFALREIY 3372
Query: 2006 YGEEITFDYNSVTESKEEYEASVCLCGSQVCR 2037
GEE+T+DY E +E C CG+++CR
Sbjct: 3373 KGEELTYDYKFPIE--DENHKIACTCGARLCR 3402
>gi|326928449|ref|XP_003210391.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific-like, partial [Meleagris gallopavo]
Length = 2336
Score = 65.9 bits (159), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 67/132 (50%), Gaps = 18/132 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
+FV E++GE+ ++++ ++ E FY + L++ + ++DA
Sbjct: 1669 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 1713
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K NYA + H C+PNCE + V+G ++G++ + I G E+TF+YN +
Sbjct: 1714 PKGNYARFMNHCCQPNCETQKWCVNGDTRVGLFAIVNIKAGTELTFNYNLECLGNGK--- 1770
Query: 2027 SVCLCGSQVCRG 2038
+VC CG+ C G
Sbjct: 1771 TVCKCGAPNCSG 1782
>gi|345493934|ref|XP_001600694.2| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1 [Nasonia
vitripennis]
Length = 1382
Score = 65.9 bits (159), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 76/148 (51%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
+G G+V + G+ F++E++GEV E + +R LQ+ E +Y + ++
Sbjct: 1016 RGWGLVSLEPIKHGQ--FIIEYVGEVID-----EAEYKLR-LQQKKERKNENYYFLTIDN 1067
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K N + + HSC+PNCE + V+G +IG++ +R I GEE+
Sbjct: 1068 SR---------MIDAEPKGNLSRFMNHSCQPNCETQKWKVNGDTRIGLFALRDIEPGEEL 1118
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN + + CLC + C G
Sbjct: 1119 TFNYNLACDGETR---KPCLCKAPNCSG 1143
>gi|403072167|pdb|4FMU|A Chain A, Crystal Structure Of Methyltransferase Domain Of Human Set
Domain- Containing Protein 2 Compound: Pr-Snf
gi|407944022|pdb|4H12|A Chain A, The Crystal Structure Of Methyltransferase Domain Of Human
Set Domain- Containing Protein 2 In Complex With
S-Adenosyl-L-Homocysteine
Length = 278
Score = 65.9 bits (159), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 31/77 (40%), Positives = 43/77 (55%), Gaps = 3/77 (3%)
Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
++DA K N + + HSC PNCE + V+G ++G +T + + G E+TFDY K
Sbjct: 181 IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSELTFDYQFQRYGK 240
Query: 2022 EEYEASVCLCGSQVCRG 2038
EA C CGS CRG
Sbjct: 241 ---EAQKCFCGSANCRG 254
>gi|358253063|dbj|GAA51760.1| histone-lysine N-methyltransferase NSD1/2 [Clonorchis sinensis]
Length = 1596
Score = 65.9 bits (159), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 39/131 (29%), Positives = 67/131 (51%), Gaps = 18/131 (13%)
Query: 1908 FVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMH 1967
FV E++G++ ++++ R L+ +E+ +Y + L+ + ++DA
Sbjct: 1074 FVNEYIGDL------IDEEEANRRLRFAHENNVTNYYMMKLDAQR---------IIDAGP 1118
Query: 1968 KANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEAS 2027
K N + + H C PN + V+G +IG++ VR I GEE+TFDYN V +E
Sbjct: 1119 KGNLSRFMNHCCDPNLNTQKWTVNGDNRIGLFAVRDIAAGEELTFDYNFVALGQERLN-- 1176
Query: 2028 VCLCGSQVCRG 2038
C CG++ C G
Sbjct: 1177 -CRCGAENCTG 1186
>gi|223365716|pdb|2W5Y|A Chain A, Binary Complex Of The Mixed Lineage Leukaemia (Mll1) Set
Domain With The Cofactor Product S-Adenosylhomocysteine.
gi|223365717|pdb|2W5Z|A Chain A, Ternary Complex Of The Mixed Lineage Leukaemia (Mll1) Set
Domain With The Cofactor Product S-Adenosylhomocysteine
And Histone Peptide
Length = 192
Score = 65.5 bits (158), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 50/151 (33%), Positives = 70/151 (46%), Gaps = 21/151 (13%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
G G+ C + GE V+E+ G V IRS+Q + + + I
Sbjct: 63 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKREKYYDSKGIGCYMF 108
Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
+ D D VVDA N A I HSC PNC ++V +DG I I+ +R I+ GEE+T
Sbjct: 109 RID----DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELT 164
Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
+DY E C CG++ CR +LN
Sbjct: 165 YDYKFPIEDAS--NKLPCNCGAKKCR-KFLN 192
>gi|363739108|ref|XP_414538.3| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Gallus gallus]
Length = 2412
Score = 65.5 bits (158), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 67/132 (50%), Gaps = 18/132 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
+FV E++GE+ ++++ ++ E FY + L++ + ++DA
Sbjct: 1681 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 1725
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K NYA + H C+PNCE + V+G ++G++ + I G E+TF+YN +
Sbjct: 1726 PKGNYARFMNHCCQPNCETQKWCVNGDTRVGLFAIVNIKAGTELTFNYNLECLGNGK--- 1782
Query: 2027 SVCLCGSQVCRG 2038
+VC CG+ C G
Sbjct: 1783 TVCKCGAPNCSG 1794
>gi|121483959|gb|ABM54292.1| MLL [Pan paniscus]
Length = 162
Score = 65.5 bits (158), Expect = 3e-07, Method: Composition-based stats.
Identities = 51/153 (33%), Positives = 72/153 (47%), Gaps = 25/153 (16%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI--YLE 1949
G G+ C + GE V+E+ G V IRS+Q + + + I Y+
Sbjct: 33 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKREKYYDSKGIGCYMF 78
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
R D VVDA N A I HSC PNC ++V +DG I I+ +R I+ GEE
Sbjct: 79 RID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEE 132
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
+T+DY E + C CG++ CR +LN
Sbjct: 133 LTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 162
>gi|324523879|gb|ADY48320.1| Histone-lysine N-methyltransferase ASH1L, partial [Ascaris suum]
Length = 287
Score = 65.5 bits (158), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 43/148 (29%), Positives = 73/148 (49%), Gaps = 17/148 (11%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG GV K G++ + E++G V P ++FE+ + I + NN + + ++ + +
Sbjct: 9 KGFGVFAKKYIPAGQE--LTEYVGRVMPRDEYFEQLNFIGTF--NNLEMS--YFGMQIT- 61
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ VDA + N + + HSC PNC+ VDG Y++ + ++ I G+E+
Sbjct: 62 --------NEFYVDARNCGNMSRSVNHSCEPNCKVNAVTVDGVYRLKVSALKDIAAGDEL 113
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
T+DY TE C CG+ CRG
Sbjct: 114 TYDYG--TELWSGMVGMRCRCGTAGCRG 139
>gi|451994892|gb|EMD87361.1| hypothetical protein COCHEDRAFT_1144880 [Cochliobolus heterostrophus
C5]
Length = 923
Score = 65.5 bits (158), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 45/149 (30%), Positives = 68/149 (45%), Gaps = 20/149 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ NK+ GE FV E++GEV +++ R + + +E+ FY ++
Sbjct: 217 KKGFGLRANKDMAPGE--FVFEYIGEV------IDERTFRRRMGQYDEEGIKHFY--FMS 266
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
KG+ VDA K N HSC PNC V ++GI+ R + GEE
Sbjct: 267 LTKGE-------FVDATKKGNLGRFCNHSCNPNCFVDKWVVGDKLRMGIFVERQVKAGEE 319
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
+ F+YN + + C CG C G
Sbjct: 320 LVFNYNV---DRYGADPQPCYCGEPNCSG 345
>gi|348535504|ref|XP_003455240.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Oreochromis niloticus]
Length = 2122
Score = 65.5 bits (158), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 40/131 (30%), Positives = 68/131 (51%), Gaps = 18/131 (13%)
Query: 1908 FVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMH 1967
F+ E++GEV E + IR Q+N+ FY + L++ + ++DA
Sbjct: 1671 FISEYVGEVI---DEEECRARIRHAQEND---ICNFYMLTLDKDR---------IIDAGP 1715
Query: 1968 KANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEAS 2027
K N A + HSC+PNCE + V+G ++G++ ++ + GEE+TF+YN + +
Sbjct: 1716 KGNQARFMNHSCQPNCETQKWTVNGDTRVGLFALQDVPKGEELTFNYNLECRGNGK---T 1772
Query: 2028 VCLCGSQVCRG 2038
C CG+ C G
Sbjct: 1773 ACKCGAPNCSG 1783
>gi|149726051|ref|XP_001502479.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Equus caballus]
Length = 2700
Score = 65.5 bits (158), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
+FV E++GE+ ++++ ++ E FY + L++ + ++DA
Sbjct: 1969 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 2013
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K NYA + H C+PNCE + +V+G ++G++ + I G E+TF+YN +
Sbjct: 2014 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 2070
Query: 2027 SVCLCGSQVCRG 2038
+VC CG+ C G
Sbjct: 2071 TVCKCGAPNCSG 2082
>gi|451846131|gb|EMD59442.1| hypothetical protein COCSADRAFT_258710 [Cochliobolus sativus ND90Pr]
Length = 923
Score = 65.5 bits (158), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 45/149 (30%), Positives = 68/149 (45%), Gaps = 20/149 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ NK+ GE FV E++GEV +++ R + + +E+ FY ++
Sbjct: 217 KKGFGLRANKDMAPGE--FVFEYIGEV------IDERTFRRRMGQYDEEGIKHFY--FMS 266
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
KG+ VDA K N HSC PNC V ++GI+ R + GEE
Sbjct: 267 LTKGE-------FVDATKKGNLGRFCNHSCNPNCFVDKWVVGDKLRMGIFVERQVKAGEE 319
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
+ F+YN + + C CG C G
Sbjct: 320 LVFNYNV---DRYGADPQPCYCGEPNCSG 345
>gi|27477095|ref|NP_758859.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform a [Homo sapiens]
gi|16755530|gb|AAL27991.1|AF380302_1 androgen receptor-associated coregulator 267-a [Homo sapiens]
gi|119605437|gb|EAW85031.1| nuclear receptor binding SET domain protein 1, isoform CRA_a [Homo
sapiens]
Length = 2427
Score = 65.5 bits (158), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
+FV E++GE+ ++++ ++ E FY + L++ + ++DA
Sbjct: 1697 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 1741
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K NYA + H C+PNCE + +V+G ++G++ + I G E+TF+YN +
Sbjct: 1742 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 1798
Query: 2027 SVCLCGSQVCRG 2038
+VC CG+ C G
Sbjct: 1799 TVCKCGAPNCSG 1810
>gi|170058057|ref|XP_001864756.1| Mll1 protein [Culex quinquefasciatus]
gi|167877297|gb|EDS40680.1| Mll1 protein [Culex quinquefasciatus]
Length = 114
Score = 65.5 bits (158), Expect = 4e-07, Method: Composition-based stats.
Identities = 37/81 (45%), Positives = 46/81 (56%), Gaps = 5/81 (6%)
Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
VVDA + N A I HSC PNC +KV + GH I I+ +R I GEE+T+DY E
Sbjct: 39 VVDATMRGNAARFINHSCEPNCYSKVVDILGHKHIIIFALRRIVQGEELTYDYKFPFEDV 98
Query: 2022 EEYEASVCLCGSQVCRGSYLN 2042
+ C CGS+ CR YLN
Sbjct: 99 K----IPCSCGSKKCR-KYLN 114
>gi|380815578|gb|AFE79663.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform a [Macaca mulatta]
gi|383420747|gb|AFH33587.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform a [Macaca mulatta]
Length = 2426
Score = 65.5 bits (158), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
+FV E++GE+ ++++ ++ E FY + L++ + ++DA
Sbjct: 1696 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 1740
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K NYA + H C+PNCE + +V+G ++G++ + I G E+TF+YN +
Sbjct: 1741 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 1797
Query: 2027 SVCLCGSQVCRG 2038
+VC CG+ C G
Sbjct: 1798 TVCKCGAPNCSG 1809
>gi|187956219|gb|AAI50629.1| Nuclear receptor binding SET domain protein 1 [Homo sapiens]
Length = 2427
Score = 65.5 bits (158), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
+FV E++GE+ ++++ ++ E FY + L++ + ++DA
Sbjct: 1697 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 1741
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K NYA + H C+PNCE + +V+G ++G++ + I G E+TF+YN +
Sbjct: 1742 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 1798
Query: 2027 SVCLCGSQVCRG 2038
+VC CG+ C G
Sbjct: 1799 TVCKCGAPNCSG 1810
>gi|395861196|ref|XP_003802879.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific-like [Otolemur garnettii]
Length = 2410
Score = 65.5 bits (158), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
+FV E++GE+ ++++ ++ E FY + L++ + ++DA
Sbjct: 1682 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 1726
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K NYA + H C+PNCE + +V+G ++G++ + I G E+TF+YN +
Sbjct: 1727 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 1783
Query: 2027 SVCLCGSQVCRG 2038
+VC CG+ C G
Sbjct: 1784 TVCKCGAPNCSG 1795
>gi|413916020|gb|AFW55952.1| putative SET-domain containing protein family [Zea mays]
Length = 710
Score = 65.5 bits (158), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 46/129 (35%), Positives = 60/129 (46%), Gaps = 19/129 (14%)
Query: 1906 DDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDA 1965
+DFV+E++G++ + IR Q YL R D VVDA
Sbjct: 600 EDFVIEYVGQLI-----HRRVSDIRESQYEKSGIGSS----YLFRLDDD------FVVDA 644
Query: 1966 MHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYE 2025
+ A I HSC PNC KV VDG +I IY R I+ GEEIT++Y E K+
Sbjct: 645 TKRGGLARFINHSCEPNCYTKVITVDGQKKIFIYAKRRIYAGEEITYNYKFPLEEKK--- 701
Query: 2026 ASVCLCGSQ 2034
C CGS+
Sbjct: 702 -IPCHCGSR 709
>gi|355750457|gb|EHH54795.1| hypothetical protein EGM_15701 [Macaca fascicularis]
Length = 2695
Score = 65.5 bits (158), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
+FV E++GE+ ++++ ++ E FY + L++ + ++DA
Sbjct: 1965 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 2009
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K NYA + H C+PNCE + +V+G ++G++ + I G E+TF+YN +
Sbjct: 2010 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 2066
Query: 2027 SVCLCGSQVCRG 2038
+VC CG+ C G
Sbjct: 2067 TVCKCGAPNCSG 2078
>gi|380815580|gb|AFE79664.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform b [Macaca mulatta]
gi|383420749|gb|AFH33588.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform b [Macaca mulatta]
Length = 2695
Score = 65.5 bits (158), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
+FV E++GE+ ++++ ++ E FY + L++ + ++DA
Sbjct: 1965 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 2009
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K NYA + H C+PNCE + +V+G ++G++ + I G E+TF+YN +
Sbjct: 2010 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 2066
Query: 2027 SVCLCGSQVCRG 2038
+VC CG+ C G
Sbjct: 2067 TVCKCGAPNCSG 2078
>gi|405952170|gb|EKC20012.1| Histone-lysine N-methyltransferase SETD2 [Crassostrea gigas]
Length = 1451
Score = 65.5 bits (158), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 45/155 (29%), Positives = 74/155 (47%), Gaps = 20/155 (12%)
Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
+ +V KG+G+ DFV+E++GEV +K F+ + ++ K ++
Sbjct: 487 EAFVTDWKGMGLRAT--AALQPGDFVMEYVGEVLD-YKQFKSR--VKQQAKMGQE---HH 538
Query: 1944 YNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRG 2003
Y + L + V+DA +K N + + HSC PNCE + V+G ++G + +
Sbjct: 539 YFMALNSDE---------VIDASYKGNVSRYMNHSCDPNCETQKWTVNGVLRVGFFVKKA 589
Query: 2004 IHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
+ E+ FDY K EA C CGS+ CRG
Sbjct: 590 VEPLTELNFDYQFERYGK---EAQKCFCGSENCRG 621
>gi|441595720|ref|XP_004087266.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
H3 lysine-36 and H4 lysine-20 specific [Nomascus
leucogenys]
Length = 2697
Score = 65.5 bits (158), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
+FV E++GE+ ++++ ++ E FY + L++ + ++DA
Sbjct: 1967 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 2011
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K NYA + H C+PNCE + +V+G ++G++ + I G E+TF+YN +
Sbjct: 2012 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 2068
Query: 2027 SVCLCGSQVCRG 2038
+VC CG+ C G
Sbjct: 2069 TVCKCGAPNCSG 2080
>gi|345493936|ref|XP_003427184.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2 [Nasonia
vitripennis]
Length = 1317
Score = 65.5 bits (158), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 76/148 (51%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
+G G+V + G+ F++E++GEV E + +R LQ+ E +Y + ++
Sbjct: 951 RGWGLVSLEPIKHGQ--FIIEYVGEVID-----EAEYKLR-LQQKKERKNENYYFLTIDN 1002
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K N + + HSC+PNCE + V+G +IG++ +R I GEE+
Sbjct: 1003 SR---------MIDAEPKGNLSRFMNHSCQPNCETQKWKVNGDTRIGLFALRDIEPGEEL 1053
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN + + CLC + C G
Sbjct: 1054 TFNYNLACDGETR---KPCLCKAPNCSG 1078
>gi|118918400|ref|NP_032765.3| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Mus musculus]
Length = 2691
Score = 65.5 bits (158), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
+FV E++GE+ ++++ ++ E FY + L++ + ++DA
Sbjct: 1967 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 2011
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K NYA + H C+PNCE + +V+G ++G++ + I G E+TF+YN +
Sbjct: 2012 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 2068
Query: 2027 SVCLCGSQVCRG 2038
+VC CG+ C G
Sbjct: 2069 TVCKCGAPNCSG 2080
>gi|145588193|ref|YP_001154790.1| nuclear protein SET [Polynucleobacter necessarius subsp. asymbioticus
QLW-P1DMWA-1]
gi|145046599|gb|ABP33226.1| nuclear protein SET [Polynucleobacter necessarius subsp. asymbioticus
QLW-P1DMWA-1]
Length = 164
Score = 65.5 bits (158), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 47/141 (33%), Positives = 71/141 (50%), Gaps = 25/141 (17%)
Query: 1909 VVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHK 1968
++E+ GE WK EK+ + +DP FY LE D V+DA +
Sbjct: 40 IIEYKGERIS-WKLAEKRH-----PHDPKDPNHTFY-FSLE---------DGRVIDAKYG 83
Query: 1969 ANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES------KE 2022
N A I HSC+P+CE + + +G ++ IY R + GEE+ +DY+ E K+
Sbjct: 84 GNAARWINHSCKPSCETREDSFEGKPRVFIYAKRSLKVGEELFYDYSLDVEGRISKQMKK 143
Query: 2023 EYEASVCLCGSQVCRGSYLNL 2043
+YE C CG++ CRG+ L L
Sbjct: 144 DYE---CRCGAKKCRGTMLAL 161
>gi|403290056|ref|XP_003936149.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Saimiri boliviensis boliviensis]
Length = 2697
Score = 65.5 bits (158), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
+FV E++GE+ ++++ ++ E FY + L++ + ++DA
Sbjct: 1967 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 2011
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K NYA + H C+PNCE + +V+G ++G++ + I G E+TF+YN +
Sbjct: 2012 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 2068
Query: 2027 SVCLCGSQVCRG 2038
+VC CG+ C G
Sbjct: 2069 TVCKCGAPNCSG 2080
>gi|417407050|gb|JAA50158.1| Putative histone-lysine n-methyltransferase h3 lysine-36 and h4
lysine-20 specific [Desmodus rotundus]
Length = 2699
Score = 65.5 bits (158), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
+FV E++GE+ ++++ ++ E FY + L++ + ++DA
Sbjct: 1969 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 2013
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K NYA + H C+PNCE + +V+G ++G++ + I G E+TF+YN +
Sbjct: 2014 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 2070
Query: 2027 SVCLCGSQVCRG 2038
+VC CG+ C G
Sbjct: 2071 TVCKCGAPNCSG 2082
>gi|410949106|ref|XP_003981265.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Felis catus]
Length = 2432
Score = 65.5 bits (158), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
+FV E++GE+ ++++ ++ E FY + L++ + ++DA
Sbjct: 1701 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 1745
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K NYA + H C+PNCE + +V+G ++G++ + I G E+TF+YN +
Sbjct: 1746 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 1802
Query: 2027 SVCLCGSQVCRG 2038
+VC CG+ C G
Sbjct: 1803 TVCKCGAPNCSG 1814
>gi|440898362|gb|ELR49876.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Bos grunniens mutus]
Length = 2698
Score = 65.5 bits (158), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
+FV E++GE+ ++++ ++ E FY + L++ + ++DA
Sbjct: 1969 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 2013
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K NYA + H C+PNCE + +V+G ++G++ + I G E+TF+YN +
Sbjct: 2014 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 2070
Query: 2027 SVCLCGSQVCRG 2038
+VC CG+ C G
Sbjct: 2071 TVCKCGAPNCSG 2082
>gi|426229361|ref|XP_004008759.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Ovis aries]
Length = 2698
Score = 65.5 bits (158), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
+FV E++GE+ ++++ ++ E FY + L++ + ++DA
Sbjct: 1969 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 2013
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K NYA + H C+PNCE + +V+G ++G++ + I G E+TF+YN +
Sbjct: 2014 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 2070
Query: 2027 SVCLCGSQVCRG 2038
+VC CG+ C G
Sbjct: 2071 TVCKCGAPNCSG 2082
>gi|410216828|gb|JAA05633.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
gi|410260118|gb|JAA18025.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
Length = 2428
Score = 65.5 bits (158), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
+FV E++GE+ ++++ ++ E FY + L++ + ++DA
Sbjct: 1698 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 1742
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K NYA + H C+PNCE + +V+G ++G++ + I G E+TF+YN +
Sbjct: 1743 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 1799
Query: 2027 SVCLCGSQVCRG 2038
+VC CG+ C G
Sbjct: 1800 TVCKCGAPNCSG 1811
>gi|296193510|ref|XP_002806650.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
H3 lysine-36 and H4 lysine-20 specific [Callithrix
jacchus]
Length = 2692
Score = 65.5 bits (158), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
+FV E++GE+ ++++ ++ E FY + L++ + ++DA
Sbjct: 1966 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 2010
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K NYA + H C+PNCE + +V+G ++G++ + I G E+TF+YN +
Sbjct: 2011 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 2067
Query: 2027 SVCLCGSQVCRG 2038
+VC CG+ C G
Sbjct: 2068 TVCKCGAPNCSG 2079
>gi|19923586|ref|NP_071900.2| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform b [Homo sapiens]
gi|32469769|sp|Q96L73.1|NSD1_HUMAN RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific; AltName: Full=Androgen receptor
coactivator 267 kDa protein; AltName: Full=Androgen
receptor-associated protein of 267 kDa; AltName:
Full=H3-K36-HMTase; AltName: Full=H4-K20-HMTase; AltName:
Full=Lysine N-methyltransferase 3B; AltName: Full=Nuclear
receptor-binding SET domain-containing protein 1;
Short=NR-binding SET domain-containing protein
gi|17530097|gb|AAL40694.1|AF395588_1 putative nuclear protein NSD1 [Homo sapiens]
gi|16751269|gb|AAL06645.1| androgen receptor associated coregulator 267-b [Homo sapiens]
gi|119605438|gb|EAW85032.1| nuclear receptor binding SET domain protein 1, isoform CRA_b [Homo
sapiens]
Length = 2696
Score = 65.5 bits (158), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
+FV E++GE+ ++++ ++ E FY + L++ + ++DA
Sbjct: 1966 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 2010
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K NYA + H C+PNCE + +V+G ++G++ + I G E+TF+YN +
Sbjct: 2011 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 2067
Query: 2027 SVCLCGSQVCRG 2038
+VC CG+ C G
Sbjct: 2068 TVCKCGAPNCSG 2079
>gi|410303854|gb|JAA30527.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
gi|410341931|gb|JAA39912.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
Length = 2428
Score = 65.5 bits (158), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
+FV E++GE+ ++++ ++ E FY + L++ + ++DA
Sbjct: 1698 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 1742
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K NYA + H C+PNCE + +V+G ++G++ + I G E+TF+YN +
Sbjct: 1743 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 1799
Query: 2027 SVCLCGSQVCRG 2038
+VC CG+ C G
Sbjct: 1800 TVCKCGAPNCSG 1811
>gi|410216830|gb|JAA05634.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
gi|410260120|gb|JAA18026.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
Length = 2697
Score = 65.1 bits (157), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
+FV E++GE+ ++++ ++ E FY + L++ + ++DA
Sbjct: 1967 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 2011
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K NYA + H C+PNCE + +V+G ++G++ + I G E+TF+YN +
Sbjct: 2012 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 2068
Query: 2027 SVCLCGSQVCRG 2038
+VC CG+ C G
Sbjct: 2069 TVCKCGAPNCSG 2080
>gi|402873563|ref|XP_003900641.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific, partial [Papio anubis]
Length = 2343
Score = 65.1 bits (157), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
+FV E++GE+ ++++ ++ E FY + L++ + ++DA
Sbjct: 1610 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 1654
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K NYA + H C+PNCE + +V+G ++G++ + I G E+TF+YN +
Sbjct: 1655 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 1711
Query: 2027 SVCLCGSQVCRG 2038
+VC CG+ C G
Sbjct: 1712 TVCKCGAPNCSG 1723
>gi|355691890|gb|EHH27075.1| hypothetical protein EGK_17188 [Macaca mulatta]
Length = 2695
Score = 65.1 bits (157), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
+FV E++GE+ ++++ ++ E FY + L++ + ++DA
Sbjct: 1965 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 2009
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K NYA + H C+PNCE + +V+G ++G++ + I G E+TF+YN +
Sbjct: 2010 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 2066
Query: 2027 SVCLCGSQVCRG 2038
+VC CG+ C G
Sbjct: 2067 TVCKCGAPNCSG 2078
>gi|351708443|gb|EHB11362.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Heterocephalus glaber]
Length = 2698
Score = 65.1 bits (157), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
+FV E++GE+ ++++ ++ E FY + L++ + ++DA
Sbjct: 1966 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 2010
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K NYA + H C+PNCE + +V+G ++G++ + I G E+TF+YN +
Sbjct: 2011 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 2067
Query: 2027 SVCLCGSQVCRG 2038
+VC CG+ C G
Sbjct: 2068 TVCKCGAPNCSG 2079
>gi|119605439|gb|EAW85033.1| nuclear receptor binding SET domain protein 1, isoform CRA_c [Homo
sapiens]
Length = 2593
Score = 65.1 bits (157), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
+FV E++GE+ ++++ ++ E FY + L++ + ++DA
Sbjct: 1863 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 1907
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K NYA + H C+PNCE + +V+G ++G++ + I G E+TF+YN +
Sbjct: 1908 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 1964
Query: 2027 SVCLCGSQVCRG 2038
+VC CG+ C G
Sbjct: 1965 TVCKCGAPNCSG 1976
>gi|114603589|ref|XP_527132.2| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific isoform 8 [Pan troglodytes]
gi|397470588|ref|XP_003806901.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Pan paniscus]
gi|410303856|gb|JAA30528.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
gi|410341933|gb|JAA39913.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
Length = 2697
Score = 65.1 bits (157), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
+FV E++GE+ ++++ ++ E FY + L++ + ++DA
Sbjct: 1967 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 2011
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K NYA + H C+PNCE + +V+G ++G++ + I G E+TF+YN +
Sbjct: 2012 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 2068
Query: 2027 SVCLCGSQVCRG 2038
+VC CG+ C G
Sbjct: 2069 TVCKCGAPNCSG 2080
>gi|68565655|sp|O88491.1|NSD1_MOUSE RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific; AltName: Full=H3-K36-HMTase; AltName:
Full=H4-K20-HMTase; AltName: Full=Nuclear
receptor-binding SET domain-containing protein 1;
Short=NR-binding SET domain-containing protein
gi|3329465|gb|AAC40182.1| NSD1 protein [Mus musculus]
Length = 2588
Score = 65.1 bits (157), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
+FV E++GE+ ++++ ++ E FY + L++ + ++DA
Sbjct: 1864 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 1908
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K NYA + H C+PNCE + +V+G ++G++ + I G E+TF+YN +
Sbjct: 1909 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 1965
Query: 2027 SVCLCGSQVCRG 2038
+VC CG+ C G
Sbjct: 1966 TVCKCGAPNCSG 1977
>gi|301615056|ref|XP_002936997.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Xenopus (Silurana) tropicalis]
Length = 2440
Score = 65.1 bits (157), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 57/225 (25%), Positives = 101/225 (44%), Gaps = 24/225 (10%)
Query: 1886 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN 1945
+ +G G+ C + GE FV E++GE+ ++++ ++ E FY
Sbjct: 1766 FRTLSRGWGLRCRTDIKKGE--FVNEYVGEM------IDEEECRARIRYAQEQDITNFYM 1817
Query: 1946 IYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIH 2005
+ L++ + V+DA K N+A + H C+PNCE + V+G ++G++ + I
Sbjct: 1818 LTLDKDR---------VIDAGPKGNFARFMNHCCQPNCETQKWTVNGDTRVGLFALCDIK 1868
Query: 2006 YGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDRHQL 2065
E+TF+YN + +VC CG+ C G +L + + + V E G + +
Sbjct: 1869 AXVELTFNYNLECLGNGK---TVCKCGAPNCSG-FLGVRPKN--QPVSSEDKGKKRKQYV 1922
Query: 2066 MLEACELNSVSEEDYLELGRAG-LGSCLLGGLPNWVVAYSARLVR 2109
+ E+ E++ G G L SC G P A +L R
Sbjct: 1923 KRKKSEVVKEHEDECFSCGDGGQLVSCKKPGCPKVYHAECLKLTR 1967
>gi|70571511|dbj|BAE06763.1| zinc finger protein [Ciona intestinalis]
Length = 709
Score = 65.1 bits (157), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 45/144 (31%), Positives = 71/144 (49%), Gaps = 18/144 (12%)
Query: 1905 EDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVD 1964
E F++E++GEV E++ R+++ N + + Y + LE V+D
Sbjct: 1 EGQFLLEYVGEVVS-----EREFRRRTIE--NYNAHNDHYCVQLEAG---------TVID 44
Query: 1965 AMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEY 2024
AN + HSC+PNCE + V+G Y++G++ R I EE+T+DYN + +
Sbjct: 45 GYRLANEGRFVNHSCQPNCEMQKWVVNGEYRVGLFAKRPIVGSEELTYDYNFHAYNLDRQ 104
Query: 2025 EASVCLCGSQVCRGSYLNLTGEGA 2048
+ C CGS CRG T GA
Sbjct: 105 QP--CRCGSSECRGVIGGKTQRGA 126
>gi|410904194|ref|XP_003965577.1| PREDICTED: histone-lysine N-methyltransferase SETD1B-A-like, partial
[Takifugu rubripes]
Length = 109
Score = 65.1 bits (157), Expect = 5e-07, Method: Composition-based stats.
Identities = 33/78 (42%), Positives = 46/78 (58%), Gaps = 4/78 (5%)
Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
++DA N+A I HSC PNC AKV V+ +I IY+ + I+ EEIT+DY E
Sbjct: 34 IIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSRQPINVNEEITYDYKFPIEDV 93
Query: 2022 EEYEASVCLCGSQVCRGS 2039
+ CLCG++ CRG+
Sbjct: 94 K----IPCLCGAENCRGT 107
>gi|301785552|ref|XP_002928188.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific-like [Ailuropoda melanoleuca]
gi|281342107|gb|EFB17691.1| hypothetical protein PANDA_018107 [Ailuropoda melanoleuca]
Length = 2699
Score = 65.1 bits (157), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
+FV E++GE+ ++++ ++ E FY + L++ + ++DA
Sbjct: 1970 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 2014
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K NYA + H C+PNCE + +V+G ++G++ + I G E+TF+YN +
Sbjct: 2015 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 2071
Query: 2027 SVCLCGSQVCRG 2038
+VC CG+ C G
Sbjct: 2072 TVCKCGAPNCSG 2083
>gi|148709230|gb|EDL41176.1| nuclear receptor-binding SET-domain protein 1, isoform CRA_b [Mus
musculus]
Length = 2382
Score = 65.1 bits (157), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
+FV E++GE+ ++++ ++ E FY + L++ + ++DA
Sbjct: 1658 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 1702
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K NYA + H C+PNCE + +V+G ++G++ + I G E+TF+YN +
Sbjct: 1703 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 1759
Query: 2027 SVCLCGSQVCRG 2038
+VC CG+ C G
Sbjct: 1760 TVCKCGAPNCSG 1771
>gi|73953273|ref|XP_865778.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific isoform 5 [Canis lupus familiaris]
Length = 2698
Score = 65.1 bits (157), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
+FV E++GE+ ++++ ++ E FY + L++ + ++DA
Sbjct: 1967 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 2011
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K NYA + H C+PNCE + +V+G ++G++ + I G E+TF+YN +
Sbjct: 2012 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 2068
Query: 2027 SVCLCGSQVCRG 2038
+VC CG+ C G
Sbjct: 2069 TVCKCGAPNCSG 2080
>gi|350580826|ref|XP_003123715.3| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific, partial [Sus scrofa]
Length = 2392
Score = 65.1 bits (157), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
+FV E++GE+ ++++ ++ E FY + L++ + ++DA
Sbjct: 1661 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 1705
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K NYA + H C+PNCE + +V+G ++G++ + I G E+TF+YN +
Sbjct: 1706 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 1762
Query: 2027 SVCLCGSQVCRG 2038
+VC CG+ C G
Sbjct: 1763 TVCKCGAPNCSG 1774
>gi|196013861|ref|XP_002116791.1| hypothetical protein TRIADDRAFT_31338 [Trichoplax adhaerens]
gi|190580769|gb|EDV20850.1| hypothetical protein TRIADDRAFT_31338 [Trichoplax adhaerens]
Length = 725
Score = 65.1 bits (157), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 37/100 (37%), Positives = 55/100 (55%), Gaps = 8/100 (8%)
Query: 1943 FYNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIG 1997
+ I + + KG+ + Y L V++DA K N A + HSC+PNCE V+G IG
Sbjct: 483 YQRIKMAQSKGEKNFYMLNIDKDVIIDAGQKGNLARFMNHSCQPNCETHKWTVNGLTCIG 542
Query: 1998 IYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCR 2037
++ + I GEE+TFDY ++ E C CGS++CR
Sbjct: 543 LFAIDDIKQGEELTFDYRLHAVGNDQAE---CHCGSKLCR 579
>gi|444706655|gb|ELW47981.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Tupaia chinensis]
Length = 2687
Score = 65.1 bits (157), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
+FV E++GE+ ++++ ++ E FY + L++ + ++DA
Sbjct: 1958 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 2002
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K NYA + H C+PNCE + +V+G ++G++ + I G E+TF+YN +
Sbjct: 2003 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 2059
Query: 2027 SVCLCGSQVCRG 2038
+VC CG+ C G
Sbjct: 2060 TVCKCGAPNCSG 2071
>gi|148709229|gb|EDL41175.1| nuclear receptor-binding SET-domain protein 1, isoform CRA_a [Mus
musculus]
Length = 2588
Score = 65.1 bits (157), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
+FV E++GE+ ++++ ++ E FY + L++ + ++DA
Sbjct: 1864 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 1908
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K NYA + H C+PNCE + +V+G ++G++ + I G E+TF+YN +
Sbjct: 1909 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 1965
Query: 2027 SVCLCGSQVCRG 2038
+VC CG+ C G
Sbjct: 1966 TVCKCGAPNCSG 1977
>gi|354471955|ref|XP_003498206.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Cricetulus griseus]
Length = 2690
Score = 65.1 bits (157), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
+FV E++GE+ ++++ ++ E FY + L++ + ++DA
Sbjct: 1969 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 2013
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K NYA + H C+PNCE + +V+G ++G++ + I G E+TF+YN +
Sbjct: 2014 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 2070
Query: 2027 SVCLCGSQVCRG 2038
+VC CG+ C G
Sbjct: 2071 TVCKCGAPNCSG 2082
>gi|326671180|ref|XP_694414.5| PREDICTED: histone-lysine N-methyltransferase NSD3 [Danio rerio]
Length = 1562
Score = 65.1 bits (157), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 40/132 (30%), Positives = 67/132 (50%), Gaps = 18/132 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
DFV+E++GE+ + ++ + ++ NE+ FY + L + + V+DA
Sbjct: 1282 DFVMEYVGEL------IDSEECKQRIRTANENHVTNFYMLTLTKDR---------VIDAG 1326
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K N + + HSC PNCE + V+G +IG++T+ I E+TF+YN
Sbjct: 1327 PKGNLSRFMNHSCSPNCETQKWTVNGDVRIGLFTLCDISADTELTFNYNLDCLGNGR--- 1383
Query: 2027 SVCLCGSQVCRG 2038
+ C CGS+ C G
Sbjct: 1384 TSCHCGSENCSG 1395
>gi|297676794|ref|XP_002816309.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific isoform 1 [Pongo abelii]
Length = 2697
Score = 65.1 bits (157), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
+FV E++GE+ ++++ ++ E FY + L++ + ++DA
Sbjct: 1967 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 2011
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K NYA + H C+PNCE + +V+G ++G++ + I G E+TF+YN +
Sbjct: 2012 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 2068
Query: 2027 SVCLCGSQVCRG 2038
+VC CG+ C G
Sbjct: 2069 TVCKCGAPNCSG 2080
>gi|119895257|ref|XP_592234.3| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific, partial [Bos taurus]
Length = 2389
Score = 65.1 bits (157), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
+FV E++GE+ ++++ ++ E FY + L++ + ++DA
Sbjct: 1660 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 1704
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K NYA + H C+PNCE + +V+G ++G++ + I G E+TF+YN +
Sbjct: 1705 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 1761
Query: 2027 SVCLCGSQVCRG 2038
+VC CG+ C G
Sbjct: 1762 TVCKCGAPNCSG 1773
>gi|395736540|ref|XP_003776772.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific isoform 2 [Pongo abelii]
Length = 2594
Score = 65.1 bits (157), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
+FV E++GE+ ++++ ++ E FY + L++ + ++DA
Sbjct: 1864 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 1908
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K NYA + H C+PNCE + +V+G ++G++ + I G E+TF+YN +
Sbjct: 1909 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 1965
Query: 2027 SVCLCGSQVCRG 2038
+VC CG+ C G
Sbjct: 1966 TVCKCGAPNCSG 1977
>gi|291387890|ref|XP_002710469.1| PREDICTED: nuclear receptor binding SET domain protein 1 isoform 2
[Oryctolagus cuniculus]
Length = 2431
Score = 65.1 bits (157), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
+FV E++GE+ ++++ ++ E FY + L++ + ++DA
Sbjct: 1700 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 1744
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K NYA + H C+PNCE + +V+G ++G++ + I G E+TF+YN +
Sbjct: 1745 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 1801
Query: 2027 SVCLCGSQVCRG 2038
+VC CG+ C G
Sbjct: 1802 TVCKCGAPNCSG 1813
>gi|52545752|emb|CAH56331.1| hypothetical protein [Homo sapiens]
Length = 881
Score = 65.1 bits (157), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
+FV E++GE+ ++++ ++ E FY + L++ + ++DA
Sbjct: 151 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 195
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K NYA + H C+PNCE + +V+G ++G++ + I G E+TF+YN +
Sbjct: 196 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 252
Query: 2027 SVCLCGSQVCRG 2038
+VC CG+ C G
Sbjct: 253 TVCKCGAPNCSG 264
>gi|291001085|ref|XP_002683109.1| predicted protein [Naegleria gruberi]
gi|284096738|gb|EFC50365.1| predicted protein [Naegleria gruberi]
Length = 147
Score = 65.1 bits (157), Expect = 5e-07, Method: Composition-based stats.
Identities = 34/78 (43%), Positives = 46/78 (58%), Gaps = 6/78 (7%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYN-SVTE 2019
+++DA + N A I HSC PNC A++ VD ++ IY +R I GEEIT+DY + E
Sbjct: 71 LIIDATKRGNLARFINHSCDPNCCARIIEVDKQKKVCIYALRKILVGEEITYDYKFPIEE 130
Query: 2020 SKEEYEASVCLCGSQVCR 2037
SK C CGSQ C+
Sbjct: 131 SKIP-----CKCGSQKCK 143
>gi|149039889|gb|EDL94005.1| nuclear receptor binding SET domain protein 1 (predicted), isoform
CRA_b [Rattus norvegicus]
Length = 2586
Score = 65.1 bits (157), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
+FV E++GE+ ++++ ++ E FY + L++ + ++DA
Sbjct: 1861 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 1905
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K NYA + H C+PNCE + +V+G ++G++ + I G E+TF+YN +
Sbjct: 1906 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 1962
Query: 2027 SVCLCGSQVCRG 2038
+VC CG+ C G
Sbjct: 1963 TVCKCGAPNCSG 1974
>gi|290985403|ref|XP_002675415.1| predicted protein [Naegleria gruberi]
gi|284089011|gb|EFC42671.1| predicted protein [Naegleria gruberi]
Length = 438
Score = 65.1 bits (157), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 43/148 (29%), Positives = 73/148 (49%), Gaps = 18/148 (12%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG+GV CN++ + F+ E++GEV V D + K + + Y + +
Sbjct: 266 KGIGVKCNQDV-IKKGTFITEYVGEVISV-------DKFETRTKRSYKKSLHHYCMNMNE 317
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA N A I HSC PN + V+G ++GI+ ++ I GEEI
Sbjct: 318 NE---------IIDATWMGNIARFINHSCAPNARTQTWDVNGQNRVGIFAIKDIVKGEEI 368
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
T++YN + + +E + C CG+ C+G
Sbjct: 369 TYNYNFLIYN-DETKQQECKCGAPNCQG 395
>gi|344265319|ref|XP_003404732.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Loxodonta africana]
Length = 2702
Score = 65.1 bits (157), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
+FV E++GE+ ++++ ++ E FY + L++ + ++DA
Sbjct: 1970 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 2014
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K NYA + H C+PNCE + +V+G ++G++ + I G E+TF+YN +
Sbjct: 2015 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 2071
Query: 2027 SVCLCGSQVCRG 2038
+VC CG+ C G
Sbjct: 2072 TVCKCGAPNCSG 2083
>gi|194907101|ref|XP_001981487.1| GG12082 [Drosophila erecta]
gi|190656125|gb|EDV53357.1| GG12082 [Drosophila erecta]
Length = 1441
Score = 65.1 bits (157), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 52/186 (27%), Positives = 87/186 (46%), Gaps = 33/186 (17%)
Query: 1880 SRPDDKYVAYRKG--LGVVCNKEGGFG--------EDDFVVEFLGEVYPVWKWFEKQDGI 1929
SR +++ RK L VV E GFG E DFV+E++GEV E Q +
Sbjct: 1225 SRCENQMFEQRKSPRLEVVYMNERGFGLVNREPIAEGDFVIEYVGEVI---NHAEFQRRM 1281
Query: 1930 RSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTA 1989
Q+ ++ +Y + +E+ ++DA K N A + HSC PNCE +
Sbjct: 1282 EQKQRGRDE---NYYFLGVEKD---------FIIDAGPKGNLARFMNHSCEPNCETQKWT 1329
Query: 1990 VDGHYQIGIYTVRGIHYGEEITFDY---NSVTESKEEYEASVCLCGSQVCRGSYLNLTGE 2046
V+ +++G++ ++ I E+TF+Y + + SK+ C CG+ C G +
Sbjct: 1330 VNCIHRVGLFAIKDIPANTELTFNYLWDDLMNNSKK-----ACFCGATRCSGEIGGKLKD 1384
Query: 2047 GAFEKV 2052
GA ++
Sbjct: 1385 GAVKET 1390
>gi|168025972|ref|XP_001765507.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683357|gb|EDQ69768.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 993
Score = 65.1 bits (157), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 45/138 (32%), Positives = 67/138 (48%), Gaps = 26/138 (18%)
Query: 1905 EDDFVVEFLGEVY---PVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLV 1961
+D+FV+E+ GEV K + G RS+ N Y+ D
Sbjct: 837 KDEFVIEYTGEVIDDAMCEKRLWEMKGRRSI-----------CNFYMCEIAKD------F 879
Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
++DA K N + + HSC+PNC + VDG ++G++ R I GEE+T+DY V
Sbjct: 880 IIDATRKGNASRYLNHSCQPNCRLEKWRVDGETRVGVFAGRNIIAGEELTYDYKYV---- 935
Query: 2022 EEYEASV-CLCGSQVCRG 2038
E+ +V C CG+ CRG
Sbjct: 936 -EFGPNVKCRCGAPNCRG 952
>gi|157822347|ref|NP_001100807.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Rattus norvegicus]
gi|149039888|gb|EDL94004.1| nuclear receptor binding SET domain protein 1 (predicted), isoform
CRA_a [Rattus norvegicus]
Length = 2381
Score = 65.1 bits (157), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
+FV E++GE+ ++++ ++ E FY + L++ + ++DA
Sbjct: 1656 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 1700
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K NYA + H C+PNCE + +V+G ++G++ + I G E+TF+YN +
Sbjct: 1701 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 1757
Query: 2027 SVCLCGSQVCRG 2038
+VC CG+ C G
Sbjct: 1758 TVCKCGAPNCSG 1769
>gi|357163489|ref|XP_003579748.1| PREDICTED: histone-lysine N-methyltransferase ASHH1-like
[Brachypodium distachyon]
Length = 517
Score = 65.1 bits (157), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 47/150 (31%), Positives = 72/150 (48%), Gaps = 24/150 (16%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
+G G++ + G+ FV+E+ GEV WK + R Q + E Y IYL
Sbjct: 93 RGWGLLAEENIMAGQ--FVIEYCGEVIS-WK-----EAKRRSQAYEDQGLMEAYIIYLNT 144
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ +DA K + A I HSC+PNCE + V G ++GI+ + I G E+
Sbjct: 145 AES---------IDATKKGSLARFINHSCQPNCETRKWNVLGEVRVGIFAKQDIPIGMEL 195
Query: 2011 TFDYNSVTESKEEYEASV--CLCGSQVCRG 2038
++DYN E + ++ CLCG+ C G
Sbjct: 196 SYDYNF-----EWFGGAIVRCLCGAASCSG 220
>gi|348574862|ref|XP_003473209.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
H3 lysine-36 and H4 lysine-20 specific-like [Cavia
porcellus]
Length = 2509
Score = 65.1 bits (157), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
+FV E++GE+ ++++ ++ E FY + L++ + ++DA
Sbjct: 1779 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 1823
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K NYA + H C+PNCE + +V+G ++G++ + I G E+TF+YN +
Sbjct: 1824 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 1880
Query: 2027 SVCLCGSQVCRG 2038
+VC CG+ C G
Sbjct: 1881 TVCKCGAPNCSG 1892
>gi|456062341|ref|YP_007501311.1| Nuclear protein SET [beta proteobacterium CB]
gi|455439638|gb|AGG32576.1| Nuclear protein SET [beta proteobacterium CB]
Length = 164
Score = 65.1 bits (157), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 47/141 (33%), Positives = 71/141 (50%), Gaps = 25/141 (17%)
Query: 1909 VVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHK 1968
++E+ GE WK EK+ + +DP FY LE D +DA +
Sbjct: 40 IIEYKGERIS-WKLAEKRH-----PHDPKDPNHTFY-FSLE---------DGRCIDAKYG 83
Query: 1969 ANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES------KE 2022
N A I HSC+P+CE + + DG ++ IY R + GEE+ +DY+ E K+
Sbjct: 84 GNAARWINHSCKPSCETREDSFDGEPRVFIYAKRNLKLGEELFYDYSLDVEGRITKQMKK 143
Query: 2023 EYEASVCLCGSQVCRGSYLNL 2043
+YE C CG++ CRG+ L+L
Sbjct: 144 DYE---CRCGAKKCRGTMLSL 161
>gi|291387888|ref|XP_002710468.1| PREDICTED: nuclear receptor binding SET domain protein 1 isoform 1
[Oryctolagus cuniculus]
Length = 2700
Score = 65.1 bits (157), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
+FV E++GE+ ++++ ++ E FY + L++ + ++DA
Sbjct: 1969 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 2013
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K NYA + H C+PNCE + +V+G ++G++ + I G E+TF+YN +
Sbjct: 2014 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 2070
Query: 2027 SVCLCGSQVCRG 2038
+VC CG+ C G
Sbjct: 2071 TVCKCGAPNCSG 2082
>gi|344240382|gb|EGV96485.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Cricetulus griseus]
Length = 2318
Score = 65.1 bits (157), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
+FV E++GE+ ++++ ++ E FY + L++ + ++DA
Sbjct: 1597 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 1641
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K NYA + H C+PNCE + +V+G ++G++ + I G E+TF+YN +
Sbjct: 1642 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 1698
Query: 2027 SVCLCGSQVCRG 2038
+VC CG+ C G
Sbjct: 1699 TVCKCGAPNCSG 1710
>gi|453087448|gb|EMF15489.1| hypothetical protein SEPMUDRAFT_161660 [Mycosphaerella populorum
SO2202]
Length = 966
Score = 65.1 bits (157), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 50/172 (29%), Positives = 76/172 (44%), Gaps = 23/172 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+KG G+ N E +DF+ E++GEV K F + L + +E+ FY ++
Sbjct: 215 KKGYGLRANTE--LQANDFIFEYIGEVIGE-KTFRNR-----LHQYDEEGIKHFY--FMS 264
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
KG+ VDA K N HSC PNC V ++GI+ R IH GEE
Sbjct: 265 LSKGE-------FVDATKKGNLGRFCNHSCNPNCYVDKWVVGDKLRMGIFAERKIHAGEE 317
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLD 2061
+ F+YN + + C C C G + G+ E+ K H +++
Sbjct: 318 LVFNYNV---DRYGADPQPCYCDEPNCTGF---IGGKTQTERATKLSHTIIE 363
>gi|94732456|emb|CAK03662.1| novel protein similar to vertebrate Wolf-Hirschhorn syndrome
candidate 1 (WHSC1) [Danio rerio]
Length = 728
Score = 65.1 bits (157), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 43/148 (29%), Positives = 78/148 (52%), Gaps = 20/148 (13%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G++ ++ GE FV E++GE+ ++++ ++ E+ FY + +++
Sbjct: 431 KGWGLISLRDIKKGE--FVNEYVGEL------IDEEECRSRIRHAQENDITHFYMLTIDK 482
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ ++DA K NY+ + HSC+PNCE + V+G ++G++ V I G E+
Sbjct: 483 DR---------IIDAGPKGNYSRFMNHSCQPNCETQKWTVNGDTRVGLFAVCDIPAGTEL 533
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+YN E+ +VC CG+ C G
Sbjct: 534 TFNYNLDCLGNEK---TVCRCGAPNCSG 558
>gi|118101388|ref|XP_424390.2| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2 [Gallus
gallus]
Length = 1386
Score = 65.1 bits (157), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 46/169 (27%), Positives = 85/169 (50%), Gaps = 22/169 (13%)
Query: 1882 PDDKYVAY-RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
PD + + R+G G+ + GE FV E++GE+ ++++ +++ +E+
Sbjct: 1094 PDAEIIKTDRRGWGLRTKRNIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSV 1145
Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
FY + + + + ++DA K NY+ + HSC PNCE + V+G ++G++
Sbjct: 1146 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDIRVGLFA 1196
Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAF 2049
+ I G E+TF+YN E C CG++ C G +L + + AF
Sbjct: 1197 LCDIPAGMELTFNYNLDCLGNGRTE---CHCGAENCSG-FLGVRPKTAF 1241
>gi|118101386|ref|XP_001232891.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1 [Gallus
gallus]
Length = 1436
Score = 65.1 bits (157), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 46/169 (27%), Positives = 85/169 (50%), Gaps = 22/169 (13%)
Query: 1882 PDDKYVAY-RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
PD + + R+G G+ + GE FV E++GE+ ++++ +++ +E+
Sbjct: 1144 PDAEIIKTDRRGWGLRTKRNIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSV 1195
Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
FY + + + + ++DA K NY+ + HSC PNCE + V+G ++G++
Sbjct: 1196 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDIRVGLFA 1246
Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAF 2049
+ I G E+TF+YN E C CG++ C G +L + + AF
Sbjct: 1247 LCDIPAGMELTFNYNLDCLGNGRTE---CHCGAENCSG-FLGVRPKTAF 1291
>gi|113470951|gb|ABI34877.1| Wolf-Hirschhorn syndrome candidate 1-like 1 [Danio rerio]
Length = 129
Score = 64.7 bits (156), Expect = 6e-07, Method: Composition-based stats.
Identities = 40/132 (30%), Positives = 67/132 (50%), Gaps = 18/132 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
DFV+E++GE+ + ++ + ++ NE+ FY + L + + V+DA
Sbjct: 11 DFVMEYVGEL------IDSEECKQRIRTANENHVTNFYMLTLTKDR---------VIDAG 55
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K N + + HSC PNCE + V+G +IG++T+ I E+TF+YN
Sbjct: 56 PKGNLSRFMNHSCSPNCETQKWTVNGDVRIGLFTLCDISADTELTFNYNLDCLGNGR--- 112
Query: 2027 SVCLCGSQVCRG 2038
+ C CGS+ C G
Sbjct: 113 TSCHCGSENCSG 124
>gi|21392158|gb|AAM48433.1| RE61305p [Drosophila melanogaster]
Length = 1016
Score = 64.7 bits (156), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 42/151 (27%), Positives = 74/151 (49%), Gaps = 25/151 (16%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
+G G+V + G DFV+E++GEV + R +++ D +Y + +E+
Sbjct: 833 RGFGLVNREPIAVG--DFVIEYVGEV------INHAEFQRRMEQKQRDRDENYYFLGVEK 884
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
++DA K N A + HSC PNCE + V+ +++GI+ ++ I E+
Sbjct: 885 D---------FIIDAGPKGNLARFMNHSCEPNCETQKWTVNCIHRVGIFAIKDIPVNSEL 935
Query: 2011 TFDY---NSVTESKEEYEASVCLCGSQVCRG 2038
TF+Y + + SK+ C CG++ C G
Sbjct: 936 TFNYLWDDLMNNSKK-----ACFCGAKRCSG 961
>gi|195037347|ref|XP_001990122.1| GH19166 [Drosophila grimshawi]
gi|193894318|gb|EDV93184.1| GH19166 [Drosophila grimshawi]
Length = 1434
Score = 64.7 bits (156), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 39/148 (26%), Positives = 69/148 (46%), Gaps = 19/148 (12%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
+G G+VC + E DF++E++GEV + R + + D FY + +E+
Sbjct: 1244 RGFGLVCRE--AIAEGDFIIEYVGEV------INHAEFQRRVAQKTNDRDENFYFLGVEK 1295
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
++DA K N A + HSC PNC + V+ ++G++ ++ I E+
Sbjct: 1296 D---------FIIDAGPKGNLARFMNHSCEPNCATQKWTVNCINRVGLFAIKDIPENTEL 1346
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+Y + + C CG++ C G
Sbjct: 1347 TFNY--LWDDLMNNGKKACFCGAKRCSG 1372
>gi|224080887|ref|XP_002197925.1| PREDICTED: histone-lysine N-methyltransferase NSD3 [Taeniopygia
guttata]
Length = 1435
Score = 64.7 bits (156), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 46/169 (27%), Positives = 85/169 (50%), Gaps = 22/169 (13%)
Query: 1882 PDDKYV-AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
PD + + R+G G+ + GE FV E++GE+ ++++ +++ +E+
Sbjct: 1143 PDAEIIKTERRGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSV 1194
Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
FY + + + + ++DA K NY+ + HSC PNCE + V+G ++G++
Sbjct: 1195 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDIRVGLFA 1245
Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAF 2049
+ I G E+TF+YN E C CG++ C G +L + + AF
Sbjct: 1246 LCDIPAGMELTFNYNLDCLGNGRTE---CHCGAENCSG-FLGVRPKTAF 1290
>gi|326932813|ref|XP_003212507.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like isoform 1
[Meleagris gallopavo]
Length = 1436
Score = 64.7 bits (156), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 46/169 (27%), Positives = 85/169 (50%), Gaps = 22/169 (13%)
Query: 1882 PDDKYVAY-RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
PD + + R+G G+ + GE FV E++GE+ ++++ +++ +E+
Sbjct: 1144 PDAEIIKTDRRGWGLRTKRNIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSV 1195
Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
FY + + + + ++DA K NY+ + HSC PNCE + V+G ++G++
Sbjct: 1196 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDIRVGLFA 1246
Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAF 2049
+ I G E+TF+YN E C CG++ C G +L + + AF
Sbjct: 1247 LCDIPAGMELTFNYNLDCLGNGRTE---CHCGAENCSG-FLGVRPKTAF 1291
>gi|224095256|ref|XP_002310367.1| SET domain protein [Populus trichocarpa]
gi|222853270|gb|EEE90817.1| SET domain protein [Populus trichocarpa]
Length = 281
Score = 64.7 bits (156), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 47/146 (32%), Positives = 69/146 (47%), Gaps = 21/146 (14%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
G G+V +++ GE FV+E++GEV + + L K FY + R
Sbjct: 38 GSGIVADEDIKQGE--FVIEYVGEV------IDDKTCEERLWKMKHCGETNFYLCEINRD 89
Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
+V+DA +K N + I HSC PN E + +DG +IGI+ R I GE +T
Sbjct: 90 ---------MVIDATYKGNKSRYINHSCSPNTEMQKWIIDGETRIGIFATRDIRKGEHLT 140
Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCR 2037
+DY V ++ C CGS CR
Sbjct: 141 YDYQFVQFGADQ----DCHCGSSGCR 162
>gi|326932815|ref|XP_003212508.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like isoform 2
[Meleagris gallopavo]
Length = 1386
Score = 64.7 bits (156), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 46/169 (27%), Positives = 85/169 (50%), Gaps = 22/169 (13%)
Query: 1882 PDDKYVAY-RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
PD + + R+G G+ + GE FV E++GE+ ++++ +++ +E+
Sbjct: 1094 PDAEIIKTDRRGWGLRTKRNIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSV 1145
Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
FY + + + + ++DA K NY+ + HSC PNCE + V+G ++G++
Sbjct: 1146 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDIRVGLFA 1196
Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAF 2049
+ I G E+TF+YN E C CG++ C G +L + + AF
Sbjct: 1197 LCDIPAGMELTFNYNLDCLGNGRTE---CHCGAENCSG-FLGVRPKTAF 1241
>gi|256084142|ref|XP_002578291.1| SET domain protein [Schistosoma mansoni]
Length = 1746
Score = 64.7 bits (156), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 39/132 (29%), Positives = 67/132 (50%), Gaps = 18/132 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
+FV E++G++ ++ + R L+ +E+ +Y + L+ + ++DA
Sbjct: 1051 EFVNEYIGDL------IDEDEANRRLRFAHENNITNYYMMKLDSQR---------IIDAG 1095
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K N + + HSC PN + V+G +IG++ VR I GEE+TF+YN V +E
Sbjct: 1096 PKGNLSRFMNHSCDPNLNTQKWTVNGDNRIGLFAVRDISVGEELTFNYNFVALGQERLN- 1154
Query: 2027 SVCLCGSQVCRG 2038
C CG+ C G
Sbjct: 1155 --CRCGASNCVG 1164
>gi|449270866|gb|EMC81514.1| Histone-lysine N-methyltransferase NSD3 [Columba livia]
Length = 1440
Score = 64.7 bits (156), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 46/169 (27%), Positives = 85/169 (50%), Gaps = 22/169 (13%)
Query: 1882 PDDKYV-AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
PD + + R+G G+ + GE FV E++GE+ ++++ +++ +E+
Sbjct: 1148 PDAEIIKTERRGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSV 1199
Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
FY + + + + ++DA K NY+ + HSC PNCE + V+G ++G++
Sbjct: 1200 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDIRVGLFA 1250
Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAF 2049
+ I G E+TF+YN E C CG++ C G +L + + AF
Sbjct: 1251 LCDIPAGMELTFNYNLDCLGNGRTE---CHCGAENCSG-FLGVRPKSAF 1295
>gi|124513208|ref|XP_001349960.1| SET domain protein, putative [Plasmodium falciparum 3D7]
gi|23615377|emb|CAD52368.1| SET domain protein, putative [Plasmodium falciparum 3D7]
Length = 2548
Score = 64.3 bits (155), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 52/159 (32%), Positives = 79/159 (49%), Gaps = 17/159 (10%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
+ G GV C ++ GE + E++GEV + FEK+ + Q+ E + YN Y+
Sbjct: 2128 KTGYGVFCKRDIKNGE--LICEYVGEVLG-KREFEKR--LEVYQE--ESKKTDMYNWYII 2180
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
+ D V +D+ K + + I HSC PN ++ V G Y+IGI+ +R I GEE
Sbjct: 2181 QINKD------VYIDSGKKGSISRFINHSCSPNSVSQKWIVRGFYRIGIFALRDIPSGEE 2234
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGA 2048
IT++Y S +E CLC S C +L GE +
Sbjct: 2235 ITYNY-SYNFLFNNFE---CLCKSPNCMNYHLLKKGESS 2269
>gi|94312468|ref|YP_585678.1| putative histone-lysine N-methyltransferase [Cupriavidus
metallidurans CH34]
gi|430804722|ref|ZP_19431837.1| putative histone-lysine N-methyltransferase [Cupriavidus sp. HMR-1]
gi|93356320|gb|ABF10409.1| putative histone-lysine N-methyltransferase [Cupriavidus
metallidurans CH34]
gi|429503042|gb|ELA01344.1| putative histone-lysine N-methyltransferase [Cupriavidus sp. HMR-1]
Length = 170
Score = 64.3 bits (155), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 48/147 (32%), Positives = 71/147 (48%), Gaps = 29/147 (19%)
Query: 1901 GGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDL 1960
G E + V+E+ GE + WK +L+++ DP + Y GD
Sbjct: 41 GQIAEGERVIEYKGE-HISWK--------EALKRHPHDPNDPNHTFYFSLDDGD------ 85
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA N A I H+C PNCEA+ + ++ I+ +R I GEE+ +DY V ++
Sbjct: 86 -VIDAKFGGNRARWINHACDPNCEAR----EKKGRVFIHALRDIEPGEELFYDYGLVIDA 140
Query: 2021 ------KEEYEASVCLCGSQVCRGSYL 2041
K+EYE C CGS CRG+ L
Sbjct: 141 RYTKKLKQEYE---CRCGSPKCRGTML 164
>gi|51849607|dbj|BAD42330.1| hypothetical protein [Nannochloris bacillaris]
Length = 334
Score = 64.3 bits (155), Expect = 8e-07, Method: Composition-based stats.
Identities = 47/150 (31%), Positives = 70/150 (46%), Gaps = 24/150 (16%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+ ++ G+ F+VE++GEV E+++ R EFY +R
Sbjct: 145 KGFGLFAAEDVKAGQ--FIVEYVGEV------LEEEEYARR---------KEFYIATGQR 187
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
+ + V+DA + I HSC PNCE + V G IG++ + + G +
Sbjct: 188 HYYFMNVGNGEVIDAARRGGLGRFINHSCEPNCETQKWVVRGELAIGLFALEDVPAGSVL 247
Query: 2011 TFDYNSVTESKEEY--EASVCLCGSQVCRG 2038
TFDYN E Y + CLCGS+ CRG
Sbjct: 248 TFDYNF-----ERYGDKPMKCLCGSKACRG 272
>gi|302825340|ref|XP_002994293.1| hypothetical protein SELMODRAFT_432224 [Selaginella moellendorffii]
gi|300137824|gb|EFJ04637.1| hypothetical protein SELMODRAFT_432224 [Selaginella moellendorffii]
Length = 820
Score = 64.3 bits (155), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 43/132 (32%), Positives = 66/132 (50%), Gaps = 19/132 (14%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
DF++E++GEV E+ ++ +NN FY + G+D V+DA
Sbjct: 200 DFLIEYIGEVIDDKTCEERLWDLKERGENN------FYLCEV--------GHD-KVIDAT 244
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K N + I HSC PN + + DG +IG++ V I G+EIT+DY + E+
Sbjct: 245 FKGNMSRFINHSCNPNAQLRKWQCDGELRIGVFAVSRILKGQEITYDYKYIQFGTEQQ-- 302
Query: 2027 SVCLCGSQVCRG 2038
C CGS+ C+G
Sbjct: 303 --CHCGSKNCKG 312
>gi|10438794|dbj|BAB15346.1| unnamed protein product [Homo sapiens]
Length = 1069
Score = 64.3 bits (155), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
+FV E++GE+ ++++ ++ E FY + L++ + ++DA
Sbjct: 339 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 383
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K NYA + H C+PNCE + +V+G ++G++ + I G E+TF+YN +
Sbjct: 384 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 440
Query: 2027 SVCLCGSQVCRG 2038
+VC CG+ C G
Sbjct: 441 TVCKCGAPNCSG 452
>gi|195395005|ref|XP_002056127.1| GJ10771 [Drosophila virilis]
gi|194142836|gb|EDW59239.1| GJ10771 [Drosophila virilis]
Length = 1430
Score = 64.3 bits (155), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 39/148 (26%), Positives = 69/148 (46%), Gaps = 19/148 (12%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
+G G+VC + E DF++E++GEV + R + + D FY + +E+
Sbjct: 1236 RGFGLVCREP--IAEGDFIIEYVGEV------INHAEFQRRMAQKTRDRDENFYFLGVEK 1287
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
++DA K N A + HSC PNC + V+ ++G++ ++ I E+
Sbjct: 1288 D---------FIIDAGPKGNLARFMNHSCEPNCATQKWTVNCINRVGLFAIKDIPENTEL 1338
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+Y + + C CG++ C G
Sbjct: 1339 TFNY--LWDDLMNNGKKACFCGAKRCSG 1364
>gi|195108992|ref|XP_001999076.1| GI23270 [Drosophila mojavensis]
gi|193915670|gb|EDW14537.1| GI23270 [Drosophila mojavensis]
Length = 1433
Score = 64.3 bits (155), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 41/148 (27%), Positives = 72/148 (48%), Gaps = 19/148 (12%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
+G G+VC + E DF++E++GEV E Q + KN ++ FY + +E+
Sbjct: 1221 RGFGLVCREP--IKEGDFIIEYVGEVI---NHAEFQRRMAQKTKNRDE---NFYFLGVEK 1272
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
++DA K N A + HSC PNC + V+ + ++G++ ++ I E+
Sbjct: 1273 D---------FIIDAGPKGNLARFMNHSCEPNCATQKWTVNCNNRVGLFAIKDIPENTEL 1323
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
TF+Y + + C CG++ C G
Sbjct: 1324 TFNY--LWDDLMNNGKKACFCGAKRCSG 1349
>gi|353232109|emb|CCD79464.1| putative set domain protein [Schistosoma mansoni]
Length = 1503
Score = 64.3 bits (155), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 39/132 (29%), Positives = 67/132 (50%), Gaps = 18/132 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
+FV E++G++ ++ + R L+ +E+ +Y + L+ + ++DA
Sbjct: 1051 EFVNEYIGDL------IDEDEANRRLRFAHENNITNYYMMKLDSQR---------IIDAG 1095
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K N + + HSC PN + V+G +IG++ VR I GEE+TF+YN V +E
Sbjct: 1096 PKGNLSRFMNHSCDPNLNTQKWTVNGDNRIGLFAVRDISVGEELTFNYNFVALGQERLN- 1154
Query: 2027 SVCLCGSQVCRG 2038
C CG+ C G
Sbjct: 1155 --CRCGASNCVG 1164
>gi|357116306|ref|XP_003559923.1| PREDICTED: histone-lysine N-methyltransferase ASHH3-like
[Brachypodium distachyon]
Length = 349
Score = 64.3 bits (155), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 46/153 (30%), Positives = 74/153 (48%), Gaps = 35/153 (22%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYP-------VWKWFEKQDGIRSLQKNNEDPAPEFY 1944
G G+V ++ G + +F++E++GEV +WK ++Q +
Sbjct: 126 GFGLVADE--GIQQGEFIIEYVGEVIDDRTCEERLWK-MKRQ---------------RYT 167
Query: 1945 NIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGI 2004
N YL + +V+DA +K N + I HSC+PN E + VDG ++GI+ + I
Sbjct: 168 NFYLCEVSSN------MVIDATNKGNKSRFINHSCQPNTEMQKWTVDGETRVGIFALHDI 221
Query: 2005 HYGEEITFDYNSVTESKEEYEASVCLCGSQVCR 2037
GEE+T+DY V ++ C CGS CR
Sbjct: 222 KKGEELTYDYKFVQFGADQ----DCHCGSSNCR 250
>gi|339327714|ref|YP_004687407.1| methyltransferase [Cupriavidus necator N-1]
gi|338167871|gb|AEI78926.1| methyltransferase [Cupriavidus necator N-1]
Length = 290
Score = 64.3 bits (155), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 46/147 (31%), Positives = 72/147 (48%), Gaps = 29/147 (19%)
Query: 1901 GGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDL 1960
G E + V+E+ GE + WK +L+++ DP+ + Y G
Sbjct: 157 GPIAEGERVIEYKGE-HISWK--------TALERHPHDPSDPNHTFYFSLDDGS------ 201
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA + N A I H+C PNCEA+ + ++ I+ +R I GEE+ +DY V ++
Sbjct: 202 -VIDAKYGGNRARWINHACEPNCEAR----EKKGRVFIHALRDIAQGEELFYDYGLVIDA 256
Query: 2021 ------KEEYEASVCLCGSQVCRGSYL 2041
K+E+E C CGS CRG+ L
Sbjct: 257 RYTAKLKKEFE---CRCGSPQCRGTML 280
>gi|113869618|ref|YP_728107.1| methyltransferase [Ralstonia eutropha H16]
gi|113528394|emb|CAJ94739.1| putative methyltransferase [Ralstonia eutropha H16]
Length = 171
Score = 64.3 bits (155), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 46/147 (31%), Positives = 73/147 (49%), Gaps = 29/147 (19%)
Query: 1901 GGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDL 1960
G E + V+E+ GE + WK ++L+++ DP+ + Y G
Sbjct: 38 GQIAEGERVIEYKGE-HISWK--------KALERHPHDPSDPNHTFYFSLDDGS------ 82
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
V+DA + N A I H+C PNCEA+ + ++ I+ +R I GEE+ +DY V ++
Sbjct: 83 -VIDAKYGGNRARWINHACEPNCEAR----EKKGRVFIHALRDIAEGEELFYDYGLVIDA 137
Query: 2021 ------KEEYEASVCLCGSQVCRGSYL 2041
K+E+E C CGS CRG+ L
Sbjct: 138 RYTAKLKKEFE---CRCGSPQCRGTML 161
>gi|256074584|ref|XP_002573604.1| huntingtin interacting protein-related [Schistosoma mansoni]
Length = 1575
Score = 63.9 bits (154), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 54/203 (26%), Positives = 86/203 (42%), Gaps = 33/203 (16%)
Query: 1886 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN 1945
Y KG G++ G FV+E++GEV ++ + L
Sbjct: 458 YAGKDKGWGLMATDNVKKGS--FVIEYVGEVIDFSEFRRRIRRYERL------------- 502
Query: 1946 IYLERPKGDADGYDLVV-----VDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
G A Y + V +DA K N+A + HSC PNC + +V+G +IG +
Sbjct: 503 -------GHAHHYFMAVESDRFIDAGSKGNWARFVNHSCEPNCVTQKWSVNGEIRIGFFA 555
Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLL 2060
I G+E+T DY V E + C CG+ C G + T + EKV + ++
Sbjct: 556 KEDIPSGQEVTIDYQFVQYGVSEQK---CYCGASTCSG-IMGATSKYLQEKVRMKDTTMV 611
Query: 2061 DRHQLMLEACELNSVSEEDYLEL 2083
+R +L+ +L+S D + L
Sbjct: 612 ERR--ILQLLQLDSFRNADDITL 632
>gi|449510894|ref|XP_004186257.1| PREDICTED: LOW QUALITY PROTEIN: ash1 (absent, small, or
homeotic)-like (Drosophila), partial [Taeniopygia
guttata]
Length = 519
Score = 63.9 bits (154), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 36/99 (36%), Positives = 57/99 (57%), Gaps = 7/99 (7%)
Query: 1945 NIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIY 1999
N +E+ +D Y L +V+D+ N A I HSC PNCE + +V+G Y+IG+Y
Sbjct: 1 NRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCNPNCEMQKWSVNGVYRIGLY 60
Query: 2000 TVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
++ + G E+T+DYN + + E+ + +C CG CRG
Sbjct: 61 ALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFDKCRG 97
>gi|449474840|ref|XP_002193971.2| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Taeniopygia guttata]
Length = 1651
Score = 63.9 bits (154), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 67/132 (50%), Gaps = 18/132 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
+FV E++GE+ ++++ ++ E FY + L++ + ++DA
Sbjct: 695 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 739
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K NYA + H C+PNCE + V+G ++G++ + I G E+TF+YN +
Sbjct: 740 PKGNYARFMNHCCQPNCETQKWCVNGDTRVGLFALVNIKAGTELTFNYNLECLGNGK--- 796
Query: 2027 SVCLCGSQVCRG 2038
+VC CG+ C G
Sbjct: 797 TVCKCGAPNCSG 808
>gi|444511191|gb|ELV09829.1| Histone-lysine N-methyltransferase NSD3 [Tupaia chinensis]
Length = 1235
Score = 63.9 bits (154), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 47/175 (26%), Positives = 86/175 (49%), Gaps = 22/175 (12%)
Query: 1882 PDDKYV-AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
PD + + R+G G+ + GE FV E++GE+ ++++ +Q+ +E+
Sbjct: 943 PDAEVIKTERRGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECRLRIQRAHENSV 994
Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
FY + + + + ++DA K NY+ + HSC PNCE + V+G ++G++
Sbjct: 995 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1045
Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKE 2055
+ I G E+TF+YN E C CG++ C G +L + + A +E
Sbjct: 1046 LCDIPAGMELTFNYNLDCLGNGRTE---CHCGAENCSG-FLGVRPKSACASTAEE 1096
>gi|149634094|ref|XP_001506476.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1
[Ornithorhynchus anatinus]
Length = 1437
Score = 63.9 bits (154), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 46/169 (27%), Positives = 84/169 (49%), Gaps = 22/169 (13%)
Query: 1882 PDDKYV-AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
PD + + R+G G+ + GE FV E++GE+ ++++ +++ +E+
Sbjct: 1145 PDAEIIKTERRGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSV 1196
Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
FY + + + + ++DA K NY+ + HSC PNCE + V+G ++G++
Sbjct: 1197 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1247
Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAF 2049
+ I G E+TF+YN E C CG+ C G +L + + AF
Sbjct: 1248 LCDIPAGMELTFNYNLDCLGNGRTE---CHCGADNCSG-FLGVRPKTAF 1292
>gi|359067302|ref|XP_002689078.2| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Bos taurus]
Length = 1470
Score = 63.9 bits (154), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
+FV E++GE+ ++++ ++ E FY + L++ + ++DA
Sbjct: 741 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 785
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K NYA + H C+PNCE + +V+G ++G++ + I G E+TF+YN +
Sbjct: 786 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 842
Query: 2027 SVCLCGSQVCRG 2038
+VC CG+ C G
Sbjct: 843 TVCKCGAPNCSG 854
>gi|156391978|ref|XP_001635826.1| predicted protein [Nematostella vectensis]
gi|156222924|gb|EDO43763.1| predicted protein [Nematostella vectensis]
Length = 348
Score = 63.9 bits (154), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 44/134 (32%), Positives = 63/134 (47%), Gaps = 18/134 (13%)
Query: 1905 EDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVD 1964
++ FV+E+ GEV +D Q+ + +Y + L AD ++D
Sbjct: 99 QNQFVIEYCGEV------MNYRDFQSRAQRYDRQKRRHYYFMTLR-----ADE----IID 143
Query: 1965 AMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEY 2024
A K + + I HSC PNC + V+G +IG +T+R I GEE+TFDY K
Sbjct: 144 ATLKGSISRFINHSCEPNCVTQKWTVNGLLRIGFFTLRTIKAGEELTFDYQLQRYGK--- 200
Query: 2025 EASVCLCGSQVCRG 2038
A C C S CRG
Sbjct: 201 IAQTCYCESPSCRG 214
>gi|313227685|emb|CBY22833.1| unnamed protein product [Oikopleura dioica]
Length = 1179
Score = 63.9 bits (154), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 43/131 (32%), Positives = 67/131 (51%), Gaps = 17/131 (12%)
Query: 1908 FVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMH 1967
F++E++GE+ + IR L+++ + +Y + L+ +L ++DA
Sbjct: 1016 FIIEYIGEIIS-----HDESRIR-LEESAKIGVTNYYILELD---------NLRMIDAGP 1060
Query: 1968 KANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEAS 2027
+ N A I HSC PNC V G +IGI++ R I GEE+TF+Y S E +
Sbjct: 1061 RGNIARFINHSCDPNCGIDPWIVQGDTRIGIFSKRDIQEGEELTFNYQLQQSSDE--GKT 1118
Query: 2028 VCLCGSQVCRG 2038
CLCGS+ C G
Sbjct: 1119 KCLCGSKNCAG 1129
>gi|392580378|gb|EIW73505.1| hypothetical protein TREMEDRAFT_24920 [Tremella mesenterica DSM 1558]
Length = 180
Score = 63.9 bits (154), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 34/82 (41%), Positives = 47/82 (57%), Gaps = 2/82 (2%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
+V DA K + + I HSC P+ AK+ +++G +I IY R +H GEEI +DY ES
Sbjct: 101 LVCDATFKGSVSRLINHSCDPSASAKIISINGQSKIVIYAKRTLHPGEEILYDYKFPLES 160
Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
CLCG+ CRG +LN
Sbjct: 161 DPALRVP-CLCGAATCRG-WLN 180
>gi|296485540|tpg|DAA27655.1| TPA: nuclear receptor binding SET domain protein 1 [Bos taurus]
Length = 1275
Score = 63.9 bits (154), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
+FV E++GE+ ++++ ++ E FY + L++ + ++DA
Sbjct: 544 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 588
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K NYA + H C+PNCE + +V+G ++G++ + I G E+TF+YN +
Sbjct: 589 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 645
Query: 2027 SVCLCGSQVCRG 2038
+VC CG+ C G
Sbjct: 646 TVCKCGAPNCSG 657
>gi|225380776|gb|ACN88689.1| myeloid/lymphoid or mixed-lineage leukemia [Danio rerio]
Length = 148
Score = 63.5 bits (153), Expect = 1e-06, Method: Composition-based stats.
Identities = 37/84 (44%), Positives = 46/84 (54%), Gaps = 3/84 (3%)
Query: 1959 DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVT 2018
D VVDA N A I HSC PNC ++V VDG I I+ R I+ GEE+T+DY
Sbjct: 68 DYEVVDATIHGNSARFINHSCEPNCYSRVINVDGRKHIVIFATRKIYKGEELTYDYKFPI 127
Query: 2019 ESKEEYEASVCLCGSQVCRGSYLN 2042
E E C CG++ CR +LN
Sbjct: 128 E--EPGNKLPCNCGAKKCR-KFLN 148
>gi|344238567|gb|EGV94670.1| Histone-lysine N-methyltransferase NSD3 [Cricetulus griseus]
Length = 620
Score = 63.5 bits (153), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 43/149 (28%), Positives = 75/149 (50%), Gaps = 20/149 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
RKG G+ + GE FV E++GE+ ++++ +++ +E+ FY + +
Sbjct: 338 RKGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSVTNFYMLTVT 389
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
+ + ++DA K NY+ + HSC PNCE + V+G ++G++ + I G E
Sbjct: 390 KDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFAICDIPAGME 440
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
+TF+YN +VC CGS C G
Sbjct: 441 LTFNYNLDCLGNGR---TVCHCGSDNCSG 466
>gi|309780384|ref|ZP_07675135.1| SET domain protein [Ralstonia sp. 5_7_47FAA]
gi|404394987|ref|ZP_10986790.1| hypothetical protein HMPREF0989_02076 [Ralstonia sp. 5_2_56FAA]
gi|308921087|gb|EFP66733.1| SET domain protein [Ralstonia sp. 5_7_47FAA]
gi|348615101|gb|EGY64632.1| hypothetical protein HMPREF0989_02076 [Ralstonia sp. 5_2_56FAA]
Length = 179
Score = 63.5 bits (153), Expect = 1e-06, Method: Composition-based stats.
Identities = 43/136 (31%), Positives = 69/136 (50%), Gaps = 23/136 (16%)
Query: 1909 VVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHK 1968
++E+ GE + WK +L+++ DP+ + Y G V+DA +
Sbjct: 55 IIEYKGE-HITWK--------EALRRHPHDPSDPNHTFYFSLEDGS-------VIDAKYG 98
Query: 1969 ANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEE---YE 2025
N A I H+C+PNCEA+ DG ++ I+ +R I GEE+ +DY V E ++ E
Sbjct: 99 GNRARWINHACKPNCEAR--EADG--RVFIHALRDIEAGEELFYDYGLVIEGRQTKALKE 154
Query: 2026 ASVCLCGSQVCRGSYL 2041
C CG++ CRG+ L
Sbjct: 155 QFACRCGAKKCRGTML 170
>gi|355729169|gb|AES09787.1| Wolf-Hirschhorn syndrome candidate 1-like 1 [Mustela putorius furo]
Length = 596
Score = 63.5 bits (153), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 42/149 (28%), Positives = 76/149 (51%), Gaps = 20/149 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
R+G G+ + GE FV E++GE+ ++++ +++ +E+ FY + +
Sbjct: 314 RRGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSVTNFYMLTVT 365
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
+ + ++DA K NY+ + HSC PNCE + V+G ++G++ +R I G E
Sbjct: 366 KDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFALRDIPAGME 416
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
+TF+YN E C CG++ C G
Sbjct: 417 LTFNYNLDCLGNGRTE---CHCGAENCSG 442
>gi|302846429|ref|XP_002954751.1| histone H3 methyltransferase [Volvox carteri f. nagariensis]
gi|300259934|gb|EFJ44157.1| histone H3 methyltransferase [Volvox carteri f. nagariensis]
Length = 261
Score = 63.5 bits (153), Expect = 1e-06, Method: Composition-based stats.
Identities = 48/149 (32%), Positives = 73/149 (48%), Gaps = 24/149 (16%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG G+ ++ G+ F++E++GEV ++ +++ S+ + + F NI
Sbjct: 90 KGFGLFALEDIKAGQ--FIIEYIGEVLEEDEYQRRKEYYMSVGQRHY----YFMNI---- 139
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
G+ + V+DA K N + I HSC PNCE + V G IG++ VR I E+
Sbjct: 140 --GNGE-----VIDACRKGNISRFINHSCEPNCETQKWLVRGELAIGLFAVRDIPKDTEL 192
Query: 2011 TFDYNSVTESKEEY--EASVCLCGSQVCR 2037
TFDYN E Y + C CGS CR
Sbjct: 193 TFDYNF-----ERYGDKPMRCYCGSTNCR 216
>gi|145353759|ref|XP_001421172.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|145357147|ref|XP_001422783.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144581408|gb|ABO99465.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144583027|gb|ABP01142.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 503
Score = 63.5 bits (153), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 43/139 (30%), Positives = 62/139 (44%), Gaps = 27/139 (19%)
Query: 1908 FVVEFLGEVYPVWK-----WFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVV 1962
F+VE+ GE+ + W++KQ G N YL + V
Sbjct: 306 FIVEYAGEILDEHECAERLWYDKQSGEE--------------NFYLMEISAN------YV 345
Query: 1963 VDAMHKANYASRICHSCRPNCEAK--VTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
+DA K + A I SC PNCE + V A ++GI+ I G E+T+DYN
Sbjct: 346 IDAKFKGSIARFINSSCHPNCETQRWVDASTNETRVGIFATEDIASGTELTYDYNFAHFG 405
Query: 2021 KEEYEASVCLCGSQVCRGS 2039
E+ + VC+CG CRG+
Sbjct: 406 DEKGTSFVCMCGHPKCRGT 424
>gi|390569511|ref|ZP_10249796.1| nuclear protein SET [Burkholderia terrae BS001]
gi|389938371|gb|EIN00215.1| nuclear protein SET [Burkholderia terrae BS001]
Length = 209
Score = 63.5 bits (153), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/117 (35%), Positives = 61/117 (52%), Gaps = 20/117 (17%)
Query: 1931 SLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAV 1990
+L+++ +P + Y GD V+D K N A I HSC PNCEA+ +
Sbjct: 92 ALRRHPHNPDEPNHTFYFALDSGD-------VIDGKVKGNSARWINHSCAPNCEAE--EI 142
Query: 1991 DGHYQIGIYTVRGIHYGEEITFDYNSVTES------KEEYEASVCLCGSQVCRGSYL 2041
DGH + I +R I GEE+ +DY V ++ K+EYE C CG++ CRG+ L
Sbjct: 143 DGH--VFIDALRDIGAGEELFYDYGLVIDARQTKKLKKEYE---CRCGARKCRGTML 194
>gi|126303359|ref|XP_001372863.1| PREDICTED: histone-lysine N-methyltransferase NSD3 [Monodelphis
domestica]
Length = 1435
Score = 63.5 bits (153), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 46/169 (27%), Positives = 84/169 (49%), Gaps = 22/169 (13%)
Query: 1882 PDDKYV-AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
PD + + R+G G+ + GE FV E++GE+ ++++ +++ +E+
Sbjct: 1143 PDAEVIKTERRGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSI 1194
Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
FY + + + + ++DA K NY+ + HSC PNCE + V+G ++G++
Sbjct: 1195 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDIRVGLFA 1245
Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAF 2049
+ I G E+TF+YN E C CG+ C G +L + + AF
Sbjct: 1246 LCDIPAGVELTFNYNLDCLGNGRTE---CHCGADNCSG-FLGVRPKTAF 1290
>gi|324505555|gb|ADY42386.1| Histone-lysine N-methyltransferase Mes-4 [Ascaris suum]
Length = 743
Score = 63.5 bits (153), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 48/160 (30%), Positives = 75/160 (46%), Gaps = 21/160 (13%)
Query: 1883 DDKYVAYR----KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNED 1938
DD+++ R KG GV K G++ + E++G V P ++FE+ + I + NN
Sbjct: 453 DDEWMEERRTTNKGFGVFAKKYIPAGQE--LTEYVGRVMPRDEYFEQLNFIGTF--NN-- 506
Query: 1939 PAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
LE + VDA + N + + HSC PNC+ VDG Y++ +
Sbjct: 507 ---------LEMSYFGMQITNEFYVDARNCGNMSRSVNHSCEPNCKVNAVTVDGVYRLKV 557
Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
++ I G+E+T+DY TE C CG+ CRG
Sbjct: 558 SALKDIAAGDELTYDYG--TELWSGMVGMRCRCGTAGCRG 595
>gi|16549858|dbj|BAB70868.1| unnamed protein product [Homo sapiens]
Length = 1059
Score = 63.5 bits (153), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
+FV E++GE+ ++++ ++ E FY + L++ + ++DA
Sbjct: 929 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 973
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K NYA + H C+PNCE + +V+G ++G++ + I G E+TF+YN +
Sbjct: 974 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 1030
Query: 2027 SVCLCGSQVCRG 2038
+VC CG+ C G
Sbjct: 1031 TVCKCGAPNCSG 1042
>gi|115478464|ref|NP_001062827.1| Os09g0307800 [Oryza sativa Japonica Group]
gi|51091678|dbj|BAD36461.1| putative SET domain protein 110 [Oryza sativa Japonica Group]
gi|51091893|dbj|BAD36704.1| putative SET domain protein 110 [Oryza sativa Japonica Group]
gi|113631060|dbj|BAF24741.1| Os09g0307800 [Oryza sativa Japonica Group]
Length = 340
Score = 63.5 bits (153), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 49/153 (32%), Positives = 71/153 (46%), Gaps = 35/153 (22%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYP-------VWKWFEKQDGIRSLQKNNEDPAPEFY 1944
G GVV ++ GE FV+E++GEV +WK + D
Sbjct: 119 GNGVVAEEDIKKGE--FVIEYVGEVIDDRTCEQRLWKMKRQGDT---------------- 160
Query: 1945 NIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGI 2004
N YL + +V+DA +K N + I HSC PN E + V+G ++GI+ +R I
Sbjct: 161 NFYLCEVSSN------MVIDATNKGNMSRFINHSCEPNTEMQKWTVEGETRVGIFALRDI 214
Query: 2005 HYGEEITFDYNSVTESKEEYEASVCLCGSQVCR 2037
GEE+T+DY V ++ C CGS CR
Sbjct: 215 KTGEELTYDYKFVQFGADQD----CHCGSSNCR 243
>gi|432099958|gb|ELK28852.1| Histone-lysine N-methyltransferase NSD3 [Myotis davidii]
Length = 1641
Score = 63.2 bits (152), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 43/158 (27%), Positives = 79/158 (50%), Gaps = 21/158 (13%)
Query: 1882 PDDKYV-AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
PD + + R+G G+ + GE FV E++GE+ ++++ +++ +E+
Sbjct: 1349 PDAEIIKTERRGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSV 1400
Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
FY + + + + ++DA K NY+ + HSC PNCE + V+G ++G++
Sbjct: 1401 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1451
Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
+ I G E+TF+YN E C CG++ C G
Sbjct: 1452 LCDIPAGMELTFNYNLDCLGNGRTE---CHCGAENCSG 1486
>gi|327284319|ref|XP_003226886.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like [Anolis
carolinensis]
Length = 1438
Score = 63.2 bits (152), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/149 (28%), Positives = 74/149 (49%), Gaps = 20/149 (13%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
R+G G+ + GE FV E++GE+ ++++ +++ +E+ FY + +
Sbjct: 1155 RRGWGLRTKRNIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSVTNFYMLTVT 1206
Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
+ + ++DA K NY+ + HSC PNCE + V+G ++G++ V I G E
Sbjct: 1207 KDR---------IIDAGPKGNYSRFMNHSCHPNCETQKWTVNGDVRVGLFAVCDIPAGME 1257
Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
+TF+YN E C CG+ C G
Sbjct: 1258 LTFNYNLDCLGNGRTE---CHCGADNCSG 1283
>gi|395507428|ref|XP_003758026.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1
[Sarcophilus harrisii]
Length = 1437
Score = 63.2 bits (152), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 46/169 (27%), Positives = 84/169 (49%), Gaps = 22/169 (13%)
Query: 1882 PDDKYV-AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
PD + + R+G G+ + GE FV E++GE+ ++++ +++ +E+
Sbjct: 1145 PDAEVIKTERRGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSI 1196
Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
FY + + + + ++DA K NY+ + HSC PNCE + V+G ++G++
Sbjct: 1197 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1247
Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAF 2049
+ I G E+TF+YN E C CG+ C G +L + + AF
Sbjct: 1248 LCDIPAGVELTFNYNLDCLGNGRTE---CHCGADNCSG-FLGVRPKTAF 1292
>gi|194291212|ref|YP_002007119.1| hypothetical protein RALTA_A3139 [Cupriavidus taiwanensis LMG 19424]
gi|193225047|emb|CAQ71058.1| conserved hypothetical protein [Cupriavidus taiwanensis LMG 19424]
Length = 171
Score = 63.2 bits (152), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 45/145 (31%), Positives = 72/145 (49%), Gaps = 29/145 (20%)
Query: 1903 FGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVV 1962
E + V+E+ GE + WK ++L+++ DP+ + Y G V
Sbjct: 40 IAEGERVIEYKGE-HISWK--------KALERHPHDPSDPNHTFYFSLDDGS-------V 83
Query: 1963 VDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES-- 2020
+DA + N A I H+C PNCEA+ + ++ I+ +R I GEE+ +DY V ++
Sbjct: 84 IDAKYGGNRARWINHACEPNCEAR----EKKGRVFIHALRDIAQGEELFYDYGLVIDARY 139
Query: 2021 ----KEEYEASVCLCGSQVCRGSYL 2041
K+E+E C CGS CRG+ L
Sbjct: 140 TAKLKKEFE---CRCGSPQCRGTML 161
>gi|162460550|ref|NP_001105653.1| LOC542662 [Zea mays]
gi|24021802|gb|AAN41254.1| SET domain protein 110 [Zea mays]
gi|195652527|gb|ACG45731.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20specific [Zea mays]
Length = 342
Score = 63.2 bits (152), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 50/151 (33%), Positives = 69/151 (45%), Gaps = 31/151 (20%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
G G+V E GE FV+E++GEV +D E ++ +
Sbjct: 130 GHGLVAEDEIKKGE--FVIEYVGEVI-------------------DDRTCE-NRLWTMKR 167
Query: 1952 KGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
D D Y +V+DA +K N + I HSC PN + VDG ++GI+ +R I
Sbjct: 168 LDDTDFYLCEVSSNMVIDATNKGNLSRFINHSCEPNTAMQKWTVDGETRVGIFALRDIKI 227
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCR 2037
GEE+T+DY V A VC CGS CR
Sbjct: 228 GEELTYDYKFVQFGA----AQVCHCGSSKCR 254
>gi|186477803|ref|YP_001859273.1| nuclear protein SET [Burkholderia phymatum STM815]
gi|184194262|gb|ACC72227.1| nuclear protein SET [Burkholderia phymatum STM815]
Length = 160
Score = 63.2 bits (152), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 48/141 (34%), Positives = 70/141 (49%), Gaps = 29/141 (20%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
D ++E+ GE WK +L+++ +P + Y GD V+D
Sbjct: 28 DRLIEYKGERIS-WK--------EALRRHPHNPDEPNHTFYFALDSGD-------VIDGK 71
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES------ 2020
K N A I HSC PNCEA+ +DGH + I +R I GEE+ +DY V ++
Sbjct: 72 VKGNSARWINHSCAPNCEAE--EIDGH--VYIDALRDIEAGEELFYDYGLVIDARQTKKL 127
Query: 2021 KEEYEASVCLCGSQVCRGSYL 2041
K+EYE C CG++ CRG+ L
Sbjct: 128 KKEYE---CRCGARKCRGTML 145
>gi|395507430|ref|XP_003758027.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2
[Sarcophilus harrisii]
Length = 1389
Score = 63.2 bits (152), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 46/169 (27%), Positives = 84/169 (49%), Gaps = 22/169 (13%)
Query: 1882 PDDKYV-AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
PD + + R+G G+ + GE FV E++GE+ ++++ +++ +E+
Sbjct: 1097 PDAEVIKTERRGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSI 1148
Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
FY + + + + ++DA K NY+ + HSC PNCE + V+G ++G++
Sbjct: 1149 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1199
Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAF 2049
+ I G E+TF+YN E C CG+ C G +L + + AF
Sbjct: 1200 LCDIPAGVELTFNYNLDCLGNGRTE---CHCGADNCSG-FLGVRPKTAF 1244
>gi|291225527|ref|XP_002732754.1| PREDICTED: Ash1l protein-like [Saccoglossus kowalevskii]
Length = 2643
Score = 63.2 bits (152), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 52/103 (50%), Gaps = 7/103 (6%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
+V+D + HSC PNCE + +V+G Y+IG++ ++ I G E+T+DYN +
Sbjct: 1940 MVIDGYRMGCEGRFVNHSCEPNCEMQKWSVNGVYRIGLFALKDIQPGSELTYDYNFHAFN 1999
Query: 2021 KEEYEASVCLCGSQVCRG-----SYLNLTGEGAFEKVLKELHG 2058
E + C CGS CRG S GAF+K K G
Sbjct: 2000 LETQQE--CCCGSDKCRGFIGGKSQAQQRVNGAFKKDKKTASG 2040
>gi|359489946|ref|XP_002268035.2| PREDICTED: histone-lysine N-methyltransferase ASHH3-like [Vitis
vinifera]
Length = 377
Score = 63.2 bits (152), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 53/172 (30%), Positives = 79/172 (45%), Gaps = 38/172 (22%)
Query: 1876 KAMDSRPDDKYVAY---RKGLGVVCNKEGGFGEDDFVVEFLGEVYP-------VWKWFEK 1925
K SRP K + G G+V +++ GE FV+E++GEV +WK
Sbjct: 114 KPFQSRPVKKMKMVETEKCGSGIVADEDIKQGE--FVIEYVGEVIDDKTCEDRLWK---- 167
Query: 1926 QDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEA 1985
++ L + N FY + R +V+DA +K N + I HSC PN E
Sbjct: 168 ---MKHLGETN------FYLCEINRD---------MVIDATYKGNKSRYINHSCDPNTEM 209
Query: 1986 KVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCR 2037
+ +DG +IGI+ R I GE +T+DY V ++ C CG+ CR
Sbjct: 210 QKWRIDGETRIGIFATRDIKRGEHLTYDYQFVQFGADQD----CHCGAVGCR 257
>gi|118572948|sp|Q6P2L6.2|NSD3_MOUSE RecName: Full=Histone-lysine N-methyltransferase NSD3; AltName:
Full=Nuclear SET domain-containing protein 3; AltName:
Full=Wolf-Hirschhorn syndrome candidate 1-like protein 1
homolog; Short=WHSC1-like protein 1
Length = 1439
Score = 63.2 bits (152), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 46/175 (26%), Positives = 86/175 (49%), Gaps = 22/175 (12%)
Query: 1882 PDDKYV-AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
PD + + R+G G+ + GE FV E++GE+ ++++ +++ +E+
Sbjct: 1148 PDAEVIKTERRGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSV 1199
Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
FY + + + + ++DA K NY+ + HSC PNCE + V+G ++G++
Sbjct: 1200 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1250
Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKE 2055
+ I G E+TF+YN +VC CG+ C G +L + + A + E
Sbjct: 1251 LCDIPAGMELTFNYNLDCLGNGR---TVCHCGADNCSG-FLGVRPKSACTSAVDE 1301
>gi|170694289|ref|ZP_02885443.1| nuclear protein SET [Burkholderia graminis C4D1M]
gi|170140712|gb|EDT08886.1| nuclear protein SET [Burkholderia graminis C4D1M]
Length = 185
Score = 63.2 bits (152), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 50/142 (35%), Positives = 71/142 (50%), Gaps = 29/142 (20%)
Query: 1909 VVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHK 1968
++E+ GE WK + +R N ++P FY L+ K V+D
Sbjct: 30 LIEYKGERIS-WK-----EALRRHPHNPDEPNHTFY-FALDSGK---------VIDGKVS 73
Query: 1969 ANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES------KE 2022
N A I HSC PNCEA+ +DGH + ++ +R I GEEI +DY V ++ K+
Sbjct: 74 GNSARWINHSCAPNCEAE--EIDGH--VYVHALRDIAEGEEIFYDYGLVIDARQTKKLKK 129
Query: 2023 EYEASVCLCGSQVCRGSYLNLT 2044
EYE C CGS+ CRG+ L T
Sbjct: 130 EYE---CRCGSRKCRGTMLAPT 148
>gi|170574239|ref|XP_001892724.1| SET domain containing protein [Brugia malayi]
gi|158601534|gb|EDP38427.1| SET domain containing protein [Brugia malayi]
Length = 222
Score = 63.2 bits (152), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 33/75 (44%), Positives = 42/75 (56%), Gaps = 2/75 (2%)
Query: 1963 VDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKE 2022
VDA + N A HSC PN + VDG Y++ I T++ I GEE+TFDY+ TE E
Sbjct: 66 VDARNYGNIARSFNHSCEPNTKVDAVVVDGIYRLKISTIKDIKKGEELTFDYD--TEIIE 123
Query: 2023 EYEASVCLCGSQVCR 2037
C CGS+ CR
Sbjct: 124 GLVGMECFCGSRNCR 138
>gi|124486903|ref|NP_001074738.1| histone-lysine N-methyltransferase NSD3 isoform 2 [Mus musculus]
gi|189442807|gb|AAI67226.1| Wolf-Hirschhorn syndrome candidate 1-like 1 (human) [synthetic
construct]
Length = 1446
Score = 63.2 bits (152), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 46/175 (26%), Positives = 86/175 (49%), Gaps = 22/175 (12%)
Query: 1882 PDDKYV-AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
PD + + R+G G+ + GE FV E++GE+ ++++ +++ +E+
Sbjct: 1155 PDAEVIKTERRGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSV 1206
Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
FY + + + + ++DA K NY+ + HSC PNCE + V+G ++G++
Sbjct: 1207 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1257
Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKE 2055
+ I G E+TF+YN +VC CG+ C G +L + + A + E
Sbjct: 1258 LCDIPAGMELTFNYNLDCLGNGR---TVCHCGADNCSG-FLGVRPKSACTSAVDE 1308
>gi|398807623|ref|ZP_10566499.1| SET domain-containing protein [Variovorax sp. CF313]
gi|398089158|gb|EJL79686.1| SET domain-containing protein [Variovorax sp. CF313]
Length = 206
Score = 63.2 bits (152), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 48/144 (33%), Positives = 68/144 (47%), Gaps = 27/144 (18%)
Query: 1903 FGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVV 1962
E + ++E+ GEV WK +L+++ DPA + Y G V
Sbjct: 31 LAEGETLIEYKGEVIS-WK--------EALRRHPHDPAQPNHTFYFHIDDGR-------V 74
Query: 1963 VDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKE 2022
+D K N A I HSC PNCEA VDG ++ I +R I GEE+ +DY + + E
Sbjct: 75 IDGNVKGNDARWINHSCEPNCEAD--EVDG--RVYIKALRNISAGEELNYDYGLIID--E 128
Query: 2023 EYEASV-----CLCGSQVCRGSYL 2041
Y + C CGS+ CRG+ L
Sbjct: 129 PYTPKLLSEFPCWCGSEQCRGTLL 152
>gi|148700883|gb|EDL32830.1| mCG14519 [Mus musculus]
Length = 1381
Score = 63.2 bits (152), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 46/175 (26%), Positives = 86/175 (49%), Gaps = 22/175 (12%)
Query: 1882 PDDKYV-AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
PD + + R+G G+ + GE FV E++GE+ ++++ +++ +E+
Sbjct: 1090 PDAEVIKTERRGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSV 1141
Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
FY + + + + ++DA K NY+ + HSC PNCE + V+G ++G++
Sbjct: 1142 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1192
Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKE 2055
+ I G E+TF+YN +VC CG+ C G +L + + A + E
Sbjct: 1193 LCDIPAGMELTFNYNLDCLGNGR---TVCHCGADNCSG-FLGVRPKSACTSAVDE 1243
>gi|350593412|ref|XP_003483678.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
NSD3-like [Sus scrofa]
Length = 1438
Score = 63.2 bits (152), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 46/175 (26%), Positives = 86/175 (49%), Gaps = 22/175 (12%)
Query: 1882 PDDKYV-AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
PD + + R+G G+ + GE FV E++GE+ ++++ +++ +E+
Sbjct: 1146 PDAEIIKTERRGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSV 1197
Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
FY + + + + ++DA K NY+ + HSC PNCE + V+G ++G++
Sbjct: 1198 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1248
Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKE 2055
+ I G E+TF+YN E C CG++ C G +L + + A +E
Sbjct: 1249 LCDIPAGMELTFNYNLDCLGNGRTE---CHCGAENCSG-FLGVRPKSACASTTEE 1299
>gi|297737225|emb|CBI26426.3| unnamed protein product [Vitis vinifera]
Length = 438
Score = 63.2 bits (152), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 53/172 (30%), Positives = 79/172 (45%), Gaps = 38/172 (22%)
Query: 1876 KAMDSRPDDKYVAY---RKGLGVVCNKEGGFGEDDFVVEFLGEVYP-------VWKWFEK 1925
K SRP K + G G+V +++ GE FV+E++GEV +WK
Sbjct: 191 KPFQSRPVKKMKMVETEKCGSGIVADEDIKQGE--FVIEYVGEVIDDKTCEDRLWK---- 244
Query: 1926 QDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEA 1985
++ L + N FY + R +V+DA +K N + I HSC PN E
Sbjct: 245 ---MKHLGETN------FYLCEINRD---------MVIDATYKGNKSRYINHSCDPNTEM 286
Query: 1986 KVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCR 2037
+ +DG +IGI+ R I GE +T+DY V ++ C CG+ CR
Sbjct: 287 QKWRIDGETRIGIFATRDIKRGEHLTYDYQFVQFGADQD----CHCGAVGCR 334
>gi|296082099|emb|CBI21104.3| unnamed protein product [Vitis vinifera]
Length = 1111
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 44/135 (32%), Positives = 67/135 (49%), Gaps = 14/135 (10%)
Query: 1908 FVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMH 1967
VVE++GE+ + +++ +S +K A F+ I E ++DA
Sbjct: 991 MVVEYVGEIVGLRVADKRESDYQSGRKLQYKTACYFFRIDKEH-----------IIDATR 1039
Query: 1968 KANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEAS 2027
K A + HSC PNC AKV +V ++ + R I+ GEEIT+DY+ E +E +
Sbjct: 1040 KGGIARFVNHSCLPNCVAKVISVRNEKKVVFFAERDINPGEEITYDYHFNHE--DEGKKI 1097
Query: 2028 VCLCGSQVCRGSYLN 2042
C C S+ CR YLN
Sbjct: 1098 PCFCNSRNCR-RYLN 1111
>gi|313226807|emb|CBY21952.1| unnamed protein product [Oikopleura dioica]
Length = 216
Score = 62.8 bits (151), Expect = 2e-06, Method: Composition-based stats.
Identities = 45/133 (33%), Positives = 66/133 (49%), Gaps = 23/133 (17%)
Query: 1908 FVVEFLGEVYPVWKWFEKQ--DGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDA 1965
F++E+LGEV K F+K+ + RS ++++ Y + L R +DA
Sbjct: 75 FIIEYLGEVVSA-KEFKKRSHEYARSGKQHH-------YFMELSRQ---------ATIDA 117
Query: 1966 MHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYE 2025
HK + I HSC PN E + V+G +IG + +R I EEITFDY + +
Sbjct: 118 YHKGAISRFINHSCEPNSETQKWTVNGLLRIGFFAIRDIQPEEEITFDYQFIHFG----Q 173
Query: 2026 ASVCLCGSQVCRG 2038
CLCG+ CRG
Sbjct: 174 GQKCLCGAPSCRG 186
>gi|218201888|gb|EEC84315.1| hypothetical protein OsI_30811 [Oryza sativa Indica Group]
Length = 360
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 49/153 (32%), Positives = 71/153 (46%), Gaps = 35/153 (22%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYP-------VWKWFEKQDGIRSLQKNNEDPAPEFY 1944
G GVV ++ GE FV+E++GEV +WK + D
Sbjct: 119 GNGVVAEEDIKKGE--FVIEYVGEVIDDRTCEQRLWKMKRQGDT---------------- 160
Query: 1945 NIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGI 2004
N YL + +V+DA +K N + I HSC PN E + V+G ++GI+ +R I
Sbjct: 161 NFYLCEVSSN------MVIDATNKGNMSRFINHSCEPNTEMQKWTVEGETRVGIFALRDI 214
Query: 2005 HYGEEITFDYNSVTESKEEYEASVCLCGSQVCR 2037
GEE+T+DY V ++ C CGS CR
Sbjct: 215 KTGEELTYDYKFVQFGADQD----CHCGSSNCR 243
>gi|47225089|emb|CAF97504.1| unnamed protein product [Tetraodon nigroviridis]
Length = 352
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 37/84 (44%), Positives = 45/84 (53%), Gaps = 3/84 (3%)
Query: 1959 DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVT 2018
D VVDA N A I HSC PNC ++V VDG I I+ R I+ GEE+T+DY
Sbjct: 272 DYEVVDATVHGNAARFINHSCEPNCYSRVITVDGKKHIVIFASRRIYQGEELTYDYKFPI 331
Query: 2019 ESKEEYEASVCLCGSQVCRGSYLN 2042
E E C C S+ CR +LN
Sbjct: 332 E--EASSKLPCNCNSKKCR-KFLN 352
>gi|291409090|ref|XP_002720827.1| PREDICTED: WHSC1L1 protein [Oryctolagus cuniculus]
Length = 1435
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 46/175 (26%), Positives = 86/175 (49%), Gaps = 22/175 (12%)
Query: 1882 PDDKYV-AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
PD + + R+G G+ + GE FV E++GE+ ++++ +++ +E+
Sbjct: 1143 PDAEIIKTERRGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSV 1194
Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
FY + + + + ++DA K NY+ + HSC PNCE + V+G ++G++
Sbjct: 1195 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1245
Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKE 2055
+ I G E+TF+YN E C CG++ C G +L + + A +E
Sbjct: 1246 LCDIPAGMELTFNYNLDCLGNGRTE---CHCGAENCSG-FLGVRPKSACASTTEE 1296
>gi|298706866|emb|CBJ25830.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 810
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/132 (28%), Positives = 65/132 (49%), Gaps = 16/132 (12%)
Query: 1908 FVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMH 1967
+ E++GEV + R L+ N+ EFY + L + + +DA
Sbjct: 563 LIGEYVGEVIDEAMVEHRMAEQRRLRPNDG----EFYIMELGQS---------LFIDAKE 609
Query: 1968 KANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEAS 2027
K N I HSC PNC+ + + G+ ++GIY + + GE +++DY T K ++
Sbjct: 610 KGNLMRLINHSCNPNCDVQAWNIAGYTRLGIYAKKDLAKGESLSYDYKFSTNEKARFK-- 667
Query: 2028 VCLCGSQVCRGS 2039
C+CG++ CRG+
Sbjct: 668 -CMCGAENCRGT 678
>gi|414884958|tpg|DAA60972.1| TPA: putative histone-lysine N-methyltransferase family protein
isoform 1 [Zea mays]
gi|414884959|tpg|DAA60973.1| TPA: putative histone-lysine N-methyltransferase family protein
isoform 2 [Zea mays]
Length = 337
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 44/77 (57%), Gaps = 4/77 (5%)
Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
+V+DA +K N + I HSC PN + VDG ++GI+ +R I GEE+T+DY V
Sbjct: 182 MVIDATNKGNLSRFINHSCEPNTAMQKWTVDGETRVGIFALRDIKIGEELTYDYKFVQFG 241
Query: 2021 KEEYEASVCLCGSQVCR 2037
A VC CGS CR
Sbjct: 242 A----AQVCHCGSSKCR 254
>gi|222641285|gb|EEE69417.1| hypothetical protein OsJ_28789 [Oryza sativa Japonica Group]
Length = 360
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 49/153 (32%), Positives = 71/153 (46%), Gaps = 35/153 (22%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYP-------VWKWFEKQDGIRSLQKNNEDPAPEFY 1944
G GVV ++ GE FV+E++GEV +WK + D
Sbjct: 119 GNGVVAEEDIKKGE--FVIEYVGEVIDDRTCEQRLWKMKRQGDT---------------- 160
Query: 1945 NIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGI 2004
N YL + +V+DA +K N + I HSC PN E + V+G ++GI+ +R I
Sbjct: 161 NFYLCEVSSN------MVIDATNKGNMSRFINHSCEPNTEMQKWTVEGETRVGIFALRDI 214
Query: 2005 HYGEEITFDYNSVTESKEEYEASVCLCGSQVCR 2037
GEE+T+DY V ++ C CGS CR
Sbjct: 215 KTGEELTYDYKFVQFGADQD----CHCGSSNCR 243
>gi|91785767|ref|YP_560973.1| hypothetical protein Bxe_A0006 [Burkholderia xenovorans LB400]
gi|91689721|gb|ABE32921.1| conserved hypothetical protein [Burkholderia xenovorans LB400]
Length = 174
Score = 62.8 bits (151), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 49/155 (31%), Positives = 76/155 (49%), Gaps = 23/155 (14%)
Query: 1895 VVCNKEGGFGEDDFVVEFL--GEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPK 1952
+ + G G+ F VE + GE +K E+ +L+++ +PA + Y
Sbjct: 6 IAVRRSGVHGKGVFAVEPIAAGERLIEYKG-ERISWKEALRRHPHNPAEPNHTFYFALDS 64
Query: 1953 GDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITF 2012
G V+D N A I HSC PNCEA+ +DGH + ++ +R I GEE+ +
Sbjct: 65 GK-------VIDGKVNGNSARWINHSCAPNCEAE--EIDGH--VYVHALRDIAEGEEVFY 113
Query: 2013 DYNSVTES------KEEYEASVCLCGSQVCRGSYL 2041
DY V ++ K+EYE C CG++ CRG+ L
Sbjct: 114 DYGLVIDARQTNKLKKEYE---CRCGARKCRGTML 145
>gi|357498513|ref|XP_003619545.1| Histone-lysine N-methyltransferase ASHH3 [Medicago truncatula]
gi|355494560|gb|AES75763.1| Histone-lysine N-methyltransferase ASHH3 [Medicago truncatula]
Length = 348
Score = 62.8 bits (151), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 50/165 (30%), Positives = 76/165 (46%), Gaps = 24/165 (14%)
Query: 1876 KAMDSRPDDKYVAYRK---GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSL 1932
KA RP K + G G+V +++ GE FV+E++GEV + + + L
Sbjct: 103 KAFQHRPVKKMKLVKTEKCGSGIVADEDIKLGE--FVIEYVGEV------IDDKTCEQRL 154
Query: 1933 QKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDG 1992
+ FY + R +V+DA +K N + I HSC PN E + +DG
Sbjct: 155 WNMKDRGETNFYLCEINRD---------MVIDATNKGNKSRYINHSCCPNTEMQKWIIDG 205
Query: 1993 HYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCR 2037
+IGI+ R I GE +T+DY V ++ C CG+ CR
Sbjct: 206 ETRIGIFASRDIKKGEHLTYDYQFVQFGADQD----CHCGAVQCR 246
>gi|71029610|ref|XP_764448.1| hypothetical protein [Theileria parva strain Muguga]
gi|68351402|gb|EAN32165.1| hypothetical protein TP04_0811 [Theileria parva]
Length = 995
Score = 62.8 bits (151), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 46/148 (31%), Positives = 72/148 (48%), Gaps = 14/148 (9%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
KG+G V +E GE + V E++GEV F++ S + ++ +Y + + R
Sbjct: 716 KGVGAVATEE--IGEGELVCEYVGEVISQAD-FQRCLASASFAEIDDGNQSHWYVMKIHR 772
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
D Y +D+ H N A I HSC PNC + V G Y++G++ +R I EE+
Sbjct: 773 -----DTY----IDSTHLGNVARFINHSCDPNCASVPINVKGTYRMGVFALRKIKQDEEV 823
Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
T++Y SK C C ++ CRG
Sbjct: 824 TYNYGFT--SKGVGGGFRCRCRAKNCRG 849
>gi|157821603|ref|NP_001099560.1| histone-lysine N-methyltransferase NSD3 [Rattus norvegicus]
gi|149057818|gb|EDM09061.1| Wolf-Hirschhorn syndrome candidate 1-like 1 (predicted) [Rattus
norvegicus]
Length = 1396
Score = 62.8 bits (151), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 43/158 (27%), Positives = 79/158 (50%), Gaps = 21/158 (13%)
Query: 1882 PDDKYV-AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
PD + + R+G G+ + GE FV E++GE+ ++++ +++ +E+
Sbjct: 1105 PDAEIIKTERRGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSV 1156
Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
FY + + + + ++DA K NY+ + HSC PNCE + V+G ++G++
Sbjct: 1157 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1207
Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
+ I G E+TF+YN +VC CG+ C G
Sbjct: 1208 LCDIPAGMELTFNYNLDCLGNGR---TVCHCGADNCSG 1242
>gi|195574451|ref|XP_002105202.1| GD18047 [Drosophila simulans]
gi|194201129|gb|EDX14705.1| GD18047 [Drosophila simulans]
Length = 567
Score = 62.8 bits (151), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 43/151 (28%), Positives = 75/151 (49%), Gaps = 25/151 (16%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
+G G+V N+E DFV+E++GEV + R +++ D +Y + +E+
Sbjct: 384 RGFGLV-NREP-IAAGDFVIEYVGEV------INHAEFQRRMEQKQRDRDENYYFLGVEK 435
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
++DA K N A + HSC PNCE + V+ +++GI+ ++ I E+
Sbjct: 436 D---------FIIDAGPKGNLARFMNHSCEPNCETQKWTVNCIHRVGIFAIKDIPVNTEL 486
Query: 2011 TFDY---NSVTESKEEYEASVCLCGSQVCRG 2038
TF+Y + + SK+ C CG++ C G
Sbjct: 487 TFNYLWDDLMNNSKK-----ACFCGAKRCSG 512
>gi|417406466|gb|JAA49891.1| Putative histone-lysine n-methyltransferase nsd3-like isoform 3
[Desmodus rotundus]
Length = 1438
Score = 62.8 bits (151), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 43/158 (27%), Positives = 79/158 (50%), Gaps = 21/158 (13%)
Query: 1882 PDDKYV-AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
PD + + R+G G+ + GE FV E++GE+ ++++ +++ +E+
Sbjct: 1146 PDAEIIKTERRGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSV 1197
Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
FY + + + + ++DA K NY+ + HSC PNCE + V+G ++G++
Sbjct: 1198 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1248
Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
+ I G E+TF+YN E C CG++ C G
Sbjct: 1249 LCDIPAGMELTFNYNLDCLGNGRTE---CHCGAENCSG 1283
>gi|226493201|ref|NP_001149253.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20specific [Zea mays]
gi|194704072|gb|ACF86120.1| unknown [Zea mays]
gi|195625808|gb|ACG34734.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20specific [Zea mays]
gi|238014446|gb|ACR38258.1| unknown [Zea mays]
gi|414589294|tpg|DAA39865.1| TPA: putative histone-lysine N-methyltransferase family protein
isoform 1 [Zea mays]
gi|414589295|tpg|DAA39866.1| TPA: putative histone-lysine N-methyltransferase family protein
isoform 2 [Zea mays]
Length = 339
Score = 62.8 bits (151), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 50/151 (33%), Positives = 69/151 (45%), Gaps = 31/151 (20%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
G G+V E GE FV+E++GEV +D E ++ +
Sbjct: 127 GHGLVAEDEIKKGE--FVIEYVGEVI-------------------DDRTCE-NRLWTMKR 164
Query: 1952 KGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
D D Y +V+DA +K N + I HSC PN + VDG ++GI+ +R I
Sbjct: 165 LLDTDFYLCEVSSNMVIDATNKGNRSRFINHSCEPNTAMQKWTVDGETRVGIFALRDIKI 224
Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCR 2037
GEE+T+DY V A VC CGS CR
Sbjct: 225 GEELTYDYKFVQFGA----AQVCHCGSSNCR 251
>gi|356559949|ref|XP_003548258.1| PREDICTED: histone-lysine N-methyltransferase ASHH3-like [Glycine
max]
Length = 349
Score = 62.4 bits (150), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 46/146 (31%), Positives = 71/146 (48%), Gaps = 21/146 (14%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
G G+V +++ GE FV+E++GEV E+ ++ + N FY + R
Sbjct: 126 GSGIVADEDIKLGE--FVIEYVGEVIDDKTCEERLWNMKHSGETN------FYLCEINRD 177
Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
+V+DA +K N + I HSC PN E + +DG +IGI+ R I GE +T
Sbjct: 178 ---------MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIQKGEHLT 228
Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCR 2037
+DY V ++ C CG+ CR
Sbjct: 229 YDYQFVQFGADQD----CHCGAAECR 250
>gi|319796552|ref|YP_004158192.1| nuclear protein set [Variovorax paradoxus EPS]
gi|315599015|gb|ADU40081.1| nuclear protein SET [Variovorax paradoxus EPS]
Length = 205
Score = 62.4 bits (150), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 51/151 (33%), Positives = 70/151 (46%), Gaps = 32/151 (21%)
Query: 1901 GGFGEDDF-----VVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDA 1955
G F DD ++E+ GEV WK +L+++ DPA + Y G
Sbjct: 24 GVFAVDDLAEGETLIEYKGEVIN-WK--------EALRRHPHDPAQPNHTFYFHIDDGR- 73
Query: 1956 DGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYN 2015
V+D K N A I HSC PNCEA VDG ++ I +R I GEE+ +DY
Sbjct: 74 ------VIDGNVKGNDARWINHSCEPNCEAD--EVDG--RVYIKALRNIAAGEELNYDYG 123
Query: 2016 SVTESKEEYEASV-----CLCGSQVCRGSYL 2041
+ + E Y + C CGS+ CRG+ L
Sbjct: 124 LIID--EPYTPKLLSEFPCWCGSEQCRGTLL 152
>gi|385207702|ref|ZP_10034570.1| SET domain-containing protein [Burkholderia sp. Ch1-1]
gi|385180040|gb|EIF29316.1| SET domain-containing protein [Burkholderia sp. Ch1-1]
Length = 174
Score = 62.4 bits (150), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 40/117 (34%), Positives = 61/117 (52%), Gaps = 20/117 (17%)
Query: 1931 SLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAV 1990
+L+++ +PA + Y G V+D N A I HSC PNCEA+ +
Sbjct: 43 ALRRHPHNPAEPNHTFYFALDSGK-------VIDGKVNGNSARWINHSCAPNCEAE--EI 93
Query: 1991 DGHYQIGIYTVRGIHYGEEITFDYNSVTES------KEEYEASVCLCGSQVCRGSYL 2041
DGH + ++ +R I GEE+ +DY V ++ K+EYE C CG++ CRG+ L
Sbjct: 94 DGH--VYVHALRDIAEGEEVFYDYGLVIDARQTKKLKKEYE---CRCGARKCRGTML 145
>gi|397521373|ref|XP_003830771.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1 [Pan
paniscus]
Length = 1437
Score = 62.4 bits (150), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 43/158 (27%), Positives = 78/158 (49%), Gaps = 21/158 (13%)
Query: 1882 PDDKYV-AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
PD + + R+G G+ + GE FV E++GE+ ++++ +++ +E+
Sbjct: 1145 PDAEIIKTERRGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSV 1196
Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
FY + + + + ++DA K NY+ + HSC PNCE + V+G ++G++
Sbjct: 1197 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1247
Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
+ I G E+TF+YN E C CG+ C G
Sbjct: 1248 LCDIPAGMELTFNYNLDCLGNGRTE---CHCGADNCSG 1282
>gi|410307858|gb|JAA32529.1| Wolf-Hirschhorn syndrome candidate 1-like 1 [Pan troglodytes]
Length = 1437
Score = 62.4 bits (150), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 43/158 (27%), Positives = 78/158 (49%), Gaps = 21/158 (13%)
Query: 1882 PDDKYV-AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
PD + + R+G G+ + GE FV E++GE+ ++++ +++ +E+
Sbjct: 1145 PDAEIIKTERRGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSV 1196
Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
FY + + + + ++DA K NY+ + HSC PNCE + V+G ++G++
Sbjct: 1197 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1247
Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
+ I G E+TF+YN E C CG+ C G
Sbjct: 1248 LCDIPAGMELTFNYNLDCLGNGRTE---CHCGADNCSG 1282
>gi|402878017|ref|XP_003902703.1| PREDICTED: histone-lysine N-methyltransferase NSD3 [Papio anubis]
Length = 1438
Score = 62.4 bits (150), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 46/175 (26%), Positives = 85/175 (48%), Gaps = 22/175 (12%)
Query: 1882 PDDKYV-AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
PD + + R+G G+ + GE FV E++GE+ ++++ +++ +E+
Sbjct: 1146 PDAEIIKTERRGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSV 1197
Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
FY + + + + ++DA K NY+ + HSC PNCE + V+G ++G++
Sbjct: 1198 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1248
Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKE 2055
+ I G E+TF+YN E C CG+ C G +L + + A +E
Sbjct: 1249 LCDIPAGMELTFNYNLDCLGNGRTE---CHCGADNCSG-FLGVRPKSACASTTEE 1299
>gi|420253143|ref|ZP_14756206.1| SET domain-containing protein [Burkholderia sp. BT03]
gi|398052652|gb|EJL44901.1| SET domain-containing protein [Burkholderia sp. BT03]
Length = 160
Score = 62.4 bits (150), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 42/117 (35%), Positives = 61/117 (52%), Gaps = 20/117 (17%)
Query: 1931 SLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAV 1990
+L+++ +P + Y GD V+D K N A I HSC PNCEA+ +
Sbjct: 43 ALRRHPHNPDEPNHTFYFALDSGD-------VIDGKVKGNSARWINHSCAPNCEAE--EI 93
Query: 1991 DGHYQIGIYTVRGIHYGEEITFDYNSVTES------KEEYEASVCLCGSQVCRGSYL 2041
DGH + I +R I GEE+ +DY V ++ K+EYE C CG++ CRG+ L
Sbjct: 94 DGH--VFIDALRDIGAGEELFYDYGLVIDARQTKKLKKEYE---CRCGARKCRGTML 145
>gi|350855153|emb|CCD58126.1| huntingtin interacting protein-related [Schistosoma mansoni]
Length = 887
Score = 62.4 bits (150), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 54/203 (26%), Positives = 86/203 (42%), Gaps = 33/203 (16%)
Query: 1886 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN 1945
Y KG G++ G FV+E++GEV ++ + L
Sbjct: 286 YAGKDKGWGLMATDNVKKGS--FVIEYVGEVIDFSEFRRRIRRYERL------------- 330
Query: 1946 IYLERPKGDADGYDLVV-----VDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
G A Y + V +DA K N+A + HSC PNC + +V+G +IG +
Sbjct: 331 -------GHAHHYFMAVESDRFIDAGSKGNWARFVNHSCEPNCVTQKWSVNGEIRIGFFA 383
Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLL 2060
I G+E+T DY V E + C CG+ C G + T + EKV + ++
Sbjct: 384 KEDIPSGQEVTIDYQFVQYGVSEQK---CYCGASTCSG-IMGATSKYLQEKVRMKDTTMV 439
Query: 2061 DRHQLMLEACELNSVSEEDYLEL 2083
+R +L+ +L+S D + L
Sbjct: 440 ERR--ILQLLQLDSFRNADDITL 460
>gi|414590165|tpg|DAA40736.1| TPA: putative trithorax-like family protein [Zea mays]
Length = 1566
Score = 62.0 bits (149), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 67/135 (49%), Gaps = 14/135 (10%)
Query: 1908 FVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMH 1967
VVE++GE+ ++ +S ++ A F+ I E ++DA
Sbjct: 1446 MVVEYVGEIVGQRVADRREIEYQSGKRQQYKSACYFFKIDREH-----------IIDATR 1494
Query: 1968 KANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEAS 2027
K A + HSC+PNC AK+ +V ++ + R I+ GEEIT+DY+ E +E +
Sbjct: 1495 KGGIARFVNHSCQPNCVAKIISVRNEKKVMFFAERHINPGEEITYDYHFNRE--DEGQRI 1552
Query: 2028 VCLCGSQVCRGSYLN 2042
+C C S+ CR YLN
Sbjct: 1553 LCFCRSRYCR-RYLN 1566
>gi|397521377|ref|XP_003830773.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 3 [Pan
paniscus]
Length = 1388
Score = 62.0 bits (149), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 43/158 (27%), Positives = 78/158 (49%), Gaps = 21/158 (13%)
Query: 1882 PDDKYV-AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
PD + + R+G G+ + GE FV E++GE+ ++++ +++ +E+
Sbjct: 1096 PDAEIIKTERRGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSV 1147
Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
FY + + + + ++DA K NY+ + HSC PNCE + V+G ++G++
Sbjct: 1148 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1198
Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
+ I G E+TF+YN E C CG+ C G
Sbjct: 1199 LCDIPAGMELTFNYNLDCLGNGRTE---CHCGADNCSG 1233
>gi|395847337|ref|XP_003796335.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2
[Otolemur garnettii]
Length = 1389
Score = 62.0 bits (149), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 43/158 (27%), Positives = 78/158 (49%), Gaps = 21/158 (13%)
Query: 1882 PDDKYV-AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
PD + + R+G G+ + GE FV E++GE+ ++++ +++ +E+
Sbjct: 1097 PDAEIIKTERRGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSV 1148
Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
FY + + + + ++DA K NY+ + HSC PNCE + V+G ++G++
Sbjct: 1149 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1199
Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
+ I G E+TF+YN E C CG+ C G
Sbjct: 1200 LCDIPAGMELTFNYNLDCLGNGRTE---CHCGADNCSG 1234
>gi|426256406|ref|XP_004021831.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2 [Ovis
aries]
Length = 1439
Score = 62.0 bits (149), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 46/175 (26%), Positives = 85/175 (48%), Gaps = 22/175 (12%)
Query: 1882 PDDKYV-AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
PD + + R+G G+ + GE FV E++GE+ ++++ +++ +E+
Sbjct: 1147 PDAEVIRTERRGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSV 1198
Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
FY + + + + ++DA K NY+ + HSC PNCE + V+G ++G++
Sbjct: 1199 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1249
Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKE 2055
+ I G E+TF+YN E C CG+ C G +L + + A +E
Sbjct: 1250 LCDIPAGMELTFNYNLDCLGNGRTE---CHCGADNCSG-FLGVRPKSACASTAEE 1300
>gi|426359420|ref|XP_004046973.1| PREDICTED: histone-lysine N-methyltransferase NSD3 [Gorilla gorilla
gorilla]
Length = 1397
Score = 62.0 bits (149), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 43/158 (27%), Positives = 78/158 (49%), Gaps = 21/158 (13%)
Query: 1882 PDDKYV-AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
PD + + R+G G+ + GE FV E++GE+ ++++ +++ +E+
Sbjct: 1105 PDAEIIKTERRGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSV 1156
Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
FY + + + + ++DA K NY+ + HSC PNCE + V+G ++G++
Sbjct: 1157 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1207
Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
+ I G E+TF+YN E C CG+ C G
Sbjct: 1208 LCDIPAGMELTFNYNLDCLGNGRTE---CHCGADNCSG 1242
>gi|440907576|gb|ELR57709.1| Histone-lysine N-methyltransferase NSD3 [Bos grunniens mutus]
Length = 1446
Score = 62.0 bits (149), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 46/175 (26%), Positives = 85/175 (48%), Gaps = 22/175 (12%)
Query: 1882 PDDKYV-AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
PD + + R+G G+ + GE FV E++GE+ ++++ +++ +E+
Sbjct: 1156 PDAEVIRTERRGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSV 1207
Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
FY + + + + ++DA K NY+ + HSC PNCE + V+G ++G++
Sbjct: 1208 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1258
Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKE 2055
+ I G E+TF+YN E C CG+ C G +L + + A +E
Sbjct: 1259 LCDIPAGMELTFNYNLDCLGNGRTE---CHCGADNCSG-FLGVRPKSACASTAEE 1309
>gi|395847335|ref|XP_003796334.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1
[Otolemur garnettii]
Length = 1438
Score = 62.0 bits (149), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 43/158 (27%), Positives = 78/158 (49%), Gaps = 21/158 (13%)
Query: 1882 PDDKYV-AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
PD + + R+G G+ + GE FV E++GE+ ++++ +++ +E+
Sbjct: 1146 PDAEIIKTERRGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSV 1197
Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
FY + + + + ++DA K NY+ + HSC PNCE + V+G ++G++
Sbjct: 1198 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1248
Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
+ I G E+TF+YN E C CG+ C G
Sbjct: 1249 LCDIPAGMELTFNYNLDCLGNGRTE---CHCGADNCSG 1283
>gi|255078218|ref|XP_002502689.1| set domain protein [Micromonas sp. RCC299]
gi|226517954|gb|ACO63947.1| set domain protein [Micromonas sp. RCC299]
Length = 1065
Score = 62.0 bits (149), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 44/150 (29%), Positives = 67/150 (44%), Gaps = 22/150 (14%)
Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAP-EFYNIYL 1948
RKG G+ + + F++E++GEV +D RS + +D +Y + L
Sbjct: 181 RKGHGLFTKQA--LKKGQFIIEYIGEVL-------HEDEYRSRKARYDDEGRRHYYFMTL 231
Query: 1949 ERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGE 2008
+ +DA + N + HSC PNCE + V+G IGIY + I G+
Sbjct: 232 SSSE---------TIDAAERGNAGRFLNHSCDPNCETQKWMVNGELCIGIYALTDIDAGD 282
Query: 2009 EITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
E+TFDYN + C CG+ C G
Sbjct: 283 ELTFDYNFERYGDNPIK---CFCGTSRCGG 309
>gi|414590164|tpg|DAA40735.1| TPA: putative trithorax-like family protein [Zea mays]
Length = 1591
Score = 62.0 bits (149), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 67/135 (49%), Gaps = 14/135 (10%)
Query: 1908 FVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMH 1967
VVE++GE+ ++ +S ++ A F+ I E ++DA
Sbjct: 1471 MVVEYVGEIVGQRVADRREIEYQSGKRQQYKSACYFFKIDREH-----------IIDATR 1519
Query: 1968 KANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEAS 2027
K A + HSC+PNC AK+ +V ++ + R I+ GEEIT+DY+ E +E +
Sbjct: 1520 KGGIARFVNHSCQPNCVAKIISVRNEKKVMFFAERHINPGEEITYDYHFNRE--DEGQRI 1577
Query: 2028 VCLCGSQVCRGSYLN 2042
+C C S+ CR YLN
Sbjct: 1578 LCFCRSRYCR-RYLN 1591
>gi|315364634|pdb|3OOI|A Chain A, Crystal Structure Of Human Histone-Lysine
N-Methyltransferase Nsd1 Set Domain In Complex With
S-Adenosyl-L-Methionine
Length = 232
Score = 62.0 bits (149), Expect = 4e-06, Method: Composition-based stats.
Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
+FV E++GE+ ++++ ++ E FY + L++ + ++DA
Sbjct: 116 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 160
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K NYA + H C+PNCE + +V+G ++G++ + I G E+TF+YN +
Sbjct: 161 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 217
Query: 2027 SVCLCGSQVCRG 2038
+VC CG+ C G
Sbjct: 218 TVCKCGAPNCSG 229
>gi|386331753|ref|YP_006027922.1| set domain protein [Ralstonia solanacearum Po82]
gi|334194201|gb|AEG67386.1| set domain protein [Ralstonia solanacearum Po82]
Length = 188
Score = 62.0 bits (149), Expect = 4e-06, Method: Composition-based stats.
Identities = 42/136 (30%), Positives = 68/136 (50%), Gaps = 23/136 (16%)
Query: 1909 VVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHK 1968
++E+ GE + WK +L+++ DP+ + Y G V+DA +
Sbjct: 64 IIEYKGE-HISWK--------EALRRHPHDPSDPNHTFYFSLEDGS-------VIDAKYG 107
Query: 1969 ANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEAS- 2027
N A I H+C+PNCEA+ DG ++ I+ +R I GEE+ +DY V E ++
Sbjct: 108 GNRARWINHACKPNCEAR--EKDG--RVFIHALRDIEAGEELFYDYGLVIEGRQTKALKA 163
Query: 2028 --VCLCGSQVCRGSYL 2041
C CG++ CRG+ L
Sbjct: 164 QFACHCGAKTCRGTML 179
>gi|355708046|gb|AES03147.1| nuclear receptor binding SET domain protein 1 [Mustela putorius furo]
Length = 261
Score = 62.0 bits (149), Expect = 5e-06, Method: Composition-based stats.
Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
+FV E++GE+ ++++ ++ E FY + L++ + ++DA
Sbjct: 142 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 186
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K NYA + H C+PNCE + +V+G ++G++ + I G E+TF+YN +
Sbjct: 187 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 243
Query: 2027 SVCLCGSQVCRG 2038
+VC CG+ C G
Sbjct: 244 TVCKCGAPNCSG 255
>gi|431902251|gb|ELK08752.1| Histone-lysine N-methyltransferase NSD3 [Pteropus alecto]
Length = 1322
Score = 62.0 bits (149), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 43/158 (27%), Positives = 78/158 (49%), Gaps = 21/158 (13%)
Query: 1882 PDDKYV-AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
PD + + R+G G+ + GE FV E++GE+ ++++ +++ +E+
Sbjct: 1105 PDAEIIKTERRGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSV 1156
Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
FY + + + + ++DA K NY+ + HSC PNCE + V+G ++G++
Sbjct: 1157 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1207
Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
+ I G E+TF+YN E C CG+ C G
Sbjct: 1208 LCDIPAGMELTFNYNLDCLGNGRTE---CHCGADNCSG 1242
>gi|239818159|ref|YP_002947069.1| histone-lysine N-methyltransferase [Variovorax paradoxus S110]
gi|239804736|gb|ACS21803.1| Histone-lysine N-methyltransferase [Variovorax paradoxus S110]
Length = 210
Score = 61.6 bits (148), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 47/144 (32%), Positives = 68/144 (47%), Gaps = 27/144 (18%)
Query: 1903 FGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVV 1962
E + ++E+ GEV WK +L+++ DPA + Y G V
Sbjct: 31 LAEGETLIEYKGEVIS-WK--------EALRRHPHDPAQPNHTFYFHIDDGR-------V 74
Query: 1963 VDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKE 2022
+D K N A I HSC PNCEA +DG ++ I +R I GEE+ +DY + + E
Sbjct: 75 IDGNVKGNDARWINHSCEPNCEAD--EIDG--RVYIKALRNIAAGEELNYDYGLIID--E 128
Query: 2023 EYEASV-----CLCGSQVCRGSYL 2041
Y + C CGS+ CRG+ L
Sbjct: 129 PYTPKLLSEFPCWCGSENCRGTLL 152
>gi|302795285|ref|XP_002979406.1| hypothetical protein SELMODRAFT_110353 [Selaginella moellendorffii]
gi|300153174|gb|EFJ19814.1| hypothetical protein SELMODRAFT_110353 [Selaginella moellendorffii]
Length = 274
Score = 61.6 bits (148), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 42/132 (31%), Positives = 64/132 (48%), Gaps = 19/132 (14%)
Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
DF++E++GEV E+ ++ +NN FY + K V+DA
Sbjct: 109 DFLIEYIGEVIDDKTCEERLWDLKERGENN------FYLCEVGHDK---------VIDAT 153
Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
K N + I HSC PN + + DG +IG++ V I G+EIT+DY + E+
Sbjct: 154 FKGNMSRFINHSCDPNAQLRKWQCDGELRIGVFAVSRILKGQEITYDYKYIQFGTEQQ-- 211
Query: 2027 SVCLCGSQVCRG 2038
C CGS+ C+G
Sbjct: 212 --CHCGSKNCKG 221
>gi|449665927|ref|XP_002164851.2| PREDICTED: histone-lysine N-methyltransferase NSD2-like [Hydra
magnipapillata]
Length = 1214
Score = 61.6 bits (148), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 40/150 (26%), Positives = 78/150 (52%), Gaps = 24/150 (16%)
Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
+G G++ + + GE FV+E++GE+ +++ R +++ +E ++Y + +++
Sbjct: 860 RGWGLMADTDIKQGE--FVIEYVGEL------IDEETCHRRVREYHEKDIFDYYFLTIDK 911
Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
++DA K N + + HSC PNCE + V+G ++ ++ R I GEE+
Sbjct: 912 DN---------IIDAYPKGNMSRFMNHSCNPNCETQKWTVNGEIRVALFATRDIKMGEEL 962
Query: 2011 TFDYN--SVTESKEEYEASVCLCGSQVCRG 2038
F+YN S+ K++ C CG+ C G
Sbjct: 963 CFNYNLDSLGNDKKQ-----CKCGAVNCSG 987
>gi|402588522|gb|EJW82455.1| SET domain-containing protein [Wuchereria bancrofti]
Length = 626
Score = 61.6 bits (148), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 33/75 (44%), Positives = 42/75 (56%), Gaps = 2/75 (2%)
Query: 1963 VDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKE 2022
VDA + N A HSC PN + VDG Y++ I T++ I GEE+TFDY+ TE E
Sbjct: 495 VDARNYGNIARSFNHSCEPNTKVDAVVVDGIYRLKISTIKDIKKGEELTFDYD--TEIIE 552
Query: 2023 EYEASVCLCGSQVCR 2037
C CGS+ CR
Sbjct: 553 GLVGMECFCGSKNCR 567
>gi|405966105|gb|EKC31425.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Crassostrea gigas]
Length = 1079
Score = 61.6 bits (148), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 57/251 (22%), Positives = 111/251 (44%), Gaps = 33/251 (13%)
Query: 1908 FVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMH 1967
FV E++GE+ ++++ R + +++E+ +Y + L++ + V+DA
Sbjct: 736 FVHEYVGEL------IDEEEVKRRIDESHENNISNYYMLTLDKNR---------VIDAGP 780
Query: 1968 KANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEAS 2027
K N + + HSC PNCE + +G ++G++ + I G E+TF+YN ++ +
Sbjct: 781 KGNLSRFMNHSCAPNCETQKWTANGDVRVGLFAIYDIPAGTELTFNYNLECLGNDK---T 837
Query: 2028 VCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDRHQLMLEACELNSVSEED--YLELGR 2085
C CG+++C G +L + + A + + ++ + +++ E D G
Sbjct: 838 KCNCGAELCSG-FLGVRPKSAVAASVAKGKKKDEKKKRKRNKKKIDGKKEHDDECFRCGE 896
Query: 2086 AG-LGSCLLGGLPNWVVAYSARLVRFINLERTKLPE---EILRHNLEEKRKYFSDICLEV 2141
G L C GG P ++ L+ +K P + H+ +E K +C E
Sbjct: 897 GGELVMCDRGGCP--------KVYHLHCLKLSKPPHGKWDCPWHHCDECGKPAITMCTEC 948
Query: 2142 EKSDAEVQAEG 2152
S EG
Sbjct: 949 PNSFCATHTEG 959
>gi|295678165|ref|YP_003606689.1| nuclear protein SET [Burkholderia sp. CCGE1002]
gi|295438008|gb|ADG17178.1| nuclear protein SET [Burkholderia sp. CCGE1002]
Length = 177
Score = 61.6 bits (148), Expect = 6e-06, Method: Composition-based stats.
Identities = 45/139 (32%), Positives = 69/139 (49%), Gaps = 29/139 (20%)
Query: 1909 VVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHK 1968
++E+ GE WK +L+++ +PA + Y G V+D
Sbjct: 30 LIEYKGERI-TWK--------EALRRHPHNPAEPNHTFYFALDNGK-------VIDGKVN 73
Query: 1969 ANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES------KE 2022
N A I HSC PNCEA+ +DGH + ++ +R I GEE+ +DY V ++ K+
Sbjct: 74 GNSARWINHSCAPNCEAE--EIDGH--VYVHALRDIAEGEEVFYDYGLVIDARQTKKLKK 129
Query: 2023 EYEASVCLCGSQVCRGSYL 2041
EYE C CG++ CRG+ L
Sbjct: 130 EYE---CRCGARKCRGTML 145
>gi|209515808|ref|ZP_03264671.1| nuclear protein SET [Burkholderia sp. H160]
gi|209503835|gb|EEA03828.1| nuclear protein SET [Burkholderia sp. H160]
Length = 170
Score = 61.2 bits (147), Expect = 6e-06, Method: Composition-based stats.
Identities = 40/117 (34%), Positives = 61/117 (52%), Gaps = 20/117 (17%)
Query: 1931 SLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAV 1990
+L+++ +PA + Y G V+D N A I HSC PNCEA+ +
Sbjct: 43 ALRRHPHNPAEPNHTFYFALDNGK-------VIDGKVNGNSARWINHSCAPNCEAE--EI 93
Query: 1991 DGHYQIGIYTVRGIHYGEEITFDYNSVTES------KEEYEASVCLCGSQVCRGSYL 2041
DGH + ++ +R I GEE+ +DY V ++ K+EYE C CG++ CRG+ L
Sbjct: 94 DGH--VYVHALRDIAEGEEVFYDYGLVIDARQTKKLKKEYE---CRCGARKCRGTML 145
>gi|187930618|ref|YP_001901105.1| nuclear protein SET [Ralstonia pickettii 12J]
gi|187727508|gb|ACD28673.1| nuclear protein SET [Ralstonia pickettii 12J]
Length = 179
Score = 60.8 bits (146), Expect = 8e-06, Method: Composition-based stats.
Identities = 43/136 (31%), Positives = 68/136 (50%), Gaps = 23/136 (16%)
Query: 1909 VVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHK 1968
++E+ GE + WK +L+++ DP+ + Y G V+DA
Sbjct: 55 IIEYKGE-HITWK--------EALRRHPHDPSDPNHTFYFSLEDGS-------VIDAKFG 98
Query: 1969 ANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEE---YE 2025
N A I H+C+PNCEA+ DG ++ I+ +R I GEE+ +DY V E ++ E
Sbjct: 99 GNRARWINHACKPNCEAR--EEDG--RVFIHALRDIEPGEELFYDYGLVIEGRQTKALKE 154
Query: 2026 ASVCLCGSQVCRGSYL 2041
C CG++ CRG+ L
Sbjct: 155 QFACRCGAKRCRGTML 170
>gi|241664808|ref|YP_002983168.1| nuclear protein SET [Ralstonia pickettii 12D]
gi|240866835|gb|ACS64496.1| nuclear protein SET [Ralstonia pickettii 12D]
Length = 179
Score = 60.8 bits (146), Expect = 9e-06, Method: Composition-based stats.
Identities = 43/136 (31%), Positives = 68/136 (50%), Gaps = 23/136 (16%)
Query: 1909 VVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHK 1968
++E+ GE + WK +L+++ DP+ + Y G V+DA
Sbjct: 55 IIEYKGE-HITWK--------EALRRHPHDPSDPNHTFYFSLEDGS-------VIDAKFG 98
Query: 1969 ANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEE---YE 2025
N A I H+C+PNCEA+ DG ++ I+ +R I GEE+ +DY V E ++ E
Sbjct: 99 GNRARWINHACKPNCEAR--EEDG--RVFIHALRDIEPGEELFYDYGLVIEGRQTKALKE 154
Query: 2026 ASVCLCGSQVCRGSYL 2041
C CG++ CRG+ L
Sbjct: 155 QFACRCGAKKCRGTML 170
>gi|281346901|gb|EFB22485.1| hypothetical protein PANDA_005493 [Ailuropoda melanoleuca]
Length = 926
Score = 60.8 bits (146), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 43/158 (27%), Positives = 79/158 (50%), Gaps = 21/158 (13%)
Query: 1882 PDDKYV-AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
PD + + R+G G+ + GE FV E++GE+ ++++ +++ +E+
Sbjct: 634 PDAEVIKTERRGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECKLRIKRAHENSV 685
Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
FY + + + + ++DA K NY+ + HSC PNCE + V+G ++G++
Sbjct: 686 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 736
Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
+ I G E+TF+YN E C CG++ C G
Sbjct: 737 LCDIPAGMELTFNYNLDCLGNGRTE---CHCGAENCSG 771
>gi|340506525|gb|EGR32648.1| SET domain protein [Ichthyophthirius multifiliis]
Length = 978
Score = 60.8 bits (146), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 40/146 (27%), Positives = 70/146 (47%), Gaps = 22/146 (15%)
Query: 1908 FVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMH 1967
F+++++GEV+ + ++GI+ ++ + YL + + V+D
Sbjct: 73 FIIQYIGEVFDI----NSEEGIKRVKDYSRSTCT-----YLMKIDKNE------VIDPTF 117
Query: 1968 KANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEAS 2027
K N A I HSC PNC + V G IGI+ ++ I +E+TFDY + Y+
Sbjct: 118 KGNLARFINHSCDPNCITQKWHVLGEICIGIFAIKNIKEDDELTFDYQF-----DSYKTP 172
Query: 2028 V--CLCGSQVCRGSYLNLTGEGAFEK 2051
+ CLCG+ C+G + + FE+
Sbjct: 173 LTKCLCGNVKCKGYLGYIPTDYTFEE 198
>gi|76161881|gb|AAX30110.2| KIAA1076 protein [Schistosoma japonicum]
Length = 123
Score = 60.5 bits (145), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 36/81 (44%), Positives = 45/81 (55%), Gaps = 4/81 (4%)
Query: 1959 DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVT 2018
D V+DA N A I HSC+PNC AK+ V+ +I IY+ R I+ EEIT+DY
Sbjct: 45 DDFVIDATMCGNNARFINHSCQPNCYAKIIMVESKKKIVIYSKRDINVMEEITYDYKFPY 104
Query: 2019 ESKEEYEASVCLCGSQVCRGS 2039
E E C CGS CRG+
Sbjct: 105 EE----EKIPCQCGSSSCRGT 121
>gi|296004740|ref|XP_966279.2| SET domain protein, putative [Plasmodium falciparum 3D7]
gi|263429753|sp|C6KTD2.1|HKNMT_PLAF7 RecName: Full=Putative histone-lysine N-methyltransferase PFF1440w
gi|225631776|emb|CAG25109.2| SET domain protein, putative [Plasmodium falciparum 3D7]
Length = 6753
Score = 60.5 bits (145), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 45/143 (31%), Positives = 70/143 (48%), Gaps = 21/143 (14%)
Query: 1902 GFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----IYLERPKGDAD 1956
G+G + EF+ E PV ++ + IR++ D ++Y+ Y+ R +
Sbjct: 6623 GYGL--YTCEFINEGEPVIEYI--GEYIRNII---SDKREKYYDKIESSCYMFRLNEN-- 6673
Query: 1957 GYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQ-IGIYTVRGIHYGEEITFDYN 2015
+++DA N + I HSC PNC K+ + D + + I I+ R I EEIT+DY
Sbjct: 6674 ----IIIDATKWGNVSRFINHSCEPNCFCKIVSCDQNLKHIVIFAKRDIAAHEEITYDYQ 6729
Query: 2016 SVTESKEEYEASVCLCGSQVCRG 2038
ES E + +CLCGS C G
Sbjct: 6730 FGVES--EGKKLICLCGSSTCLG 6750
>gi|225430418|ref|XP_002283013.1| PREDICTED: histone-lysine N-methyltransferase ATX1-like [Vitis
vinifera]
Length = 496
Score = 60.5 bits (145), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 44/135 (32%), Positives = 67/135 (49%), Gaps = 14/135 (10%)
Query: 1908 FVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMH 1967
VVE++GE+ + +++ +S +K A F+ I E ++DA
Sbjct: 376 MVVEYVGEIVGLRVADKRESDYQSGRKLQYKTACYFFRIDKEH-----------IIDATR 424
Query: 1968 KANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEAS 2027
K A + HSC PNC AKV +V ++ + R I+ GEEIT+DY+ E +E +
Sbjct: 425 KGGIARFVNHSCLPNCVAKVISVRNEKKVVFFAERDINPGEEITYDYHFNHE--DEGKKI 482
Query: 2028 VCLCGSQVCRGSYLN 2042
C C S+ CR YLN
Sbjct: 483 PCFCNSRNCR-RYLN 496
>gi|213404666|ref|XP_002173105.1| carboxypeptidase Y [Schizosaccharomyces japonicus yFS275]
gi|212001152|gb|EEB06812.1| carboxypeptidase Y [Schizosaccharomyces japonicus yFS275]
Length = 1055
Score = 60.5 bits (145), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 49/194 (25%), Positives = 70/194 (36%), Gaps = 24/194 (12%)
Query: 409 RSPSHSDRSPHDRGRYYDHRDRSPSRHDRSPYTRDRSPYTFDRSPYSRERSPYNRDRSPY 468
R P H + P D +HRDR P + P DR P FD P +R P + + P
Sbjct: 253 RKPEHHGKPPMDFEHEPEHRDRPPMDFEHGPEHHDRPPMDFDHEPEHHDRPPMDFEHGPE 312
Query: 469 AREKSPYDRSRHYDHRNRSPFSAERSPQDRARFHDRSDRTPNY----------------- 511
+ P D R +H R P E P+ +R P +
Sbjct: 313 RHGEPPRDFERKPEHHGRPPKDFEHGPEHHGEPPRDFERKPEHHGKPPKHFEPEREHHGE 372
Query: 512 ----LERSPLHRSRPNNHREASSKTGASEKRNARYDSKGHEDKLGPKDSNARCSRSSAKE 567
ER P H +P H E + ++ +D HE PK+S KE
Sbjct: 373 PPRDFERKPEHHGKPPKHFEPEREHRDRPPKDFEHDRAHHEKP--PKESEPEQHEKQPKE 430
Query: 568 SQDKSNVQDLNVSD 581
S+ + + DL + D
Sbjct: 431 SKPEQEI-DLQIVD 443
Score = 57.0 bits (136), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 42/142 (29%), Positives = 53/142 (37%), Gaps = 20/142 (14%)
Query: 394 HEPSLSSRVIYD------RHGRSPSHSDRSPHDRGRYYDHRDRSPSRHDRSPYTRDR--- 444
HEP R D RHG P +R P G+ H + P HDR P +
Sbjct: 169 HEPEHHDRPPMDFEHGPKRHGEPPEDFERKPEHHGKPPKHFEPGPDHHDRPPKDFEHGPE 228
Query: 445 ----SPYTFDRSPYSRERSPYNRDRSPYAREKSPYDRSRHYDHRNRSPFSAERSPQDRAR 500
P F+R P R P + +R P K P D +HR+R P E P+
Sbjct: 229 HHGEPPRDFERKPEHRGEPPRDFERKPEHHGKPPMDFEHEPEHRDRPPMDFEHGPE---- 284
Query: 501 FHDRSDRTPNYLERSPLHRSRP 522
DR P + P H RP
Sbjct: 285 ---HHDRPPMDFDHEPEHHDRP 303
Score = 51.2 bits (121), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 35/138 (25%), Positives = 50/138 (36%), Gaps = 7/138 (5%)
Query: 409 RSPSHSDRSPHDRGRYYDHRDRSPSRHDRSPYTRDRSPYTFDRSPYSRERSPYNRDRSPY 468
R P H P D R +H + P + P RDR P F+ P +R P + D P
Sbjct: 239 RKPEHRGEPPRDFERKPEHHGKPPMDFEHEPEHRDRPPMDFEHGPEHHDRPPMDFDHEPE 298
Query: 469 AREKSPYDRSRHYDHRNRSPFSAERSPQDRARFHDRSDRTPNY-------LERSPLHRSR 521
++ P D + P ER P+ R + P + ER P H +
Sbjct: 299 HHDRPPMDFEHGPERHGEPPRDFERKPEHHGRPPKDFEHGPEHHGEPPRDFERKPEHHGK 358
Query: 522 PNNHREASSKTGASEKRN 539
P H E + R+
Sbjct: 359 PPKHFEPEREHHGEPPRD 376
Score = 49.7 bits (117), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 34/121 (28%), Positives = 46/121 (38%), Gaps = 7/121 (5%)
Query: 409 RSPSHSDRSPHDRGRYYDHRDRSPSRHDRSPYTRDRSPYTFDRSPYSRERSPYNRDRSPY 468
R P H DR P D +H DR P + P P F+R P + P + + P
Sbjct: 155 REPEHHDRPPMDFEHEPEHHDRPPMDFEHGPKRHGEPPEDFERKPEHHGKPPKHFEPGPD 214
Query: 469 AREKSPYDRSRHYDHRNRSPFSAERSPQDRARFHDRSDRTPNY-------LERSPLHRSR 521
++ P D +H P ER P+ R +R P + E P HR R
Sbjct: 215 HHDRPPKDFEHGPEHHGEPPRDFERKPEHRGEPPRDFERKPEHHGKPPMDFEHEPEHRDR 274
Query: 522 P 522
P
Sbjct: 275 P 275
>gi|242051571|ref|XP_002454931.1| hypothetical protein SORBIDRAFT_03g001640 [Sorghum bicolor]
gi|241926906|gb|EES00051.1| hypothetical protein SORBIDRAFT_03g001640 [Sorghum bicolor]
Length = 993
Score = 60.5 bits (145), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 55/185 (29%), Positives = 82/185 (44%), Gaps = 28/185 (15%)
Query: 1873 GILKAMDSRPDDKYVAYRKGLG---------VVCNKEGGFGEDDFVVEFLGEVYPVWKWF 1923
G+ MD + D + +++ LG V C + G G F + E V ++
Sbjct: 822 GLNACMDRKDDQSFSTFKERLGYLQKTENLRVSCGRSGIHGWGLFAARNIQEGQMVIEYR 881
Query: 1924 EKQ-----DGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHS 1978
+Q +R Q + E + YL + D VV+DA K N A I HS
Sbjct: 882 GEQVRRCVADLREAQYHREKK-----DCYLFKISED------VVIDATDKGNIARLINHS 930
Query: 1979 CRPNCEAKVTAVDG-HYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCR 2037
C PNC A++ V G QI + R + GEE+T+DY + E+ + CLC + CR
Sbjct: 931 CMPNCYARIMTVSGDRNQIILIAKRDVSAGEELTYDYLFDPDESEDCKVP-CLCKAPNCR 989
Query: 2038 GSYLN 2042
G Y+N
Sbjct: 990 G-YMN 993
>gi|345781638|ref|XP_003432154.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like [Canis lupus
familiaris]
Length = 742
Score = 60.5 bits (145), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 39/141 (27%), Positives = 72/141 (51%), Gaps = 23/141 (16%)
Query: 1898 NKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADG 1957
NK+G +FV E++GE+ ++++ +++ +E+ FY + + + +
Sbjct: 535 NKQG-----EFVNEYVGEL------IDEEECRLRIKRAHENSVTNFYMLTVTKDR----- 578
Query: 1958 YDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSV 2017
++DA K NY+ + HSC PNCE + V+G ++G++ + I G E+TF+YN
Sbjct: 579 ----IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDIRVGLFALCDIPAGMELTFNYNLD 634
Query: 2018 TESKEEYEASVCLCGSQVCRG 2038
E C CG++ C G
Sbjct: 635 CLGNGRTE---CHCGAENCSG 652
>gi|356530969|ref|XP_003534051.1| PREDICTED: histone-lysine N-methyltransferase ASHH3-like [Glycine
max]
Length = 349
Score = 60.5 bits (145), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 45/146 (30%), Positives = 70/146 (47%), Gaps = 21/146 (14%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
G G+V +++ GE FV+E++GEV E+ ++ + N FY + R
Sbjct: 126 GSGIVADEDIKLGE--FVIEYVGEVIDDKTCEERLWNMKHRGETN------FYLCEINRD 177
Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
+V+DA +K N + I HSC PN E + +DG +IGI+ I GE +T
Sbjct: 178 ---------MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATSDIQKGEHLT 228
Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCR 2037
+DY V ++ C CG+ CR
Sbjct: 229 YDYQFVQFGADQ----DCHCGAAECR 250
>gi|47222897|emb|CAF99053.1| unnamed protein product [Tetraodon nigroviridis]
Length = 768
Score = 60.1 bits (144), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 38/145 (26%), Positives = 72/145 (49%), Gaps = 18/145 (12%)
Query: 1894 GVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKG 1953
GV+ + FV+E++GEV ++++ ++ E+ FY + L++ +
Sbjct: 480 GVLMTSSDATSQGAFVIEYVGEV------IDEEECRARIKHAQENDIFNFYMLTLDKDR- 532
Query: 1954 DADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFD 2013
++DA K N A + H C+PNCE + V+G ++G++ ++ I G+E+ F+
Sbjct: 533 --------IIDAGPKGNQARFMNHCCQPNCETQKWTVNGDTRVGLFALQDIPKGKELNFN 584
Query: 2014 YNSVTESKEEYEASVCLCGSQVCRG 2038
YN + +VC CG+ C G
Sbjct: 585 YNLECLGNGK---TVCKCGAPNCSG 606
>gi|2980780|emb|CAA18207.1| putative protein [Arabidopsis thaliana]
gi|7269987|emb|CAB79804.1| putative protein [Arabidopsis thaliana]
Length = 477
Score = 60.1 bits (144), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/139 (28%), Positives = 65/139 (46%), Gaps = 29/139 (20%)
Query: 1905 EDDFVVEFLGEVYPVWK-----WFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYD 1959
++DF+VE++GEV + W K G++ +FY +++
Sbjct: 328 KEDFIVEYIGEVISDAQCEQRLWDMKHKGMK-----------DFYMCEIQKD-------- 368
Query: 1960 LVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTE 2019
+DA K N + + HSC PNC + V+G ++G++ R I GE +T+DY V
Sbjct: 369 -FTIDATFKGNASRFLNHSCNPNCVLEKWQVEGETRVGVFAARQIEAGEPLTYDYRFVQF 427
Query: 2020 SKEEYEASVCLCGSQVCRG 2038
E C CGS+ C+G
Sbjct: 428 GPE----VKCNCGSENCQG 442
>gi|407715215|ref|YP_006835780.1| nuclear protein SET [Burkholderia phenoliruptrix BR3459a]
gi|407237399|gb|AFT87598.1| nuclear protein SET [Burkholderia phenoliruptrix BR3459a]
Length = 173
Score = 60.1 bits (144), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 48/139 (34%), Positives = 70/139 (50%), Gaps = 29/139 (20%)
Query: 1909 VVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHK 1968
++E+ GE WK + +R N ++P FY L+ K V+D
Sbjct: 18 LIEYKGERIS-WK-----EALRRHPHNPDEPNHTFY-FALDSGK---------VIDGKVN 61
Query: 1969 ANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES------KE 2022
N A I HSC PNCEA+ +DGH + ++ +R I GEE+ +DY V ++ K+
Sbjct: 62 GNSARWINHSCAPNCEAE--EIDGH--VYVHALRDIAEGEELFYDYGLVIDARQTKKLKK 117
Query: 2023 EYEASVCLCGSQVCRGSYL 2041
EYE C CGS+ CRG+ L
Sbjct: 118 EYE---CRCGSRKCRGTML 133
>gi|224117806|ref|XP_002331636.1| SET domain protein [Populus trichocarpa]
gi|222874032|gb|EEF11163.1| SET domain protein [Populus trichocarpa]
Length = 351
Score = 59.7 bits (143), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 45/146 (30%), Positives = 67/146 (45%), Gaps = 21/146 (14%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
G G+V +++ GE FV+E++GEV + L K FY + R
Sbjct: 124 GSGIVADEDIKQGE--FVIEYVGEV------IDDNTCEERLWKMKHRGETNFYLCEINRN 175
Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
+V+DA +K N + I HSC PN E + +DG +IGI+ I GE +T
Sbjct: 176 ---------MVIDATYKGNKSRYINHSCSPNTEMQKWIIDGETRIGIFATHDIRKGEHLT 226
Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCR 2037
+DY V ++ C CG+ CR
Sbjct: 227 YDYQFVQFGADQ----DCHCGASGCR 248
>gi|449505027|ref|XP_004162355.1| PREDICTED: histone-lysine N-methyltransferase ASHH3-like [Cucumis
sativus]
Length = 373
Score = 59.7 bits (143), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 46/146 (31%), Positives = 71/146 (48%), Gaps = 21/146 (14%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
G G+V +++ GE FV+E++GEV E+ ++ + N FY + R
Sbjct: 126 GSGIVADEDIKQGE--FVIEYVGEVIDDKTCEERLWNMKHRGETN------FYLCEINRD 177
Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
+V+DA +K N + I HSC PN E + +DG +IGI+ R I GE +T
Sbjct: 178 ---------MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIPKGEHLT 228
Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCR 2037
+DY V ++ C CG+ CR
Sbjct: 229 YDYQFVQFGADQD----CHCGAVDCR 250
>gi|297802948|ref|XP_002869358.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297315194|gb|EFH45617.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 497
Score = 59.7 bits (143), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 67/144 (46%), Gaps = 30/144 (20%)
Query: 1903 FGEDDFVVEFLGEVYPVWK-----WFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADG 1957
++DF+VE++GEV + W K G++ +FY +++
Sbjct: 346 INKEDFIVEYIGEVISDAQCEQRLWDMKHKGMK-----------DFYMCEIQKD------ 388
Query: 1958 YDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSV 2017
+DA K N + + HSC PNC + V+G ++G++ R I GE +T+DY V
Sbjct: 389 ---FTIDATFKGNASRFLNHSCSPNCVLEKWQVEGETRVGVFAARQIEAGEPLTYDYRFV 445
Query: 2018 TESKEEYEASVCLCGSQVCRGSYL 2041
E C CGS+ C+G YL
Sbjct: 446 QFGPEVK----CNCGSESCQG-YL 464
>gi|449442399|ref|XP_004138969.1| PREDICTED: histone-lysine N-methyltransferase ASHH3-like [Cucumis
sativus]
Length = 373
Score = 59.3 bits (142), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 46/146 (31%), Positives = 71/146 (48%), Gaps = 21/146 (14%)
Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
G G+V +++ GE FV+E++GEV E+ ++ + N FY + R
Sbjct: 126 GSGIVADEDIKQGE--FVIEYVGEVIDDKTCEERLWNMKHRGETN------FYLCEINRD 177
Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
+V+DA +K N + I HSC PN E + +DG +IGI+ R I GE +T
Sbjct: 178 ---------MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIPKGEHLT 228
Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCR 2037
+DY V ++ C CG+ CR
Sbjct: 229 YDYQFVQFGADQD----CHCGAVDCR 250
>gi|18417683|ref|NP_567859.1| histone-lysine N-methyltransferase ASHR3 [Arabidopsis thaliana]
gi|75164864|sp|Q949T8.1|ASHR3_ARATH RecName: Full=Histone-lysine N-methyltransferase ASHR3; AltName:
Full=ASH1-related protein 3; AltName: Full=Protein SET
DOMAIN GROUP 4; AltName: Full=Protein stamen loss
gi|15292921|gb|AAK92831.1| unknown protein [Arabidopsis thaliana]
gi|20465681|gb|AAM20309.1| unknown protein [Arabidopsis thaliana]
gi|56201422|dbj|BAD72877.1| stamen loss [Arabidopsis thaliana]
gi|332660421|gb|AEE85821.1| histone-lysine N-methyltransferase ASHR3 [Arabidopsis thaliana]
Length = 497
Score = 59.3 bits (142), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 67/144 (46%), Gaps = 30/144 (20%)
Query: 1903 FGEDDFVVEFLGEVYPVWK-----WFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADG 1957
++DF+VE++GEV + W K G++ +FY +++
Sbjct: 346 INKEDFIVEYIGEVISDAQCEQRLWDMKHKGMK-----------DFYMCEIQKD------ 388
Query: 1958 YDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSV 2017
+DA K N + + HSC PNC + V+G ++G++ R I GE +T+DY V
Sbjct: 389 ---FTIDATFKGNASRFLNHSCNPNCVLEKWQVEGETRVGVFAARQIEAGEPLTYDYRFV 445
Query: 2018 TESKEEYEASVCLCGSQVCRGSYL 2041
E C CGS+ C+G YL
Sbjct: 446 QFGPE----VKCNCGSENCQG-YL 464
>gi|254358922|ref|ZP_04975195.1| putative membrane protein [Burkholderia mallei 2002721280]
gi|148028049|gb|EDK86070.1| putative membrane protein [Burkholderia mallei 2002721280]
Length = 956
Score = 56.6 bits (135), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 48/202 (23%), Positives = 82/202 (40%), Gaps = 9/202 (4%)
Query: 378 SSSSRISSLDKYSSRHHEP---------SLSSRVIYDRHGRSPSHSDRSPHDRGRYYDHR 428
S++ RI+S RHH P + + R+P+ R+P+ R +
Sbjct: 3 SANGRIASFGSLRERHHAPQSRGPLAALNRRRARRSESERRTPNAERRTPNAERRTPNAE 62
Query: 429 DRSPSRHDRSPYTRDRSPYTFDRSPYSRERSPYNRDRSPYAREKSPYDRSRHYDHRNRSP 488
R+P+ R+P R+P R+P + R+P R+P A ++P R + R+P
Sbjct: 63 RRTPNAERRTPNAERRTPNAERRTPNAERRTPNAERRTPNAERRTPNAERRTPNAERRTP 122
Query: 489 FSAERSPQDRARFHDRSDRTPNYLERSPLHRSRPNNHREASSKTGASEKRNARYDSKGHE 548
+ R+P R + RTPN R+P R N + G + + R ++
Sbjct: 123 NAERRTPNAERRTPNAERRTPNAERRTPNAERRTPNAERRTPNAGPETRLSRRGNTIDSS 182
Query: 549 DKLGPKDSNARCSRSSAKESQD 570
P R +R S + S +
Sbjct: 183 PIRTPSAPRRRLTRPSMRYSVE 204
>gi|254200343|ref|ZP_04906709.1| putative membrane protein [Burkholderia mallei FMH]
gi|147749939|gb|EDK57013.1| putative membrane protein [Burkholderia mallei FMH]
Length = 1012
Score = 56.2 bits (134), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 50/182 (27%), Positives = 78/182 (42%), Gaps = 20/182 (10%)
Query: 378 SSSSRISSLDKYSSRHHEP----------------SLSSRVIYDRHGRSPSHSDRSPHDR 421
S++ RI+S RHH P S S R + R+P+ R+P+
Sbjct: 3 SANGRIASFGSLRERHHAPQSRGPLAALNRRRARRSESERRTPNAERRTPNAERRTPNAE 62
Query: 422 GRYYDHRDRSPSRHDRSPYTRDRSPYTFDRSPYSRERSPYNRDRSPYAREKSPYDRSRHY 481
R + R+P+ R+P R+P R+P + R+P R+P A ++P R
Sbjct: 63 RRTPNAERRTPNAERRTPNAERRTPNAERRTPNAERRTPNAERRTPNAERRTPNAERRTP 122
Query: 482 DHRNRSPFSAERSPQDRARFHDRSDRTPNYLERSP-LHRSRPNNHRE---ASSKTGASEK 537
+ R+P + R+P R + RTPN R+P R PN R A +T +E+
Sbjct: 123 NAERRTPNAERRTPNAERRTPNAERRTPNAERRTPNAERRTPNAERRTPNAERRTPNAER 182
Query: 538 RN 539
R
Sbjct: 183 RT 184
Score = 54.3 bits (129), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 45/160 (28%), Positives = 71/160 (44%), Gaps = 6/160 (3%)
Query: 409 RSPSHSDRSPHDRGRYYDHRDRSPSRHDRSPYTRDRSPYTFDRSPYSRERSPYNRDRSPY 468
R+P+ R+P+ R + R+P+ R+P R+P R+P + R+P R+P
Sbjct: 71 RTPNAERRTPNAERRTPNAERRTPNAERRTPNAERRTPNAERRTPNAERRTPNAERRTPN 130
Query: 469 AREKSPYDRSRHYDHRNRSPFSAERSPQDRARFHDRSDRTPNYLERSP-LHRSRPNNHRE 527
A ++P R + R+P + R+P R + RTPN R+P R PN R
Sbjct: 131 AERRTPNAERRTPNAERRTPNAERRTPNAERRTPNAERRTPNAERRTPNAERRTPNAERR 190
Query: 528 ---ASSKTGASEKR--NARYDSKGHEDKLGPKDSNARCSR 562
A +T +E+R NA + E + R SR
Sbjct: 191 TPNAERRTPNAERRTPNAERRTPNAERRTPNAGPETRLSR 230
Score = 53.9 bits (128), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 40/162 (24%), Positives = 69/162 (42%)
Query: 409 RSPSHSDRSPHDRGRYYDHRDRSPSRHDRSPYTRDRSPYTFDRSPYSRERSPYNRDRSPY 468
R+P+ R+P+ R + R+P+ R+P R+P R+P + R+P R+P
Sbjct: 99 RTPNAERRTPNAERRTPNAERRTPNAERRTPNAERRTPNAERRTPNAERRTPNAERRTPN 158
Query: 469 AREKSPYDRSRHYDHRNRSPFSAERSPQDRARFHDRSDRTPNYLERSPLHRSRPNNHREA 528
A ++P R + R+P + R+P R + RTPN R+P R N
Sbjct: 159 AERRTPNAERRTPNAERRTPNAERRTPNAERRTPNAERRTPNAERRTPNAERRTPNAERR 218
Query: 529 SSKTGASEKRNARYDSKGHEDKLGPKDSNARCSRSSAKESQD 570
+ G + + R ++ P R +R S + S +
Sbjct: 219 TPNAGPETRLSRRGNTIDSSPIRTPSAPRRRLTRPSMRYSVE 260
>gi|407262482|ref|XP_003946424.1| PREDICTED: periphilin-1-like [Mus musculus]
Length = 588
Score = 54.7 bits (130), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 60/137 (43%), Positives = 70/137 (51%), Gaps = 33/137 (24%)
Query: 391 SRHHEPSLSSRVIYDRHGRSPSHS-DRSPH-----------DRGRYYDHRDRSP--SRHD 436
S HH SS DR SP ++ DRSPH DR Y RDRSP +R
Sbjct: 232 SSHHPRDRSSHYARDR---SPHYARDRSPHYARDRPPQYARDRSSQYA-RDRSPQYARDR 287
Query: 437 RSPYTRDRSP-YTFDRSP-YSRERSP---------YNRDRSP-YAREKS---PYDRSRHY 481
S Y RDRSP Y DRSP Y+R+RSP Y RDRSP YAR++S DRS HY
Sbjct: 288 SSHYARDRSPHYARDRSPHYARDRSPQYARDRSSHYARDRSPQYARDRSSQYARDRSSHY 347
Query: 482 DHRNRSPFSAERSPQDR 498
S ++ +RSP R
Sbjct: 348 ARDRSSHYARDRSPHKR 364
>gi|21220509|ref|NP_626288.1| hypothetical protein SCO2028 [Streptomyces coelicolor A3(2)]
gi|5738516|emb|CAB52863.1| putative membrane protein [Streptomyces coelicolor A3(2)]
Length = 509
Score = 53.1 bits (126), Expect = 0.002, Method: Composition-based stats.
Identities = 38/105 (36%), Positives = 51/105 (48%)
Query: 410 SPSHSDRSPHDRGRYYDHRDRSPSRHDRSPYTRDRSPYTFDRSPYSRERSPYNRDRSPYA 469
+P S+R R+ + DRSP DRSP DRS DRSP + +P + DRS A
Sbjct: 335 APPASERVRRATKRFPRNPDRSPPNPDRSPPDPDRSSPNPDRSPSDPDGTPSDPDRSSPA 394
Query: 470 REKSPYDRSRHYDHRNRSPFSAERSPQDRARFHDRSDRTPNYLER 514
++SP + H +RSP + P D R SDR P +R
Sbjct: 395 SDRSPSAPAAHAPAPDRSPPDPDGRPSDPDRSSPASDRLPPASDR 439
Score = 52.0 bits (123), Expect = 0.005, Method: Composition-based stats.
Identities = 44/130 (33%), Positives = 58/130 (44%), Gaps = 3/130 (2%)
Query: 409 RSPSHSDRSPHDRGRYYDHRDRSPSRHDRSPYTRDRSPYTFDRSPYSRERSPYNRDRSPY 468
RSP + DRSP D R + DRSPS D +P DRS DRSP + DRSP
Sbjct: 355 RSPPNPDRSPPDPDRSSPNPDRSPSDPDGTPSDPDRSSPASDRSPSAPAAHAPAPDRSPP 414
Query: 469 AREKSPYDRSRHYDHRNRSPFSAERSPQDRARFHDRSDRTPNYLERSPLHRSRPNNHREA 528
+ P D R +R P +++RSP A D +P + P S P+ A
Sbjct: 415 DPDGRPSDPDRSSPASDRLPPASDRSPSAPAAHAPAPDWSPPDPDGRP---SDPDRSSPA 471
Query: 529 SSKTGASEKR 538
S + + R
Sbjct: 472 SDRLPPAPDR 481
>gi|148708748|gb|EDL40695.1| mCG51743 [Mus musculus]
Length = 527
Score = 53.1 bits (126), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 60/137 (43%), Positives = 70/137 (51%), Gaps = 33/137 (24%)
Query: 391 SRHHEPSLSSRVIYDRHGRSPSHS-DRSPH-----------DRGRYYDHRDRSP--SRHD 436
S HH SS DR SP ++ DRSPH DR Y RDRSP +R
Sbjct: 171 SSHHPRDRSSHYARDR---SPHYARDRSPHYARDRPPQYARDRSSQYA-RDRSPQYARDR 226
Query: 437 RSPYTRDRSP-YTFDRSP-YSRERSP---------YNRDRSP-YAREKS---PYDRSRHY 481
S Y RDRSP Y DRSP Y+R+RSP Y RDRSP YAR++S DRS HY
Sbjct: 227 SSHYARDRSPHYARDRSPHYARDRSPQYARDRSSHYARDRSPQYARDRSSQYARDRSSHY 286
Query: 482 DHRNRSPFSAERSPQDR 498
S ++ +RSP R
Sbjct: 287 ARDRSSHYARDRSPHKR 303
>gi|18605943|gb|AAH22960.1| BC022960 protein, partial [Mus musculus]
Length = 368
Score = 51.6 bits (122), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 54/115 (46%), Positives = 68/115 (59%), Gaps = 20/115 (17%)
Query: 428 RDRSP--SRHDRSPYTRDRSP-YTFDRSP-YSRERSP-YNRDRSP-YAREKSPY---DRS 478
RDRS +R S Y RDRS Y DRSP Y+R+RS Y RDRSP YAR++SP+ DRS
Sbjct: 33 RDRSSQYARDRSSQYARDRSSQYARDRSPQYARDRSSHYARDRSPHYARDRSPHYARDRS 92
Query: 479 RHYDHRNRSPFSAERSPQ---DRARFH--DRS-----DRTPNYL-ERSPLHRSRP 522
HY S ++ +RSPQ DR+ + DRS DR+ +Y +RSP R P
Sbjct: 93 PHYARDRSSHYARDRSPQYARDRSSQYARDRSSHYARDRSSHYARDRSPHKRDAP 147
>gi|340382809|ref|XP_003389910.1| PREDICTED: hypothetical protein LOC100636721 [Amphimedon
queenslandica]
Length = 841
Score = 48.5 bits (114), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 25/51 (49%), Positives = 30/51 (58%)
Query: 427 HRDRSPSRHDRSPYTRDRSPYTFDRSPYSRERSPYNRDRSPYAREKSPYDR 477
+RD P D PY RD PY D PY+R+ PYNRD PY R+ PY+R
Sbjct: 439 YRDDRPYNRDDRPYNRDNRPYNRDDRPYNRDDRPYNRDDRPYNRDDRPYNR 489
Score = 44.3 bits (103), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 27/65 (41%), Positives = 34/65 (52%)
Query: 404 YDRHGRSPSHSDRSPHDRGRYYDHRDRSPSRHDRSPYTRDRSPYTFDRSPYSRERSPYNR 463
Y+R G + D P++R +RD P D PY RD PY D PY+R+ PYNR
Sbjct: 430 YNRDGEDRPYRDDRPYNRDDRPYNRDNRPYNRDDRPYNRDDRPYNRDDRPYNRDDRPYNR 489
Query: 464 DRSPY 468
D PY
Sbjct: 490 DDRPY 494
>gi|407264369|ref|XP_003945664.1| PREDICTED: periphilin-1-like [Mus musculus]
Length = 668
Score = 48.1 bits (113), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 88/267 (32%), Positives = 117/267 (43%), Gaps = 45/267 (16%)
Query: 300 EGLYKGEHNNGKNHGREYFHGNRFKRHGTDSDSGDRKYYGDYG-----DFAGLKSRRLSD 354
+G Y + + G+ +F R R S S Y G F+ R S
Sbjct: 182 DGYYSHDAFRVCDEGQSFFRDQRRSRRNYHSASWQPNYRNRRGGLRRKTFSSHHPRDRSS 241
Query: 355 DY-NSRSVH-----SEHYSRHSVEKFHRNSSSSRISSLDKYSSRHHEPSLS-SRVIYDRH 407
Y RS H S HY+R ++ R+ SS +R P + R +
Sbjct: 242 HYARDRSPHYARDRSPHYARDRPPQYARDRSSQYARDRSPQYARDRSPQYARDRSPHYAR 301
Query: 408 GRSPSHS-DRSPH---DRGRYYDH-------RDRSP--SRHDRSPYTRDRSP-YTFDRSP 453
RSP ++ DRS DR Y RDRS +R S Y RDRS Y DRSP
Sbjct: 302 DRSPQYARDRSSQYARDRSSQYARDRSSQYARDRSSQYARDRSSQYARDRSSQYARDRSP 361
Query: 454 -YSRERSP-YNRDRSP-YAREKSPY---DRSRHYDHRNRSPFSAERSPQ----------- 496
Y+R+RS Y RDRSP YAR++SP+ DRS HY S ++ +RSPQ
Sbjct: 362 QYARDRSSHYARDRSPHYARDRSPHYARDRSPHYARDRSSHYARDRSPQYARDRSSQYAR 421
Query: 497 DRARFHDRSDRTPNYL-ERSPLHRSRP 522
DR+ + R DR+ +Y +RSP R P
Sbjct: 422 DRSSHYAR-DRSSHYARDRSPHKRDAP 447
>gi|357622727|gb|EHJ74138.1| hypothetical protein KGM_12956 [Danaus plexippus]
Length = 1922
Score = 48.1 bits (113), Expect = 0.065, Method: Compositional matrix adjust.
Identities = 64/231 (27%), Positives = 97/231 (41%), Gaps = 32/231 (13%)
Query: 413 HSDRSPHDRGRYYDHRDRSPSRHDRSPYTRDRSPYTFDRS-PYSRERSPYNRDRSPYARE 471
H D S +DR R + RSP R+P + PY D+ Y + SPY++ S Y R
Sbjct: 418 HRDAS-YDRSRGGSYEPRSP----RAPSYERKPPY--DKGGAYEKRLSPYDKRSSSYERR 470
Query: 472 KSPYDRSRHYDHRNRSPFSAERSPQDRARFHDRSD-----RTPNYLERSPLHRSRPNNHR 526
+ YD+ YD R SP+S R +R R D RTP P+ RP + R
Sbjct: 471 AASYDKQTPYDRRRHSPYSRMRGSSYGSRSPSRDDPRKRPRTP------PVETRRPLSPR 524
Query: 527 EASSKTGASEKRN---ARYDSKGHEDKLGPK------DSNARCSRSSAKESQDKSNVQDL 577
E + + + R+ A YD K P+ + S S + D + V+
Sbjct: 525 EGETTSPMNSVRSEEGAEYDRGDRSGKQIPRIDFYHQSYRHKSSIRSPSQEVDNNYVELQ 584
Query: 578 NVSDEKTANCESHKEEQP-QSSSVDCKEPPQVDGPPLEELVSMEEDMDICD 627
+ S ++ +P +S + + E +D P E ++S D DICD
Sbjct: 585 HSSLVTVPIVDTTVAPKPIESPNRNPDEEKSMDAEPFEPILS---DEDICD 632
>gi|294678028|ref|YP_003578643.1| ribosomal large subunit pseudouridine synthase B [Rhodobacter
capsulatus SB 1003]
gi|294476848|gb|ADE86236.1| ribosomal large subunit pseudouridine synthase B [Rhodobacter
capsulatus SB 1003]
Length = 594
Score = 46.2 bits (108), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 61/221 (27%), Positives = 84/221 (38%), Gaps = 29/221 (13%)
Query: 331 DSGDRKYYGDYGDFAGLKSRRLSDD---YNSRSVHSEHYSRHSVEKFHRNSSSSRISSLD 387
+ GDRK Y +RR D Y R + YSR E R + R
Sbjct: 337 EDGDRKPYAPRDGEKKPYARREDGDRKPYAPRDGEKKPYSRR--EDGDRKPYAPRDGERK 394
Query: 388 KYSSRHHEPSLSSRVIYDRHGRSPSHSDRSPHDRGRYYDHRDRSPSRHDRSPYTR----D 443
Y+ R DR +P ++ P+ R D + +P +R PY R D
Sbjct: 395 PYARREDG---------DRKPYAPRDGEKKPYARREDGDRKPYAPRDGERKPYARREDGD 445
Query: 444 RSPYTFDRSPYSRERSPYNR----DRSPYAR---EKSPYDRSRHYDHRNRSPFSAERSPQ 496
R PY +P E+ PY R DR PYA EK PY R D + +P E+ P
Sbjct: 446 RKPY----APRDGEKKPYARREDGDRKPYAPRDGEKKPYARREDGDRKPYAPRDGEKKPY 501
Query: 497 DRARFHDRSDRTPNYLERSPLHRSRPNNHREASSKTGASEK 537
R DR P E+ P R + + + + G + K
Sbjct: 502 ARREDGDRKPYAPRDGEKKPYARREDGDRKPYAPRDGEAGK 542
Score = 44.3 bits (103), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 57/213 (26%), Positives = 80/213 (37%), Gaps = 26/213 (12%)
Query: 356 YNSRSVHSEHYSRHSVEKFHRNSSSSRISSLDKYSSRHHEPSLSSRVIYDRHGRSPSHSD 415
Y R + Y+R E R + R YS R DR +P +
Sbjct: 344 YAPRDGEKKPYARR--EDGDRKPYAPRDGEKKPYSRREDG---------DRKPYAPRDGE 392
Query: 416 RSPHDRGRYYDHRDRSPSRHDRSPYTR----DRSPYT---FDRSPYSRERSPYNRDRSPY 468
R P+ R D + +P ++ PY R DR PY +R PY+R + DR PY
Sbjct: 393 RKPYARREDGDRKPYAPRDGEKKPYARREDGDRKPYAPRDGERKPYARRE---DGDRKPY 449
Query: 469 AR---EKSPYDRSRHYDHRNRSPFSAERSPQDRARFHDRSDRTPNYLERSPLHRSRPNNH 525
A EK PY R D + +P E+ P R DR P E+ P R +
Sbjct: 450 APRDGEKKPYARREDGDRKPYAPRDGEKKPYARREDGDRKPYAPRDGEKKPYARREDGDR 509
Query: 526 REASSKTGASEKRNARYDSKGHEDKLGPKDSNA 558
+ + + G + R D G P+D A
Sbjct: 510 KPYAPRDGEKKPYARRED--GDRKPYAPRDGEA 540
Score = 43.9 bits (102), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 56/206 (27%), Positives = 80/206 (38%), Gaps = 30/206 (14%)
Query: 338 YGDYGDFAGLKSRRLSDDYNSRSVHSEHYSRHSVEKFHRNSSSSRISSLDKYSSRHHEPS 397
+G DFAG + R + +S H E R F R R Y+ R
Sbjct: 290 FGRQRDFAGAEGDRKPKSFGMKS-HREEGERKP---FARREDGERKP----YARREDG-- 339
Query: 398 LSSRVIYDRHGRSPSHSDRSPHDRGRYYDHRDRSPSRHDRSPYTR----DRSPYT---FD 450
DR +P ++ P+ R D + +P ++ PY+R DR PY +
Sbjct: 340 -------DRKPYAPRDGEKKPYARREDGDRKPYAPRDGEKKPYSRREDGDRKPYAPRDGE 392
Query: 451 RSPYSRERSPYNRDRSPYAR---EKSPYDRSRHYDHRNRSPFSAERSPQDRARFHDRSDR 507
R PY+R + DR PYA EK PY R D + +P ER P R DR
Sbjct: 393 RKPYARRE---DGDRKPYAPRDGEKKPYARREDGDRKPYAPRDGERKPYARREDGDRKPY 449
Query: 508 TPNYLERSPLHRSRPNNHREASSKTG 533
P E+ P R + + + + G
Sbjct: 450 APRDGEKKPYARREDGDRKPYAPRDG 475
>gi|395732000|ref|XP_003775997.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein C2orf16
homolog [Pongo abelii]
Length = 1984
Score = 45.8 bits (107), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 70/184 (38%), Positives = 86/184 (46%), Gaps = 28/184 (15%)
Query: 363 SEHYSRHSVEKFHRNSSSSRISSLDKYSSRHHEPS-LSSRVIYDRHGRSPSHSDR-SPHD 420
SE R E+ HR S S RH PS S R +R RSPS R SP +
Sbjct: 1687 SERSHRSPSERRHRRPSER--SHRSPSERRHRRPSERSHRSPSERRHRSPSQRSRPSPSE 1744
Query: 421 RGRYYDHRDRSPS-RHDRSPYTRDRSPYTFDRSPYSRERSPYNRDRSPYAREKSPYDRSR 479
R R RSPS R RSP R R P +R R RSP R + R +SP +RS
Sbjct: 1745 R------RHRSPSERRHRSPSQRSR-PSPSER----RHRSPSQRSQ---RRHRSPSERSH 1790
Query: 480 HYDHRNRSPFSAE---RSPQDRAR--FHDRSDRTPNYLERSPLHRSRPNNHREASSKT-G 533
H R S+E RSP +R+R +RS R+P+ RS HRS +HR S ++
Sbjct: 1791 HSPSERRHLSSSERRHRSPLERSRHSLSERSHRSPSE-RRS--HRSFERSHRRISERSHS 1847
Query: 534 ASEK 537
SEK
Sbjct: 1848 PSEK 1851
>gi|356515246|ref|XP_003526312.1| PREDICTED: probable histone-lysine N-methyltransferase ATXR3-like
[Glycine max]
Length = 2325
Score = 41.6 bits (96), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 19/36 (52%), Positives = 23/36 (63%), Gaps = 3/36 (8%)
Query: 1 MGDGGVACMPLQQQQQHNSIMERFPISDKTTICVGN 36
MGDGGVAC+PLQQQQ ++ER P + GN
Sbjct: 1 MGDGGVACIPLQQQQH---VIERLPNAAAEKALSGN 33
>gi|390344371|ref|XP_798120.3| PREDICTED: uncharacterized protein LOC593558 [Strongylocentrotus
purpuratus]
Length = 1785
Score = 41.2 bits (95), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 41/99 (41%)
Query: 421 RGRYYDHRDRSPSRHDRSPYTRDRSPYTFDRSPYSRERSPYNRDRSPYAREKSPYDRSRH 480
RGR DR P +H R P R P ++R P+ RSP RSP +SP + SR
Sbjct: 560 RGREPPMHDRVPPQHGRMPPQHGRVPSDYERVPHGHVRSPSEYSRSPSEYSRSPSEYSRS 619
Query: 481 YDHRNRSPFSAERSPQDRARFHDRSDRTPNYLERSPLHR 519
P + Q R + R P+ ++P R
Sbjct: 620 PSEHRGPPGRGPQPQQQYGRVPSQHGRAPHEAGKAPHGR 658
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.314 0.131 0.389
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 40,404,072,461
Number of Sequences: 23463169
Number of extensions: 1839635905
Number of successful extensions: 7974976
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 10864
Number of HSP's successfully gapped in prelim test: 20165
Number of HSP's that attempted gapping in prelim test: 6426794
Number of HSP's gapped (non-prelim): 743596
length of query: 2445
length of database: 8,064,228,071
effective HSP length: 160
effective length of query: 2285
effective length of database: 8,605,088,327
effective search space: 19662626827195
effective search space used: 19662626827195
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 86 (37.7 bits)