BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 000067
         (2445 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|224132582|ref|XP_002327831.1| SET domain protein [Populus trichocarpa]
 gi|222837240|gb|EEE75619.1| SET domain protein [Populus trichocarpa]
          Length = 2476

 Score = 3212 bits (8327), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 1684/2558 (65%), Positives = 1958/2558 (76%), Gaps = 195/2558 (7%)

Query: 1    MGDGGVACMPLQQQQQHNSIMERFPISDK------------TTICVGNSSNNSNKT---- 44
            MG GGVACMPLQ    +  + ERFP+ ++            TT C G  + NSN      
Sbjct: 1    MGSGGVACMPLQHGSNNIIMEERFPVQEQPTAAAAAMTTTATTACGGGKTVNSNSNISSA 60

Query: 45   -----NNNSISNNNDNKTNNDSSNN----------------------------------- 64
                 NN S  +  DN   N SSN                                    
Sbjct: 61   DNDNNNNGSSGDKKDNGKVNASSNGVTGKLKRVKRIIKVKKVVRRVVLGEKKGVGLDKAV 120

Query: 65   NGSSSSKNNETNKSNVKKNGVSTKTVRKKIVKIKKVIAVKKKEVQKN-SGSSKSNNNGEN 123
             G+  S + E      K++G+ T+   K++   KK   +KK++  K  +   K +    +
Sbjct: 121  KGAGGSGSKEVAVLEKKESGLKTEEKSKEVAAEKKESGLKKEDKSKEVAAEKKESGLKSS 180

Query: 124  IDNKNVENGGAVGE---VVTVDKENLKNEEVEEGELGTLKW------ENGEFVQPEKSQP 174
              +K VENG  +G     V     N+K EEVEEGELGTL+W      ENGEFV P   +P
Sbjct: 181  SGSKTVENGDGLGSGDSKVQSGSNNIK-EEVEEGELGTLRWPSKGEIENGEFV-PTPEKP 238

Query: 175  QSQLQSQSKQIEKGEIIVFSSKCRRGETEKGE---SGLWRGN---KDDIEKGEFIPDRWH 228
            +        +IE+GEI   S K ++G+ EKGE      WR     +D+IEKGEFIPDRW+
Sbjct: 239  RRS------EIERGEI--GSGKWKKGDIEKGEIVSGNKWRKGEAVRDEIEKGEFIPDRWN 290

Query: 229  KEVVKDEYGYSKSR-RYDYKLERTPPSGKYSGEDVYRRKEFDRSGSQHSKSSSRWESGQE 287
               +KDEYGY+KSR R+D   ERTPPSGKYS EDVYRRKE  RSG        RWESGQE
Sbjct: 291  ---IKDEYGYNKSRGRHDMSSERTPPSGKYSSEDVYRRKELSRSGGM------RWESGQE 341

Query: 288  RNVRISSKIVDDEGLYKGEHNNGKNHGREYFHGNRFKRHGTDSDSGDRKYYGDYGDFAGL 347
            R+ RISSKIVD+EG YK E++NGK+H RE+  GNR KRH TDSD+ +RKYYGDY   A  
Sbjct: 342  RSTRISSKIVDEEGSYKSEYSNGKSHEREHASGNRLKRHVTDSDNTERKYYGDY---AIS 398

Query: 348  KSRRLSDDYNSRSVHSEHYSRHSVEKFHRNSSSSRISSLDKYSSRHHEPSLSSRVIYDRH 407
            KSRRLS+D  SR  +SEHYSRHSVE+F+++SS SR+SS DKYSSRHHEP+LSS+V+YDRH
Sbjct: 399  KSRRLSED-GSRYAYSEHYSRHSVERFYKSSSYSRVSSSDKYSSRHHEPTLSSKVVYDRH 457

Query: 408  GRSPSHSDRSPHDRGRYYDHRDRSPSR--------------HDRSPYTRDRSPYT----- 448
                SHSDRSPHDR RYYDHRDRSP R              H+RSPY R+RSPY      
Sbjct: 458  ----SHSDRSPHDRPRYYDHRDRSPIRYEKSPYGREKTPFGHERSPYGRERSPYGRERSP 513

Query: 449  ---------FDRSPYSRERSPYNRDRSPYAREKSPYDRSRHYDHRNRSPFSAERSPQDRA 499
                      DRSPY RE+SPY R+RSPY  EKSPYDRS + +HR RSP   ERSPQDR 
Sbjct: 514  YWRDRSPDGHDRSPYGREKSPYGRERSPYVLEKSPYDRSSYNEHRKRSPAYFERSPQDRT 573

Query: 500  RFHDRSDRTPNYLERSPLHRSRPNNHREASSKTGASEKRNARYDSKGHEDKLGPKDSNAR 559
            R HDRSDRTP+YLERSP  R+RP NHREAS K  A EKR+++Y +K  +DK+  KD   +
Sbjct: 574  RHHDRSDRTPSYLERSPHDRARPTNHREASRKGAAHEKRSSQYGNKKQDDKISQKDPAVK 633

Query: 560  CSRSSAKESQDKSNVQDLNVSDEKTANCESHKEEQPQSSSVDCKEPPQVDGPPLEELVSM 619
             +  SAKESQDKS+V +L+  DEK  + E+  EE+ +S  ++ KE P+VDGPP EEL SM
Sbjct: 634  DTELSAKESQDKSSVHNLDGLDEKNTSSETRLEEKSESPVINAKESPKVDGPPPEELQSM 693

Query: 620  EEDMDICDTPPHVPAVTDSSVGKWFYLDHCGMECGPSRLCDLKTLVEEGVLVSDHFIKHL 679
            EEDMDICDTPPHVP V D+S G+WFYLDH G+ECGPS+LC+LK LV+EG+L+SDHFIKHL
Sbjct: 694  EEDMDICDTPPHVPVVADTSTGRWFYLDHFGVECGPSKLCELKALVDEGILMSDHFIKHL 753

Query: 680  DSNRWETVENAVSPLVTVNFPSITSDSVTQLVSPPEASGNLLADTGDTAQS---TGEEFP 736
            DS+RW T+ENAVSPLVTVNFPS+  D +TQLVSPPEA GNLLADTGD  QS    GE  P
Sbjct: 754  DSDRWLTIENAVSPLVTVNFPSVVPDVITQLVSPPEAPGNLLADTGDIVQSCSQIGEGVP 813

Query: 737  VTL-QSQCCPDGSAAAAESSEDLHIDVRVGALLDGFTVIPGKEIETLGEILQTTFERVDW 795
              L Q   CP+ SA A+E  EDL ID RVGALL+GF+V+PG EIET+G            
Sbjct: 814  GNLLQPLVCPNHSAVASEPLEDLQIDERVGALLEGFSVVPGSEIETVG------------ 861

Query: 796  QNNGGPTWHGACVGEQKPGDQKVDELY-ISDTKMKEAAELKSG---DKDH-WVVCFDSDE 850
                G  W+ A   EQ+  DQ  +EL   SD   KEA E   G   DKD  +    DS +
Sbjct: 862  ----GFAWYLASTAEQQ--DQNSNELLGHSDLITKEAVEAWPGSLADKDDGFASSVDSAD 915

Query: 851  WFSGRWSCKGGDWKRNDEAAQDRCSRKKQVLNDGFPLCQMPKSGYEDPRWNQKDDLYYPS 910
            WFSGRWSCKGGDWKRNDE+ QDR +R+K VLNDGFPLC M KSG EDPRW +KDDLY+PS
Sbjct: 916  WFSGRWSCKGGDWKRNDESVQDRFTRRKVVLNDGFPLCHMTKSGCEDPRWQRKDDLYFPS 975

Query: 911  HSRRLDLPPWAYACPDERNDGSGGSRSTQSKLAAVRGVKGTMLPVVRINACVVNDHGSFV 970
             SR+LDLPPWA++  DERND  G S+ST +K    RGVKGT+LPVVRINACVV DH   V
Sbjct: 976  QSRKLDLPPWAFSSTDERNDTGGVSKSTLNKPPITRGVKGTVLPVVRINACVVQDH---V 1032

Query: 971  SEPRSKVRAKERHSSRSARSYSSANDVRRSSAESDSHSKARNNQDSQGSWKSIACINTPK 1030
            SE R+KVR K+R+ SR+AR++S+ NDV+RSS ESDS SK  N+ DS G WKS A +NTPK
Sbjct: 1033 SETRTKVRGKDRYHSRAARTHSATNDVKRSSVESDSQSKVVNDPDSHGCWKSTAPLNTPK 1092

Query: 1031 DRLCTVDDLQLQLGEWYYLDGAGHERGPSSFSELQVLVDQGCIQKHTSVFRKFDKVWVPL 1090
            D LCT DDLQL LGEWYYLDGAGHE+GPSSFSELQ L D G IQK++SVFRKFD+VWVP+
Sbjct: 1093 DCLCTADDLQLNLGEWYYLDGAGHEQGPSSFSELQNLADIGTIQKYSSVFRKFDRVWVPI 1152

Query: 1091 TFATETSASTVRNHGEKIMPSGDSSGLPPTQSQDAVLGESNNNVNSNAFHTMHPQFIGYT 1150
            T ATET  ++V+     + P   SSG   T S+        ++ +S++FH++HPQFIG+T
Sbjct: 1153 TSATETFGASVKIQQSNVEPVIGSSG---TLSKSQTASNVESDRSSSSFHSLHPQFIGFT 1209

Query: 1151 RGKLHELVMKSYKNREFAAAINEVLDPWINAKQPKKETE-HVYRKS--EGDTRAGKRARL 1207
            RGKLHELVMKSYKNREFAAAINE LDPWI AK+P KE + H+Y KS  E D RAGKRAR+
Sbjct: 1210 RGKLHELVMKSYKNREFAAAINEALDPWIVAKRPPKEIDKHMYLKSGMEIDARAGKRARM 1269

Query: 1208 LVRESDGDEETEEELQTIQDESTFEDLCGDASFPGEESASSAIESGGWGLLDGHTLAHVF 1267
               ++D D E EE     +DE+TFE LCGD +F  EES  S IE+G WGLLDGH LA VF
Sbjct: 1270 QPAQNDEDYEMEEGTLH-KDETTFEQLCGDTNFHREESMCSEIEAGSWGLLDGHMLARVF 1328

Query: 1268 HFLRSDMKSLAFASLTCRHWRAAVRFYKGISRQVDLSSVGPNCTDSLIRKTLNAFDKEKL 1327
            HFLRSDMKSL FASLTC+ WR AV FYKGIS QVDLSS  PNCTD ++R  +N ++KEK+
Sbjct: 1329 HFLRSDMKSLVFASLTCKKWRCAVSFYKGISIQVDLSSGAPNCTDIMVRSIMNGYNKEKI 1388

Query: 1328 NSILLVGCTNITSGMLEEILQSFPHLSSIDIRGCGQFGELALKFPNINWVKSQKSRGAKF 1387
            N+++L GC NITSGMLEEIL+SFP LSSIDIRGC QF ELAL+FPNI+W+KS+     + 
Sbjct: 1389 NAMVLAGCKNITSGMLEEILRSFPCLSSIDIRGCTQFMELALRFPNISWLKSRTRISVES 1448

Query: 1388 NDSRSKIRSLKQITEKSSSAPKSKGLGDDMDDFGDLKDYFESVDKRDSANQSFRRSLYQR 1447
            N   SK+RSLKQI+E+              DDFG+LK+YF+SV+KRDSANQ FRRSLY+R
Sbjct: 1449 N---SKLRSLKQISER--------------DDFGELKEYFDSVNKRDSANQLFRRSLYKR 1491

Query: 1448 SKVFDARKSSSILSRDARMRRWSIKKSENGYKRMEEFLASSLKEIMRVNTFEFFVPKVAE 1507
            SKVFDARKSSSIL RDARMRRW++KKSEN Y+RME FLAS LK+IM+ NTF+FFVPK+ E
Sbjct: 1492 SKVFDARKSSSILPRDARMRRWAVKKSENSYRRMEGFLASGLKDIMKENTFDFFVPKLTE 1551

Query: 1508 IEGRMKKGYYISHGLGSVKDDISRMCRDAIKAKNRGSAGDMNRITTLFIQLATRLEQGAK 1567
            IE RMK GYY+ HGL +VK+DISRMCRDAIK KNRG AGDMN I TLF+QLA+RLE+ +K
Sbjct: 1552 IEDRMKSGYYVGHGLRAVKEDISRMCRDAIKVKNRG-AGDMNHIITLFLQLASRLEESSK 1610

Query: 1568 SSYYEREEMMKSWKDESPAGLYSATSKYKKKLSKMVSERKYMNRSNGTSLANGDFDYGEY 1627
             SY ER+E+MKSWKD+    L SA  K+KKK        KYMNRSNGT LANG FD+GEY
Sbjct: 1611 FSY-ERDELMKSWKDDVSTALDSAPIKHKKKAIDK----KYMNRSNGTILANGSFDFGEY 1665

Query: 1628 ASDREIRKRLSKLNRKSLDSGSETSDDLDGSSEDGKSDSESTVSDTDSDMDFRSDGRARE 1687
            ASD+EI+KR+SKLNRKS+DSGSETSDD   SSEDG+S   ST SDT+SD+DFRS+GR  +
Sbjct: 1666 ASDQEIKKRISKLNRKSMDSGSETSDDR--SSEDGRSGGGSTASDTESDLDFRSEGRPGD 1723

Query: 1688 SRGAGDFTTDEGLDFSDDREWGARMTKASLVPPVTRKYEVIDQYVIVADEEDVRRKMRVS 1747
            SRG   F TDE     D+REWGARMT ASLVPPVTRKYEVIDQYVIVADEEDV+RKM VS
Sbjct: 1724 SRGDEYFMTDE-----DEREWGARMTNASLVPPVTRKYEVIDQYVIVADEEDVQRKMSVS 1778

Query: 1748 LPEDYAEKLNAQKNGSEELDMELPEVKDYKPRKQLGDQVFEQEVYGIDPYTHNLLLDSMP 1807
            LP+DYAEKL+AQKNG+EELDMELPEVKDYKPRKQLGD+V EQEVYGIDPYTHNLLLDSMP
Sbjct: 1779 LPDDYAEKLDAQKNGTEELDMELPEVKDYKPRKQLGDEVIEQEVYGIDPYTHNLLLDSMP 1838

Query: 1808 DELDWNLLEKHLFIEDVLLRTLNKQVRHFTGTGNTPMMYPLQPVIEEIEKEAVDDCDVRT 1867
            +E+DW L +KH+FIEDVLL TLNKQVRH+TG GNTPM YPLQPV+EE+E+ A++DCD RT
Sbjct: 1839 EEVDWPLSQKHMFIEDVLLCTLNKQVRHYTGAGNTPMTYPLQPVVEELEQAAMEDCDTRT 1898

Query: 1868 MKMCRGILKAMDSRPDDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQD 1927
            MK+CRGIL+A+DSRPDDKYVAYRKGLGVVCNKE GF +DDFVVEFLGEVYP WKWFEKQD
Sbjct: 1899 MKICRGILRAIDSRPDDKYVAYRKGLGVVCNKEAGFRDDDFVVEFLGEVYPAWKWFEKQD 1958

Query: 1928 GIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKV 1987
            GIR LQK++++PAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSC+PNCEAKV
Sbjct: 1959 GIRLLQKDSKEPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCKPNCEAKV 2018

Query: 1988 TAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEG 2047
            TAV G YQIGIY+VR I +GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEG
Sbjct: 2019 TAVGGQYQIGIYSVRKIQHGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEG 2078

Query: 2048 AFEKVLKELHGLLDRHQLMLEACELNSVSEEDYLELGRAGLGSCLLGGLPNWVVAYSARL 2107
            AF+KVLKE HGLLDRH LML ACELNSVSEEDYL+LGRAGLGSCLLGGLP+WVVAYSARL
Sbjct: 2079 AFQKVLKECHGLLDRHYLMLGACELNSVSEEDYLDLGRAGLGSCLLGGLPDWVVAYSARL 2138

Query: 2108 VRFINLERTKLPEEILRHNLEEKRKYFSDICLEVEKSDAEVQAEGVYNQRLQNLAVTLDK 2167
            VRFINLERTKLPEEILRHNLEEK+KYF+DIC+EVE+SDAEVQAEGVYNQRLQNLAVTLDK
Sbjct: 2139 VRFINLERTKLPEEILRHNLEEKKKYFADICIEVERSDAEVQAEGVYNQRLQNLAVTLDK 2198

Query: 2168 VRYVMRCVFGDPKKAPPPVERLSPEETVSFLWKGEGSLVEELIQCMAPHVEEDVLNDLKS 2227
            VRYVMRC+FGDPK APPP+E+L+PEETVSFLWK EGSLVEEL+QCM+PH++ ++LNDLKS
Sbjct: 2199 VRYVMRCIFGDPKLAPPPLEKLTPEETVSFLWKEEGSLVEELLQCMSPHMDGEMLNDLKS 2258

Query: 2228 KIQAHDPSGSEDIQRELRKSLLWLRDEVRNLPCTYKCRHDAAADLIHIYAYTKCFFRVQE 2287
            KI AHDPS S+DI + ++KSLLWLRDEVR+LPCTYKCRHDAAADLIH+YAYTK FFRV+E
Sbjct: 2259 KIYAHDPSDSDDIPKAIQKSLLWLRDEVRSLPCTYKCRHDAAADLIHVYAYTKSFFRVRE 2318

Query: 2288 YKAFTSPPVYISPLDLGPKYADKLGADLQVYRKTYGENYCLGQLIFWHIQTNADPDCTLA 2347
            Y AFTSPPVYISPLDLGPK ADKLG     Y+KTYGENYC+GQLIFWHIQTN +PD TLA
Sbjct: 2319 YDAFTSPPVYISPLDLGPKCADKLGGLPHKYQKTYGENYCMGQLIFWHIQTNTEPDSTLA 2378

Query: 2348 RASRGCLSLPDIGSFYAKVQKPSRHRVYGPKTVRFMLSRMEKQPQRPWPKDRIWAFKSSP 2407
            +AS+GCLSLPDIGSFY+KVQKPS+ R+YGPKTV+ ML RMEK PQ+PWPKD+IW+FKSSP
Sbjct: 2379 KASKGCLSLPDIGSFYSKVQKPSQQRIYGPKTVKMMLGRMEKYPQKPWPKDQIWSFKSSP 2438

Query: 2408 RIFGSPMLDSSLTGCPLDREMVHWLKHRPAIFQAMWDR 2445
            ++FGSPMLD+ L   PLDREMVHWLKHRP ++QAMWDR
Sbjct: 2439 KVFGSPMLDAVLNKSPLDREMVHWLKHRPTVYQAMWDR 2476


>gi|255549293|ref|XP_002515700.1| huntingtin interacting protein, putative [Ricinus communis]
 gi|223545137|gb|EEF46647.1| huntingtin interacting protein, putative [Ricinus communis]
          Length = 2430

 Score = 3181 bits (8248), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 1628/2346 (69%), Positives = 1886/2346 (80%), Gaps = 124/2346 (5%)

Query: 144  ENLKNEEVEEGELGTLKW-------ENGEFVQPEKSQPQSQLQSQSKQIEKGEIIVFSSK 196
            +N   EEVEEGELGTLKW       ENGEFV PEK+       ++  +I+KGEI++ + K
Sbjct: 165  QNNNKEEVEEGELGTLKWPPKAAEVENGEFVPPEKT-------TRRTEIDKGEIVI-ADK 216

Query: 197  CRRGETEKGE----SGLWRG---NKDDIEKGEFIPDRWHKEVVKDEYGYSKSR-RYDYKL 248
             R+ + EKGE    SG WR    ++D+IEKGEFIPDRWH    K+E GY+KSR +YD   
Sbjct: 217  WRKRDIEKGEGTAVSGRWRKGDFSRDEIEKGEFIPDRWHN---KEELGYNKSRTKYDISR 273

Query: 249  ERTPPSGKYSGEDVYRRKEFDRSGSQHSKSSS-RWESGQERNVRISSKIVDDEGLYKGEH 307
            ERTPPSGKYS ED+YRRKEF RSGS     SS RWESG ERN+RISSKI+D+E +YK E+
Sbjct: 274  ERTPPSGKYSNEDIYRRKEFSRSGSSQHSKSSSRWESGLERNIRISSKILDEESMYKSEY 333

Query: 308  NNGKNHGREYFHGNRFKRHGTDSDSGDRKYYGDYGDFAGLKSRRLSDDYNSRSVHSEHYS 367
            +NGKNHGR+Y  GNR KR+G DSDS +RK+YGDYGD+A  KSRRLS+D  +R +HSEHYS
Sbjct: 334  SNGKNHGRDYTSGNRLKRYGADSDSSERKHYGDYGDYACSKSRRLSED-TARPIHSEHYS 392

Query: 368  RHSVEKFHRNSSSSRISS--LDKYSSRHHEPSLSSRVIYDRHGRSPSHSDRSPHDRGRYY 425
            RHSVE+F+RNSS++      LDKYSSRHHEP+LSS+V+YDRH RSP HS+RSP DR R+Y
Sbjct: 393  RHSVERFYRNSSTTSSRISSLDKYSSRHHEPTLSSKVVYDRHERSPGHSERSPRDRARHY 452

Query: 426  DHRDRSPSRHDRSPY--------------TRDRSPYTFDRSPYSRERSPYNRDRSPYARE 471
            DHRDRSP R +RSPY               R+RSPY  +RSPY  ERSPY R+RSPYAR+
Sbjct: 453  DHRDRSPVRRERSPYRLERSPFGRERSPYVRERSPYVRERSPYVHERSPYVRERSPYARD 512

Query: 472  KSPYDRSRHYDHRNRSPFSAERSPQDRARFHDRSDRTPNYLERSPLHRSRPNNHREASSK 531
            KSPYDRSRHYD+R RSP  +ERS QDR  +HDR DRTPN+LERSPL R RPNNHREAS K
Sbjct: 513  KSPYDRSRHYDYR-RSPAHSERSSQDR--YHDRRDRTPNFLERSPLDRGRPNNHREASRK 569

Query: 532  TGASEKRNARYDSKGHEDKLGPKDSNARCSRSSAKESQDKSNVQDLNVSDEKTANCESHK 591
             G SEKRN++  +KG EDKL  KD + R S+   KESQD+++V ++   +EK A+ +S K
Sbjct: 570  GGVSEKRNSQNANKGKEDKLNQKDCSERDSQFIVKESQDRNDVHNITGLEEKNASSDSLK 629

Query: 592  EEQPQSSSVDCKEPPQVDGPPLEELVSMEEDMDICDTPPHVPAVTDSSVGKWFYLDHCGM 651
            E Q QS  +D KE   VDGPP EEL+SMEEDMDICDTPPHVPAVTDSS GKWFYLD+ G+
Sbjct: 630  EAQTQSPVMDVKESLPVDGPPPEELLSMEEDMDICDTPPHVPAVTDSSTGKWFYLDYFGL 689

Query: 652  ECGPSRLCDLKTLVEEGVLVSDHFIKHLDSNRWETVENAVSPLVTVNFPSITSDSVTQLV 711
            ECGPS+LCDLK LV+ GVLV+DH +KHLDS+RW T+ENAVSPLV  NFPSI SD+VT+LV
Sbjct: 690  ECGPSKLCDLKALVDGGVLVADHLVKHLDSDRWVTIENAVSPLVASNFPSIVSDTVTRLV 749

Query: 712  SPPEASGNLLADTGDTAQS---TGEEFPVTL-QSQCCPDGSAAAAESSEDLHIDVRVGAL 767
            SPPEA GNLLADTGD  QS    GEE  + L Q   C + +AA +E  EDLHID RVGAL
Sbjct: 750  SPPEAPGNLLADTGDMGQSGYKNGEEASMALPQPLGCLNDNAALSEPLEDLHIDQRVGAL 809

Query: 768  LDGFTVIPGKEIETLGEILQTTFERVDWQNNGGPTWHGACVGEQKPGDQKVDELYISDTK 827
            L+G+T++PG+E+ET+GE+L TTFE V W+  G          E++ G    +    SD K
Sbjct: 810  LEGYTIVPGRELETIGEVLLTTFELVPWERCGQ--------SEEQFGQSNDEPSRYSDLK 861

Query: 828  MKEAAELKS---GDKDHWVVCF-DSDEWFSGRWSCKGGDWKRNDEAAQDRCSRKKQVLND 883
              +A E+ S    D+D    CF DS +WFSGRWSCKGGDWKRNDE  QDR SR+K VL+D
Sbjct: 862  PNDAVEVSSSATSDRDQSCACFADSADWFSGRWSCKGGDWKRNDENVQDRFSRRKFVLSD 921

Query: 884  GFPLCQMPKSGYEDPRWNQKDDLYYPSHSRRLDLPPWAYACPDERNDGSGGSRSTQSKLA 943
            G+PLCQMPKSG EDPRW++KDDLYYPS SRRLDLPPWA++C DERN+    SR+T +K +
Sbjct: 922  GYPLCQMPKSGTEDPRWHRKDDLYYPSQSRRLDLPPWAFSCTDERNECGSASRTTLAKPS 981

Query: 944  AVRGVKGTMLPVVRINACVVNDHGSFVSEPRSKVRAKERHSSRSARSYSSANDVRRSSAE 1003
             VRGVKGTMLPVVRINACVV DHGSFVSEPR KVR KER+ SRS+R YS+ANDV+R +AE
Sbjct: 982  VVRGVKGTMLPVVRINACVVKDHGSFVSEPRIKVRGKERYPSRSSRMYSAANDVKRLTAE 1041

Query: 1004 SDSHSKARNNQDSQGSWKSIACINTPKDRLCTVDDLQLQLGEWYYLDGAGHERGPSSFSE 1063
             DS SK   +QDS  SWKSI+ +NTPKDRLCTVDDLQL LGEWYYLDG+GHE+GPSSFSE
Sbjct: 1042 GDSQSKI--DQDSHSSWKSISFVNTPKDRLCTVDDLQLHLGEWYYLDGSGHEQGPSSFSE 1099

Query: 1064 LQVLVDQGCIQKHTSVFRKFDKVWVPLTFATETSASTVRNHGEKIMPSGDSSGLPPTQSQ 1123
            LQVL  QG I+K +SVFRKFD+VWVP+T  T +S +T +   E +   GDSS    T S+
Sbjct: 1100 LQVLASQGAIKKWSSVFRKFDRVWVPVTPVTGSSEATFKTQEETVALPGDSST---TLSK 1156

Query: 1124 DAVLGESNNNVNSNAFHTMHPQFIGYTRGKLHELVMKSYKNREFAAAINEVLDPWINAKQ 1183
                  S NN NS  FH  HPQFIGYTRGKLHELVMKS+K+REFAAAIN+VLDPWINAKQ
Sbjct: 1157 SQGAANSENNANSVPFHCQHPQFIGYTRGKLHELVMKSFKSREFAAAINDVLDPWINAKQ 1216

Query: 1184 PKKETE-HVYRKSEGDTRAGKRARLLVRESDGDEETEEELQTIQ-DESTFEDLCGDASFP 1241
            PKKE + H+YRKSE D R+ KRARL V  SD D   +E++++IQ DE+TFE+LCGD+ F 
Sbjct: 1217 PKKEVDSHIYRKSEIDGRSSKRARLQVDGSDDDYFIDEDVESIQKDETTFEELCGDSIFH 1276

Query: 1242 GEESASSAIESGGWGLLDGHTLAHVFHFLRSDMKSLAFASLTCRHWRAAVRFYKGISRQV 1301
            GE S  S  E G WGLLDGH LA VFH++RSDM+SL FASLTC+HWRAAV FYK ISRQV
Sbjct: 1277 GENSECSDSELGSWGLLDGHMLARVFHYMRSDMRSLVFASLTCKHWRAAVSFYKDISRQV 1336

Query: 1302 DLSSVGPNCTDSLIRKTLNAFDKEKLNSILLVGCTNITSGMLEEILQSFPHLSSIDIRGC 1361
            D S +G NCTDS+I   LN ++KE++NS+ L+            +   +P L+   +   
Sbjct: 1337 DFSHLGSNCTDSMIWNILNGYNKERINSMALIYFA---------LSLVYPLLT---LEVA 1384

Query: 1362 GQFGELALKFPNINWVKSQKSRG-AKFNDSRSKIRSLKQITEKSSSAPKSKGLGDDMDDF 1420
                   LKFP++ W+K+Q SRG     +S SKIRSLK I+E++ +  K+KGLG D DDF
Sbjct: 1385 ANSRNWPLKFPDVRWIKTQSSRGIGIIEESSSKIRSLKHISERTPTFYKTKGLGSDADDF 1444

Query: 1421 GDLKDYFESVDKRDSANQSFRRSLYQRSKVFDARKSSSILSRDARMRRWSIKKSENGYKR 1480
            GDLK+YF+SV+KRDSANQ FRRSLY+RSK+FDAR+SSSI+SRDAR+RRW+IKKSE+GYKR
Sbjct: 1445 GDLKEYFDSVNKRDSANQLFRRSLYKRSKLFDARRSSSIVSRDARVRRWAIKKSESGYKR 1504

Query: 1481 MEEFLASSLKEIMRVNTFEFFVPKVAEIEGRMKKGYYISHGLGSVKDDISRMCRDAIKAK 1540
            ME FLAS LK+IM+ NTF+FFVPKVAEIE RMK GYY+ HGL SVK+DISRMCRDAIK  
Sbjct: 1505 MEGFLASGLKDIMKENTFDFFVPKVAEIEDRMKSGYYLGHGLRSVKEDISRMCRDAIK-- 1562

Query: 1541 NRGSAGDMNRITTLFIQLATRLEQGAKSSYYEREEMMKSWKDESPAGLYSATSKYKKKLS 1600
                                             +E+MKSWKD+  AGL  A+ K KKKL 
Sbjct: 1563 ---------------------------------DELMKSWKDDLSAGLGCASMKSKKKL- 1588

Query: 1601 KMVSERKYMNRSNGTSLANGDFDYGEYASDREIRKRLSKLNRKSLDSGSETSDDLDGSSE 1660
              + ++K  NR+NG++ +NG FDYGEYASDREIR+RLSKLNRKS++SGSETSD LD SSE
Sbjct: 1589 --LIDKKNANRNNGSTFSNGGFDYGEYASDREIRRRLSKLNRKSMESGSETSDGLDKSSE 1646

Query: 1661 DGKSDSESTVSDTDSDMDFRSDGRARESRGAGDFTTDEGLD-FSDDREWGARMTKASLVP 1719
            DG+S+S+ST SDT+SD+D R +GR  ESRG G F  DE LD   D+REWGARMTKASLVP
Sbjct: 1647 DGRSESDSTSSDTESDLDIRLEGRIGESRGGGFFMEDEALDSMIDEREWGARMTKASLVP 1706

Query: 1720 PVTRKYEVIDQYVIVADEEDVRRKMRVSLPEDYAEKLNAQKNGSEELDMELPEVKDYKPR 1779
            PVTRKYEVIDQYVIVADEEDV+RKM V+LP+DYAEKL+AQKNG+E  DMELPEVK+YKPR
Sbjct: 1707 PVTRKYEVIDQYVIVADEEDVQRKMCVALPDDYAEKLDAQKNGTE--DMELPEVKEYKPR 1764

Query: 1780 KQLGDQVFEQEVYGIDPYTHNLLLDSMPDELDWNLLEKHLFIEDVLLRTLNKQVRHFTGT 1839
            KQ GD+V EQEVYGIDPYTHNLLLDSMP+ELDW L +KH+FIED+LLRTLNKQVR FTGT
Sbjct: 1765 KQPGDEVLEQEVYGIDPYTHNLLLDSMPEELDWTLSDKHMFIEDMLLRTLNKQVRRFTGT 1824

Query: 1840 GNTPMMYPLQPVIEEIEKEAVDDCDVRTMKMCRGILKAMDSRPDDKYVAYRKGLGVVCNK 1899
            GNTPM YPL+P+IEEIE  A +DCDVRTMK+C+GILKA+DSR DD YVAYRKGLGVVCNK
Sbjct: 1825 GNTPMKYPLKPIIEEIEAAAEEDCDVRTMKICQGILKAIDSRRDDNYVAYRKGLGVVCNK 1884

Query: 1900 EGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYD 1959
            EGGF EDDFVVEFLGEVYP WKWFEKQDGIRSLQK+++DPAPEFYNIYLERPKGDADGYD
Sbjct: 1885 EGGFAEDDFVVEFLGEVYPAWKWFEKQDGIRSLQKDSKDPAPEFYNIYLERPKGDADGYD 1944

Query: 1960 LVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTE 2019
            LVVVDAMHKANYASRICHSCRPNCEAKVTAV G YQIGIYTVR I YGEEITFDYNSVTE
Sbjct: 1945 LVVVDAMHKANYASRICHSCRPNCEAKVTAVHGQYQIGIYTVREIQYGEEITFDYNSVTE 2004

Query: 2020 SKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDRHQLMLEACELNSVSEED 2079
            SKEEYEASVCLCGSQVCRGSYLNLTGEGAF+KVLKE H +LDRH LMLEACELNSVSEED
Sbjct: 2005 SKEEYEASVCLCGSQVCRGSYLNLTGEGAFQKVLKEWHAMLDRHHLMLEACELNSVSEED 2064

Query: 2080 YLELGRAGLGSCLLGGLPNWVVAYSARLVRFINLERTKLPEEILRHNLEEKRKYFSDICL 2139
            YL+LGRAGLGSCLLGGLP+WVVAYSARLVRFINLERTKLPEEILRHNLEEKRKYFSDICL
Sbjct: 2065 YLDLGRAGLGSCLLGGLPDWVVAYSARLVRFINLERTKLPEEILRHNLEEKRKYFSDICL 2124

Query: 2140 EVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRCVFGDPKKAPPPVERLSPEETVSFLW 2199
            EVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMR +FGDPKKAPPP+ERLSPEETVSF+W
Sbjct: 2125 EVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRSLFGDPKKAPPPLERLSPEETVSFIW 2184

Query: 2200 KGEGSLVEELIQCMAPHVEEDVLNDLKSKIQAHDPSGSEDIQRELRKSLLWLRDEVRNLP 2259
            K EGSLV+EL+QCMAPHVE DVLNDLKSKI A DP  S++I++EL+KSLLWLRDEVR+LP
Sbjct: 2185 KEEGSLVDELLQCMAPHVEVDVLNDLKSKICARDPLNSDNIRKELQKSLLWLRDEVRSLP 2244

Query: 2260 CTYKCRHDAAADLIHIYAYTKCFFRVQEYKAFTSPPVYISPLDLGPKYADKLGADLQVYR 2319
            CTYKCRHDAAADLIH+YAYT+CF+RV+EY  FTSPPV+ISPLDLGPKYADKLGA +  YR
Sbjct: 2245 CTYKCRHDAAADLIHVYAYTRCFYRVREYDTFTSPPVHISPLDLGPKYADKLGAGIHEYR 2304

Query: 2320 KTYGENYCLGQLIFWHIQTNADPDCTLARASRGCLSLPDIGSFYAKVQKPSRHRVYGPKT 2379
            KTYGENYC+GQLIFWHIQTNA+PDC+LA+ASRGCLSLPDIGSFYAKVQKPS+ RVYGP+T
Sbjct: 2305 KTYGENYCMGQLIFWHIQTNAEPDCSLAKASRGCLSLPDIGSFYAKVQKPSQQRVYGPRT 2364

Query: 2380 VRFMLSRMEKQPQRPWPKDRIWAFKSSPRIFGSPMLDSSLTGCPLDREMVHWLKHRPAIF 2439
            V+ ML RMEK PQ+PWPKD+IW+FKSSP++ GSPMLD+ L+   LDREMVHWLKHRP ++
Sbjct: 2365 VKLMLERMEKYPQKPWPKDQIWSFKSSPKVIGSPMLDAVLSNSSLDREMVHWLKHRPTVY 2424

Query: 2440 QAMWDR 2445
            QAMWDR
Sbjct: 2425 QAMWDR 2430


>gi|359485692|ref|XP_002275342.2| PREDICTED: probable histone-lysine N-methyltransferase ATXR3-like
            [Vitis vinifera]
          Length = 2367

 Score = 3131 bits (8117), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 1633/2366 (69%), Positives = 1888/2366 (79%), Gaps = 137/2366 (5%)

Query: 129  VENGGAVGEVVTVDKENLKNEEVEEGELGTLKW-----ENGEFVQPEKSQPQSQLQSQSK 183
            +ENG        +  + +  EEVEEGELGTLKW     ENGEF +PEK +          
Sbjct: 90   IENG-------EICNDKIVKEEVEEGELGTLKWPKGEVENGEF-EPEKPR--------RS 133

Query: 184  QIEKGEIIVFSSKCRRGETEKGESGLWR-----GNKDDIEKGEFIPDRWHKEVVKDEYGY 238
             IEKGE +  S K R+G+ EKGE  L R     G+KD++EKGEFIPDRW ++V +D YG 
Sbjct: 134  DIEKGEFV--SGKWRKGDIEKGELVLERFRKGDGSKDELEKGEFIPDRWQRDVGRDGYGC 191

Query: 239  SKSRR------------YDYKLERTPPSGKYSGEDVYRRKEFDRSGSQHSKSSSR--WES 284
            SK RR            YD++ ERTPPSGKYSG+DV +RKEF RSGSQ +K SSR  WE+
Sbjct: 192  SKMRRHELAKDKGWKFEYDHERERTPPSGKYSGDDVSQRKEFSRSGSQFAKRSSRSRWEA 251

Query: 285  GQERNVRISSKIVDDEGLYKGEHNNGKNHGREYFHGNRFKRHGTDSDSGDRKYYGDYGDF 344
              ERNVRISSKIVDDEG YK EHN+ KNHGRE     R KR+GTDSD  +RK++G+YGD 
Sbjct: 252  VPERNVRISSKIVDDEGTYKTEHNSSKNHGRELVSRTRMKRYGTDSDGSERKHHGEYGDH 311

Query: 345  AGLKSRRLSDDYNSRSVHSEHYSRHSVEKFHRNSSSSRISSLDKYSSRHHEPSLSSRVIY 404
             G K R+LSDD N R+VH EHYSR S+E+ +RNSSSSRISS D++SSRH+E S SS+V++
Sbjct: 312  MGSKIRKLSDDSN-RTVHLEHYSRRSMERSYRNSSSSRISSSDRFSSRHYESSFSSKVVH 370

Query: 405  DRHGRSPSHSDRSPHDRGRYYDHRDRSPSRHDRSPYTRDRSPYTFDRSPYSRERSPYNRD 464
            DRHGRSP HS+RSP DR RY+DH              RDRSP                  
Sbjct: 371  DRHGRSPVHSERSPRDRARYHDH--------------RDRSPAY---------------- 400

Query: 465  RSPYAREKSPYDRSRHYDHRNRSPFSAERSPQDRARFHDRSDRTPNYLERSPLHRSRPNN 524
            RS   R++SPYDRSRHYDHRNRSP   ERSPQDR R+H+R DRTP YLERSPL  SRPNN
Sbjct: 401  RSSPRRDRSPYDRSRHYDHRNRSPAPTERSPQDRPRYHERRDRTPTYLERSPLDHSRPNN 460

Query: 525  HREASSKTGASEKRNARYDSKGHEDKLGPKDSNARCSRSSAKESQDKSNVQDLNV--SDE 582
            +REAS K GA EKR+ +Y +K  E+KL  +D+N R    SAKESQD+S++  +N   SDE
Sbjct: 461  YREASCKGGAGEKRHGQYGNKVQEEKLNQRDANGRDPHFSAKESQDRSSLHTVNGHGSDE 520

Query: 583  KTANCESHKEEQPQSSSVDCKEPPQVDGPPLEELVSMEEDMDICDTPPHVPAVTDSSVGK 642
            K+AN + HKEE+PQS  V+ +EPPQ+   P EEL SMEEDMDICDTPPHVP V DS+ GK
Sbjct: 521  KSANHQPHKEEKPQSPCVNLEEPPQITVAP-EELASMEEDMDICDTPPHVPLVADSTTGK 579

Query: 643  WFYLDHCGMECGPSRLCDLKTLVEEGVLVSDHFIKHLDSNRWETVENAVSPLVTVNFPSI 702
            WFYLDH GME GPS+LCDLK LVEEGVLVSDH IKH+DS+RW T+ENA SPLV VNFPSI
Sbjct: 580  WFYLDHFGMERGPSKLCDLKKLVEEGVLVSDHLIKHVDSDRWLTIENAASPLVPVNFPSI 639

Query: 703  TSDSVTQLVSPPEASGNLLADTGDTAQST---GEEFPVTL-QSQCCPDGSAAAAESSEDL 758
             SD+VTQLVSPPEA GNLLA+ GD  +S+    EE P TL QS  C + S+ A+E  EDL
Sbjct: 640  VSDTVTQLVSPPEAPGNLLAEAGDATESSKLLDEETPATLLQSMSCNNDSSTASEPLEDL 699

Query: 759  HIDVRVGALLDGFTVIPGKEIETLGEILQTTFERVDWQNNGGPTWHGACVGEQKPGDQKV 818
             ID RV ALL GFTVIPG+E+ETLG                G +WH   +GEQ   DQ+ 
Sbjct: 700  QIDERVRALLKGFTVIPGRELETLG----------------GLSWHQPRIGEQ--FDQRT 741

Query: 819  DEL-YISDTKMKEAAELKSGDKDHWVVCF---DSDEWFSGRWSCKGGDWKRNDEAAQDRC 874
            DE     +   KEA++ +S         F   D  +WFS RW+ KGGDWKRNDE+AQDR 
Sbjct: 742  DEFSRYPEITSKEASDSRSSTSSDKDYAFAFGDFSDWFSARWASKGGDWKRNDESAQDRL 801

Query: 875  SRKKQVLNDGFPLCQMPKSGYEDPRWNQKDDLYYPSHSRRLDLPPWAYACPDERNDGSGG 934
            SRKK VLNDG+PLCQMPKSGYEDPRW++KD+LYYPSH R+LDLP WA++ PDER+D +  
Sbjct: 802  SRKKLVLNDGYPLCQMPKSGYEDPRWHRKDELYYPSHGRKLDLPIWAFSWPDERSDSNSA 861

Query: 935  SRSTQSKLAAVRGVKGTMLPVVRINACVVNDHGSFVSEPRSKVRAKERHSSRSARSYSSA 994
            SR++Q K   VRGVKG+MLPVVRINACV        SEP +KVR K+R+SSRSAR+YSS 
Sbjct: 862  SRASQIK-PVVRGVKGSMLPVVRINACV--------SEPPAKVRGKDRYSSRSARAYSST 912

Query: 995  NDVRRSSAESDSHSKARNNQDSQGSWKSIACINTPKDRLCTVDDLQLQLGEWYYLDGAGH 1054
             DV+RSSAES SHSK+ +  DSQGSWK I  INTPKDRLCT +DLQL LG+WYYLDGAGH
Sbjct: 913  TDVKRSSAESASHSKSVSENDSQGSWKCITSINTPKDRLCTAEDLQLHLGDWYYLDGAGH 972

Query: 1055 ERGPSSFSELQVLVDQGCIQKHTSVFRKFDKVWVPLTFATETSASTVRNHGEKIMPSGDS 1114
            E+GPSSFSELQ LVDQG IQKH+SVFRK DK+WVP+T A +   + V+   +  + S D 
Sbjct: 973  EQGPSSFSELQALVDQGSIQKHSSVFRKNDKIWVPITSAADVPDAAVKIQPQNNVTSTDC 1032

Query: 1115 SGLPPTQSQDAVLGESNNNVNSNAFHTMHPQFIGYTRGKLHELVMKSYKNREFAAAINEV 1174
            SG    QS    +G   NN  S + H++HPQFIGYT GKLHELVMKSYK+REFAAAINEV
Sbjct: 1033 SGPSLAQSLAGAIG--GNNTISRSLHSLHPQFIGYTCGKLHELVMKSYKSREFAAAINEV 1090

Query: 1175 LDPWINAKQPKKETEH--VYRKSEGDTR-----------AGKRARLLVRESDGDEETEEE 1221
            LDPWIN+KQPKKE  +  V   S  D             AG R R LV  S+ D E EE+
Sbjct: 1091 LDPWINSKQPKKEMANSAVSNSSLHDLNKFRTSGMSHICAGIRGRWLVDGSEDDYEMEED 1150

Query: 1222 LQTIQ-DESTFEDLCGDASFPGEESASSAIESGGWGLLDGHTLAHVFHFLRSDMKSLAFA 1280
            +  +Q DESTFEDLC DA+F  E+ A + + S  WGLLDG+ LA VFHFLR+D+KSLAFA
Sbjct: 1151 VLLVQKDESTFEDLCSDATFYQEDIALAEMGSENWGLLDGNVLARVFHFLRTDVKSLAFA 1210

Query: 1281 SLTCRHWRAAVRFYKGISRQVDLSSVGPNCTDSLIRKTLNAFDKEKLNSILLVGCTNITS 1340
            +LTC+HWRAAVRFYKG+SRQVDLSSVG  CTDS I   +N ++KE++ S++L+GCTNIT 
Sbjct: 1211 ALTCKHWRAAVRFYKGVSRQVDLSSVGSLCTDSTIWSMINGYNKERITSMILIGCTNITP 1270

Query: 1341 GMLEEILQSFPHLSSIDIRGCGQFGELALKFPNINWVKSQKSRGAKFNDSRSKIRSLKQI 1400
            GMLE++L SFP LSSIDIRGC QF ELA KF N+NW+KS+      F +S SKI++LKQI
Sbjct: 1271 GMLEDVLGSFPSLSSIDIRGCSQFWELADKFSNLNWIKSRIRVMKVFEESYSKIKALKQI 1330

Query: 1401 TEKSSSAPKSKGLGDDMDDFGDLKDYFESVDKRDSANQSFRRSLYQRSKVFDARKSSSIL 1460
            TE+ S +   KG+G  +DD  +LK+YF+SVD+R+SA+QSFRRS Y+RSK+FDAR+SSSIL
Sbjct: 1331 TERPSVSKPLKGMGSHVDDSSELKEYFDSVDRRESASQSFRRSYYKRSKLFDARRSSSIL 1390

Query: 1461 SRDARMRRWSIKKSENGYKRMEEFLASSLKEIMRVNTFEFFVPKVAEIEGRMKKGYYISH 1520
            SRDARMRRWSIK SENGYKRMEEFLASSL++IM+ NTF+FFVPKVAEIE RMK GYY  H
Sbjct: 1391 SRDARMRRWSIKNSENGYKRMEEFLASSLRDIMKENTFDFFVPKVAEIEDRMKNGYYAGH 1450

Query: 1521 GLGSVKDDISRMCRDAIKAKNRGSAGDMNRITTLFIQLATRLEQGAKSSYYEREEMMKSW 1580
            GL SVK+DISRMCRDAIKAKNRG +G+MNRI TLFI+LAT LE+G+KSS   REEM++ W
Sbjct: 1451 GLSSVKEDISRMCRDAIKAKNRGDSGNMNRIITLFIRLATCLEEGSKSSN-GREEMVRRW 1509

Query: 1581 KDESPAGLYSATSKYKKKLSKMVSERKYMNRSNGTSLANGDFDYGEYASDREIRKRLSKL 1640
            KDESP+GL S+ SKYKKKL+K+V+ERK+  RSNG S      DYGEYASDREIR+RLSKL
Sbjct: 1510 KDESPSGLCSSGSKYKKKLNKIVTERKH--RSNGGS------DYGEYASDREIRRRLSKL 1561

Query: 1641 NRKSLDSGSETSDDLDGSSEDGKSDSESTVSDTDSDMDFRSDGRARESRGAGDFTTDEGL 1700
            N+KS+DSGS+TSDDLD SSE G S SEST SDT+SD+DFRS+G   ESR  G FT DEGL
Sbjct: 1562 NKKSMDSGSDTSDDLDRSSEGGSSGSESTASDTESDLDFRSEGGVAESRVDGYFTADEGL 1621

Query: 1701 -DFSDDREWGARMTKASLVPPVTRKYEVIDQYVIVADEEDVRRKMRVSLPEDYAEKLNAQ 1759
               +DDREWGARMTK SLVPPVTRKYEVI+QYVIVADE++V+RKM+VSLPE Y EKL AQ
Sbjct: 1622 YSMTDDREWGARMTKVSLVPPVTRKYEVIEQYVIVADEDEVQRKMKVSLPEHYNEKLTAQ 1681

Query: 1760 KNGSEELDMELPEVKDYKPRKQLGDQVFEQEVYGIDPYTHNLLLDSMPDELDWNLLEKHL 1819
            KNG+EE DME+PEVKDYKPRKQLGD+V EQEVYGIDPYTHNLLLDSMP+ELDW LLEKHL
Sbjct: 1682 KNGTEESDMEIPEVKDYKPRKQLGDEVIEQEVYGIDPYTHNLLLDSMPEELDWPLLEKHL 1741

Query: 1820 FIEDVLLRTLNKQVRHFTGTGNTPMMYPLQPVIEEIEKEAVDDCDVRTMKMCRGILKAMD 1879
            FIE+VLL TLNKQVRHFTGTGNTPMMY LQPV+E+I+K A ++ D+RT+KMC+GILKAM+
Sbjct: 1742 FIEEVLLCTLNKQVRHFTGTGNTPMMYHLQPVVEDIQKTAEEELDLRTLKMCQGILKAMN 1801

Query: 1880 SRPDDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDP 1939
            SRPDD YVAYRKGLGVVCNKEGGF ++DFVVEFLGEVYP WKWFEKQDGIRSLQKN++DP
Sbjct: 1802 SRPDDNYVAYRKGLGVVCNKEGGFSQEDFVVEFLGEVYPAWKWFEKQDGIRSLQKNSKDP 1861

Query: 1940 APEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIY 1999
            APEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAV+G YQIGIY
Sbjct: 1862 APEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVEGQYQIGIY 1921

Query: 2000 TVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGL 2059
            TVR I YGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAF+KVLKE HG+
Sbjct: 1922 TVRQIQYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFQKVLKECHGI 1981

Query: 2060 LDRHQLMLEACELNSVSEEDYLELGRAGLGSCLLGGLPNWVVAYSARLVRFINLERTKLP 2119
            LDR+Q+M EACELN VSEEDY++LGRAGLGSCLLGGLP+W++AY+ARLVRFIN ERTKLP
Sbjct: 1982 LDRYQMMFEACELNMVSEEDYIDLGRAGLGSCLLGGLPDWLIAYAARLVRFINFERTKLP 2041

Query: 2120 EEILRHNLEEKRKYFSDICLEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRCVFGDP 2179
            EEILRH+L+EKRKYF+DI LEVEKSDAE+QAEGVYNQRLQNLA+TLDKVRYVMRCVFGDP
Sbjct: 2042 EEILRHSLDEKRKYFADISLEVEKSDAELQAEGVYNQRLQNLALTLDKVRYVMRCVFGDP 2101

Query: 2180 KKAPPPVERLSPEETVSFLWKGEGSLVEELIQCMAPHVEEDVLNDLKSKIQAHDPSGSED 2239
            KKAPPP+ERLS EE VSFLW GEGSLVEEL+QCMAPH+E+ +L++LK KI+AHDPSGS+D
Sbjct: 2102 KKAPPPLERLSAEEVVSFLWNGEGSLVEELLQCMAPHMEDGMLSELKPKIRAHDPSGSDD 2161

Query: 2240 IQRELRKSLLWLRDEVRNLPCTYKCRHDAAADLIHIYAYTKCFFRVQEYKAFTSPPVYIS 2299
            I +EL+KSLLWLRDEVRNLPC YKCRHDAAADLIHIYAYTKCFFRV+EYK+ TSPPVYIS
Sbjct: 2162 IHKELQKSLLWLRDEVRNLPCNYKCRHDAAADLIHIYAYTKCFFRVREYKSVTSPPVYIS 2221

Query: 2300 PLDLGPKYADKLGADLQVYRKTYGENYCLGQLIFWHIQTNADPDCTLARASRGCLSLPDI 2359
            PLDLGPKY+DKLG+ +Q Y KTYGENYCLGQLI+WH QTNADPDC LARASRGCLSLPDI
Sbjct: 2222 PLDLGPKYSDKLGSGIQEYCKTYGENYCLGQLIYWHNQTNADPDCNLARASRGCLSLPDI 2281

Query: 2360 GSFYAKVQKPSRHRVYGPKTVRFMLSRMEKQPQRPWPKDRIWAFKSSPRIFGSPMLDSSL 2419
            GSFYAKVQKPSR RVYGP+T+RFML+RMEKQPQR WPKDRIW+FKS P+IFGSPMLD+ L
Sbjct: 2282 GSFYAKVQKPSRQRVYGPRTLRFMLARMEKQPQRQWPKDRIWSFKSCPKIFGSPMLDAVL 2341

Query: 2420 TGCPLDREMVHWLKHRPAIFQAMWDR 2445
               PLDREM+HWLK+RPA FQAMWDR
Sbjct: 2342 HNSPLDREMLHWLKNRPATFQAMWDR 2367


>gi|449453666|ref|XP_004144577.1| PREDICTED: probable histone-lysine N-methyltransferase ATXR3-like
            [Cucumis sativus]
          Length = 2336

 Score = 2979 bits (7722), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 1573/2491 (63%), Positives = 1859/2491 (74%), Gaps = 201/2491 (8%)

Query: 1    MGDGGVACMPLQQQQQHNSIMERFPISDKTTICVGNSSNNSNKTNNNSISNNNDNKTNND 60
            MGDGGVAC+PLQQQQQH  IME FPI  +  +C G                         
Sbjct: 1    MGDGGVACIPLQQQQQH--IMETFPIPSEKMLCAGK------------------------ 34

Query: 61   SSNNNGSSSSKNNETNKSNVKKNGVSTKTVRKKIVKIKK--VIAVKKKEVQKNSGSSKSN 118
               NNG +S       KS VK     ++  RK+ +K+KK  V+A   +  +  SG  K  
Sbjct: 35   ---NNGFNS-------KSTVK----FSEAERKQKMKLKKEEVVAKDVELGRTESGLDKPG 80

Query: 119  NNGENIDNKNVENGGAVGEVVTVDKENLKNEEVEEGELGTLKW-----ENGEFVQPEKSQ 173
             +   + +   ENG    E           +EVEEGE GTLKW     ENGEFV PEKS+
Sbjct: 81   KSSREVGH--AENGVDSAE----------KDEVEEGEFGTLKWSRVEVENGEFV-PEKSR 127

Query: 174  PQSQLQSQSKQIEKGEIIVFSSKCRRGETEKGE------------SGLWRGNKDDIEKGE 221
                      +I+KGE +    K RRG+ EKGE            +   R  KD+IE+GE
Sbjct: 128  --------RTEIDKGENV--RGKWRRGDIEKGEIVPEKSRKGEVDNRSRRLAKDEIERGE 177

Query: 222  FIPDRWHK-EVVKDEYGYSKSRRYDYKLER--------TPPSGKYSGEDVYRRKEFDRSG 272
            FIPDRW K +++KD++ YS++RRY+ + +R        TPP  KYS +D  RRKE +RSG
Sbjct: 178  FIPDRWEKGDILKDDFRYSRTRRYEPEKDRAWKNVREPTPPLVKYSTDDT-RRKELNRSG 236

Query: 273  SQHSKSSSRWESGQERNVRISSKIVDDEGLYKGEHNNGKNHGREYFHGNRFKRHGTDSDS 332
            +QH K++ RWE+GQ+R  R  SK+++DE  ++ ++N+GKN G++Y   NR KR+  +SD+
Sbjct: 237  NQHGKTTPRWETGQDRGSRYGSKLMNDEVTHRNDYNDGKNFGKDYSSCNRLKRYSLESDN 296

Query: 333  GDRKYYGDYGDFAGLKSRRLSDDYNSRSVHSEHYSRHSVEKFHRNSSSSRISSL-DKYSS 391
             +RK+YGDYGD+AG KSRRLS+D +SR+ HS+HYS   +E+  +NSSSS   S  DK+S+
Sbjct: 297  FERKHYGDYGDYAGSKSRRLSED-SSRTAHSDHYSIRPMERSCKNSSSSSRISSSDKFST 355

Query: 392  RHHEPS-LSSRVIYDRHGRSPSHSDRSPHDRGRYYDHRDRSPSRHDRSPYTRDRSPYTFD 450
            RH+E S  SSR  Y RH  SP HSDRSP ++GRY+DHRDRSP                  
Sbjct: 356  RHYESSSTSSREAYSRHVHSPGHSDRSPREKGRYHDHRDRSPGH---------------- 399

Query: 451  RSPYSRERSPYNRDRSPYAREKSPYDRSRHYDHRNRSPFSAERSPQDRARFHDRSDRTPN 510
                 R+RSP+  +RSPY R+KSPYDRSRHYDHR RSP + ERSPQDRAR H R DRTPN
Sbjct: 400  -----RDRSPFIGERSPYGRDKSPYDRSRHYDHRYRSPLT-ERSPQDRARCHSRRDRTPN 453

Query: 511  YLERSPLHRSRPNNHREASSKTGASEKRNARYDSKGHEDKLGPKDSNARCSRSSAKESQD 570
            YL+RSPL RSR +NHRE S ++   +  N    S+  EDK  PKD + R   S AKES D
Sbjct: 454  YLDRSPLDRSRTSNHRETSRRSKGEKHNNG---SRAREDKTTPKDPDGR--ESVAKESYD 508

Query: 571  KSNVQDLNVSDEKTANCESHK-EEQPQSSSVDCKEPPQVDGPPLEELVSMEEDMDICDTP 629
            + N Q+ N S E   +C S++ EE+ QS +    E   VDG P EEL SMEEDMDICDTP
Sbjct: 509  EINEQNTNGSIETVGDCRSYEGEEKSQSPNQTSIELSHVDGVP-EELPSMEEDMDICDTP 567

Query: 630  PHVPAVTDSSVGKWFYLDHCGMECGPSRLCDLKTLVEEGVLVSDHFIKHLDSNRWETVEN 689
            PH P VTD+S GKWFYLD+ G+E GP+RL DLK LVEEG L+SDHFIKHLDS+RW TVEN
Sbjct: 568  PHAPLVTDTSTGKWFYLDYYGLERGPTRLYDLKALVEEGSLMSDHFIKHLDSDRWVTVEN 627

Query: 690  AVSPLVTVNFPSITSDSVTQLVSPPEASGNLLADTGDTAQ---STGEEFPVTLQSQCCPD 746
            AVSPLVT+NFPSI  DSVTQLVSPPEA+GN+L D  DT +     G   P  + S     
Sbjct: 628  AVSPLVTINFPSIVPDSVTQLVSPPEATGNVLVDITDTGKLDIQGGHFEPNQIPSGGSIL 687

Query: 747  GSAAAAESSE---DLHIDVRVGALLDGFTVIPGKEIETLGEILQTTFERVDWQNNGGPTW 803
             S    E+SE   DLHID R+GALL+  TVIPGKE+ET+ E+LQ T +   W+       
Sbjct: 688  PSDEGVEASEPLGDLHIDERIGALLEDITVIPGKELETIAEVLQMTLDGEQWERLAISEG 747

Query: 804  HGACVGEQKPGDQKVDELY-ISD--TKMKEAAELK-SGDKDHWVVCFDSDEWFSGRWSCK 859
                VGEQ   DQ  D++   SD  T +   ++   S DKD  V   D  +W SG WSCK
Sbjct: 748  FSDHVGEQL--DQSTDDVVEFSDFVTSVDSGSQKNVSSDKDFAV---DDGDWTSGPWSCK 802

Query: 860  GGDWKRNDEAAQDRCSRKKQVLNDGFPLCQMPKSGYEDPRWNQKDDLYYPSHSRRLDLPP 919
            GGDW+RNDE+AQ+R  RKK VLNDGFPLCQM KSGYEDPRW+QKD+LYYPS S+RLDLPP
Sbjct: 803  GGDWRRNDESAQERNGRKKLVLNDGFPLCQMSKSGYEDPRWHQKDELYYPSQSKRLDLPP 862

Query: 920  WAYACPDERNDGSGGSRSTQSKLAAVRGVKGTMLPVVRINACVVNDHGSFVSEPRSKVRA 979
            WA+ C D+R+               +RG KGTMLPV+RINACVV DHGSFVSEPR KVR 
Sbjct: 863  WAFTCLDDRS------------TLTIRGTKGTMLPVIRINACVVKDHGSFVSEPRMKVRG 910

Query: 980  KERHSSRSARSYSSANDVRRSSAESDSHSKARNNQDSQGSWKSIACINTPKDRLCTVDDL 1039
            K    SRS R +SS  D +RS A+ DS SK   +  S+ S K+ A ++ PKDRLC+ DDL
Sbjct: 911  KGH--SRS-RLFSSNTDGKRS-ADGDSLSKIARDVSSERSLKATAFVSIPKDRLCSYDDL 966

Query: 1040 QLQLGEWYYLDGAGHERGPSSFSELQVLVDQGCIQKHTSVFRKFDKVWVPLTFATETSAS 1099
            QL  G+WYYLDGAGHE GPSSFSELQ+LVD G IQK++SVFRKFD+VWVP+T   E S S
Sbjct: 967  QLHFGDWYYLDGAGHECGPSSFSELQLLVDHGIIQKNSSVFRKFDRVWVPVTSFAECSES 1026

Query: 1100 TVRNHGEKIMPSGDSSGLPPTQSQDAVLGESNNNVNSNAFHTMHPQFIGYTRGKLHELVM 1159
            T R   EKI   G+++  P + S D   G       SN FH +HPQF+GYTRGKLHELVM
Sbjct: 1027 TRRIQREKIPLLGETTKNPVSVSGDNSFG--GLATTSNMFHELHPQFVGYTRGKLHELVM 1084

Query: 1160 KSYKNREFAAAINEVLDPWINAKQPKKETEH-VYRKSEGDTRAGKRARLLVRESDGDEET 1218
            K YK+REFAAAIN+VLDPWINAKQPKKE E  ++ KS+G  RA KRAR+LV ESD D E 
Sbjct: 1085 KFYKSREFAAAINDVLDPWINAKQPKKEMEKTMHWKSDGSARAAKRARVLVDESDDDYEV 1144

Query: 1219 EEEL--QTIQDESTFEDLCGDASFPGEESASSAIESGGWGLLDGHTLAHVFHFLRSDMKS 1276
            +E+L     +DE  FEDLCGDA+FPGEES S  +ES  WG LDGH LA +FHFL+SD+KS
Sbjct: 1145 DEDLLHHRQKDEIAFEDLCGDATFPGEESTSLEVES--WGFLDGHILARIFHFLQSDLKS 1202

Query: 1277 LAFASLTCRHWRAAVRFYKGISRQVDLSSVGPNCTDSLIRKTLNAFDKEKLNSILLVGCT 1336
            L+FAS+TC+HWRAAVRFYK IS+QVDLSS+GPNCT+S     ++ +++EK+N I+LVGCT
Sbjct: 1203 LSFASVTCKHWRAAVRFYKDISKQVDLSSLGPNCTNSTFMNVMSTYNEEKVNFIVLVGCT 1262

Query: 1337 NITSGMLEEILQSFPHLSSIDIRGCGQFGELALKFPNINWVKSQKSRGAKFNDSRSKIRS 1396
            NIT  +LEEIL  FP L+SID+RGC QF +L  K+PNINWVK   +      ++ SK+RS
Sbjct: 1263 NITPVVLEEILGMFPQLASIDVRGCSQFNDLPSKYPNINWVKRSLNATKNNEETHSKMRS 1322

Query: 1397 LKQITEKSSSAPKSKGLGDDMDDFGDLKDYFESVDKRDSANQSFRRSLYQRSKVFDARKS 1456
            LK +T+KS S  K KGL  ++DDFG+LK YFESVDKR+SANQ FRRSLY+RSKVFDARKS
Sbjct: 1323 LKHLTDKSYSLSKIKGLSSNVDDFGELKQYFESVDKRESANQLFRRSLYKRSKVFDARKS 1382

Query: 1457 SSILSRDARMRRWSIKKSENGYKRMEEFLASSLKEIMRVNTFEFFVPKVAEIEGRMKKGY 1516
            SSI+SRDARMR+WSIKKSE GYKRM EFLASSLKEIMR NTFEFFVPKVAEI+ R++ GY
Sbjct: 1383 SSIVSRDARMRQWSIKKSEVGYKRMVEFLASSLKEIMRDNTFEFFVPKVAEIQDRIRNGY 1442

Query: 1517 YISHGLGSVKDDISRMCRDAIKAKNRGSAGDMNRITTLFIQLATRLEQGAKSSYYEREEM 1576
            YI  GLGSVK+DISRMCRDAIK                                    + 
Sbjct: 1443 YIKRGLGSVKEDISRMCRDAIKY-----------------------------------DE 1467

Query: 1577 MKSWKDESPAGL-YSATSKYKKKLSKMVSERKYMNRSNGTSLANGDFDYGEYASDREIRK 1635
            + SW+D+S   L  SA SKYK++L K+ +ERKY NRSNG+   NG  D+GEYASDREIR+
Sbjct: 1468 VSSWEDDSSLRLGSSAASKYKRRLGKVGTERKYTNRSNGSIFGNGALDHGEYASDREIRR 1527

Query: 1636 RLSKLNRKSLDSGSETSDDLDGSSEDGKSDSESTVSDTDSDMDFRSDGRARESRGAGDFT 1695
            RLS+LN+K + S SETSD+ D SS DGKS SE++ SDT+SD++F S GR  E+RG   F 
Sbjct: 1528 RLSRLNKKPIGSESETSDEFDRSSGDGKSGSENSASDTESDLEF-SSGRI-ETRGDKCFI 1585

Query: 1696 TDEGLDFS-DDREWGARMTKASLVPPVTRKYEVIDQYVIVADEEDVRRKMRVSLPEDYAE 1754
             DE  D + DDREWGARMTKASLVPPVTRKYE+ID+YV++ADEE+VRRKMRVSLP+DY E
Sbjct: 1586 LDEAFDSTMDDREWGARMTKASLVPPVTRKYELIDEYVVIADEEEVRRKMRVSLPDDYVE 1645

Query: 1755 KLNAQKNGSEELDMELPEVKDYKPRKQLGDQVFEQEVYGIDPYTHNLLLDSMPDELDWNL 1814
            KLNAQKNG+EELDMELPEVKDYKPRK++GD+V EQEVYGIDPYTHNLLLDS+P+ELDW+L
Sbjct: 1646 KLNAQKNGAEELDMELPEVKDYKPRKKIGDEVLEQEVYGIDPYTHNLLLDSVPEELDWSL 1705

Query: 1815 LEKHLFIEDVLLRTLNKQVRHFTGTGNTPMMYPLQPVIEEIEKEAVDDCDVRTMKMCRGI 1874
            ++KH+FIEDVLLRTLNKQ  HFTGTGNTPM YPL PVIEEIEK A  +CD+R M++C+GI
Sbjct: 1706 MDKHMFIEDVLLRTLNKQAIHFTGTGNTPMKYPLLPVIEEIEKVAAAECDIRIMRLCQGI 1765

Query: 1875 LKAMDSRPDDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQK 1934
            LKA+ SRP+DKYVAYRKGLGVVCNK+ GFGEDDFVVEFLGEVYPVWKW+EKQDGIRSLQK
Sbjct: 1766 LKAIHSRPEDKYVAYRKGLGVVCNKQEGFGEDDFVVEFLGEVYPVWKWYEKQDGIRSLQK 1825

Query: 1935 NNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHY 1994
            N++DPAPEFYNIYLERPKGD DGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHY
Sbjct: 1826 NDKDPAPEFYNIYLERPKGDGDGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHY 1885

Query: 1995 QIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLK 2054
            QIGIYT+R I YGEEITFDYNSVTESKEEYEASVCLCGS VCRGSYLNLTG+GAF KVL+
Sbjct: 1886 QIGIYTLRKIQYGEEITFDYNSVTESKEEYEASVCLCGSHVCRGSYLNLTGDGAFLKVLE 1945

Query: 2055 ELHGLLDRHQLMLEACELNSVSEEDYLELGRAGLGSCLLGGLPNWVVAYSARLVRFINLE 2114
            E HG+LD HQLMLEACELNSVSE+DYL+LGRAGLGSCLLGGLP+W+VAYSAR+VRFIN E
Sbjct: 1946 EWHGVLDCHQLMLEACELNSVSEDDYLDLGRAGLGSCLLGGLPDWLVAYSARVVRFINFE 2005

Query: 2115 RTKLPEEILRHNLEEKRKYFSDICLEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRC 2174
            RTKLP+EIL HNLEEKRKYFSDICL+VEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRC
Sbjct: 2006 RTKLPQEILAHNLEEKRKYFSDICLDVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRC 2065

Query: 2175 VFGDPKKAPPPVERLSPEETVSFLWKGEGSLVEELIQCMAPHVEEDVLNDLKSKIQAHDP 2234
            +FGDPK APPP++RLSPEE+VS++W GEGSLVEEL+  M PHVEED+++DLK KI+AHDP
Sbjct: 2066 IFGDPKNAPPPLKRLSPEESVSYIWNGEGSLVEELLLSMVPHVEEDLISDLKLKIRAHDP 2125

Query: 2235 SGSEDIQRELRKSLLWLRDEVRNLPCTYKCRHDAAADLIHIYAYTKCFFRVQEYKAFTSP 2294
              S+DIQ+EL++SLLWLRDEVRN+PCTYK R+DAAADLIHIYAYTK FFR+QEYKA TSP
Sbjct: 2126 LCSDDIQKELQQSLLWLRDEVRNIPCTYKSRNDAAADLIHIYAYTKNFFRIQEYKAVTSP 2185

Query: 2295 PVYISPLDLGPKYADKLGADLQVYRKTYGENYCLGQLIFWHIQTNADPDCTLARASRGCL 2354
            PVYIS LDLGPKY DKLG   Q Y KTYG NYCLGQLIFWH Q N DPDC+LA ASRGCL
Sbjct: 2186 PVYISSLDLGPKYVDKLGTGFQEYCKTYGPNYCLGQLIFWHNQQNIDPDCSLALASRGCL 2245

Query: 2355 SLPDIGSFYAKVQKPSRHRVYGPKTVRFMLSRMEKQPQRPWPKDRIWAFKSSPRIFGSPM 2414
            SLP+I SFYA+VQKPSR RVYGPKTV+FMLSRMEKQPQRPWPKDRIW+FK+SP++ GSPM
Sbjct: 2246 SLPEISSFYARVQKPSRQRVYGPKTVKFMLSRMEKQPQRPWPKDRIWSFKNSPKVIGSPM 2305

Query: 2415 LDSSLTGCPLDREMVHWLKHRPAIFQAMWDR 2445
            LD  L+  PL++++VHWLKHR  IFQAMWDR
Sbjct: 2306 LDVVLSNSPLEKDLVHWLKHRTPIFQAMWDR 2336


>gi|449493199|ref|XP_004159219.1| PREDICTED: probable histone-lysine N-methyltransferase ATXR3-like
            [Cucumis sativus]
          Length = 2336

 Score = 2978 bits (7720), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 1573/2491 (63%), Positives = 1858/2491 (74%), Gaps = 201/2491 (8%)

Query: 1    MGDGGVACMPLQQQQQHNSIMERFPISDKTTICVGNSSNNSNKTNNNSISNNNDNKTNND 60
            MGDGGVAC+PLQQQQQH  IME FPI  +  +C G                         
Sbjct: 1    MGDGGVACIPLQQQQQH--IMETFPIPSEKMLCAGK------------------------ 34

Query: 61   SSNNNGSSSSKNNETNKSNVKKNGVSTKTVRKKIVKIKK--VIAVKKKEVQKNSGSSKSN 118
               NNG +S       KS VK     ++  RK+ +K+KK  V+A   +  +  SG  K  
Sbjct: 35   ---NNGFNS-------KSTVK----FSEAERKQKMKLKKEEVVAKDVELGRTESGLDKPG 80

Query: 119  NNGENIDNKNVENGGAVGEVVTVDKENLKNEEVEEGELGTLKW-----ENGEFVQPEKSQ 173
             +   + +   ENG    E           +EVEEGE GTLKW     ENGEFV PEKS+
Sbjct: 81   KSSREVGH--AENGVDSAE----------KDEVEEGEFGTLKWSRVEVENGEFV-PEKSR 127

Query: 174  PQSQLQSQSKQIEKGEIIVFSSKCRRGETEKGE------------SGLWRGNKDDIEKGE 221
                      +I+KGE +    K RRG+ EKGE            +   R  KD+IE+GE
Sbjct: 128  --------RTEIDKGENV--RGKWRRGDIEKGEIVPEKSRKGEVDNRSRRLAKDEIERGE 177

Query: 222  FIPDRWHK-EVVKDEYGYSKSRRYDYKLER--------TPPSGKYSGEDVYRRKEFDRSG 272
            FIPDRW K +++KD++ YS++RRY+ + +R        TPP  KYS +D  RRKE +RSG
Sbjct: 178  FIPDRWEKGDILKDDFRYSRTRRYEPEKDRAWKNVREPTPPLVKYSTDDT-RRKELNRSG 236

Query: 273  SQHSKSSSRWESGQERNVRISSKIVDDEGLYKGEHNNGKNHGREYFHGNRFKRHGTDSDS 332
            +QH K++ RWE+GQ+R  R  SK+++DE  ++ ++N+GKN G++Y   NR KR+  +SD+
Sbjct: 237  NQHGKTTPRWETGQDRGSRYGSKLMNDEVSHRNDYNDGKNFGKDYSSCNRLKRYSLESDN 296

Query: 333  GDRKYYGDYGDFAGLKSRRLSDDYNSRSVHSEHYSRHSVEKFHRNSSSSRISSL-DKYSS 391
             +RK+YGDYGD+AG KSRRLS+D +SR+ HS+HYS   +E+  +NSSSS   S  DK+S+
Sbjct: 297  FERKHYGDYGDYAGSKSRRLSED-SSRTAHSDHYSIRPMERSCKNSSSSSRISSSDKFST 355

Query: 392  RHHEPS-LSSRVIYDRHGRSPSHSDRSPHDRGRYYDHRDRSPSRHDRSPYTRDRSPYTFD 450
            RH+E S  SSR  Y RH  SP HSDRSP ++GRY+DHRDRSP   DRS            
Sbjct: 356  RHYESSSTSSREAYSRHVHSPGHSDRSPREKGRYHDHRDRSPGHQDRS------------ 403

Query: 451  RSPYSRERSPYNRDRSPYAREKSPYDRSRHYDHRNRSPFSAERSPQDRARFHDRSDRTPN 510
                     P+  +RSPY R+KSPYDRSRHYDHR RSP + ERSPQDRAR H R DRTPN
Sbjct: 404  ---------PFIGERSPYGRDKSPYDRSRHYDHRYRSPLT-ERSPQDRARCHSRRDRTPN 453

Query: 511  YLERSPLHRSRPNNHREASSKTGASEKRNARYDSKGHEDKLGPKDSNARCSRSSAKESQD 570
            YL+RSPL RSR +NHRE S ++   +  N    S+  EDK  PKD + R   S AKES D
Sbjct: 454  YLDRSPLDRSRTSNHRETSRRSKGEKHNNG---SRAREDKTTPKDPDGR--ESVAKESYD 508

Query: 571  KSNVQDLNVSDEKTANCESHK-EEQPQSSSVDCKEPPQVDGPPLEELVSMEEDMDICDTP 629
            + N Q+ N S E   +C S++ EE+ QS +    E   VDG P EEL SMEEDMDICDTP
Sbjct: 509  EINEQNTNGSIETVGDCRSYEGEEKSQSPNQTSIELSHVDGVP-EELPSMEEDMDICDTP 567

Query: 630  PHVPAVTDSSVGKWFYLDHCGMECGPSRLCDLKTLVEEGVLVSDHFIKHLDSNRWETVEN 689
            PH P VTD+S GKWFYLD+ G+E GP+RL DLK LVEEG L+SDHFIKHLDS+RW TVEN
Sbjct: 568  PHAPLVTDTSTGKWFYLDYYGLERGPTRLYDLKALVEEGSLMSDHFIKHLDSDRWVTVEN 627

Query: 690  AVSPLVTVNFPSITSDSVTQLVSPPEASGNLLADTGDTAQ---STGEEFPVTLQSQCCPD 746
            AVSPLVT+NFPSI  DSVTQLVSPPEA+GN+L D  DT +     G   P  + S     
Sbjct: 628  AVSPLVTINFPSIVPDSVTQLVSPPEATGNVLVDITDTGKLDIQGGHFEPNQIPSGGSIL 687

Query: 747  GSAAAAESSE---DLHIDVRVGALLDGFTVIPGKEIETLGEILQTTFERVDWQNNGGPTW 803
             S    E+SE   DLHID R+GALL+  TVIPGKE+ET+ E+LQ T +   W+       
Sbjct: 688  PSDEGVEASEPLGDLHIDERIGALLEDITVIPGKELETIAEVLQMTLDGEQWERLAISEG 747

Query: 804  HGACVGEQKPGDQKVDELY-ISD--TKMKEAAELK-SGDKDHWVVCFDSDEWFSGRWSCK 859
                VGEQ   DQ  D++   SD  T +   ++   S DKD  V   D  +W SG WSCK
Sbjct: 748  FSDHVGEQL--DQSTDDVVEFSDFVTSVDSGSQKNVSSDKDFAV---DDGDWTSGPWSCK 802

Query: 860  GGDWKRNDEAAQDRCSRKKQVLNDGFPLCQMPKSGYEDPRWNQKDDLYYPSHSRRLDLPP 919
            GGDW+RNDE+AQ+R  RKK VLNDGFPLCQM KSGYEDPRW+QKD+LYYPS S+RLDLPP
Sbjct: 803  GGDWRRNDESAQERNGRKKLVLNDGFPLCQMSKSGYEDPRWHQKDELYYPSQSKRLDLPP 862

Query: 920  WAYACPDERNDGSGGSRSTQSKLAAVRGVKGTMLPVVRINACVVNDHGSFVSEPRSKVRA 979
            WA+ C D+R+               +RG KGTMLPV+RINACVV DHGSFVSEPR KVR 
Sbjct: 863  WAFTCLDDRS------------TLTIRGTKGTMLPVIRINACVVKDHGSFVSEPRMKVRG 910

Query: 980  KERHSSRSARSYSSANDVRRSSAESDSHSKARNNQDSQGSWKSIACINTPKDRLCTVDDL 1039
            K    SRS R +SS  D +RS A+ DS SK   +  S+ S K+ A ++ PKDRLC+ DDL
Sbjct: 911  KGH--SRS-RLFSSNTDGKRS-ADGDSLSKIARDVSSERSLKATAFVSIPKDRLCSYDDL 966

Query: 1040 QLQLGEWYYLDGAGHERGPSSFSELQVLVDQGCIQKHTSVFRKFDKVWVPLTFATETSAS 1099
            QL  G+WYYLDGAGHE GPSSFSELQ+LVD G IQK++SVFRKFD+VWVP+T   E S S
Sbjct: 967  QLHFGDWYYLDGAGHECGPSSFSELQLLVDHGIIQKNSSVFRKFDRVWVPVTSFAECSES 1026

Query: 1100 TVRNHGEKIMPSGDSSGLPPTQSQDAVLGESNNNVNSNAFHTMHPQFIGYTRGKLHELVM 1159
            T R   EKI   G+++  P + S D   G       SN FH +HPQF+GYTRGKLHELVM
Sbjct: 1027 TRRIQREKIPLLGETTKNPVSVSGDNSFG--GLATTSNMFHELHPQFVGYTRGKLHELVM 1084

Query: 1160 KSYKNREFAAAINEVLDPWINAKQPKKETEH-VYRKSEGDTRAGKRARLLVRESDGDEET 1218
            K YK+REFAAAIN+VLDPWINAKQPKKE E  ++ KS+G  RA KRAR+LV ESD D E 
Sbjct: 1085 KFYKSREFAAAINDVLDPWINAKQPKKEMEKTMHWKSDGSARAAKRARVLVDESDDDYEV 1144

Query: 1219 EEEL--QTIQDESTFEDLCGDASFPGEESASSAIESGGWGLLDGHTLAHVFHFLRSDMKS 1276
            +E+L     +DE  FEDLCGDA+FPGEES S  +ES  WG LDGH LA +FHFL+SD+KS
Sbjct: 1145 DEDLLHHRQKDEIAFEDLCGDATFPGEESTSLEVES--WGFLDGHILARIFHFLQSDLKS 1202

Query: 1277 LAFASLTCRHWRAAVRFYKGISRQVDLSSVGPNCTDSLIRKTLNAFDKEKLNSILLVGCT 1336
            L+FAS+TC+HWRAAVRFYK IS+QVDLSS+GPNCT+S     ++ +++EK+N I+LVGCT
Sbjct: 1203 LSFASVTCKHWRAAVRFYKDISKQVDLSSLGPNCTNSTFMNVMSTYNEEKVNFIVLVGCT 1262

Query: 1337 NITSGMLEEILQSFPHLSSIDIRGCGQFGELALKFPNINWVKSQKSRGAKFNDSRSKIRS 1396
            NIT  +LEEIL  FP L+SID+RGC QF +L  K+PNINWVK   +      ++ SK+RS
Sbjct: 1263 NITPVVLEEILGMFPQLASIDVRGCSQFNDLPSKYPNINWVKRSLNATKNNEETHSKMRS 1322

Query: 1397 LKQITEKSSSAPKSKGLGDDMDDFGDLKDYFESVDKRDSANQSFRRSLYQRSKVFDARKS 1456
            LK +T+KS S  K KGL  ++DDFG+LK YFESVDKR+SANQ FRRSLY+RSKVFDARKS
Sbjct: 1323 LKHLTDKSYSLSKIKGLSSNVDDFGELKQYFESVDKRESANQLFRRSLYKRSKVFDARKS 1382

Query: 1457 SSILSRDARMRRWSIKKSENGYKRMEEFLASSLKEIMRVNTFEFFVPKVAEIEGRMKKGY 1516
            SSI+SRDARMR+WSIKKSE GYKRM EFLASSLKEIMR NTFEFFVPKVAEI+ R++ GY
Sbjct: 1383 SSIVSRDARMRQWSIKKSEVGYKRMVEFLASSLKEIMRDNTFEFFVPKVAEIQDRIRNGY 1442

Query: 1517 YISHGLGSVKDDISRMCRDAIKAKNRGSAGDMNRITTLFIQLATRLEQGAKSSYYEREEM 1576
            YI  GLGSVK+DISRMCRDAIK                                    + 
Sbjct: 1443 YIKRGLGSVKEDISRMCRDAIKY-----------------------------------DE 1467

Query: 1577 MKSWKDESPAGL-YSATSKYKKKLSKMVSERKYMNRSNGTSLANGDFDYGEYASDREIRK 1635
            + SW+D+S   L  SA SKYK++L K+ +ERKY NRSNG+   NG  D+GEYASDREIR+
Sbjct: 1468 VSSWEDDSSLRLGSSAASKYKRRLGKVGTERKYTNRSNGSIFGNGALDHGEYASDREIRR 1527

Query: 1636 RLSKLNRKSLDSGSETSDDLDGSSEDGKSDSESTVSDTDSDMDFRSDGRARESRGAGDFT 1695
            RLS+LN+K + S SETSD+ D SS DGKS SE++ SDT+SD++F S GR  E+RG   F 
Sbjct: 1528 RLSRLNKKPIGSESETSDEFDRSSGDGKSGSENSASDTESDLEF-SSGRI-ETRGDKCFI 1585

Query: 1696 TDEGLDFS-DDREWGARMTKASLVPPVTRKYEVIDQYVIVADEEDVRRKMRVSLPEDYAE 1754
             DE  D + DDREWGARMTKASLVPPVTRKYE+ID+YV++ADEE+VRRKMRVSLP+DY E
Sbjct: 1586 LDEAFDSTMDDREWGARMTKASLVPPVTRKYELIDEYVVIADEEEVRRKMRVSLPDDYVE 1645

Query: 1755 KLNAQKNGSEELDMELPEVKDYKPRKQLGDQVFEQEVYGIDPYTHNLLLDSMPDELDWNL 1814
            KLNAQKNG+EELDMELPEVKDYKPRK++GD+V EQEVYGIDPYTHNLLLDS+P+ELDW+L
Sbjct: 1646 KLNAQKNGAEELDMELPEVKDYKPRKKIGDEVLEQEVYGIDPYTHNLLLDSVPEELDWSL 1705

Query: 1815 LEKHLFIEDVLLRTLNKQVRHFTGTGNTPMMYPLQPVIEEIEKEAVDDCDVRTMKMCRGI 1874
            ++KH+FIEDVLLRTLNKQ  HFTGTGNTPM YPL PVIEEIEK A  +CD+R M++C+GI
Sbjct: 1706 MDKHMFIEDVLLRTLNKQAIHFTGTGNTPMKYPLLPVIEEIEKVAAAECDIRIMRLCQGI 1765

Query: 1875 LKAMDSRPDDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQK 1934
            LKA+ SRP+DKYVAYRKGLGVVCNK+ GFGEDDFVVEFLGEVYPVWKW+EKQDGIRSLQK
Sbjct: 1766 LKAIHSRPEDKYVAYRKGLGVVCNKQEGFGEDDFVVEFLGEVYPVWKWYEKQDGIRSLQK 1825

Query: 1935 NNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHY 1994
            N++DPAPEFYNIYLERPKGD DGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHY
Sbjct: 1826 NDKDPAPEFYNIYLERPKGDGDGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHY 1885

Query: 1995 QIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLK 2054
            QIGIYT+R I YGEEITFDYNSVTESKEEYEASVCLCGS VCRGSYLNLTG+GAF KVL+
Sbjct: 1886 QIGIYTLRKIQYGEEITFDYNSVTESKEEYEASVCLCGSHVCRGSYLNLTGDGAFLKVLE 1945

Query: 2055 ELHGLLDRHQLMLEACELNSVSEEDYLELGRAGLGSCLLGGLPNWVVAYSARLVRFINLE 2114
            E HG+LD HQLMLEACELNSVSE+DYL+LGRAGLGSCLLGGLP+W+VAYSAR+VRFIN E
Sbjct: 1946 EWHGVLDCHQLMLEACELNSVSEDDYLDLGRAGLGSCLLGGLPDWLVAYSARVVRFINFE 2005

Query: 2115 RTKLPEEILRHNLEEKRKYFSDICLEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRC 2174
            RTKLP+EIL HNLEEKRKYFSDICL+VEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRC
Sbjct: 2006 RTKLPQEILAHNLEEKRKYFSDICLDVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRC 2065

Query: 2175 VFGDPKKAPPPVERLSPEETVSFLWKGEGSLVEELIQCMAPHVEEDVLNDLKSKIQAHDP 2234
            +FGDPK APPP++RLSPEE+VS++W GEGSLVEEL+  M PHVEED+++DLK KI+AHDP
Sbjct: 2066 IFGDPKNAPPPLKRLSPEESVSYIWNGEGSLVEELLLSMVPHVEEDLISDLKLKIRAHDP 2125

Query: 2235 SGSEDIQRELRKSLLWLRDEVRNLPCTYKCRHDAAADLIHIYAYTKCFFRVQEYKAFTSP 2294
              S+DIQ+EL++SLLWLRDEVRN+PCTYK R+DAAADLIHIYAYTK FFR+QEYKA TSP
Sbjct: 2126 LCSDDIQKELQQSLLWLRDEVRNIPCTYKSRNDAAADLIHIYAYTKNFFRIQEYKAVTSP 2185

Query: 2295 PVYISPLDLGPKYADKLGADLQVYRKTYGENYCLGQLIFWHIQTNADPDCTLARASRGCL 2354
            PVYIS LDLGPKY DKLG   Q Y KTYG NYCLGQLIFWH Q N DPDC+LA ASRGCL
Sbjct: 2186 PVYISSLDLGPKYVDKLGTGFQEYCKTYGPNYCLGQLIFWHNQQNIDPDCSLALASRGCL 2245

Query: 2355 SLPDIGSFYAKVQKPSRHRVYGPKTVRFMLSRMEKQPQRPWPKDRIWAFKSSPRIFGSPM 2414
            SLP+I SFYA+VQKPSR RVYGPKTV+FMLSRMEKQPQRPWPKDRIW+FK+SP++ GSPM
Sbjct: 2246 SLPEISSFYARVQKPSRQRVYGPKTVKFMLSRMEKQPQRPWPKDRIWSFKNSPKVIGSPM 2305

Query: 2415 LDSSLTGCPLDREMVHWLKHRPAIFQAMWDR 2445
            LD  L+  PL++++VHWLKHR  IFQAMWDR
Sbjct: 2306 LDVVLSNSPLEKDLVHWLKHRTPIFQAMWDR 2336


>gi|356544844|ref|XP_003540857.1| PREDICTED: probable histone-lysine N-methyltransferase ATXR3-like
            [Glycine max]
          Length = 2331

 Score = 2936 bits (7611), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 1567/2476 (63%), Positives = 1857/2476 (75%), Gaps = 176/2476 (7%)

Query: 1    MGDGGVACMPLQQQQQHNSIMERFP-ISDKTTICVGNSSNNSNKTNNNSISNNNDNKTNN 59
            MGDGGVACMPLQQQ     ++ER P  + +  +C G S N  +                 
Sbjct: 1    MGDGGVACMPLQQQH----VIERLPNAAAEKALCGGKSGNGFD----------------- 39

Query: 60   DSSNNNGSSSSKNNETNKSNVKKNGVSTKTVRKKIVKIKKVIAVKKKEVQKNSGSSKSNN 119
                    S        +    K         KK+V         K E+  +   S+  N
Sbjct: 40   --------SGLLKVAGKRKKKVKVKKKVSPAAKKVV---------KSELTVDGVGSRGGN 82

Query: 120  NGENIDNKNVENGGAVGEVVTVDKENLKNEEVEEGELGTL--KWENGEFVQPEKSQPQSQ 177
                    +VE+G   GE+          +EVEEGELGTL  + ENGEFV PEK      
Sbjct: 83   --------DVESGEVCGEM----------DEVEEGELGTLGCELENGEFV-PEK----PV 119

Query: 178  LQSQSKQIEKGEIIVFSSKCRRGETEKGE--SGLWRGNK-DDIEKGEFIPDRWHK-EVVK 233
            +  +  +IE GEI+  S + ++GE E+GE  SG WR  + DDIEKGEFIPDRWH+ ++ +
Sbjct: 120  MLMRRSEIENGEIV--SERWKKGEVERGEFVSGKWRKEEDDDIEKGEFIPDRWHRGDMGR 177

Query: 234  DEYGYSKSRRYD------YKLER--TPPSGK-YSGEDVYRRKEFDRSGSQHSKSSSRWES 284
            D+YGY++ RRY       +K ER  TPPSG+ Y+G++ +R+KE +RSGSQH+KS+ RWES
Sbjct: 178  DDYGYARIRRYQPGRDKGWKNEREHTPPSGRYYTGDEHFRKKELNRSGSQHAKSAPRWES 237

Query: 285  GQERNVRISSKIVDDEGLYKGEHNNGKNHGREYFHGNRFKRHGTDSDSGDRKYYGDYGDF 344
            GQERN+RISSKIVD+E   K EH+N + H R+Y  GNR KRHG +S+  +RK   +YGD+
Sbjct: 238  GQERNIRISSKIVDEE---KNEHSNSRTHMRDYSSGNRLKRHGNESEGCERK---NYGDY 291

Query: 345  AGLKSRRLSDDYNSRSVHSEHYSRHSVEKFHRNSSSSRISSLDKYSSRHHEPSLSSRVIY 404
            AG KSRRLSDD + R  +SEHYSR SVE+ +RNSSS   +  DKYSSRHHE SL +R +Y
Sbjct: 292  AGSKSRRLSDD-SPRLAYSEHYSRLSVERSYRNSSSKSSA--DKYSSRHHE-SLPTRSVY 347

Query: 405  DRHGRSPSHSDRSPHDRGRYYDHRDRSPSRHDRSPYTRDRSPYTFDRSPYSRERSPYNRD 464
            D+HGRSP +S+RSPHDR RYYDH+DR+P R   SPY+ DRSPY+ ++SP+ RERSPYNR+
Sbjct: 348  DKHGRSPGNSERSPHDRARYYDHKDRTPVR--PSPYSCDRSPYSSEKSPHGRERSPYNRN 405

Query: 465  RSPYAREKSPYDRSRHYDHRNRSPFSAERSPQDRARFHDRSDRTPNYLERSPLHRSRPNN 524
                      +DRSRH+DH+ RSP  AERSPQDR R HDR D TPN +E+SP  R+R N 
Sbjct: 406  ----------WDRSRHHDHKMRSPTHAERSPQDRGRHHDRRDPTPNLIEQSPHDRTRSNM 455

Query: 525  HREASSKTGASEKRNARYDSKGHEDKLGPKDSNARCSRSSAKESQDKSNVQDLNVSDEKT 584
            HRE +SK  +SEK N+++  K +EDK   K++N      S  ESQ + NV + + S E  
Sbjct: 456  HREINSKISSSEKHNSQHSCKDYEDKHVQKEANL-----SDVESQGERNVHNASKSFEID 510

Query: 585  ANCESHKEEQPQSSSVDCKEPPQVDGPPLEELVSMEEDMDICDTPPHVPAVTDSSVGKWF 644
               E  KE+Q  + +V CK  P ++  P EEL SMEEDMDICDTPPHVP V DSS GKWF
Sbjct: 511  VCSEPEKEQQSSNPTVSCKGSPCLEPLP-EELASMEEDMDICDTPPHVPVVVDSSSGKWF 569

Query: 645  YLDHCGMECGPSRLCDLKTLVEEGVLVSDHFIKHLDSNRWETVENAVSPLVTVNFPSITS 704
            YLD+ G+E GPS+L D+K LV++GVL+SDHFIKH+DS+RW TVENAVSP+   +F S+ S
Sbjct: 570  YLDYNGVEHGPSKLSDIKVLVDDGVLMSDHFIKHIDSDRWLTVENAVSPVTAQSFLSVVS 629

Query: 705  DSVTQLVSPPEASGNLLADTGDTAQSTGEEF-----PVTLQSQCCPDGSAAAAESSEDLH 759
            +++TQLV+PPEA GNLLADTGD  QS  E +     P+ LQ   C + S  A+   EDLH
Sbjct: 630  ETITQLVNPPEAPGNLLADTGDILQSGPENYLGIPTPI-LQPMLCSEDSGIASVLLEDLH 688

Query: 760  IDVRVGALLDGFTVIPGKEIETLGEILQTTFERVDWQN----NGGPTWHGACVGEQKPGD 815
            ID RVG LL+G+ VIPG+E E + E LQ  FE   W+      G P  H  C+  +   D
Sbjct: 689  IDERVGVLLEGYDVIPGREFEAIKESLQMNFEYAKWEGLEECEGFPG-HDTCLRMEH--D 745

Query: 816  QKVDELYISDTKMKEAAELKSGDKDHWVVCFDSDEWFSGRWSCKGGDWKRNDEAAQDRCS 875
             ++D    S  + +    + SG ++ + +    D WFS +WSCKGGDWKRND+ AQDR  
Sbjct: 746  SRID----SSREYESQVSIPSGKENGFTLGVPGD-WFSAQWSCKGGDWKRNDD-AQDRYC 799

Query: 876  RKKQVLNDGFPLCQMPKSGYEDPRWNQKDDLYYPSHSRRLDLPPWAYACPDERNDGSGGS 935
             KK VLNDGF LCQMPKSG EDPRW +KDDLYYPSHSRRLDLP WA+ C DER D S  S
Sbjct: 800  NKKLVLNDGFSLCQMPKSGCEDPRWTRKDDLYYPSHSRRLDLPVWAF-CTDERGDCSTLS 858

Query: 936  RSTQSKLAAVRGVKGTMLPVVRINACVVNDHGSFVSEPRSKVRAKERHSSRSARSYSSAN 995
            +  Q+KLA+VRGVKG +L VVRINACVV D GS VSE   K R+K+R+ SRS  S+SS +
Sbjct: 859  KPVQTKLASVRGVKGNILSVVRINACVVKDQGSLVSESCHKTRSKDRYPSRSTWSFSSTS 918

Query: 996  DVRRSSAESDSHSKARNNQDSQGSWKSIACINTPKDRLCTVDDLQLQLGEWYYLDGAGHE 1055
              +RSS E DS SKA N+Q S GS +S+  IN PKD   TV DLQL  G WYYLDG+G E
Sbjct: 919  YSKRSSTEEDSQSKASNDQGSLGSCRSMEFINIPKDYCRTVHDLQLHSGNWYYLDGSGRE 978

Query: 1056 RGPSSFSELQVLVDQGCIQKHTSVFRKFDKVWVPLTFATETS--ASTVRNHGEKIMPSGD 1113
            RGPSSFSELQ LVDQG ++K++SVFRK DK+WVP+T + ET     ++R+H E    SG+
Sbjct: 979  RGPSSFSELQRLVDQGIVKKYSSVFRKCDKLWVPVTSSAETYDFDVSLRSHQESSTLSGE 1038

Query: 1114 SSGLPPTQSQDAVLGESNNNVNSNAFHTMHPQFIGYTRGKLHELVMKSYKNREFAAAINE 1173
             SGLP  Q   A +GE ++   SN F+++ PQF+GYTRGKLHELVM+SYK+REFAA INE
Sbjct: 1039 CSGLPSKQIHGASVGEHDS--KSNLFNSLQPQFVGYTRGKLHELVMRSYKSREFAAVINE 1096

Query: 1174 VLDPWINAKQPKKETE-HVYRKSEGDTRAGKRARLLVRESDGDEETEE-ELQTIQDESTF 1231
            VLDPWIN +QPKKETE   Y KSEGD  A KRAR+LV  S+ D + E+  L   +DESTF
Sbjct: 1097 VLDPWINTRQPKKETEKQTYWKSEGDGHASKRARMLVDYSEEDSDFEDGSLPNWKDESTF 1156

Query: 1232 EDLCGDASFPGEESASSAIESGGWGLLDGHTLAHVFHFLRSDMKSLAFASLTCRHWRAAV 1291
            E LCGDA+F GE S  +    G  GLLDG  L+ VFH LRSD+KSLAFAS+TC+HWRA V
Sbjct: 1157 EALCGDATFSGEGSDITDPNVGSLGLLDGCMLSRVFHCLRSDLKSLAFASMTCKHWRATV 1216

Query: 1292 RFYKGISRQVDLSSVGPNCTDSLIRKTLNAFDKEKLNSILLVGCTNITSGMLEEILQSFP 1351
            RFYK +SR V+LSS+G +CTDS++   LNA++K+K+ SI+L+GCTNIT+GMLE+IL  FP
Sbjct: 1217 RFYKKVSRHVNLSSLGHSCTDSIMWNILNAYEKDKIESIVLIGCTNITAGMLEKILLLFP 1276

Query: 1352 HLSSIDIRGCGQFGELALKFPNINWVKSQKSRGAKFNDSRSKIRSLKQITEKSSSAPKSK 1411
             LS++DIRGC QFGEL LKF N+ W+KS  S   K      KIRS+KQ  E++SS  K  
Sbjct: 1277 GLSTVDIRGCSQFGELTLKFTNVKWIKSHSSHITKIASESHKIRSVKQFAEQTSSVSKVS 1336

Query: 1412 GLGDDMDDFGDLKDYFESVDKRDSANQSFRRSLYQRSKVFDARKSSSILSRDARMRRWSI 1471
             LG   DDFG+LKDYF+SVDKRD+A Q FR++LY+RSK++DAR SSSILSRDAR RRW I
Sbjct: 1337 ILG-IRDDFGELKDYFDSVDKRDTAKQLFRQNLYKRSKLYDARNSSSILSRDARTRRWPI 1395

Query: 1472 KKSENGYKRMEEFLASSLKEIMRVNTFEFFVPKVAEIEGRMKKGYYISHGLGSVKDDISR 1531
            KKSE+GYKRME+FLAS L+EIM+ N+ +FF+PKVAEIE +MK GYY  HGL  VK+DISR
Sbjct: 1396 KKSESGYKRMEQFLASRLREIMKANSCDFFMPKVAEIEAKMKNGYYSGHGLSYVKEDISR 1455

Query: 1532 MCRDAIKAKNRGSAGDMNRITTLFIQLATRLEQGAKSSYYEREEMMKSWKDESPAGLYSA 1591
            MCRDAIK                                   + +MK W ++ P+ L S 
Sbjct: 1456 MCRDAIK-----------------------------------DALMKLWGNDPPSSLCST 1480

Query: 1592 TSKYKKKL-SKMVSERKYMNRSNGTSLANGDFDYGEYASDREIRKRLSKLNRKSLDSGSE 1650
            +SKYKK   ++++SERK+ N        +G  D GEYASDREIR+RLSKLN+K  +S SE
Sbjct: 1481 SSKYKKSKENRLLSERKHRNNE-----THGGLDNGEYASDREIRRRLSKLNKKYFNSESE 1535

Query: 1651 TSDDLDGSSEDGKSDSESTVSDTDSDMDFRSDGRARESRGAGDFTTDEGLDF-SDDREWG 1709
            TSDD D SSEDGKSDS++T +DT+SD D  S+ R  +SRG G FT D+GL F +D+REWG
Sbjct: 1536 TSDDFDRSSEDGKSDSDTTTTDTESDQDVHSESRIGDSRGDGYFTPDDGLHFITDEREWG 1595

Query: 1710 ARMTKASLVPPVTRKYEVIDQYVIVADEEDVRRKMRVSLPEDYAEKLNAQKNGSEELDME 1769
            ARMTKASLVPPVTRKY+VIDQY+IVADEEDVRRKMRVSLP+DYAEKL+AQKNG EE DME
Sbjct: 1596 ARMTKASLVPPVTRKYDVIDQYIIVADEEDVRRKMRVSLPDDYAEKLSAQKNGIEESDME 1655

Query: 1770 LPEVKDYKPRKQLGDQVFEQEVYGIDPYTHNLLLDSMPDELDWNLLEKHLFIEDVLLRTL 1829
            LPEVKDYKPRKQL ++V EQEVYGIDPYTHNLLLDSMP ELDW+L EKHLFIED LLR L
Sbjct: 1656 LPEVKDYKPRKQLENEVVEQEVYGIDPYTHNLLLDSMPKELDWSLQEKHLFIEDKLLRML 1715

Query: 1830 NKQVRHFTGTGNTPMMYPLQPVIEEIEKEAVDDCDVRTMKMCRGILKAMDSRPDDKYVAY 1889
            NKQV+HFTGTGNTPM YPLQP IEEIE+ A + CD RT++MC+GILKA+ SR DDKYVAY
Sbjct: 1716 NKQVKHFTGTGNTPMSYPLQPAIEEIERYAEEHCDARTVRMCQGILKAIKSRSDDKYVAY 1775

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            RKGLGVVCNKE GFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKN++DPAPEFYNIYLE
Sbjct: 1776 RKGLGVVCNKEEGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNSDDPAPEFYNIYLE 1835

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
            RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIY+VR I +GEE
Sbjct: 1836 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYSVREIQHGEE 1895

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDRHQLMLEA 2069
            ITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKE HG+LDRH LMLEA
Sbjct: 1896 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKEWHGILDRHYLMLEA 1955

Query: 2070 CELNSVSEEDYLELGRAGLGSCLLGGLPNWVVAYSARLVRFINLERTKLPEEILRHNLEE 2129
            CELNSVSEEDY +LGRAGLGSCLLGGLP+W+V+Y+ARLVRFIN ERTKLPEEIL+HNLEE
Sbjct: 1956 CELNSVSEEDYNDLGRAGLGSCLLGGLPDWLVSYAARLVRFINFERTKLPEEILKHNLEE 2015

Query: 2130 KRKYFSDICLEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRCVFGDPKKAPPPVERL 2189
            KRKYFSDICLEVE+SDAEVQAEGVYNQRLQNLAVTLDKVRYVMRC+FGDP KAPPP+E+L
Sbjct: 2016 KRKYFSDICLEVERSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRCIFGDPLKAPPPLEKL 2075

Query: 2190 SPEETVSFLWKGEGSLVEELIQCMAPHVEEDVLNDLKSKIQAHDPSGSEDIQRELRKSLL 2249
            SPE  VSFLWKGE S VEEL+QC+AP+VEE  LNDLKSKI AHDPS S DIQ+ ++KSLL
Sbjct: 2076 SPEAVVSFLWKGEDSFVEELLQCLAPYVEESTLNDLKSKIHAHDPSSSGDIQKAVQKSLL 2135

Query: 2250 WLRDEVRNLPCTYKCRHDAAADLIHIYAYTKCFFRVQEYKAFTSPPVYISPLDLGPKYAD 2309
            WLRDEVRNLPCTYKCRHDAAADLIHIYAYTK FFR+Q+Y+  TSPPVYISPLDLGPKYAD
Sbjct: 2136 WLRDEVRNLPCTYKCRHDAAADLIHIYAYTKYFFRIQDYQTITSPPVYISPLDLGPKYAD 2195

Query: 2310 KLGADLQVYRKTYGENYCLGQLIFWHIQTNADPDCTLARASRGCLSLPDIGSFYAKVQKP 2369
            KLGA  Q YRK YGENYCLGQLIFWH Q+NA+PDCTLAR SRGCLSLPDI SFYAK QKP
Sbjct: 2196 KLGAGFQEYRKIYGENYCLGQLIFWHNQSNAEPDCTLARISRGCLSLPDISSFYAKAQKP 2255

Query: 2370 SRHRVYGPKTVRFMLSRMEKQPQRPWPKDRIWAFKSSPRIFGSPMLDSSLTGCPLDREMV 2429
            SRHRVYGP+TVR ML+RMEKQPQ+PWPKDRIW+FK+SP+ FGSPMLD+ +   PLDREMV
Sbjct: 2256 SRHRVYGPRTVRSMLARMEKQPQKPWPKDRIWSFKNSPKYFGSPMLDAVINNSPLDREMV 2315

Query: 2430 HWLKHRPAIFQAMWDR 2445
            HWLKHRPAIFQA+WD+
Sbjct: 2316 HWLKHRPAIFQALWDQ 2331


>gi|356547055|ref|XP_003541933.1| PREDICTED: probable histone-lysine N-methyltransferase ATXR3-like
            [Glycine max]
          Length = 2351

 Score = 2898 bits (7513), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 1568/2474 (63%), Positives = 1852/2474 (74%), Gaps = 152/2474 (6%)

Query: 1    MGDGGVACMPLQQQQQHNSIMERFPISDKTTICVGNSSNNSNKTNNNSISNNNDNKTNND 60
            MGDGGVACMPLQ       IMER P ++K T+C G S N  N +     +     K    
Sbjct: 1    MGDGGVACMPLQY------IMERLPSAEK-TVCRGKSGNGFN-SKLLKFAGKERRKMKPR 52

Query: 61   SSNNNGSSSSKNNETNKSNVKKNGVSTKTVRKKIVKIKKVIAVKKKEVQKNSGSSKSNNN 120
             S       SK N +  SN  +NG                  V+KK+ Q      +    
Sbjct: 53   KSELGLDRVSKRNSS--SNDVENGGE----------------VEKKQ-QHEKVQKEEVEE 93

Query: 121  GE----NIDNKNVENGGAVGEVVTVDKENLKNEEVEEGELGTLKWENGEFVQPEKSQPQS 176
            GE         ++ENG  V E++ +     +  EVE GE+ + KW+  E  + E      
Sbjct: 94   GELGTLKWPRADLENGEFVPEMLPLPPP--RRGEVENGEIVSEKWKARELEKGEVGFG-- 149

Query: 177  QLQSQSKQIEKGEIIVFSSKCRRGETEKGESGLWRGNKDDIEKGEFIPDRWHKEVVKDEY 236
              + + +++E+ E IV     R+GE E+GE G WRG KD+IEKGEFIPDRW+    K +Y
Sbjct: 150  --KWRKEEVERRE-IVSEKGGRKGEAERGEYGSWRGGKDEIEKGEFIPDRWY----KGDY 202

Query: 237  GYSKSRRYD------YKLER----TPPSGKYSGEDVYRRKEFDRSGSQHSKSSSRWE-SG 285
              S++RR+       +K ER    TP SG+Y+G+D +R+KE +RSGSQH KSS RWE  G
Sbjct: 203  DNSRNRRHHSGRDKGWKAEREHESTPSSGRYTGDDFFRKKELNRSGSQHVKSSPRWEGGG 262

Query: 286  QERNVRISSKIVDDEGLYKGEHNNGKNHGREYFHGNRFKRHGTDSDSGDRKYYGDYGDFA 345
            Q+RNVRISSKIV DE   K  H+NGK+H R+Y  G+R KR G D+DS +RK   DY   A
Sbjct: 263  QQRNVRISSKIVHDE---KNVHSNGKDHTRDYSSGSRLKRLGNDTDSYERKQSADY---A 316

Query: 346  GLKSRRLSDDYNSRSVHSEHYSRH---SVEKFHRNSSSSRISSLDKYSSRHHEPSLSSRV 402
            GLKSRRLSDD + R V+SE+YS H   SVE+ +RN++ +++S+ DKYS R+HE SLS+R 
Sbjct: 317  GLKSRRLSDD-SCRQVYSENYSCHSPRSVERSYRNNNGTKLSA-DKYSCRNHESSLSTRP 374

Query: 403  IYDRHGRSPSHSDRSPHDRGRYYDHRDRSPSRHDRSPYTRDRSPYTFDRSPYSRERSPYN 462
             YDRHGRSP HS+RSP DRGRYYDHR+R+P R  RSP  RDRSPY +++SPY RE+SPY 
Sbjct: 375  AYDRHGRSPGHSERSPRDRGRYYDHRERTPVR--RSPCGRDRSPYNWEKSPYGREKSPYM 432

Query: 463  RDRSPYAREKSPYDRSRHYDHRNRSPFSAERSPQDRARFHDRSDRTPNYLERSPLHRSRP 522
            R+          +DRSR +DH+ RSP  AE+SP DR+R HDR D TPN  E SPL R+R 
Sbjct: 433  RN----------WDRSRQHDHKLRSPTHAEQSPPDRSRRHDRRDCTPNLAEASPLDRARK 482

Query: 523  NNHREASSKTGASEKRNARYDSKGHEDKLGPKDSNARCSRSSAKESQDKSNVQDLNVSDE 582
            N+  E+SSKT +SEK +++   K  EDK   ++SN      S+ ESQ + +VQ    S E
Sbjct: 483  NSRHESSSKTLSSEKHDSQNSCKDREDKQIQRESNC-----SSTESQSEKSVQVTIKSVE 537

Query: 583  KTANCESHKEEQPQSSSVDCKEPPQVDGPPLEELVSMEEDMDICDTPPHVPAVTDSSVGK 642
            K    E  KE+Q  S +V  KE P  + PP EEL SMEEDMDICDTPPHVP VTD S GK
Sbjct: 538  KDICSEPVKEQQSCSPTVSHKESPHSEPPP-EELPSMEEDMDICDTPPHVPVVTDLSSGK 596

Query: 643  WFYLDHCGMECGPSRLCDLKTLVEEGVLVSDHFIKHLDSNRWETVENAVSPLVTVNFPSI 702
            W+YLD+ G+E GP++LCD+K LV+EGVL+SDHFIKHLDS+RW TVENA SPLV  +F SI
Sbjct: 597  WYYLDYGGVENGPAKLCDIKVLVDEGVLMSDHFIKHLDSDRWLTVENAASPLVRQSFASI 656

Query: 703  TSDSVTQLVSPPEASGNLLADTGDTAQSTGEEFPVTL----QSQCCPDGSAAAAESSEDL 758
             SD++TQLV+PPEA GN+L+D  D   S  +     L    Q + CP+ S    E  EDL
Sbjct: 657  ASDTITQLVNPPEAPGNILSDAADILHSAPDNHQEMLTPLRQPRVCPNDSVFTFELLEDL 716

Query: 759  HIDVRVGALLDGFTVIPGKEIETLGEILQTTFERVD---WQNNGGPTWHGACVGEQKPGD 815
            HI+ RV  LL+G+ V PG E+E + E LQ  FE       ++  G  W  +CVGE    D
Sbjct: 717  HIEERVRNLLEGYDVTPGMELEAIKEALQMNFENAKGEGLEDYEGFLWSVSCVGED--WD 774

Query: 816  QKVDELYISDTKMKEAAELKSGDKDHWVVCFDSDEWFSGRWSCKGGDWKRNDEAAQDRCS 875
               D L   D+   E+    S DKD+      S +WFS RWSCKGGDWKRND+ AQDR S
Sbjct: 775  SSTD-LASRDS---ESQSSMSCDKDNGHAFGVSSDWFSTRWSCKGGDWKRNDD-AQDRYS 829

Query: 876  RKKQVLNDGFPLCQMPKSGYEDPRWNQKDDLYYPSHSRRLDLPPWAYACPDERNDGSGGS 935
            RKK VLN+GFPLCQMPKSG EDPRW QKDDLY+PS SR+LDLP WA+ C DER+D S  S
Sbjct: 830  RKKLVLNNGFPLCQMPKSGCEDPRWPQKDDLYFPSQSRKLDLPLWAF-CADERDDCSVAS 888

Query: 936  RSTQSKLAAVRGVKGTMLPVVRINACVVNDHGSFVSEPRSKVRAKERHSSRSARSYSSAN 995
            +S QSK A+VRGVKG +L VVRINACVV D GS VSE R K R KERH SR AR +SS +
Sbjct: 889  KSVQSKPASVRGVKGNVLSVVRINACVVKDQGSLVSESRHKTRVKERHHSRPARPFSSIS 948

Query: 996  DVRRSSAESDSHSKARNNQDSQGSWKSIACINTPKDRLCTVDDLQLQLGEWYYLDGAGHE 1055
            D +RSS E D  SKA ++   Q S++ +  INTPKD  CT+ +LQL LG+WYYLDG+G E
Sbjct: 949  DSKRSSTEQD-QSKAVSD---QVSYQILEFINTPKDHRCTIRELQLHLGDWYYLDGSGRE 1004

Query: 1056 RGPSSFSELQVLVDQGCIQKHTSVFRKFDKVWVPLTFATETSASTVRNHGEKIMPSGDSS 1115
            RGPSSFSELQ  VDQG I+KH+SVFRK DK+WVP+T ATETS  ++ +  E    SG  S
Sbjct: 1005 RGPSSFSELQYFVDQGIIKKHSSVFRKSDKLWVPITSATETSDGSLMDQQESSSISGACS 1064

Query: 1116 GLPPTQSQDAVLGESNNNVNSNAFHTMHPQFIGYTRGKLHELVMKSYKNREFAAAINEVL 1175
            G P  Q+Q    GE     NS+ F+++HPQF+GYTRGKLHELVMKSYK+REFAAAINEVL
Sbjct: 1065 GFPSKQTQVVSCGEP--YTNSSLFNSLHPQFVGYTRGKLHELVMKSYKSREFAAAINEVL 1122

Query: 1176 DPWINAKQPKKETE-HVYRKSEGDTRAGKRARLLVRESDGDEETEE-ELQTIQDESTFED 1233
            DPWINA+QPKKE E  +Y KSEGD  A KRAR+LV +S+ D + E+ ++   +DESTFED
Sbjct: 1123 DPWINARQPKKEIEKQIYWKSEGDAHAAKRARMLVDDSEDDIDLEDGDVNIEKDESTFED 1182

Query: 1234 LCGDASFPGEESASSAIESGGWGLLDGHTLAHVFHFLRSDMKSLAFASLTCRHWRAAVRF 1293
            LCGDA+FP EE   +  + G W  LDGH LA VFHFL+SD+KSL FAS+TC+HWRAAVRF
Sbjct: 1183 LCGDATFPEEEIGITDTDLGSWSNLDGHVLARVFHFLKSDLKSLVFASMTCKHWRAAVRF 1242

Query: 1294 YKGISRQVDLSSVGPNCTDSLIRKTLNAFDKEKLNSILLVGCTNITSGMLEEILQSFPHL 1353
            YK +S QV+LSS+G +CTD+++   LNA++K+K+NS++L GC NIT+ MLE+IL SFP L
Sbjct: 1243 YKEVSIQVNLSSLGHSCTDTMLWNILNAYEKDKINSVILRGCVNITADMLEKILFSFPGL 1302

Query: 1354 SSIDIRGCGQFGELALKFPNINWVKSQKSRGAKFNDSRSKIRSLKQITEKSSSAPKSKGL 1413
             +IDIRGC QFGEL LKF N+ W+KS+ S   K  +   KIRSLK ITE +SS  KS  L
Sbjct: 1303 FTIDIRGCNQFGELTLKFANVKWIKSRSSHLTKIAEESHKIRSLKHITELTSSVSKSISL 1362

Query: 1414 GDDMDDFGDLKDYFESVDKRDSANQSFRRSLYQRSKVFDARKSSSILSRDARMRRWSIKK 1473
            G  +DDFG LKDYF+SVDKRD+  Q FR++LY+RSK++DARKSSSILSRDAR RRW+IKK
Sbjct: 1363 G--IDDFGQLKDYFDSVDKRDN-KQLFRQNLYKRSKLYDARKSSSILSRDARTRRWAIKK 1419

Query: 1474 SENGYKRMEEFLASSLKEIMRVNTFEFFVPKVAEIEGRMKKGYYISHGLGSVKDDISRMC 1533
            SE+GYKRMEEFLA  L+EIM+ N+ +FFV KVAEIE +MK GYY S GL SVK+DISRMC
Sbjct: 1420 SESGYKRMEEFLALRLREIMKTNSCDFFVLKVAEIEAKMKSGYYSSRGLNSVKEDISRMC 1479

Query: 1534 RDAIKAKNRGSAGDMNRITTLFIQLATRLEQGAKSSYYEREEMMKSWKDESPAGLYSATS 1593
            RDAIK                                     ++KSW ++ PAG  S  S
Sbjct: 1480 RDAIK-----------------------------------NALLKSWDNDLPAGSCSTFS 1504

Query: 1594 KYKKKLSKMVSERKYMNRSNGTSLANGDFDYGEYASDREIRKRLSKLNRKSLDSGSETS- 1652
            KYKK  +++V+ERKY  RSNGT   +G  D  EY SDREIR+RLSKLN+KS+DS SETS 
Sbjct: 1505 KYKK--NRLVNERKY--RSNGT---HGGLDNVEYTSDREIRRRLSKLNKKSMDSESETSD 1557

Query: 1653 DDLDGSSEDGKSDSESTVSDTDSDMDFRSDGRARESRGAGDFTTDEGLDF-SDDREWGAR 1711
            DDLD S E+GKSD+++T SD++SD +   +  +RESRG G FT++E L F +DDREWGAR
Sbjct: 1558 DDLDKSYEEGKSDTDTTTSDSESDREVHPESLSRESRGDGYFTSEEELGFITDDREWGAR 1617

Query: 1712 MTKASLVPPVTRKYEVIDQYVIVADEEDVRRKMRVSLPEDYAEKLNAQKNGSEELDMELP 1771
            MTKASLVPPVTRKYEVIDQY IVADEEDVRRKMRVSLP+DYAEKL+AQKNG+EE DMELP
Sbjct: 1618 MTKASLVPPVTRKYEVIDQYCIVADEEDVRRKMRVSLPDDYAEKLSAQKNGTEESDMELP 1677

Query: 1772 EVKDYKPRKQLGDQVFEQEVYGIDPYTHNLLLDSMPDELDWNLLEKHLFIEDVLLRTLNK 1831
            EVKDYKPRKQLG++V EQEVYGIDPYTHNLLLDSMP+ELDW+L EKHLFIED LLRTLNK
Sbjct: 1678 EVKDYKPRKQLGNEVIEQEVYGIDPYTHNLLLDSMPEELDWSLQEKHLFIEDTLLRTLNK 1737

Query: 1832 QVRHFTGTGNTPMMYPLQPVIEEIEKEAVDDCDVRTMKMCRGILKAMDSRPDDKYVAYRK 1891
            QVR+FTG G+TPM Y L+ VIE+I+K A +DCD R +KMC+GILKA+DSRPDDKYVAYRK
Sbjct: 1738 QVRNFTGNGSTPMSYSLRSVIEDIKKFAEEDCDARMVKMCQGILKAIDSRPDDKYVAYRK 1797

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
            GLGVVCNKE GF EDDFVVEFLGEVYPVWKWFEKQDGIRSLQK+++DPAPEFYNIYLERP
Sbjct: 1798 GLGVVCNKEEGFAEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKDSKDPAPEFYNIYLERP 1857

Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
            KGDADGYDLVVVDAMH ANYASRICHSCRPNCEAKVTAVDG YQIGIY++R I +GEEIT
Sbjct: 1858 KGDADGYDLVVVDAMHMANYASRICHSCRPNCEAKVTAVDGQYQIGIYSLREIQHGEEIT 1917

Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDRHQLMLEACE 2071
            FDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAF+KVLK+ HG+LDRH LMLEACE
Sbjct: 1918 FDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFQKVLKDSHGILDRHCLMLEACE 1977

Query: 2072 LNSVSEEDYLELGRAGLGSCLLGGLPNWVVAYSARLVRFINLERTKLPEEILRHNLEEKR 2131
            LNSVSEEDY +LGRAGLGSCLLGGLP+W+VAY+ARLVRFIN ERTKLPEEIL+HNLEEKR
Sbjct: 1978 LNSVSEEDYNDLGRAGLGSCLLGGLPDWLVAYAARLVRFINFERTKLPEEILKHNLEEKR 2037

Query: 2132 KYFSDICLEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRCVFGDPKKAPPPVERLSP 2191
            KYFSDI LEVE+SDAEVQAEGVYNQRLQNLAVTLDKVRYVMRC+FGDP+KAPPP+E+LSP
Sbjct: 2038 KYFSDIILEVERSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRCIFGDPRKAPPPLEKLSP 2097

Query: 2192 EETVSFLWKGEGSLVEELIQCMAPHVEEDVLNDLKSKIQAHDPSGSEDIQRELRKSLLWL 2251
            E TVSFLWKGEGS VEEL+QC+ PHVEE +LNDLK KI AHDPS S DIQ+ELRKSLLWL
Sbjct: 2098 EATVSFLWKGEGSFVEELVQCITPHVEEGILNDLKFKIHAHDPSNSGDIQKELRKSLLWL 2157

Query: 2252 RDEVRNLPCTYKCRHDAAADLIHIYAYTKCFFRVQEYKAFTSPPVYISPLDLGPKYADKL 2311
            RDEVRNLPCTYKCRHDAAADLIHIYAYTK FFR++ Y+  TSPPVYISPLDLGPKY +KL
Sbjct: 2158 RDEVRNLPCTYKCRHDAAADLIHIYAYTKYFFRIRNYQTITSPPVYISPLDLGPKYTNKL 2217

Query: 2312 GADLQVYRKTYGENYCLGQLIFWHIQTNADPDCTLARASRGCLSLPDIGSFYAKVQKPSR 2371
            GA+ Q YRK YGENYCLGQLIFWH Q+NADPD +LARASRGCLSLPD  SFYAK QKPSR
Sbjct: 2218 GAEFQEYRKIYGENYCLGQLIFWHNQSNADPDRSLARASRGCLSLPDTNSFYAKAQKPSR 2277

Query: 2372 HRVYGPKTVRFMLSRMEKQPQRPWPKDRIWAFKSSPRIFGSPMLDSSLTGCPLDREMVHW 2431
            H VYGP+TVR ML+RMEK PQR WPKDRIW+FKSSP+ FGSPMLD+ +   PLDREMVHW
Sbjct: 2278 HCVYGPRTVRSMLARMEKLPQRSWPKDRIWSFKSSPKFFGSPMLDAVVNNSPLDREMVHW 2337

Query: 2432 LKHRPAIFQAMWDR 2445
             KHRPAIFQAMWDR
Sbjct: 2338 FKHRPAIFQAMWDR 2351


>gi|297739332|emb|CBI28983.3| unnamed protein product [Vitis vinifera]
          Length = 2199

 Score = 2776 bits (7197), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 1493/2354 (63%), Positives = 1747/2354 (74%), Gaps = 281/2354 (11%)

Query: 129  VENGGAVGEVVTVDKENLKNEEVEEGELGTLKWENGEFVQPEKSQPQSQLQSQSKQIEKG 188
            +ENG    + +         EEVEEGELGTLKW  GE V+  + +P+   +S S   EKG
Sbjct: 90   IENGEICNDKIV-------KEEVEEGELGTLKWPKGE-VENGEFEPEKPRRSDS---EKG 138

Query: 189  EIIVFSSKCRRGETEKGES------------GLWRGNKDDIEKGEFIPDRWHKEVVKDEY 236
            EI+  + K R+GE EKGE             G WRG+KD++EKGEFIPDRW ++V +D Y
Sbjct: 139  EIV--AEKSRKGEVEKGEFRFRKGDGEKADFGSWRGSKDELEKGEFIPDRWQRDVGRDGY 196

Query: 237  GYSKSRR------------YDYKLERTPPSGKYSGEDVYRRKEFDRSGSQHSKSSSRWES 284
            G SK RR            YD++ ERTPPSGK                            
Sbjct: 197  GCSKMRRHELAKDKGWKFEYDHERERTPPSGK---------------------------- 228

Query: 285  GQERNVRISSKIVDDEGLYKGEHNNGKNHGREYFHGNRFKRHGTDSDSGDRKYYGDYGDF 344
                NVRISSKIVDDEG YK EHN+ KNHGRE     R KR+GTDSD  +RK++G+YGD 
Sbjct: 229  ----NVRISSKIVDDEGTYKTEHNSSKNHGRELVSRTRMKRYGTDSDGSERKHHGEYGDH 284

Query: 345  AGLKSRRLSDDYNSRSVHSEHYSRHSVEKFHRNSSSSRISSLDKYSSRHHEPSLSSRVIY 404
             G K R+LSDD N R+VH EHYSR S+E+ +RNSSSSRISS D++SSRH+E S SS+V++
Sbjct: 285  MGSKIRKLSDDSN-RTVHLEHYSRRSMERSYRNSSSSRISSSDRFSSRHYESSFSSKVVH 343

Query: 405  DRHGRSPSHSDRSPHDRGRYYDHRDRSPSRHDRSPYTRDRSPYTFDRSPYSRERSPYNRD 464
            DRHGRSP HS+RSP DR RY+DH              RDRSP                  
Sbjct: 344  DRHGRSPVHSERSPRDRARYHDH--------------RDRSPAY---------------- 373

Query: 465  RSPYAREKSPYDRSRHYDHRNRSPFSAERSPQDRARFHDRSDRTPNYLERSPLHRSRPNN 524
            RS   R++SPYDRSRHYDHRNRSP   ERSPQDR R+H+R DRTP YLERSPL  SRPNN
Sbjct: 374  RSSPRRDRSPYDRSRHYDHRNRSPAPTERSPQDRPRYHERRDRTPTYLERSPLDHSRPNN 433

Query: 525  HREASSKTGASEKRNARYDSKGHEDKLGPKDSNARCSRSSAKESQDKSNVQDLNV--SDE 582
            +REAS K GA EKR+ +Y +K  E+KL  +D+N R    SAKESQD+S++  +N   SDE
Sbjct: 434  YREASCKGGAGEKRHGQYGNKVQEEKLNQRDANGRDPHFSAKESQDRSSLHTVNGHGSDE 493

Query: 583  KTANCESHKEEQPQSSSVDCKEPPQVDGPPLEELVSMEEDMDICDTPPHVPAVTDSSVGK 642
            K+AN + HKEE+PQS  V+ +EPPQ+   P EEL SMEEDMDI                 
Sbjct: 494  KSANHQPHKEEKPQSPCVNLEEPPQITVAP-EELASMEEDMDI----------------- 535

Query: 643  WFYLDHCGMECGPSRLCDLKTLVEEGVLVSDHFIKHLDSNRWETVENAVSPLVTVNFPSI 702
                                       LVSDH IKH+D                      
Sbjct: 536  --------------------------FLVSDHLIKHVD---------------------- 547

Query: 703  TSDSVTQLVSPPEASGNLLADTGDTAQST---GEEFPVTL-QSQCCPDGSAAAAESSEDL 758
                        +A GNLLA+ GD  +S+    EE P TL QS  C + S+ A+E  EDL
Sbjct: 548  ------------KAPGNLLAEAGDATESSKLLDEETPATLLQSMSCNNDSSTASEPLEDL 595

Query: 759  HIDVRVGALLDGFTVIPGKEIETLGEILQTTFERVDWQNNG--GPTWHGACVGEQKPGDQ 816
             ID RV ALL GFTVIPG+E+ETLGE+LQ +FE   W+  G  G +WH   +GEQ   DQ
Sbjct: 596  QIDERVRALLKGFTVIPGRELETLGEVLQVSFEHAQWEKLGAEGLSWHQPRIGEQ--FDQ 653

Query: 817  KVDEL-YISDTKMKEAAELKSGDKDHWVVCF---DSDEWFSGRWSCKGGDWKRNDEAAQD 872
            + DE     +   KEA++ +S         F   D  +WFS RW+ KGGDWKRNDE+AQD
Sbjct: 654  RTDEFSRYPEITSKEASDSRSSTSSDKDYAFAFGDFSDWFSARWASKGGDWKRNDESAQD 713

Query: 873  RCSRKKQVLNDGFPLCQMPKSGYEDPRWNQKDDLYYPSHSRRLDLPPWAYACPDERNDGS 932
            R SRKK VLNDG+PLCQMPKSGYEDPRW++KD+LYYPSH R+LDLP WA++ PDER+D +
Sbjct: 714  RLSRKKLVLNDGYPLCQMPKSGYEDPRWHRKDELYYPSHGRKLDLPIWAFSWPDERSDSN 773

Query: 933  GGSRSTQSKLAAVRGVKGTMLPVVRINACVVNDHGSFVSEPRSKVRAKERHSSRSARSYS 992
              SR++Q K   VRGVKG+MLPVVRINACV        SEP +KVR K+R+SSRSAR+YS
Sbjct: 774  SASRASQIK-PVVRGVKGSMLPVVRINACV--------SEPPAKVRGKDRYSSRSARAYS 824

Query: 993  SANDVRRSSAESDSHSKARNNQDSQGSWKSIACINTPKDRLCTVDDLQLQLGEWYYLDGA 1052
            S  DV+RSSAES SHSK+ +  DSQGSWK I  INTPKDRLCT +DLQL LG+WYYLDGA
Sbjct: 825  STTDVKRSSAESASHSKSVSENDSQGSWKCITSINTPKDRLCTAEDLQLHLGDWYYLDGA 884

Query: 1053 GHERGPSSFSELQVLVDQGCIQKHTSVFRKFDKVWVPLTFATETSASTVRNHGEKIMPSG 1112
            GHE+GPSSFSELQ LVDQG IQKH+SVFRK DK+W  +T       + + N         
Sbjct: 885  GHEQGPSSFSELQALVDQGSIQKHSSVFRKNDKIWNNVTSTDYHCTAYILN--------- 935

Query: 1113 DSSGLPPTQSQDAVLGESNNNVNSNAFHTMHPQFIGYTRGKLHELVMKSYKNREFAAAIN 1172
              S + P +        +N+ V++++ H ++       RG+   LV  S  + E    + 
Sbjct: 936  --SLVIPKEM-------ANSAVSNSSLHDLNKFRTSGIRGRW--LVDGSEDDYEMEEDV- 983

Query: 1173 EVLDPWINAKQPKKETEHVYRKSEGDTRAGKRARLLVRESDGDEETEEELQTIQDESTFE 1232
                                              LLV++   DE T E+L          
Sbjct: 984  ----------------------------------LLVQK---DESTFEDL---------- 996

Query: 1233 DLCGDASFPGEESASSAIESGGWGLLDGHTLAHVFHFLRSDMKSLAFASLTCRHWRAAVR 1292
              C DA+F  E+ A + + S  WGLLDG+ LA VFHFLR+D+KSLAFA+LTC+HWRAAVR
Sbjct: 997  --CSDATFYQEDIALAEMGSENWGLLDGNVLARVFHFLRTDVKSLAFAALTCKHWRAAVR 1054

Query: 1293 FYKGISRQVDLSSVGPNCTDSLIRKTLNAFDKEKLNSILLVGCTNITSGMLEEILQSFPH 1352
            FYKG+SRQVDLSSVG  CTDS I   +N ++KE++ S++L+GCTNIT GMLE++L SFP 
Sbjct: 1055 FYKGVSRQVDLSSVGSLCTDSTIWSMINGYNKERITSMILIGCTNITPGMLEDVLGSFPS 1114

Query: 1353 LSSIDIRGCGQFGELALKFPNINWVKSQKSRGAKFNDSRSKIRSLKQITEKSSSAPKSKG 1412
            LSSIDIRGC QF ELA KF N+NW+KS+      F +S SKI++LKQITE+ S +   KG
Sbjct: 1115 LSSIDIRGCSQFWELADKFSNLNWIKSRIRVMKVFEESYSKIKALKQITERPSVSKPLKG 1174

Query: 1413 LGDDMDDFGDLKDYFESVDKRDSANQSFRRSLYQRSKVFDARKSSSILSRDARMRRWSIK 1472
            +G  +DD  +LK+YF+SVD+R+SA+QSFRRS Y+RSK+FDAR+SSSILSRDARMRRWSIK
Sbjct: 1175 MGSHVDDSSELKEYFDSVDRRESASQSFRRSYYKRSKLFDARRSSSILSRDARMRRWSIK 1234

Query: 1473 KSENGYKRMEEFLASSLKEIMRVNTFEFFVPKVAEIEGRMKKGYYISHGLGSVKDDISRM 1532
             SENGYKRMEEFLASSL++IM+ NTF+FFVPKVAEIE RMK GYY  HGL SVK+DISRM
Sbjct: 1235 NSENGYKRMEEFLASSLRDIMKENTFDFFVPKVAEIEDRMKNGYYAGHGLSSVKEDISRM 1294

Query: 1533 CRDAIKAKNRGSAGDMNRITTLFIQLATRLEQGAKSSYYEREEMMKSWKDESPAGLYSAT 1592
            CRDAIKAKNRG +G+MNRI TLFI+LAT LE+G+KSS   REEM++ WKDESP+GL S+ 
Sbjct: 1295 CRDAIKAKNRGDSGNMNRIITLFIRLATCLEEGSKSS-NGREEMVRRWKDESPSGLCSSG 1353

Query: 1593 SKYKKKLSKMVSERKYMNRSNGTSLANGDFDYGEYASDREIRKRLSKLNRKSLDSGSETS 1652
            SKYKKKL+K+V+ERK+  RSNG S      DYGEYASDREIR+RLSKLN+KS+DSGS+TS
Sbjct: 1354 SKYKKKLNKIVTERKH--RSNGGS------DYGEYASDREIRRRLSKLNKKSMDSGSDTS 1405

Query: 1653 DDLDGSSEDGKSDSESTVSDTDSDMDFRSDGRARESRGAGDFTTDEGL-DFSDDREWGAR 1711
            DDLD SSE G S SEST SDT+SD+DFRS+G   ESR  G FT DEGL   +DDREWGAR
Sbjct: 1406 DDLDRSSEGGSSGSESTASDTESDLDFRSEGGVAESRVDGYFTADEGLYSMTDDREWGAR 1465

Query: 1712 MTKASLVPPVTRKYEVIDQYVIVADEEDVRRKMRVSLPEDYAEKLNAQKNGSEELDMELP 1771
            MTK SLVPPVTRKYEVI+QYVIVADE++V+RKM+VSLPE Y EKL AQKNG+EE DME+P
Sbjct: 1466 MTKVSLVPPVTRKYEVIEQYVIVADEDEVQRKMKVSLPEHYNEKLTAQKNGTEESDMEIP 1525

Query: 1772 EVKDYKPRKQLGDQVFEQEVYGIDPYTHNLLLDSMPDELDWNLLEKHLFIEDVLLRTLNK 1831
            EVKDYKPRKQLGD+V EQEVYGIDPYTHNLLLDSMP+ELDW LLEKHLFIE+VLL TLNK
Sbjct: 1526 EVKDYKPRKQLGDEVIEQEVYGIDPYTHNLLLDSMPEELDWPLLEKHLFIEEVLLCTLNK 1585

Query: 1832 QVRHFTGTGNTPMMYPLQPVIEEIEKEAVDDCDVRTMKMCRGILKAMDSRPDDKYVAYRK 1891
            QVRHFTGTGNTPMMY LQPV+E+I+K A ++ D+RT+KMC+GILKAM+SRPDD YVAYRK
Sbjct: 1586 QVRHFTGTGNTPMMYHLQPVVEDIQKTAEEELDLRTLKMCQGILKAMNSRPDDNYVAYRK 1645

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
            GLGVVCNKEGGF ++DFVVEFLGEVYP WKWFEKQDGIRSLQKN++DPAPEFYNIYLERP
Sbjct: 1646 GLGVVCNKEGGFSQEDFVVEFLGEVYPAWKWFEKQDGIRSLQKNSKDPAPEFYNIYLERP 1705

Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
            KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAV+G YQIGIYTVR I YGEEIT
Sbjct: 1706 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVEGQYQIGIYTVRQIQYGEEIT 1765

Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDRHQLMLEACE 2071
            FDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAF+KVLKE HG+LDR+Q+M EACE
Sbjct: 1766 FDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFQKVLKECHGILDRYQMMFEACE 1825

Query: 2072 LNSVSEEDYLELGRAGLGSCLLGGLPNWVVAYSARLVRFINLERTKLPEEILRHNLEEKR 2131
            LN VSEEDY++LGRAGLGSCLLGGLP+W++AY+ARLVRFIN ERTKLPEEILRH+L+EKR
Sbjct: 1826 LNMVSEEDYIDLGRAGLGSCLLGGLPDWLIAYAARLVRFINFERTKLPEEILRHSLDEKR 1885

Query: 2132 KYFSDICLEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRCVFGDPKKAPPPVERLSP 2191
            KYF+DI LEVEKSDAE+QAEGVYNQRLQNLA+TLDKVRYVMRCVFGDPKKAPPP+ERLS 
Sbjct: 1886 KYFADISLEVEKSDAELQAEGVYNQRLQNLALTLDKVRYVMRCVFGDPKKAPPPLERLSA 1945

Query: 2192 EETVSFLWKGEGSLVEELIQCMAPHVEEDVLNDLKSKIQAHDPSGSEDIQRELRKSLLWL 2251
            EE VSFLW GEGSLVEEL+QCMAPH+E+ +L++LK KI+AHDPSGS+DI +EL+KSLLWL
Sbjct: 1946 EEVVSFLWNGEGSLVEELLQCMAPHMEDGMLSELKPKIRAHDPSGSDDIHKELQKSLLWL 2005

Query: 2252 RDEVRNLPCTYKCRHDAAADLIHIYAYTKCFFRVQEYKAFTSPPVYISPLDLGPKYADKL 2311
            RDEVRNLPC YKCRHDAAADLIHIYAYTKCFFRV+EYK+ TSPPVYISPLDLGPKY+DKL
Sbjct: 2006 RDEVRNLPCNYKCRHDAAADLIHIYAYTKCFFRVREYKSVTSPPVYISPLDLGPKYSDKL 2065

Query: 2312 GADLQVYRKTYGENYCLGQLIFWHIQTNADPDCTLARASRGCLSLPDIGSFYAKVQKPSR 2371
            G+ +Q Y KTYGENYCLGQLI+WH QTNADPDC LARASRGCLSLPDIGSFYAKVQKPSR
Sbjct: 2066 GSGIQEYCKTYGENYCLGQLIYWHNQTNADPDCNLARASRGCLSLPDIGSFYAKVQKPSR 2125

Query: 2372 HRVYGPKTVRFMLSRMEKQPQRPWPKDRIWAFKSSPRIFGSPMLDSSLTGCPLDREMVHW 2431
             RVYGP+T+RFML+RMEKQPQR WPKDRIW+FKS P+IFGSPMLD+ L   PLDREM+HW
Sbjct: 2126 QRVYGPRTLRFMLARMEKQPQRQWPKDRIWSFKSCPKIFGSPMLDAVLHNSPLDREMLHW 2185

Query: 2432 LKHRPAIFQAMWDR 2445
            LK+RPA FQAMWDR
Sbjct: 2186 LKNRPATFQAMWDR 2199


>gi|297804746|ref|XP_002870257.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316093|gb|EFH46516.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
          Length = 2364

 Score = 2711 bits (7026), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 1450/2543 (57%), Positives = 1794/2543 (70%), Gaps = 279/2543 (10%)

Query: 1    MGDGGVACMPLQQQQQHNSIMERFPISDKTTICVGNSSNNSNKTNNNSISNNNDNKTNND 60
            M DGGVACMPL       +IME+ PI +KTT+C GN S +   T+N              
Sbjct: 1    MSDGGVACMPLL------NIMEKLPIVEKTTLCGGNESKSVGTTDNG------------- 41

Query: 61   SSNNNGSSSSKNNETNKSNVKKNGVSTKTVRKKIVK-IKKVIAVKKKEVQKNS------- 112
                + S SSK  E+  ++  K   S    +K+IVK I+KV+  + K+ QK +       
Sbjct: 42   ----HTSISSKVPESQPAD-NKPSASQPVKKKRIVKVIRKVVKRRPKQPQKQAEEQLKDQ 96

Query: 113  ---------------------GSSKSNNNGENIDNKNVENGGAVGEVVTVDKENLKNEEV 151
                                  + KS   G    +K VENGG  G            +EV
Sbjct: 97   PPSQVVQLPAESQLQLKEQEEQNKKSEVKGGTSGDKEVENGGDSG----------FKDEV 146

Query: 152  EEGELGTLK----WENGEFVQPEKSQPQSQLQSQSKQIEKGEIIV--------------- 192
            EEGELGTLK     ENGE + P KS        Q  +IEKGEI+                
Sbjct: 147  EEGELGTLKPPGDLENGE-ISPVKSL-------QKSEIEKGEIVGESWKKDEPTKGEFSY 198

Query: 193  ------------FSS-KCRRGETEKGESGLWRGNKDDIEKGEFIPDRWHK-EVVKDEYGY 238
                        FS+ K  +G  E  E   WR + D+IEKGEFIPDRW K + VKD++ Y
Sbjct: 199  LKYHKGNVERRDFSADKNWKGGKEDREFRSWRDSGDEIEKGEFIPDRWQKMDAVKDDHSY 258

Query: 239  SKSRR----------YDYKLERTPPSGKYSGEDVYRRKEFDRSGSQHSKSSSRWESGQER 288
             +SRR          Y+Y+ ERTPP G+++ ED+Y ++EF               SG +R
Sbjct: 259  IRSRRNGVDREKTWKYEYEYERTPPGGRFANEDIYHQREF--------------RSGHDR 304

Query: 289  NVRISSKIVDDEGLYKGEHNNGKNHGREYFHG-NRFKRHGTDSDSGDRKY-YGDYGDFAG 346
              RISSKIV +E L+K E+NN  N  +EY    NR KRHG + DS +RK+ Y DYGD+  
Sbjct: 305  TTRISSKIVIEENLHKNEYNNPSNFVKEYSSTVNRLKRHGAEPDSVERKHSYADYGDYGS 364

Query: 347  LKSRRLSDDYNSRSVHSEHYSRHSVEKFHRNSSSSRISSLDKYSSRHHEPSLSSRVIYDR 406
             K R+LSDD  SRS+HS+HYS+HS E+ +R+S SS+ SSL+KY  +H + S  ++   D+
Sbjct: 365  SKCRKLSDDC-SRSLHSDHYSQHSAERLYRDSYSSKNSSLEKYHRKHQDASFPAKAFSDK 423

Query: 407  HGRSPSHSDRSPHDRGRYYDHRDRSPSRHDRSPYTRDRSPYTFDRSPYSRERSPYNRDRS 466
            HG SP+ SD SPHDR RY+++R                     DRSPY+RERSPY  ++S
Sbjct: 424  HGHSPARSDWSPHDRSRYHENR---------------------DRSPYARERSPYIFEKS 462

Query: 467  PYAREKSPYDRSRHYDHRNRSPFSAERSPQDRARFHDRSDRTPNYLERSPLHRSRPNNHR 526
             +AR++SP DRSRH+D+R RSP  +E SP DR+R  DR D  PNY+E +   R+R N HR
Sbjct: 463  SHARKRSPRDRSRHHDYR-RSPSYSEWSPHDRSRPSDRRDSIPNYMEDTQNDRNRRNGHR 521

Query: 527  EASSKTGASEKRNARYDSKGHEDKLGPKDSNARCSRSSAKESQDKSNVQDLNVSDEKTAN 586
            E S K+G  E+R+++  ++  E+K   KDSN + S SS+KE Q K+ + + N+  EK + 
Sbjct: 522  EISRKSGVRERRDSQTGTE-LENKHRYKDSNGKESTSSSKELQGKNILYNNNLVVEKNSV 580

Query: 587  CESHKEEQPQSSSVDCKEPPQVDGPPLEELVSMEEDMDICDTPPHVPAVTDSSVGKWFYL 646
            C+S K   P ++    KEP QV   P EEL SME DMDICDTPPH P   DSS+GKWFYL
Sbjct: 581  CDSSKIPIPCATG---KEPVQVGEAPPEELPSMEVDMDICDTPPHEPMAADSSLGKWFYL 637

Query: 647  DHCGMECGPSRLCDLKTLVEEGVLVSDHFIKHLDSNRWETVENAVSPLVTVNFPSITSDS 706
            D+ G E GP+RL +LK L+E+G+L SDH IKH D+NRW                      
Sbjct: 638  DYYGTEHGPARLSELKALMEQGILFSDHMIKHSDNNRW---------------------- 675

Query: 707  VTQLVSPPEASGNLLADTGDTA------QSTGEEFPVTLQSQCCPDGSAAAAESSEDLHI 760
               L +PPEA GNLL D  DT       Q  G+  P ++     PD +    E  ED  I
Sbjct: 676  ---LANPPEAPGNLLEDITDTTEAVCIEQEAGDSLPESVSVMTIPDANEFLVEHLEDFQI 732

Query: 761  DVRVGALLDGFTVIPGKEIETLGEILQTT--------------FERVDWQNNGGPTWHGA 806
            D R+  LL+G+T+ PG+E E+LGE L  T              FE V     G  +  G 
Sbjct: 733  DKRIANLLEGYTIAPGREFESLGEALNVTVEFKETRRCVTSEVFEVVQIWAFGMKSI-GK 791

Query: 807  CVGEQKPGDQKVDELYISDTKMKEAAELKSGDKDHWVVCFDSDE---WFSGRWSCKGGDW 863
            C+   K  ++    L  S+   +   E KS D    V   +SDE   WFSGRWSCKGGDW
Sbjct: 792  CLMFVKDDEEL---LGCSEPIKRAIEEFKSDD----VYGSESDEIGSWFSGRWSCKGGDW 844

Query: 864  KRNDEAAQDRCSRKKQVLNDGFPLCQMPKSGYEDPRWNQKDDLYYPSHSRRLDLPPWAYA 923
             R DEA+QDR  +KK VLNDGFPLC M KSGYEDPRW+ KDD+YYP  S RL+LP WA++
Sbjct: 845  IRQDEASQDRYYKKKIVLNDGFPLCLMQKSGYEDPRWHHKDDMYYPLSSSRLELPLWAFS 904

Query: 924  CPDERNDGSGGSRSTQSKLAAVRGVKGTMLPVVRINACVVNDHGSFVSEPRSKVRAKERH 983
              DERN                RGVK  +L VVR+N+ VVND    V +PR+KVR KER 
Sbjct: 905  GVDERNQA--------------RGVKANLLSVVRLNSLVVNDQVPPVPDPRAKVRGKERC 950

Query: 984  SSRSARSYSSANDVRRSSAESDSHSKARNNQDSQGSWKSIACINTPKDRLCTVDDLQLQL 1043
             SR AR   +++D +R S ES S S A N QDS G  ++ A +NTP+DRLCTVDDLQL +
Sbjct: 951  PSRPARPSPASSDSKRESVESHSQSTASNGQDSHGLLRTDASVNTPRDRLCTVDDLQLHI 1010

Query: 1044 GEWYYLDGAGHERGPSSFSELQVLVDQGCIQKHTSVFRKFDKVWVPLTFATETSASTVRN 1103
            G+W+Y DGAG E+GP  FSELQ+LV++G I+ H+SVFRK DK+WVP+T  T +  +  + 
Sbjct: 1011 GDWFYTDGAGQEQGPLPFSELQILVEKGFIKSHSSVFRKSDKIWVPVTSITNSPETIAKL 1070

Query: 1104 HGEKIMPSGDSSGLPPTQSQDAVLGESNNNVNSNAFHTMHPQFIGYTRGKLHELVMKSYK 1163
             G+      D   L  +++QD  L  S  + + N+FH +HPQF+GY RGKLH+LVMK++K
Sbjct: 1071 RGKNPALPSDCQDLVVSETQD--LKRSEMDTSLNSFHGVHPQFLGYFRGKLHQLVMKTFK 1128

Query: 1164 NREFAAAINEVLDPWINAKQPKKETE-HVYRKSEGDTRAGKRARLLVRESDGDEETEEEL 1222
            +R+F+AAIN+VLD WI+A+QPKKE+E ++Y+ SE D+   KRARL+  ES  D E E+  
Sbjct: 1129 SRDFSAAINDVLDSWIHARQPKKESEKYMYQSSELDSCFTKRARLMAGESGEDSEMEDTQ 1188

Query: 1223 QTIQDESTFEDLCGDASFPGEESASSAIESGGWGLLDGHTLAHVFHFLRSDMKSLAFASL 1282
               +DE TFEDLCGDA+F  E S S+      WGLLDGH LA VFH LR D+KSLAFAS+
Sbjct: 1189 MFQKDELTFEDLCGDATFQIEGSGSAGTVGIYWGLLDGHALARVFHLLRYDVKSLAFASM 1248

Query: 1283 TCRHWRAAVRFYKGISRQVDLSSVGPNCTDSLIRKTLNAFDKEKLNSILLVGCTNITSGM 1342
            TCRHW+A +  YK ISRQVDLSS+GPNCTDS +R  +N ++KEK++SI+LVGCTN+T+ M
Sbjct: 1249 TCRHWKATINSYKEISRQVDLSSLGPNCTDSRLRSIMNTYNKEKIDSIILVGCTNVTASM 1308

Query: 1343 LEEILQSFPHLSSIDIRGCGQFGELALKFPNINWVKSQKSRGAKFNDSRSKIRSLKQITE 1402
            LEEIL  FP +SS+DI GC QFG+L++ + N++W++ Q +R  + +   S+IRSLKQ T+
Sbjct: 1309 LEEILHIFPRISSVDITGCSQFGDLSVNYKNVSWLRCQNTRSGELH---SRIRSLKQATD 1365

Query: 1403 KSSSAPKSKGLGDDMDDFGDLKDYFESVDKRDSANQSFRRSLYQRSKVFDARKSSSILSR 1462
             S    KSKG+G D DDFG+LKDYF+ V+KRDSANQ FRRSLY+RSK++DARKSS+ILSR
Sbjct: 1366 GS----KSKGVGGDTDDFGNLKDYFDRVEKRDSANQLFRRSLYKRSKLYDARKSSAILSR 1421

Query: 1463 DARMRRWSIKKSENGYKRMEEFLASSLKEIMRVNTFEFFVPKVAEIEGRMKKGYYISHGL 1522
            DAR+RRW+IKKSE+GYKR+EEFLA SL+ IM+ NTF+FF  KV++IE +MK GYY+SHGL
Sbjct: 1422 DARIRRWAIKKSEHGYKRVEEFLALSLRGIMKQNTFDFFALKVSQIEEKMKNGYYVSHGL 1481

Query: 1523 GSVKDDISRMCRDAIKAKNRGSAGDMNRITTLFIQLATRLEQGAKSSYYEREEMMKSWKD 1582
             SVK+DISRMCR+AIK                                   +E+MKSW+D
Sbjct: 1482 RSVKEDISRMCREAIK-----------------------------------DELMKSWQD 1506

Query: 1583 ESPAGLYSATSKYKKKLSKMVSERKYMNRSNGTSLANGDFDYGEYASDREIRKRLSKLNR 1642
             S  GL SA SKY KKLSK V+E+KYM+R++ T   NG  DYGEYASDREI++RLSKLNR
Sbjct: 1507 GS--GLSSA-SKYNKKLSKTVTEKKYMSRTSDTFGVNGASDYGEYASDREIKRRLSKLNR 1563

Query: 1643 KSLDSGSETSDDLDGSSEDGKSDSESTVSDTDSDMDFRSDGRARESRGAGDFTTDEGLD- 1701
            KS  SGSETS +    S++GKSD+ S+ S ++S+ D RS+GR+++ R    FT DE  D 
Sbjct: 1564 KSFSSGSETSSE---LSDNGKSDNYSSASASESESDIRSEGRSQDLRTERYFTADESFDS 1620

Query: 1702 FSDDREWGARMTKASLVPPVTRKYEVIDQYVIVADEEDVRRKMRVSLPEDYAEKLNAQKN 1761
             +++REWGARMTKASLVPPVTRKYEVI++Y IVADEE+V+RKMRVSLPEDY EKLNAQ+N
Sbjct: 1621 VTEEREWGARMTKASLVPPVTRKYEVIEKYAIVADEEEVQRKMRVSLPEDYGEKLNAQRN 1680

Query: 1762 GSEELDMELPEVKDYKPRKQLGDQVFEQEVYGIDPYTHNLLLDSMPDELDWNLLEKHLFI 1821
            G EELDMELPEVK++KPRK LGD+V EQEVYGIDPYTHNLLLDSMP ELDW+L +KH FI
Sbjct: 1681 GIEELDMELPEVKEFKPRKLLGDEVLEQEVYGIDPYTHNLLLDSMPGELDWSLQDKHSFI 1740

Query: 1822 EDVLLRTLNKQVRHFTGTGNTPMMYPLQPVIEEIEKEAVDDCDVRTMKMCRGILKAMDSR 1881
            EDV+LRTLN+QVR FTG+GNTPM++PL+PVIEE+++ A ++CD+RT+KMC+ +LK ++SR
Sbjct: 1741 EDVVLRTLNRQVRLFTGSGNTPMVFPLRPVIEELKESAREECDIRTLKMCQVVLKEIESR 1800

Query: 1882 PDDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAP 1941
             DDKYV+YRKGLGVVCNKEGGFGE+DFVVEFLGEVYPVWKWFEKQDGIRSLQ+N  DPAP
Sbjct: 1801 SDDKYVSYRKGLGVVCNKEGGFGEEDFVVEFLGEVYPVWKWFEKQDGIRSLQENKTDPAP 1860

Query: 1942 EFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTV 2001
            EFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIY+V
Sbjct: 1861 EFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYSV 1920

Query: 2002 RGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLD 2061
            R I YGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAF+KVLK+ HGLL+
Sbjct: 1921 RAIEYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFQKVLKDWHGLLE 1980

Query: 2062 RHQLMLEACELNSVSEEDYLELGRAGLGSCLLGGLPNWVVAYSARLVRFINLERTKLPEE 2121
            RH+LMLEAC LNSVSEEDYLELGRAGLGSCLLGGLP+WV+AYSARLVRFIN ERTKLPEE
Sbjct: 1981 RHRLMLEACILNSVSEEDYLELGRAGLGSCLLGGLPDWVIAYSARLVRFINFERTKLPEE 2040

Query: 2122 ILRHNLEEKRKYFSDICLEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRCVFGDPKK 2181
            IL+HNLEEKRKYFSDI L+VEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMR VFGDPK 
Sbjct: 2041 ILKHNLEEKRKYFSDIHLDVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRHVFGDPKN 2100

Query: 2182 APPPVERLSPEETVSFLWKGEGSLVEELIQCMAPHVEEDVLNDLKSKIQAHDPSGSEDIQ 2241
            APPP+ERL+PEETVSF+W G+GSLV+EL+Q ++PH+EE +LN+L+SKI +HDPSGS D+ 
Sbjct: 2101 APPPLERLTPEETVSFVWNGDGSLVDELVQSLSPHLEEGILNELRSKIHSHDPSGSADVL 2160

Query: 2242 RELRKSLLWLRDEVRNLPCTYKCRHDAAADLIHIYAYTKCFFRVQEYKAFTSPPVYISPL 2301
            +EL++SLLWLRDE+R+LPCTYKCR+DAAADLIHIYAYTKCFF+V+EY++F S PV+ISPL
Sbjct: 2161 KELQRSLLWLRDEIRDLPCTYKCRNDAAADLIHIYAYTKCFFKVREYQSFISSPVHISPL 2220

Query: 2302 DLGPKYADKLGADLQVYRKTYGENYCLGQLIFWHIQTNADPDCTLARASRGCLSLPDIGS 2361
            DLG KYADKLG  ++ YRKTYGENYCLGQLI+W+ QTN DPD TL +A+RGCLSLPD+ S
Sbjct: 2221 DLGAKYADKLGESIKEYRKTYGENYCLGQLIYWYNQTNTDPDLTLVKATRGCLSLPDVAS 2280

Query: 2362 FYAKVQKPSRHRVYGPKTVRFMLSRMEKQPQRPWPKDRIWAFKSSPRIFGSPMLDSSLTG 2421
            FYAK QKPS+HRVYGPKTV+ M+S+M KQPQRPWPKD+IW FKS+PR+FGSPM D+ L  
Sbjct: 2281 FYAKAQKPSKHRVYGPKTVKTMVSQMSKQPQRPWPKDKIWTFKSTPRVFGSPMFDAVLNN 2340

Query: 2422 CPLDREMVHWLKHRPAIFQAMWD 2444
              LDRE++ WL++R  +FQA WD
Sbjct: 2341 SSLDRELLQWLRNRRHVFQATWD 2363


>gi|186511821|ref|NP_193253.4| putative histone-lysine N-methyltransferase ATXR3 [Arabidopsis
            thaliana]
 gi|229488102|sp|O23372.2|ATXR3_ARATH RecName: Full=Probable histone-lysine N-methyltransferase ATXR3;
            AltName: Full=Protein SET DOMAIN GROUP 2; AltName:
            Full=Trithorax-related protein 3; Short=TRX-related
            protein 3
 gi|332658165|gb|AEE83565.1| putative histone-lysine N-methyltransferase ATXR3 [Arabidopsis
            thaliana]
          Length = 2335

 Score = 2681 bits (6950), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 1432/2524 (56%), Positives = 1777/2524 (70%), Gaps = 270/2524 (10%)

Query: 1    MGDGGVACMPLQQQQQHNSIMERFPISDKTTICVGNSSNNSNKTNNNSISNNNDNKTNND 60
            M DGGVACMPL       +IME+ PI +KTT+C GN S                 KT   
Sbjct: 1    MSDGGVACMPLL------NIMEKLPIVEKTTLCGGNES-----------------KTAAT 37

Query: 61   SSNNNGSSSSKNNETNKSNVKKNGVSTKTVRKKIVK-IKKVIAVKKKEVQKNSGSS---- 115
            + N + S ++K  E+  +N K +  S    +K+IVK I+KV+  + K+ QK +       
Sbjct: 38   TENGHTSIATKVPESQPAN-KPSASSQPVKKKRIVKVIRKVVKRRPKQPQKQADEQLKDQ 96

Query: 116  ---------------------KSNNNGENIDNKNVENGGAVGEVVTVDKENLKNEEVEEG 154
                                 KS   G     K VENGG  G            +EVEEG
Sbjct: 97   PPSQVVQLPAESQLQIKEQDKKSEFKGGTSGVKEVENGGDSG----------FKDEVEEG 146

Query: 155  ELGTLKW----ENGEFVQPEKSQPQSQLQSQSKQIEKGEIIV------------------ 192
            ELGTLK     ENGE + P KS        Q  +IEKGEI+                   
Sbjct: 147  ELGTLKLHEDLENGE-ISPVKSL-------QKSEIEKGEIVGESWKKDEPTKGEFSHLKY 198

Query: 193  ---------FSS-KCRRGETEKGESGLWRGNKDDIEKGEFIPDRWHK-EVVKDEYGYSKS 241
                     FS+ K  +G  E+ E   WR   D+IEKGEFIPDRW K +  KD++ Y +S
Sbjct: 199  HKGYVERRDFSADKNWKGGKEEREFRSWRDPSDEIEKGEFIPDRWQKMDTGKDDHSYIRS 258

Query: 242  RR----------YDYKLERTPPSGKYSGEDVYRRKEFDRSGSQHSKSSSRWESGQERNVR 291
            RR          Y+Y+ ERTPP G++  ED+Y ++EF               SG +R  R
Sbjct: 259  RRNGVDREKTWKYEYEYERTPPGGRFVNEDIYHQREF--------------RSGLDRTTR 304

Query: 292  ISSKIVDDEGLYKGEHNNGKNHGREYFH-GNRFKRHGTDSDSGDRKY-YGDYGDFAGLKS 349
            ISSKIV +E L+K E+NN  N  +EY   GNR KRHG + DS +RK+ Y DYGD+   K 
Sbjct: 305  ISSKIVIEENLHKNEYNNSSNFVKEYSSTGNRLKRHGAEPDSIERKHSYADYGDYGSSKC 364

Query: 350  RRLSDDYNSRSVHSEHYSRHSVEKFHRNSSSSRISSLDKYSSRHHEPSLSSRVIYDRHGR 409
            R+LSDD  SRS+HS+HYS+HS E+ +R+S  S+ SSL+KY  +H + S  ++   D+HG 
Sbjct: 365  RKLSDDC-SRSLHSDHYSQHSAERLYRDSYPSKNSSLEKYPRKHQDASFPAKAFSDKHGH 423

Query: 410  SPSHSDRSPHDRGRYYDHRDRSPSRHDRSPYTRDRSPYTFDRSPYSRERSPYNRDRSPYA 469
            SPS SD SPHDR RY+++R                     DRSPY+RERSPY  ++S +A
Sbjct: 424  SPSRSDWSPHDRSRYHENR---------------------DRSPYARERSPYIFEKSSHA 462

Query: 470  REKSPYDRSRHYDHRNRSPFSAERSPQDRARFHDRSDRTPNYLERSPLHRSRPNNHREAS 529
            R++SP DR  H     RSP  +E SP DR+R  DR D  PN++E +   R+R N HRE S
Sbjct: 463  RKRSPRDRRHH--DYRRSPSYSEWSPHDRSRPSDRRDYIPNFMEDTQSDRNRRNGHREIS 520

Query: 530  SKTGASEKRNARYDSKGHEDKLGPKDSNARCSRSSAKESQDKSNVQDLNVSDEKTANCES 589
             K+G  E+R+ +  ++  E K   K+SN + S SS+KE Q K+ + + ++  EK + C+S
Sbjct: 521  RKSGVRERRDCQTGTE-LEIKHKYKESNGKESTSSSKELQGKNILYNNSLLVEKNSVCDS 579

Query: 590  HKEEQPQSSSVDCKEPPQVDGPPLEELVSMEEDMDICDTPPHVPAVTDSSVGKWFYLDHC 649
             K   P ++    KEP QV   P EEL SME DMDICDTPPH P  +DSS+GKWFYLD+ 
Sbjct: 580  SKIPVPCATG---KEPVQVGEAPTEELPSMEVDMDICDTPPHEPMASDSSLGKWFYLDYY 636

Query: 650  GMECGPSRLCDLKTLVEEGVLVSDHFIKHLDSNRWETVENAVSPLVTVNFPSITSDSVTQ 709
            G E GP+RL DLK L+E+G+L SDH IKH D+NRW                         
Sbjct: 637  GTEHGPARLSDLKALMEQGILFSDHMIKHSDNNRW------------------------- 671

Query: 710  LVSPPEASGNLLADTGDTA------QSTGEEFPVTLQSQCCPDGSAAAAESSEDLHIDVR 763
            LV+PPEA GNLL D  DT       Q  G+  P  +  +  PDG     E+ ED  ID+R
Sbjct: 672  LVNPPEAPGNLLEDIADTTEAVCIEQGAGDSLPELVSVRTLPDGKEIFVENREDFQIDMR 731

Query: 764  VGALLDGFTVIPGKEIETLGEILQTTFERVDWQNNGGPTWHGACVGEQKPGDQKVDELYI 823
            V  LLDG T+ PG+E ETLGE L+     V+++           VG  +P  + ++E   
Sbjct: 732  VENLLDGRTITPGREFETLGEALKVN---VEFEETRRCVTSEGVVGMFRPMKRAIEEFKS 788

Query: 824  SDTKMKEAAELKSGDKDHWVVCFDSDEWFSGRWSCKGGDWKRNDEAAQDRCSRKKQVLND 883
             D    E+ E+ S              WFSGRWSCKGGDW R DEA+QDR  +KK VLND
Sbjct: 789  DDAYGSESDEIGS--------------WFSGRWSCKGGDWIRQDEASQDRYYKKKIVLND 834

Query: 884  GFPLCQMPKSGYEDPRWNQKDDLYYPSHSRRLDLPPWAYACPDERNDGSGGSRSTQSKLA 943
            GFPLC M KSG+EDPRW+ KDDLYYP  S RL+LP WA++  DERN              
Sbjct: 835  GFPLCLMQKSGHEDPRWHHKDDLYYPLSSSRLELPLWAFSVVDERNQ------------- 881

Query: 944  AVRGVKGTMLPVVRINACVVNDHGSFVSEPRSKVRAKERHSSRSARSYSSANDVRRSSAE 1003
              RGVK ++L VVR+N+ VVND    + +PR+KVR+KER  SR AR   +++D +R S E
Sbjct: 882  -TRGVKASLLSVVRLNSLVVNDQVPPIPDPRAKVRSKERCPSRPARPSPASSDSKRESVE 940

Query: 1004 SDSHSKARNNQDSQGSWKSIACINTPKDRLCTVDDLQLQLGEWYYLDGAGHERGPSSFSE 1063
            S S S A   QDSQG WK+   +NTP+DRLCTVDDLQL +G+W+Y DGAG E+GP SFSE
Sbjct: 941  SHSQSTASTGQDSQGLWKTDTSVNTPRDRLCTVDDLQLHIGDWFYTDGAGQEQGPLSFSE 1000

Query: 1064 LQVLVDQGCIQKHTSVFRKFDKVWVPLTFATETSASTVRNHGEKIMPSGDSSGLPPTQSQ 1123
            LQ LV++G I+ H+SVFRK DK+WVP+T  T++  +     G+         GL  +++Q
Sbjct: 1001 LQKLVEKGFIKSHSSVFRKSDKIWVPVTSITKSPETIAMLRGKTPALPSACQGLVVSETQ 1060

Query: 1124 DAVLGESNNNVNSNAFHTMHPQFIGYTRGKLHELVMKSYKNREFAAAINEVLDPWINAKQ 1183
            D    E + ++NS  FH +HPQF+GY RGKLH+LVMK++K+R+F+AAIN+V+D WI+A+Q
Sbjct: 1061 DFKYSEMDTSLNS--FHGVHPQFLGYFRGKLHQLVMKTFKSRDFSAAINDVVDSWIHARQ 1118

Query: 1184 PKKETE-HVYRKSEGDTRAGKRARLLVRESDGDEETEEELQTIQDESTFEDLCGDASFPG 1242
            PKKE+E ++Y+ SE ++   KRARL+  ES  D E E+     +DE TFEDLCGD +F  
Sbjct: 1119 PKKESEKYMYQSSELNSCYTKRARLMAGESGEDSEMEDTQMFQKDELTFEDLCGDLTFNI 1178

Query: 1243 EESASSAIESGGWGLLDGHTLAHVFHFLRSDMKSLAFASLTCRHWRAAVRFYKGISRQVD 1302
            E + S+      WGLLDGH LA VFH LR D+KSLAFAS+TCRHW+A +  YK ISRQVD
Sbjct: 1179 EGNRSAGTVGIYWGLLDGHALARVFHMLRYDVKSLAFASMTCRHWKATINSYKDISRQVD 1238

Query: 1303 LSSVGPNCTDSLIRKTLNAFDKEKLNSILLVGCTNITSGMLEEILQSFPHLSSIDIRGCG 1362
            LSS+GP+CTDS +R  +N ++KEK++SI+LVGCTN+T+ MLEEIL+  P +SS+DI GC 
Sbjct: 1239 LSSLGPSCTDSRLRSIMNTYNKEKIDSIILVGCTNVTASMLEEILRLHPRISSVDITGCS 1298

Query: 1363 QFGELALKFPNINWVKSQKSRGAKFNDSRSKIRSLKQITEKSSSAPKSKGLGDDMDDFGD 1422
            QFG+L + + N++W++ Q +R  + +   S+IRSLKQ T+      KSKGLG D DDFG+
Sbjct: 1299 QFGDLTVNYKNVSWLRCQNTRSGELH---SRIRSLKQTTD----VAKSKGLGGDTDDFGN 1351

Query: 1423 LKDYFESVDKRDSANQSFRRSLYQRSKVFDARKSSSILSRDARMRRWSIKKSENGYKRME 1482
            LKDYF+ V+KRDSANQ FRRSLY+RSK++DAR+SS+ILSRDAR+RRW+IKKSE+GYKR+E
Sbjct: 1352 LKDYFDRVEKRDSANQLFRRSLYKRSKLYDARRSSAILSRDARIRRWAIKKSEHGYKRVE 1411

Query: 1483 EFLASSLKEIMRVNTFEFFVPKVAEIEGRMKKGYYISHGLGSVKDDISRMCRDAIKAKNR 1542
            EFLASSL+ IM+ NTF+FF  KV++IE +MK GYY+SHGL SVK+DISRMCR+AIK    
Sbjct: 1412 EFLASSLRGIMKQNTFDFFALKVSQIEEKMKNGYYVSHGLRSVKEDISRMCREAIK---- 1467

Query: 1543 GSAGDMNRITTLFIQLATRLEQGAKSSYYEREEMMKSWKDESPAGLYSATSKYKKKLSKM 1602
                                           +E+MKSW+D S  GL SAT KY KKLSK 
Sbjct: 1468 -------------------------------DELMKSWQDGS--GLSSAT-KYNKKLSKT 1493

Query: 1603 VSERKYMNRSNGTSLANGDFDYGEYASDREIRKRLSKLNRKSLDSGSETSDDLDGSSEDG 1662
            V+E+KYM+R++ T   NG  DYGEYASDREI++RLSKLNRKS  S S+TS +    S++G
Sbjct: 1494 VAEKKYMSRTSDTFGVNGASDYGEYASDREIKRRLSKLNRKSFSSESDTSSE---LSDNG 1550

Query: 1663 KSDSESTVSDTDSDMDFRSDGRARESRGAGDFTTDEGLD-FSDDREWGARMTKASLVPPV 1721
            KSD+ S+ S ++S+ D RS+GR+++ R    FT D+  D  +++REWGARMTKASLVPPV
Sbjct: 1551 KSDNYSSASASESESDIRSEGRSQDLRIEKYFTADDSFDSVTEEREWGARMTKASLVPPV 1610

Query: 1722 TRKYEVIDQYVIVADEEDVRRKMRVSLPEDYAEKLNAQKNGSEELDMELPEVKDYKPRKQ 1781
            TRKYEVI++Y IVADEE+V+RKMRVSLPEDY EKLNAQ+NG EELDMELPEVK+YKPRK 
Sbjct: 1611 TRKYEVIEKYAIVADEEEVQRKMRVSLPEDYGEKLNAQRNGIEELDMELPEVKEYKPRKL 1670

Query: 1782 LGDQVFEQEVYGIDPYTHNLLLDSMPDELDWNLLEKHLFIEDVLLRTLNKQVRHFTGTGN 1841
            LGD+V EQEVYGIDPYTHNLLLDSMP ELDW+L +KH FIEDV+LRTLN+QVR FTG+G+
Sbjct: 1671 LGDEVLEQEVYGIDPYTHNLLLDSMPGELDWSLQDKHSFIEDVVLRTLNRQVRLFTGSGS 1730

Query: 1842 TPMMYPLQPVIEEIEKEAVDDCDVRTMKMCRGILKAMDSRPDDKYVAYRKGLGVVCNKEG 1901
            TPM++PL+PVIEE+++ A ++CD+RTMKMC+G+LK ++SR DDKYV+YRKGLGVVCNKEG
Sbjct: 1731 TPMVFPLRPVIEELKESAREECDIRTMKMCQGVLKEIESRSDDKYVSYRKGLGVVCNKEG 1790

Query: 1902 GFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLV 1961
            GFGE+DFVVEFLGEVYPVWKWFEKQDGIRSLQ+N  DPAPEFYNIYLERPKGDADGYDLV
Sbjct: 1791 GFGEEDFVVEFLGEVYPVWKWFEKQDGIRSLQENKTDPAPEFYNIYLERPKGDADGYDLV 1850

Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
            VVDAMH ANYASRICHSCRPNCEAKVTAVDGHYQIGIY+VR I YGEEITFDYNSVTESK
Sbjct: 1851 VVDAMHMANYASRICHSCRPNCEAKVTAVDGHYQIGIYSVRAIEYGEEITFDYNSVTESK 1910

Query: 2022 EEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDRHQLMLEACELNSVSEEDYL 2081
            EEYEASVCLCGSQVCRGSYLNLTGEGAF+KVLK+ HGLL+RH+LMLEAC LNSVSEEDYL
Sbjct: 1911 EEYEASVCLCGSQVCRGSYLNLTGEGAFQKVLKDWHGLLERHRLMLEACVLNSVSEEDYL 1970

Query: 2082 ELGRAGLGSCLLGGLPNWVVAYSARLVRFINLERTKLPEEILRHNLEEKRKYFSDICLEV 2141
            ELGRAGLGSCLLGGLP+W++AYSARLVRFIN ERTKLPEEIL+HNLEEKRKYFSDI L+V
Sbjct: 1971 ELGRAGLGSCLLGGLPDWMIAYSARLVRFINFERTKLPEEILKHNLEEKRKYFSDIHLDV 2030

Query: 2142 EKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRCVFGDPKKAPPPVERLSPEETVSFLWKG 2201
            EKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMR VFGDPK APPP+ERL+PEETVSF+W G
Sbjct: 2031 EKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRHVFGDPKNAPPPLERLTPEETVSFVWNG 2090

Query: 2202 EGSLVEELIQCMAPHVEEDVLNDLKSKIQAHDPSGSEDIQRELRKSLLWLRDEVRNLPCT 2261
            +GSLV+EL+Q ++PH+EE  LN+L+SKI  HDPSGS D+ +EL++SLLWLRDE+R+LPCT
Sbjct: 2091 DGSLVDELLQSLSPHLEEGPLNELRSKIHGHDPSGSADVLKELQRSLLWLRDEIRDLPCT 2150

Query: 2262 YKCRHDAAADLIHIYAYTKCFFRVQEYKAFTSPPVYISPLDLGPKYADKLGADLQVYRKT 2321
            YKCR+DAAADLIHIYAYTKCFF+V+EY++F S PV+ISPLDLG KYADKLG  ++ YRKT
Sbjct: 2151 YKCRNDAAADLIHIYAYTKCFFKVREYQSFISSPVHISPLDLGAKYADKLGESIKEYRKT 2210

Query: 2322 YGENYCLGQLIFWHIQTNADPDCTLARASRGCLSLPDIGSFYAKVQKPSRHRVYGPKTVR 2381
            YGENYCLGQLI+W+ QTN DPD TL +A+RGCLSLPD+ SFYAK QKPS+HRVYGPKTV+
Sbjct: 2211 YGENYCLGQLIYWYNQTNTDPDLTLVKATRGCLSLPDVASFYAKAQKPSKHRVYGPKTVK 2270

Query: 2382 FMLSRMEKQPQRPWPKDRIWAFKSSPRIFGSPMLDSSL-TGCPLDREMVHWLKHRPAIFQ 2440
             M+S+M KQPQRPWPKD+IW FKS+PR+FGSPM D+ L     LDRE++ WL++R  +FQ
Sbjct: 2271 TMVSQMSKQPQRPWPKDKIWTFKSTPRVFGSPMFDAVLNNSSSLDRELLQWLRNRRHVFQ 2330

Query: 2441 AMWD 2444
            A WD
Sbjct: 2331 ATWD 2334


>gi|357453545|ref|XP_003597050.1| Histone-lysine N-methyltransferase E(z) [Medicago truncatula]
 gi|355486098|gb|AES67301.1| Histone-lysine N-methyltransferase E(z) [Medicago truncatula]
          Length = 2512

 Score = 2550 bits (6608), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 1325/2090 (63%), Positives = 1583/2090 (75%), Gaps = 72/2090 (3%)

Query: 394  HEPSLSSRVIYDRHGRSPSHSDRSPHDRG-RYYDHRDRSPSRHDRSPYTRD--------R 444
            H P   S   +D   RSP+ +++SP D+G R YD + RSP+R ++SP  +         R
Sbjct: 457  HSPQDPSWRQHDHKLRSPARAEQSPQDQGWRQYDPKLRSPARTEQSPRNQGWRHNDHKLR 516

Query: 445  SPYTFDRSPYSRE-RSPYNRDRSPYAREKSPYDRS-RHYDHRNRSPFSAERSPQDRARFH 502
            SP   ++SP  +  R   ++ RSP   E+SP  +  +  DH+ RSP   E+SP D+ R  
Sbjct: 517  SPARTEQSPRGQGWRHNDHKLRSPACTEQSPRGQGWQQNDHKLRSPARTEQSPHDQGRRR 576

Query: 503  DRSDRTPNYLERSPLHRSRPNNHREASSKTGASEKRNARYDSKGHEDKLGPKDSNARCSR 562
               D TPN  E SP  R+  + H E S K  +SE  N     K  EDK  P++S      
Sbjct: 577  GLRDCTPNLGEESPHVRTTKDVHEETSCKNSSSENLNFPNSCKSDEDKHIPRESAC---- 632

Query: 563  SSAKESQDKSNVQDLNVSDEK-TANCESHKEEQPQSSSVDCKEPPQVDG-PPLEELVSME 620
             S  ES+ + NVQ  N S EK  ++ +    +Q  S +VD KE PQ +  PP +EL+SME
Sbjct: 633  -SVTESEGERNVQKTNESIEKDISSSQPVDTQQSCSPTVDHKESPQCEAQPPPDELLSME 691

Query: 621  EDMDICDTPPHVPAVTDSSVGKWFYLDHCGMECGPSRLCDLKTLVEEGVLVSDHFIKHLD 680
            EDMDICDTPPHVP VTD S GKWFYLD+ G+E GP++LCD+K LV+EGVL+SDHFIKHLD
Sbjct: 692  EDMDICDTPPHVPVVTDLSSGKWFYLDYGGVENGPTKLCDIKALVDEGVLMSDHFIKHLD 751

Query: 681  SNRWETVENAVSPLVTVNFPSITSDSVTQLVSPPEASGNLLADTGDTAQSTGEEFPVTLQ 740
            SNRW TVENAVSPLV   FPS+ SD++TQLV+PPEASGNLLADT D  QS     P  L 
Sbjct: 752  SNRWLTVENAVSPLVAQIFPSVVSDTITQLVNPPEASGNLLADTADI-QSAPANNPEML- 809

Query: 741  SQCCPDG----SAAAAESSEDLHIDVRVGALLDGFTVIPGKEIETLGEILQTTFERVDWQ 796
            +   P G    +   +E  ++ +ID RV  LL+G+ VIPG E+E + E LQ  FE     
Sbjct: 810  APSPPRGHLNDNVLTSELLDNFYIDERVQKLLEGYDVIPGMELEAIKEALQMKFEYPKED 869

Query: 797  NNG---GPTWHGACVGEQKPGDQKVDELYISDTKMKEAAELKSGDKDHWVVCFDSDEWFS 853
              G   G  WH +C+ E    D   D   ++    +    +   +KD         +WFS
Sbjct: 870  GLGDYEGFPWHVSCLRED--CDSSTD---LASRDSESQLSMSCDNKDDGFGYGIPKDWFS 924

Query: 854  GRWSCKGGDWKRNDEAAQDRCSRKKQVLNDGFPLCQMPKSGYEDPRWNQKDDLYYPSHSR 913
              WSCKGGDWKRND+  QDR  RKK VLN+GFPLCQ+PKSG EDPRW + DDLY PS SR
Sbjct: 925  TLWSCKGGDWKRNDDT-QDRFFRKKVVLNNGFPLCQLPKSGCEDPRWPEIDDLYCPSQSR 983

Query: 914  RLDLPPWAYACPDERNDGSGGSRSTQSKLAAVRGVKGTMLPVVRINACVVNDHGSFVSEP 973
             LDLP WA    DE  D +  SRS QSK  +++GVKG +L VVRINACVVND G  +SE 
Sbjct: 984  -LDLPLWAVGA-DELVDCNAASRSVQSKPPSIKGVKGNVLSVVRINACVVNDQGLLLSES 1041

Query: 974  RSKVRAKERHSSRSARSYSSANDVRRSSAESDSHSKARNNQDSQGSWKSIACINTPKDRL 1033
            R + R K+R   RS R ++S +D +RSS E  S SKA ++Q   GS++S+  I  PKD L
Sbjct: 1042 RHQTRGKDRQHPRSTRPFTSTSDSKRSSTEESSQSKAVSDQ---GSYQSMEFIGVPKDHL 1098

Query: 1034 CTVDDLQLQLGEWYYLDGAGHERGPSSFSELQVLVDQGCIQKHTSVFRKFDKVWVPLTFA 1093
            CT+ +LQL LG+WYY+D +G E+GPSSFSELQ LVDQG I++H+SVFRK DK+WVP+  A
Sbjct: 1099 CTIQELQLHLGDWYYIDASGREKGPSSFSELQSLVDQGVIKRHSSVFRKRDKLWVPIASA 1158

Query: 1094 TETSASTVRNHGEKIMPSGDSSGLPPTQSQDAVLGESNNNVNSNAFHTMHPQFIGYTRGK 1153
             ET      +H +     G  S  P  Q+Q    GES    +S+ F+ +HPQF+G+TRGK
Sbjct: 1159 AETLDVCPTSHQKSSSTLGACSDHPSQQTQGVSYGESC--TSSSLFNKIHPQFVGFTRGK 1216

Query: 1154 LHELVMKSYKNREFAAAINEVLDPWINAKQPKKETE-HVYRKSEGDTRAGKRARLLVRES 1212
            LHELVMKSYK+RE AAAINEVLDPWINA+QPKK+ E  +Y KSEGDTRA KRAR+LV +S
Sbjct: 1217 LHELVMKSYKSRELAAAINEVLDPWINARQPKKDIEKQIYWKSEGDTRAAKRARMLVDDS 1276

Query: 1213 DGDEETEEELQTIQDESTFEDLCGDASFPGEESASSAIESGGWGLLDGHTLAHVFHFLRS 1272
            + D   E+ +   ++E TFEDL GDA+FP +E   +  E G WGLLDG  LA +FHFLRS
Sbjct: 1277 EEDSGLEDGVTIGKNEPTFEDLRGDATFPEKEIGITDSEVGSWGLLDGPVLARIFHFLRS 1336

Query: 1273 DMKSLAFASLTCRHWRAAVRFYKGISRQVDLSSVGPNCTDSLIRKTLNAFDKEKLNSILL 1332
            D KSL FAS+TC+HW AAVRFYK IS Q++LSS+G +CTDS++   +NA++K+K+NSI+L
Sbjct: 1337 DFKSLVFASMTCKHWSAAVRFYKEISMQLNLSSLGHSCTDSVLWNIMNAYEKDKINSIIL 1396

Query: 1333 VGCTNITSGMLEEILQSFPHLSSIDIRGCGQFGELALKFPNINWVKSQKSRGAKFNDSRS 1392
            +GC NIT+ MLE+IL SFP L +IDIRGC QFGEL  KF N+ W+KS+ SR     +   
Sbjct: 1397 IGCNNITADMLEKILLSFPGLCTIDIRGCSQFGELTPKFTNVKWIKSRSSRMDGIAEEPH 1456

Query: 1393 KIRSLKQITEKSSSAPKSKGLGDDMDDFGDLKDYFESVDKRDSANQSFRRSLYQRSKVFD 1452
            KIRSLK IT ++ SA KS  LG  +DDFG LK+YF+SVDKRDSA Q FR++LY+RSK++D
Sbjct: 1457 KIRSLKHITGQTLSASKSSNLG--IDDFGQLKEYFDSVDKRDSAKQLFRQNLYKRSKLYD 1514

Query: 1453 ARKSSSILSRDARMRRWSIKKSENGYKRMEEFLASSLKEIMRVNTFEFFVPKVAEIEGRM 1512
            ARKSSSILSRDAR RRW+IKKSE+G+KRMEEFLAS LKEIM+ N+ +FFVPKVAEIE +M
Sbjct: 1515 ARKSSSILSRDARTRRWAIKKSESGFKRMEEFLASRLKEIMKTNSCDFFVPKVAEIEAKM 1574

Query: 1513 KKGYYISHGLGSVKDDISRMCRDAIKAKNRGSAGDMNRITTLFIQLATRLEQGAKSSYYE 1572
            K GYY S GL SVK+DISRMCRDAIKAK+RG A DMN I TLFIQLA+RLE  +K+    
Sbjct: 1575 KSGYYSSRGLSSVKEDISRMCRDAIKAKSRGDASDMNHIVTLFIQLASRLEASSKN-VQG 1633

Query: 1573 REEMMKSWKDESPAGLYSATSKYKKKLSKMVSERKYMNRSNGTSLANGDFDYGEYASDRE 1632
            R+ ++KSW ++SPA   S +SKYKK  +++V+ERKY  RSNG    +   D  +Y SD+E
Sbjct: 1634 RDVLLKSWDNDSPAMFSSTSSKYKK--NRLVNERKY--RSNG---KHNILDNLDYTSDKE 1686

Query: 1633 IRKRLSKLNRKSLDSGSETSDDLDGSSEDGKSDSESTVSDTDSDMDFRSDGRARESRGAG 1692
            IR+RLSKLN+KS+ S SETSDDLD S ED KSDS+ST +++ SD + RS    R+ R  G
Sbjct: 1687 IRRRLSKLNKKSMGSESETSDDLDRSFEDDKSDSDSTTAESGSDHEVRSKITTRDPRD-G 1745

Query: 1693 DFTTDEGLDF-SDDREWGARMTKASLVPPVTRKYEVIDQYVIVADEEDVRRKMRVSLPED 1751
             F+ +  LDF +DDREWGARMTKASLVPPVTRKYEVID Y IVADEE+VRRKM+VSLP+D
Sbjct: 1746 CFSPEGELDFITDDREWGARMTKASLVPPVTRKYEVIDHYCIVADEEEVRRKMQVSLPDD 1805

Query: 1752 YAEKLNAQKNGSEELDMELPEVKDYKPRKQLGDQVFEQEVYGIDPYTHNLLLDSMPDELD 1811
            YAEKL+AQKNG+EE DMELPEVK +KPRK+LG++V EQEVYGIDPYTHNLLLDSMP+ELD
Sbjct: 1806 YAEKLSAQKNGTEESDMELPEVKSFKPRKELGNEVIEQEVYGIDPYTHNLLLDSMPEELD 1865

Query: 1812 WNLLEKHLFIEDVLLRTLNKQVRHFTGTGNTPMMYPLQPVIEEIEKEAVDDCDVRTMKMC 1871
            W+L EKHLFIED LL+TLNK VR  TGTGNTPM YPLQP+I++I++ A + CD R ++MC
Sbjct: 1866 WSLQEKHLFIEDTLLQTLNKHVRSSTGTGNTPMSYPLQPIIDDIKRCAEEGCDARMLRMC 1925

Query: 1872 RGILKAMDSRPDDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEV--------------- 1916
            +GILKAM+SRPDDKYVAYRKGLGVVCNKE GF +DDFVVEFLGEV               
Sbjct: 1926 QGILKAMNSRPDDKYVAYRKGLGVVCNKEEGFSQDDFVVEFLGEVRHHICTVLIFNIFLQ 1985

Query: 1917 -YPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRI 1975
             YPVWKWFEKQDGIRSLQK++ DPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRI
Sbjct: 1986 VYPVWKWFEKQDGIRSLQKDSTDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRI 2045

Query: 1976 CHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQV 2035
            CHSCRPNCEAKVTAVDG YQIGIY+VR I +GEEITFDYNSVTESKEEYEASVCLCGSQV
Sbjct: 2046 CHSCRPNCEAKVTAVDGQYQIGIYSVRKIQHGEEITFDYNSVTESKEEYEASVCLCGSQV 2105

Query: 2036 CRGSYLNLTGEGAFEKVLKELHGLLDRHQLMLEACELNSVSEEDYLELGRAGLGSCLLGG 2095
            CRGSYLNLTGEGAF+KVLK+ HG+LDRH LMLEACE N VSEEDY +LGRAGLGSCLLGG
Sbjct: 2106 CRGSYLNLTGEGAFQKVLKDSHGILDRHYLMLEACESNIVSEEDYNDLGRAGLGSCLLGG 2165

Query: 2096 LPNWVVAYSARLVRFINLERTKLPEEILRHNLEEKRKYFSDICLEVEKSDAEVQAEGVYN 2155
            LP+W+VAY+ARLVRFIN ERTKLPEEIL+HNL+EKRKYFSD+ LEVE+SDAEVQAEGVYN
Sbjct: 2166 LPDWLVAYAARLVRFINFERTKLPEEILKHNLDEKRKYFSDVHLEVERSDAEVQAEGVYN 2225

Query: 2156 QRLQNLAVTLDKVRYVMRCVFGDPKKAPPPVERLSPEETVSFLWKGEGSLVEELIQCMAP 2215
            QRLQNLAVTLDKVRYVMRC+FGDP+KAPPP+E+LSPEE VS LWKGEGS VEEL+Q +A 
Sbjct: 2226 QRLQNLAVTLDKVRYVMRCIFGDPRKAPPPLEKLSPEEVVSSLWKGEGSFVEELLQGIAA 2285

Query: 2216 HVEEDVLNDLKSKIQAHDPSGSEDIQRELRKSLLWLRDEVRNLPCTYKCRHDAAADLIHI 2275
            HVEED+LNDLKSKI A DPS S DI +ELRKSLLWLRDE+R+L CTYKCRHDAAADL+HI
Sbjct: 2286 HVEEDILNDLKSKIHARDPSSSADILKELRKSLLWLRDEIRSLSCTYKCRHDAAADLLHI 2345

Query: 2276 YAYTKCFFRVQEYKAFTSPPVYISPLDLGPKYADKLGADLQVYRKTYGENYCLGQLIFWH 2335
            YAYTK FFR+QEY+  TSPPV+ISPLDLGPKY +KLGA++Q YRK YGENYCLGQLIFWH
Sbjct: 2346 YAYTKHFFRIQEYQTVTSPPVHISPLDLGPKYTNKLGAEIQEYRKVYGENYCLGQLIFWH 2405

Query: 2336 IQTNADPDCTLARASRGCLSLPDIGSFYAKVQKPSRHRVYGPKTVRFMLSRMEKQPQRPW 2395
             Q+N DPD +L RASRGCLSLPDI SFYAK Q PS++RVYGP+TVR ML+RMEKQPQR W
Sbjct: 2406 NQSNTDPDRSLVRASRGCLSLPDINSFYAKAQNPSQNRVYGPRTVRSMLARMEKQPQRSW 2465

Query: 2396 PKDRIWAFKSSPRIFGSPMLDSSLTGCPLDREMVHWLKHRPAIFQAMWDR 2445
            PKD+IW F+SSP+ FGSPMLD+ +    LDREMVHWLKHRP +   MWDR
Sbjct: 2466 PKDQIWLFRSSPKFFGSPMLDAVINNSTLDREMVHWLKHRPDV---MWDR 2512



 Score =  340 bits (871), Expect = 8e-90,   Method: Compositional matrix adjust.
 Identities = 232/530 (43%), Positives = 313/530 (59%), Gaps = 85/530 (16%)

Query: 1   MGDGGVACMPLQQQQQHNSIMERFPISDKTTICVGNSSNNSNKTNNNSISNNNDNKTNND 60
           MGDGGV CMPLQ       IME+   S+K+  C G+   N ++ N  S      ++   D
Sbjct: 1   MGDGGVTCMPLQY------IMEKISSSEKSH-CGGSKFVNGDRKNMKS----RKSELGFD 49

Query: 61  SSNNNGSSSSKNNETNKSNVKKNGVSTKTVRKKIVKIKKVIAVKKKEVQKNSGSSKSNNN 120
             N +GS     ++  K  V++  + T                 + EV+           
Sbjct: 50  RVNKSGSDVENGDKVLKEEVEEGELVTN------------FKWPRSEVE----------- 86

Query: 121 GENIDNKNVENGGAVGEVVTVDKENLKNEEVEEGELGTLKWENGEFVQPEKSQPQSQLQS 180
                   +ENG  V E V       +  E+E GE+   +W+  EF + EK + +S    
Sbjct: 87  --------IENGEIVPENVMS-----RRSEIENGEIVGERWKTREFEKFEKGEFRSG-NW 132

Query: 181 QSKQIEKGEIIVFSSKCRRGETEKGESGLWRGNKDDIEKGEFIPDRWHKEVV--KDEYGY 238
           +   +E+GEI+  S K RRGE E G  G WRG KDD EKGEF+PDRW+K  +  K++YG 
Sbjct: 133 RRDDVERGEIV--SEKGRRGENEYG-PGSWRGGKDDYEKGEFVPDRWYKGEMGGKNDYGN 189

Query: 239 SKSRR--------YDYKLERT-PPSGKYSGEDVYRRKEF-DRSGSQHSKSSSRWESGQER 288
             +RR        + ++ ERT PPS +Y+GED +R+KEF +RSG+QH+K+SSRWE+ Q R
Sbjct: 190 ISNRRNYPGKDKGWKFQRERTPPPSWRYTGEDSFRKKEFINRSGNQHAKNSSRWENAQPR 249

Query: 289 NVRISSKIVDDEGLYKGEHNNGKNHGREYF-HGNRFKRHGTDSDSGDRKYYGDYGDFAGL 347
           NVR SSKIVDDE   K  ++NGK+H R+Y   G+R KR G D D  +RK+   Y DF  L
Sbjct: 250 NVRTSSKIVDDE---KNAYSNGKDHTRDYTSSGSRLKRPGNDFDGYERKH---YADFTNL 303

Query: 348 KSRRLSDDYNSRSVHSEHYSRHSVEKFHRNSSSSRISSLDKYSSRHHEPSLSSRVIYDRH 407
           KSRRLSDD N R  +SE+YSR  VE+ +RN++S+R+S+ +KYSSR+HE SLS+R  YDRH
Sbjct: 304 KSRRLSDD-NYRCAYSENYSRRPVEQSYRNNNSTRLSA-EKYSSRNHESSLSTRPAYDRH 361

Query: 408 GRSPSHSDRSPHDRGRYYDHRDRSPSRHDRSPYTRDRSPYTFDRSPYSRERSPYNRDRSP 467
            RSP HS+ SP DR RYYD R+R+P R  RSP+ R+RSPY+ D+SP++RERSPY R    
Sbjct: 362 ERSPVHSEWSPRDRSRYYDQRERTPVR--RSPFGRERSPYSRDKSPHARERSPYMRS--- 416

Query: 468 YAREKSPYDRSRHYDHRNRSPFSAERSPQDRA-RFHDRSDRTPNYLERSP 516
                  +DRSR +DH+ RSP   E+SPQD+  R HD   R+P   E SP
Sbjct: 417 -------WDRSRQHDHKLRSPVRTEQSPQDQGWRQHDHKLRSPARTEHSP 459


>gi|224095776|ref|XP_002310475.1| SET domain protein [Populus trichocarpa]
 gi|222853378|gb|EEE90925.1| SET domain protein [Populus trichocarpa]
          Length = 2350

 Score = 2293 bits (5943), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 1129/1522 (74%), Positives = 1295/1522 (85%), Gaps = 39/1522 (2%)

Query: 926  DERNDGSGGSRSTQSKLAAVRGVKGTMLPVVRINACVVNDHGSFVSEPRSKVRAKERHSS 985
            D+RND  G SRST +K    RGVKGT+LPVVRINACVV DH   VSE R+KVR K+R+ S
Sbjct: 866  DDRNDTGGVSRSTLNKPPITRGVKGTVLPVVRINACVVQDH--VVSETRTKVRGKDRYHS 923

Query: 986  RSARSYSSANDVRRSSAESDSHSKARNNQDSQGSWKSIACINTPKDRLCTVDDLQLQLGE 1045
            RSAR++S+ NDV+ SS E DS S+  N+QDS G WKS A +NTPKDRLCT DDLQL LG+
Sbjct: 924  RSARTHSATNDVKSSSVECDSQSRVVNDQDSHGCWKSTASLNTPKDRLCTADDLQLNLGD 983

Query: 1046 WYYLDGAGHERGPSSFSELQVLVDQGCIQKHTSVFRKFDKVWVPLTFATETSASTVRNHG 1105
            WYYLDG+GHERGP SFSELQ L D+G IQK++SVFRKFD+VWVP+  ATETS + VR   
Sbjct: 984  WYYLDGSGHERGPLSFSELQNLADKGTIQKYSSVFRKFDRVWVPVASATETSEAAVRIQQ 1043

Query: 1106 EKIMPSGDSSGLPPTQSQDAVLGESNNNVNSNAFHTMHPQFIGYTRGKLHELVMKSYKNR 1165
              +  S  SSG    +SQ A   ESN +  S++FH++HPQFIG+TRGKLHELVMKSYKNR
Sbjct: 1044 SNVELSVGSSGTL-LKSQTAANIESNKD--SSSFHSLHPQFIGFTRGKLHELVMKSYKNR 1100

Query: 1166 EFAAAINEVLDPWINAKQPKKETE-HVYRKSEGDTRAGKRARLLVRESDGDEETEEE-LQ 1223
            EFA AINE LDPWI AKQP+KE + H+Y KSE D R GKRA +   +   D E EE+ L 
Sbjct: 1101 EFAVAINEALDPWIVAKQPQKELDKHMYLKSEIDVRVGKRAWMQPDQIVKDNEMEEDTLH 1160

Query: 1224 TIQDESTFEDLCGDASFPGEESASSAIESGGWGLLDGHTLAHVFHFLRSDMKSLAFASLT 1283
             +  E+TFE LCGD +F  EES  S IE+G WGLLDGH LA +FHFLRSD+KSL FASLT
Sbjct: 1161 KV--ETTFEQLCGDTNFHREESMCSEIEAGSWGLLDGHMLARIFHFLRSDLKSLVFASLT 1218

Query: 1284 CRHWRAAVRFYKGISRQVDLSSVGPNCTDSLIRKTLNAFDKEKLNSILLVGCTNITSGML 1343
            C+HWRAAV FYKGIS QVDLSSVG NCTD ++R  +N ++KEK+N+++L GCTN+TSGML
Sbjct: 1219 CKHWRAAVSFYKGISIQVDLSSVGLNCTDLMVRSIMNGYNKEKINAMVLTGCTNVTSGML 1278

Query: 1344 EEILQSFPHLSSIDIRGCGQFGELALKFPNINWVKSQKSRGAKFNDSRSKIRSLKQITEK 1403
            EEIL S P LSSIDIRGC QF EL  +FP ++W+KS   R     +S SK+RSLKQI+ +
Sbjct: 1279 EEILCSLPCLSSIDIRGCTQFMELVHQFPRVSWLKS---RTRIPEESNSKLRSLKQISGR 1335

Query: 1404 SSSAPKSKGLGDDMDDFGDLKDYFESVDKRDSANQSFRRSLYQRSKVFDARKSSSILSRD 1463
                          DDFG+LK+YF+SV+KRDSANQ FRRSLY+RSKVFDARKSSSILSRD
Sbjct: 1336 --------------DDFGELKEYFDSVNKRDSANQLFRRSLYKRSKVFDARKSSSILSRD 1381

Query: 1464 ARMRRWSIKKSENGYKRMEEFLASSLKEIMRVNTFEFFVPKVAEIEGRMKKGYYISHGLG 1523
            ARMRRW++KKSEN Y RME FLA+ LK+IM+ N F+FFVPKVAEIE RMK GYY+ HGL 
Sbjct: 1382 ARMRRWAVKKSENSYTRMEGFLAAGLKDIMKENIFDFFVPKVAEIEDRMKNGYYVGHGLR 1441

Query: 1524 SVKDDISRMCRDAIKAKNRGSAGDMNRITTLFIQLATRLEQGAKSSYYEREEMMKSWKDE 1583
            SVK+DISRMCRDAIK KNRG AGDMN I TLF QLA+RLE+ +K SY ER+E+MKSWKD+
Sbjct: 1442 SVKEDISRMCRDAIKVKNRG-AGDMNHIITLFFQLASRLEESSKFSY-ERDELMKSWKDD 1499

Query: 1584 SPAGLYSATSKYKKKLSKMVSERKYMNRSNGTSLANGDFDYGEYASDREIRKRLSKLNRK 1643
              A L SA      K  K  + +KYMNRSNGT  ANG FDYGEYASD+EI+KR+SKLNRK
Sbjct: 1500 LSAALDSAP----MKHKKKATGKKYMNRSNGTIPANGSFDYGEYASDQEIKKRISKLNRK 1555

Query: 1644 SLDSGSETSDDLDGSSEDGKSDSESTVSDTDSDMDFRSDGRARESRGAGDFTTDEGLDFS 1703
            S+DSGSETSDD   SSEDG+S S+ST SDT+SD+DFRS+GR  ESRG     TDE     
Sbjct: 1556 SMDSGSETSDDR--SSEDGRSGSDSTASDTESDLDFRSEGRTGESRGDRYCMTDE----- 1608

Query: 1704 DDREWGARMTKASLVPPVTRKYEVIDQYVIVADEEDVRRKMRVSLPEDYAEKLNAQKNGS 1763
            D+REWGARMTK SLVPPVTRKYEVIDQY+IVADEEDV+RKM VSLP+DYAEKL+AQKNG+
Sbjct: 1609 DEREWGARMTKVSLVPPVTRKYEVIDQYLIVADEEDVQRKMSVSLPDDYAEKLDAQKNGT 1668

Query: 1764 EELDMELPEVKDYKPRKQLGDQVFEQEVYGIDPYTHNLLLDSMPDELDWNLLEKHLFIED 1823
            EELDMELPEVKDYKPRKQLGD+V EQEVYGIDPYTHNLLLDSMP+E+DW LL+KH+FIED
Sbjct: 1669 EELDMELPEVKDYKPRKQLGDEVIEQEVYGIDPYTHNLLLDSMPEEVDWPLLQKHMFIED 1728

Query: 1824 VLLRTLNKQVRHFTGTGNTPMMYPLQPVIEEIEKEAVDDCDVRTMKMCRGILKAMDSRPD 1883
            VLL TLNKQVRHFTG GNTPM Y +QPV+EEIE+ A++DCD+R MK+CRGIL+A+DSRPD
Sbjct: 1729 VLLCTLNKQVRHFTGAGNTPMTYAIQPVVEEIEQAAMEDCDIRKMKICRGILRAIDSRPD 1788

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            DKYVAYRKGLGVVCNKEGGFG+DDFVVEFLGEVYP WKWFEKQDGIR LQK++++PAPEF
Sbjct: 1789 DKYVAYRKGLGVVCNKEGGFGDDDFVVEFLGEVYPAWKWFEKQDGIRLLQKDSKEPAPEF 1848

Query: 1944 YNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRG 2003
            YNIYLERPKGDADGYDLVVVDAMHKANYASRICHSC+PNCEAKVTAVDG YQIGIYTVR 
Sbjct: 1849 YNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCKPNCEAKVTAVDGQYQIGIYTVRE 1908

Query: 2004 IHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDRH 2063
            I +GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAF+KVLKE HGLLDRH
Sbjct: 1909 IQHGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFQKVLKEWHGLLDRH 1968

Query: 2064 QLMLEACELNSVSEEDYLELGRAGLGSCLLGGLPNWVVAYSARLVRFINLERTKLPEEIL 2123
             LML ACELNSVSEEDYL+LGRAGLGSCLLGGLP+WVVAYSARLVRFINLERTKLPEEIL
Sbjct: 1969 YLMLGACELNSVSEEDYLDLGRAGLGSCLLGGLPDWVVAYSARLVRFINLERTKLPEEIL 2028

Query: 2124 RHNLEEKRKYFSDICLEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRCVFGDPKKAP 2183
            RHNL+EKRKYF+D CLEVE+SDAEVQAEGVYNQRLQNLAVTLDKVRYVMRC+FGDPK+AP
Sbjct: 2029 RHNLKEKRKYFADTCLEVERSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRCIFGDPKQAP 2088

Query: 2184 PPVERLSPEETVSFLWKGEGSLVEELIQCMAPHVEEDVLNDLKSKIQAHDPSGSEDIQRE 2243
            PP+E+L+PEETVSFLWKG+GSLV+EL+QCM+P+++ED+LNDLKSK+ AHDPS  +DIQ+ 
Sbjct: 2089 PPLEKLTPEETVSFLWKGDGSLVDELLQCMSPYMDEDMLNDLKSKVCAHDPSDCDDIQKA 2148

Query: 2244 LRKSLLWLRDEVRNLPCTYKCRHDAAADLIHIYAYTKCFFRVQEYKAFTSPPVYISPLDL 2303
            L+KSLLWLRDEVR+LPCTYKCRHDAAADLIH+YAYTK FFRV++Y AFTSPPV+ISPLDL
Sbjct: 2149 LQKSLLWLRDEVRSLPCTYKCRHDAAADLIHVYAYTKSFFRVRDYDAFTSPPVHISPLDL 2208

Query: 2304 GPKYADKLGADLQVYRKTYGENYCLGQLIFWHIQTNADPDCTLARASRGCLSLPDIGSFY 2363
            GPK ADKLG     Y+KTYG +YC+GQLIFWH+QTN +PD TLA+AS+GCLSLP+IGSFY
Sbjct: 2209 GPKCADKLGGLPHKYQKTYGGSYCMGQLIFWHVQTNTEPDFTLAKASKGCLSLPEIGSFY 2268

Query: 2364 AKVQKPSRHRVYGPKTVRFMLSRMEKQPQRPWPKDRIWAFKSSPRIFGSPMLDSSLTGCP 2423
            AKVQKPS+ R+YGPKTV+ ML RMEK PQ+PWPKD+IW+FK+SP++FGSPMLD+ L   P
Sbjct: 2269 AKVQKPSQQRIYGPKTVKMMLERMEKYPQKPWPKDQIWSFKNSPKVFGSPMLDAVLNNAP 2328

Query: 2424 LDREMVHWLKHRPAIFQAMWDR 2445
            LDREMVHWLKHRP ++QA+WDR
Sbjct: 2329 LDREMVHWLKHRPTVYQAVWDR 2350



 Score =  795 bits (2053), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 489/890 (54%), Positives = 579/890 (65%), Gaps = 130/890 (14%)

Query: 1   MGDGGVACMPLQQQQQHNSIMERFPISDK-------------TTICVGNSSNNSNKTNNN 47
           MGDGGVACMPLQ    +  + ERFP+ +K             TT  V   +NNSN  +  
Sbjct: 1   MGDGGVACMPLQHSSNNIIMEERFPVQEKNTTTTVTAAVPSTTTTKVETVNNNSNSGSGG 60

Query: 48  SISNNNDNKTNNDSSNNNGSSSSKNNETNKSNVKKN------------------------ 83
             SNNN+N  ++     NG S+S NN       K                          
Sbjct: 61  GSSNNNNNNVSSGDKKGNGKSNSSNNGVTGKVKKVKRIVKVKKVVRKVVVGEKKGVGLVR 120

Query: 84  ------GVSTKTV---------RKKIVKIKKVIAVKKKEVQKNSGSSKSNNNGENID--N 126
                 G  +K V          KK  K K+V A KK+   K   +++   +G  I   +
Sbjct: 121 EVKSACGSGSKEVVVLEKKESGLKKEEKSKEVTAEKKESGWKKELAAEKKESGLKISSGS 180

Query: 127 KNVENGGAVGEVVTVDKENLKN--EEVEEGELGTLKW------ENGEFVQ-PEKSQPQSQ 177
           K VENG  +G   T  +    N  EEVEEGELGTLKW      ENGEFV  PEK      
Sbjct: 181 KTVENGDGLGSGDTKLQSGSNNIKEEVEEGELGTLKWPTKGEIENGEFVPIPEK------ 234

Query: 178 LQSQSKQIEKGEIIVFSSKCRRGETEKGESGLWRGNK--------DDIEKGEFIPDRWHK 229
              +  +IE+GEI   S K ++G+ EKGE  +  GNK        D+IEKGEFIPDRW+ 
Sbjct: 235 --PRRSEIERGEI--GSEKWKKGDIEKGE--IVSGNKWQRGEVVRDEIEKGEFIPDRWNG 288

Query: 230 EVVKDEYGYSKSR-RYDYKLERTPPSGKYSGEDVYRRKEFDRS-GSQHSKSSSRWESGQE 287
              KDEYGY +SR RYD   ERTPPSGKYS EDV RRKE  RS GS HSKSS RWESGQE
Sbjct: 289 ---KDEYGYIRSRGRYDMSRERTPPSGKYSCEDVNRRKELTRSGGSLHSKSSMRWESGQE 345

Query: 288 RNVRISSKIVDDEGLYKGEHNNGKNHGREYFHGNRFKRHGTDSDSGDRKYYGDYGDFAGL 347
           R+ RISSKIVD+EG YK E++NGKN GREY  GNR KRHGTDSDS +RK+YGDY   +  
Sbjct: 346 RSTRISSKIVDEEGSYKSEYSNGKNPGREYSSGNRLKRHGTDSDSTERKHYGDY---SSS 402

Query: 348 KSRRLSDDYNSRSVHSEHYSRHSVEKFHRN-SSSSRISSLDKYSSRHHEPSLSSRVIYDR 406
           KSRRLS+D  SR  +SEHYSRHSVE+F++N SSSSR+S  DKYSSRHHE +L S+V+YDR
Sbjct: 403 KSRRLSED-GSRYAYSEHYSRHSVERFYKNSSSSSRVSLSDKYSSRHHESTLPSKVVYDR 461

Query: 407 HGRSPSHSDRSPHDRGRYYDHRDRSPSRH----------------------------DRS 438
           H     HSD SPH+R RY DHRDRSP RH                            +RS
Sbjct: 462 H----VHSDWSPHERPRYNDHRDRSPIRHEKSPYGRERTPYGLERSPYGRERSPYGRERS 517

Query: 439 PYTRDRSPYTFDRSPYSRERSPYNRDRSPYAREKSPYDRSRHYDHRNRSPFSAERSPQDR 498
           PY RDRSPY  DRSPY RE+SPY R+RSPY  EKSPYDRSRHY+HR RSP   ERSPQDR
Sbjct: 518 PYWRDRSPYGHDRSPYGREKSPYGRERSPYGLEKSPYDRSRHYEHRKRSPSYVERSPQDR 577

Query: 499 ARFHDRSDRTPNYLERSPLHRSRPNNHREASSKTGASEKRNARYDSKGHEDKLGPKDSNA 558
           AR HDRSDRTPNYLERSP  R++PNN+REA  K GA+EKRN++Y +K  EDK+  KD +A
Sbjct: 578 ARHHDRSDRTPNYLERSPHDRAKPNNYREA-RKGGATEKRNSQYGNKQQEDKISQKDPDA 636

Query: 559 RCSRSSAKESQDKSNVQDLNVSDEKTANCESHKEEQPQSSSVDCKEPPQVDGPPLEELVS 618
           R +  SAKESQDKS+V +L+  DEK A+ E+  EE+ +S  ++ KEPPQVDGPP EEL S
Sbjct: 637 RDTEPSAKESQDKSSVLNLDGLDEKNASSETRIEEKSESPRINVKEPPQVDGPPPEELQS 696

Query: 619 MEEDMDICDTPPHVPAVTDSSVGKWFYLDHCGMECGPSRLCDLKTLVEEGVLVSDHFIKH 678
           MEEDMDICDTPPHVPAV D+S GKWFYLDH G+ECGPS+LC+LK LV+EG L+SDHFIKH
Sbjct: 697 MEEDMDICDTPPHVPAVADTSTGKWFYLDHFGVECGPSKLCELKALVDEGSLMSDHFIKH 756

Query: 679 LDSNRWETVENAVSPLVTVNFPSITSDSVTQLVSPPEASGNLLADTGDTAQS---TGEEF 735
           L S+RW T+ENA+SP V VNFPS+  D++TQLVSPPEA GNLLADTGD  QS    GE  
Sbjct: 757 LHSDRWLTIENALSPFVPVNFPSVVPDAITQLVSPPEAPGNLLADTGDIGQSCAQIGEGV 816

Query: 736 PVT-LQSQCCPDGSAAAAESSEDLHIDVRVGALLDGFTVIPGKEIETLGE 784
               L+   CPD S  A+ES EDL ID RVGALL+GF+V+PG E+ET+G+
Sbjct: 817 SGNFLKPPVCPDHSEIASESLEDLQIDERVGALLEGFSVVPGSELETVGD 866


>gi|222640020|gb|EEE68152.1| hypothetical protein OsJ_26262 [Oryza sativa Japonica Group]
          Length = 2255

 Score = 2122 bits (5499), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 1179/2335 (50%), Positives = 1501/2335 (64%), Gaps = 187/2335 (8%)

Query: 150  EVEEGELGTLKWENGEFVQPEKSQPQSQLQ-----SQSKQIEKGEIIVFSSKCRRGETEK 204
            E+EEGEL   + +N      E+S P  + +     S + ++E GEI++ S K R+     
Sbjct: 61   ELEEGELLNGEADNSSSRDLERSMPPKKWRKVLAASSAAEVEPGEIVMPSKKARKN---- 116

Query: 205  GESGLWRGNKDDIEKGEFIPDRWHKEVVKDEYGYSKSRRYDYKLERTPPSGKYSGEDVYR 264
            GE          +EKGE  P+R  K+         KS R   K E  P      GE    
Sbjct: 117  GE----------LEKGEIAPERQRKD------KSDKSGRKSNKDEVEP------GEVAPP 154

Query: 265  RKEFDRSGSQHSKSSSRWESGQERNVRISSKIVDDEGLYKGEHNN-----GKNHGREYFH 319
             K+ DR  ++   SS++               V D+G  KG   +     G+        
Sbjct: 155  DKKQDRDHNKKLGSSAQ---------------VRDDGSKKGSSRDSDEEPGEIRPESSST 199

Query: 320  GNRFKRHGTDSDSGDRKYYGDYGDFAGLKSRRLSDDYNSRSVHSEHYSRHSVEKFHRNSS 379
            G+  K   T+ ++ + K+  D  D  G KSRR  +  +S         RH          
Sbjct: 200  GSARKSRATEPENSNHKHQADTCDQTGSKSRRKGEAKSS--------GRH---------- 241

Query: 380  SSRISSLDKYSSRHHEPSLSSRVIYDRHGRSPSHSDRSPHDRGRYYDHRDRSPSRHDRSP 439
                      S R+ + S  +R   DRH RSP    R PHDR R+    DRSPSR + SP
Sbjct: 242  ---------LSGRNRDISPMTR---DRHERSPGILGRFPHDRLRH----DRSPSRLEPSP 285

Query: 440  YTRDRSPYTFDRSPYSRERSPYNRDRSPYAREKSPYDRSRHYDHRNRSPFSAERSPQDRA 499
              R R     DRSPY    SP +R R  + R+ +P  R   + HR+ +P   + SP+ R+
Sbjct: 286  RDRGRHYDNRDRSPYI---SPRHRMRPSHYRDNTP-SRGEMHHHRDNTPSRVDSSPR-RS 340

Query: 500  RFHDRSDRTPNYLERSPLHRSRPNNHREASSKTGASEKRNARYDSKGHEDKLGPKD---- 555
            +  D  DR+P+  ++SP  R R     EA  K+  ++  N   +   H+ K   +     
Sbjct: 341  QHEDFRDRSPSRRDKSPSERGRTTESHEAGKKSRGAKLENNSLEKAQHKSKSTKQSTKSK 400

Query: 556  -----SNARCSRSSAKESQDKSNVQDLNVSDEKTANCESHKEEQPQSSSVDCKEPPQV-- 608
                 SN + S+  A E+   + +                    P +       PP+   
Sbjct: 401  SSSNGSNEKISKEKATETIQYTELPPPPPLPPPPPPPPPPPPPLPPNMPPPLPPPPEPEL 460

Query: 609  DGPPLEELVSMEEDMDICDTPPHV----PAVTD---SSVGKWFYLDHCGMECGPSRLCDL 661
            +G P E+ VSMEEDMDICDTPPH     P  T+   S VGKWFYLDH G+E GPS+L DL
Sbjct: 461  NGAPAED-VSMEEDMDICDTPPHTTSSAPGPTEPPASDVGKWFYLDHYGIEQGPSKLADL 519

Query: 662  KTLVEEGVLVSDHFIKHLDSNRWETVENAVSPLVTVNFPSITSDSVTQLVSPPEASGNLL 721
            K LVE+G L+SDH IKH DSNRW TVENA SPLV   FPS+ SD  TQLVSPPEA GNLL
Sbjct: 520  KKLVEDGYLLSDHLIKHADSNRWVTVENAASPLVPSEFPSVYSDVSTQLVSPPEAPGNLL 579

Query: 722  ADTGDTAQSTGEEFPVTLQSQCCPDGSAAAAESSEDLHIDVRVGALLDGFTVIPGKEIET 781
             +  + A  T  E               A+AE  ED +ID RV AL+DG  ++ G+E+E 
Sbjct: 580  DEAREEASGTDHE-----------QMKEASAEEQEDFYIDDRVDALMDGSIMVDGQELEI 628

Query: 782  LGEILQTTFERVDWQNNGGPTWHGACVGEQKPGDQKVDELYISDTKMKEAAELKSGDKDH 841
            LGE+L   FE V+W++     +      E+  G ++  E    D++      +   ++D 
Sbjct: 629  LGELLNAHFEPVNWESEDLSRFQVKL--ERDDGTKRSTEF--PDSRTAHIYGVVPAERDT 684

Query: 842  WVVCFDSDEWFSGRWSCKGGDWKRNDEAAQDRCSRKKQVLNDGFPLCQMPKSGYEDPRWN 901
            +    +S EW+SGRWSCKGGDWKRND+ +QD+  RKK VLN+G+PLCQMPK  +EDPRW 
Sbjct: 685  YQPHIESSEWYSGRWSCKGGDWKRNDDFSQDKPYRKKLVLNEGYPLCQMPKGNHEDPRWG 744

Query: 902  QKDDLYYPSHSRRLDLPPWAYACPDERND--------GSGGSRSTQSKLAAVRGVKGTML 953
             KDDLYYP  +++LDLP WA++  +E +D        G    RS Q+K    +GVKGT L
Sbjct: 745  CKDDLYYPLRAKKLDLPLWAFSSTEENDDTVDDASKSGVMPGRSGQTKQPP-KGVKGTTL 803

Query: 954  PVVRINACVVNDHGSFVSEPRSKVRAKERHSSRSARSYSSANDVRRSSAESDSHSKARNN 1013
            PVV+INA VV D  S  SE R K +  +R  SRS+RS+S   D R S+ E  SHSK  + 
Sbjct: 804  PVVKINARVVKDQSS--SELRIKPKVADRPPSRSSRSHSIGTD-RSSTHEGSSHSKKHHE 860

Query: 1014 QDSQGSWKSIACINTPKDRLCTVDDLQLQLGEWYYLDGAGHERGPSSFSELQVLVDQGCI 1073
             DSQ   KS +  N PKD +CTV++L +++G+WYYLDG GHERGP S+SELQ L  +G I
Sbjct: 861  HDSQSLHKSKSVPNIPKDHVCTVEELSVKVGDWYYLDGTGHERGPFSYSELQELAKKGTI 920

Query: 1074 QKHTSVFRKFDKVWVPLTFATETSASTVRNHGEKIMPSGDSSGLPPTQSQDAVLGESNNN 1133
             + +SVFRK D  W+P+                K + SG S+      S  + L  SN +
Sbjct: 921  LEGSSVFRKIDNTWLPVL---------------KDLKSGCSARNGEAGSSTSALTHSNQS 965

Query: 1134 VNSNAFHTMHPQFIGYTRGKLHELVMKSYKNREFAAAINEVLDPWINAKQPKKETEHVYR 1193
                 FH MHPQF+GYTRGKLHELVMK +K+RE   AINEVL+PWI  KQP+KE E  + 
Sbjct: 966  ----NFHEMHPQFVGYTRGKLHELVMKYFKSRELTLAINEVLEPWIATKQPRKELETFFS 1021

Query: 1194 KSEG-------DTRAGKRARLLVRESDG-DEETEEELQTIQDESTFEDLCGDASFPGEES 1245
             S         D  + KRARLL  +SD   + +E+ L + +D+  FEDL   A+   E  
Sbjct: 1022 HSSASKNFVQEDGGSTKRARLLPDQSDEYTDMSEDILASQKDDCCFEDLFEGAAHVKESP 1081

Query: 1246 ASSAIESGGWGLLDGHTLAHVFHFLRSDMKSLAFASLTCRHWRAAVRFYKGISRQVDLSS 1305
             +S  ES  WGLL+ H LA +FHFLR+D+KSL  ++ TC  W  A ++Y+ + R +DLSS
Sbjct: 1082 LNSRTESESWGLLNEHVLARIFHFLRADVKSLISSAATCSWWNTAAKYYRSVCRFIDLSS 1141

Query: 1306 VGPNCTDSLIRKTLNAFDKEKLNSILLVGCTNITSGMLEEILQSFPHLSSIDIRGCGQFG 1365
            +GP CTD++    +  +D + + +++L GC+N++S  L E+L+ FPH+S + I+GC Q G
Sbjct: 1142 LGPQCTDNVFHDIMAGYDMQNIRTLVLTGCSNLSSLALAEVLKRFPHISYVHIQGCSQLG 1201

Query: 1366 ELALKFPNINWVKSQKSRGAKFNDSRSKIRSLKQITEKSSSAPKS-KGLGDDMDDFGDLK 1424
            +L  KF ++ W+KS  +  A +     KIRSLKQI + S+S  K+ + L   M    +L 
Sbjct: 1202 DLKNKFQHVKWIKSSLNPDASYQ----KIRSLKQIDDGSNSTSKAGRILTSQMGGSDELD 1257

Query: 1425 DYFESVDKRDSANQSFRRSLYQRSKVFDARKSSSILSRDARMRRWSIKKSENGYKRMEEF 1484
             YF  +  R+S+  SF +  Y+RSK  D RKSS++LSRDA+MRR   +K+EN Y++MEEF
Sbjct: 1258 GYFADISNRESSTLSFGQGFYKRSKWLDIRKSSAVLSRDAQMRRLMQRKAENSYRKMEEF 1317

Query: 1485 LASSLKEIMRVNTFEFFVPKVAEIEGRMKKGYYISHGLGSVKDDISRMCRDAIKAKNRGS 1544
            + + LKEIM+ + F+FFVPKVA+IE R+K GYY  HG   +K+DI  MCRDA++ K R  
Sbjct: 1318 VINKLKEIMKSSRFDFFVPKVAKIEVRLKNGYYARHGFSYIKNDIRSMCRDALRYKGRSD 1377

Query: 1545 AGDMNRITTLFIQLATRLEQGAKSSYYEREEMMKSWKDESPAGLYSATSKYKKKLSKMVS 1604
             GDM +I   FIQLA +LE     S  +   + K   D S    YS+  K KKK SK +S
Sbjct: 1378 LGDMKQIVVAFIQLAKKLENPRLISDRDGTAVQK---DSSDMSQYSSDLKLKKKQSKTMS 1434

Query: 1605 ERKYMNRSNGTSLANGDFDYGEYASDREIRKRLSKLNRKSLDSGSETSDDLDGSSEDGKS 1664
            ER+      G +      D    A DREI++ LSKL ++ +DSGSETSDD DG SE  ++
Sbjct: 1435 ERR------GANWTTAGADPSSRAFDREIKRSLSKLKKRDIDSGSETSDDDDGYSEGDET 1488

Query: 1665 DSESTVSDTDSDMDFRSDGRARESRGAGDFTTDEGLDFSDDREWGARMTKASLVPPVTRK 1724
            +SE+TVSDT+SD+D  S     +  G   F + E L  +DDR WGARMTKASLVPPVTRK
Sbjct: 1489 ESETTVSDTESDLDVNSGAWDLKGNGMKLFESSESL--TDDRGWGARMTKASLVPPVTRK 1546

Query: 1725 YEVIDQYVIVADEEDVRRKMRVSLPEDYAEKLNAQKNGSEELDMELPEVKDYKPRKQLGD 1784
            YEVI++Y+IVADEE+V RKMRV+LP+DY+EKL +QKNG+E L  ELPEVKDY+PRK  GD
Sbjct: 1547 YEVIEKYLIVADEEEVLRKMRVALPDDYSEKLLSQKNGTENL--ELPEVKDYQPRKVPGD 1604

Query: 1785 QVFEQEVYGIDPYTHNLLLDSMPDELDWNLLEKHLFIEDVLLRTLNKQVRHFTGTGNTPM 1844
            +V EQEVYGIDPYTHNLLL+ MP ELDW   +KH F+E++LL TLNKQVR FTG+GNTPM
Sbjct: 1605 EVLEQEVYGIDPYTHNLLLEMMPTELDWPSSDKHTFVEELLLNTLNKQVRQFTGSGNTPM 1664

Query: 1845 MYPLQPVIEEIEKEAVDDCDVRTMKMCRGILKAMDSRPDDKYVAYRKGLGVVCNKEGGFG 1904
            +YPL+PVIEEI+K A +  D RT KMC G+LKAM + P+     Y  GLGVVCNK GGFG
Sbjct: 1665 VYPLKPVIEEIQKSAEESGDRRTSKMCLGMLKAMRNHPE-----YNYGLGVVCNKTGGFG 1719

Query: 1905 EDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVD 1964
             DDFV+EF GEVYP W+W+EKQDGI+ +Q N++D APEFYNI LERPKGD DGYDLV VD
Sbjct: 1720 VDDFVIEFFGEVYPSWRWYEKQDGIKHIQNNSDDQAPEFYNIMLERPKGDRDGYDLVFVD 1779

Query: 1965 AMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEY 2024
            AMHKANYASRICHSC PNCEAKVTAVDGHYQIGIYTVR I  GEEITFDYNSVTESKEE+
Sbjct: 1780 AMHKANYASRICHSCNPNCEAKVTAVDGHYQIGIYTVRPIAEGEEITFDYNSVTESKEEH 1839

Query: 2025 EASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDRHQLMLEACELNSVSEEDYLELG 2084
            EASVCLCGSQ+CRGSYLN +GEGAFEKVL E HG+LDRH L+L+ACE NSVS++D ++LG
Sbjct: 1840 EASVCLCGSQICRGSYLNFSGEGAFEKVLMEFHGVLDRHSLLLQACEANSVSQQDLIDLG 1899

Query: 2085 RAGLGSCLLGGLPNWVVAYSARLVRFINLERTKLPEEILRHNLEEKRKYFSDICLEVEKS 2144
            RAGLG+CLL GLP W+VAY+A LVRFI  ER KLP EI +HN++EKR++F+DI ++ EK+
Sbjct: 1900 RAGLGTCLLAGLPGWLVAYTAHLVRFIFFERQKLPHEIFKHNVDEKRQFFTDINMDSEKN 1959

Query: 2145 DAEVQAEGVYNQRLQNLAVTLDKVRYVMRCVFGDPKKAPPPVERLSPEETVSFLWKGEGS 2204
            DAEVQAEGV N RLQNL  TLDKVRYVMRC+FGDPK APPP+ RL+    VS +WKGEGS
Sbjct: 1960 DAEVQAEGVLNSRLQNLTHTLDKVRYVMRCIFGDPKNAPPPLVRLTGRSLVSAIWKGEGS 2019

Query: 2205 LVEELIQCMAPHVEEDVLNDLKSKIQAHDPSGSEDIQRELRKSLLWLRDEVRNLPCTYKC 2264
            LV+EL++ M PHVEEDVL DLK+KI+AHDPSGSEDI+ E+R SLLWLRDE+R L CTYKC
Sbjct: 2020 LVDELLESMEPHVEEDVLTDLKAKIRAHDPSGSEDIEGEIRSSLLWLRDELRTLSCTYKC 2079

Query: 2265 RHDAAADLIHIYAYTKCFFRVQEYKAFTSPPVYISPLDLGPKYADKLGADLQVYRKTYGE 2324
            RHDAAADLIH+YAYTKCFFRV++YK   SPPV ISPLDLGPKYADKLG   Q Y KTY E
Sbjct: 2080 RHDAAADLIHMYAYTKCFFRVRDYKTVKSPPVLISPLDLGPKYADKLGPGFQEYCKTYPE 2139

Query: 2325 NYCLGQLIFWHIQTNADPDCTLARASRGCLSLPDIGSFYAKVQKPSRHRVYGPKTVRFML 2384
            NYCLGQLI+W+ Q NA+P+  L RA +GC+SLPD+ SFY K  KP++ RVYG +TVRFML
Sbjct: 2140 NYCLGQLIYWYSQ-NAEPESRLTRARKGCMSLPDVSSFYVKSVKPTQERVYGSRTVRFML 2198

Query: 2385 SRMEKQPQRPWPKDRIWAFKSSPRIFGSPMLDSSLTGCPLDREMVHWLKHRPAIF 2439
            +RME Q QRPWPKDRIW FKS PR FG+PM+D+ L   PLD+EMVHWLK R  +F
Sbjct: 2199 ARMENQAQRPWPKDRIWVFKSDPRFFGTPMMDAVLNNSPLDKEMVHWLKTRSNVF 2253


>gi|2244876|emb|CAB10297.1| hypothetical protein [Arabidopsis thaliana]
 gi|7268264|emb|CAB78560.1| hypothetical protein [Arabidopsis thaliana]
          Length = 2351

 Score = 2066 bits (5352), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 1071/1765 (60%), Positives = 1298/1765 (73%), Gaps = 161/1765 (9%)

Query: 709  QLVSPPEASGNLLADTGDTA------QSTGEEFPVTLQSQCCPDGSAAAAESSEDLHIDV 762
            +LV+PPEA GNLL D  DT       Q  G+  P  +  +  PDG     E+ ED  ID+
Sbjct: 570  ELVNPPEAPGNLLEDIADTTEAVCIEQGAGDSLPELVSVRTLPDGKEIFVENREDFQIDM 629

Query: 763  RVGALLDGFTVIPGKEIETLGEILQTTFE----------RVDWQNNG--GPTWHG---AC 807
            RV  LLDG T+ PG+E ETLGE L+   E           V   NN    P         
Sbjct: 630  RVENLLDGRTITPGREFETLGEALKVNVEFEETRRCVTSEVFAPNNTKFSPKQKAEPNKF 689

Query: 808  VGEQKPGDQKVDELYISDTKMKEAAELKSGDKDHWVVCFDSDEWFSGRWSCKGGDWKRND 867
            VG  +P  + ++E    D    E+ E+ S              WFSGRWSCKGGDW R D
Sbjct: 690  VGMFRPMKRAIEEFKSDDAYGSESDEIGS--------------WFSGRWSCKGGDWIRQD 735

Query: 868  EAAQDRCSRKKQVLNDGFPLCQMPKSGYEDPRWNQKDDLYYPSHSRRLDLPPWAYACPDE 927
            EA+QDR  +KK VLNDGFPLC M KSG+EDPRW+ KDDLYYP  S RL+LP WA++  DE
Sbjct: 736  EASQDRYYKKKIVLNDGFPLCLMQKSGHEDPRWHHKDDLYYPLSSSRLELPLWAFSVVDE 795

Query: 928  RNDGSGGSRSTQSKLAAVRGVKGTMLPVVRINACVVNDHGSFVSEPRSKVRAKERHSSRS 987
            RN                RGVK ++L VVR+N+ VVND    + +PR+KVR+KER  SR 
Sbjct: 796  RNQ--------------TRGVKASLLSVVRLNSLVVNDQVPPIPDPRAKVRSKERCPSRP 841

Query: 988  ARSYSSANDVRRSSAESDSHSKARNNQDSQGSWKSIACINTPKDRLCTVDDLQLQLGEWY 1047
            AR   +++D +R S ES S S A   QDSQG WK+   +NTP+DRLCTVDDLQL +G+W+
Sbjct: 842  ARPSPASSDSKRESVESHSQSTASTGQDSQGLWKTDTSVNTPRDRLCTVDDLQLHIGDWF 901

Query: 1048 YLDGAGHERGPSSFSELQVLVDQGCIQKHTSVFRKFDKVWVPLTFATETSASTVRNHGEK 1107
            Y DGAG E+GP SFSELQ LV++G I+ H+SVFRK DK+WVP+T  T++  +     G+ 
Sbjct: 902  YTDGAGQEQGPLSFSELQKLVEKGFIKSHSSVFRKSDKIWVPVTSITKSPETIAMLRGKT 961

Query: 1108 IMPSGDSSGLPPTQSQDAVLGESNNNVNSNAFHTMHPQFIGYTRGKLHELVMKSYKNREF 1167
                    GL  +++QD    E + ++NS  FH +HPQF+GY RGKLH+LVMK++K+R+F
Sbjct: 962  PALPSACQGLVVSETQDFKYSEMDTSLNS--FHGVHPQFLGYFRGKLHQLVMKTFKSRDF 1019

Query: 1168 AAAINEVLDPWINAKQPKKETEHVYRKSEGDTR-----------------------AGKR 1204
            +AAIN+V+D WI+A+QPKKE+E    +S G                          +  R
Sbjct: 1020 SAAINDVVDSWIHARQPKKESEKYMYQSSGMHNYQNLNFPLTYWFLNLGGCLLLFFSPSR 1079

Query: 1205 ARLLVRESDGDEETEEELQTIQDESTFEDLCGDASFPGEESASSAIESGGWGLLDGHTLA 1264
            ARL+  ES  D E E+     +DE TFEDLCGD +F  E + S+      WGLLDGH LA
Sbjct: 1080 ARLMAGESGEDSEMEDTQMFQKDELTFEDLCGDLTFNIEGNRSAGTVGIYWGLLDGHALA 1139

Query: 1265 HVFHFLRSDMKSLAFASLTCRHWRAAVRFYKGISRQVDLSSVGPNCTDSLIRKTLNAFDK 1324
             VFH LR D+KSLAFAS+TCRHW+A +  YK ISRQVDLSS+GP+CTDS +R  +N ++K
Sbjct: 1140 RVFHMLRYDVKSLAFASMTCRHWKATINSYKDISRQVDLSSLGPSCTDSRLRSIMNTYNK 1199

Query: 1325 EKLNSILLVGCTNITSGMLEEILQSFPHLSSIDIRGCGQFGELALKFPNINWVKSQKSRG 1384
            EK++SI+LVGCTN+T+ MLEEIL+  P +SS+DI GC QFG+L + + N++W++ Q +R 
Sbjct: 1200 EKIDSIILVGCTNVTASMLEEILRLHPRISSVDITGCSQFGDLTVNYKNVSWLRCQNTR- 1258

Query: 1385 AKFNDSRSKIRSLKQITEKSSSAPKSKGLGDDMDDFGDLKDYFESVDKRDSANQSFRRSL 1444
                                S   KSKGLG D DDFG+LKDYF+ V+KRDSANQ FRRSL
Sbjct: 1259 --------------------SDVAKSKGLGGDTDDFGNLKDYFDRVEKRDSANQLFRRSL 1298

Query: 1445 YQRSKVFDARKSSSILSRDARMRRWSIKKSENGYKRMEEFLASSLKEIMRVNTFEFFVPK 1504
            Y+RSK++DAR+SS+ILSRDAR+RRW+IKKSE+GYKR+EEFLASSL+ IM+ NTF+FF  K
Sbjct: 1299 YKRSKLYDARRSSAILSRDARIRRWAIKKSEHGYKRVEEFLASSLRGIMKQNTFDFFALK 1358

Query: 1505 V------AEIEGRMKKGYYISHGLGSVKDDISRMCRDAIK-------------AKNRGSA 1545
            V      ++IE +MK GYY+SHGL SVK+DISRMCR+AI                  G +
Sbjct: 1359 VLSGTCVSQIEEKMKNGYYVSHGLRSVKEDISRMCREAINFVIFLLTLLCIQGGGIEGGS 1418

Query: 1546 GDMNRITTLFIQLATRLEQGAK-SSYYEREEMMKSWKDESPAGLYSATSKYKKKLSKMVS 1604
             DMNRI  LFIQLATRLE+ +  +S Y R+E+MKSW+D S  GL SAT KY KKLSK V+
Sbjct: 1419 KDMNRIIALFIQLATRLEEVSMITSSYGRDELMKSWQDGS--GLSSAT-KYNKKLSKTVA 1475

Query: 1605 ERKYMNRSNGTSLANGDFDYGEYASDREIRKRLSKLNRKSLDSGSETSDDLDGSSEDGKS 1664
            E+KYM+R++ T   NG  DYGEYASDREI++RLSKLNRKS  S S+TS +    S++GKS
Sbjct: 1476 EKKYMSRTSDTFGVNGASDYGEYASDREIKRRLSKLNRKSFSSESDTSSE---LSDNGKS 1532

Query: 1665 DSESTVSDTDSDMDFRSDGRARESRGAGDFTTDEGLD-FSDDREWGARMTKASLVPPVTR 1723
            D+ S+ S ++S+ D RS+GR+++ R    FT D+  D  +++REWGARMTKASLVPPVTR
Sbjct: 1533 DNYSSASASESESDIRSEGRSQDLRIEKYFTADDSFDSVTEEREWGARMTKASLVPPVTR 1592

Query: 1724 KYEVIDQYVIVADEEDVRRKMRVSLPEDYAEKLNAQKNGSEELDMELPEVKDYKPRKQLG 1783
            KYEVI++Y IVADEE+V+RKMRVSLPEDY EKLNAQ+NG EELDMELPEVK+YKPRK LG
Sbjct: 1593 KYEVIEKYAIVADEEEVQRKMRVSLPEDYGEKLNAQRNGIEELDMELPEVKEYKPRKLLG 1652

Query: 1784 DQVFEQEVYGIDPYTHNLLLDSMPDELDWNLLEKHLFIEDVLLRTLNKQVRHFTGTGNTP 1843
            D+V EQEVYGIDPYTHNLLLDSMP ELDW                   QVR FTG+G+TP
Sbjct: 1653 DEVLEQEVYGIDPYTHNLLLDSMPGELDW-------------------QVRLFTGSGSTP 1693

Query: 1844 MMYPLQPVIEEIEKEAVDDCDVRTMKMCRGILKAMDSRPDDKYVAYRKGLGVVCNKEGGF 1903
            M++PL+PVIEE+++ A ++CD+RTMKMC+G+LK ++SR DDKYV+YRKGLGVVCNKEGGF
Sbjct: 1694 MVFPLRPVIEELKESAREECDIRTMKMCQGVLKEIESRSDDKYVSYRKGLGVVCNKEGGF 1753

Query: 1904 GEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPK------GDADG 1957
            GE+DFVVEFLGEVYPVWKWFEKQDGIRSLQ+N  DPAPEFYNIYLERPK      GDADG
Sbjct: 1754 GEEDFVVEFLGEVYPVWKWFEKQDGIRSLQENKTDPAPEFYNIYLERPKVWRKYDGDADG 1813

Query: 1958 YDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSV 2017
            YDLVVVDAMH ANYASRICHSCRPNCEAKVTAVDGHYQIGIY+VR I YGEEITFDYNSV
Sbjct: 1814 YDLVVVDAMHMANYASRICHSCRPNCEAKVTAVDGHYQIGIYSVRAIEYGEEITFDYNSV 1873

Query: 2018 TESKEEYEASVCLCG-------SQVCRGSYLNLTGEGAFEKVLKELHGLLDRHQLMLEAC 2070
            TE        +C           QVCRGSYLNLTGEGAF+KVLK+ HGLL+RH+LMLEAC
Sbjct: 1874 TEVCSLLSLLLCSSTVGKYYFVGQVCRGSYLNLTGEGAFQKVLKDWHGLLERHRLMLEAC 1933

Query: 2071 ELNSVSEEDYLELGRAGLGSCLLGGLPNWVVAYSARLVRFINLERTKLPEEILRHNLEEK 2130
             LNSVSEEDYLELGRAGLGSCLLGGLP+W++AYSARLVRFIN ERTKLPEEIL+HNLEEK
Sbjct: 1934 VLNSVSEEDYLELGRAGLGSCLLGGLPDWMIAYSARLVRFINFERTKLPEEILKHNLEEK 1993

Query: 2131 RKYFSDICLEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRCVFGDPKKAPPPVERLS 2190
            RKYFSDI L+VEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMR VFGDPK APPP+ERL+
Sbjct: 1994 RKYFSDIHLDVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRHVFGDPKNAPPPLERLT 2053

Query: 2191 PEETVSFLWKGEGSLVEELIQCMAPHVEEDVLNDLKSKIQAHDPSGSEDIQRELRKSLLW 2250
            PEETVSF+W G+GSLV+EL+Q ++PH+EE  LN+L+SKI  HDPSGS D+ +EL++SLLW
Sbjct: 2054 PEETVSFVWNGDGSLVDELLQSLSPHLEEGPLNELRSKIHGHDPSGSADVLKELQRSLLW 2113

Query: 2251 LRDEVRNLPCTYKCRHDAAADLIHIYAYTKCFFRV-------QEYKAFTSPPVYISPLDL 2303
            LRDE+R+LPCTYKCR+DAAADLIHIYAYTKCFF+V       QEY++F S PV+ISPLDL
Sbjct: 2114 LRDEIRDLPCTYKCRNDAAADLIHIYAYTKCFFKVRMGLDMLQEYQSFISSPVHISPLDL 2173

Query: 2304 GPKYADKLGADLQVYRKTYGENYCLGQLIFWHIQTNADPDCTLARASRGCLSLPDIGSFY 2363
            G KYADKLG  ++ YRKTYGENYCLGQLI+W+ QTN DPD TL +A+RGCLSLPD+ SFY
Sbjct: 2174 GAKYADKLGESIKEYRKTYGENYCLGQLIYWYNQTNTDPDLTLVKATRGCLSLPDVASFY 2233

Query: 2364 AKVQKPSRHRVYGPKTVRFMLSRME 2388
            AK QKPS+HRVYGPKTV+ M+S+M+
Sbjct: 2234 AKAQKPSKHRVYGPKTVKTMVSQMQ 2258



 Score =  337 bits (863), Expect = 7e-89,   Method: Compositional matrix adjust.
 Identities = 261/686 (38%), Positives = 349/686 (50%), Gaps = 159/686 (23%)

Query: 1   MGDGGVACMPLQQQQQHNSIMERFPISDKTTICVGNSSNNSNKTNNNSISNNNDNKTNND 60
           M DGGVACMPL       +IME+ PI +KTT+C GN S                 KT   
Sbjct: 1   MSDGGVACMPLL------NIMEKLPIVEKTTLCGGNES-----------------KTAAT 37

Query: 61  SSNNNGSSSSKNNETNKSNVKKNGVSTKTVRKKIVK-IKKVIAVKKKEVQKNSGSS---- 115
           + N + S ++K  E+  +N K +  S    +K+IVK I+KV+  + K+ QK +       
Sbjct: 38  TENGHTSIATKVPESQPAN-KPSASSQPVKKKRIVKVIRKVVKRRPKQPQKQADEQLKDQ 96

Query: 116 ---------------------KSNNNGENIDNKNVENGGAVGEVVTVDKENLKNEEVEEG 154
                                KS   G     K VENGG  G            +EVEEG
Sbjct: 97  PPSQVVQLPAESQLQIKEQDKKSEFKGGTSGVKEVENGGDSG----------FKDEVEEG 146

Query: 155 ELGTLKW----ENGEFVQPEKSQPQSQLQSQSKQIEKGEII---------VFSSKCRRGE 201
           ELGTLK     ENGE + P KS        Q  +IEKGEI+           + K  +G 
Sbjct: 147 ELGTLKLHEDLENGE-ISPVKSL-------QKSEIEKGEIVGESWKKDEPTKADKNWKGG 198

Query: 202 TEKGESGLWRGNKDDIEKGEFIPDRWHK-EVVKDEYGYSKSRR----------YDYKLER 250
            E+ E   WR   D+IEKGEFIPDRW K +  KD++ Y +SRR          Y+Y+ ER
Sbjct: 199 KEEREFRSWRDPSDEIEKGEFIPDRWQKMDTGKDDHSYIRSRRNGVDREKTWKYEYEYER 258

Query: 251 TPPSGKYSGEDVYRRKEFDRSGSQHSKSSSRWESGQERNVRISSKIVDDEGLYKGEHNNG 310
           TPP G                                R  RISSKIV +E L+K E+NN 
Sbjct: 259 TPPGG--------------------------------RTTRISSKIVIEENLHKNEYNNS 286

Query: 311 KNHGREYFH-GNRFKRHGTDSDSGDRKY-YGDYGDFAGLKSRRLSDDYNSRSVHSEHYSR 368
            N  +EY   GNR KRHG + DS +RK+ Y DYGD+   K R+LSDD  SRS+HS+HYS+
Sbjct: 287 SNFVKEYSSTGNRLKRHGAEPDSIERKHSYADYGDYGSSKCRKLSDDC-SRSLHSDHYSQ 345

Query: 369 HSVEKFHRNSSSSRISSLDKYSSRHHEPSLSSRVIYDRHGRSPSHSDRSPHDRGRYYDHR 428
           HS E+ +R+S  S+ SSL+KY  +H + S  ++   D+HG SPS SD SPHDR RY+++R
Sbjct: 346 HSAERLYRDSYPSKNSSLEKYPRKHQDASFPAKAFSDKHGHSPSRSDWSPHDRSRYHENR 405

Query: 429 DRSPSRHDRSPYTRDRSPYTFDRSPYSRERSPYNRDRSPYAREKSPYDRSRHYDHRNRSP 488
                  DRSPY R+RSPY F++S ++R+RSP +R                      RSP
Sbjct: 406 -------DRSPYARERSPYIFEKSSHARKRSPRDRRHH----------------DYRRSP 442

Query: 489 FSAERSPQDRARFHDRSDRTPNYLERSPLHRSRPNNHREASSKTGASEKRNARYDSKGHE 548
             +E SP DR+R  DR D  PN++E +   R+R N HRE S K+G  E+R+ +  ++  E
Sbjct: 443 SYSEWSPHDRSRPSDRRDYIPNFMEDTQSDRNRRNGHREISRKSGVRERRDCQTGTE-LE 501

Query: 549 DKLGPKDSNARCSRSSAKESQDKSNVQDLNVSDEKTANCESHKEEQPQSSSVDCKEPPQV 608
            K   K+SN + S SS+KE Q K+ + + ++  EK + C+S K   P ++    KEP QV
Sbjct: 502 IKHKYKESNGKESTSSSKELQGKNILYNNSLLVEKNSVCDSSKIPVPCATG---KEPVQV 558

Query: 609 DGPPLEELVSMEEDMDICDTPPHVPA 634
              P EEL SME        PP  P 
Sbjct: 559 GEAPTEELPSME-----LVNPPEAPG 579


>gi|357139674|ref|XP_003571404.1| PREDICTED: probable histone-lysine N-methyltransferase ATXR3-like
            [Brachypodium distachyon]
          Length = 2214

 Score = 2041 bits (5287), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 1111/2110 (52%), Positives = 1403/2110 (66%), Gaps = 140/2110 (6%)

Query: 360  SVHSEHYSR---HSVEKFHRNSSSSRISSLDKYSSRHHEPSLSSRVIYDRHGRSPSHSDR 416
            S H +H++     S  K  R   +   S+    S R+HE S   R  +DR  RSP    R
Sbjct: 213  SNHRKHHAETCDQSGSKSRRKGEAKSTSAGRHLSGRNHEISTPIRDRHDRLERSPGILGR 272

Query: 417  SPHDRGRYYDHRDRSPSRHDRSPYTRDRSPYTFDRSPYSRERSPYNRDRSPYAREKSPYD 476
             PHDR R+  H +RSPSR +RSP  R R    +D           NRDRSPY    SP  
Sbjct: 273  FPHDRVRHEKH-ERSPSRLERSPRDRGRH---YD-----------NRDRSPYI---SPRH 314

Query: 477  RSRHYDHRNRSPFSAERSPQDRARFHDRSDRTPNYLERSPLHRSRPNNHREASSKTGASE 536
            + R   HR+ +P   + SP+ R +  D  DRTP   +RSP  R R  +  EAS K+    
Sbjct: 315  KVRQPHHRDSTPSRIDNSPRGRIQHEDIRDRTPLRTDRSPSERGRTTDSHEASKKS---- 370

Query: 537  KRNARYDSKGHEDKLGPKDSNARCSRSSAKESQDKSNVQDLNVSDEKTANCESHKEEQPQ 596
             R A+ +SK  E        NA+    S K+S                        EQP 
Sbjct: 371  -RGAKLESKNLE--------NAQHKNKSMKQSL----------------------PEQPN 399

Query: 597  SSSVDCKEPPQVDGPPLEELVSMEEDMDICDTPPHV-----PAVTDSSV-GKWFYLDHCG 650
               V              E VSMEEDMDICDTPPH      P+V  S+V GKWFYLD  G
Sbjct: 400  DVVV--------------EDVSMEEDMDICDTPPHTSEAPKPSVEPSTVMGKWFYLDQFG 445

Query: 651  MECGPSRLCDLKTLVEEGVLVSDHFIKHLDSNRWETVENAVSPLVTVNFPSITSDSVTQL 710
            +E GPS+L DLK LV++G L+SDH IKH D NRW TVENA +PLV  +   + SD  TQL
Sbjct: 446  VEQGPSKLADLKKLVDDGYLLSDHLIKHADCNRWVTVENAATPLVPSDISLVYSDGTTQL 505

Query: 711  VSPPEASGNLLADTGDTAQSTGEEFPVTLQSQCCPDGSAAAAESSEDLHIDVRVGALLDG 770
            VSPPEA GNLL    D A+   EE      S        A+ E  EDL+ID RVGAL+ G
Sbjct: 506  VSPPEAPGNLL----DEAR---EEASALASSADNEQMEEASEEPKEDLYIDNRVGALMYG 558

Query: 771  FTVIPGKEIETLGEILQTTFERVDWQNNGGPTWHGACVGEQKPGDQKVDELYISDTKMKE 830
              ++ G E+E LG+ L T F RVD +    P        +    D     +  +D    +
Sbjct: 559  SVLVEGHELEILGDALATHFNRVDLERWDQPEDFPRFQAQPAREDVINGGIEFADNSATD 618

Query: 831  AAELKSGDKDHWVVCFDSDEWFSGRWSCKGGDWKRNDEAAQDRCSRKKQVLNDGFPLCQM 890
               +   ++D +    +S EWFSGRWSCKGGDWKRNDE +QD+  RKK VLN+G+ LCQM
Sbjct: 619  IYGVGPIERDTFYHNVESSEWFSGRWSCKGGDWKRNDEFSQDKPYRKKLVLNEGYALCQM 678

Query: 891  PKSGYEDPRWNQKDDLYYPSHSRRLDLPPWAYACPDERND-----GSGG---SRSTQSKL 942
            PK  +EDPRW+ KDDLYY   +++LDLP WA++  +E  D       GG    RS Q + 
Sbjct: 679  PKGSHEDPRWHCKDDLYYHVPAKKLDLPLWAFSSTEESTDTVDDTSKGGIMPGRSGQVR- 737

Query: 943  AAVRGVKGTMLPVVRINACVVNDHGSFVSEPRSKVRAKERHSSRSARSYSSANDVRRSSA 1002
             + +GVKG  LPVVRINA VV D  S   EP  K R  +R  SRS+RS+S   D R S+ 
Sbjct: 738  QSTKGVKGMTLPVVRINARVVKDQSSV--EPCIKPRGADRSLSRSSRSHSIGAD-RSSAH 794

Query: 1003 ESDSHSKARNNQDSQGSWKSIACINTPKDRLCTVDDLQLQLGEWYYLDGAGHERGPSSFS 1062
            E  S+SK  +  D Q   KS + +N P+D +CTV++L ++LG+WYYLDG  HE GP S+S
Sbjct: 795  EGLSYSKKHHEHDLQSFHKSKSVLNIPEDHVCTVEELSVKLGDWYYLDGTAHEHGPFSYS 854

Query: 1063 ELQVLVDQGCIQKHTSVFRKFDKVWVPLTFATETSASTVRNHGEKIMPSGDSSGLPPTQS 1122
            ELQ LV +G I++ +SVFRK D  W+P+    +  +++          S  +S L  +  
Sbjct: 855  ELQKLVRRGTIRERSSVFRKIDNTWLPVVKDMKFDSASRNGGSGS---SNSTSALVHSDQ 911

Query: 1123 QDAVLGESNNNVNSNAFHTMHPQFIGYTRGKLHELVMKSYKNREFAAAINEVLDPWINAK 1182
             + V+     N  S +FH +HPQF+GYTRGKLHELVMK +K+RE   AINEVLDPWI AK
Sbjct: 912  SNVVV-----NHGSGSFHELHPQFVGYTRGKLHELVMKYFKSRELTLAINEVLDPWIAAK 966

Query: 1183 QPKKETEHVYRKSEGDTR--------AGKRARLLVRESDGDEETEEELQTI-QDESTFED 1233
            QPKKE E  Y  +   TR        + KRAR L   SD D +  E++ T  +D+  FED
Sbjct: 967  QPKKEIE-TYVANNSATRNLLPEDAGSAKRARFLPDRSDEDIDMYEDILTSHKDDCCFED 1025

Query: 1234 LCGDASFPGEESASSAIESGGWGLLDGHTLAHVFHFLRSDMKSLAFASLTCRHWRAAVRF 1293
            L  +A+       +S  ES  W LL+GH LA +FHFLR+DMKSL  ++ TCR W  A + 
Sbjct: 1026 LFQEAAL-----TNSIAESESWDLLNGHVLARIFHFLRADMKSLISSAATCRRWNTAAKC 1080

Query: 1294 YKGISRQVDLSSVGPNCTDSLIRKTLNAFDKEKLNSILLVGCTNITSGMLEEILQSFPHL 1353
            Y+   R VDLSSVGP CTDS+ R  +  ++K+ + +++LVGC++++   LE++L   PH+
Sbjct: 1081 YRNTCRFVDLSSVGPRCTDSVFRGIMAGYEKQNIKTLVLVGCSSLSPLALEKVLVQLPHI 1140

Query: 1354 SSIDIRGCGQFGELALKFPNINWVKSQKSRGAKFNDSRSKIRSLKQITEKSSSAPK-SKG 1412
            S + I+GC Q  ++  +F +I W+ S  +      +S  KI+SLKQI + S    K ++ 
Sbjct: 1141 SYVHIQGCSQLEDMKSRFQHIKWITSSLNP----EESLQKIKSLKQIDDGSGHPSKVARN 1196

Query: 1413 LGDDMDDFGDLKDYFESVDKRDSANQSFRRSLYQRSKVFDARKSSSILSRDARMRRWSIK 1472
            +   +    +L  YF  +  R++AN SF +  Y+RSK  DARKSS++LS+DA++RR   +
Sbjct: 1197 MTSQLGGSDELDGYFADISNRENANLSFGQGFYKRSKWLDARKSSAVLSKDAQLRRLMQR 1256

Query: 1473 KSENGYKRMEEFLASSLKEIMRVNTFEFFVPKVAEIEGRMKKGYYISHGLGSVKDDISRM 1532
             +EN Y++MEEF+ S L+EIM+ + F+FF PKV +IE R++ GYY  HG  S+KDDI  M
Sbjct: 1257 NAENSYRKMEEFVISRLREIMKSSRFDFFDPKVEKIEARLRSGYYARHGFSSLKDDIRSM 1316

Query: 1533 CRDAIKAKNRGSAGDMNRITTLFIQLATRLEQGAKSSYYEREEMMKSWKDESPAGLYSAT 1592
            CRDA+++K R    DM +I   FIQLA RL  G      ER   +   KD S    Y++ 
Sbjct: 1317 CRDALRSKGRSE--DMKQIVVSFIQLAKRL--GNPRVISERNGAVIQ-KDNSDMVQYTSD 1371

Query: 1593 SKYKKKLSKMVSERKYMNRSNGTSLANGDFDYGEYASDREIRKRLSKLNRKSLDSGSETS 1652
            +K KKK +K   ER+  N +  T+ A    D    A DREI++ LSKL ++ +DSGSETS
Sbjct: 1372 TKLKKKQNKTTGERRGANWTAATAGA----DTSSRAFDREIKRSLSKLKKRDVDSGSETS 1427

Query: 1653 DDLDGSSEDGKSDSESTVSDTDSDMDFRSDGRARESRGAGDFTTDEGLDFSDDREWGARM 1712
            DD DG SE  +++SE+TVSDT+SD+D  S   A + +G G    + G   +DDR WGARM
Sbjct: 1428 DDDDGYSEGDETESETTVSDTESDLDLNS--VAWDLKGNGMKLFESGDSVTDDRGWGARM 1485

Query: 1713 TKASLVPPVTRKYEVIDQYVIVADEEDVRRKMRVSLPEDYAEKLNAQKNGSEELDMELPE 1772
            TKASLVPPVTRKYEVI++Y+IVADEE+V+RKMRV+LP+DY+EKL +QKNG+E L  E+PE
Sbjct: 1486 TKASLVPPVTRKYEVIEKYLIVADEEEVQRKMRVALPDDYSEKLLSQKNGTENL--EIPE 1543

Query: 1773 VKDYKPRKQLGDQVFEQEVYGIDPYTHNLLLDSMPDELDWNLLEKHLFIEDVLLRTLNKQ 1832
            VK+Y+ RK  GD++ EQEVYGIDP+THNLL D MP +L W+  ++H FIE++LL TLNKQ
Sbjct: 1544 VKEYQRRKVPGDEILEQEVYGIDPFTHNLLRDIMPADLGWSAADQHTFIEELLLNTLNKQ 1603

Query: 1833 VRHFTGTGNTPMMYPLQPVIEEIEKEAVDDCDVRTMKMCRGILKAMDSRP--DDK-YVAY 1889
            V+ FTG+GNTPM+Y L+PVIEEI+K A +  D RT+KMC G+LKAM SRP  D K YVAY
Sbjct: 1604 VKDFTGSGNTPMVYHLKPVIEEIQKSAEESGDRRTVKMCLGMLKAMRSRPGPDHKHYVAY 1663

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            RKGLGVVCNK+GGFG DDFV+EF GEVYP W+W+EKQDGI+ +Q N+ED APEFYNI LE
Sbjct: 1664 RKGLGVVCNKKGGFGVDDFVIEFFGEVYPSWRWYEKQDGIKHIQNNSEDQAPEFYNIMLE 1723

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
            RPKGD DGYDLV VDAMHKANYASRICHSC PNCEAKVTAVDG YQIG+YTVR I  GEE
Sbjct: 1724 RPKGDRDGYDLVFVDAMHKANYASRICHSCNPNCEAKVTAVDGQYQIGVYTVRPIAEGEE 1783

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDRHQLMLEA 2069
            ITFDYNSVTESKEE+EASVCLCGSQVCRGSYLN +GEGAFEKVL E HG+LDRH L+L+A
Sbjct: 1784 ITFDYNSVTESKEEHEASVCLCGSQVCRGSYLNFSGEGAFEKVLMEFHGVLDRHSLLLQA 1843

Query: 2070 CELNSVSEEDYLELGRAGLGSCLLGGLPNWVVAYSARLVRFINLERTKLPEEILRHNLEE 2129
            CE NSVS++D ++LGRAGLG+CLL GLP W+VAY+A LVRFI  ER KLP EI +HN++E
Sbjct: 1844 CEANSVSQQDLIDLGRAGLGTCLLAGLPGWLVAYTAHLVRFIFFERQKLPNEIFKHNVDE 1903

Query: 2130 KRKYFSDICLEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRCVFGDPKKAPPPVERL 2189
            KR++F+DI ++ E++DAEVQAEGV N RLQNL  TLDKVRYVMRCVFGDPK APPP+ RL
Sbjct: 1904 KRQFFTDINMDSERNDAEVQAEGVLNSRLQNLTHTLDKVRYVMRCVFGDPKNAPPPLVRL 1963

Query: 2190 SPEETVSFLWKGEGSLVEELIQCMAPHVEEDVLNDLKSKIQAHDPSGSEDIQRELRKSLL 2249
            +    VS +WKGEGSLVEEL+Q M PHVEEDVL DLK KI+ HDPS SEDI+ ++R SLL
Sbjct: 1964 TGRSLVSAIWKGEGSLVEELLQSMEPHVEEDVLADLKDKIRDHDPSDSEDIEGDIRNSLL 2023

Query: 2250 WLRDEVRNLPCTYKCRHDAAADLIHIYAYTKCFFRVQEYKAFTSPPVYISPLDLGPKYAD 2309
            WLRDE+R+L CTYKCRHDAAADLIH+YAYTKCFFR ++YK   SPPV+ISPLDLGPKYAD
Sbjct: 2024 WLRDELRSLSCTYKCRHDAAADLIHMYAYTKCFFRARDYKTVKSPPVHISPLDLGPKYAD 2083

Query: 2310 KLGADLQVYRKTYGENYCLGQLIFWHIQTNADPDCTLARASRGCLSLPDIGSFYAKVQKP 2369
            KLG   Q YRKTY ENYCL QLI+W+ Q NA+P+  L RA +GC+SLPD+ SFY    K 
Sbjct: 2084 KLGPGFQEYRKTYPENYCLAQLIYWYSQ-NAEPESRLTRARKGCMSLPDVSSFYVTSVKQ 2142

Query: 2370 SRHRVYGPKTVRFMLSRMEKQPQRPWPKDRIWAFKSSPRIFGSPMLDSSLTGCPLDREMV 2429
            ++ RVYG +TVRFML+RMEKQ QR WPKDRIW FK+ PR FG+PM+D+ L    LD+EMV
Sbjct: 2143 TQERVYGTRTVRFMLTRMEKQAQRQWPKDRIWVFKNHPRFFGTPMMDAVLNNSSLDKEMV 2202

Query: 2430 HWLKHRPAIF 2439
            HWLK R  +F
Sbjct: 2203 HWLKTRSNVF 2212


>gi|413921170|gb|AFW61102.1| hypothetical protein ZEAMMB73_524379 [Zea mays]
          Length = 2278

 Score = 1967 bits (5097), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 1081/2140 (50%), Positives = 1384/2140 (64%), Gaps = 143/2140 (6%)

Query: 362  HSEHYSRHSVEKFHRNS--SSSRISSLDKYSSRHHEPSLSSRVIYDRHGRSPSHSDRSPH 419
            H    S  S  K HR       + S+    S R+ E S  +R   DRH RSP    R PH
Sbjct: 218  HHTDTSDQSGSKSHRKGPKGEGKRSAARHLSGRNREISSPTRDRRDRHERSPGILGRFPH 277

Query: 420  DRGRYYDHRDRSPSRHDRSPY--------TRDRSPYTFDRSPYSRERSPYNRDRSPYARE 471
            +R R+ D  DRSPSR +RSP+        +RD SPY    SP  R R P+ RD +P   +
Sbjct: 278  ERSRH-DRYDRSPSRLERSPHRERARHYESRDHSPYV---SPRHRARQPHFRDNTPSRVD 333

Query: 472  KSPYDRSRHYDHRNRSPFSAERSPQDRARFHDRSDRTPNYLERSPLHRSRPNNHREAS-- 529
              P  R +  D R+RSPF            HDRS          P  R RP +  EAS  
Sbjct: 334  NFPRGRVQREDVRDRSPF-----------LHDRS----------PSERFRPTDTHEASKK 372

Query: 530  SKTGASEKRNARYDSKGHEDKL---GPKDSNARCSRSSAKESQDKSNVQDLNVSDEKTAN 586
            S++G + +++        +      G  + N + S     ES   + +            
Sbjct: 373  SRSGNNSEKSQHKSKSAKQSSKTKSGSNEKNEKISNEKPTESSKYTELPPPPPLPLPPPP 432

Query: 587  CESHKEEQPQSSSVDCKEPPQVDGPPLEELVSMEEDMDICDTPPHVPAVTDSS-----VG 641
                       +      PP    P       M EDMDICDTPPH  A  + +     +G
Sbjct: 433  PPPPPPPPLPPAVPPPLPPPPEPEPTGVLAEDMIEDMDICDTPPHTSAAPEPTDPIYDIG 492

Query: 642  KWFYLDHCGMECGPSRLCDLKTLVEEGVLVSDHFIKHLDSNRWETVENAVSPLVTVNFPS 701
            +WFYLDH G+E GPS+L  LK LVE+G L+SDH IKH DSNRW TVENA SPLV  +FPS
Sbjct: 493  RWFYLDHFGIEQGPSKLAVLKKLVEDGYLLSDHLIKHADSNRWVTVENAASPLVPSDFPS 552

Query: 702  ITSDSVTQLVSPPEASGNLLADTGDTAQ--STGEEFPVTLQSQCCPDGSAAAAESSEDLH 759
              SD+ TQ+V+PPEA GNLL +  + A   ++G E    ++         A+AE SE+ +
Sbjct: 553  FYSDTSTQMVNPPEAPGNLLDEALEEASNLASGSEDKQMVE---------ASAEDSEEFY 603

Query: 760  IDVRVGALLDGFTVIPGKEIETLGEILQTTFERVDWQNNGGPT----WHGACVGEQKPGD 815
            I+ RV AL+DG  ++ G+E+E +GE+L   F+  DWQ    P     +H    G+    D
Sbjct: 604  INDRVEALMDGSILVHGQELEIIGELLGADFQPADWQRLSHPEDFTRFHVHIEGD----D 659

Query: 816  QKVDELYISDTKMKEAAELKSGDKDHWVVCFDSDEWFSGRWSCKGGDWKRNDEAAQDRCS 875
            + +      + +  +A  L S D  H  V  +S EWFSGRWSCKGGDW+RNDE  QD   
Sbjct: 660  EIIGGTEFLENRTTDAYGLVSVDNFHHYV--ESSEWFSGRWSCKGGDWRRNDELGQDTPF 717

Query: 876  RKKQVLNDGFPLCQMPKSGYEDPRWNQKDDLYYPSHSRRLDLPPWAYACPDERNDGSGGS 935
            RKK VLN+G+PLCQ+PK  YEDPR   KD+LYYP   ++ DLP WA++  +E  DG   +
Sbjct: 718  RKKLVLNEGYPLCQIPKGSYEDPRRPCKDELYYPVRGKKHDLPLWAFSSTEEDIDGVNDT 777

Query: 936  RSTQSKLAAVRGVKG----------TMLPVVRINACVVNDHGSFVSEPRSKVRAKERHSS 985
                +K   V G  G           ML VV IN  V+ D  S   EPR+K R  +R  S
Sbjct: 778  ----TKNTVVPGRPGQTRQPPSEVKVMLQVVSINYHVIKDQSSV--EPRTKPRGTDRPPS 831

Query: 986  RSARSYSSANDVRRSSAESDSHSKARNNQDSQGSWKSIACINTPKDRLCTVDDLQLQLGE 1045
            RS+RS+S   + R S  +  SH +  ++ DSQ   KS +  N PKD +CTVD+L +  G+
Sbjct: 832  RSSRSHSIGAE-RSSIHDGSSHFRKHHDHDSQSFHKSKSVPNIPKDHVCTVDELSVNRGD 890

Query: 1046 WYYLDGAGHERGPSSFSELQVLVDQGCIQKHTSVFRKFDKVWVPLTFATETSASTVRNHG 1105
            WYYLDG GH++GP S+SELQ LV +  I + +SVFRK D  W P+    +  +S      
Sbjct: 891  WYYLDGTGHDQGPFSYSELQELVKKDTIIEQSSVFRKIDNTWFPVLKDLKPGSSVPSAAP 950

Query: 1106 EKIMPSGDSSGLPPTQSQDAVLGESNNNVNSNAFHTMHPQFIGYTRGKLHELVMKSYKNR 1165
               + +  +    P Q    V      N  S++FH +HPQF GYTRGKLHELVMK +K+R
Sbjct: 951  SSNLIAAFTH---PDQYNFGV------NQGSSSFHELHPQFAGYTRGKLHELVMKYFKSR 1001

Query: 1166 EFAAAINEVLDPWINAKQPKKETEHVYRKSEG-------DTRAGKRARLLVRESDGDEET 1218
            E   AINEVLDPWI+AKQPKKE E  +  +         D  + KRARLL  +SD +   
Sbjct: 1002 ELTLAINEVLDPWISAKQPKKEFEAYFSHNSASRNFLPEDGGSAKRARLLPDQSDENIHL 1061

Query: 1219 EEELQTIQDEST-FEDLCGDASFPGEESASSAIESGGWGLLDGHTLAHVFHFLRSDMKSL 1277
             E++   + E   FE+LC  AS    +S +    +  WGLL+ H LA +FHF+R+D+KSL
Sbjct: 1062 SEDIIASRKEDICFEELCDGASSVDNDSVNPRAGNASWGLLNDHLLARIFHFMRADLKSL 1121

Query: 1278 AFASLTCRHWRAAVRFYKGISRQVDLSSVGPNCTDSLIRKTLNAFDKEKLNSILLVGCTN 1337
              ++ TC+ W AA ++Y+ + R +DLSSVGP CTDS+    +  F+K+ + +++L GC+N
Sbjct: 1122 ISSAATCKSWNAAAKYYRNMCRFIDLSSVGPLCTDSVFCDIMAGFEKQNIRTLILAGCSN 1181

Query: 1338 ITSGMLEEILQSFPHLSSIDIRGCGQFGELALKFPNINWVKSQKSRGAKFNDSRSKIRSL 1397
            ++S  L  +L+  P +S + I+GC   G+L  KF ++ W++S  +    +     K+++L
Sbjct: 1182 LSSHALGRVLEHLPQISYVHIQGCSHLGDLKNKFQHVKWIRSSLNPEGSYR----KMKTL 1237

Query: 1398 KQITEKSSSAPKSKGLGDDMDDFGDLKDYFESVDKRDSANQ-SFRRSLYQRSKVFDARKS 1456
            KQI + ++ A K     D +DD  +L  YF  + K + A+  SF +  Y+RSK+ DARKS
Sbjct: 1238 KQIGDGNNYASKVARNFDQLDDSDELDGYFADISKIEGASLFSFGQGFYKRSKLLDARKS 1297

Query: 1457 SSILSRDARMRRWSIKKSENGYKRMEEFLASSLKEIMRVNTFEFFVPKVAEIEGRMKKGY 1516
            S++LSRDA MRR   +++EN Y++MEEF+ + L+EIMR N F+FF+PKV++IEGR+K GY
Sbjct: 1298 SAVLSRDAEMRRLMQRQAENSYRKMEEFVINRLREIMRCNRFDFFIPKVSKIEGRLKNGY 1357

Query: 1517 YISHGLGSVKDDISRMCRDAIKAKNRGSAGDMNRITTLFIQLATRLEQGAKSSYYEREEM 1576
            Y  HG  ++K DI  MC+DA++ K+     D+ +I   FIQLA RL     +  Y  E  
Sbjct: 1358 YARHGFRTIKHDIRTMCQDALRYKDGNDLDDVKQIVVSFIQLAKRL----GNPRYISERN 1413

Query: 1577 MKSWKDESPAGLYSATSKYKKKLSKMVSERKYMNRSNGTSLANGDFDYGEYASDREIRKR 1636
              + +D      YS  +K KKK +K         R+N   L     D    A D EI++ 
Sbjct: 1414 GAAAQDSLDISQYSFDTKLKKKQNK--------TRAN---LVAAGADNSSRAFDLEIKRS 1462

Query: 1637 LSKLNRKSLDSGSETSDDLDGSSEDGKSDSESTVSDTDSDMDFRSDGRARESRGAGDFTT 1696
            LSKL +K + SGSETSDD DG SE  +++SE+TVSDT+SD D  S   A + +G     T
Sbjct: 1463 LSKLKKKDVCSGSETSDD-DGYSEGDETESETTVSDTESDFDVNSG--AWDLKGNCLKLT 1519

Query: 1697 DEGLDFSDDREWGARMTKASLVPPVTRKYEVIDQYVIVADEEDVRRKMRVSLPEDYAEKL 1756
            + G    DDR  GARMTKASLVPPVTRKYEVI++Y+IVAD E+V+RKMRVSLP+DY+EKL
Sbjct: 1520 EHGESVIDDRILGARMTKASLVPPVTRKYEVIEEYLIVADVEEVQRKMRVSLPDDYSEKL 1579

Query: 1757 NAQKNGSEELDMELPEVKDYKPRKQLGDQVFEQEVYGIDPYTHNLLLDSMPDELDWNLLE 1816
             +QKNG+E L  ELPEVKDY+PRK  GD++ EQEVYGIDPYTHNLL D MP +L+ +  +
Sbjct: 1580 LSQKNGTENL--ELPEVKDYQPRKVAGDEILEQEVYGIDPYTHNLLSDIMPSDLELSPTD 1637

Query: 1817 KHLFIEDVLLRTLNKQVRHFTGTGNTPMMYPLQPVIEEIEKEAVDDCDVRTMKMCRGILK 1876
            KH+FIE++LL  LNKQVRHFTG GNTPM Y ++PVIEEI++ A D  D RT+KMC G+LK
Sbjct: 1638 KHIFIEELLLNALNKQVRHFTGLGNTPMTYNIRPVIEEIQRSAEDSGDRRTLKMCLGMLK 1697

Query: 1877 AMDSRPDDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNN 1936
            +M +R D  +VAYRKGLGVVCNK+GGFG DDFVVEF GEVYP W+W+EKQDGI+ +Q N+
Sbjct: 1698 SMRNRSDQNFVAYRKGLGVVCNKKGGFGVDDFVVEFFGEVYPSWRWYEKQDGIKHIQNNS 1757

Query: 1937 EDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAK---------- 1986
            ED APEFYNI LERPKGD  GYDLV VDAMHKANYASRICHSC PNCEAK          
Sbjct: 1758 EDQAPEFYNIMLERPKGDRHGYDLVFVDAMHKANYASRICHSCNPNCEAKKKRIYTYDYA 1817

Query: 1987 -------VTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGS 2039
                   VTAVDG YQIG+YT+R I  GEEITFDYNSVTESKEE+EASVCLCGSQVCRGS
Sbjct: 1818 KADMRAIVTAVDGKYQIGVYTLRPIAEGEEITFDYNSVTESKEEHEASVCLCGSQVCRGS 1877

Query: 2040 YLNLTGEGAFEKVLKELHGLLDRHQLMLEACELNSVSEEDYLELGRAGLGSCLLGGLPNW 2099
            YLN +GEGAFEKVL E HG+LDRH L+L+ACE +SVS++D ++LGRAGLG+CLL GLP W
Sbjct: 1878 YLNFSGEGAFEKVLMEFHGVLDRHSLLLQACETDSVSQQDLIDLGRAGLGTCLLAGLPVW 1937

Query: 2100 VVAYSARLVRFINLERTKLPEEILRHNLEEKRKYFSDICLEVEKSDAEVQAEGVYNQRLQ 2159
            +VAY+A LVRFI LER KLP+EILRHN++EKR++  +I ++ EK+DAEVQAEGV N RLQ
Sbjct: 1938 LVAYTAHLVRFIYLERQKLPDEILRHNVDEKRQFLIEINMDSEKNDAEVQAEGVLNSRLQ 1997

Query: 2160 NLAVTLDKVRYVMRCVFGDPKKAPPPVERLSPEETVSFLWKGEGSLVEELIQCMAPHVEE 2219
             +  TLDKVRYVMRC+FGDPK APPP+ RLS +  VS +WKG+ S+V EL+Q M PHVEE
Sbjct: 1998 QIVHTLDKVRYVMRCIFGDPKNAPPPMVRLSGKSLVSAIWKGDSSIVAELLQSMEPHVEE 2057

Query: 2220 DVLNDLKSKIQAHDPSGSEDIQRELRKSLLWLRDEVRNLPCTYKCRHDAAADLIHIYAYT 2279
            +VL+DLK+KI AHDPS SEDI+  +R SLLWLRDE+R LPCTYKCRHDAAADLIH+YAYT
Sbjct: 2058 EVLSDLKAKICAHDPSDSEDIEGGIRNSLLWLRDELRTLPCTYKCRHDAAADLIHLYAYT 2117

Query: 2280 KCFFRVQEYKAFTSPPVYISPLDLGPKYADKLGADLQVYRKTYGENYCLGQLIFWHIQTN 2339
            KCFFRV++YK   SPPV+ISPLDLGPKYADKLG   Q Y KTY ENYCL QLI+W+ Q N
Sbjct: 2118 KCFFRVRDYKTVKSPPVHISPLDLGPKYADKLGPGFQEYCKTYPENYCLAQLIYWYSQ-N 2176

Query: 2340 ADPDCTLARASRGCLSLPDIGSFYAKVQKPSRHRVYGPKTVRFMLSRMEKQPQRPWPKDR 2399
            ++P+  L RA +GC+SLPD+ SFY K  KP + RVYG +TVRFMLSRMEKQ QRPWPKDR
Sbjct: 2177 SEPESRLTRARKGCMSLPDVSSFYVKSLKPLQERVYGNRTVRFMLSRMEKQAQRPWPKDR 2236

Query: 2400 IWAFKSSPRIFGSPMLDSSLTGCPLDREMVHWLKHRPAIF 2439
            IW FKS PR FGSPM+D+ L   PLD+EMVHWLK RP +F
Sbjct: 2237 IWVFKSDPRYFGSPMMDAVLNNSPLDKEMVHWLKTRPNVF 2276


>gi|218200574|gb|EEC83001.1| hypothetical protein OsI_28046 [Oryza sativa Indica Group]
          Length = 2000

 Score = 1368 bits (3540), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 657/994 (66%), Positives = 783/994 (78%), Gaps = 19/994 (1%)

Query: 1446 QRSKVFDARKSSSILSRDARMRRWSIKKSENGYKRMEEFLASSLKEIMRVNTFEFFVPKV 1505
            +RSK  D RKSS++LSRDA+MRR   +K+EN Y++MEEF+ + LKEIM+ + F+FFVPKV
Sbjct: 1024 ERSKWLDIRKSSAVLSRDAQMRRLMQRKAENSYRKMEEFVINKLKEIMKSSRFDFFVPKV 1083

Query: 1506 AEIEGRMKKGYYISHGLGSVKDDISRMCRDAIKAKNRGSAGDMNRITTLFIQLATRLEQG 1565
            A+IE R+K GYY  HG   +K+DI  MCRDA++ K R   GDM +I   FIQLA +LE  
Sbjct: 1084 AKIEVRLKNGYYARHGFSYIKNDIRSMCRDALRYKGRSDLGDMKQIVVAFIQLAKKLENP 1143

Query: 1566 AKSSYYEREEMMKSWKDESPAGLYSATSKYKKKLSKMVSERKYMNRSNGTSLANGDFDYG 1625
               S  +   + K   D S    YS+  K KKK SK +SER+      G +      D  
Sbjct: 1144 RLISDRDGTAVQK---DSSDMSQYSSDLKLKKKQSKTMSERR------GANWTTAGADPS 1194

Query: 1626 EYASDREIRKRLSKLNRKSLDSGSETSDDLDGSSEDGKSDSESTVSDTDSDMDFRSDGRA 1685
              A DREI++ LSKL ++ +DSGSETSDD DG SE  +++SE+TVSDT+SD+D  S    
Sbjct: 1195 SRAFDREIKRSLSKLKKRDIDSGSETSDDDDGYSEGDETESETTVSDTESDLDVNSGAWD 1254

Query: 1686 RESRGAGDFTTDEGLDFSDDREWGARMTKASLVPPVTRKYEVIDQYVIVADEEDVRRKMR 1745
             +  G   F + E L  +DDR WGARMTKASLVPPVTRKYEVI++Y+IVADEE+V RKMR
Sbjct: 1255 LKGNGMKLFESSESL--TDDRGWGARMTKASLVPPVTRKYEVIEKYLIVADEEEVLRKMR 1312

Query: 1746 VSLPEDYAEKLNAQKNGSEELDMELPEVKDYKPRKQLGDQVFEQEVYGIDPYTHNLLLDS 1805
            V+LP+DY+EKL +QKNG+E L  ELPEVKDY+PRK  GD+V EQEVYGIDPYTHNLLL+ 
Sbjct: 1313 VALPDDYSEKLLSQKNGTENL--ELPEVKDYQPRKVPGDEVLEQEVYGIDPYTHNLLLEM 1370

Query: 1806 MPDELDWNLLEKHLFIEDVLLRTLNKQVRHFTGTGNTPMMYPLQPVIEEIEKEAVDDCDV 1865
            MP ELDW   +KH F+E++LL TLNKQVR FTG+GNTPM+YPL+PVIEEI+K A +  D 
Sbjct: 1371 MPTELDWPSSDKHTFVEELLLNTLNKQVRQFTGSGNTPMVYPLKPVIEEIQKSAEESGDR 1430

Query: 1866 RTMKMCRGILKAMDSRPDDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEK 1925
            RT KMC G+LKAM + P+     Y  GLGVVCNK GGFG DDFV+EF GEVYP W+W+EK
Sbjct: 1431 RTSKMCLGMLKAMRNHPE-----YNYGLGVVCNKTGGFGVDDFVIEFFGEVYPSWRWYEK 1485

Query: 1926 QDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEA 1985
            QDGI+ +Q N++D APEFYNI LERPKGD DGYDLV VDAMHKANYASRICHSC PNCEA
Sbjct: 1486 QDGIKHIQNNSDDQAPEFYNIMLERPKGDRDGYDLVFVDAMHKANYASRICHSCNPNCEA 1545

Query: 1986 KVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTG 2045
            KVTAVDGHYQIGIYTVR I  GEEITFDYNSVTESKEE+EASVCLCGSQ+CRGSYLN +G
Sbjct: 1546 KVTAVDGHYQIGIYTVRPIAEGEEITFDYNSVTESKEEHEASVCLCGSQICRGSYLNFSG 1605

Query: 2046 EGAFEKVLKELHGLLDRHQLMLEACELNSVSEEDYLELGRAGLGSCLLGGLPNWVVAYSA 2105
            EGAFEKVL E HG+LDRH L+L+ACE NSVS++D ++LGRAGLG+CLL GLP W+VAY+A
Sbjct: 1606 EGAFEKVLMEFHGVLDRHSLLLQACEANSVSQQDLIDLGRAGLGTCLLAGLPGWLVAYTA 1665

Query: 2106 RLVRFINLERTKLPEEILRHNLEEKRKYFSDICLEVEKSDAEVQAEGVYNQRLQNLAVTL 2165
             LVRFI  ER KLP EI +HN++EKR++F+DI ++ EK+DAEVQAEGV N RLQNL  TL
Sbjct: 1666 HLVRFIFFERQKLPHEIFKHNVDEKRQFFTDINMDSEKNDAEVQAEGVLNSRLQNLTHTL 1725

Query: 2166 DKVRYVMRCVFGDPKKAPPPVERLSPEETVSFLWKGEGSLVEELIQCMAPHVEEDVLNDL 2225
            DKVRYVMRC+FGDPK APPP+ RL+    VS +WKGEGSLV+EL++ M PHVEEDVL DL
Sbjct: 1726 DKVRYVMRCIFGDPKNAPPPLVRLTGRSLVSAIWKGEGSLVDELLESMEPHVEEDVLTDL 1785

Query: 2226 KSKIQAHDPSGSEDIQRELRKSLLWLRDEVRNLPCTYKCRHDAAADLIHIYAYTKCFFRV 2285
            K+KI+AHDPSGSEDI+ E+R SLLWLRDE+R L CTYKCRHDAAADLIH+YAYTKCFFRV
Sbjct: 1786 KAKIRAHDPSGSEDIEGEIRSSLLWLRDELRTLSCTYKCRHDAAADLIHMYAYTKCFFRV 1845

Query: 2286 QEYKAFTSPPVYISPLDLGPKYADKLGADLQVYRKTYGENYCLGQLIFWHIQTNADPDCT 2345
            ++YK   SPPV ISPLDLGPKYADKLG   Q Y KTY ENYCLGQLI+W+ Q NA+P+  
Sbjct: 1846 RDYKTVKSPPVLISPLDLGPKYADKLGPGFQEYCKTYPENYCLGQLIYWYSQ-NAEPESR 1904

Query: 2346 LARASRGCLSLPDIGSFYAKVQKPSRHRVYGPKTVRFMLSRMEKQPQRPWPKDRIWAFKS 2405
            L RA +GC+SLPD+ SFY K  KP++ RVYG +TVRFML+RME Q QRPWPKDRIW FKS
Sbjct: 1905 LTRARKGCMSLPDVSSFYVKSVKPTQERVYGSRTVRFMLARMENQAQRPWPKDRIWVFKS 1964

Query: 2406 SPRIFGSPMLDSSLTGCPLDREMVHWLKHRPAIF 2439
             PR FG+PM+D+ L   PLD+EM+HWLK R  +F
Sbjct: 1965 DPRFFGTPMMDAVLNNSPLDKEMMHWLKTRSNVF 1998



 Score =  610 bits (1573), Expect = e-171,   Method: Compositional matrix adjust.
 Identities = 349/741 (47%), Positives = 456/741 (61%), Gaps = 62/741 (8%)

Query: 609  DGPPLEELVSMEEDMDICDTPPHV----PAVTD---SSVGKWFYLDHCGMECGPSRLCDL 661
            +G P E+ VSMEEDMDICDTPPH     P  T+   S VGKWFYLDH G+E GPS+L DL
Sbjct: 326  NGAPAED-VSMEEDMDICDTPPHTTSSAPEPTEPPASDVGKWFYLDHYGIEQGPSKLADL 384

Query: 662  KTLVEEGVLVSDHFIKHLDSNRWETVENAVSPLVTVNFPSITSDSVTQLVSPPEASGNLL 721
            K LVE+G L+SDH IKH DSNRW TVENA SPLV   FPS+ SD  TQLVSPPEA GNLL
Sbjct: 385  KKLVEDGYLLSDHLIKHADSNRWVTVENAASPLVPSEFPSVYSDVSTQLVSPPEAPGNLL 444

Query: 722  ADTGDTAQSTGEEFPVTLQSQCCPDGSAAAAESSEDLHIDVRVGALLDGFTVIPGKEIET 781
             +  + A  T  E               A+AE  ED +ID RV AL+DG  ++ G+E+E 
Sbjct: 445  DEAREEASGTDHE-----------QMKEASAEEQEDFYIDDRVDALMDGSIMVDGQELEI 493

Query: 782  LGEILQTTFERVDWQNNGGPTWHGACVGEQKPGDQKVDELYISDTKMKEAAELKSGDKDH 841
            LGE+L   FE V+W++     +      E+  G ++  E    D++      +   ++D 
Sbjct: 494  LGELLNAHFEPVNWESEDLSRFQVKL--ERDDGTKRSTEF--PDSRTAHIYGVVPAERDT 549

Query: 842  WVVCFDSDEWFSGRWSCKGGDWKRNDEAAQDRCSRKKQVLNDGFPLCQMPKSGYEDPRWN 901
            +    +S EW+SGRWSCKGGDWKRND+ +QD+  RKK VLN+G+PLCQMPK  +EDPRW 
Sbjct: 550  YQPHIESSEWYSGRWSCKGGDWKRNDDFSQDKPYRKKLVLNEGYPLCQMPKGNHEDPRWV 609

Query: 902  QKDDLYYPSHSRRLDLPPWAYACPDERND--------GSGGSRSTQSKLAAVRGVKGTML 953
             KDDLYYP  +++LDLP WA++  +E +D        G    RS Q+K    +GVKGT L
Sbjct: 610  CKDDLYYPLRAKKLDLPLWAFSSTEENDDTVDDASKSGVIPGRSGQTKQPP-KGVKGTTL 668

Query: 954  PVVRINACVVNDHGSFVSEPRSKVRAKERHSSRSARSYSSANDVRRSSAESDSHSKARNN 1013
            PVV+INA VV D  S  SE R K +  +R  SRS+RS+S   D R S+ E  SHSK  + 
Sbjct: 669  PVVKINARVVKDQSS--SEHRIKPKVADRPPSRSSRSHSIGTD-RSSTHEGSSHSKKHHE 725

Query: 1014 QDSQGSWKSIACINTPKDRLCTVDDLQLQLGEWYYLDGAGHERGPSSFSELQVLVDQGCI 1073
             DSQ   KS +  N PKD +CTV++L +++G+WYYLDG GHER P S+SELQ L  +G I
Sbjct: 726  HDSQSLHKSKSVPNIPKDHVCTVEELSVKVGDWYYLDGTGHERVPFSYSELQELAKKGTI 785

Query: 1074 QKHTSVFRKFDKVWVPLTFATETSASTVRNHGEKIMPSGDSSGLPPTQSQDAVLGESNNN 1133
             + +SVFRK D  W+P+                K + SG S+      S  + L  SN  
Sbjct: 786  LEGSSVFRKIDNTWLPVL---------------KDLKSGCSARNGEAGSSTSALTHSNQ- 829

Query: 1134 VNSNAFHTMHPQFIGYTRGKLHELVMKSYKNREFAAAINEVLDPWINAKQPKKETEHVYR 1193
               + FH MHPQF+GYTRGKLHELVMK +K+RE   AINEVL+PWI  KQP+KE E  + 
Sbjct: 830  ---SNFHEMHPQFVGYTRGKLHELVMKYFKSRELTLAINEVLEPWIATKQPRKELETFFS 886

Query: 1194 KSEG-------DTRAGKRARLLVRESDG-DEETEEELQTIQDESTFEDLCGDASFPGEES 1245
             S         D  + KRARLL  +SD   + +E+ L + +D+  FEDL   A+   E  
Sbjct: 887  HSSASKNFVQEDGGSTKRARLLPDQSDEYTDMSEDILASQKDDCCFEDLFEGAAHVKESP 946

Query: 1246 ASSAIESGGWGLLDGHTLAHVFHFLRSDMKSLAFASLTCRHWRAAVRFYKGISRQVDLSS 1305
             +S  ES  WGLL+ H LA +FHFLR+D+KSL  ++ TC  W  A ++Y+ + R +DLSS
Sbjct: 947  LNSRTESESWGLLNEHVLARIFHFLRADVKSLISSAATCSWWNTAAKYYRSVCRFIDLSS 1006

Query: 1306 VGPNCTDSLIRKTLNAFDKEK 1326
            +GP CTD++    + + ++ K
Sbjct: 1007 LGPQCTDNVFHDIMGSIERSK 1027


>gi|242078371|ref|XP_002443954.1| hypothetical protein SORBIDRAFT_07g005020 [Sorghum bicolor]
 gi|241940304|gb|EES13449.1| hypothetical protein SORBIDRAFT_07g005020 [Sorghum bicolor]
          Length = 2166

 Score = 1021 bits (2639), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 579/1247 (46%), Positives = 791/1247 (63%), Gaps = 86/1247 (6%)

Query: 619  MEEDMDICDTPPHVPAVTDSS-----VGKWFYLDHCGMECGPSRLCDLKTLVEEGVLVSD 673
            M EDMDICDTPPH     + +     +G+WFYLDH G+E GPS+L +LK LVE+G L+SD
Sbjct: 433  MIEDMDICDTPPHTSGAPEPTEPICDIGRWFYLDHFGIEQGPSKLAELKKLVEDGYLLSD 492

Query: 674  HFIKHLDSNRWETVENAVSPLVTVNFPSITSDSVTQLVSPPEASGNLLADTGDTAQ--ST 731
            H IKH DSNRW TVENA SPLV  +FPS+ SD+ TQ+V+PPEA GNLL +  + A   ++
Sbjct: 493  HLIKHADSNRWVTVENAASPLVPSDFPSLYSDTSTQMVNPPEAPGNLLDEALEEASNLAS 552

Query: 732  GEEFPVTLQSQCCPDGSAAAAESSEDLHIDVRVGALLDGFTVIPGKEIETLGEILQTTFE 791
            G E       Q       A+AE SE+ +ID RV AL+DG  ++ G+E+E +GE+L   F+
Sbjct: 553  GAE-----DKQM----DEASAEDSEEFYIDDRVEALMDGSILVHGQELEIIGELLGADFQ 603

Query: 792  RVDWQNNGGPT----WHGACVGEQKPGDQKVDELYISDTKMKEAAELKSGDKDHWVVCFD 847
              DWQ+   P     +H    G+   G     E    + +  +A  L S +K+++    +
Sbjct: 604  PADWQSWSHPEDFTRFHVHTEGDD--GINGGTEFL--ENRATDAYGLVSVEKNNFHHYVE 659

Query: 848  SDEWFSGRWSCKGGDWKRNDEAAQDRCSRKKQVLNDGFPLCQMPKSGYEDPRWNQKDDLY 907
            S EWFSGRWSCKGGDW RNDE +QD   RKK VLN+G+PLCQMPK  YEDPR   KD+LY
Sbjct: 660  SSEWFSGRWSCKGGDWMRNDELSQDTPFRKKLVLNEGYPLCQMPKGSYEDPRRPCKDELY 719

Query: 908  YPSHSRRLDLPPWAYACPDERND--------GSGGSRSTQSKLAAVRGVKGTMLPVVRIN 959
            YP  +++ DLP WA++  +E  D        G    R  Q++    RGVKG MLPVVRIN
Sbjct: 720  YPVRAKKHDLPLWAFSSTEEDTDSVNDTTKSGVVPGRPGQTRQPP-RGVKGMMLPVVRIN 778

Query: 960  ACVVNDHGSFVSEPRSKVRAKERHSSRSARSYSSANDVRRSSAESDSHSKARNNQDSQGS 1019
            + VV D  S   EPR+K R  +R  SRS+RS+S      RSS    S  +  ++ DSQ  
Sbjct: 779  SRVVKDQSSV--EPRTKPRGTDRPLSRSSRSHSIG--AERSSVHEGSTHRKHHDHDSQSL 834

Query: 1020 WKSIACINTPKDRLCTVDDLQLQLGEWYYLDGAGHERGPSSFSELQVLVDQGCIQKHTSV 1079
             KS +  N PKDR+CTVD+L +  G+WYYLDG GHE GP S+SELQ LV +G I + +SV
Sbjct: 835  HKSKSVPNIPKDRVCTVDELSVNRGDWYYLDGTGHEHGPFSYSELQELVKKGTIIEQSSV 894

Query: 1080 FRKFDKVWVPLTFATETSASTVRNHGEKIMPSGDSSGLPPTQSQDAVL--GESNNNVN-- 1135
            FRK D  W P+    +  +S         +PS   S    + S  A++   + N  VN  
Sbjct: 895  FRKIDNTWFPVLKDLKPGSS---------VPSAARS----SNSTAALMHPDQYNFGVNQG 941

Query: 1136 SNAFHTMHPQFIGYTRGKLHELVMKSYKNREFAAAINEVLDPWINAKQPKKETEHVYRKS 1195
            S +FH +HPQF+GYTRGKLHELVMK +K+RE   AINEVLDPWI+AKQPKKE E  +  +
Sbjct: 942  SGSFHELHPQFVGYTRGKLHELVMKYFKSRELTLAINEVLDPWISAKQPKKEFEAYFSHN 1001

Query: 1196 EG-------DTRAGKRARLLVRESDGDEETEEE-LQTIQDESTFEDLC-GDASFPGEESA 1246
                     D  + KRA+LL  +SD D    E+ L + +++  FE+LC G +S    +S 
Sbjct: 1002 SASRNFLPEDGGSAKRAKLLPDQSDEDIHLSEDILASRKEDICFEELCDGASSSVDNDSV 1061

Query: 1247 SSAIESGGWGLLDGHTLAHVFHFLRSDMKSLAFASLTCRHWRAAVRFYKGISRQVDLSSV 1306
            +    +  WGLL+GH LA +FHF+R+D+KSL  ++ TCR W AA ++Y+ + R +DLSSV
Sbjct: 1062 NPRAGNESWGLLNGHVLARIFHFMRADVKSLISSAATCRSWNAAAKYYRNMCRFIDLSSV 1121

Query: 1307 GPNCTDSLIRKTLNAFDKEKLNSILLVGCTNITSGMLEEILQSFPHLSSIDIRGCGQFGE 1366
            GP CTDS+    +  ++K+ + +++L GC+N++S  L  +L+  P +S + I+GCG  G+
Sbjct: 1122 GPLCTDSVFCDIMAGYEKQNIRTLILAGCSNLSSHALGRVLEQLPQISYVHIQGCGHLGD 1181

Query: 1367 LALKFPNINWVKSQKSRGAKFNDSRSKIRSLKQITEKSSSAPK-SKGLGDDMDDFGDLKD 1425
            L  KF ++ W++S  +      +S  K+++LKQI + ++   K ++     +D   +L  
Sbjct: 1182 LKSKFQHVKWIRSSLNP----EESYQKMKTLKQIGDGNNYTSKVARNFTSQLDGSDELDG 1237

Query: 1426 YFESVDKRDSANQSFRRSLYQRSKVFDARKSSSILSRDARMRRWSIKKSENGYKRMEEFL 1485
            YF  +  R++AN SF +  Y+RSK+ DARKSS++LSRDA MRR   +++EN Y++MEEF+
Sbjct: 1238 YFADISNRENANLSFGQGFYKRSKLLDARKSSAVLSRDAEMRRLMQRQAENSYRKMEEFV 1297

Query: 1486 ASSLKEIMRVNTFEFFVPKVAEIEGRMKKGYYISHGLGSVKDDISRMCRDAIKAKNRGSA 1545
             + L+EIMR N F+FF+PKVA+IEGR+K GYY  HG  ++K DI  MC+DA++ K+   +
Sbjct: 1298 INRLREIMRSNRFDFFIPKVAKIEGRLKNGYYARHGFRTIKHDIRTMCQDALRYKDGNDS 1357

Query: 1546 GDMNRITTLFIQLATRLEQGAKSSYYEREEMMKSWKDESPAGLYSATSKYKKKLSKMVSE 1605
            GD+ +I   FIQLA RL  G      ER     +  D      YS  +K KKK       
Sbjct: 1358 GDIKQIVVSFIQLAKRL--GNPRHISERNGA--AAHDSLDISQYSFDTKLKKK------- 1406

Query: 1606 RKYMNRSNGTSLANGDFDYGEYASDREIRKRLSKLNRKSLDSGSETSDDLDGSSEDGKSD 1665
                N++ G +L     D    A D EI++ LSKL +K + SGSETSDD D  SE  +++
Sbjct: 1407 ---QNKTRGANLVAAGADNSSRAFDLEIKRSLSKLKKKDVYSGSETSDDDDVYSEGDETE 1463

Query: 1666 SESTVSDTDSDMDFRSDGRARESRGAGDFTTDEGLDFSDDREWGARMTKASLVPPVTRKY 1725
            SE+TVSDT+SD+D  S   A + +G G    + G   +DDR  GARMTKASLVPPVTRKY
Sbjct: 1464 SETTVSDTESDLDVNSG--AWDLKGNGLKLIEPGESVTDDRILGARMTKASLVPPVTRKY 1521

Query: 1726 EVIDQYVIVADEEDVRRKMRVSLPEDYAEKLNAQKNGSEELDMELPEVKDYKPRKQLGDQ 1785
            EVI++Y+IVAD E+V+RKMRV+LP+DY+EKL +QKNG+E L  ELPEVKDY+PRK  GD+
Sbjct: 1522 EVIEEYLIVADVEEVQRKMRVALPDDYSEKLLSQKNGTENL--ELPEVKDYQPRKVAGDE 1579

Query: 1786 VFEQEVYGIDPYTHNLLLDSMPDELDWNLLEKHLFIEDVLLRTLNKQ 1832
            + EQEVYGIDPYTHNLL D MP +L+ +  +KH+FIE+ L    NK+
Sbjct: 1580 ILEQEVYGIDPYTHNLLSDIMPADLELSPTDKHIFIEEGLGVVCNKK 1626



 Score =  901 bits (2329), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 426/645 (66%), Positives = 507/645 (78%), Gaps = 18/645 (2%)

Query: 1812 WNLLEKHLFIEDV--LLRTL---------NKQVRHFTGTGNT--PMMYPLQP---VIEEI 1855
            + ++E++L + DV  + R +          K +    GT N   P +   QP     +EI
Sbjct: 1521 YEVIEEYLIVADVEEVQRKMRVALPDDYSEKLLSQKNGTENLELPEVKDYQPRKVAGDEI 1580

Query: 1856 EKEAVDDCDVRTMKMCRGILKA-MDSRPDDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLG 1914
             ++ V   D  T  +   I+ A ++  P DK++   +GLGVVCNK+GGFG DDFVVEF G
Sbjct: 1581 LEQEVYGIDPYTHNLLSDIMPADLELSPTDKHIFIEEGLGVVCNKKGGFGVDDFVVEFFG 1640

Query: 1915 EVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASR 1974
            EVYP W+W+EKQDGI+ +Q N+ED APEFYNI LERPKGD DGYDLV VDAMHKANYASR
Sbjct: 1641 EVYPSWRWYEKQDGIKHIQNNSEDQAPEFYNIMLERPKGDRDGYDLVFVDAMHKANYASR 1700

Query: 1975 ICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQ 2034
            ICHSC PNCEAKVTAVDG YQIG+YT+R I  GEEITFDYNSVTESKEE+EASVCLCGSQ
Sbjct: 1701 ICHSCNPNCEAKVTAVDGKYQIGVYTLRPIAEGEEITFDYNSVTESKEEHEASVCLCGSQ 1760

Query: 2035 VCRGSYLNLTGEGAFEKVLKELHGLLDRHQLMLEACELNSVSEEDYLELGRAGLGSCLLG 2094
            VCRGSYLN +GEGAFEKVL E HG+LDRH L+L+ACE +SVS++D ++LGRAGLG+CLL 
Sbjct: 1761 VCRGSYLNFSGEGAFEKVLMEFHGVLDRHSLLLQACETDSVSQQDLIDLGRAGLGTCLLA 1820

Query: 2095 GLPNWVVAYSARLVRFINLERTKLPEEILRHNLEEKRKYFSDICLEVEKSDAEVQAEGVY 2154
            GLP W+VAY+A LVRFI LER KLP+EILRHN++EKR++  +I ++ EK+DAEVQAEGV 
Sbjct: 1821 GLPGWLVAYTANLVRFIYLERQKLPDEILRHNVDEKRQFLIEINMDSEKNDAEVQAEGVL 1880

Query: 2155 NQRLQNLAVTLDKVRYVMRCVFGDPKKAPPPVERLSPEETVSFLWKGEGSLVEELIQCMA 2214
            N RLQ +  TLDKVRYVMRCVFGDPK APPP+ RLS +  VS +WKG+ S+V EL+Q M 
Sbjct: 1881 NSRLQQIVHTLDKVRYVMRCVFGDPKNAPPPLVRLSGKSLVSAIWKGDSSIVAELLQSME 1940

Query: 2215 PHVEEDVLNDLKSKIQAHDPSGSEDIQRELRKSLLWLRDEVRNLPCTYKCRHDAAADLIH 2274
            PHVEE+VL+DLK KI+AHDP  SEDI+  +R SLLWLRDE+R LPCTYKCRHDAAADLIH
Sbjct: 1941 PHVEEEVLSDLKVKIRAHDPPDSEDIEGGIRNSLLWLRDELRTLPCTYKCRHDAAADLIH 2000

Query: 2275 IYAYTKCFFRVQEYKAFTSPPVYISPLDLGPKYADKLGADLQVYRKTYGENYCLGQLIFW 2334
            +YAYTKCFFRV++YK   SPPV+ISPLDLGPKYADKLG   Q Y KTY ENYCL QLI+W
Sbjct: 2001 LYAYTKCFFRVRDYKTVKSPPVHISPLDLGPKYADKLGPGFQEYCKTYPENYCLAQLIYW 2060

Query: 2335 HIQTNADPDCTLARASRGCLSLPDIGSFYAKVQKPSRHRVYGPKTVRFMLSRMEKQPQRP 2394
            + Q N++P+  L RA +GC+SLPD+ SFY K  KPS+ RVYG +TVRFMLSRMEKQ QRP
Sbjct: 2061 YSQ-NSEPESRLTRARKGCMSLPDVSSFYVKSAKPSQERVYGNRTVRFMLSRMEKQAQRP 2119

Query: 2395 WPKDRIWAFKSSPRIFGSPMLDSSLTGCPLDREMVHWLKHRPAIF 2439
            WPKDRIW FKS PR FGSPM+D+ L   PLD+EMVHWLK RP +F
Sbjct: 2120 WPKDRIWVFKSDPRFFGSPMMDAVLNNSPLDKEMVHWLKTRPNVF 2164


>gi|168059519|ref|XP_001781749.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666751|gb|EDQ53397.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 2661

 Score =  846 bits (2186), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 543/1419 (38%), Positives = 762/1419 (53%), Gaps = 214/1419 (15%)

Query: 1199 TRAGKRARLLVRESDGDEETEEELQTIQDESTFEDLCGDASFPGEESASSAIESGGWGLL 1258
            T  GK AR    E   D+ET E     ++ S       DA       +SS I   GW  L
Sbjct: 1282 TSTGKNAR----EVSSDQETSETGAVHENLSNL-----DAG-----GSSSQI---GWAWL 1324

Query: 1259 DGHTLAHVFHFLRSDMKSLAFASLTCRHWRAAVRFYKGISRQVDLSSVGPNCTDSLIRKT 1318
                L  V   LR D KSL  A  TC+ W+   +  K  ++ VDLS +G  CTD+ +   
Sbjct: 1325 PTRILTKVLRLLRGDPKSLVAAMGTCQSWKDCAQDIKVSTKHVDLSGLGLRCTDA-VLGG 1383

Query: 1319 LNAFDKEKLNSILLVGCTNITSGMLEEILQSFPHLSSIDIRGCGQFGELALKFPNINWVK 1378
            L  F   +L  I L  C N++S  LE +L+S+P +  + I GC +  EL   +P + WV 
Sbjct: 1384 LLGFGGGQLKHITLDHCLNVSSKGLERLLKSYPSIREVGICGCARLIELVELYPQVRWVG 1443

Query: 1379 ---------------------SQKSRGAK--FNDSRSKIRSLKQI------TEKSSSAPK 1409
                                 S KS G K  + D  +   SL +       TEKS S   
Sbjct: 1444 NPFAVAHGIDTQRHGLRYNKLSSKSSGGKREYGDDVNSGGSLDETRYRDKSTEKSESPGT 1503

Query: 1410 SKGLGDDMD----------------DFGDLKD---------YFESVDKRDSANQSF--RR 1442
             + + + +D                DF  LK+         +  S  K   + +    + 
Sbjct: 1504 IRRVVNSLDPLGKDVHASQMGYPCRDFKRLKENSMSGTRNGFCTSAGKHGISKRKLNSKS 1563

Query: 1443 SLYQRSKVFDARKSSSILSRDA----RMRRWSIKKSENGYKRMEEFLASSLKEIMRVNTF 1498
            ++  +S V  + K + + S  A    +   W++K +E   K  E+ +A +L+ +M  ++ 
Sbjct: 1564 TIKSQSSVRGSLKGTPVSSEKADNVAKEVSWNVKDAE---KNPEKAMARALRVVMEADSE 1620

Query: 1499 EFFVPKVAE-------------------IEGRMKKGYYISH-GLGSVKDDISRMCRDAIK 1538
              F     E                   ++ ++K G+Y    G+   K+D+ +  R+A +
Sbjct: 1621 HLFHRMANEQVSEGRGAPTKSGQVDFCTVQKKLKLGHYGGRDGVKLFKEDLLQPLRNAFR 1680

Query: 1539 AKNRG----SAGDMNRITTLFIQLATRLEQGAKSSYYEREEMMKSWKDESPAGLYSATSK 1594
             ++      +AG + ++     Q    L      S  +  E+         + + SA + 
Sbjct: 1681 LEHDSVIYKTAGRLFKVAHQVGQHLFNL-----LSKPQIRELADGQSKALSSCVTSARTL 1735

Query: 1595 YKKKLSKMVS-------ERKYMNRSNGTSLANGDFDYGEYASDREIRKRLSKLNRKSL-- 1645
             K K SK ++       +R + + +   S+     +Y    SD ++  R S L       
Sbjct: 1736 LKSKESKDLATEKQRGPKRSWDSETGARSVKRKVLNYMNSRSDGDVLSRDSDLQGSIDRD 1795

Query: 1646 -----------------DSGSETSDDLDGSSEDGKSDSESTVSDTDSDMDFRSDGRARES 1688
                             ++GS   +D+D +  D   D+E++ SD  S     +D    + 
Sbjct: 1796 ERESRRDRMRRSQWAESEAGSSDEEDMDEALYD--EDTETSGSDVASKSGI-ADELVYDY 1852

Query: 1689 RGAGDF-TTDEGLDFSDDREW-GARMTKASLVPPVTRKYEVIDQYVIVADEEDVRRKMR- 1745
            R A D      G      R+W GARMTKA++VPP+TRKYEVI++Y IV D E V  KM+ 
Sbjct: 1853 RDASDSDNIYSGFGEGSSRDWWGARMTKAAMVPPLTRKYEVIEEYRIVDDFERVASKMKR 1912

Query: 1746 ------------VSLPEDYAEKL-NAQKNGSEELDMELPEVKDYKPRKQLGDQVFEQEVY 1792
                        V LP+DY EKL  A+K G     +++PE+K+ +PRK+LG +V EQEVY
Sbjct: 1913 HCEVTSVNFWRKVILPDDYEEKLWAAKKVGDRYAHLDVPELKECRPRKRLGKEVLEQEVY 1972

Query: 1793 GIDPYTHNLLLDSMPDELD-WNLLEKHLFIEDVLLRTLNKQVRHFTGTGNTPMMYPLQPV 1851
            GIDPYT+NLLL++MP + + +   +K LFIE+ LLR LN++V  FTG+G  PM Y L+ V
Sbjct: 1973 GIDPYTYNLLLNTMPADTELFTEKQKQLFIEEKLLRALNREVSSFTGSGKAPMEYSLEKV 2032

Query: 1852 IEEIEKEAVDDCDVRTMKMCRGILKAMDSRPDDKYVAYRKGLGVVCNKEGGFGEDDFVVE 1911
            I  I  +A  D  ++    CR +LK M    +DKYVAYRKGLGVVCNK  GF + DFVVE
Sbjct: 2033 IAHICGDAHADQPLQVF--CRSLLKNMQRHLNDKYVAYRKGLGVVCNKPEGFDDGDFVVE 2090

Query: 1912 FLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANY 1971
            F GEVYP W+W+EKQDGIR+LQK  ++PAPEFYNI  ERPKGD+ GYD++VVDAMHKAN+
Sbjct: 2091 FFGEVYPPWRWYEKQDGIRALQKKEKEPAPEFYNIVFERPKGDSWGYDVLVVDAMHKANF 2150

Query: 1972 ASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLC 2031
            ASR+CHSCRPNCEAKVTAV+G Y IG+YT+R I +GEE+TFDY  VTESKEE+++SVCLC
Sbjct: 2151 ASRLCHSCRPNCEAKVTAVNGKYMIGVYTLRKIEFGEELTFDYCCVTESKEEHDSSVCLC 2210

Query: 2032 GSQVCRGSYLNLTGEGAFEKVLKELHGLLDRHQLMLEACELNSVSEEDYLELGRAGLGSC 2091
            GSQ C+GSYL  TG GA+++VLKE HG+LDRH L+L+AC   +V+  +  +L +AGLG C
Sbjct: 2211 GSQGCKGSYLCYTGPGAYDEVLKECHGILDRHNLLLQACTSGAVTFREQEDLKQAGLGPC 2270

Query: 2092 LLGGLPNWVVAYSARLVRFINLERTKLPEEILRHNLEEKRKYFSDICLEVEKSDAEVQA- 2150
            LL GLP WV+ Y+A +V ++N ER +LP+E++      K +      +++++ D EVQA 
Sbjct: 2271 LLDGLPQWVIKYAAGIVSYLNFERQRLPDELM------KAEMLKHTGIDLDRQDVEVQAH 2324

Query: 2151 ---------------EGVYNQRLQNLAVTLDK--------------------------VR 2169
                           EGVYNQRLQNLA+TLDK                          VR
Sbjct: 2325 AALLSEGIPLETWMTEGVYNQRLQNLAITLDKVMPPPTISLCTGIVGVAEILSPLFLQVR 2384

Query: 2170 YVMRCVFG-DPKKAPPPVERLSPEETVSFLWKGEGSLVEELIQCMAPHVEEDVLNDLKSK 2228
            +V+  ++G +  KA PP+  L P E V ++W G+ S+V EL+QCMA H  E  L DL  +
Sbjct: 2385 HVLTKLYGEEASKASPPLRMLEPHELVDYIWTGKDSVVGELLQCMAVHSPEG-LADLTRQ 2443

Query: 2229 IQAHDPSGSEDIQRELRKSLLWLRDEVRNLPCTYKCRHDAAADLIHIYAYTKCFFRVQEY 2288
            IQ H+P    DI+  LR+SLLWLRD +R +P T   RHDAAADLIH+YAYTK FF   +Y
Sbjct: 2444 IQDHNPPPGGDIEENLRRSLLWLRDTLRKVPATCMGRHDAAADLIHLYAYTKHFFTNNDY 2503

Query: 2289 KAFTSPPVYISPLDLGPKYADKLGADLQVYRKTYGENYCLGQLIFWHIQTNADPDCTLAR 2348
                SPP+ I   DLGPK++   GA   ++RK+Y +NY  GQLI W  QT+ DP  +L +
Sbjct: 2504 GLVDSPPILIYACDLGPKHS---GAGPYMWRKSYSKNYVWGQLISWFRQTSVDPGASLVQ 2560

Query: 2349 ASRGCLSLPDIGSFYAKVQKPSRHRVYGPKTVRFMLSRMEKQPQRPWPKD---RIWAFKS 2405
              RGCL LPDI S YA+  +      Y  K  + M+  ME  PQ+ W +     +W FKS
Sbjct: 2561 DRRGCLMLPDISSCYARTIQHDFRCGYSDKDRKKMIMHMETYPQKKWTRKLTPELWNFKS 2620

Query: 2406 SPRIFGSPMLDSSLTGCPLDREMVHWLKHRPAIFQAMWD 2444
               +FGSPMLD+++    L++E + WLK R  +F   WD
Sbjct: 2621 DRGLFGSPMLDAAVAKTKLNKECMQWLKTRDTVFHGPWD 2659



 Score =  107 bits (266), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 61/165 (36%), Positives = 89/165 (53%), Gaps = 23/165 (13%)

Query: 1038 DLQLQLGEWYYLDGAGHERGPSSFSELQVLVDQGCIQKHTSVFRKFDKVWVPLTFAT--- 1094
            +LQL+ G W+YLD AGHERGP + S L+ +V +G +    SV RK D +WVP++      
Sbjct: 1039 ELQLESGVWHYLDAAGHERGPFTLSALKGIVAEGGLPAGASVLRKRDNLWVPVSHLVQYY 1098

Query: 1095 ----------------ETSASTVRNHGEKIMPSGDSSGLPPTQSQDAVLGESNNNVNSNA 1138
                            E SA+ VR+    +  +      P   +  A+ G   ++V+S+ 
Sbjct: 1099 DAHSPAFLSKLQPDYLERSANLVRSAASTVASTVQDPAHP---ANLALKGLDVHSVSSST 1155

Query: 1139 FHTMHPQFIGYTRGKLHELVMKSYKNREFAAAINEVLDPWINAKQ 1183
            FH   PQF+GYT GKLHE VMKS++   FA   N+ LD W ++K+
Sbjct: 1156 FHNELPQFLGYTNGKLHEYVMKSFRG-SFAGFFNDALDVWSSSKR 1199


>gi|168035499|ref|XP_001770247.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162678464|gb|EDQ64922.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 2852

 Score =  835 bits (2158), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 417/800 (52%), Positives = 543/800 (67%), Gaps = 36/800 (4%)

Query: 1664 SDSESTVSDTDSDMDFRSDGRARESRGAGDFTTDEGLDFSDDREWGARMTKASLVPPVTR 1723
            SD  S     D  MD  +DG   +     D  +  G + S    WGARMTKA++VPP+TR
Sbjct: 2068 SDVASKSGIADEAMDDYNDGSESD-----DAYSGYGGEGSSRDWWGARMTKAAMVPPLTR 2122

Query: 1724 KYEVIDQYVIVADEEDVRRKMRVSLPEDYAEKLNAQKNGSEEL-DMELPEVKDYKPRKQL 1782
            KYEVI++Y IV D E V  KM+V LP+DY EKL   K G +    +++PE+K++KPRK+L
Sbjct: 2123 KYEVIEEYRIVDDYERVVAKMKVELPDDYEEKLRVAKKGGDRFAHLDVPELKEFKPRKRL 2182

Query: 1783 GDQVFEQEVYGIDPYTHNLLLDSMP-DELDWNLLEKHLFIEDV-------------LLRT 1828
            G++V EQEVYGIDPYT+NLLL++MP D   +   +K LFIE+              LLR 
Sbjct: 2183 GEEVLEQEVYGIDPYTYNLLLNTMPADTESFTDKQKQLFIEEAFSDEDCSLDMLQKLLRA 2242

Query: 1829 LNKQVRHFTGTGNTPMMYPLQPVIEEIEKEAVDDCDVRTMKMCRGILKAMDSRPDDKYVA 1888
            LN++V  FTG+G  PM Y L+ VI  I  +   D  ++    CR +LK M S  +DKYVA
Sbjct: 2243 LNREVSSFTGSGKAPMEYSLEKVISHICSDVHADQPLQVF--CRSLLKNMKSHLNDKYVA 2300

Query: 1889 YRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYL 1948
            YRKGLGVVCNK  GF + DFVVEF GEVYP W+W+EKQDGIR+LQK  ++PAPEFYNI  
Sbjct: 2301 YRKGLGVVCNKPEGFDDGDFVVEFFGEVYPPWRWYEKQDGIRALQKKEKEPAPEFYNIVF 2360

Query: 1949 ERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGE 2008
            ERPKGD+ GYD++VVDAMHKAN+ASR+CHSCRPNCEAKVTAV+G Y IG+YT+R I +GE
Sbjct: 2361 ERPKGDSLGYDVLVVDAMHKANFASRLCHSCRPNCEAKVTAVNGKYMIGVYTLRKIEFGE 2420

Query: 2009 EITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDRHQLMLE 2068
            E+TFDY  VTESKEE+++SVCLCGSQ C+GSYL  TG GA+++VLKE HG+LDRH L+L+
Sbjct: 2421 ELTFDYCCVTESKEEHDSSVCLCGSQGCKGSYLCYTGPGAYDEVLKEYHGILDRHNLLLQ 2480

Query: 2069 ACELNSVSEEDYLELGRAGLGSCLLGGLPNWVVAYSARLVRFINLERTKLPEEILRHNLE 2128
            AC   +V+  +  +L +AGLG CLL GLP WV+ Y+A +V ++N ER +LP+E++     
Sbjct: 2481 ACTSGAVTFREQEDLKQAGLGPCLLDGLPQWVIKYAAGIVSYLNFERQRLPDELM----- 2535

Query: 2129 EKRKYFSDICLEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRCVFG-DPKKAPPPVE 2187
             K +      +++++ D EVQ EGVYNQRLQNLA+TLDKVR+++  ++G +  KAPPP+ 
Sbjct: 2536 -KAEMLKQTGIDIDRQDIEVQTEGVYNQRLQNLAITLDKVRHILVKLYGEEASKAPPPLR 2594

Query: 2188 RLSPEETVSFLWKGEGSLVEELIQCMAPHVEEDVLNDLKSKIQAHDPSGSEDIQRELRKS 2247
             L P E V  +W G+ S+V EL+QCMA H  E  L DL  +IQ H+P    DI+  LRKS
Sbjct: 2595 MLEPHELVKCIWTGKDSIVGELLQCMALHSPEG-LADLTKQIQEHNPPPGGDIEENLRKS 2653

Query: 2248 LLWLRDEVRNLPCTYKCRHDAAADLIHIYAYTKCFFRVQEYKAFTSPPVYISPLDLGPKY 2307
            LLWLRD +R +P T   RHDAAADLIH+YAYTK FF   +Y    SPP+ I   DLGPK+
Sbjct: 2654 LLWLRDTLRKVPATCMGRHDAAADLIHLYAYTKHFFTNYDYGMVDSPPILIYACDLGPKH 2713

Query: 2308 ADKLGADLQVYRKTYGENYCLGQLIFWHIQTNADPDCTLARASRGCLSLPDIGSFYAKVQ 2367
            +   GA   ++RK+Y +NY  GQLI W  QT+ADP  +L +  RGCL LPDI S YA+  
Sbjct: 2714 S---GAGPYMWRKSYSKNYIWGQLISWFRQTSADPGASLVQDRRGCLMLPDISSCYARTI 2770

Query: 2368 KPSRHRVYGPKTVRFMLSRMEKQPQRPWPKD---RIWAFKSSPRIFGSPMLDSSLTGCPL 2424
            +      Y  K  + M+  ME  PQ+ W +     +W FKS   +FGSPMLD+++    L
Sbjct: 2771 QHDFRCGYSDKDRKRMILHMETHPQKKWTRKFTPELWNFKSDRGLFGSPMLDAAVAKTKL 2830

Query: 2425 DREMVHWLKHRPAIFQAMWD 2444
            ++E +HWLK R  +F   WD
Sbjct: 2831 NKECIHWLKTRETVFHGPWD 2850



 Score =  113 bits (282), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 67/181 (37%), Positives = 91/181 (50%), Gaps = 20/181 (11%)

Query: 1022 SIACINTPKDRLCTVDDLQLQLGEWYYLDGAGHERGPSSFSELQVLVDQGCIQKHTSVFR 1081
            +  C   P   +    +LQL+ G W+YLD AGHERGP   S LQ  V +G +    SVFR
Sbjct: 1174 AFKCTKNPASVVLRKHELQLESGVWHYLDAAGHERGPFLMSALQGFVAEGGLPAGASVFR 1233

Query: 1082 KFDKVWVPLTFATETSASTVRNHGEKIMP-SGDSSGLP--PTQSQDAVLGESN------- 1131
            K D +WVP++   +   +       K +P S + SG P  P     A+  E         
Sbjct: 1234 KRDNLWVPVSHLIQLHNAHAPAFASKPLPHSLERSGYPVRPAVPSGAISCEDPAHTAHPA 1293

Query: 1132 ---------NNVNSNAFHTMHPQFIGYTRGKLHELVMKSYKNREFAAAINEVLDPWINAK 1182
                      +V+S+ FH  HPQF+GYT GKLHE  MKS++   FA   N+ LD W N+K
Sbjct: 1294 HPDLMELDVRSVSSSIFHDDHPQFLGYTNGKLHEHAMKSFRG-SFAGFFNDALDVWSNSK 1352

Query: 1183 Q 1183
            +
Sbjct: 1353 R 1353



 Score = 79.3 bits (194), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 52/169 (30%), Positives = 81/169 (47%), Gaps = 12/169 (7%)

Query: 1254 GWGLLDGHTLAHVFHFLRSDMKSLAFASLTCRHWRAAVRFYKGISRQVDLSSVGPNCTDS 1313
            G   L    L  V   LR D KSL  A  TC+ W+   +  +  ++ VDLS +G +C D+
Sbjct: 1547 GRAWLPPRILTKVLRLLRGDPKSLVAAMATCQSWKNCAQSIRMSTKHVDLSGLGSHCNDA 1606

Query: 1314 LIRKTLNAFDKEKLNSILLVGCTNITSGMLEEILQSFPHLSSIDIRGCGQFGELALKFPN 1373
            +I   L      KL  I L  C N++S  L  +L+S+P +  + I GC Q  EL   +P 
Sbjct: 1607 IIGGLLGF-GGGKLRRITLDYCLNVSSKALGRLLKSYPSIREVSISGCVQLSELVELYPQ 1665

Query: 1374 INWVKSQKSRGAKFNDSRSKIRSLKQITEKSSSAPKSKGL----GDDMD 1418
            ++WV +  +        R  +++ K       + PKS G+    GDD++
Sbjct: 1666 VSWVGNPFAIPHGLESQRHNLKNNK-------TNPKSSGIKREFGDDVN 1707


>gi|115475081|ref|NP_001061137.1| Os08g0180100 [Oryza sativa Japonica Group]
 gi|46805056|dbj|BAD17037.1| SET domain-containing protein-like [Oryza sativa Japonica Group]
 gi|113623106|dbj|BAF23051.1| Os08g0180100 [Oryza sativa Japonica Group]
          Length = 494

 Score =  791 bits (2042), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 365/492 (74%), Positives = 418/492 (84%), Gaps = 1/492 (0%)

Query: 1948 LERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYG 2007
            LERPKGD DGYDLV VDAMHKANYASRICHSC PNCEAKVTAVDGHYQIGIYTVR I  G
Sbjct: 2    LERPKGDRDGYDLVFVDAMHKANYASRICHSCNPNCEAKVTAVDGHYQIGIYTVRPIAEG 61

Query: 2008 EEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDRHQLML 2067
            EEITFDYNSVTESKEE+EASVCLCGSQ+CRGSYLN +GEGAFEKVL E HG+LDRH L+L
Sbjct: 62   EEITFDYNSVTESKEEHEASVCLCGSQICRGSYLNFSGEGAFEKVLMEFHGVLDRHSLLL 121

Query: 2068 EACELNSVSEEDYLELGRAGLGSCLLGGLPNWVVAYSARLVRFINLERTKLPEEILRHNL 2127
            +ACE NSVS++D ++LGRAGLG+CLL GLP W+VAY+A LVRFI  ER KLP EI +HN+
Sbjct: 122  QACEANSVSQQDLIDLGRAGLGTCLLAGLPGWLVAYTAHLVRFIFFERQKLPHEIFKHNV 181

Query: 2128 EEKRKYFSDICLEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRCVFGDPKKAPPPVE 2187
            +EKR++F+DI ++ EK+DAEVQAEGV N RLQNL  TLDKVRYVMRC+FGDPK APPP+ 
Sbjct: 182  DEKRQFFTDINMDSEKNDAEVQAEGVLNSRLQNLTHTLDKVRYVMRCIFGDPKNAPPPLV 241

Query: 2188 RLSPEETVSFLWKGEGSLVEELIQCMAPHVEEDVLNDLKSKIQAHDPSGSEDIQRELRKS 2247
            RL+    VS +WKGEGSLV+EL++ M PHVEEDVL DLK+KI+AHDPSGSEDI+ E+R S
Sbjct: 242  RLTGRSLVSAIWKGEGSLVDELLESMEPHVEEDVLTDLKAKIRAHDPSGSEDIEGEIRSS 301

Query: 2248 LLWLRDEVRNLPCTYKCRHDAAADLIHIYAYTKCFFRVQEYKAFTSPPVYISPLDLGPKY 2307
            LLWLRDE+R L CTYKCRHDAAADLIH+YAYTKCFFRV++YK   SPPV ISPLDLGPKY
Sbjct: 302  LLWLRDELRTLSCTYKCRHDAAADLIHMYAYTKCFFRVRDYKTVKSPPVLISPLDLGPKY 361

Query: 2308 ADKLGADLQVYRKTYGENYCLGQLIFWHIQTNADPDCTLARASRGCLSLPDIGSFYAKVQ 2367
            ADKLG   Q Y KTY ENYCLGQLI+W+ Q NA+P+  L RA +GC+SLPD+ SFY K  
Sbjct: 362  ADKLGPGFQEYCKTYPENYCLGQLIYWYSQ-NAEPESRLTRARKGCMSLPDVSSFYVKSV 420

Query: 2368 KPSRHRVYGPKTVRFMLSRMEKQPQRPWPKDRIWAFKSSPRIFGSPMLDSSLTGCPLDRE 2427
            KP++ RVYG +TVRFML+RME Q QRPWPKDRIW FKS PR FG+PM+D+ L   PLD+E
Sbjct: 421  KPTQERVYGSRTVRFMLARMENQAQRPWPKDRIWVFKSDPRFFGTPMMDAVLNNSPLDKE 480

Query: 2428 MVHWLKHRPAIF 2439
            MVHWLK R  +F
Sbjct: 481  MVHWLKTRSNVF 492


>gi|302821685|ref|XP_002992504.1| hypothetical protein SELMODRAFT_448778 [Selaginella moellendorffii]
 gi|300139706|gb|EFJ06442.1| hypothetical protein SELMODRAFT_448778 [Selaginella moellendorffii]
          Length = 1806

 Score =  672 bits (1733), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 322/525 (61%), Positives = 406/525 (77%), Gaps = 8/525 (1%)

Query: 1704 DDREWGARMTKASLVPPVTRKYEVIDQYVIVADEEDVRRKMRVSLPEDYAEKLNAQK-NG 1762
            D REWGARMTKA++VPPVTRKYE+I+ Y IV D++ V RKM+V +P+DY EKL A K   
Sbjct: 1260 DVREWGARMTKAAMVPPVTRKYEIIEDYWIVIDKDLVERKMKVEVPDDYEEKLRASKLKR 1319

Query: 1763 SEELDMELPEVKDYKPRKQLGDQVFEQEVYGIDPYTHNLLLDSMPDELDWNLLEKHLFIE 1822
             E   +++P++K+Y PR++LG +V EQEVYGIDPYTHNLLLD+MP      L EK  F+E
Sbjct: 1320 GEYSHLDIPDIKEYHPRRELGLEVMEQEVYGIDPYTHNLLLDTMPKIPAMTLQEKLQFME 1379

Query: 1823 DVLLRTLNKQVRHFTGTGNTPMMYPLQPVIEEIEKEAVDDC-DVRTMKMCRGILKAMDSR 1881
            + LL+ +NK+V+ FTGTG  P+ + L+PVI+ I    VDD  D    +  +G+L  M +R
Sbjct: 1380 ETLLQAINKEVKQFTGTGKAPIDFSLEPVIQRI----VDDAQDTSMQQFAQGLLSNMRNR 1435

Query: 1882 PDDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAP 1941
              +KY+AYRKGLGVVCNK+GGF EDDFVVEF GEVYP W+W+EKQDG R LQK +++P P
Sbjct: 1436 TKEKYLAYRKGLGVVCNKDGGFKEDDFVVEFFGEVYPAWRWYEKQDGCRYLQKKDKEPLP 1495

Query: 1942 EFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTV 2001
            EFYNI LERPKGDA GYDLVVVDAMHKAN+ASRICHSCRPNCEAKVTAV G Y IG+Y +
Sbjct: 1496 EFYNILLERPKGDAAGYDLVVVDAMHKANFASRICHSCRPNCEAKVTAVKGRYIIGVYAL 1555

Query: 2002 RGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLD 2061
            R I  GEE+TFDYNSVTESKEEY  S+CLCGSQ CRGSYLNL   GA + V+KE HGLLD
Sbjct: 1556 RPIQNGEELTFDYNSVTESKEEYNNSICLCGSQCCRGSYLNLANAGASQDVIKERHGLLD 1615

Query: 2062 RHQLMLEACELNSVSEEDYLELGRAGLGSCLLGGLPNWVVAYSARLVRFINLERTKLPEE 2121
            RH L+LEAC    V+  +  E+ +AG+GSCLL GLP+W++ Y+ARLV F+NLER  LP+E
Sbjct: 1616 RHVLLLEACCEGPVTRLELEEMRQAGVGSCLLDGLPDWLLKYTARLVEFMNLERQLLPDE 1675

Query: 2122 ILRHNLEEKRKYFSDICLEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRCVFGDPKK 2181
            ++R ++++KRK  +D+  E+ + DAE QAEGVYNQRLQN+A+TLDKVRYV+R +F DP++
Sbjct: 1676 LMR-SVKKKRKD-ADLSYELGRVDAENQAEGVYNQRLQNIAITLDKVRYVLRQLFTDPRE 1733

Query: 2182 APPPVERLSPEETVSFLWKGEGSLVEELIQCMAPHVEEDVLNDLK 2226
            APP +++L  +E VS LW  E S+V EL+ CM PH+  D L +LK
Sbjct: 1734 APPLLKKLDQKELVSRLWSAENSIVNELLSCMMPHIPADRLAELK 1778



 Score =  208 bits (529), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 219/824 (26%), Positives = 330/824 (40%), Gaps = 143/824 (17%)

Query: 643  WFYLDHCGMECGPSRLCDLKTLVEEGVLVSDHFIKHLDSNRWETVENAVSPLVTVNFPSI 702
            W Y++  G   GP  L  LK L     +  DH +    +N W T+E+A SP         
Sbjct: 420  WVYINKNGDTQGPMELAALKLLASREFMPDDHLVMRCGTNAWITLEHAQSP-------DN 472

Query: 703  TSDSVTQLVSPPEASGNLLADTGDTAQSTGEEFPVTLQSQCCPDGSAA-AAESSEDLHID 761
             S ++ +LV+P  A+ N  A      +ST              DG+A    ES E L ID
Sbjct: 473  GSSALQKLVNPVAATRN--AKDATLVEST-------------IDGNAQFQNESFEFLDID 517

Query: 762  VRVGALLDGFTVIPGKEIETLGEILQTTFERVDWQNNGGPTWHGACVGEQKPGDQKVDEL 821
             RV  +L              G  L++    +D Q    P    A      P     +  
Sbjct: 518  GRVERILATCRA------SDQGNELKSVLSALDAQRQHSPDEGSAPPELSNPWKDSYNLG 571

Query: 822  YISDTKMK--EAAELKSGDKDHWVVCFDSDE-------------WFSGRWSCKGGDWKRN 866
            +  D  ++  +A       +        ++E             W  GRW+ KGGDWK  
Sbjct: 572  FGVDADLECLDAVPRVVPPEPVPAPPPSTEEDLHHHDHNPSPLKWKPGRWTSKGGDWKLL 631

Query: 867  DEAAQDRCSRKKQVLNDGFPLCQMPKSGYEDPRWNQKDDLYYPSHSRRLDLPPWAYACPD 926
                Q+     K VLN+G  LC+ P  G  DPR   +      S   + +LP WA     
Sbjct: 632  HPDGQNYV---KVVLNEGSLLCERPHYGV-DPRRQVQ-----VSERPKFELPQWAL---- 678

Query: 927  ERNDGSGGSRSTQSKLAAVRGVKGTMLPVVRINACVVNDHGSFVSEPRSKVRAKERHSSR 986
            +RN  +  S  T    +A R  K          A    DHG  ++  R+ +   ER SS 
Sbjct: 679  DRNQKADSSAETTKSASATRPAK---------TATRAFDHGKEIAPERASM---ERPSSF 726

Query: 987  SARSYSSANDVRRSSAESDSHSKARNNQDSQGSWKSIACINTPKDRLCTVDDLQLQLGEW 1046
              +  +S +D R  +   + H+ +     S+    S +C    K + C  D   L  G+W
Sbjct: 727  LTKK-ASFSDTRPKTLPPERHTPSARTFASKAQRPS-SCSADIKAK-CNHD---LGRGDW 780

Query: 1047 YYLDGAGHERGPSSFSELQVLVDQGCIQKHTSVFRKFDKVWVPLTFATETSASTVRNHGE 1106
            +Y DG G ERGP SF+ELQ ++ +  +   TS +RK D +WVPL                
Sbjct: 781  FYKDGGGRERGPYSFAELQAMIGRELLIPGTSAYRKSDDLWVPLP--------------- 825

Query: 1107 KIMPSGDSSGLPPTQSQDAVLGESNNNVNSNAFHTMHPQFIGYTRGKLHELVMKSYKNRE 1166
               P  D      T+   +   +    V       +    + +T G+LHE VMK YKN+ 
Sbjct: 826  --RPEMDDGNFNVTEVTTSSF-DGARRVRKVVTTNIQQTIMAFTSGQLHEHVMKHYKNQV 882

Query: 1167 FAAAINEVLDP-------------------------WI----NAKQPKKETEHVYRKSEG 1197
             +A + E LD                          WI    +   P    +    + +G
Sbjct: 883  MSAILFEGLDARAKLIESNRRLSTCSTVALLSGEDSWIGYGRSHPSPSSSNDTSDEREDG 942

Query: 1198 DTRAGKRARLLVRE----------------SDGDEETEEELQTIQDESTFEDLCGDASFP 1241
            D       R L R                 +D  E +E+EL T +       +  D    
Sbjct: 943  DQDHRPNRRPLFRSNGLSQEQTSRKRRLVYNDDVESSEDELPTGRRTRQRCLIRNDVFDS 1002

Query: 1242 GEESASSAIESGGWGLLDGHTLAHVFHFLRSDMKSLAFASLTCRHWRAAVRFYKGISRQV 1301
             +E      ++  W  L    L  ++H L+ D+KSLA  S+TC+ WRAAV  +K   + +
Sbjct: 1003 SDEHLYEEGQNNSWETLGQLMLMRIYHHLKGDLKSLALISMTCKSWRAAVEKFKPKVKCL 1062

Query: 1302 DLSSVGPNCTDSLIRKTLNA-FDKEKLNSILLVGCTNITSGMLEEILQSFPHLSSIDIRG 1360
            D +S+G +CTD+++       +    L  ILL  CT ++   L + L++ P +  +DI G
Sbjct: 1063 DFTSIGLHCTDAVLSSVQQQNYGGGNLKQILLKDCTVLSPDALGKFLEACPTIQDVDING 1122

Query: 1361 CGQFGELALKFPNINWV--KSQKSRGAK--FNDSRSKIRSLKQI 1400
            C QFG+L+  FP +NWV   S+ S  A    +DS  K++SL  I
Sbjct: 1123 CDQFGDLSHSFPQVNWVYDDSEVSDSATQGSDDSHRKMKSLNSI 1166


>gi|302817012|ref|XP_002990183.1| hypothetical protein SELMODRAFT_447947 [Selaginella moellendorffii]
 gi|300142038|gb|EFJ08743.1| hypothetical protein SELMODRAFT_447947 [Selaginella moellendorffii]
          Length = 1749

 Score =  669 bits (1727), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 321/525 (61%), Positives = 405/525 (77%), Gaps = 8/525 (1%)

Query: 1704 DDREWGARMTKASLVPPVTRKYEVIDQYVIVADEEDVRRKMRVSLPEDYAEKLNAQK-NG 1762
            D REWGARM KA++VPPVTRKYE+I+ Y IV D++ V RKM+V +P+DY EKL A K   
Sbjct: 1203 DVREWGARMPKAAMVPPVTRKYEIIEDYWIVIDKDLVERKMKVEVPDDYEEKLRASKLKR 1262

Query: 1763 SEELDMELPEVKDYKPRKQLGDQVFEQEVYGIDPYTHNLLLDSMPDELDWNLLEKHLFIE 1822
             E   +++P++K+Y PR++LG +V EQEVYGIDPYTHNLLLD+MP      L EK  F+E
Sbjct: 1263 GEYSHLDIPDIKEYHPRRELGLEVMEQEVYGIDPYTHNLLLDTMPKIPAMTLQEKLQFME 1322

Query: 1823 DVLLRTLNKQVRHFTGTGNTPMMYPLQPVIEEIEKEAVDDC-DVRTMKMCRGILKAMDSR 1881
            + LL+ +NK+V+ FTGTG  P+ + L+PVI+ I    VDD  D    +  +G+L  M +R
Sbjct: 1323 ETLLQAINKEVKQFTGTGKAPIDFSLEPVIQRI----VDDAQDTSMQQFAQGLLSNMRNR 1378

Query: 1882 PDDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAP 1941
              +KY+AYRKGLGVVCNK+GGF EDDFVVEF GEVYP W+W+EKQDG R LQK +++P P
Sbjct: 1379 TKEKYLAYRKGLGVVCNKDGGFKEDDFVVEFFGEVYPAWRWYEKQDGCRYLQKKDKEPLP 1438

Query: 1942 EFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTV 2001
            EFYNI LERPKGDA GYDLVVVDAMHKAN+ASRICHSCRPNCEAKVTAV G Y IG+Y +
Sbjct: 1439 EFYNILLERPKGDAAGYDLVVVDAMHKANFASRICHSCRPNCEAKVTAVKGRYIIGVYAL 1498

Query: 2002 RGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLD 2061
            R I  GEE+TFDYNSVTESKEEY  S+CLCGSQ CRGSYLNL   GA + V+KE HGLLD
Sbjct: 1499 RPIQNGEELTFDYNSVTESKEEYNNSICLCGSQCCRGSYLNLANAGASQDVIKERHGLLD 1558

Query: 2062 RHQLMLEACELNSVSEEDYLELGRAGLGSCLLGGLPNWVVAYSARLVRFINLERTKLPEE 2121
            RH L+LEAC    V+  +  E+ +AG+GSCLL GLP+W++ Y+ARLV F+NLER  LP+E
Sbjct: 1559 RHVLLLEACCEGPVTRLELEEMRQAGVGSCLLDGLPDWLLKYTARLVEFMNLERQLLPDE 1618

Query: 2122 ILRHNLEEKRKYFSDICLEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRCVFGDPKK 2181
            ++R ++++KRK  +D+  E+ + DAE QAEGVYNQRLQN+A+TLDKVRYV+R +F DP++
Sbjct: 1619 LMR-SVKKKRKD-ADLSYELGRVDAENQAEGVYNQRLQNIAITLDKVRYVLRQLFTDPRE 1676

Query: 2182 APPPVERLSPEETVSFLWKGEGSLVEELIQCMAPHVEEDVLNDLK 2226
            APP +++L  +E VS LW  E S+V EL+ CM PH+  D L +LK
Sbjct: 1677 APPLLKKLDQKELVSRLWSAENSIVNELLSCMMPHIPADRLAELK 1721



 Score =  196 bits (497), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 163/574 (28%), Positives = 244/574 (42%), Gaps = 95/574 (16%)

Query: 850  EWFSGRWSCKGGDWKRNDEAAQDRCSRKKQVLNDGFPLCQMPKSGYEDPRWNQKDDLYYP 909
            +W  GRW+ KGGDWK      Q+     K VLN+G  LC+ P  G  DPR   +      
Sbjct: 587  KWKPGRWTSKGGDWKLLHPDGQNYV---KVVLNEGSLLCERPHYGV-DPRRQVQ-----V 637

Query: 910  SHSRRLDLPPWAYACPDERNDGSGGSRSTQSKLAAVRGVKGTMLPVVRINACVVNDHGSF 969
            S   + +LP WA     +RN  +  S  T    +A R  K          A    DHG  
Sbjct: 638  SERPKFELPQWAL----DRNQKADSSAETTKSASATRPAK---------TATRAFDHGKE 684

Query: 970  VSEPRSKVRAKERHSSRSARSYSSANDVRRSSAESDSHSKARNNQDSQGSWKSIACINTP 1029
            ++  R+ +   ER SS   +  +S +D R  +   + H+ +     S+    S +C    
Sbjct: 685  IAPERASM---ERPSSFLTKK-ASFSDTRPKTLPPERHTPSARTFASKAQRPS-SCSADI 739

Query: 1030 KDRLCTVDDLQLQLGEWYYLDGAGHERGPSSFSELQVLVDQGCIQKHTSVFRKFDKVWVP 1089
            K + C  D   L  G+W+Y DG G ERGP SF+ELQ +V +  +   TS +RK D +WVP
Sbjct: 740  KAK-CNHD---LGRGDWFYKDGGGRERGPYSFAELQAMVGRELLIPGTSAYRKSDDLWVP 795

Query: 1090 LTFATETSASTVRNHGEKIMPSGDSSGLPPTQSQDAVLGESNNNVNSNAFHTMHPQFIGY 1149
            L                   P  D      T+   +   +    V       +    + +
Sbjct: 796  LP-----------------RPEMDDGNFNVTEVTTSSF-DGARRVRKVVTTNIQQTIMAF 837

Query: 1150 TRGKLHELVMKSYKNREFAAAINEVLDP-------------------------WI----N 1180
            T G+LHE VMK YKN+  +A + E LD                          WI    +
Sbjct: 838  TSGQLHEHVMKHYKNQVMSAILFEGLDARAKLIESNRRLSTCSTVALLSGEDSWIGYGRS 897

Query: 1181 AKQPKKETEHVYRKSEGDTRAGKRARLLVRE----------------SDGDEETEEELQT 1224
               P    +    + +GD       R L R                 +D  E +E+EL T
Sbjct: 898  HPSPSSSNDTSDEREDGDQDHRPNRRPLFRSNGLSQEQTSRKRRLVYNDDVESSEDELPT 957

Query: 1225 IQDESTFEDLCGDASFPGEESASSAIESGGWGLLDGHTLAHVFHFLRSDMKSLAFASLTC 1284
             +       +  D     +E    A ++  W  L    L  ++H L+ D+KSLA  S+TC
Sbjct: 958  GRRTRQRRLIRNDVFDSSDEHLYEAGQNNSWETLGQLMLMRIYHHLKGDLKSLALISMTC 1017

Query: 1285 RHWRAAVRFYKGISRQVDLSSVGPNCTDSLIRKTLNA-FDKEKLNSILLVGCTNITSGML 1343
            + WRAAV  +K   + +D +S+G +CTD+++       +    L  ILL  CT ++   L
Sbjct: 1018 KSWRAAVEKFKPKVKCLDFTSIGVHCTDAVLSSVQQQNYGGGNLKQILLKDCTVLSPDAL 1077

Query: 1344 EEILQSFPHLSSIDIRGCGQFGELALKFPNINWV 1377
             + L++ P +  +DI GC QFG+L+  FP +NWV
Sbjct: 1078 GKFLEACPTIQDVDINGCDQFGDLSHSFPQVNWV 1111


>gi|302825692|ref|XP_002994441.1| hypothetical protein SELMODRAFT_432361 [Selaginella moellendorffii]
 gi|300137625|gb|EFJ04494.1| hypothetical protein SELMODRAFT_432361 [Selaginella moellendorffii]
          Length = 1531

 Score =  586 bits (1511), Expect = e-164,   Method: Compositional matrix adjust.
 Identities = 294/524 (56%), Positives = 372/524 (70%), Gaps = 62/524 (11%)

Query: 1704 DDREWGARMTKASLVPPVTRKYEVIDQYVIVADEEDVRRKMRVSLPEDYAEKLNAQK-NG 1762
            D REWGARMTKA++VPPVTRKYE+I+ Y IV D++ V RKM+V +P+DY EKL A K   
Sbjct: 1041 DVREWGARMTKAAMVPPVTRKYEIIEDYWIVIDKDLVERKMQVEVPDDYEEKLRASKLKR 1100

Query: 1763 SEELDMELPEVKDYKPRKQLGDQVFEQEVYGIDPYTHNLLLDSMPDELDWNLLEKHLFIE 1822
             E   +++P++K+Y PR++LG +V EQEVYGIDPYTHNLLLD+MP               
Sbjct: 1101 GEYSHLDIPDIKNYHPRRELGVEVMEQEVYGIDPYTHNLLLDTMP--------------- 1145

Query: 1823 DVLLRTLNKQVRHFTGTGNTPMMYPLQPVIEEIEKEAVDDCDVRTMKMCRGILKAMDSRP 1882
                                P M  LQ  ++ +E+                +L+A++   
Sbjct: 1146 ------------------KIPAM-TLQEKLQFMEET---------------LLQAIN--- 1168

Query: 1883 DDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPE 1942
                   ++GLGVVCNK+GGF EDDFVVEF GEVYP W+W+EKQDG R LQK +++P PE
Sbjct: 1169 -------KEGLGVVCNKDGGFKEDDFVVEFFGEVYPAWRWYEKQDGCRYLQKKDKEPLPE 1221

Query: 1943 FYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVR 2002
            FYNI LERPKGDA GYDLVVVDAMHKAN+ASRICHSCRPNCEAKVTAV G Y IG+Y +R
Sbjct: 1222 FYNILLERPKGDAAGYDLVVVDAMHKANFASRICHSCRPNCEAKVTAVKGRYIIGVYALR 1281

Query: 2003 GIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDR 2062
             I  GEE+TFDYNSVTESKEEY  S+CLCGSQ CRGSYLNL   GA + V+KE HGLLDR
Sbjct: 1282 PIQNGEELTFDYNSVTESKEEYNNSICLCGSQCCRGSYLNLANAGASQDVIKERHGLLDR 1341

Query: 2063 HQLMLEACELNSVSEEDYLELGRAGLGSCLLGGLPNWVVAYSARLVRFINLERTKLPEEI 2122
            H L+LEAC    V+  +  E+ +AG+GSCLL GLP+W++ Y+ARLV F+NLER  LP+E+
Sbjct: 1342 HVLLLEACCEGPVTRLELEEMRQAGVGSCLLDGLPDWLLKYTARLVEFMNLERQLLPDEL 1401

Query: 2123 LRHNLEEKRKYFSDICLEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRCVFGDPKKA 2182
            +R ++++KRK  +D+  E+ + DAE QAEGVYNQRLQN+A+TLDKVRYV+R +F DP++A
Sbjct: 1402 MR-SVKKKRKD-ADLSYELGRVDAENQAEGVYNQRLQNIAITLDKVRYVLRQLFTDPREA 1459

Query: 2183 PPPVERLSPEETVSFLWKGEGSLVEELIQCMAPHVEEDVLNDLK 2226
            PP +++L  +E VS LW  E S+V EL+ CM PH+  D L +LK
Sbjct: 1460 PPLLKKLDQKELVSRLWSAENSIVNELLSCMMPHIPADRLAELK 1503



 Score =  162 bits (410), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 118/410 (28%), Positives = 178/410 (43%), Gaps = 68/410 (16%)

Query: 1041 LQLGEWYYLDGAGHERGPSSFSELQVLVDQGCIQKHTSVFRKFDKVWVPLTFATETSAST 1100
            L  G+W+Y DG G ERGP SF+ELQ +V +  +   TS +RK D +WVPL          
Sbjct: 556  LGRGDWFYKDGGGRERGPYSFAELQAMVGRELLIPGTSAYRKSDDLWVPLP--------- 606

Query: 1101 VRNHGEKIMPSGDSSGLPPTQSQDAVLGESNNNVNSNAFHTMHPQFIGYTRGKLHELVMK 1160
                     P  D      T+   +   +    V       +    + +T G+LHE VMK
Sbjct: 607  --------RPEMDDGNFNVTEVTTSSF-DGARRVRKVVTTNIQQTIMAFTSGQLHEHVMK 657

Query: 1161 SYKNREFAAAINEVLDP-------------------------WINAKQ----PKKETEHV 1191
             YKN+  +A + E LD                          WI   +    P    +  
Sbjct: 658  HYKNQVMSAILFEGLDARAKLIESNRRLSTCSTVALLSGEDSWIGYGRSHPSPSSSNDTS 717

Query: 1192 YRKSEGDTRAGKRARLLVRES----------------DGDEETEEELQTIQDESTFEDLC 1235
              + +GD       R L R +                D  E +E+EL T +       + 
Sbjct: 718  DEREDGDQDHRPNRRPLFRSNGLSQEQTSRKRRLVYNDDVESSEDELPTGRRTRQRRLIR 777

Query: 1236 GDASFPGEESASSAIESGGWGLLDGHTLAHVFHFLRSDMKSLAFASLTCRHWRAAVRFYK 1295
             D     +E    A ++  W  L    L  ++H L+ D+KSLA  S+TC+ WRAAV  +K
Sbjct: 778  NDVFDSSDEHLYEAGQNNSWETLGQLMLMRIYHHLKGDLKSLALISMTCKSWRAAVEKFK 837

Query: 1296 GISRQVDLSSVGPNCTDSLIRKTLNA-FDKEKLNSILLVGCTNITSGMLEEILQSFPHLS 1354
               + +D +S+G +CTD+++       +    L  ILL  CT ++   L + L++ P + 
Sbjct: 838  PKVKCLDFTSIGVHCTDAVLSSVQQQNYGGGNLKQILLKDCTVLSPDALGKFLEACPTIQ 897

Query: 1355 SIDIRGCGQFGELALKFPNINWV--KSQKSRGAK--FNDSRSKIRSLKQI 1400
             +DI GC QFG+L+  FP +NWV   S+ S  A    +DS  K++SL  I
Sbjct: 898  DVDINGCDQFGDLSHSFPQVNWVYDDSEVSDSATQGSDDSHRKMKSLNSI 947



 Score = 42.7 bits (99), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 39/129 (30%), Positives = 54/129 (41%), Gaps = 27/129 (20%)

Query: 643 WFYLDHCGMECGPSRLCDLKTLVEEGVLVSDHFIKHLDSNRWETVENAVSPLVTVNFPSI 702
           W Y++  G   GP  L  LK L     +  DH +    +N W T+E+A S       P  
Sbjct: 361 WVYINKNGDTQGPMELAALKLLASREFMPDDHLVMRCGTNAWITLEHAQS-------PDN 413

Query: 703 TSDSVTQLVSPPEASGNLLADTGDTAQST---GEEFPVTLQSQCCPDGSAAAAESSEDLH 759
            S ++ +LV+P  A+ N  A     A+ST    E+F                 ES E L 
Sbjct: 414 GSSALQRLVNPVAATRN--AKDATLAESTIDGNEQF---------------QNESFEFLD 456

Query: 760 IDVRVGALL 768
           ID RV  +L
Sbjct: 457 IDGRVERIL 465


>gi|212723442|ref|NP_001132870.1| hypothetical protein [Zea mays]
 gi|194695622|gb|ACF81895.1| unknown [Zea mays]
 gi|413916953|gb|AFW56885.1| hypothetical protein ZEAMMB73_718091 [Zea mays]
          Length = 302

 Score =  456 bits (1174), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 211/301 (70%), Positives = 246/301 (81%), Gaps = 1/301 (0%)

Query: 2139 LEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRCVFGDPKKAPPPVERLSPEETVSFL 2198
            ++ EK+DAEVQAEGV N RLQ +  TLDKVRYVMRC+FGDPK APPP+ RLS +  VS +
Sbjct: 1    MDSEKNDAEVQAEGVLNSRLQQIVHTLDKVRYVMRCIFGDPKNAPPPLVRLSGKSLVSAI 60

Query: 2199 WKGEGSLVEELIQCMAPHVEEDVLNDLKSKIQAHDPSGSEDIQRELRKSLLWLRDEVRNL 2258
            WKG+ S+V ELIQ M PHVEE+VL+DLK+KI+AHDPS SEDI+  +R SLLWLRDE+R L
Sbjct: 61   WKGDSSIVAELIQSMEPHVEEEVLSDLKAKIRAHDPSESEDIEGGIRNSLLWLRDELRTL 120

Query: 2259 PCTYKCRHDAAADLIHIYAYTKCFFRVQEYKAFTSPPVYISPLDLGPKYADKLGADLQVY 2318
             CTYKCRHDAAADLIH+YAYTKCFFRV++YK   SPPV+ISPLDLGPKYADKLG   Q Y
Sbjct: 121  SCTYKCRHDAAADLIHLYAYTKCFFRVRDYKTVKSPPVHISPLDLGPKYADKLGPGFQEY 180

Query: 2319 RKTYGENYCLGQLIFWHIQTNADPDCTLARASRGCLSLPDIGSFYAKVQKPSRHRVYGPK 2378
             KTY ENYCL QLI+W+ Q N++P+  L RA +GC+SLPD+ SFY K  KPS+ R YG +
Sbjct: 181  CKTYPENYCLAQLIYWYSQ-NSEPESRLTRARKGCMSLPDVSSFYVKSAKPSQERAYGNR 239

Query: 2379 TVRFMLSRMEKQPQRPWPKDRIWAFKSSPRIFGSPMLDSSLTGCPLDREMVHWLKHRPAI 2438
            TVRFMLSRMEKQ QRPWPKDRIW FKS PR FGSPM+D+ L   PLD+EMVHWLK RP +
Sbjct: 240  TVRFMLSRMEKQAQRPWPKDRIWVFKSDPRFFGSPMMDTVLNNSPLDKEMVHWLKTRPNV 299

Query: 2439 F 2439
            F
Sbjct: 300  F 300


>gi|237506940|gb|ACQ99221.1| hypothetical protein [Tragopogon dubius]
          Length = 199

 Score =  367 bits (942), Expect = 5e-98,   Method: Compositional matrix adjust.
 Identities = 168/199 (84%), Positives = 189/199 (94%)

Query: 1960 LVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTE 2019
            LVVVDAMHKANYASRICHSCRPNCEAKVTAVDG YQIGIY+VR I YGEE+TFDYNSVTE
Sbjct: 1    LVVVDAMHKANYASRICHSCRPNCEAKVTAVDGQYQIGIYSVRPIVYGEEVTFDYNSVTE 60

Query: 2020 SKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDRHQLMLEACELNSVSEED 2079
            SKEEYEASVCLCGSQVCRGS+LNLTGEGAF+KVLKE HG+L+RHQLMLEACELN VSEED
Sbjct: 61   SKEEYEASVCLCGSQVCRGSFLNLTGEGAFQKVLKECHGILNRHQLMLEACELNCVSEED 120

Query: 2080 YLELGRAGLGSCLLGGLPNWVVAYSARLVRFINLERTKLPEEILRHNLEEKRKYFSDICL 2139
            Y+ELG+AGLGSCLL GLP+W+VAY+ARLVRFI+ ERTKLP+ IL HNLEEKRKYF+DIC+
Sbjct: 121  YIELGKAGLGSCLLSGLPDWLVAYAARLVRFIHFERTKLPQVILTHNLEEKRKYFTDICM 180

Query: 2140 EVEKSDAEVQAEGVYNQRL 2158
            + EK++AE+QAEGV+NQR+
Sbjct: 181  DTEKNEAEIQAEGVFNQRI 199


>gi|237506942|gb|ACQ99222.1| hypothetical protein [Tragopogon pratensis]
          Length = 199

 Score =  365 bits (936), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 167/199 (83%), Positives = 188/199 (94%)

Query: 1960 LVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTE 2019
            LVVVDAMHKANYASRICHSCRPNCEAKVTAVDG YQIGIY+VR I YGEE+TFDYNSVTE
Sbjct: 1    LVVVDAMHKANYASRICHSCRPNCEAKVTAVDGQYQIGIYSVRPIVYGEEVTFDYNSVTE 60

Query: 2020 SKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDRHQLMLEACELNSVSEED 2079
            SKEEYEASVCLCGSQVCRGS+LNLTGEGAF+KVLKE HG+L+RHQLMLEACELN VSEED
Sbjct: 61   SKEEYEASVCLCGSQVCRGSFLNLTGEGAFQKVLKECHGILNRHQLMLEACELNCVSEED 120

Query: 2080 YLELGRAGLGSCLLGGLPNWVVAYSARLVRFINLERTKLPEEILRHNLEEKRKYFSDICL 2139
            Y+EL +AGLGSCLL GLP+W+VAY+ARLVRFI+ ERTKLP+ IL HNLEEKRKYF+DIC+
Sbjct: 121  YIELSKAGLGSCLLSGLPDWLVAYAARLVRFIHFERTKLPQVILTHNLEEKRKYFTDICM 180

Query: 2140 EVEKSDAEVQAEGVYNQRL 2158
            + EK++AE+QAEGV+NQR+
Sbjct: 181  DTEKNEAEIQAEGVFNQRI 199


>gi|308809269|ref|XP_003081944.1| SET domain-containing protein (ISS) [Ostreococcus tauri]
 gi|116060411|emb|CAL55747.1| SET domain-containing protein (ISS) [Ostreococcus tauri]
          Length = 1744

 Score =  323 bits (828), Expect = 8e-85,   Method: Compositional matrix adjust.
 Identities = 237/766 (30%), Positives = 377/766 (49%), Gaps = 70/766 (9%)

Query: 1724 KYEVIDQYVIVADEEDVRRKMRVSLPEDYAEKLNAQKNGSEELDME----LPEVKDYKPR 1779
            ++ VI++YV   DE + + +  V+L +++     +  + +   D       P +     R
Sbjct: 987  QFTVIEEYVDRKDELECQIERSVTLAKNHPYVKGSTTSTANVDDGHPPGWWPSLGVKAER 1046

Query: 1780 KQLGDQVFEQEVYGIDPYT----HNLLLDSMPD--ELDWNLLEKHLFIEDVLLRTLNKQV 1833
            K+L  +V EQE YG D  T       L   +PD  E D   L KHL      L  +N+  
Sbjct: 1047 KKLVTEVIEQETYGCDFVTGRDATTTLQKVLPDFSEDDVWALYKHL------LSQVNESY 1100

Query: 1834 RHFT--GTGNTPMMYPLQPVIEEIEKEAVDDCDVRTMKMCRGILK-AMDSRPDDK-YVAY 1889
               T        +    + + E+ E + V   D++++   + + K A +SR   + Y  +
Sbjct: 1101 GAMTPDTLATQSLALAAEDLAEKFENQGVRMNDMKSIAFSKALWKLASESRVSPEFYAVH 1160

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNN---EDPAPEFYNI 1946
            RKG GVVC +    GE  F+++FLGE+YP W W EKQD IR +QKN    +   PEFYN+
Sbjct: 1161 RKGFGVVCKEPIKKGE--FLIDFLGEIYPPWAWAEKQDAIRLVQKNRGLRDKGPPEFYNM 1218

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
             +ERP GD +GY ++  DAMH+ NYA R+ H+C PN E  + A++G Y+I   T R I  
Sbjct: 1219 QIERPGGDEEGYSVLFCDAMHENNYAGRLSHTCDPNVEVNLKAINGKYEIHFITNRDIEP 1278

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDRHQLM 2066
            GEE+ ++Y+S T++ +E EA+ CLCG+++CRGSYLN  GE    +VL   H L+DR  + 
Sbjct: 1279 GEELAYNYHSCTDNMKEVEAAFCLCGARMCRGSYLNFVGEDNNSQVLNTKHKLIDRQVIA 1338

Query: 2067 LEACELNS--VSEEDYLELGRAGL--GSCLLGGLPNWVVAYSARLVRFINLERTKLPEEI 2122
             +A +  +  ++ +    L + G   G  LL   P+W++ +   L  ++  E  +LP+ I
Sbjct: 1339 FKAIDRAAEPLNPKQVRCLEQVGFYPGKGLLKCCPSWLLHFVGDLAIYMEEEVNQLPKHI 1398

Query: 2123 LRHNLEEKRKYF---SDICLEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVM-RCVFGD 2178
            L    +E  K       +     +  A++ A  V   R Q++A+ L K+R V+ R   G 
Sbjct: 1399 LAAAKQEHEKLLLKSPGMEFTYNEKFAKIDALAVRENRTQSIAIMLSKLRRVLTRARDGG 1458

Query: 2179 PKK-----------APPPVERLSPEETVSFLWKGEG------SLVEELIQCMAPH---VE 2218
             +K           APPP  RL+ +E     W G G      S++  L+  M PH    +
Sbjct: 1459 AQKSVYECLDKFESAPPPFVRLTDDEVAVQFW-GTGTDGFDRSVIRGLLNAMGPHERKRD 1517

Query: 2219 EDVLNDLKSKIQAHDPSGSEDIQRELRKSLLWLRDEVRNLPCTYKCRHDAAADLIHIYAY 2278
             D      SK++A      +  + +LR+SLLWLRDE+  LP     RHD AA L+H YA 
Sbjct: 1518 ADEFVKWTSKVEAIADKARKG-KMDLRESLLWLRDELIKLPRDKCARHDLAAALVHFYAM 1576

Query: 2279 TKCFFR---VQEYKAFTSPPVYISPLDLGP------KYADKLGADLQVYRKTYGENYCLG 2329
            T+ F++     E+  +TS  V +   ++           DK+ A ++   KTY   Y   
Sbjct: 1577 TEQFWQPSPAPEHMGYTSDKVAVREDEVNAWGVGAGGGGDKIVARVE---KTYRPGYSGA 1633

Query: 2330 QLIFWHIQTNADPDCTLARASRGCLSLPDIGSFYAKVQKPSRHRVYGPKTVRFMLSRMEK 2389
             ++ WH Q  ADP   L    +G L++PDI   Y+        R  G       +  ++ 
Sbjct: 1634 TMLQWHKQEVADPTQHLVANRKGNLTMPDIACCYSSRPNQPLARA-GSLEHETWIGHLQN 1692

Query: 2390 QPQRPWPK-DRIWAFKSSPRIFGSPMLDSSLTGC-PLDREMVHWLK 2433
             P+  WP     W   +  ++ GSP++D+ + G   +  +++ W+K
Sbjct: 1693 WPEETWPNLSGPWGIGNPQKLIGSPIIDAWMQGKRSIPVKVLAWIK 1738


>gi|145351886|ref|XP_001420292.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144580526|gb|ABO98585.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 1361

 Score =  313 bits (801), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 235/771 (30%), Positives = 374/771 (48%), Gaps = 84/771 (10%)

Query: 1724 KYEVIDQYVIVADEEDVRRKMRVSL----PEDYAEKLNAQKNGSEELDMELPEVKDYKPR 1779
            ++ VI +YV   DE++ + +  V+L    P D   K+              P +     R
Sbjct: 610  QFTVITEYVDRKDEKECQIERSVTLAANHPFDKKSKMKTAHVDDGTPPGWWPSLGVKAER 669

Query: 1780 KQLGDQVFEQEVYGIDPYT--------HNLLLDSMPDELDWNLLEKHLFIEDVLLRTLNK 1831
            K+L  +V EQE YG+D  T          +L D   D++ W L ++       LL  +N+
Sbjct: 670  KKLVTEVIEQETYGVDFVTGRDATETLKRVLPDYSEDDV-WGLYKQ-------LLAQVNE 721

Query: 1832 QVRHFTGTGNTPMMYPLQPVIEEIEKEAVDDCDVRTMKMCRGILKAMDSRPD--DKYVAY 1889
               +   T +T     L    E++  +     D +++   + + K   +  +  + Y  +
Sbjct: 722  S--YGAMTPDTLATQNLAVAAEDLAVKLERKADAKSLAFSKALWKLAAAAIETPEYYYVH 779

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNN---EDPAPEFYNI 1946
            RKG GVVCN+    GE  F+++FLGE+YP W W EKQD IR +QK     +   PEFYN+
Sbjct: 780  RKGFGVVCNQPIKKGE--FLIDFLGEIYPPWAWAEKQDAIRQVQKARGLRDRGPPEFYNM 837

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
             +ERP GDA+GY ++  DAMH+ NYA R+ H+C PN E  + A++G Y+I   T R I  
Sbjct: 838  QIERPGGDAEGYSVLFCDAMHENNYAGRLSHTCDPNVEVNLKAINGKYEIHFITTRDIAP 897

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDRHQLM 2066
            GEE+ ++Y+S T++ +E E + CLCG+++CRGSYLN  GE    +VL+  H L+DR  + 
Sbjct: 898  GEELAYNYHSCTDNMKEVEMAFCLCGARMCRGSYLNFVGEDHHSQVLESKHKLIDRQVMS 957

Query: 2067 LEACE--LNSVSEEDYLELGRAGL--GSCLLGGLPNWVVAYSARLVRFINLERTKLPEEI 2122
             +A +   + ++ +    L   G   G  LL   P W++ +   +  +++ E  +LP+ I
Sbjct: 958  FKAIDKAADPLTSKQERSLAAVGFYPGKGLLRNCPGWLLQFVGDVAVYMDTELNELPKHI 1017

Query: 2123 LRHNLEEKRKYFS---DICLEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVM------- 2172
            L    +E  K             +  A++ A  +   R Q +A+ L K+R ++       
Sbjct: 1018 LAAAKKEHAKLLEKNPQAEFSYTEKFAKIDALAMRENRTQCVAIMLSKLRRLLTRARDDG 1077

Query: 2173 ------RCVFGDPKKAPPPVERLSPEETVSFLWKG-----EGSLVEELIQCMAPHVEEDV 2221
                   C+    K APP V  L+  E  +  W       E S+V  LI+ M PH  +  
Sbjct: 1078 PQKSVYECMDVFEKSAPPYVT-LTEAEIAAHFWGSGPENFEKSIVCGLIRAMGPHERK-- 1134

Query: 2222 LNDLKSKIQAHDPSGSEDIQRELRK-------SLLWLRDEVRNLPCTYKCRHDAAADLIH 2274
             ND    I+    S  E +  E+RK       SLLWLRDE++ L  T   RHD AA LIH
Sbjct: 1135 -NDADKFIKW--TSMVESVAVEVRKGKMTRKESLLWLRDELKKLKQTDGARHDLAAGLIH 1191

Query: 2275 IYAYTKCFFR---VQEYKAFTSPPVYISPLDLGP------KYADKLGADLQVYRKTYGEN 2325
            +YA T  F++     E++ + S  V +   ++           DK+ A ++   KTY   
Sbjct: 1192 LYAETNRFWQPSSAPEHQVYKSDKVAVREDEVNAWGVGAGGGGDKIVARVE---KTYRPG 1248

Query: 2326 YCLGQLIFWHIQTNADPDCTLARASRGCLSLPDIGSFYAKVQKPSRHRVYGP-KTVRFML 2384
            +    ++ WH Q  ADP   +    RG LS+PDI   Y+   +P +       +     L
Sbjct: 1249 FSAATMLQWHKQEMADPTQYITANRRGNLSMPDIACCYS--SRPGQPLARSSDREHETWL 1306

Query: 2385 SRMEKQPQRPWPKDR-IWAFKSSPRIFGSPMLDSSLTGC-PLDREMVHWLK 2433
            + ++  P+ PWP+    W   +S ++ GSP+LD+ + G   +  + + WLK
Sbjct: 1307 AHLQSWPEEPWPQSSGPWGVANSQKLIGSPVLDAWMKGQRSIPAKCLAWLK 1357


>gi|308805538|ref|XP_003080081.1| SET domain-containing protein (ISS) [Ostreococcus tauri]
 gi|116058540|emb|CAL53729.1| SET domain-containing protein (ISS) [Ostreococcus tauri]
          Length = 844

 Score =  306 bits (784), Expect = 9e-80,   Method: Compositional matrix adjust.
 Identities = 189/574 (32%), Positives = 292/574 (50%), Gaps = 60/574 (10%)

Query: 1882 PDDKYVAYR---KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNED 1938
            P +  V +R   KG GVVC    G      V  ++GE+YP W+W+E+QD I+    N   
Sbjct: 271  PKEHRVQFRIHPKGTGVVCINPNGLKAGTLVNYYIGEMYPPWQWYERQDAIKKSFPNMN- 329

Query: 1939 PAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
              P F+NI LERP  D  G  ++ V+AMHK ++ASR+ HSC PNC+      DG   +G+
Sbjct: 330  -LPSFFNITLERPAHDERGRHVIFVEAMHKGSFASRLSHSCEPNCQTVTFTKDGKLTLGM 388

Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHG 2058
            +TVR I YGEE+T+DY+ +TES EEY    CLC S  CRGS+L   G GAF  ++ + H 
Sbjct: 389  FTVRDIAYGEEMTWDYSCITESAEEYRTGFCLCSSPGCRGSFLTYAGNGAFTAIVNKKHS 448

Query: 2059 LLDRHQLMLEACELNSVSEEDYLELGRAGLGSCLLGGLPNWVVAYSARLVRFINLERTKL 2118
             L R+ ++  A     +++ +   L  AG+  C L   P+WVV ++A  +++I LE  +L
Sbjct: 449  FLHRNAILFVA-STTPLTKAESESLYVAGIRQCALEKCPDWVVKWAALTLQYIKLEEKEL 507

Query: 2119 PEEILRHNLEEKRKYFSDICLEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRCVFGD 2178
            P+ +++  + E  +Y        ++  A+ +A GV + R+ NL VTLDK+RYV+      
Sbjct: 508  PDVLMKLPVTEYGRY--------DEIGAKYEAAGVASTRITNLVVTLDKIRYVL----NR 555

Query: 2179 PKKAPPPVER-LSPEETVSFLWKGEGSLVEELIQCM------------------APHVEE 2219
            P +      R LS +E +  LW GE S+    I  M                  A   E+
Sbjct: 556  PGQRRDSFFRALSDDEVIDHLWSGEASVFRRFIITMVNSGGDKRNEARSASMSTAAMFEK 615

Query: 2220 D-----VLNDLKSKIQAHDPSGSEDIQRELRKSLLWLRDEVRNLPCTYKCRHDAAADLIH 2274
                  V + LK+  ++ +     +   + R  LL +R  + +     K  H  A DL+ 
Sbjct: 616  TWTDTRVASALKAIKKSVNVVERPETAEQARARLLQVRAALEH--AGDKAFHAQARDLLW 673

Query: 2275 IYAYTKCFFRVQEYKAFTSPPVYISPLDLGPKYADKLGADL---------QVYRKTYGEN 2325
            ++A T  +F ++++    SPPV I   D+  + + ++   L         ++ +K YG  
Sbjct: 674  LHANTLHYFTIEKFDLVLSPPVNID--DMKSQISCEMRTKLPNAVKGNRDKLLQKKYGPL 731

Query: 2326 YCLGQLIFWHIQTNADPDCTLARASRGCLSLPDIGSFYAKVQKPSRHRVYGPKTVRFMLS 2385
            Y  GQL+ W+ QT   PD +L+   RG LSLPD  S Y+ V  P++   Y     R ++ 
Sbjct: 732  YVWGQLVTWYKQTVYAPDASLSADRRGSLSLPDPESCYSAV--PTK---YTSSERRSLIK 786

Query: 2386 RMEKQPQRPWPKDRIWAFKSSPRIFGSPMLDSSL 2419
             M       WP    W+FK+  +++GSPM D +L
Sbjct: 787  LMRSNIHAMWPTTMSWSFKNPTKVYGSPMFDEAL 820


>gi|110741769|dbj|BAE98829.1| hypothetical protein [Arabidopsis thaliana]
          Length = 517

 Score =  301 bits (770), Expect = 5e-78,   Method: Compositional matrix adjust.
 Identities = 231/597 (38%), Positives = 307/597 (51%), Gaps = 151/597 (25%)

Query: 1   MGDGGVACMPLQQQQQHNSIMERFPISDKTTICVGNSSNNSNKTNNNSISNNNDNKTNND 60
           M DGGVACMPL       +IME+ PI +KTT+C GN S                 KT   
Sbjct: 1   MSDGGVACMPLL------NIMEKLPIVEKTTLCGGNES-----------------KTAAT 37

Query: 61  SSNNNGSSSSKNNETNKSNVKKNGVSTKTVRKKIVK-IKKVIAVKKKEVQKNSGSS---- 115
           + N + S ++K  E+  +N K +  S    +K+IVK I+KV+  + K+ QK +       
Sbjct: 38  TENGHTSIATKVPESQPAN-KPSASSQPVKKKRIVKVIRKVVKRRPKQPQKQADEQLKDQ 96

Query: 116 ---------------------KSNNNGENIDNKNVENGGAVGEVVTVDKENLKNEEVEEG 154
                                KS   G     K VENGG  G            +EVEEG
Sbjct: 97  PPSQVVQLPAESQLQIKEQDKKSEFKGGTSGVKEVENGGDSG----------FKDEVEEG 146

Query: 155 ELGTLKW----ENGEFVQPEKSQPQSQLQSQSKQIEKGEIIV------------------ 192
           ELGTLK     ENGE + P KS        Q  +IEKGEI+                   
Sbjct: 147 ELGTLKLHEDLENGE-ISPVKSL-------QKSEIEKGEIVGESWKKDEPTKGEFSHLKY 198

Query: 193 ---------FSS-KCRRGETEKGESGLWRGNKDDIEKGEFIPDRWHK-EVVKDEYGYSKS 241
                    FS+ K  +G  E+ E   WR   D+IEKGEFIPDRW K +  KD++ Y +S
Sbjct: 199 HKGYVERRDFSADKNWKGGKEEREFRSWRDPSDEIEKGEFIPDRWQKMDTGKDDHSYIRS 258

Query: 242 RR----------YDYKLERTPPSGKYSGEDVYRRKEFDRSGSQHSKSSSRWESGQERNVR 291
           RR          Y+Y+ ERTPP G++  ED+Y ++EF               SG +R  R
Sbjct: 259 RRNGVDREKTWKYEYEYERTPPGGRFVNEDIYHQREF--------------RSGLDRTTR 304

Query: 292 ISSKIVDDEGLYKGEHNNGKNHGREYFH-GNRFKRHGTDSDSGDRKY-YGDYGDFAGLKS 349
           ISSKIV +E L+K E+NN  N  +EY   GNR KRHG + DS +RK+ Y DYGD+   K 
Sbjct: 305 ISSKIVIEENLHKNEYNNSSNFVKEYSSTGNRLKRHGAEPDSIERKHSYADYGDYGSSKC 364

Query: 350 RRLSDDYNSRSVHSEHYSRHSVEKFHRNSSSSRISSLDKYSSRHHEPSLSSRVIYDRHGR 409
           R+LSDD  SRS+HS+HYS+HS E+ +R+S  S+ SSL+KY  +H + S  ++   D+HG 
Sbjct: 365 RKLSDDC-SRSLHSDHYSQHSAERLYRDSYPSKNSSLEKYPRKHQDASFPAKAFSDKHGH 423

Query: 410 SPSHSDRSPHDRGRYYDHRDRSPSRHDRSPYTRDRSPYTFDRSPYSRERSPYNRDRSPYA 469
           SPS SD SPHDR RY+++R       DRSPY R+RSPY F++S ++R+RSP +R      
Sbjct: 424 SPSRSDWSPHDRSRYHENR-------DRSPYARERSPYIFEKSSHARKRSPRDRRHH--- 473

Query: 470 REKSPYDRSRHYDHRNRSPFSAERSPQDRARFHDRSDRTPNYLERSPLHRSRPNNHR 526
                           RSP  +E SP DR+R  DR D  PN++E +   R+R N HR
Sbjct: 474 -------------DYRRSPSYSEWSPHDRSRPSDRRDYIPNFMEDTQSDRNRRNGHR 517


>gi|145347753|ref|XP_001418326.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144578555|gb|ABO96619.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 931

 Score =  288 bits (736), Expect = 4e-74,   Method: Compositional matrix adjust.
 Identities = 181/556 (32%), Positives = 282/556 (50%), Gaps = 36/556 (6%)

Query: 1874 ILKAMDSRPD-DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSL 1932
            I+++ D R   +++  + KG GVVC    G     FV  ++GE+Y  WKW+E+QD I+  
Sbjct: 376  IVESHDPRARRERFRIHPKGTGVVCINPNGLKAGTFVNHYIGEIYSPWKWYERQDAIKKC 435

Query: 1933 QKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDG 1992
                E   P F+NI LERP  D  G  +  V+AMHK  +ASR+ HSC PNC+    A  G
Sbjct: 436  YPGME--LPSFFNITLERPPHDDRGRHVSFVEAMHKGCFASRLSHSCEPNCQTVTFAKGG 493

Query: 1993 HYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKV 2052
               +G++T + I YGEE+T+DY+ +TES EEY    CLC S  CRGS+L  +G GAF  V
Sbjct: 494  KLTLGMFTTQDIAYGEEMTWDYSCITESAEEYRTGFCLCSSPTCRGSFLTYSGSGAFTAV 553

Query: 2053 LKELHGLLDRHQLMLEACELNSVSEEDYLELGRAGLGSCLLGGLPNWVVAYSARLVRFIN 2112
            + + H  L R+ ++ +A   ++++  D   L  +G+  C L   P+WVV ++A  + +I 
Sbjct: 554  VNKKHAFLHRNAILFKA-STSALTNVDRKMLHDSGIRECALEHCPDWVVKWAALTLEYIK 612

Query: 2113 LERTKLPEEILRHNLEEKRKYFSDICLEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVM 2172
            LE  +LP E++        +Y         +  A+ +A GV   R+ NL VTLDK+RYV+
Sbjct: 613  LEENELPNELMSLPATRFGRY--------NELGAKSEATGVAATRITNLIVTLDKIRYVL 664

Query: 2173 RCVFGDPKKAPPPVERLSPEETVSFLWKGEGSLVEELIQ-CMAPHVEEDVLNDL-KSKIQ 2230
                   ++  P    L+  E +  LW G+ S++  +++  +A    +   N + KS++ 
Sbjct: 665  T---RSGQERAPFFRVLTESEVIEHLWSGDESILRRILRSILAGAGAKKGSNSVGKSRLV 721

Query: 2231 AHDPSGSEDIQRELRKSLLWLRDEVRNLPCTYKCRHDAAADLIHIYAYTKCFFRVQEYKA 2290
                  + D + +     +  R ++   P T       A+DL+  YA T+ +F   +   
Sbjct: 722  MAKMPKTGDARVDAAMKAIQERIDIDERPKT-------ASDLLWFYANTRNWFTHAKLDN 774

Query: 2291 FTSPPVYISPL-DLGPKYADK------LGADLQVYRKTYGENYCLGQLIFWHIQTNADPD 2343
              SP V I  +  + P    K       G    + +K YG  Y  GQL+ W+ QT   PD
Sbjct: 775  VISPAVNIDDVASVIPTTQRKHIPNAFRGNREAMLQKRYGALYVWGQLVTWYKQTIYSPD 834

Query: 2344 CTLARASRGCLSLPDIGSFYAKVQKPSRHRVYGPKTVRFMLSRMEKQPQRPWPKDRIWAF 2403
             +L+   RG LSLPD  S  +    PS    Y  K  + +   + K   + WP    W+F
Sbjct: 835  SSLSADRRGTLSLPDPESCCSAA--PS---AYVNKERKELFKALRKNKHQSWPTATSWSF 889

Query: 2404 KSSPRIFGSPMLDSSL 2419
            K+  +++GSPM D +L
Sbjct: 890  KNPAKVYGSPMFDDAL 905


>gi|145494033|ref|XP_001433011.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124400127|emb|CAK65614.1| unnamed protein product [Paramecium tetraurelia]
          Length = 1065

 Score =  284 bits (727), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 203/716 (28%), Positives = 350/716 (48%), Gaps = 76/716 (10%)

Query: 1733 IVADEEDVRRKMRVSLPEDYAEKLNAQKNGSEE----LDMELPEVKDYKPRKQLGDQVFE 1788
            I  D E   RK RV   E           G+E+    + +++ + +  K +++    V E
Sbjct: 281  ITWDSECPNRKNRVECLE---------HEGTEQPCMNMSIQMKQHQVNKMKQEENADVEE 331

Query: 1789 QEVYGIDPYTHNLLLDSMPDELDWNLLEKHLFIEDVLLRTLNKQVRHFTGTGNTPMMYPL 1848
               +GID YT  ++++ +P  L+++  +K+ F+E +LL  +N+         +    Y +
Sbjct: 332  TPCWGIDAYTRKVIINILP--LNYDDAQKNKFLEKLLL-AINR-------PSDKENAYDM 381

Query: 1849 QPVIEEIEKEA---VDDCDVRTMKMCRGILKAMDSRPDDKYVAYRKGLGVVCNKEGGFGE 1905
                + I KE+       +    KM + I K +     + +  + KG G+VC  + G   
Sbjct: 382  SLACDYIIKESKLMSSHYNKEDRKMAKQIQKVL-KYDTEGFRIHTKGFGLVCVNKQGIKN 440

Query: 1906 DDFVVEFLGEVYPVWKWFEKQDGIRSLQK--NNEDPAPEFYNIYLERPKGDADGYDLVVV 1963
            +  ++ +LGE+Y  W+W+EKQD I+   K  N +D  P+FYNI L+  + D  G D + V
Sbjct: 441  NSLIIPYLGEIYQPWRWYEKQDFIKKQMKEHNQKDILPDFYNIMLDIHRDDIKGIDFLFV 500

Query: 1964 DAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEE 2023
            D ++K NY+SR+ HSC PNC    T  +G Y IG+Y +R I YGEE+TFDY S TESK+E
Sbjct: 501  DPINKGNYSSRLSHSCNPNCGTVTTVSNGTYVIGMYAMREIQYGEELTFDYCSFTESKQE 560

Query: 2024 YEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDRHQLMLEACELN-SVSEEDYLE 2082
               ++CLCGS+ C+  YL L+    +  +L + H  L R+ ++L++C  N   S ED   
Sbjct: 561  QLQALCLCGSEKCKIYYLQLSNCKEYNGILDKEHCFLTRNAILLKSCSDNVDKSNEDSEL 620

Query: 2083 LGRAGLGSCLLGGLPNWVVAYSARLVRFINLERTKLPEEILRHNLEEKRKYFSDICLEVE 2142
              +  +GS +L   P W+  +   +++FI+               +E++ Y S++ L+ E
Sbjct: 621  YSKYRIGSSVLNDCPLWLKNWVGYILKFID---------------QERQTYKSELNLKYE 665

Query: 2143 KSDAEVQAEGVYNQ------RLQNLAVTLDKVRYVMRCVFGDPKKAPPPVERLSPEETVS 2196
            ++ AEV+    +        R+QNL  TLDK+++ +     +   + PP+  +   + + 
Sbjct: 666  QT-AEVEQWNHFTATQHSEDRIQNLIFTLDKIKFFL----NNSDSSEPPISIIGDSDLLD 720

Query: 2197 FLWK-------GEGSLVEELIQCMAPHVEEDVLNDL----KSKIQAHD-PSGSEDIQRE- 2243
              WK        E S   EL Q    H ++ ++  +    K K Q HD      +I R+ 
Sbjct: 721  SFWKDYSSGTSSECSFFNELYQLFQKHNQKKMIELIHVIYKKKQQLHDYKENIHNIHRQE 780

Query: 2244 -LRKSLLWLRDEVRNLPCTYKCRHDAAADLIHIYAYTKCFFRVQEYKAFTSPPVYISPLD 2302
             L   +L+L      +   Y   H+A + ++++ AYT  +F+  EY  F+SPP+     D
Sbjct: 781  LLITRMLFLTLSHMLMQQQYTFHHEALSLILYMMAYTYTYFKPYEYTGFSSPPI----ED 836

Query: 2303 LGPKYADKLGADLQVYRKTYGENYCLGQLIFWHIQTNADPDCTLARASRGCLSLPDIGSF 2362
            L  +         +   + Y   +  GQLI W  QT A P  +L++  RG LS P I SF
Sbjct: 837  LEWRKVGAFKKKCKSEGRAYSSQFVWGQLIGWFKQTVAAPQASLSQDRRGTLSYPAISSF 896

Query: 2363 YAKVQKPSRHRVYGPKTVR-FMLSRMEKQPQRPWPKDRI-WAFKSSPRIFGSPMLD 2416
                 K    +    ++ R + +  +  +P   WP +   W++K++ +I+G+ + +
Sbjct: 897  DKAGDKAYPFQQSKAQSSRSYFIQHLLDKPSYMWPPETASWSYKNTYKIYGTILFE 952


>gi|403375645|gb|EJY87798.1| hypothetical protein OXYTRI_23635 [Oxytricha trifallax]
          Length = 1691

 Score =  275 bits (704), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 178/607 (29%), Positives = 308/607 (50%), Gaps = 53/607 (8%)

Query: 1781 QLGDQVFEQEVYGIDPYTHNLLLDSMPDELDWNLLEKHLFIEDVLLRTLNKQVRHFTGTG 1840
            +LG  V E+  +GID  T   L+  +P   D  +  +  FIE  L+  + +Q     G  
Sbjct: 960  KLGIDVIEKISWGIDMGTAVNLMTLLPK--DMPMKAQSDFIEKRLVFAIQQQ-----GDQ 1012

Query: 1841 NTPMMYPLQPVIEEIEKEAVDDCDVRTMK-MCRGILKAMDSRPDDKYVAYRKGLGVVCNK 1899
               +   L+ +I + E     D D    + M +GI    D+  +  +  + KG+G+ C +
Sbjct: 1013 GYDVREALKFIINDRENPRFRDIDRELAQIMLQGITMVKDN-VERHFRVHSKGIGIFCKR 1071

Query: 1900 EGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNN--EDPAPEFYNIYLERPKGDADG 1957
              G    + ++E+ GE+Y  W W+EKQD ++  Q         P+FYNI  ER   D  G
Sbjct: 1072 NEGIKASNLIIEYFGEIYQPWNWYEKQDVLKQGQNKQTLSKDLPDFYNITFERHHDDPQG 1131

Query: 1958 YDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSV 2017
            YD+++VD +   NY+SR+ HSC PNC   +   D  Y IG++ ++ + +GEE+ F+Y S+
Sbjct: 1132 YDILMVDPILYGNYSSRLSHSCNPNCSTIIHVRDNQYSIGMFAIKDVSFGEELCFNYCSL 1191

Query: 2018 TESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDRHQLMLEACELNSVSE 2077
            TES++EYE+++CLCG++VC+G YL L  +     ++K+ H  +DR+ L+ +AC+   ++E
Sbjct: 1192 TESEKEYESAICLCGTEVCQGKYLQLANDKKHMAIMKKYHTFVDRNYLLYKACKFPEITE 1251

Query: 2078 EDYLELGRAGLGSCLLGGLPNWVVAYSARLVRFINLERTKLPEEILRHNLEEKRKYFSDI 2137
            ED   L   G+   +L  +P+W+  +++ +  +I  E    P             +F +I
Sbjct: 1252 EDEKRLNDFGIKESVLKDVPDWLKKWASLICEYIIFEEDIYP------------SFFKEI 1299

Query: 2138 CLEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRC--VFGDPKKAPPPVERLSPEETV 2195
                ++ D  ++A+   + ++ N+A+T+DKV +V++   VF       PP++ LS +E  
Sbjct: 1300 YPTFKEEDLRIEAKNQRDSKIWNIAITIDKVMHVLKSMGVF------EPPIKDLSYKERC 1353

Query: 2196 SFLWKGEGSLVEELIQCMAPHVEE--DVLNDLKSKIQA--------HDPSGS--EDIQRE 2243
              LW  + SL E LI  +  H+E   + L+ L+  I           D +G   ++  +E
Sbjct: 1354 IRLWDSKDSLRESLIDVLN-HIENYPEKLDALQKVISVPLVEAQFDEDENGRYYKEKYQE 1412

Query: 2244 LRKSLLWLRDEVRNLPCTYKCRHDAAADLIHIYAYTKCFFRVQE-YKAFTSPPVYISPLD 2302
            ++  +  +   +R +  + K    A  D + +YA T  +F   E YK        I   D
Sbjct: 1413 IQSVIALISGILRPIK-SIKVMIPALCDSLWLYANTHTYFTPNENYKKCKGDEQKIRKCD 1471

Query: 2303 LGPKYADK-----LGADLQVYR--KTYGENYCLGQLIFWHIQTNADPDCTLARASRGCLS 2355
            +  +  ++     +  + QVYR  K Y  +Y  GQL+ W+ QT   P+ +L+   RG LS
Sbjct: 1472 VRIENQNQASLNPVEQEKQVYRGFKEYDPSYVWGQLVGWYKQTVDKPNASLSADRRGTLS 1531

Query: 2356 LPDIGSF 2362
            +PD+ SF
Sbjct: 1532 MPDLESF 1538


>gi|384247471|gb|EIE20958.1| hypothetical protein COCSUDRAFT_57497 [Coccomyxa subellipsoidea
            C-169]
          Length = 1198

 Score =  275 bits (703), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 175/520 (33%), Positives = 269/520 (51%), Gaps = 59/520 (11%)

Query: 1820 FIEDVLLRTLNKQVRHFTGTGNTPMMYPLQPVIEEIEKEAVDDCDVRTMKMCRGILKAMD 1879
            ++E  L+  +NKQ     G     ++  LQ V E  +  A D C V         ++ + 
Sbjct: 456  WVERELMPAINKQ-----GASGWDILLALQDVKEHAQA-AGDMCSVEAANAVEKRVQKVG 509

Query: 1880 SRPDDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDP 1939
            S   + +  + KG+G+ C +  G     FV E+LGEV+  W+WFE QD IR   K   D 
Sbjct: 510  S---NYFRIHPKGVGLKCCRSEGLPPLTFVEEYLGEVHTPWRWFEMQDIIR---KTMGDE 563

Query: 1940 APEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIY 1999
             P+FYNI LERP+ D  GYD++ +DA  K  YASR+ HSC PNC+A V A  G   I +Y
Sbjct: 564  LPDFYNIVLERPRDDPTGYDVMFIDAAAKGAYASRMSHSCTPNCQAVVMACGGRLTIAVY 623

Query: 2000 TVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGL 2059
            T+R ++ GEE+TFDY  VTES++E+  ++CLCG++ CRGS+L   G  AF +++ + H +
Sbjct: 624  TLRHVYPGEELTFDYACVTESEKEFRTAICLCGTRNCRGSFLTFAGSRAFMQIMTQRHSM 683

Query: 2060 LDRHQLMLEACELNSVSEEDYLELGRAGLGSCLLGG------LPNWVVAYSARLVRFINL 2113
            L R  L++ A     ++E D   L   GL  C LGG      +P W+  ++A ++ F+  
Sbjct: 684  LRRQALLVRAGA-EPLTERDRARLQEFGLRECALGGGGGQSRVPAWLEKWTALILEFVQE 742

Query: 2114 ERTKLPEEILRHNLEEKRKYFSDICLEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMR 2173
            E+  LPE++L           S+I      + A  +A+GV + RLQN+ +TLDKV+    
Sbjct: 743  EQRLLPEQLL--------ALPSNIVAYTPFTAAS-EAKGVSDNRLQNVVITLDKVKL--- 790

Query: 2174 CVFGDPKKAPPPVERLSPEETVSFLWKGEGSLVEELIQCMAPHV-------------EED 2220
            C+    +    P+  L+  E   +LW G  SL    ++  A  +             +ED
Sbjct: 791  CLSKPGQCLNAPLRLLTDSEVAEYLWTGTKSLARRWLRTAASQLANPSVARALSSAEDED 850

Query: 2221 VLNDLKSKIQAH-----------DPSGSEDIQRELRKSLLWLRDEVRNLPCTYKCRHDAA 2269
             +  + ++ + H            P+       E R  L+ L ++VR L       H AA
Sbjct: 851  DIQAVLARHRGHKHLEELAVLVLQPAAD---AAEGRARLMALAEKVRALDVACGGGHTAA 907

Query: 2270 ADLIHIYAYTKCFFRVQ-EYKAFTSPPVYISPLDLGPKYA 2308
            AD++ +YA ++ +F  + EYK FTSPPV+++  DL  K A
Sbjct: 908  ADMLLLYASSQTWFTSEREYKGFTSPPVHLNLDDLMLKRA 947



 Score = 73.2 bits (178), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 46/134 (34%), Positives = 69/134 (51%), Gaps = 11/134 (8%)

Query: 2309 DKLGADLQVYRKTYGENYCLGQLIFWHIQTNADPDCTLARASRGCLSLPDIGSFYAKVQK 2368
            DK G       K YG  +  GQL  W+ QT  DP  +L+   RG +S+PDI S Y     
Sbjct: 1055 DKHGRTQVGLAKKYGPGFVWGQLNGWYKQTVFDPTASLSAERRGTISMPDIESCYGG--- 1111

Query: 2369 PSRHRVYGPKTVRFMLSRMEKQPQRPWPKDRIWAFKSSPRIFGSPMLDS----SLTGCPL 2424
             SR R Y  K    ++  +EK+P+  W    IW+F++  +++GSPM D+    ++ G P 
Sbjct: 1112 -SRSR-YTVKDRNHLIDHIEKRPEGMWKIGTIWSFRNEAKVYGSPMFDAVWAQTMPGAPP 1169

Query: 2425 D--REMVHWLKHRP 2436
            D   E++  L+  P
Sbjct: 1170 DPMPELLQKLRSAP 1183


>gi|357486421|ref|XP_003613498.1| Elongation factor 1-alpha [Medicago truncatula]
 gi|355514833|gb|AES96456.1| Elongation factor 1-alpha [Medicago truncatula]
          Length = 1488

 Score =  273 bits (698), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 168/347 (48%), Positives = 187/347 (53%), Gaps = 111/347 (31%)

Query: 1641 NRKSLDSGSETSDDLDGSSEDGKSDSESTVSDTDSDMDFRSDGRARESRGAGDFTTDEGL 1700
            N+KS+DS SETSDDLD SSE   +     ++ T  +      G  +    A  +    GL
Sbjct: 1058 NKKSIDSDSETSDDLDVSSEVKLAIVMMILTQTKIE-----PGNQKVMDTA--YLPYNGL 1110

Query: 1701 DF-SDDREWGARMTKASLVPPVTRKYEVIDQYVIVADEEDVRRKMRVSLPEDYAEKLNAQ 1759
            DF SD+ EWG  MTKASLV PVTRKY+VIDQYVIVA                        
Sbjct: 1111 DFISDECEWGHCMTKASLVSPVTRKYDVIDQYVIVA------------------------ 1146

Query: 1760 KNGSEELDMELPEVKDYKPRKQLGDQVFEQEVYGIDPYTHNLLLDSMPDELDWNLLEKHL 1819
                                    D+V EQEVYGIDP THNLLLDSMP ELDW+L EK  
Sbjct: 1147 ------------------------DEVIEQEVYGIDPSTHNLLLDSMPAELDWSLQEK-- 1180

Query: 1820 FIEDVLLRTLNKQVRHFTGTGNTPMMYPLQPVIEEIEKEAVDDCDVRTMKMCRGILKAMD 1879
                       +   H+      P                        + MC+GILKAMD
Sbjct: 1181 ----------TQSFGHWICKLGVP-----------------------RISMCQGILKAMD 1207

Query: 1880 SRPDDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDP 1939
             RPDDKYVAYRKGLGVVCNKE GF EDDFVVEFLGEV                    +D 
Sbjct: 1208 KRPDDKYVAYRKGLGVVCNKEEGFAEDDFVVEFLGEV--------------------KDS 1247

Query: 1940 APEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAK 1986
            APEFYNIYLERPKGDADGYDLVVVDA HKAN+ASRICHSCRPNCEA+
Sbjct: 1248 APEFYNIYLERPKGDADGYDLVVVDATHKANHASRICHSCRPNCEAE 1294



 Score =  182 bits (461), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 124/343 (36%), Positives = 163/343 (47%), Gaps = 129/343 (37%)

Query: 1041 LQLGEWYYLDGAGHERGPSSFSELQVLVDQGCIQKHTSVFRKFDKVWVPLTFATETSAST 1100
            L LG+WY+LDG G ERGPSSF +LQ  VDQ  I+K     ++F                +
Sbjct: 764  LHLGDWYFLDGLGRERGPSSFLDLQSSVDQCIIKK-----KQF----------------S 802

Query: 1101 VRNHGEKIMPSGDSSGLPPTQSQDAVLGESNNNVNSNAFHTMHPQFIGYTRGKLHELVMK 1160
            V N  + + P                                  Q +GYTRGK+HELV+K
Sbjct: 803  VANFLDSLYP----------------------------------QVVGYTRGKVHELVIK 828

Query: 1161 SYKNREFAAAINEVLDPWINAKQPKKE-TEHVYRKSE----------------------- 1196
            SYK+REFAA INEVL PWINA+QPKKE  + +Y  ++                       
Sbjct: 829  SYKSREFAAVINEVLYPWINARQPKKEFKKQIYIGNQVTIYFLKFTTIPVFVSNSYEEMI 888

Query: 1197 ---------GDTRAGKRARLLVRESDGDEETEEELQTIQ-DESTFEDLCGDASFPGEESA 1246
                     GDT A KRAR+LV +SD +   E+    I+ +EST E L GD +F  EES 
Sbjct: 889  CALSSLTLKGDTHASKRARVLVDDSDEEGGFEDCSFIIENNESTVEALSGDVTFSREESG 948

Query: 1247 SSAIESGGWGLLDGHTLAHVFHFLRSDMKSLAFASLTCRHWRAAVRFYKGISRQVDLSSV 1306
             +  + G WGLLDG  LA +FHFLRSD+KSL F                           
Sbjct: 949  ITVSKEGRWGLLDGRMLARIFHFLRSDLKSLVF--------------------------- 981

Query: 1307 GPNCTDSLIRKTLNAFDKEKLNSILLVGCTNITSGMLEEILQS 1349
                         NA++K+K+ S++L+GCTNIT+ +LE+ L S
Sbjct: 982  -------------NAYEKDKIKSMILMGCTNITADILEKFLVS 1011



 Score = 60.8 bits (146), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 31/58 (53%), Positives = 38/58 (65%), Gaps = 12/58 (20%)

Query: 1481 MEEFLASSLKEIMRVNTFEFFVPKVAEIEGRMKKGYYISHGLGSVKDDISRMCRDAIK 1538
            +E+FL S L+EIM+ N  +FFVPKV E            HGL SVK+ ISRMCRDA+K
Sbjct: 1005 LEKFLVSRLREIMKANACDFFVPKVPE------------HGLSSVKEGISRMCRDAMK 1050


>gi|118383419|ref|XP_001024864.1| SET domain containing protein [Tetrahymena thermophila]
 gi|89306631|gb|EAS04619.1| SET domain containing protein [Tetrahymena thermophila SB210]
          Length = 2631

 Score =  256 bits (654), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 170/581 (29%), Positives = 291/581 (50%), Gaps = 66/581 (11%)

Query: 1882 PDDKYV-AYR---KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNE 1937
            P++ Y  A+R   KG+G++C    G  +++F+ E++GE++P W+WFEKQD I+   K N 
Sbjct: 1050 PENLYSEAFRIHTKGMGLICINPKGIEQNEFITEYIGEIFPPWRWFEKQDTIKKYMKENN 1109

Query: 1938 --DPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVD-GHY 1994
              D  P+F+NI LE  K D  GYD++ VD + K N++SR+ HSC PNC    T  + G+Y
Sbjct: 1110 KRDILPDFWNIMLEIHKDDPKGYDILFVDPILKGNFSSRLSHSCEPNCGTVPTITNTGNY 1169

Query: 1995 QIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLK 2054
             I ++ +  I YGEE++FDY +VTES +E++ ++CLCGS  CRG YL L+         K
Sbjct: 1170 VIAMFAMHPIEYGEELSFDYMAVTESIQEHKRAICLCGSSKCRGRYLELSN--------K 1221

Query: 2055 ELHGLLDRHQLMLEAC--ELNSVSEEDYLELGRAGLGSCLLGGLPNWVVAYSARLVRFIN 2112
            ++H  L+R   +  AC  ELN   EED   L +    S +    P W+  ++A ++R IN
Sbjct: 1222 KIHCFLNRTYTLYIACTEELN---EEDENILEQYSFRSNIRENSPKWLQKWAALVLRIIN 1278

Query: 2113 LERTKLPEEILRHNLEEKRKYFSDICLEVEKSDAEVQ-------AEGVYNQRLQNLAVTL 2165
             E     EE++    ++ +K  S + L  E+ + ++        A+   + R+QN+ +++
Sbjct: 1279 QEYDLFLEELVEAEKKKVQKEESLVNLTQEQINQKIDLPYLKYLAQSRKDARIQNIVISI 1338

Query: 2166 DKVRYVMRCVFGDPKKAPPPVERLSPEETVSFLW-KGEGSLVEELIQCMAPHVEEDVLN- 2223
            DK++Y    +      + PP+  +  ++ +   W K E +L ++L++ +   +   V+N 
Sbjct: 1339 DKIKYYTNQI----NDSSPPLLNMQNDQLLENYWIKTENTLKDDLLEVLQ-QIHSKVINY 1393

Query: 2224 -----------DLKSKIQAHDPSGSEDIQRELRKSLLWLRDEVRNLPCTYKCRHDAAADL 2272
                       +   KIQ     G  D    + + ++ +  +          + DA + +
Sbjct: 1394 QLVEYAHIIITNATKKIQIFQKFGQFDHGLLVVRVVVLIISDFFLQMKNCNLQSDALSLI 1453

Query: 2273 IHIYAYTKCFFRVQEYKAFTSPPVYISPLDL--GPKYADKLGADLQVY-----RKTYGEN 2325
            +H +A+T  +F+   YK FTS    I   D+     + +    ++Q +     +KTY   
Sbjct: 1454 LHFHAFTHKYFKTHSYKKFTSEEQIIQKEDIINVELFDEDQQGNIQEFTAYSDKKTYTSL 1513

Query: 2326 YCLGQLIFWHIQTNADPDCTLARASRGCLSLPDI-------GSFYAKVQKPSRHRVYGPK 2378
            +  GQL  W+ Q+  +P  TL+   RG L  P +        + Y  V      +   PK
Sbjct: 1514 FVWGQLNMWYKQSVTNPATTLSLERRGPLIYPQLSNSFKESSTLYPFV---DNKQAENPK 1570

Query: 2379 TVRFMLSRMEKQPQRPWPKDRI--WAFKSSPRIFGSPMLDS 2417
             V   +  ++ +P+  WP D    W+FK+    +GS M DS
Sbjct: 1571 QV--FMDHLKTKPECYWPGDNFNKWSFKNQMSQYGSFMYDS 1609


>gi|145532427|ref|XP_001451969.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124419646|emb|CAK84572.1| unnamed protein product [Paramecium tetraurelia]
          Length = 1024

 Score =  255 bits (652), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 181/594 (30%), Positives = 297/594 (50%), Gaps = 44/594 (7%)

Query: 1779 RKQLGDQ--VFEQEVYGIDPYTHNLLLDSMPDELDWNLLEKHLFIEDVLL---RTLNKQV 1833
            R   GD   V E   +GID YT N++++ +P  L++   +K+ FIE ++L   R  +K+ 
Sbjct: 321  RTNFGDNADVEETLCWGIDVYTRNVIINILP--LNYVESQKNQFIEKLILAINRPNDKER 378

Query: 1834 RHFTGTGNTPMMYPLQPVIEEIEKEAVDDCDVRTMKMCRGILKAMDSRPDDKYVAYRKGL 1893
             +  G     ++   + +     K+     D +  K  + ++K +D      +  + KG 
Sbjct: 379  GYDMGLACDYIIRESRMLSSLYNKD-----DRKMAKSIKRVIK-LDG---GGFRIHTKGC 429

Query: 1894 GVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQK--NNEDPAPEFYNIYLERP 1951
            G+VC  + G   +  ++ +LGE+Y  W+W+EKQD I+   K  N +D  P+FYNI L+R 
Sbjct: 430  GLVCVNKFGIKTNSLIIPYLGEIYQPWRWYEKQDFIKKQMKEQNKKDILPDFYNIMLDRH 489

Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
              D DG D++ VD ++K N++SR+ HSC PNC    T  +G Y IG+Y +R I +GEE+T
Sbjct: 490  LDDEDGIDILFVDPINKGNFSSRLSHSCNPNCGTVTTVSNGTYVIGMYAMRDIQFGEELT 549

Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDRHQLMLEACE 2071
            FDY S TESK+E   ++CLCGS+ C+  YL L+ +  +  +L   H  L R+ ++  +C 
Sbjct: 550  FDYCSFTESKQEQLQALCLCGSENCKKYYLGLSNQREYNAILDRTHCFLKRNAILFNSCL 609

Query: 2072 LNSVSEEDYLELGRAGLGSCLLGGLPNWVVAYSARLVRFINLERTKLPEEILRHNLEEKR 2131
             N   ++  L+  +  +GS LL G P W+  +  +L+ FI+       EE + +  E   
Sbjct: 610  DNFKIDQSLLD--KYKIGSSLLTGCPFWLKCWICQLLVFID-------EEYIIYKAELDT 660

Query: 2132 KYFSDICLEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRCVFGDPKKAPPPVERLSP 2191
            K+  +   E EK + +  A+    +R+QNL  TLDK+++ ++          PP+ +++ 
Sbjct: 661  KFILN--EETEKWN-QFTAQLHSEERIQNLIFTLDKIKFFLK----QSDTVEPPLTKITN 713

Query: 2192 EETVSFLW--KGEGSLVEELIQCMAPHVEEDVLNDLKSKIQAHDPSGSEDIQRELRKSLL 2249
            E+ +   W    E  L  EL Q    H  + ++  +   +   D     D+Q +L  + L
Sbjct: 714  EDLIMNFWGMTNESLLSNELYQLFQKHGLKKLMELI---VLIQDKRHLYDVQEQLLLTRL 770

Query: 2250 WLRDEVRNLPCTYKCRHDAAADLI-HIYAYTKCFFRVQEYKAFTSPPVYISPLDLGPKYA 2308
                    L    +  +     LI  + A+T  +F+  EYK F SPP  I  L+ G    
Sbjct: 771  LFLVLSHLLLQQKQSFYYEGLSLILQMMAFTYTYFKPTEYKGFQSPP--IDDLEWGK--V 826

Query: 2309 DKLGADLQVYRKTYGENYCLGQLIFWHIQTNADPDCTLARASRGCLSLPDIGSF 2362
              +    +   KTY   +  GQL+ W+ QT   P  +L    RG L  P I SF
Sbjct: 827  GLIKRKCKAEGKTYSSLFAWGQLVGWYKQTVLAPQLSLCVDRRGTLLYPQISSF 880


>gi|412987959|emb|CCO19355.1| predicted protein [Bathycoccus prasinos]
          Length = 2064

 Score =  245 bits (625), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 156/464 (33%), Positives = 239/464 (51%), Gaps = 41/464 (8%)

Query: 1771 PEVKDYKPRKQLGDQVFEQEVYGIDPYTHN----LLLDSMPDELDWNLLEKHLFIEDVLL 1826
            P V +  PR + G  V EQE YG D  T      +L +++P+  D    ++H  I   L+
Sbjct: 1120 PCVGERNPRLKSGRDVREQETYGCDFVTGRDAVAVLAEALPEFSD----DEHWGIYAKLM 1175

Query: 1827 RTLNKQVRHFT--GTGNTPMMYPLQPVIEEIEKEAVDDC-DVRTMKMCRGILKAMDSRPD 1883
              +N+     T        +    + + E+ E+  +    D      C   L     +  
Sbjct: 1176 NQVNESYGKMTPDTLATQSLALAAEDLAEKFERANITSPKDGMKNLACAKALWTFSKKAR 1235

Query: 1884 DK---YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNN---E 1937
            +    +V +RKG GVV  +E    + +FVV+FLGE+YP W W EKQD I+  QK     +
Sbjct: 1236 ENPNLFVVHRKGYGVVNIREKNIQKGEFVVDFLGEIYPPWAWMEKQDAIKQAQKAKGLKD 1295

Query: 1938 DPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIG 1997
              APEFYN+ +ERP GD  G+ L+  DAMH  N+A+R+ HSC PN +  +T VDG Y+I 
Sbjct: 1296 IGAPEFYNMQMERPGGDKHGFGLLFCDAMHYNNFAARMSHSCEPNVQVILTVVDGKYEIH 1355

Query: 1998 IYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELH 2057
             Y  R I  GEE+ ++Y+S ++S +E EA+ CLCG++ CRGSYL+  GE    +V    H
Sbjct: 1356 FYATREIQKGEELCYNYHSCSDSMKEVEAAFCLCGAKKCRGSYLSFVGENNNSQVFDSEH 1415

Query: 2058 GLLDRHQLMLEAC-------ELNSVSEE---DYLELGRAGLGSCLLGGLPNWVVAYSARL 2107
             +LDR+ ++L+A        E N  ++E     LE     +G  +L   P W+  Y A++
Sbjct: 1416 RILDRYAMLLDAIDEAKEKREGNDDNDEVVKTRLESLGFRIGCGILADAPKWLTNYYAKV 1475

Query: 2108 VRFINLERTKLP----EEILRHNLEEKRK----YFSDICLEVEKSDAEVQAEGVYNQRLQ 2159
              FI+ ER  LP    E    H++  +++    Y  +      + +AE++A  V   R+Q
Sbjct: 1476 ASFIDHERETLPPLIYEAAKEHHINRRKRGDPGYRGEFVY--TEKNAEIEAMAVRENRIQ 1533

Query: 2160 NLAVTLDKVRYVMRC--VFGDPK--KAPPPVERLSPEETVSFLW 2199
             LAV + K+R ++     F  PK   +PPP  +LS +ET+   W
Sbjct: 1534 ALAVCMSKIRRLLTLGEGFDSPKYGTSPPPYAKLSAKETIEKFW 1577



 Score = 90.9 bits (224), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 63/205 (30%), Positives = 96/205 (46%), Gaps = 14/205 (6%)

Query: 2245 RKSLLWLRDEVRNLPCTYKCRHDAAADLIHIYAYTKCFFRVQEYKAFTSPPVYISPLDLG 2304
            R +LLWLRD + +LP T   RHD  AD++H++A T+ F++             I  +D+ 
Sbjct: 1733 RAALLWLRDALLDLPVTPCARHDLCADIVHLFANTEHFYKFDHLAPCYQTQAGIQ-IDVR 1791

Query: 2305 PKYADKLGADLQV--------YRKTYGENYCLGQLIFWHIQTNADPDCTLARASRGCLSL 2356
                   G   Q         + K Y  +Y    L+ WH Q  ADP   +  + +GC  L
Sbjct: 1792 EDEVMAFGVGAQAASHKIASSHSKKYKHDYIPAALLSWHKQELADPTKLVHVSFKGCAYL 1851

Query: 2357 PDIGSFYAKVQKPSRHRVYGP--KTVRFMLSRMEKQPQRPW-PKDRIWAFKSSPRIFGSP 2413
            PDI   Y  V+  ++  V G   +     LS + ++   PW PK   WA  ++ ++ GSP
Sbjct: 1852 PDISCCYG-VRAEAKPIVNGCDLENRSKWLSCLTEKINEPWEPKTGPWAGTNAQKLIGSP 1910

Query: 2414 MLDS-SLTGCPLDREMVHWLKHRPA 2437
            MLD+       LD  ++ WL+ R A
Sbjct: 1911 MLDAWRKKQSMLDESVLDWLRTRKA 1935


>gi|340508154|gb|EGR33923.1| SET domain protein [Ichthyophthirius multifiliis]
          Length = 935

 Score =  243 bits (620), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 158/536 (29%), Positives = 278/536 (51%), Gaps = 59/536 (11%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQK--NNEDPAPEFYNIYL 1948
            KG+G+ C    G  +++F+ E++GE+Y  W+WFEKQD ++   K  N ++  P+F+NI L
Sbjct: 413  KGIGLTCINSQGIQKNEFITEYVGEIYEPWRWFEKQDLLKKFIKENNQQNILPDFWNIML 472

Query: 1949 ERPKGDADGYDLVVVDAMHKANYASRICHSCRPNC-EAKVTAVDGHYQIGIYTVRGIHYG 2007
            E  K D  GYD++ +D + K N++SR+ HSC+ NC    V   +G Y IG+Y ++ I YG
Sbjct: 473  EIHKDDPKGYDILFIDPIIKGNFSSRLNHSCQANCGTVPVINNEGKYVIGLYAMQQISYG 532

Query: 2008 EEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNL--TGEGAFEKVLKELHGLLDRHQL 2065
            EE+TFDY +VTESK+E+  ++CLCGS  CRG YL L  TG   + ++L+++   L R  +
Sbjct: 533  EELTFDYMAVTESKQEHNRALCLCGSSKCRGKYLELSTTGIKEYNQILEDISCFLHRTYI 592

Query: 2066 MLEACELN-SVSEEDYLELGRAGLGSCLLGGLPNWVVAYSARLVRFINLERTKLPEEILR 2124
            +  +C  N  ++ ED   L      S +  G P W++ +  + +R IN E     +E   
Sbjct: 593  LEYSCRKNVQLNAEDEQLLESESFRSNIKQGCPIWLLKWICQSLRIINQEYNIFLQE--- 649

Query: 2125 HNLEEKRKYFSDICLEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRCVFGDPKKAPP 2184
              L  K KY +     + K  A+ + +     R+QNL +T++KV+Y +     DPK   P
Sbjct: 650  --LRNKNKYTN---FRILKYQAQTKKDN----RIQNLVITINKVKYFINKT-NDPK---P 696

Query: 2185 PVERLSPEETVSFLWKGEGSL-----VEELIQCMAPH--------VEEDVLNDLKSKIQA 2231
            P+++L+ E  ++ LW  +        + E++Q +  +           +++N +  ++Q 
Sbjct: 697  PLQQLNQEYILNILWLNDKQYSIKEGINEILQNIPDNETNYQYVIYSRNLINSIDKQVQI 756

Query: 2232 HD--PSGSEDIQRELRKSLLWLRDEVRNLPCTYKCRHDAAADLIHIYAYTKCFFRVQEYK 2289
            ++     S+ +   +R  LL + D ++ +           + L+H +A+T  +F    YK
Sbjct: 757  YNSFKQYSQSLLL-IRFGLLTISDFLQQIQ-----NQQITSSLLHFHAFTHIYFTNYAYK 810

Query: 2290 AFTSPPVYISPLDL--GPKYADK--------LGADLQVYRKTYGENYCLGQLIFWHIQTN 2339
             FTS  + I   D+     Y +K        L   L+   KTY   +  GQL  W+ Q+ 
Sbjct: 811  QFTSEEILIQKGDVINVELYEEKQSQNQEQGLDNFLKKLSKTYQSLFVWGQLNIWYKQSV 870

Query: 2340 ADPDCTLARASRGCLSLPDIGSFYAKVQKPSRHRVYGPKTVRFMLSRMEKQPQRPW 2395
            A+P   L+   RG +  P + +F + +Q   +++       + ++++++KQ  + W
Sbjct: 871  ANPGNLLSAERRGTIVYPCLQNFIS-IQNDKKYQ-----CSKSIINQIQKQRNKYW 920


>gi|412993322|emb|CCO16855.1| predicted protein [Bathycoccus prasinos]
          Length = 1476

 Score =  231 bits (589), Expect = 4e-57,   Method: Compositional matrix adjust.
 Identities = 123/335 (36%), Positives = 190/335 (56%), Gaps = 21/335 (6%)

Query: 1886 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN 1945
            Y  + KG+GVVC ++ G     FV  +LGE+Y  W+W+E+ D ++    N E   P F+N
Sbjct: 477  YRLHPKGVGVVCIRKEGLQPGMFVNHYLGEMYSPWRWYERCDAMKKRNPNQE--LPSFFN 534

Query: 1946 IYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIH 2005
            I LERPK D  G D V V+AMH+  +ASR+ HSC  NC+  V + +G   IG+YT   I 
Sbjct: 535  ITLERPKDDVRGKDTVFVEAMHECEFASRMSHSCAGNCQTTVISHEGKLSIGVYTNSKIE 594

Query: 2006 YGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDRHQL 2065
             GEE+ +DY+ VTES++E+ A++CLC S  CRGS+L+  G   F  V+ E H  L R+ +
Sbjct: 595  CGEELCWDYSCVTESEKEFRAAICLCSSPNCRGSFLSYAGSSTFTAVMNEKHNFLHRNAM 654

Query: 2066 MLEACELNSVSEEDYLELGRAGLGSCLLGGL-----PNWVVAYSARLVRFINLERTKLPE 2120
            +  AC    +++ED   L   G+    L  L     P+W+V +++ ++R++ LE   LPE
Sbjct: 655  LCRACS-EPLTDEDLALLSDYGIRDSALNTLSGERAPDWLVKWASLILRYVQLEEKLLPE 713

Query: 2121 EILRHNLEEKRKYFSDICLEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRCVFGDPK 2180
             +    +++  KY       +E + AE    GV   RLQN+ VTLDK++Y +R     P+
Sbjct: 714  ALCNLPMQKGVKY------NLEGAKAETY--GVVATRLQNIVVTLDKIKYFLR----QPE 761

Query: 2181 KAPPPVERLSPE-ETVSFLWKGEGSLVEELIQCMA 2214
            ++  P  R + E + +  LW G  S++   I  ++
Sbjct: 762  QSDKPFMREATEADIIEHLWTGSESILVRAIGALS 796



 Score = 88.6 bits (218), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 67/202 (33%), Positives = 102/202 (50%), Gaps = 19/202 (9%)

Query: 2221 VLNDLKSKIQAHDPSGSEDIQRELRKSLLWLRDEVRNLPCTYKCRHDAAADLIHIYAYTK 2280
            VL+D+ +K +    + SE      ++ L  + D +RNL       H A ADL+ +YA T+
Sbjct: 994  VLDDIIAKSKMTPSTASE-----AKQWLAEVSDSIRNL----GIEHCACADLLLMYARTQ 1044

Query: 2281 CFFRVQEYKAFTSPPVYISPLDLGPK-YADKLGADLQ-VYRKTYGENYCLGQLIFWHIQT 2338
             +F  +++  F SPPV +   D G K  A K+   ++    K Y  +Y  GQ+  W  QT
Sbjct: 1045 RWFTPEKFVGFMSPPVQLREHDPGCKQTASKISIHVKNTLTKKYQPHYPWGQMCSWFKQT 1104

Query: 2339 NADPDCTLARASRGCLSLPDIGSFYAKVQKPSRHRVYGPKTVRFMLSRMEKQ-PQRPWPK 2397
              DP  +L+   RG LSLPD+ S Y      +    Y  KT R  L R+ ++   R WP 
Sbjct: 1105 IYDPTASLSADRRGTLSLPDVESAY------NNGGAYV-KTDRKQLFRILRENASRNWPT 1157

Query: 2398 DRIWAFKSSPRIFGSPMLDSSL 2419
               W+FK+  +++GSP  D +L
Sbjct: 1158 TMQWSFKNYAKMYGSPWFDDAL 1179


>gi|159486133|ref|XP_001701098.1| histone methyltransferase [Chlamydomonas reinhardtii]
 gi|158271992|gb|EDO97800.1| histone methyltransferase [Chlamydomonas reinhardtii]
          Length = 1028

 Score =  224 bits (570), Expect = 6e-55,   Method: Compositional matrix adjust.
 Identities = 112/284 (39%), Positives = 170/284 (59%), Gaps = 11/284 (3%)

Query: 1848 LQPVIEEIEKEAVDDCDVRTMKMCRGILKAMDSRPDDKYVAYRKGLGVVCNKEGGFGEDD 1907
            L  V+E +  EA +  D    +    ++  +     + +  + KG GV+C   GG     
Sbjct: 725  LLKVLETVAAEAAERGDAPCAQAAEAVIARLRQIGWNYFRLHPKGRGVICRVPGGLEPFT 784

Query: 1908 FVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMH 1967
            FV E+LGE++  W+WFE QD I+ L +      P+FYNI LERP+ D DGYD++ V+A  
Sbjct: 785  FVEEYLGELHSPWRWFEIQDAIKKLTQQE---LPDFYNITLERPRDDPDGYDVLFVEAAF 841

Query: 1968 KANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEAS 2027
             A++ASR+ HSC PNC A V +V+G   I +YT R I  GEE+TFDY SVTES++EY  +
Sbjct: 842  MASFASRMSHSCTPNCAAVVVSVNGRLTIAMYTKRRIEAGEELTFDYRSVTESEKEYREA 901

Query: 2028 VCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDRHQLMLEACELNSVSEEDYLELGRAG 2087
            +CLCG++ CRGSYL  +G  AF +V++E H  L R  L+L A     ++E+D+  L    
Sbjct: 902  ICLCGTRSCRGSYLYYSGSDAFTQVMEEKHNFLHRQVLLLRASA-EDLTEDDHTRLRAHA 960

Query: 2088 LGSCLLGG-------LPNWVVAYSARLVRFINLERTKLPEEILR 2124
            +G   LG         P+W+V ++A +++++ LE+ +LP  +L+
Sbjct: 961  IGPTSLGDGSPGNNRAPDWLVKWAALVLQYVELEKRELPSFLLK 1004


>gi|340503864|gb|EGR30374.1| SET domain protein [Ichthyophthirius multifiliis]
          Length = 827

 Score =  223 bits (569), Expect = 9e-55,   Method: Compositional matrix adjust.
 Identities = 178/645 (27%), Positives = 301/645 (46%), Gaps = 81/645 (12%)

Query: 1777 KPRKQLGDQVFEQEVYGIDPYTH-NLLLDSMPDELDWNLLEKHLFIEDVLLRTLNKQVRH 1835
            K +K +   V E   +GID YT  NL      +E D   ++KH FI+  LL+  N     
Sbjct: 185  KLQKIIDQDVQETLCWGIDLYTKKNLHYILHENECD---IKKHNFIQRSLLKAAN----- 236

Query: 1836 FTGTGNTPMMYPLQPVI-------EEIEKEAVDDCDVRTMKMCRGILKAMDSRPD-DKYV 1887
              G     M    + +I       EE  K+ + +   R  K  + ILK +    D + + 
Sbjct: 237  LCGNNGWDMQKVCEFIIQNSKKKDEENNKDYIFNNQDR--KFSKVILKTLKINVDPEAFR 294

Query: 1888 AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQK--NNEDPAPEFYN 1945
             + KG+GV+C    G  ++D ++++ GE+Y  ++WFE+QD ++   K  N +D  P+FYN
Sbjct: 295  IHSKGMGVICLNRQGIEKNDLIIQYFGEIYRPYRWFERQDFVKKFMKENNQKDVLPDFYN 354

Query: 1946 IYLERPKGDADGYDLVV-------------VDAMHKANYASRICHSCRPNCEAKVTAVDG 1992
            I LE  K D  GYD++V             VD M K NY+SR+ HSC PNC    T  DG
Sbjct: 355  IMLEIHKNDPKGYDILVKKQKKQQNNIKKYVDPMQKGNYSSRLSHSCDPNCGTVATISDG 414

Query: 1993 HYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGA--FE 2050
             Y I +Y ++ I YGEE+ FDY++VTESK+E+  + CLCG+  CRG Y+  +      + 
Sbjct: 415  KYNISMYAMKSIEYGEELAFDYSAVTESKQEHMQATCLCGTYKCRGKYIEFSNNNLKEYN 474

Query: 2051 KVLKELHGLLDRHQLMLEACELNSVSEEDYLELGRAGLGSCLLGGLPNWVVAYSARLVRF 2110
             +L+++H  L R+  +L  C    ++ ED   L +  +   +    P+W++ + + +++ 
Sbjct: 475  FILEKMHCFLKRNSDLLR-CSNEILNSEDLKLLEKHNMRKNITENCPSWLMKWISIILKT 533

Query: 2111 INLERTKLPEEILRHN--LEEKRKYFSDICLEVEKSDAEVQ------------------- 2149
            I+ E++   E  +  N  L   +K   D+  + E+ D  +Q                   
Sbjct: 534  IDEEKSLFLEHQMNTNIFLLHSQKELRDLEEKNEEEDQSLQIKKEEKIKEIQKHVQFINY 593

Query: 2150 -AEGVYNQRLQNLAVTLDKVRYVMRCVFGDPKKAPPPVERLSPEETVSFLW-KGEGSLVE 2207
             A      R+QNL +++DKV+Y ++ V         P++ L+ ++    L  K + S+++
Sbjct: 594  LANSKVENRIQNLVISIDKVKYFLKKV----NDFQAPLDYLNFDQIFENLCGKNKESILD 649

Query: 2208 ELI--------QCMAPHVEEDVLNDLKSKIQAHDPSGSEDIQRELRKSLLWLRDEVRNLP 2259
            E+         QC    V  ++    KS +  +     +      R   L + +  + + 
Sbjct: 650  EIYDLITSYKNQCGQILVYFNIFR--KSFLPKYASISKKQGLLAFRLFCLNISEFFKKIQ 707

Query: 2260 CTYKCRHDAAADL-IHIYAYTKCFFRVQEYKAFTSPPVYISPLDL-GPKYADKLGADLQV 2317
              +   H +A  + ++ Y++T  +F   EY +  S  + IS  ++      D      + 
Sbjct: 708  SNF---HSSATFITLYFYSFTHTYFTPHEYASVCSEKMKISETEMQNLHLLDTEKKKKKH 764

Query: 2318 Y--RKTYGENYCLGQLIFWHIQTNADPDCTLARASRGCLSLPDIG 2360
            Y  ++ Y   +  GQL  W  QT A P  TL++  RG LS P I 
Sbjct: 765  YEEQRIYSPQFIWGQLTVWFKQTIASPQATLSQDRRGTLSFPSIN 809


>gi|300175979|emb|CBK22196.2| unnamed protein product [Blastocystis hominis]
          Length = 671

 Score =  216 bits (551), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 120/371 (32%), Positives = 198/371 (53%), Gaps = 24/371 (6%)

Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
            PEFYNI LERP     GYD++ +D + + N+ SR+ HSC PNC      V+G   I +  
Sbjct: 6    PEFYNIMLERPPDSRGGYDVLYIDPIFRGNFGSRMSHSCSPNCATTTITVNGRLAIVLVA 65

Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLL 2060
            +R I +GEE+ FDY  V+ESK E+E + CLCGS  CRGS+++     +F +V+ +    +
Sbjct: 66   LRPIAWGEELCFDYACVSESKTEFEMATCLCGSLQCRGSFVSYADGNSFMQVMAKRFPFV 125

Query: 2061 DRHQLMLEACELNSVSEEDYLELGRAGLGSCLLGGLPNWVVAYSARLVRFINLERTKLPE 2120
             R  ++L++C  ++VS++D   L + G+   +L G P W+  + A ++RF+  E   LP 
Sbjct: 126  KRTAVLLDSCN-SAVSDDDARRLAKHGIKCSMLEGAPAWLQKWIASILRFMEFEEASLPA 184

Query: 2121 EIL-RHNLEEKRKYFSDICLEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRCVFGDP 2179
            E+    +   +  Y SD  L +E       + GV+  RLQN+A+T D+V++ +      P
Sbjct: 185  ELRGMKDCLGRDLYPSDAALRLE-------SHGVFATRLQNVAITADRVKHFLA---QQP 234

Query: 2180 K--KAPPPVERLSPEETVSFLWKGEGSLVEELIQCMAPHVEEDVLNDLKSKI-----QAH 2232
               +A PP   L+  E + FLW G  S+++ L++     + +       + +     Q  
Sbjct: 235  AELRAVPPFRLLTDAEVLDFLWFGAHSVMKRLLRAALAEISDLPAVQFFTALDRALQQPR 294

Query: 2233 DPSGSEDIQRELRKSLLWLRDEVRNLPCTYKCRHDAAADLIHIYAYTKCFFRVQEYKAFT 2292
            D +  E ++ ++      +RD +  LP   +C H AAADL+ +Y +T  FF   EY++  
Sbjct: 295  DAATLEWVRLQMTS----VRDGLLTLPAQRRC-HQAAADLLTMYLHTAVFFVATEYRSVK 349

Query: 2293 SPPVYISPLDL 2303
              P+ +   DL
Sbjct: 350  GAPISLHCYDL 360



 Score = 61.2 bits (147), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 32/100 (32%), Positives = 52/100 (52%), Gaps = 6/100 (6%)

Query: 2319 RKTYGENYCLGQLIFWHIQTNADPDCTLARASRGCLSLPDIGSFYAKVQKPSRHRV-YGP 2377
            +++Y   Y  GQL  W  Q   DP  +L    +GC+ LPD+ S Y    +  R  + Y P
Sbjct: 531  QRSYPSGYVWGQLAMWFKQAGNDPSLSLTNERKGCVLLPDVESCY----ESKRFDLNYTP 586

Query: 2378 KTVRFMLSRMEKQPQRPWPKDRIWAFKSSPRIFGSPMLDS 2417
            +     ++R+E  P +PW     W F+ S  ++G+PM+D+
Sbjct: 587  EERGKWIARLEHCPAQPWLSTH-WTFRRSAHVYGTPMMDA 625


>gi|255073265|ref|XP_002500307.1| set domain protein [Micromonas sp. RCC299]
 gi|226515569|gb|ACO61565.1| set domain protein [Micromonas sp. RCC299]
          Length = 1496

 Score =  213 bits (543), Expect = 8e-52,   Method: Compositional matrix adjust.
 Identities = 151/490 (30%), Positives = 230/490 (46%), Gaps = 96/490 (19%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG+G+VC +  G     ++ ++LGE+Y  W+WFE+QD I+  + + E   P+F+NI LER
Sbjct: 823  KGIGIVCIRPEGLPPGTYIQDYLGELYSPWRWFERQDAIKKREPDKE--LPDFFNITLER 880

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGH----------------- 1993
            P  DA G+D++ V+A H+  +ASR+ HSC PNC+    AV                    
Sbjct: 881  PAEDAAGHDVLFVEAAHRCTFASRLSHSCAPNCQTVGVAVADQTDQKLDQKLDQNNLDQK 940

Query: 1994 -----------YQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
                         I  YT R + YGEE+ ++Y+ VTES++EY A++CLC S  C+G++L+
Sbjct: 941  LGQTADPPRTKLSIAQYTTRHVSYGEELCWNYSCVTESEKEYRAAICLCSSTTCKGAFLD 1000

Query: 2043 LTGEGAFEKVLKELHGLLDRHQLMLEACELNSVSEEDYLELGRAGLGSCLLGG------- 2095
              G  AF  V+   H  LDR+ L++ AC    ++ +D   L  AG+ S  L         
Sbjct: 1001 YAGSSAFTAVMNVRHNFLDRNALLIRACS-EPLTSDDRARLATAGIKSAALTMPGERTRT 1059

Query: 2096 -----LPNWVVAYSARLVRFINLERTKLPEEILRHNLEEKRKYFSDICLEVEKSDAEVQA 2150
                  P W++ +++  + +I +E+  LP  +    ++          +  +   A   A
Sbjct: 1060 GERVECPEWLIKWASLTLEYIEMEKELLPAALTAKPIDG---------IVYDAGFAAATA 1110

Query: 2151 EGVYNQRLQNLAVTLDKVRYVMRCVFGDPKKAPPPVER-LSPEETVSFL----------- 2198
             GV   R+ NL VTLDK++YVMR     P +   P  R LS  E V  L           
Sbjct: 1111 AGVVATRISNLVVTLDKIKYVMR----QPGQNRAPFLRHLSDNEVVDHLLGDILKRAADT 1166

Query: 2199 -------------WKGEGSL---VEELIQCMAPHVEEDVLNDLKSKIQAHDPSGSEDIQR 2242
                         + G+G+     E  +       E DVL  +   + A  P  SE   +
Sbjct: 1167 FAKKVGVKAGLPFFGGKGARNAGAEAKMPAAVGQREGDVLRFILG-VLAKPP--SEFTPQ 1223

Query: 2243 ELRKSLLWLRDEVRNLPCTYKCRHDAAADLIHIYAYTKCFFRVQEYKAFTSPPVYISPLD 2302
            E  ++L     ++R+L       H A ADL+ +YA T  +   + Y  F SPPV + PL 
Sbjct: 1224 EASQTLETCSRKIRDLGAV----HCAMADLLLLYARTAHWCTPEAYAGFQSPPVRLVPL- 1278

Query: 2303 LGPKYADKLG 2312
              PK  DKLG
Sbjct: 1279 --PK--DKLG 1284



 Score = 67.0 bits (162), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 34/103 (33%), Positives = 57/103 (55%), Gaps = 7/103 (6%)

Query: 2317 VYRKTYGENYCLGQLIFWHIQTNADPDCTLARASRGCLSLPDIGSFYAKVQKPSRHRVYG 2376
            V +K Y  ++  GQL+ W  QT  DP  +L+   RG +SLPD  S Y       ++ V G
Sbjct: 1356 VMKKKYQPHFAWGQLVSWFKQTIYDPSASLSAERRGAMSLPDPESAYG-----DKNYVTG 1410

Query: 2377 PKTVRFMLSRMEKQPQRPWPKDRIWAFKSSPRIFGSPMLDSSL 2419
             +  R ML ++ + P + WP    W+F++  +++GSP +D ++
Sbjct: 1411 DR--RSMLRQIARDPSKMWPTTWAWSFRNPGKVYGSPFIDDAI 1451


>gi|340507839|gb|EGR33721.1| SET domain protein [Ichthyophthirius multifiliis]
          Length = 667

 Score =  202 bits (514), Expect = 2e-48,   Method: Composition-based stats.
 Identities = 152/518 (29%), Positives = 252/518 (48%), Gaps = 67/518 (12%)

Query: 1889 YRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDP--APEFYNI 1946
            + KG G++C  + G  ++DF+ E++G++Y  W+WFEKQ+ I+ + K        P+F+NI
Sbjct: 122  HTKGKGLICINKKGIKQNDFITEYIGQIYQPWRWFEKQNFIKKIIKEKYKNYILPDFWNI 181

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEA-KVTAVDGHYQIGIYTVRGIH 2005
             LE  K D  GYD++ +D++ K N++S I HSC+PNC          +Y I +Y ++ I 
Sbjct: 182  MLEIHKDDQKGYDILYIDSISKGNFSSSINHSCQPNCGTFSFITNQKNYVIAVYAIQQIE 241

Query: 2006 YGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNL--TGEGAFEKVLKELHGLLDRH 2063
            YG+E+TFDY ++TES +E + S CLC S  CRG YL+L  T    F ++L ++H  LDR 
Sbjct: 242  YGQELTFDYMAITESIKEQQLSKCLCMSPNCRGLYLDLQNTNFKQFNQILDKIHNFLDRT 301

Query: 2064 QLMLEACELNSVSEEDYLELGRAGLGSCLLGGLPNWVVAYSARLVRFINLERTKLPEEIL 2123
             ++ +AC     + ED L L        ++   P W+  + + ++R IN E     +++L
Sbjct: 302  LIIQKAC-FEQFTNEDKLILEEFSFRFNIINDSPEWLQKWISYILRIINQENELFLKQLL 360

Query: 2124 -----RHNLEEKRKYFSDICLEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRCVFGD 2178
                 + N +E+++ F+              A+   +QR+QN+ +++DKV+Y +  +   
Sbjct: 361  GPESEKLNKKEQKQKFN-------------LAQYRKDQRIQNIVISIDKVKYYINQL--- 404

Query: 2179 PKKAPPPVERLSPE-------------------------ETVSFLWKGEGSLVEEL-IQC 2212
             +   PP  +L+ E                             FL K   +L+E   I  
Sbjct: 405  -QDFTPPFIKLNNEVLKYNIYIYIYIKQKIRRFFQNIWGNQKGFLKKDFLNLIEYFQINS 463

Query: 2213 MAPHVEEDVLNDLKSKIQ-------AHDPSGSEDIQRELRKSLLWLRDEVRNLPCTYKCR 2265
              P     ++  +KS I+         D     +I  +LR  LL L D +  L    K  
Sbjct: 464  KNPK-NVKIIEQIKSLIKNTSECVFKEDNENFSNILIQLRFLLLILSDLIFQLQKDEKII 522

Query: 2266 H-DAAADLIHIYAYTKCFFRVQEYKAFTSPPVYI---SPLDLGPKYADKLGADLQVYRKT 2321
            + D  A ++H YA+T  FF   +YK   S P+ I     ++L     D+    L+  +++
Sbjct: 523  NIDGFAIILHFYAFTHEFFTAYKYKQHQSEPIKIFKDEIINLQFLQDDQQNTFLE-EQQS 581

Query: 2322 YGENYCLGQLIFWHIQTNADPDCTLARASRGCLSLPDI 2359
            Y   +  GQL  W+ QT A P  TL    +G L  P++
Sbjct: 582  YSSLFVWGQLNTWYKQTVAFPATTLGIERKGTLIYPNL 619


>gi|146185998|ref|XP_001032856.2| SET domain containing protein [Tetrahymena thermophila]
 gi|146143072|gb|EAR85193.2| SET domain containing protein [Tetrahymena thermophila SB210]
          Length = 2057

 Score =  193 bits (490), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 98/273 (35%), Positives = 159/273 (58%), Gaps = 21/273 (7%)

Query: 1869 KMCRGILKAMDSRPD-DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQD 1927
            K  + + KA++ + D + +  + KG+GV+C    G  ++D ++E++GE+Y  ++WFE+QD
Sbjct: 819  KFAKILNKAIELKIDQEAFRIHPKGMGVICINRNGIDQNDLIIEYIGEIYRPYRWFERQD 878

Query: 1928 GIRSLQKNN--EDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEA 1985
             I+   K+N  +D  P+FYNI LE  K +  G D++ VD M K NY+SR+ HSC PNC  
Sbjct: 879  FIKKYMKDNNQQDVLPDFYNIMLELHKDEVKGIDILYVDPMQKGNYSSRLSHSCDPNCGT 938

Query: 1986 KVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSY--LNL 2043
              T   G+Y I ++ ++ I YGEE+ FDY++VTESK E++ S CLCG+Q CRG Y  LN 
Sbjct: 939  VATISKGYYNISMFALKSIEYGEELAFDYSAVTESKNEHKQSTCLCGTQKCRGKYIELNN 998

Query: 2044 TGEGAFEKVLKELHGLLDRHQLMLEACELNSVSEEDYLELGRAGLGSCLLGGLPNWVVAY 2103
              +  +  +L ++H  L R+  +L +     ++E+D   L +  L   +  G   W    
Sbjct: 999  NNQKEYNYILDKIHCFLKRNSDLLRSGS-EPLTEDDMNLLEKYNLKQNVQKGCEKW---- 1053

Query: 2104 SARLVRFINLERTKLPEEILRHNLEEKRKYFSD 2136
               L+++I++        IL+   EE+  +FSD
Sbjct: 1054 ---LLKWISI--------ILKSVGEEQELFFSD 1075



 Score = 88.2 bits (217), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 79/294 (26%), Positives = 135/294 (45%), Gaps = 42/294 (14%)

Query: 2157 RLQNLAVTLDKVRYVMRCVFGDPKKAPPPVERLSPEETVSFLW----------KGEG--- 2203
            R+QN+ ++LDKV++ +  V    K   PP+  LS +E    LW          K E    
Sbjct: 1372 RIQNIIISLDKVKFYLNNV----KDIRPPLSYLSQKEIFENLWGRVNKFQKRRKPENQVY 1427

Query: 2204 SLVEELIQCMAPHVEEDVLNDLKSKIQAHDPSGSEDIQRELRKSLLWLRDEVRNLPCTYK 2263
            S+VEEL+  +  +         +  I+  DP  ++ ++    K+LL  R    N+   + 
Sbjct: 1428 SIVEELMDLIKYYSHYKECQYTQKFIELFDPFMTKYVEESYEKALLAWRLFCLNIHSVFN 1487

Query: 2264 CRHD----AAADLI--HIYAYTKCFFRVQEYKAFTSPPVYISPLDLGPKYADKLGADLQV 2317
               D    A A LI  + YA+T  +F   EY+ F S  + IS  ++     + L  D + 
Sbjct: 1488 DIIDPQFHAGALLIVLYFYAFTHTYFTPHEYQPFNSEKMTISETEMFN--LELLDEDKKN 1545

Query: 2318 Y-----------RKTYGENYCLGQLIFWHIQTNADPDCTLARASRGCLSLPDIG-SFYAK 2365
                        +++Y   +  GQL  W  QT A P  TL++  RG LS P +  SF   
Sbjct: 1546 TKKPNKKKNYEEQRSYSSQFIWGQLTVWFKQTVASPQATLSQDRRGTLSYPQLNQSFKTN 1605

Query: 2366 VQK-PSRHRVYGPKTVR-FMLSRMEKQPQRPWPKDRI-WAFKSSPRIFGSPMLD 2416
            +   P + +    KT R   L+ M+++P+  WP ++  W+FK++ + +G+ + +
Sbjct: 1606 ILTYPFQEK--NDKTGRQTFLNHMKEKPKDMWPPEQAKWSFKNALKNYGTLLFE 1657


>gi|397573767|gb|EJK48860.1| hypothetical protein THAOC_32309, partial [Thalassiosira oceanica]
          Length = 1092

 Score =  192 bits (488), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 101/284 (35%), Positives = 145/284 (51%), Gaps = 22/284 (7%)

Query: 1871 CRGILKAMDSRPDDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIR 1930
             R +  A D+  DD +  + KG G V   +GG   +  +  + GEVYP W+W EK D I 
Sbjct: 512  ARLVRLASDAVDDDFFRIHPKGHGSVVIGDGGLKANSLITYYRGEVYPAWRWCEKLDAIE 571

Query: 1931 SLQK--NNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVT 1988
             +QK  N     P+FYN+ +ERPK D  GY L+ VDA  K+   S   HSC P CE +V 
Sbjct: 572  RVQKEKNLRPNLPDFYNMAMERPKKDPRGYCLLFVDASRKSGLGSSFSHSCNPTCEVRVV 631

Query: 1989 AVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGA 2048
            ++ G  Q+ + T+R +  GEE+TFDYN+VTES +EY  +VCLCG + CRGS+L+      
Sbjct: 632  SLHGKLQLSMTTLRDLEQGEELTFDYNAVTESLDEYRFAVCLCGQRRCRGSFLHYATADC 691

Query: 2049 FEKVLKELHGLLDRHQLMLEACELNSVSEEDYLELGRAGLGSCLLGG------------- 2095
            +++VL     +  R   ++  C    +S ED   L R G  +   G              
Sbjct: 692  YQQVLSRNSPMAARFANLVRGCTKQVMSREDSAILARHGFNTAAFGAVSFNHHAAATSLV 751

Query: 2096 -------LPNWVVAYSARLVRFINLERTKLPEEILRHNLEEKRK 2132
                   +P W+  Y A  +R+I  ER  LP  +L + +E   K
Sbjct: 752  SRDSIDNVPIWLRTYVADCLRYIEYERRALPVALLCNQMERMSK 795


>gi|147775274|emb|CAN61590.1| hypothetical protein VITISV_033129 [Vitis vinifera]
          Length = 576

 Score =  190 bits (482), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 94/144 (65%), Positives = 112/144 (77%), Gaps = 1/144 (0%)

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
            +P+   + ++LVV DA+HKANYASRICH CRPN EAK+TAV+G YQIGIYTVR I  GEE
Sbjct: 319  QPQNKPNPWNLVV-DAIHKANYASRICHLCRPNREAKITAVEGQYQIGIYTVRQIQCGEE 377

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDRHQLMLEA 2069
            I FDYNSVTESK+EYE SVCLCGSQVCR SYLNLTGEGAF+KVLK  HG+LD++QLM E 
Sbjct: 378  IIFDYNSVTESKKEYEVSVCLCGSQVCRMSYLNLTGEGAFQKVLKGCHGILDQYQLMSEL 437

Query: 2070 CELNSVSEEDYLELGRAGLGSCLL 2093
              L+++  +      R  LG  +L
Sbjct: 438  YTLSAMLRKFIENHTRLSLGKTVL 461



 Score = 85.5 bits (210), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 36/60 (60%), Positives = 46/60 (76%)

Query: 1043 LGEWYYLDGAGHERGPSSFSELQVLVDQGCIQKHTSVFRKFDKVWVPLTFATETSASTVR 1102
              +WYYLDGAGHE+ PSSFSELQ LVDQ  IQKH+SV  K +K+W+P+TFA +   + V+
Sbjct: 258  FSDWYYLDGAGHEQWPSSFSELQSLVDQDSIQKHSSVLGKINKIWIPITFAADVPDAAVK 317



 Score = 71.2 bits (173), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 42/103 (40%), Positives = 58/103 (56%), Gaps = 8/103 (7%)

Query: 756 EDLHIDVRVGALLDGFTVIPGKEIETLGEILQTTFERVDWQNNG--GPTWHGACVGEQKP 813
           E L ID RV ALL  FT IPG+E+ETLGE+LQ +FE   W+  G  G +WH   +G Q  
Sbjct: 162 EGLQIDERVRALLKSFTFIPGRELETLGEVLQASFEHAQWEKLGAEGLSWHQLRIGGQP- 220

Query: 814 GDQKVDELY-ISDTKMKEAAELKS---GDKDHWVVCFDSDEWF 852
            DQ++D  +   +   KEA + +     DKD+     D  +W+
Sbjct: 221 -DQRIDRFFRYPEITSKEALDSRLSTFSDKDYAFAFGDFSDWY 262


>gi|307109213|gb|EFN57451.1| hypothetical protein CHLNCDRAFT_142939 [Chlorella variabilis]
          Length = 865

 Score =  186 bits (473), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 163/567 (28%), Positives = 244/567 (43%), Gaps = 154/567 (27%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG+G++C ++GG     F                      +++K   D  P+FYNI LER
Sbjct: 383  KGVGLICKQQGGIPPLTF---------------------DAVKKITGDELPDFYNIVLER 421

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
            PK D DGYD++ +DA  K   ASR+ HSC PNC+A V A  G   I +YT+R     EE+
Sbjct: 422  PKDDPDGYDVLFIDAAAKGALASRMSHSCTPNCQAIVMACGGRLTIALYTLR-----EEL 476

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDRHQLMLEAC 2070
            TFDY+SVTES++E              GSYL  TG  AF++V+   H ++          
Sbjct: 477  TFDYSSVTESEKE--------------GSYLYFTGSRAFQQVMNTKHTVM---------- 512

Query: 2071 ELNSVSEEDYLELGRAGLGSCLLGGLPNWVVAYSARL-----VRFINLERTKLPEEILRH 2125
                                        W +A  A L       +I  E   L E++L H
Sbjct: 513  ---------------------------GWAIARLAALSTEIICEYIEEEEAHLKEDLLGH 545

Query: 2126 NLEEKRKYFSDICLEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRCVFGDPKKAPPP 2185
             L               ++ A  +A+GV   RLQN+ +TLDKV+ V++      ++    
Sbjct: 546  PLG-----------IYNEASATAEAKGVVINRLQNVVITLDKVKMVLQAPNQTDEELQSA 594

Query: 2186 VERLSPEETVSFLWKGEGSLVEELIQCMAPHVEEDVLNDLKSKIQAHDPSGSEDIQRELR 2245
            V+R S E          G+L +     M P ++     D + K                 
Sbjct: 595  VQRHSAELP--------GALSKMCSLVMQPALD---FADARCK----------------- 626

Query: 2246 KSLLWLRDEVRNLPCTYKCRHDAAADLIHIYAYTKCFFRVQE-YKAFTSPPVYISPLDLG 2304
              L+ L +++R L         A ADL+ +YA T  +F  +  YK  TSPPV ++  DL 
Sbjct: 627  --LMQLYEQLRALDVENNGGLTAVADLLLLYASTLHWFTCERGYKGVTSPPVPLNLADLA 684

Query: 2305 --------PKY--------------ADKLGADLQVYRKTYGENYCLGQLIFWHIQTNADP 2342
                    P                +DKL     + RK Y   Y  GQL  W  QT  DP
Sbjct: 685  LDRTQEQTPAAAAAAVAAAAAAVVDSDKLLGSSNL-RKVYRPLYLWGQLSGWFKQTVNDP 743

Query: 2343 DCTLARASRGCLSLPDIGSFYAKVQKPSRHRVYGPKTV---RFMLSRMEKQPQRPWPKDR 2399
              +L+   RG +SLPD+ S +A  +  +R+      T+     ++ +++K+P   W    
Sbjct: 744  TASLSAERRGTISLPDVDSSFAGGK--TRYTAKASSTLCDRGDLIDQLDKRPDAMWRTGT 801

Query: 2400 IWAFKSSPRIFGSPMLDSSLTGCPLDR 2426
            +W+F++  +++GSPMLD+    C L R
Sbjct: 802  LWSFRNEAKVYGSPMLDA--VWCELSR 826


>gi|147814949|emb|CAN70304.1| hypothetical protein VITISV_006637 [Vitis vinifera]
          Length = 694

 Score =  182 bits (461), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 105/213 (49%), Positives = 134/213 (62%), Gaps = 14/213 (6%)

Query: 651 MECGPSRLCDLKTLVEEGVLVSDHFIKHLDSNRWETVENAVSPLVTVNFPSITSDSVTQL 710
           ME GPS+LCDLK  VE GVLVSDH IKH+DS+RW T++NA S LV VNFP +  D+VTQL
Sbjct: 1   MERGPSKLCDLKKFVE-GVLVSDHLIKHIDSDRWLTIKNAASLLVPVNFPLLVLDTVTQL 59

Query: 711 VSPPEASGNLLADTGDTAQSTG---EEFPVT--LQSQCCPDGSAAAAESSEDLHIDVRVG 765
           VSPPEA GN LA+ GDT +S     EE P    LQS  C + ++ A+E  E L ID RV 
Sbjct: 60  VSPPEAPGNPLAEAGDTTESNKLLEEETPAATLLQSMSCNNDNSIASEPLEGLQIDERVR 119

Query: 766 ALLDGFTVIPGKEIETLGEILQTTFERVDWQNNG--GPTWHGACVGEQKPGDQKVDELY- 822
           ALL  F  IPG+E+ETLGE+LQ +FE   W+  G  G +WH   +G Q   DQ++D  + 
Sbjct: 120 ALLKSFAFIPGRELETLGEVLQASFEHAQWEKLGAEGLSWHRLRIGGQP--DQRIDRFFR 177

Query: 823 ISDTKMKEAAELKS---GDKDHWVVCFDSDEWF 852
             +   KEA + +     DKD+     D  +W+
Sbjct: 178 YPEITSKEALDSRLSTFSDKDYAFDFGDFSDWY 210



 Score =  122 bits (305), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 55/72 (76%), Positives = 62/72 (86%)

Query: 1954 DADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFD 2013
            + + ++LVVVDA+HKANYASRICH CRPN EAKVTAV+G YQIGIYT+R I  GEEI  D
Sbjct: 214  ELNPWNLVVVDAIHKANYASRICHLCRPNREAKVTAVEGQYQIGIYTIRQIQCGEEIILD 273

Query: 2014 YNSVTESKEEYE 2025
            YNSVTESKEEYE
Sbjct: 274  YNSVTESKEEYE 285


>gi|255084155|ref|XP_002508652.1| set domain protein [Micromonas sp. RCC299]
 gi|226523929|gb|ACO69910.1| set domain protein [Micromonas sp. RCC299]
          Length = 1342

 Score =  178 bits (451), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 194/760 (25%), Positives = 322/760 (42%), Gaps = 157/760 (20%)

Query: 1783 GDQVFEQEVYGIDPYTHNLLLDSMPDELDWNLLEKHLFIEDVLLRTLNKQVRHFTGTGNT 1842
            G  V   E+ G D +T   + ++MP     N + +H    D  +  + + +R     G  
Sbjct: 166  GVDVAMVELKGFDAHTREKIAENMP-----NAVAEHDV--DEFIDLVAQTMRLDAVAGAD 218

Query: 1843 PMMYPLQPVIEEIEKEAVDDCDVRTMKMCRGILKAMDSRPDDKYVAYRKGLGVVCNKEGG 1902
            P +      I E +               + ++K     P + + A  KG+G+V  K+GG
Sbjct: 219  PSLELAAKTIAESKAATN-----AARACAKALVKLCAKDPKE-FKAKSKGVGLVVIKDGG 272

Query: 1903 FGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKN--NEDPAPEFYNIYLERPKGDADGYDL 1960
              +D ++  + GE+YP W+WFEK+   ++++++   +D  P FYN  +ER   D  GYD+
Sbjct: 273  IPKDAYLGAYCGELYPGWRWFEKEAAAQAVRRDVKRDDEVPTFYNAAVERDLHDPRGYDV 332

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
            + +D M K +  +R  HSC+PN E +V   +G Y + + T R +  GEEI +DY   T+S
Sbjct: 333  LFIDGMVKGSVLTRASHSCQPNAEMRVRVREGKYSVEMVTTREVRTGEEICWDYRCQTDS 392

Query: 2021 KEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLL-DRHQLMLEACELNSVSEED 2079
            ++E   ++CLCGS+ CR SYL+  GE        E   +L D  +L+    + +++   +
Sbjct: 393  EKEMRRAICLCGSKNCRVSYLHYNGESELAAFADERCAVLHDAARLLASCVDADALRLPE 452

Query: 2080 YLEL---GRAGLGS------------------------------CLLGGLPNWVVAYSAR 2106
             L L   G  G G+                               +L GLP W   ++++
Sbjct: 453  TLNLNPKGTPGKGTPGKRGNNGGKRADDHWKSALIAAGVRADDEGMLAGLPQWARKFASK 512

Query: 2107 LVRFINLERTKLPEEILRHNLEE--KRKYF------------------------------ 2134
             V   + E+     ++L H+L E  KR+ +                              
Sbjct: 513  CVATAHEEK-----KVLTHSLYESAKRRAYEAIDAARAEAAEYETDPEAWKKRFPRTVST 567

Query: 2135 --------SDICLEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVM--RCVFGDPKK--- 2181
                    SD  L    +DA+ +A G++  R+Q+LAVT+DKVR V+      GD +K   
Sbjct: 568  PPTVPHEPSDADL---GADAKAEASGIHAARIQSLAVTMDKVRRVLAVHARGGDAEKPAA 624

Query: 2182 ------APPPVERLSPEETVSFLWKGEGSLVEELIQCMAPHVEE--DVLNDLKSKIQAHD 2233
                  A PP+  L+ E+  + L      L        A +V+   D L+ +     A D
Sbjct: 625  GVDVSAAAPPLRLLNDEDAAAHLVAYASRLAS------AANVDNPFDALS-VDRLSAADD 677

Query: 2234 PSGS-EDIQRELRKSL-----------LWLRD-EVRNLPCTYKCRHDAAADLIHIYAYTK 2280
             +GS ED +R +R              LW R  E      T +    A  DL ++ + T+
Sbjct: 678  ENGSDEDARRWVRSDAARRILREMADNLWKRSLEESGASDTDRIARLACGDLAYLASQTR 737

Query: 2281 CFFR-VQEYKAFTSPPVYISPLDLGPKYADKLGADLQVYRKTYGENYCLGQLIFWHIQTN 2339
             FF  V   + F SP V +        +  + GA + V R  Y ++  LG L  W  +  
Sbjct: 738  NFFAPVAGGERFKSPYVAMG------SHGSRDGAGV-VRRMDYPKHSALGFLCTWREEYL 790

Query: 2340 ADPDCTLARASRGCLSLP--DIGSFYA-KVQKPSRHRVYGPKTVRFMLSRM-----EKQP 2391
              P   LA  +RG L LP  D G+      Q P ++R     ++  + + +      +  
Sbjct: 791  ERPCDRLATDARGGLCLPRFDRGALANYSTQNPGKNRRRAHASIDALSAHLCGEMAARGG 850

Query: 2392 QRPW-PKDRIWAF---------KSSPRIFGSPMLDSSLTG 2421
             + W P D  W +         +  P + GSP++D++L G
Sbjct: 851  GKAWAPIDGCWRWDFDLDAGDERDGP-VLGSPVVDAALGG 889


>gi|147866108|emb|CAN83034.1| hypothetical protein VITISV_019861 [Vitis vinifera]
          Length = 343

 Score =  177 bits (449), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 93/160 (58%), Positives = 112/160 (70%), Gaps = 7/160 (4%)

Query: 651 MECGPSRLCDLKTLVEEGVLVSDHFIKHLDSNRWETVENAVSPLVTVNFPSITSDSVTQL 710
           ME GPS+LCDLK  VE GVLVSDH IKH+DS+RW T++NA S LV +NFP + SD+VTQL
Sbjct: 1   MERGPSKLCDLKKFVE-GVLVSDHLIKHVDSDRWLTIKNAASLLVPMNFPPLVSDTVTQL 59

Query: 711 VSPPEASGNLLADTGDTAQSTG---EEFPVT-LQSQCCPDGSAAAAESSEDLHIDVRVGA 766
           VSPPEA GN L + GDT +S     EE P T LQS  C + S+ A+E  E L ID RV A
Sbjct: 60  VSPPEAPGNPLVEAGDTTESNKLMEEETPATLLQSMSCNNDSSIASEPLEGLQIDERVRA 119

Query: 767 LLDGFTVIPGKEIETLGEILQTTFERVDWQNNG--GPTWH 804
           LL  F  IPG+E+ETLGE+LQ +FE   W+  G  G +WH
Sbjct: 120 LLKSFAFIPGRELETLGEVLQASFEHAQWEKLGAEGLSWH 159


>gi|219116062|ref|XP_002178826.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217409593|gb|EEC49524.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 2187

 Score =  173 bits (438), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 126/405 (31%), Positives = 184/405 (45%), Gaps = 62/405 (15%)

Query: 1785 QVFEQEVYGIDPYTHNLLLDSMPDELDWNLLEKHLFIEDVLLRTLN-----------KQV 1833
            +V EQ V+G+D YT   +   +  E D++      FIE  LL  +N              
Sbjct: 698  EVVEQPVWGMDCYTRRNIASCL--ETDFDPATALHFIEKWLLPAINACPIDLAHKISNAA 755

Query: 1834 RHFTGTGNTPM-------------------MYPLQPVIEEIEKEAVDDCDVRTMKMCRGI 1874
            R   G     M                   ++   P+ + + ++      V        +
Sbjct: 756  RILEGLPFESMEDGEYGEKENINDRKTPEKLWAYSPLGKALREKIKVAAPVWLTAAAYLL 815

Query: 1875 LKAMDSRPDDKYVAYRKGLG-VVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQ 1933
             KA  +   D +  + KG G V+ N +   G +  V  + GEVYP W+W EK D I   Q
Sbjct: 816  RKAYTALGPDFFRVHPKGHGSVLLNSK--VGANTLVTFYRGEVYPSWRWGEKMDAIEITQ 873

Query: 1934 -KNNEDPA-PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVD 1991
             +    PA P+FYN+ LERP+ D  GY L+ VDA  KA + S + HSC P CE +VTAV+
Sbjct: 874  SRKALKPALPDFYNMALERPQIDPRGYGLLFVDASRKAGHGSSLSHSCAPTCEVRVTAVN 933

Query: 1992 GHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEK 2051
            G   + + T+R +  GEE+TFDYN+VTES  EY ++VCLCG   CRGS+L+      ++ 
Sbjct: 934  GELTLAMTTLRELEMGEELTFDYNAVTESLNEYRSAVCLCGYGKCRGSFLHFATADCYQL 993

Query: 2052 VLK-----------------------ELHGLLDRHQLMLEACELNSVSEEDYLELGRAGL 2088
            VL                        E   +L  H  +  A    SV+  + LE G+ G+
Sbjct: 994  VLNRNAPIATRFANLVKGSMKQVMSDEDTRVLHNHGFLTAAFGAISVNRRNLLEGGQKGV 1053

Query: 2089 GSCLLGGLPNWVVAYSARLVRFINLERTKLPEEIL-RHNLEEKRK 2132
                L  +P W+  + A  +R+I  ER  LP  ++  H    KRK
Sbjct: 1054 LDT-LDIVPVWLRTFVADTLRYIEYERRALPIALICDHVSSAKRK 1097


>gi|299473409|emb|CBN77807.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 3474

 Score =  172 bits (437), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 162/634 (25%), Positives = 279/634 (44%), Gaps = 101/634 (15%)

Query: 1675 SDMDFRSDGRARESRGAGDFTTDEGLDFSDDREWGARMTKASLVPPVTRKYEVIDQYVIV 1734
            + +    +G  +ES G G    + G +F +D   G R   ++L+     K   +    I 
Sbjct: 711  TTLAIAGEGATQESVGGGASGAENGGNFLNDE--GIRAQVSNLL----EKMCTMVSINIK 764

Query: 1735 ADEEDVRRK----MRVSLPEDY-AEKLNAQKNGSEELDMELPEVKDYKPRKQLGD-QVFE 1788
             D+ +V+++     R  +  D      N++  G  + D+ +P  +D      LG  Q+ E
Sbjct: 765  NDKNEVQKQPLTLTRYRVKSDSKGTSANSKGAGDSKDDVVVPAQED----GSLGQAQLEE 820

Query: 1789 QEVYGIDPYTHNLLLDSMPDELDWNLLEKHLFIEDVLLRTLNKQVRHFTGTGNTPMMYPL 1848
            + V+GID YT + +   +   L  +  +   +I   LL  LN+Q     G    P +  L
Sbjct: 821  RSVWGIDCYTRSNVEHMLDLTLGLSKEQAQHWITTTLLPALNRQ-NGPRGVDMLPALRDL 879

Query: 1849 QPVIEEIEKE-----------------------------AVDDCDVRTMKMCRGILKAMD 1879
              VI + E E                                   +  +++     +A +
Sbjct: 880  CKVIPDGETEEELEASRARAEAEEFELGPKAEGEDLLAAQALLHAIEGLQLLHQEHRATE 939

Query: 1880 SRPDDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIR--SLQKNNE 1937
            S     + ++ KG GV+C  + G   D FV E+LG++YP W+W EK   I    L+   +
Sbjct: 940  SLVRCYFHSHPKGTGVICKAKEGLKADTFVSEYLGDLYPSWRWNEKLSAIEEAKLKHGLK 999

Query: 1938 DPAPEFYNIYLERPKGDADGYDLVVVDAM-HKANYASRICHSCRPNCEAKVTAVDGHYQI 1996
               P+FYN  +ERPK DA G+ L+ V+A  H  N++S + HSC  NC    +  +G   +
Sbjct: 1000 PDLPDFYNFMMERPKEDARGFGLLHVEAGNHVGNFSSSLSHSCNSNCTTATSVRNGRLCV 1059

Query: 1997 GIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKEL 2056
             + T R I +GEE+T +Y ++T  + EY  +VCLCGS +C+ S++  TG  ++ K+L+ L
Sbjct: 1060 TLSTTRAIAFGEELTMNYGAITSCETEYGKAVCLCGSNLCQQSFMTFTGMDSYSKILRGL 1119

Query: 2057 HGLLDRHQLMLEACELNSVSEEDYLELGRAGLGSCLLGGL-PNWVVAYSARLVRFINLER 2115
              L+    L+  A +   +S  +   L + GL S  LG L P W+  ++A  +R++  ER
Sbjct: 1120 GPLMVFRGLIQSAAD-TPISSGELETLQKFGLKSSALGDLCPIWLKKFAAMQLRYVEFER 1178

Query: 2116 TKLPEEILRHNLEEKRKYFSDICLEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMR-- 2173
             KLP  ++         Y S          A++++  V ++R+++L   L  V + ++  
Sbjct: 1179 RKLPPTLMASG---DHTYQS----------ADIESHQVMDRRIRSLVEVLSGVYHFLQEQ 1225

Query: 2174 ------------------------CVFGDPKKAP--PPVERLSPEETVSFLWKGEGSLVE 2207
                                        DP      PP++ L  +E V  LW G  S++ 
Sbjct: 1226 KTAHSRPLPEGFEPPAPAAGGRAGAANEDPATLAERPPLKLLVDDEVVEALWSGRQSMMR 1285

Query: 2208 ELIQCMAPHVEEDVLNDLKSKIQAHDPSGSEDIQ 2241
             L++ +      + +   K  ++  DP   EDIQ
Sbjct: 1286 RLLRRL------EAIYCAKLVVETPDP---EDIQ 1310


>gi|303285194|ref|XP_003061887.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226456298|gb|EEH53599.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 1561

 Score =  169 bits (428), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 82/227 (36%), Positives = 122/227 (53%), Gaps = 27/227 (11%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG+GVVC +  G     +V ++LGE+Y  W+W+E+QD I+  +   E   P+F+NI LER
Sbjct: 762  KGVGVVCIRPEGLPAGTYVNDYLGEIYAPWRWYERQDAIKKREPGKE--LPDFFNITLER 819

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDG------------------ 1992
            P  DA GYD + V+A H+  ++SR+ HSC PN       VD                   
Sbjct: 820  PAEDAAGYDTLFVEAAHRCTFSSRLSHSCAPNVHTVGVVVDASESGGDTSDDKAAEEKKA 879

Query: 1993 ------HYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGE 2046
                     I  YT R + YGEE+ ++Y+ VTES++EY A++CLC +  C+G++L+  G 
Sbjct: 880  RESDAAKLTIAQYTTRRVEYGEELCWNYSCVTESEKEYRAAICLCSAPTCKGAFLDYAGS 939

Query: 2047 GAFEKVLKELHGLLDRHQLMLEACELNSVSEEDYLELGRAGLGSCLL 2093
             AF  V+   H  LDR+ +++ AC    V+  D   L   G+ S  L
Sbjct: 940  SAFTVVMARRHNFLDRNAILMRACS-EPVTPADRALLAANGIKSSAL 985



 Score =  114 bits (286), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 105/369 (28%), Positives = 163/369 (44%), Gaps = 68/369 (18%)

Query: 2097 PNWVVAYSARLVRFINLERTKLPEEILRHNLEEKRKYFSDICLEVEKSDAEVQAEGVYNQ 2156
            P W+V ++A  + +++LER  LP+ +L   L+          +  + + A   A GV   
Sbjct: 1166 PEWLVKWAALTLEYVDLERALLPDALLEQPLDG---------IAYDAAFASATAAGVVAT 1216

Query: 2157 RLQNLAVTLDKVRYVMRCVFGDPKKAPPPVER-LSPEETVSFLWKGEGSLV-----EELI 2210
            RLQN+ +TLDK++YV+R     P +   P  R L+ EE V  LW GE  ++     E  I
Sbjct: 1217 RLQNIIITLDKIKYVLR----QPGQCRAPFLRPLTEEEVVDHLWSGEHGVLKRAAEEATI 1272

Query: 2211 QCMAPHVEEDVLNDLKSKIQAHDPSGSED--------IQREL------------RKSLLW 2250
                         D      A  P  S D          R+L            R  LL 
Sbjct: 1273 AAKCKGALAKRQRDRGGGATAAPPKPSVDRPDAASCAALRDLLDGPRPRDAKAARAGLLT 1332

Query: 2251 LRDEVRNLPCTYKCRHDAAADLIHIYAYTKCFFRVQEYKAFTSPPVYISPLDLGPKYADK 2310
              + +R+ P        AAAD + +YA+ + +   +++  FTSPPV++ PL  G + A K
Sbjct: 1333 ASNILRDAPSGAH---AAAADALFMYAHVEHWCTPEKFNGFTSPPVHLEPLPPGERRA-K 1388

Query: 2311 L-----GADLQVYRKTYGE---------------NYCLGQLIFWHIQTNADPDCTLARAS 2350
            L     G    + +K Y                 ++  GQL+ W  QT  DP  +L+   
Sbjct: 1389 LPMFCKGDAANIAKKKYQARSISHWSPYDRVGVPHFAWGQLVSWFKQTVYDPSASLSAER 1448

Query: 2351 RGCLSLPDIGSFYAKVQKPSRHRVYGPKTVRFMLSRMEKQPQRPWPKDRIWAFKSSPRIF 2410
            RG +SLPD  SFYA      R R       + ML  +E++P   WP    ++F++  +++
Sbjct: 1449 RGTMSLPDPESFYACAPGEYRKR-----ERKAMLKSLERKPDAMWPTTWSFSFRNPAKVY 1503

Query: 2411 GSPMLDSSL 2419
            GSP LD ++
Sbjct: 1504 GSPWLDDAI 1512


>gi|224002090|ref|XP_002290717.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220974139|gb|EED92469.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 3070

 Score =  166 bits (419), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 109/348 (31%), Positives = 158/348 (45%), Gaps = 36/348 (10%)

Query: 1785 QVFEQEVYGIDPYTH----NLLLDSMPDELDWNLLEKHLF-------------------- 1820
            +V EQEV+GID YT      L+      E+    LEK L                     
Sbjct: 760  EVAEQEVWGIDCYTRRNVMTLIETEFSSEIATEFLEKWLLPAINACPIDLAHKMSTAAKI 819

Query: 1821 IEDVLLRT-------LNKQVRHFTGTGNTPMMYPLQPVIEEIEKEAVDDCDVRTMKMC-R 1872
            +E + + T       ++ Q R  +   N P        +    +  +       +K   R
Sbjct: 820  LEGLPISTDTEDCPSISMQTRQNSPDKNKPKSPESSVFLRTALESKIKQFGPPWLKAAAR 879

Query: 1873 GILKAMDSRPDDK--YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIR 1930
             I  A DS  +D   +  + KG G V   E G   +  V  + GEVYP W+W EK D I 
Sbjct: 880  LIRLASDSLDEDDGFFRIHPKGHGSVVIGEEGLKANSLVTYYRGEVYPAWRWCEKLDAIE 939

Query: 1931 SLQKNN--EDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVT 1988
              QK        P+FYN+ +ERPK D  GY L+ VDA  K+   S   HSC P CE +V 
Sbjct: 940  LTQKQLGLRPNLPDFYNMAMERPKKDPRGYGLLFVDASRKSGLGSSFSHSCNPTCEVRVV 999

Query: 1989 AVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGA 2048
            A++G   + + T+R +  GEE+TFDYN+VTES  EY  ++CLCG + CRGS+L+      
Sbjct: 1000 ALNGKLSLSMTTLRDLEQGEELTFDYNAVTESLNEYRFAICLCGHKKCRGSFLHFATADC 1059

Query: 2049 FEKVLKELHGLLDRHQLMLEACELNSVSEEDYLELGRAGLGSCLLGGL 2096
            +++VL     +  R   ++       +S ED   L + G  +   G +
Sbjct: 1060 YQQVLSRNSPIAARFANLVRGSMKQVMSREDSELLLKHGFNTAAFGAV 1107


>gi|302839886|ref|XP_002951499.1| hypothetical protein VOLCADRAFT_117846 [Volvox carteri f.
            nagariensis]
 gi|300263108|gb|EFJ47310.1| hypothetical protein VOLCADRAFT_117846 [Volvox carteri f.
            nagariensis]
          Length = 1516

 Score =  159 bits (402), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 92/253 (36%), Positives = 146/253 (57%), Gaps = 20/253 (7%)

Query: 1968 KANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEAS 2027
            +A++ASR+ HSC PNC A V +V+G   I +Y  R I  GEE+TFDY+SVTES++EY  +
Sbjct: 821  QASFASRMSHSCTPNCAAVVVSVNGRLTIAMYAKRRIEPGEELTFDYSSVTESEKEYREA 880

Query: 2028 VCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDRHQLMLEACELNSVSEEDYLELGRAG 2087
            +CLCGS+ CRGSYL  +G  AF +V+++ H  L R  ++L A     + E D+  L    
Sbjct: 881  ICLCGSRNCRGSYLYYSGSTAFTQVMEQRHNFLHRQTILLRAST-EPLLESDWTRLKSHS 939

Query: 2088 LGSCLLG-------GLPNWVVAYSARLVRFINLERTKLPEEILRHNLEEKRKYFSDICLE 2140
            LG   LG         P+W+V ++A ++ ++ LE+ +LP+ +L+   +  R         
Sbjct: 940  LGPTSLGDGGPGNNKAPDWLVKWAALVLEYVELEKRELPQVLLQLPPQLGR--------- 990

Query: 2141 VEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRCVFGDPKKAPPPVERLSPEETVSFLWK 2200
                 A ++AE +   R+Q + +TLDKV+ V+R   G  + A  P+  LS  E V+ LW 
Sbjct: 991  YTSESAAIEAEAIAQNRVQQIVITLDKVKQVLRQP-GQLQTA--PMRLLSESEVVAHLWS 1047

Query: 2201 GEGSLVEELIQCM 2213
            G  S+ + +++ +
Sbjct: 1048 GSNSIAKRVLKAV 1060



 Score = 58.5 bits (140), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 34/95 (35%), Positives = 47/95 (49%), Gaps = 6/95 (6%)

Query: 2322 YGENYCLGQLIFWHIQTNADPDCTLARASRGCLSLPDIGSFYAKVQKPSRHRVYGPKTVR 2381
            YG  +  GQL  W+ QT  DP  +L+   RG LSLPD+ S Y    K      Y  K   
Sbjct: 1392 YGPWFMWGQLSGWYKQTVYDPTASLSAERRGTLSLPDVESCYGARAK------YTFKDRA 1445

Query: 2382 FMLSRMEKQPQRPWPKDRIWAFKSSPRIFGSPMLD 2416
             +L  +E+ P   W     + F++  +I+GSPM D
Sbjct: 1446 AVLRHLEQCPDAQWKTSLPFGFRNDAKIYGSPMFD 1480


>gi|303286928|ref|XP_003062753.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226455389|gb|EEH52692.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 1401

 Score =  147 bits (371), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 126/489 (25%), Positives = 210/489 (42%), Gaps = 83/489 (16%)

Query: 1790 EVYGIDPYTHNLLLDSM---PDELDWNLLEKHLFIEDVLLRTLNKQVRHFTGTGNTPMMY 1846
            E+ G+D YT   +L++M    ++    L E     E VL +   + +R     G  P ++
Sbjct: 183  ELEGVDAYTRERVLEAMTASAEDGGAGLSEDD--AEKVLAKVF-QTMRLRAVAGKDPGLH 239

Query: 1847 PLQPVIEEIEKEAVDDCDVRTMKMC------RGILKAMDSRPDDKYVAYRKGLGVVCNKE 1900
                 +  + K    D +   ++ C        +L  + ++   +  A  KG G+VC + 
Sbjct: 240  HAAKTVANVPKRTPRDTEDLNLRDCGWMEAPALVLAKLCAKEPKEIRARSKGHGLVCVRA 299

Query: 1901 GGF--GEDDFV----------VEFLGEVYPVWKWFEKQDGIRSLQKN--NEDPAPEFYNI 1946
             G   G   F+          V  +  +YP W+WFEK+   + ++++  +++  P FYN 
Sbjct: 300  DGIPKGASSFLAPPDWSPYDRVRVVNALYPGWRWFEKEVAAQRVRRDVRDDEDVPVFYNA 359

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
             +ER   D  GYD++ VD M K +  +R  HSC PN E +V   +G Y + + +   I  
Sbjct: 360  AVERDVADPKGYDMLFVDGMVKGSLLTRASHSCEPNAEMRVRVREGSYAVEMVSTCHIAR 419

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLK-ELHGLLDRHQL 2065
            GEE+ +DYN  T+S+ E + ++C CG++ CR SYL+  G+  F   L+   H       L
Sbjct: 420  GEEVCWDYNCQTDSEREMKRAICCCGAKRCRVSYLHYAGDDDFASYLRARQHVAATTAAL 479

Query: 2066 MLEACELNSVSEEDYL----------ELGRAGLG---------SCLLGGLPNWVVAYSAR 2106
            +  +C   S                 +L  AGL            +L GLP W + ++A 
Sbjct: 480  LRASCTSTSRPPPSSSSSITMSDIIRQLSDAGLKLGDASDDTERGVLSGLPEWTLRFAAS 539

Query: 2107 LVRFINLERTKLPEEILRHNLEEKRKYFSDICLEVEKS--------------------DA 2146
             V++I  E+  L   +    L +  K  +    +                        DA
Sbjct: 540  AVKYIADEKAALRVTLASQALVKAAKARAAAAKDGGAGGGDGSAAAAAAAKAAKAAHLDA 599

Query: 2147 EVQAEGVYNQRLQNLAVTLDKVRYVM-----------------RCVFGDPKKAPPPVERL 2189
            E +A GV   RLQ+L VTLDKVR+V+                 + V    + APPP++ L
Sbjct: 600  ESEASGVAAGRLQSLVVTLDKVRHVLSTDAAGGTDPVRGMANEKMVADGVRAAPPPLKAL 659

Query: 2190 SPEETVSFL 2198
            +    ++ L
Sbjct: 660  TAAHALTHL 668


>gi|147823106|emb|CAN66333.1| hypothetical protein VITISV_000601 [Vitis vinifera]
          Length = 333

 Score =  135 bits (341), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 63/80 (78%), Positives = 70/80 (87%), Gaps = 1/80 (1%)

Query: 1973 SRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCG 2032
            + ICH  RPNC+A +TAV+G YQI IYTVR I YGEEITFDYNSVTESK+EYE SVCLCG
Sbjct: 29   TTICHLRRPNCKA-ITAVEGQYQIRIYTVRQIQYGEEITFDYNSVTESKKEYEESVCLCG 87

Query: 2033 SQVCRGSYLNLTGEGAFEKV 2052
            SQVCR SYLNLTGEGAF+K+
Sbjct: 88   SQVCRMSYLNLTGEGAFQKL 107


>gi|147855182|emb|CAN83840.1| hypothetical protein VITISV_023231 [Vitis vinifera]
          Length = 533

 Score =  135 bits (341), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 89/206 (43%), Positives = 122/206 (59%), Gaps = 30/206 (14%)

Query: 1573 REEMMKSWKDESPAGLYSATSKYKKKLSKMVSERKYMNRSNGTSLANGDFDYGEYASDRE 1632
            +EE+ + WK+ESP+GL S+ SK+K KL+K+V+ERKY ++S          DYG+ ASD E
Sbjct: 157  KEEITRGWKNESPSGLRSSGSKHKNKLNKIVTERKYRSKSGS--------DYGQNASDGE 208

Query: 1633 IRKRLSKLNRKSLDSGSETSDDLDGSSEDGKSDSESTVSDTDSDMDFRSDGRARESRGAG 1692
            IR+RLSKLN+K +DS S++ +DLD SS        S     D  +  R+ G         
Sbjct: 209  IRRRLSKLNKKFMDSASDSCEDLDRSS----EGGSSGSEGYDQFVMERNPGF----NWLF 260

Query: 1693 DFTTDEGLDFSDDREWGARMTKASLVPPVTRKYEVIDQYVIVADEEDVRRKMRVSLPEDY 1752
             F T   L    +                 +KYEVI+QY IVADE++V+RKM+VSLPE +
Sbjct: 261  PFYTQNSLCVCSEE--------------FVQKYEVIEQYAIVADEDEVQRKMKVSLPEGH 306

Query: 1753 AEKLNAQKNGSEELDMELPEVKDYKP 1778
             EKL+AQKNG+EE DME+P +    P
Sbjct: 307  NEKLSAQKNGTEESDMEIPNLISGTP 332


>gi|147856971|emb|CAN81810.1| hypothetical protein VITISV_020891 [Vitis vinifera]
          Length = 682

 Score =  132 bits (333), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 71/128 (55%), Positives = 85/128 (66%), Gaps = 6/128 (4%)

Query: 683 RWETVENAVSPLVTVNFPSITSDSVTQLVSPPEASGNLLADTGDTAQSTG---EEFPVT- 738
           RW T++NA S LV VNFP   SD+VTQLVSPPEA GN LA+ GDT +S     EE P T 
Sbjct: 365 RWLTIKNAASLLVPVNFPPFVSDTVTQLVSPPEAPGNPLAEAGDTTESNKLLEEETPATS 424

Query: 739 LQSQCCPDGSAAAAESSEDLHIDVRVGALLDGFTVIPGKEIETLGEILQTTFERVDWQNN 798
           LQS  C + ++ A+E  E L ID RV ALL  F  IPGKE+ETLGE+LQ +FE   W+  
Sbjct: 425 LQSMSCNNDNSIASEPLEGLQIDERVRALLKSFAFIPGKELETLGEVLQASFEHAQWEKL 484

Query: 799 G--GPTWH 804
           G  G +WH
Sbjct: 485 GAEGLSWH 492


>gi|224095774|ref|XP_002310474.1| hypothetical protein POPTRDRAFT_562330 [Populus trichocarpa]
 gi|222853377|gb|EEE90924.1| hypothetical protein POPTRDRAFT_562330 [Populus trichocarpa]
          Length = 80

 Score =  117 bits (293), Expect = 9e-23,   Method: Composition-based stats.
 Identities = 48/64 (75%), Positives = 57/64 (89%)

Query: 2323 GENYCLGQLIFWHIQTNADPDCTLARASRGCLSLPDIGSFYAKVQKPSRHRVYGPKTVRF 2382
            G  YC+GQLIFWH+QTN +PD TLA+AS+GCLSLP+IGSFYAKVQKPS+ R+YGPKTV+ 
Sbjct: 3    GAIYCMGQLIFWHVQTNTEPDFTLAKASKGCLSLPEIGSFYAKVQKPSQQRIYGPKTVKM 62

Query: 2383 MLSR 2386
            ML R
Sbjct: 63   MLER 66


>gi|145485412|ref|XP_001428714.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124395802|emb|CAK61316.1| unnamed protein product [Paramecium tetraurelia]
          Length = 844

 Score = 94.4 bits (233), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 117/554 (21%), Positives = 222/554 (40%), Gaps = 99/554 (17%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGI-RSLQKNN-----EDPAPEFY 1944
            KG G+VC    G   ++F+  + GEVY   +WFEKQ    + +Q  N     + P  EF+
Sbjct: 291  KGKGMVCCLNEGLAGNEFICFYFGEVYTPQRWFEKQTIFHKRMQDGNRKTCSQSPYAEFF 350

Query: 1945 NIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGI 2004
             I+ +      + +    +D     N A  I +SC PNC      V+    + + T R I
Sbjct: 351  -IHDDLLVMFKNRFKF--IDPTRYGNMAQHISYSCDPNCRLIAVTVNQQNLLAVITSRKI 407

Query: 2005 HYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDRHQ 2064
            +Y EE+T  +   ++ +       CLCGS  C+    NL  E A +  +      ++R+ 
Sbjct: 408  NYFEELTLPFPYTSQDQ-------CLCGSIHCKRKQ-NLELENAHQ--ISIYSNYIERNV 457

Query: 2065 LMLEACELNSVSEEDYLELGRAGLGSCLLGGLPNWVVAYSA--RLVRFINL----ERTKL 2118
            ++L++  + S + ++                +P W+  +        +IN+    ++ K 
Sbjct: 458  ILLQSTLITSQNTQN---------------DIPEWLSNWQELNHQQNYINILSCVDKVKF 502

Query: 2119 PEEILRHNLEEKRKYFSDICLEVEKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRCVFGD 2178
               +L+H    +   F                  ++NQ  +N      K++         
Sbjct: 503  ---VLQHLKTIQPPIFL--------------VTNIFNQFWKNCETNTQKIQ--------- 536

Query: 2179 PKKAPPPVERLSPEETVSFLWKGEGSLVEELIQCMAPHVEEDVLNDLKSKIQAHDPSGSE 2238
                   +E     E V FL +       +L QC    +  +++N +K  I       + 
Sbjct: 537  -------MESSIMNEIVVFLKRH-----SQLHQC---QIGLEIINQMKKII-----DQNT 576

Query: 2239 DIQRELRKSLLWLRDE-VRNL-PCTYKCRHDAAADLIHIYAYTKCFFRVQEYKAFTSPPV 2296
            D   +L + L  L  E + N+  C++   + A + +++  ++T  +F   +Y+ F   P 
Sbjct: 577  DYALQLTRMLFLLLSEIILNIESCSF--NNKAFSTILYFMSFTHTYFSSTQYQGFDGKPF 634

Query: 2297 YISPLDLGPKYADKLGADLQVYRKTYGENYCLGQLIFWHIQTNADPDCTLARASRGCLSL 2356
              +  +  P+  +K    L    K Y   +  GQLI W+ QT  +P  ++A+  RG L  
Sbjct: 635  EETEFEYIPQPKNKSKLSL---SKQYTPQFIWGQLINWNKQTLQNPQSSMAQERRGVLCY 691

Query: 2357 PDIGSFYAKVQKPSRHRVYG-PKTVRFMLSRMEKQPQRPWPKDRIWAFKSSPRIFGSPML 2415
            P +   +    K   ++     K + +  S+ E QP         W++K+   I+G+   
Sbjct: 692  PSLLLSFDNKHKTFPYQCKTREKYLEYFQSKKEIQPDLS-----TWSYKNQHNIYGTIFF 746

Query: 2416 DSSLTGCPLDREMV 2429
            +   +   +  + V
Sbjct: 747  EQYFSLSKVGEDFV 760


>gi|255086485|ref|XP_002509209.1| set domain protein [Micromonas sp. RCC299]
 gi|226524487|gb|ACO70467.1| set domain protein [Micromonas sp. RCC299]
          Length = 283

 Score = 93.6 bits (231), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 61/238 (25%), Positives = 112/238 (47%), Gaps = 21/238 (8%)

Query: 2204 SLVEELIQCMAPH-------------VEEDVLNDLKSKIQAHDPSGSEDIQRELRKSLLW 2250
            S+V+ L+Q M PH             V     N L  +I         D ++ L+  L+ 
Sbjct: 7    SMVQGLLQAMKPHARSAKDDDDHSAEVRHAEFNALVEQIARESTEPENDPEKSLKSGLIR 66

Query: 2251 LRDEVRNLPCTYKCRHDAAADLIHIYAYTKCFFRVQ---EYKAFTSPPVYISPLDLGPKY 2307
            LRD + ++P T   RHD AA+L+H++A+T+ ++ V+    + AFT+  + +   ++    
Sbjct: 67   LRDALASMPSTPSARHDVAAELVHLHAHTRRYWSVRRGDHHGAFTAEEIPVRENEVNSFG 126

Query: 2308 ADKLGADLQVYRKT---YGENYCLGQLIFWHIQTNADPDCTLARASRGCLSLPDIGSFYA 2364
                GA  Q+ ++    Y      G L+ W+ Q  +DP   +    RGC+ +PD+   Y+
Sbjct: 127  IGAEGASEQIVKQVRPEYKAGTAGGALLVWYKQEMSDPLQWVNANRRGCVVIPDVSCAYS 186

Query: 2365 KVQKPSRHRVYGPKTVRFMLSRMEKQPQRPWPKDR-IWAFKSSPRIFGSPMLDSSLTG 2421
                 +  +  G +     L+ + + P+ PWP+    W   ++ R+ GSP+LD+ +  
Sbjct: 187  PRPGVAVAKC-GAREREAWLAHLAEHPEDPWPQHTGPWGPANAQRLIGSPVLDAFMAA 243


>gi|361069841|gb|AEW09232.1| Pinus taeda anonymous locus UMN_839_01 genomic sequence
 gi|383148794|gb|AFG56252.1| Pinus taeda anonymous locus UMN_839_01 genomic sequence
 gi|383148795|gb|AFG56253.1| Pinus taeda anonymous locus UMN_839_01 genomic sequence
 gi|383148796|gb|AFG56254.1| Pinus taeda anonymous locus UMN_839_01 genomic sequence
 gi|383148797|gb|AFG56255.1| Pinus taeda anonymous locus UMN_839_01 genomic sequence
 gi|383148798|gb|AFG56256.1| Pinus taeda anonymous locus UMN_839_01 genomic sequence
 gi|383148799|gb|AFG56257.1| Pinus taeda anonymous locus UMN_839_01 genomic sequence
 gi|383148800|gb|AFG56258.1| Pinus taeda anonymous locus UMN_839_01 genomic sequence
 gi|383148801|gb|AFG56259.1| Pinus taeda anonymous locus UMN_839_01 genomic sequence
 gi|383148802|gb|AFG56260.1| Pinus taeda anonymous locus UMN_839_01 genomic sequence
 gi|383148803|gb|AFG56261.1| Pinus taeda anonymous locus UMN_839_01 genomic sequence
 gi|383148804|gb|AFG56262.1| Pinus taeda anonymous locus UMN_839_01 genomic sequence
 gi|383148805|gb|AFG56263.1| Pinus taeda anonymous locus UMN_839_01 genomic sequence
 gi|383148806|gb|AFG56264.1| Pinus taeda anonymous locus UMN_839_01 genomic sequence
 gi|383148807|gb|AFG56265.1| Pinus taeda anonymous locus UMN_839_01 genomic sequence
          Length = 82

 Score = 91.3 bits (225), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 43/59 (72%), Positives = 51/59 (86%)

Query: 1706 REWGARMTKASLVPPVTRKYEVIDQYVIVADEEDVRRKMRVSLPEDYAEKLNAQKNGSE 1764
            REWGARMT ASLVPPVTRKYEVI++YV+VADE++V RKMRV LP+DY +KL A K+  E
Sbjct: 15   REWGARMTSASLVPPVTRKYEVIEEYVVVADEDEVSRKMRVCLPKDYEKKLAAAKDRRE 73


>gi|170573421|ref|XP_001892464.1| SET domain containing protein [Brugia malayi]
 gi|158601976|gb|EDP38706.1| SET domain containing protein [Brugia malayi]
          Length = 1603

 Score = 91.3 bits (225), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 53/151 (35%), Positives = 82/151 (54%), Gaps = 19/151 (12%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
            G G+    +    +  F+ E++GEV  +  +  +       Q+N+       Y + L   
Sbjct: 919  GCGIGVKTDVNIDKGQFICEYIGEVVSMETFNIRSRTDYRYQRNH-------YALNL--- 968

Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
                 G+   VVDA HK N A  I HSC PNCE +  +V+GHY+IG++ +RGIH GEE+T
Sbjct: 969  ---CPGF---VVDAYHKGNIARFINHSCAPNCEMQRWSVNGHYRIGLFALRGIHEGEELT 1022

Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            +DYN   ++ E  + ++C CG+  CR  +LN
Sbjct: 1023 YDYN--WDAFEFDDVTICCCGAXNCR-HFLN 1050


>gi|402585708|gb|EJW79647.1| hypothetical protein WUBG_09444, partial [Wuchereria bancrofti]
          Length = 511

 Score = 89.4 bits (220), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 53/151 (35%), Positives = 82/151 (54%), Gaps = 19/151 (12%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
            G G+    +    +  F+ E++GEV  +  +  +       Q+N+       Y + L   
Sbjct: 191  GCGIGVKTDVNIDKGQFICEYIGEVVSMETFNIRSRTDYRYQRNH-------YALNL--- 240

Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
                 G+   VVDA HK N A  I HSC PNCE +  +V+GHY+IG++ +RGIH GEE+T
Sbjct: 241  ---CPGF---VVDAYHKGNIARFINHSCAPNCEMQRWSVNGHYRIGLFALRGIHEGEELT 294

Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            +DYN   ++ E  + ++C CG+  CR  +LN
Sbjct: 295  YDYN--WDAFEFDDVTICCCGAPNCR-HFLN 322


>gi|312116897|ref|XP_003151349.1| hypothetical protein LOAG_15812 [Loa loa]
 gi|307753486|gb|EFO12720.1| hypothetical protein LOAG_15812 [Loa loa]
          Length = 213

 Score = 88.6 bits (218), Expect = 5e-14,   Method: Composition-based stats.
 Identities = 41/81 (50%), Positives = 57/81 (70%), Gaps = 3/81 (3%)

Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
            VVDA HK N A  I HSC PNCE +  +V+GHY+IG++ +RGIH GEE+T+DYN   ++ 
Sbjct: 70   VVDAYHKGNIARFINHSCAPNCEMQRWSVNGHYRIGLFALRGIHEGEELTYDYN--WDAF 127

Query: 2022 EEYEASVCLCGSQVCRGSYLN 2042
            E  + ++C CG+  CR  +LN
Sbjct: 128  EFDDVTICCCGAPNCR-HFLN 147


>gi|145521184|ref|XP_001446447.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124413925|emb|CAK79050.1| unnamed protein product [Paramecium tetraurelia]
          Length = 828

 Score = 88.6 bits (218), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 57/181 (31%), Positives = 90/181 (49%), Gaps = 19/181 (10%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGI-RSLQKNNEDPAPEFYNIYLE 1949
            KG GVVC    GF  ++F+  + GEVY   +WFEKQ    + +Q  N      F + Y E
Sbjct: 283  KGKGVVCCNFDGFVTNEFINFYFGEVYTPQRWFEKQTVFNKRMQDGNRKSG--FQSPYAE 340

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
                D    +L+ +D     N A  I +SC PNC+     ++  YQ+ I+T++ I+Y EE
Sbjct: 341  FHIND----ELLFIDPTRYGNIALHISYSCDPNCKFVTVQINSSYQLAIFTLKKINYLEE 396

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELH-GLLDRHQLMLE 2068
            +T  + S +         +CLCGS  C+     L+   AF   L + +   + R+ L+L+
Sbjct: 397  LTLPFPSTSN-------DLCLCGSIYCK----RLSQLEAFNNRLTQNYPNYIQRNALLLQ 445

Query: 2069 A 2069
            +
Sbjct: 446  S 446



 Score = 49.7 bits (117), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 35/162 (21%), Positives = 73/162 (45%), Gaps = 21/162 (12%)

Query: 2266 HDAAADLIHIYAYTKCFFRVQEYKAFTSPPVYISPLDLGPKYADKLGADLQVYRKTYGEN 2325
            ++  + +++  ++T  +F   +Y+ F   P   +  +  P+  +K    L    K Y   
Sbjct: 588  NEGLSTILYFMSFTHTYFSSTQYEGFNGKPFEENEFEYIPQPKNKQKLAL---SKMYTPQ 644

Query: 2326 YCLGQLIFWHIQTNADPDCTLARASRGCLSLPD-IGSFYAKVQKPSRHRVYG------PK 2378
            Y  GQLI W+ QT  +P  ++A+  RG L  P  I SF       ++H+++        K
Sbjct: 645  YIWGQLINWNKQTLQNPQSSMAQERRGVLCYPSLILSF------DNKHKLFPYQCKTREK 698

Query: 2379 TVRFMLSRMEKQPQRPWPKDRIWAFKSSPRIFGSPMLDSSLT 2420
             + +  ++ + QP        IW++K+   ++G+   +   +
Sbjct: 699  FLEYFYTKSDIQPDL-----SIWSYKNQYNVYGTIFFEQCFS 735


>gi|296085302|emb|CBI29034.3| unnamed protein product [Vitis vinifera]
          Length = 195

 Score = 85.9 bits (211), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 40/77 (51%), Positives = 51/77 (66%), Gaps = 5/77 (6%)

Query: 1043 LGEWYYLDGAGHERGPSSFSELQVLVDQGCIQKHTSVFRKFDKVWVPLTFATETSASTVR 1102
              +WYYLDGAGHE+ PSSFSELQ LVDQ  IQKH+SV  K +K+W+P+TFA +   + V 
Sbjct: 111  FSDWYYLDGAGHEQWPSSFSELQSLVDQDSIQKHSSVLGKINKIWIPITFAADVPDAAV- 169

Query: 1103 NHGEKIMPSGDSSGLPP 1119
                KI P    + + P
Sbjct: 170  ----KIQPQNKVTFIEP 182



 Score = 75.9 bits (185), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 45/115 (39%), Positives = 65/115 (56%), Gaps = 8/115 (6%)

Query: 744 CPDGSAAAAESSEDLHIDVRVGALLDGFTVIPGKEIETLGEILQTTFERVDWQNNG--GP 801
           C + ++ A+E  E L ID RV ALL  FT IPG+E+ETLGE+LQ +FE   W+  G  G 
Sbjct: 3   CNNDNSIASEPLEGLQIDERVRALLKSFTFIPGRELETLGEVLQASFEHAQWEKLGAEGL 62

Query: 802 TWHGACVGEQKPGDQKVDELY-ISDTKMKEAAELK---SGDKDHWVVCFDSDEWF 852
           +WH   +G Q   DQ++D  +   +   KEA + +     DKD+     D  +W+
Sbjct: 63  SWHQLRIGGQP--DQRIDRFFRYPEITSKEALDSRLSTFSDKDYAFAFGDFSDWY 115


>gi|328865276|gb|EGG13662.1| SET domain-containing protein [Dictyostelium fasciculatum]
          Length = 1418

 Score = 85.9 bits (211), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 56/155 (36%), Positives = 74/155 (47%), Gaps = 26/155 (16%)

Query: 1886 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN 1945
            + A +KG G+        G   F++E+ GEV    K  E+     S +         FY 
Sbjct: 1063 FSAEKKGWGL--KAVDNIGAKTFIIEYCGEVISKQKCLERMTESESEK--------YFYF 1112

Query: 1946 IYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIH 2005
            + L+R         L  +DA  K N A  I HSC PNCE +   VDG  +IGI+ +R I 
Sbjct: 1113 LTLDR---------LECLDASRKGNLARFINHSCDPNCETQKWNVDGEVRIGIFAIRDIK 1163

Query: 2006 YGEEITFDYNSVTESKEEYEAS--VCLCGSQVCRG 2038
             GEE+TFDYN      E +  S  VC CG+  CRG
Sbjct: 1164 RGEELTFDYNY-----ERFGTSKQVCYCGAANCRG 1193


>gi|308487582|ref|XP_003105986.1| CRE-SET-2 protein [Caenorhabditis remanei]
 gi|308254560|gb|EFO98512.1| CRE-SET-2 protein [Caenorhabditis remanei]
          Length = 1505

 Score = 85.5 bits (210), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 54/146 (36%), Positives = 79/146 (54%), Gaps = 17/146 (11%)

Query: 1903 FGEDDFVVEFLGEVYPVWKWFEK---QDGIRSLQKNNEDPAPEFYNI---YLERPKGDAD 1956
              +D+ +VE++G+   V++ F        IRSL  +  + A E   I   YL R   ++ 
Sbjct: 1371 IAQDEMIVEYIGQTVIVFQNFSSILFHLQIRSLVADEREKAYERRGIGSSYLFRIDENS- 1429

Query: 1957 GYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNS 2016
                 V+DA  + N+A  I HSC+PNC AKV  ++G  +I IY+   I+ GEEIT+DY  
Sbjct: 1430 -----VIDATKRGNFARFINHSCQPNCYAKVLTIEGEKRIVIYSRSVINKGEEITYDYKF 1484

Query: 2017 VTESKEEYEASVCLCGSQVCRGSYLN 2042
              E     +   CLCG++ CRG YLN
Sbjct: 1485 PIED----DKIDCLCGAKACRG-YLN 1505


>gi|422293956|gb|EKU21256.1| set domain protein, partial [Nannochloropsis gaditana CCMP526]
          Length = 92

 Score = 82.4 bits (202), Expect = 3e-12,   Method: Composition-based stats.
 Identities = 45/103 (43%), Positives = 57/103 (55%), Gaps = 14/103 (13%)

Query: 1936 NEDPA-PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHY 1994
            N  PA P+FYNI LERP+GDA+GY              + + H        +V    G  
Sbjct: 3    NLKPALPDFYNILLERPRGDANGYG--------PPGAGADLVHG-----HQQVVGQAGKL 49

Query: 1995 QIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCR 2037
             I + + R I YGEE+T DY S T S+EEY A+VCLCGS +CR
Sbjct: 50   TIAVCSDREIAYGEELTMDYCSFTHSEEEYLAAVCLCGSHICR 92


>gi|19075312|ref|NP_587812.1| histone lysine methyltransferase Set1 [Schizosaccharomyces pombe
            972h-]
 gi|74698592|sp|Q9Y7R4.1|SET1_SCHPO RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
            specific; AltName: Full=COMPASS component set1; AltName:
            Full=Lysine N-methyltransferase 2; AltName: Full=SET
            domain-containing protein 1; AltName: Full=Set1 complex
            component set1; Short=Set1C component set1; AltName:
            Full=Spset1
 gi|4704279|emb|CAB41652.1| histone lysine methyltransferase Set1 [Schizosaccharomyces pombe]
          Length = 920

 Score = 82.0 bits (201), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 51/141 (36%), Positives = 74/141 (52%), Gaps = 26/141 (18%)

Query: 1905 EDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDL---V 1961
            ++D V+E++GE+            IR    +N +        Y+    GD+  + +   V
Sbjct: 803  KNDMVIEYIGEI------------IRQRVADNREKN------YVREGIGDSYLFRIDEDV 844

Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
            +VDA  K N A  I HSC PNC A++  V+G  +I IY  R I +GEE+T+DY    +  
Sbjct: 845  IVDATKKGNIARFINHSCAPNCIARIIRVEGKRKIVIYADRDIMHGEELTYDY----KFP 900

Query: 2022 EEYEASVCLCGSQVCRGSYLN 2042
            EE +   CLCG+  CRG YLN
Sbjct: 901  EEADKIPCLCGAPTCRG-YLN 920


>gi|147863201|emb|CAN80485.1| hypothetical protein VITISV_032461 [Vitis vinifera]
          Length = 508

 Score = 81.6 bits (200), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 39/68 (57%), Positives = 50/68 (73%), Gaps = 2/68 (2%)

Query: 1237 DASFPGEESASSAIESGGWGLLDGHTLAHVFHFLRSDMKSLAF--ASLTCRHWRAAVRFY 1294
            +A+F  E+   + + S  WGLLDG  LA VFHFL++D+KSL F  A+LTC H RAAVRF+
Sbjct: 178  NATFYQEDIVLAEMGSENWGLLDGDVLARVFHFLKTDVKSLVFFLAALTCEHRRAAVRFF 237

Query: 1295 KGISRQVD 1302
            KG+ RQVD
Sbjct: 238  KGVPRQVD 245


>gi|341896007|gb|EGT51942.1| hypothetical protein CAEBREN_26218 [Caenorhabditis brenneri]
          Length = 1670

 Score = 81.3 bits (199), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 53/144 (36%), Positives = 77/144 (53%), Gaps = 28/144 (19%)

Query: 1903 FGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI---YLERPKGDADGYD 1959
              +D+ ++E++G+             IRSL  +  + A E   I   YL R     D + 
Sbjct: 1551 IAQDEMIIEYIGQ------------KIRSLVADEREKAYERRGIGSSYLFR----IDEH- 1593

Query: 1960 LVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYN-SVT 2018
              V+DA  + N+A  I HSC+PNC AKV  ++G  +I IY+   I+ GEEIT+DY   + 
Sbjct: 1594 -TVIDATKRGNFARFINHSCQPNCYAKVLTIEGEKRIVIYSRSTINKGEEITYDYKFPIE 1652

Query: 2019 ESKEEYEASVCLCGSQVCRGSYLN 2042
            E K +     CLCG++ CRG YLN
Sbjct: 1653 EDKID-----CLCGAKTCRG-YLN 1670


>gi|213402529|ref|XP_002172037.1| histone-lysine N-methyltransferase [Schizosaccharomyces japonicus
            yFS275]
 gi|212000084|gb|EEB05744.1| histone-lysine N-methyltransferase [Schizosaccharomyces japonicus
            yFS275]
          Length = 977

 Score = 81.3 bits (199), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 46/96 (47%), Positives = 55/96 (57%), Gaps = 11/96 (11%)

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
            YL R   DA      +VDA  K N A  I HSC PNC AK+  V+GH +I IY  R I  
Sbjct: 893  YLFRIDKDA------IVDATKKGNIARFINHSCAPNCIAKIIRVEGHQKIVIYADRDIEE 946

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            GEE+T+DY    +  EE +   CLCG+  CRG YLN
Sbjct: 947  GEELTYDY----KFPEEVDKIPCLCGAPTCRG-YLN 977


>gi|390342260|ref|XP_003725626.1| PREDICTED: uncharacterized protein LOC578079 isoform 1
            [Strongylocentrotus purpuratus]
          Length = 3023

 Score = 80.9 bits (198), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 49/153 (32%), Positives = 79/153 (51%), Gaps = 20/153 (13%)

Query: 1886 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN 1945
            +    KG G+   +E    +++FV+E++GEV    ++  +       ++ ++D    FY 
Sbjct: 1668 FYTEEKGHGLKAKEE--LKDNEFVMEYVGEVLNFHEFKHR------AKQYSKDKNLHFYF 1719

Query: 1946 IYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIH 2005
            + L   K D       ++DA  K N +  + HSC PNCE +   V+G  ++G +T R + 
Sbjct: 1720 MAL---KSDE------IIDATEKGNVSRFMNHSCDPNCETQKWTVNGQLRVGFFTKRQVK 1770

Query: 2006 YGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
             GEE+TFDY      +   EA  CLCGS+ CRG
Sbjct: 1771 PGEELTFDYQFEVYGQ---EAQKCLCGSEKCRG 1800


>gi|390342258|ref|XP_783359.3| PREDICTED: uncharacterized protein LOC578079 isoform 2
            [Strongylocentrotus purpuratus]
          Length = 3024

 Score = 80.9 bits (198), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 49/153 (32%), Positives = 79/153 (51%), Gaps = 20/153 (13%)

Query: 1886 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN 1945
            +    KG G+   +E    +++FV+E++GEV    ++  +       ++ ++D    FY 
Sbjct: 1668 FYTEEKGHGLKAKEE--LKDNEFVMEYVGEVLNFHEFKHR------AKQYSKDKNLHFYF 1719

Query: 1946 IYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIH 2005
            + L   K D       ++DA  K N +  + HSC PNCE +   V+G  ++G +T R + 
Sbjct: 1720 MAL---KSDE------IIDATEKGNVSRFMNHSCDPNCETQKWTVNGQLRVGFFTKRQVK 1770

Query: 2006 YGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
             GEE+TFDY      +   EA  CLCGS+ CRG
Sbjct: 1771 PGEELTFDYQFEVYGQ---EAQKCLCGSEKCRG 1800


>gi|198418893|ref|XP_002124393.1| PREDICTED: similar to SET domain containing 2 [Ciona intestinalis]
          Length = 2228

 Score = 80.5 bits (197), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 56/162 (34%), Positives = 80/162 (49%), Gaps = 25/162 (15%)

Query: 1882 PDDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSL--QKNNEDP 1939
            P + +    KG G+   +    G    V+E+ GEV  + ++     G RSL   + N+  
Sbjct: 1064 PTEVFQTKWKGWGIRATENLSPGM--LVMEYCGEVLDLQEF-----GRRSLLYSRGNQQ- 1115

Query: 1940 APEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIY 1999
               FY + L + +         ++DA  K N +  I HSC PNCE +   V+G  ++G +
Sbjct: 1116 --HFYFMALSQDE---------IIDATTKGNTSRFINHSCDPNCETQKWTVNGRLRVGFF 1164

Query: 2000 TVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
            T+R I+ GEEITFDY      K   EA  C CGS  CRG YL
Sbjct: 1165 TMRDINKGEEITFDYQFQRYGK---EAQACYCGSSNCRG-YL 1202


>gi|326488341|dbj|BAJ93839.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 1070

 Score = 79.7 bits (195), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 56/164 (34%), Positives = 82/164 (50%), Gaps = 22/164 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+   +E    E  F++E++GEV  +  +  +Q    S  K +      FY + L 
Sbjct: 353  KKGYGLQLLEE--VSEGRFLIEYVGEVLDITTYESRQRDYASKGKKH------FYFMAL- 403

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
                  DG +  V+DA  K N    I HSC PNC  +   V+G   IGI+ +R I  GEE
Sbjct: 404  ------DGGE--VIDACTKGNLGRFINHSCSPNCRTEKWMVNGEVCIGIFAMRNIKKGEE 455

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL--NLTGEGAFEK 2051
            +TFDYN V  S    +   C CG+  CRG Y+  +++G G   +
Sbjct: 456  LTFDYNYVRVSGAAPQK--CFCGTAKCRG-YIGGDISGSGIITQ 496


>gi|340372263|ref|XP_003384664.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
            [Amphimedon queenslandica]
          Length = 1171

 Score = 79.7 bits (195), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 56/177 (31%), Positives = 86/177 (48%), Gaps = 24/177 (13%)

Query: 1879 DSRPDDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNED 1938
            +S P   +    +G G+   +    G  DFV+E++GE+  +    E+      L+K  E 
Sbjct: 774  ESVPTQTFYTGNRGWGLKTMRSLSPG--DFVIEYVGEIVDMAAVQER------LKKTQEA 825

Query: 1939 PAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
                FY + LER          +++DA  K+N+A  I HSC PNCE +   V+G  +IGI
Sbjct: 826  SVSSFYFLTLERN---------LIIDARVKSNHARFINHSCDPNCETQKWTVNGETRIGI 876

Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKE 2055
            + ++ I    E+TFDY       E+     CLCG+Q C G      GE   ++ LK+
Sbjct: 877  FAIKDIKEDTELTFDYQFDCLGNEK---KACLCGAQNCSG----FLGEKPKQEKLKQ 926


>gi|17552320|ref|NP_498039.1| Protein SET-2, isoform c [Caenorhabditis elegans]
 gi|351058302|emb|CCD65736.1| Protein SET-2, isoform c [Caenorhabditis elegans]
          Length = 1510

 Score = 79.3 bits (194), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 54/145 (37%), Positives = 72/145 (49%), Gaps = 28/145 (19%)

Query: 1902 GFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI---YLERPKGDADGY 1958
                D+ +VE++G+             IRSL     + A E   I   YL R        
Sbjct: 1390 SIAPDEMIVEYIGQT------------IRSLVAEEREKAYERRGIGSSYLFR-------I 1430

Query: 1959 DLV-VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSV 2017
            DL  V+DA  + N+A  I HSC+PNC AKV  ++G  +I IY+   I  GEEIT+DY   
Sbjct: 1431 DLHHVIDATKRGNFARFINHSCQPNCYAKVLTIEGEKRIVIYSRTIIKKGEEITYDYKFP 1490

Query: 2018 TESKEEYEASVCLCGSQVCRGSYLN 2042
             E     +   CLCG++ CRG YLN
Sbjct: 1491 IED----DKIDCLCGAKTCRG-YLN 1510


>gi|17552318|ref|NP_498040.1| Protein SET-2, isoform a [Caenorhabditis elegans]
 gi|30173238|sp|Q18221.2|SET2_CAEEL RecName: Full=Probable histone-lysine N-methyltransferase set-2;
            AltName: Full=SET domain-containing protein 2
 gi|351058300|emb|CCD65734.1| Protein SET-2, isoform a [Caenorhabditis elegans]
          Length = 1507

 Score = 79.3 bits (194), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 54/145 (37%), Positives = 72/145 (49%), Gaps = 28/145 (19%)

Query: 1902 GFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI---YLERPKGDADGY 1958
                D+ +VE++G+             IRSL     + A E   I   YL R        
Sbjct: 1387 SIAPDEMIVEYIGQT------------IRSLVAEEREKAYERRGIGSSYLFR-------I 1427

Query: 1959 DLV-VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSV 2017
            DL  V+DA  + N+A  I HSC+PNC AKV  ++G  +I IY+   I  GEEIT+DY   
Sbjct: 1428 DLHHVIDATKRGNFARFINHSCQPNCYAKVLTIEGEKRIVIYSRTIIKKGEEITYDYKFP 1487

Query: 2018 TESKEEYEASVCLCGSQVCRGSYLN 2042
             E     +   CLCG++ CRG YLN
Sbjct: 1488 IED----DKIDCLCGAKTCRG-YLN 1507


>gi|17552316|ref|NP_498041.1| Protein SET-2, isoform b [Caenorhabditis elegans]
 gi|351058301|emb|CCD65735.1| Protein SET-2, isoform b [Caenorhabditis elegans]
          Length = 739

 Score = 79.0 bits (193), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 54/145 (37%), Positives = 72/145 (49%), Gaps = 28/145 (19%)

Query: 1902 GFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI---YLERPKGDADGY 1958
                D+ +VE++G+             IRSL     + A E   I   YL R        
Sbjct: 619  SIAPDEMIVEYIGQT------------IRSLVAEEREKAYERRGIGSSYLFR-------I 659

Query: 1959 DLV-VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSV 2017
            DL  V+DA  + N+A  I HSC+PNC AKV  ++G  +I IY+   I  GEEIT+DY   
Sbjct: 660  DLHHVIDATKRGNFARFINHSCQPNCYAKVLTIEGEKRIVIYSRTIIKKGEEITYDYKFP 719

Query: 2018 TESKEEYEASVCLCGSQVCRGSYLN 2042
             E     +   CLCG++ CRG YLN
Sbjct: 720  IED----DKIDCLCGAKTCRG-YLN 739


>gi|189237403|ref|XP_973596.2| PREDICTED: similar to AGAP011688-PA [Tribolium castaneum]
 gi|270007628|gb|EFA04076.1| hypothetical protein TcasGA2_TC014310 [Tribolium castaneum]
          Length = 1569

 Score = 79.0 bits (193), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 52/157 (33%), Positives = 77/157 (49%), Gaps = 20/157 (12%)

Query: 1882 PDDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAP 1941
            P + +   +KGLG+       +GE  F++E++GEV    ++  + D   +      D   
Sbjct: 574  PVEVFKTEKKGLGLRAAANIPYGE--FILEYVGEVLDPEEFDNRADDYSN------DKNK 625

Query: 1942 EFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTV 2001
             +Y + L   + DA      ++DA  K N +  I HSC PN E +   V+G  +IG ++ 
Sbjct: 626  HYYFMSL---RADA------IIDATMKGNISRFINHSCDPNAETQKWTVNGELRIGFFST 676

Query: 2002 RGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            R I  GEEITFDY      K   EA  C C S +CRG
Sbjct: 677  RTILAGEEITFDYRFQRYGK---EAQKCYCESSLCRG 710


>gi|25395700|pir||H88444 protein C26E6.12 [imported] - Caenorhabditis elegans
          Length = 1802

 Score = 78.6 bits (192), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 54/144 (37%), Positives = 72/144 (50%), Gaps = 28/144 (19%)

Query: 1903 FGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI---YLERPKGDADGYD 1959
               D+ +VE++G+             IRSL     + A E   I   YL R        D
Sbjct: 1683 IAPDEMIVEYIGQT------------IRSLVAEEREKAYERRGIGSSYLFR-------ID 1723

Query: 1960 LV-VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVT 2018
            L  V+DA  + N+A  I HSC+PNC AKV  ++G  +I IY+   I  GEEIT+DY    
Sbjct: 1724 LHHVIDATKRGNFARFINHSCQPNCYAKVLTIEGEKRIVIYSRTIIKKGEEITYDYKFPI 1783

Query: 2019 ESKEEYEASVCLCGSQVCRGSYLN 2042
            E     +   CLCG++ CRG YLN
Sbjct: 1784 ED----DKIDCLCGAKTCRG-YLN 1802


>gi|291227185|ref|XP_002733567.1| PREDICTED: HSPC069-like [Saccoglossus kowalevskii]
          Length = 2376

 Score = 78.2 bits (191), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 52/151 (34%), Positives = 73/151 (48%), Gaps = 26/151 (17%)

Query: 1891 KGLGV-VCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            KG G+  C       E  FV+E++GEV     + E +   +   K+N      +Y + L 
Sbjct: 1175 KGFGLRTC---AEIPEGKFVLEYVGEV---LNYSEFKSRTKHYNKDNRK---HYYFMALT 1225

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              +         ++DA  K N +  I HSC PNCE +   V+GH ++G +T R I  GEE
Sbjct: 1226 SDE---------IIDATKKGNVSRFINHSCDPNCETQKWTVNGHIRVGFFTKRAIPAGEE 1276

Query: 2010 ITFDYNSVTESKEEY--EASVCLCGSQVCRG 2038
            +TFDY       E Y  EA  C CG+  CRG
Sbjct: 1277 LTFDYQF-----ERYGKEAQKCYCGASNCRG 1302


>gi|344241969|gb|EGV98072.1| putative histone-lysine N-methyltransferase ASH1L [Cricetulus
            griseus]
          Length = 1546

 Score = 77.8 bits (190), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 50/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +++ A  KG G+   +    G+  F++E+LGEV        +Q               EF
Sbjct: 1239 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEV------VSEQ---------------EF 1275

Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
             N  +E+    +D Y L     +V+D+    N A  I HSC PNCE +  +V+G Y+IG+
Sbjct: 1276 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 1335

Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            Y ++ +  G E+T+DYN  + + E+ +  +C CG + CRG
Sbjct: 1336 YALKDVLAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 1373


>gi|384499018|gb|EIE89509.1| hypothetical protein RO3G_14220 [Rhizopus delemar RA 99-880]
          Length = 962

 Score = 77.8 bits (190), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 53/156 (33%), Positives = 74/156 (47%), Gaps = 30/156 (19%)

Query: 1893 LGVVCNKEGGFG--------EDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFY 1944
            + V+  ++ GFG         + F++E++GEV P       Q+ IR   +  E  A    
Sbjct: 168  VDVIRTEKKGFGLRALTDLPTNSFIMEYIGEVIP------NQEFIR---RTKEYEASGLE 218

Query: 1945 NIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGI 2004
            + Y    K D       ++DA  K   A  I HSC PNC  +   V  + +IGI+T RGI
Sbjct: 219  HYYFMTLKTDE------IIDATKKGCLARFINHSCNPNCVTQKWVVGKNMRIGIFTNRGI 272

Query: 2005 HYGEEITFDYNSVTESKEEY--EASVCLCGSQVCRG 2038
              GEE+TFDY       E Y  +A VC CG   C+G
Sbjct: 273  KAGEELTFDYKF-----ERYGAQAQVCYCGEFACKG 303


>gi|328856222|gb|EGG05344.1| hypothetical protein MELLADRAFT_78094 [Melampsora larici-populina
            98AG31]
          Length = 1098

 Score = 77.8 bits (190), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 55/180 (30%), Positives = 81/180 (45%), Gaps = 27/180 (15%)

Query: 1864 DVRTMKMCRGI-LKAMDSRPDDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKW 1922
            + R ++MC+    +     P +     RKG GV    +     D FV E++GEV      
Sbjct: 265  ECRCLQMCQNQRFQKRQYAPIEIVATERKGFGV--RLKSDVPADSFVYEYIGEVV----- 317

Query: 1923 FEKQDGIRSLQKNNEDPAPE----FYNIYLERPKGDADGYDLVVVDAMHKANYASRICHS 1978
                 G ++ Q+  ++ A E    FY + L+R +          +DA  K      + HS
Sbjct: 318  -----GEKAFQRRIKEYAQEGLKHFYFMQLQREE---------YIDATKKGGLGRFLNHS 363

Query: 1979 CRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            C PNC      V  H ++GI+T R +  GEE+TF+YN V    + YEA  C CG   C G
Sbjct: 364  CNPNCYIGKWVVGRHLRMGIFTKRAVKGGEELTFNYN-VDRYGQVYEAQECFCGEAQCVG 422


>gi|324507672|gb|ADY43247.1| Histone-lysine N-methyltransferase set-2 [Ascaris suum]
          Length = 539

 Score = 77.4 bits (189), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 36/78 (46%), Positives = 51/78 (65%), Gaps = 3/78 (3%)

Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
            V+DA +  N+A  I HSC+PNC AKV  VDG  +I IY+   I+ G+EIT+DY    E +
Sbjct: 463  VIDATNMGNFARFINHSCQPNCYAKVVVVDGEKRIVIYSKTPINKGDEITYDYKFPIEEE 522

Query: 2022 EEYEASVCLCGSQVCRGS 2039
            ++ +   CLCG+  CRG+
Sbjct: 523  DKID---CLCGAPSCRGT 537


>gi|440470515|gb|ELQ39582.1| histone-lysine N-methyltransferase [Magnaporthe oryzae Y34]
 gi|440488496|gb|ELQ68221.1| histone-lysine N-methyltransferase [Magnaporthe oryzae P131]
          Length = 1278

 Score = 77.4 bits (189), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 52/139 (37%), Positives = 70/139 (50%), Gaps = 19/139 (13%)

Query: 1905 EDDFVVEFLGE-VYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVV 1963
            +DD ++E++GE V P      +    RS             + YL R   DA      V+
Sbjct: 1158 KDDMIIEYVGEEVRPSVAQVREARYDRS----------GIGSSYLFRIDEDA------VI 1201

Query: 1964 DAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEE 2023
            DA  K   A  I HSC PNC AK+  V+G  +I IY +R I   EE+T+DY    E KEE
Sbjct: 1202 DATKKGGIARFINHSCMPNCTAKIIRVEGTKRIVIYALRDIARNEELTYDYKFELEEKEE 1261

Query: 2024 YEASVCLCGSQVCRGSYLN 2042
             +   CLCG+  C+G +LN
Sbjct: 1262 -DRVPCLCGTTNCKG-FLN 1278


>gi|389634753|ref|XP_003715029.1| histone-lysine N-methyltransferase [Magnaporthe oryzae 70-15]
 gi|351647362|gb|EHA55222.1| histone-lysine N-methyltransferase [Magnaporthe oryzae 70-15]
          Length = 1278

 Score = 77.4 bits (189), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 52/139 (37%), Positives = 70/139 (50%), Gaps = 19/139 (13%)

Query: 1905 EDDFVVEFLGE-VYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVV 1963
            +DD ++E++GE V P      +    RS             + YL R   DA      V+
Sbjct: 1158 KDDMIIEYVGEEVRPSVAQVREARYDRS----------GIGSSYLFRIDEDA------VI 1201

Query: 1964 DAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEE 2023
            DA  K   A  I HSC PNC AK+  V+G  +I IY +R I   EE+T+DY    E KEE
Sbjct: 1202 DATKKGGIARFINHSCMPNCTAKIIRVEGTKRIVIYALRDIARNEELTYDYKFELEEKEE 1261

Query: 2024 YEASVCLCGSQVCRGSYLN 2042
             +   CLCG+  C+G +LN
Sbjct: 1262 -DRVPCLCGTTNCKG-FLN 1278


>gi|357149500|ref|XP_003575133.1| PREDICTED: histone-lysine N-methyltransferase ASHH2-like
            [Brachypodium distachyon]
          Length = 1022

 Score = 77.4 bits (189), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 53/168 (31%), Positives = 82/168 (48%), Gaps = 22/168 (13%)

Query: 1886 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN 1945
            + + +KG G+   +E    E  F++E++GEV  +  +  +Q    S  + +      FY 
Sbjct: 251  FCSGKKGFGLQLKEE--VTEGRFLIEYVGEVLDITAYECRQRYYASKGQKH------FYF 302

Query: 1946 IYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIH 2005
            + L   +         V+DA  K N    I HSC PNC  +   V+G   IGI+ +R I 
Sbjct: 303  MALNGGE---------VIDACTKGNLGRFINHSCSPNCRTEKWMVNGEVCIGIFAMRNIK 353

Query: 2006 YGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYL--NLTGEGAFEK 2051
             GEE+TFDYN V  S    +   C CG+  CRG Y+  +++G G   +
Sbjct: 354  KGEELTFDYNYVRVSGAAPQK--CFCGTAKCRG-YIGGDISGSGIIAQ 398


>gi|443894422|dbj|GAC71770.1| histone H3 (Lys4) methyltransferase complex, subunit SET1 [Pseudozyma
            antarctica T-34]
          Length = 1366

 Score = 77.4 bits (189), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 49/133 (36%), Positives = 67/133 (50%), Gaps = 20/133 (15%)

Query: 1907 DFVVEFLGEVY--PVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVD 1964
            D V+E++GEV    V    EKQ      Q N        ++ YL R   D      +VVD
Sbjct: 1249 DMVIEYVGEVVRQQVADEREKQ---YERQGN--------FSTYLFRVDDD------LVVD 1291

Query: 1965 AMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEY 2024
            A HK N A  + H C PNC AK+  ++G  +I ++    I  GEE+T+DY     S ++ 
Sbjct: 1292 ATHKGNIARLMNHCCTPNCNAKILTLNGEKRIVLFAKSPIRPGEELTYDYK-FQSSADDE 1350

Query: 2025 EASVCLCGSQVCR 2037
            +A  CLCGS  CR
Sbjct: 1351 DAIPCLCGSPGCR 1363


>gi|354478852|ref|XP_003501628.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
            N-methyltransferase ASH1L-like [Cricetulus griseus]
          Length = 2962

 Score = 77.4 bits (189), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +++ A  KG G+   +    G+  F++E+LGEV              S Q        EF
Sbjct: 2139 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2175

Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
             N  +E+    +D Y L     +V+D+    N A  I HSC PNCE +  +V+G Y+IG+
Sbjct: 2176 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2235

Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            Y ++ +  G E+T+DYN  + + E+ +  +C CG + CRG
Sbjct: 2236 YALKDVLAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2273


>gi|326933478|ref|XP_003212830.1| PREDICTED: probable histone-lysine N-methyltransferase ASH1L-like
            [Meleagris gallopavo]
          Length = 2974

 Score = 77.0 bits (188), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +++ A  KG G+   +    G+  F++E+LGEV              S Q        EF
Sbjct: 2154 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2190

Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
             N  +E+    +D Y L     +V+D+    N A  I HSC PNCE +  +V+G Y+IG+
Sbjct: 2191 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCNPNCEMQKWSVNGVYRIGL 2250

Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            Y ++ +  G E+T+DYN  + + E+ +  +C CG + CRG
Sbjct: 2251 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2288


>gi|363742848|ref|XP_422858.3| PREDICTED: probable histone-lysine N-methyltransferase ASH1L [Gallus
            gallus]
          Length = 2954

 Score = 77.0 bits (188), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +++ A  KG G+   +    G+  F++E+LGEV              S Q        EF
Sbjct: 2134 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2170

Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
             N  +E+    +D Y L     +V+D+    N A  I HSC PNCE +  +V+G Y+IG+
Sbjct: 2171 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCNPNCEMQKWSVNGVYRIGL 2230

Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            Y ++ +  G E+T+DYN  + + E+ +  +C CG + CRG
Sbjct: 2231 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2268


>gi|157818737|ref|NP_001101159.1| probable histone-lysine N-methyltransferase ASH1L [Rattus norvegicus]
 gi|149048100|gb|EDM00676.1| ash1 (absent, small, or homeotic)-like (Drosophila) (predicted)
            [Rattus norvegicus]
          Length = 2918

 Score = 77.0 bits (188), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +++ A  KG G+   +    G+  F++E+LGEV              S Q        EF
Sbjct: 2098 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2134

Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
             N  +E+    +D Y L     +V+D+    N A  I HSC PNCE +  +V+G Y+IG+
Sbjct: 2135 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2194

Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            Y ++ +  G E+T+DYN  + + E+ +  +C CG + CRG
Sbjct: 2195 YALKDVPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2232


>gi|71015569|ref|XP_758824.1| hypothetical protein UM02677.1 [Ustilago maydis 521]
 gi|74702458|sp|Q4PB36.1|SET1_USTMA RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
            specific; AltName: Full=COMPASS component SET1; AltName:
            Full=SET domain-containing protein 1
 gi|46098614|gb|EAK83847.1| hypothetical protein UM02677.1 [Ustilago maydis 521]
          Length = 1468

 Score = 77.0 bits (188), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 46/137 (33%), Positives = 67/137 (48%), Gaps = 28/137 (20%)

Query: 1907 DFVVEFLGEVYPVW------KWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDL 1960
            D V+E++GEV          K +E+Q                 ++ YL R   D      
Sbjct: 1351 DMVIEYVGEVVRQQVADEREKQYERQGN---------------FSTYLFRVDDD------ 1389

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
            +VVDA HK N A  + H C PNC AK+  ++G  +I ++    I  GEE+T+DY   + +
Sbjct: 1390 LVVDATHKGNIARLMNHCCTPNCNAKILTLNGEKRIVLFAKTAIRAGEELTYDYKFQSSA 1449

Query: 2021 KEEYEASVCLCGSQVCR 2037
             +E +A  CLCGS  CR
Sbjct: 1450 DDE-DAIPCLCGSPGCR 1465


>gi|343429488|emb|CBQ73061.1| related to regulatory protein SET1 [Sporisorium reilianum SRZ2]
          Length = 1453

 Score = 77.0 bits (188), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 45/131 (34%), Positives = 68/131 (51%), Gaps = 16/131 (12%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            D V+E++GEV  V +    +   +  ++ N       ++ YL R   D      +VVDA 
Sbjct: 1336 DMVIEYVGEV--VRQQVADEREKQYERQGN-------FSTYLFRVDDD------LVVDAT 1380

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
            HK N A  + H C PNC AK+  ++G  +I ++    I  GEE+T+DY     S ++ +A
Sbjct: 1381 HKGNIARLMNHCCTPNCNAKILTLNGEKRIVLFAKSPIRAGEELTYDYK-FQSSADDEDA 1439

Query: 2027 SVCLCGSQVCR 2037
              CLCGS  CR
Sbjct: 1440 IPCLCGSPGCR 1450


>gi|222623047|gb|EEE57179.1| hypothetical protein OsJ_07116 [Oryza sativa Japonica Group]
          Length = 1963

 Score = 77.0 bits (188), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 50/154 (32%), Positives = 74/154 (48%), Gaps = 19/154 (12%)

Query: 1885 KYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFY 1944
            K+   +KG G+   ++    E  F++E++GEV  +  +  +Q    S  + +      FY
Sbjct: 1297 KFHTGKKGYGLQLKED--VSEGRFLIEYVGEVLDITAYESRQRYYASKGQKH------FY 1348

Query: 1945 NIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGI 2004
             + L   +         V+DA  K N    I HSC PNC  +   V+G   IGI+ +R I
Sbjct: 1349 FMALNGGE---------VIDACTKGNLGRFINHSCSPNCRTEKWMVNGEVCIGIFAMRNI 1399

Query: 2005 HYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
              GEE+TFDYN V  S    +   C CG+  CRG
Sbjct: 1400 KKGEELTFDYNYVRVSGAAPQK--CFCGTAKCRG 1431


>gi|218190961|gb|EEC73388.1| hypothetical protein OsI_07633 [Oryza sativa Indica Group]
          Length = 1906

 Score = 76.6 bits (187), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 50/154 (32%), Positives = 74/154 (48%), Gaps = 19/154 (12%)

Query: 1885 KYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFY 1944
            K+   +KG G+   ++    E  F++E++GEV  +  +  +Q    S  + +      FY
Sbjct: 1312 KFHTGKKGYGLQLKED--VSEGRFLIEYVGEVLDITAYESRQRYYASKGQKH------FY 1363

Query: 1945 NIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGI 2004
             + L   +         V+DA  K N    I HSC PNC  +   V+G   IGI+ +R I
Sbjct: 1364 FMALNGGE---------VIDACTKGNLGRFINHSCSPNCRTEKWMVNGEVCIGIFAMRNI 1414

Query: 2005 HYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
              GEE+TFDYN V  S    +   C CG+  CRG
Sbjct: 1415 KKGEELTFDYNYVRVSGAAPQK--CFCGTAKCRG 1446


>gi|195566590|ref|XP_002106863.1| GD17127 [Drosophila simulans]
 gi|194204255|gb|EDX17831.1| GD17127 [Drosophila simulans]
          Length = 2246

 Score = 76.6 bits (187), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 51/149 (34%), Positives = 77/149 (51%), Gaps = 20/149 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+        GE  F++E++GEV    + FE++  + S  +N       +Y + L 
Sbjct: 1371 KKGCGITAELLIPPGE--FIMEYVGEVIDSEE-FERRQHLYSKDRNRH-----YYFMAL- 1421

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              +G+A      V+DA  K N +  I HSC PN E +   V+G  +IG ++V+ I  GEE
Sbjct: 1422 --RGEA------VIDATSKGNISRYINHSCDPNAETQKWTVNGELRIGFFSVKPIQPGEE 1473

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            ITFDY  +   +   +A  C C S  CRG
Sbjct: 1474 ITFDYQYLRYGR---DAQRCYCESTNCRG 1499


>gi|332219957|ref|XP_003259124.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            ASH1L [Nomascus leucogenys]
          Length = 2892

 Score = 76.6 bits (187), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +++ A  KG G+   +    G+  F++E+LGEV              S Q        EF
Sbjct: 2143 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2179

Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
             N  +E+    +D Y L     +V+D+    N A  I HSC PNCE +  +V+G Y+IG+
Sbjct: 2180 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2239

Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            Y ++ +  G E+T+DYN  + + E+ +  +C CG + CRG
Sbjct: 2240 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2277


>gi|426216789|ref|XP_004002640.1| PREDICTED: histone-lysine N-methyltransferase ASH1L [Ovis aries]
          Length = 2965

 Score = 76.6 bits (187), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +++ A  KG G+   +    G+  F++E+LGEV              S Q        EF
Sbjct: 2144 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2180

Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
             N  +E+    +D Y L     +V+D+    N A  I HSC PNCE +  +V+G Y+IG+
Sbjct: 2181 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2240

Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            Y ++ +  G E+T+DYN  + + E+ +  +C CG + CRG
Sbjct: 2241 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2278


>gi|73622271|ref|NP_619620.3| histone-lysine N-methyltransferase ASH1L [Mus musculus]
 gi|341940590|sp|Q99MY8.3|ASH1L_MOUSE RecName: Full=Histone-lysine N-methyltransferase ASH1L; AltName:
            Full=ASH1-like protein; AltName: Full=Absent small and
            homeotic disks protein 1 homolog
          Length = 2958

 Score = 76.6 bits (187), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +++ A  KG G+   +    G+  F++E+LGEV              S Q        EF
Sbjct: 2138 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2174

Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
             N  +E+    +D Y L     +V+D+    N A  I HSC PNCE +  +V+G Y+IG+
Sbjct: 2175 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2234

Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            Y ++ +  G E+T+DYN  + + E+ +  +C CG + CRG
Sbjct: 2235 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2272


>gi|417407091|gb|JAA50172.1| Putative histone-lysine n-methyltransferase ash1l isoform 1 [Desmodus
            rotundus]
          Length = 2962

 Score = 76.6 bits (187), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +++ A  KG G+   +    G+  F++E+LGEV              S Q        EF
Sbjct: 2141 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2177

Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
             N  +E+    +D Y L     +V+D+    N A  I HSC PNCE +  +V+G Y+IG+
Sbjct: 2178 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2237

Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            Y ++ +  G E+T+DYN  + + E+ +  +C CG + CRG
Sbjct: 2238 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2275


>gi|338724967|ref|XP_001499134.2| PREDICTED: probable histone-lysine N-methyltransferase ASH1L isoform
            1 [Equus caballus]
          Length = 2963

 Score = 76.6 bits (187), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +++ A  KG G+   +    G+  F++E+LGEV              S Q        EF
Sbjct: 2142 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2178

Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
             N  +E+    +D Y L     +V+D+    N A  I HSC PNCE +  +V+G Y+IG+
Sbjct: 2179 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2238

Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            Y ++ +  G E+T+DYN  + + E+ +  +C CG + CRG
Sbjct: 2239 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2276


>gi|195478285|ref|XP_002100470.1| GE17076 [Drosophila yakuba]
 gi|194187994|gb|EDX01578.1| GE17076 [Drosophila yakuba]
          Length = 2397

 Score = 76.6 bits (187), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 49/149 (32%), Positives = 77/149 (51%), Gaps = 20/149 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+    +   GE  F++E++GEV    + FE++  + S  +N       +Y + L 
Sbjct: 1437 KKGCGITAELQIPPGE--FIMEYVGEVIDSEE-FERRQHLYSKDRNRH-----YYFMAL- 1487

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              +G+A      ++DA  K N +  I HSC PN E +   V+G  +IG ++V+ I  GEE
Sbjct: 1488 --RGEA------IIDATSKGNISRYINHSCDPNAETQKWTVNGELRIGFFSVKPIQPGEE 1539

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            ITFDY      +   +A  C C +  CRG
Sbjct: 1540 ITFDYQYQRYGR---DAQRCYCEAANCRG 1565


>gi|417515828|gb|JAA53722.1| histone-lysine N-methyltransferase ASH1L [Sus scrofa]
          Length = 2951

 Score = 76.6 bits (187), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +++ A  KG G+   +    G+  F++E+LGEV              S Q        EF
Sbjct: 2130 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2166

Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
             N  +E+    +D Y L     +V+D+    N A  I HSC PNCE +  +V+G Y+IG+
Sbjct: 2167 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2226

Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            Y ++ +  G E+T+DYN  + + E+ +  +C CG + CRG
Sbjct: 2227 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2264


>gi|440903623|gb|ELR54260.1| Putative histone-lysine N-methyltransferase ASH1L [Bos grunniens
            mutus]
          Length = 2965

 Score = 76.6 bits (187), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +++ A  KG G+   +    G+  F++E+LGEV              S Q        EF
Sbjct: 2144 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2180

Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
             N  +E+    +D Y L     +V+D+    N A  I HSC PNCE +  +V+G Y+IG+
Sbjct: 2181 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2240

Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            Y ++ +  G E+T+DYN  + + E+ +  +C CG + CRG
Sbjct: 2241 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2278


>gi|326676505|ref|XP_692254.4| PREDICTED: probable histone-lysine N-methyltransferase ASH1L [Danio
            rerio]
          Length = 2933

 Score = 76.6 bits (187), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 83/160 (51%), Gaps = 30/160 (18%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +++ A  KG G+   +    G+  F++E+LGEV              S Q        EF
Sbjct: 2064 ERFRAEGKGWGIRTKQPLRAGQ--FIIEYLGEVV-------------SEQ--------EF 2100

Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
             +  +E+    +  Y L     +V+D+    N A  + HSC PNCE +  +V+G Y+IG+
Sbjct: 2101 RSRMMEQYFSHSGHYCLNLDSGMVIDSYRMGNEARFVNHSCEPNCEMQKWSVNGVYRIGL 2160

Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            + ++ I+ G E+T+DYN  + + EE +  VC CGS+ CRG
Sbjct: 2161 FALKDINSGTELTYDYNFHSFNTEEQQ--VCKCGSEGCRG 2198


>gi|395845197|ref|XP_003795328.1| PREDICTED: histone-lysine N-methyltransferase ASH1L [Otolemur
            garnettii]
          Length = 2961

 Score = 76.6 bits (187), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +++ A  KG G+   +    G+  F++E+LGEV              S Q        EF
Sbjct: 2140 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2176

Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
             N  +E+    +D Y L     +V+D+    N A  I HSC PNCE +  +V+G Y+IG+
Sbjct: 2177 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2236

Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            Y ++ +  G E+T+DYN  + + E+ +  +C CG + CRG
Sbjct: 2237 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2274


>gi|327286108|ref|XP_003227773.1| PREDICTED: probable histone-lysine N-methyltransferase ASH1L-like
            [Anolis carolinensis]
          Length = 2957

 Score = 76.6 bits (187), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +++ A  KG G+   +    G+  F++E+LGEV              S Q        EF
Sbjct: 2136 ERFRAEEKGWGIRTKESLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2172

Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
             N  +E+    +D Y L     +V+D+    N A  I HSC PNCE +  +V+G Y+IG+
Sbjct: 2173 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCNPNCEMQKWSVNGVYRIGL 2232

Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            Y ++ +  G E+T+DYN  + + E+ +  +C CG + CRG
Sbjct: 2233 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2270


>gi|299746032|ref|XP_002910994.1| Setd1a protein [Coprinopsis cinerea okayama7#130]
 gi|298406870|gb|EFI27500.1| Setd1a protein [Coprinopsis cinerea okayama7#130]
          Length = 1614

 Score = 76.6 bits (187), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 39/81 (48%), Positives = 46/81 (56%), Gaps = 4/81 (4%)

Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
            VVDA  K N    I HSC PNC AK+  + G  +I IY  + I  GEEIT+DY+   E  
Sbjct: 1538 VVDATKKGNLGRLINHSCDPNCTAKIITISGVKKIVIYAKQDIELGEEITYDYHFPIEQD 1597

Query: 2022 EEYEASVCLCGSQVCRGSYLN 2042
             +     CLCGS  CRG YLN
Sbjct: 1598 NKIP---CLCGSARCRG-YLN 1614


>gi|242061944|ref|XP_002452261.1| hypothetical protein SORBIDRAFT_04g022620 [Sorghum bicolor]
 gi|241932092|gb|EES05237.1| hypothetical protein SORBIDRAFT_04g022620 [Sorghum bicolor]
          Length = 1840

 Score = 76.6 bits (187), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 51/156 (32%), Positives = 77/156 (49%), Gaps = 20/156 (12%)

Query: 1886 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN 1945
            + + +KG G+   ++    E  F++E++GEV  +  +  +Q    S  + +      FY 
Sbjct: 1136 FYSGKKGYGLQLQED--VTEGRFLIEYVGEVLDITSYESRQRYYASKGQKH------FYF 1187

Query: 1946 IYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIH 2005
            + L   +         V+DA  K N    I HSC PNC  +   V+G   IGI+++R I 
Sbjct: 1188 MALNGGE---------VIDACTKGNLGRFINHSCSPNCRTEKWMVNGEVCIGIFSLRNIK 1238

Query: 2006 YGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
             GEE+TFDYN V  S    +   C CG+  CRG YL
Sbjct: 1239 KGEELTFDYNYVRVSGAAPQK--CFCGTAKCRG-YL 1271


>gi|345567899|gb|EGX50801.1| hypothetical protein AOL_s00054g887 [Arthrobotrys oligospora ATCC
            24927]
          Length = 1338

 Score = 76.6 bits (187), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 39/84 (46%), Positives = 49/84 (58%), Gaps = 2/84 (2%)

Query: 1959 DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVT 2018
            +  V+DA  K   A  I HSC PNC AK+  V+G  +I IY +R IH  EE+T+DY    
Sbjct: 1257 ETTVIDATKKGGIARFINHSCTPNCTAKIIKVEGTKRIVIYALRDIHKDEELTYDYKFER 1316

Query: 2019 ESKEEYEASVCLCGSQVCRGSYLN 2042
            E   E E   CLCGS  C+G +LN
Sbjct: 1317 EIDSE-ERIPCLCGSSGCKG-FLN 1338


>gi|148683294|gb|EDL15241.1| ash1 (absent, small, or homeotic)-like (Drosophila) [Mus musculus]
          Length = 2918

 Score = 76.6 bits (187), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +++ A  KG G+   +    G+  F++E+LGEV              S Q        EF
Sbjct: 2098 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2134

Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
             N  +E+    +D Y L     +V+D+    N A  I HSC PNCE +  +V+G Y+IG+
Sbjct: 2135 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2194

Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            Y ++ +  G E+T+DYN  + + E+ +  +C CG + CRG
Sbjct: 2195 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2232


>gi|417407083|gb|JAA50168.1| Putative histone-lysine n-methyltransferase ash1l isoform 1 [Desmodus
            rotundus]
          Length = 2832

 Score = 76.6 bits (187), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +++ A  KG G+   +    G+  F++E+LGEV              S Q        EF
Sbjct: 2141 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2177

Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
             N  +E+    +D Y L     +V+D+    N A  I HSC PNCE +  +V+G Y+IG+
Sbjct: 2178 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2237

Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            Y ++ +  G E+T+DYN  + + E+ +  +C CG + CRG
Sbjct: 2238 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2275


>gi|403293713|ref|XP_003937857.1| PREDICTED: histone-lysine N-methyltransferase ASH1L [Saimiri
            boliviensis boliviensis]
          Length = 2970

 Score = 76.6 bits (187), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +++ A  KG G+   +    G+  F++E+LGEV              S Q        EF
Sbjct: 2149 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2185

Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
             N  +E+    +D Y L     +V+D+    N A  I HSC PNCE +  +V+G Y+IG+
Sbjct: 2186 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2245

Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            Y ++ +  G E+T+DYN  + + E+ +  +C CG + CRG
Sbjct: 2246 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2283


>gi|73960946|ref|XP_537251.2| PREDICTED: probable histone-lysine N-methyltransferase ASH1L isoform
            1 [Canis lupus familiaris]
          Length = 2965

 Score = 76.6 bits (187), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +++ A  KG G+   +    G+  F++E+LGEV              S Q        EF
Sbjct: 2144 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2180

Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
             N  +E+    +D Y L     +V+D+    N A  I HSC PNCE +  +V+G Y+IG+
Sbjct: 2181 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2240

Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            Y ++ +  G E+T+DYN  + + E+ +  +C CG + CRG
Sbjct: 2241 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2278


>gi|301785832|ref|XP_002928328.1| PREDICTED: probable histone-lysine N-methyltransferase ASH1L-like
            [Ailuropoda melanoleuca]
          Length = 2965

 Score = 76.6 bits (187), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +++ A  KG G+   +    G+  F++E+LGEV              S Q        EF
Sbjct: 2144 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2180

Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
             N  +E+    +D Y L     +V+D+    N A  I HSC PNCE +  +V+G Y+IG+
Sbjct: 2181 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2240

Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            Y ++ +  G E+T+DYN  + + E+ +  +C CG + CRG
Sbjct: 2241 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2278


>gi|291397821|ref|XP_002715465.1| PREDICTED: absent, small, or homeotic 1-like [Oryctolagus cuniculus]
          Length = 2961

 Score = 76.6 bits (187), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +++ A  KG G+   +    G+  F++E+LGEV              S Q        EF
Sbjct: 2140 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2176

Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
             N  +E+    +D Y L     +V+D+    N A  I HSC PNCE +  +V+G Y+IG+
Sbjct: 2177 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2236

Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            Y ++ +  G E+T+DYN  + + E+ +  +C CG + CRG
Sbjct: 2237 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2274


>gi|13442965|gb|AAK26242.1|AF247132_1 putative chromatin remodeling factor [Mus musculus]
          Length = 2669

 Score = 76.6 bits (187), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +++ A  KG G+   +    G+  F++E+LGEV              S Q        EF
Sbjct: 1849 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 1885

Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
             N  +E+    +D Y L     +V+D+    N A  I HSC PNCE +  +V+G Y+IG+
Sbjct: 1886 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 1945

Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            Y ++ +  G E+T+DYN  + + E+ +  +C CG + CRG
Sbjct: 1946 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 1983


>gi|380814664|gb|AFE79206.1| putative histone-lysine N-methyltransferase ASH1L [Macaca mulatta]
 gi|383419979|gb|AFH33203.1| putative histone-lysine N-methyltransferase ASH1L [Macaca mulatta]
          Length = 2963

 Score = 76.6 bits (187), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +++ A  KG G+   +    G+  F++E+LGEV              S Q        EF
Sbjct: 2142 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2178

Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
             N  +E+    +D Y L     +V+D+    N A  I HSC PNCE +  +V+G Y+IG+
Sbjct: 2179 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2238

Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            Y ++ +  G E+T+DYN  + + E+ +  +C CG + CRG
Sbjct: 2239 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2276


>gi|351696657|gb|EHA99575.1| Putative histone-lysine N-methyltransferase ASH1L [Heterocephalus
            glaber]
          Length = 2930

 Score = 76.6 bits (187), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +++ A  KG G+   +    G+  F++E+LGEV              S Q        EF
Sbjct: 2109 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2145

Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
             N  +E+    +D Y L     +V+D+    N A  I HSC PNCE +  +V+G Y+IG+
Sbjct: 2146 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2205

Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            Y ++ +  G E+T+DYN  + + E+ +  +C CG + CRG
Sbjct: 2206 YALKDMTAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2243


>gi|300795068|ref|NP_001179672.1| probable histone-lysine N-methyltransferase ASH1L [Bos taurus]
 gi|296489728|tpg|DAA31841.1| TPA: ash1 (absent, small, or homeotic)-like [Bos taurus]
          Length = 2965

 Score = 76.6 bits (187), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +++ A  KG G+   +    G+  F++E+LGEV              S Q        EF
Sbjct: 2144 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2180

Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
             N  +E+    +D Y L     +V+D+    N A  I HSC PNCE +  +V+G Y+IG+
Sbjct: 2181 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2240

Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            Y ++ +  G E+T+DYN  + + E+ +  +C CG + CRG
Sbjct: 2241 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2278


>gi|388853505|emb|CCF52904.1| related to regulatory protein SET1 [Ustilago hordei]
          Length = 1489

 Score = 76.3 bits (186), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 46/131 (35%), Positives = 68/131 (51%), Gaps = 16/131 (12%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            D V+E++GE+        +Q    + +K  E      ++ YL R   D      +VVDA 
Sbjct: 1372 DMVIEYVGEMV-------RQQVADNREKQYERQG--NFSTYLFRVDDD------LVVDAT 1416

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
            HK N A  + H C PNC AK+  V+G  +I ++    I  GEE+T+DY   + + +E +A
Sbjct: 1417 HKGNIARLMNHCCTPNCNAKILTVNGEKRIVLFAKSPIKAGEELTYDYKFQSSADDE-DA 1475

Query: 2027 SVCLCGSQVCR 2037
              CLCGS  CR
Sbjct: 1476 IPCLCGSDGCR 1486


>gi|355745722|gb|EHH50347.1| hypothetical protein EGM_01160 [Macaca fascicularis]
          Length = 2904

 Score = 76.3 bits (186), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +++ A  KG G+   +    G+  F++E+LGEV              S Q        EF
Sbjct: 2144 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2180

Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
             N  +E+    +D Y L     +V+D+    N A  I HSC PNCE +  +V+G Y+IG+
Sbjct: 2181 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2240

Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            Y ++ +  G E+T+DYN  + + E+ +  +C CG + CRG
Sbjct: 2241 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2278


>gi|110349788|ref|NP_060959.2| histone-lysine N-methyltransferase ASH1L [Homo sapiens]
 gi|225000936|gb|AAI72595.1| Ash1 (absent, small, or homeotic)-like (Drosophila) [synthetic
            construct]
          Length = 2964

 Score = 76.3 bits (186), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +++ A  KG G+   +    G+  F++E+LGEV              S Q        EF
Sbjct: 2143 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2179

Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
             N  +E+    +D Y L     +V+D+    N A  I HSC PNCE +  +V+G Y+IG+
Sbjct: 2180 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2239

Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            Y ++ +  G E+T+DYN  + + E+ +  +C CG + CRG
Sbjct: 2240 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2277


>gi|117949323|sp|Q9NR48.2|ASH1L_HUMAN RecName: Full=Histone-lysine N-methyltransferase ASH1L; AltName:
            Full=ASH1-like protein; Short=huASH1; AltName:
            Full=Absent small and homeotic disks protein 1 homolog;
            AltName: Full=Lysine N-methyltransferase 2H
          Length = 2969

 Score = 76.3 bits (186), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +++ A  KG G+   +    G+  F++E+LGEV              S Q        EF
Sbjct: 2148 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2184

Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
             N  +E+    +D Y L     +V+D+    N A  I HSC PNCE +  +V+G Y+IG+
Sbjct: 2185 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2244

Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            Y ++ +  G E+T+DYN  + + E+ +  +C CG + CRG
Sbjct: 2245 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2282


>gi|350583322|ref|XP_003125756.3| PREDICTED: probable histone-lysine N-methyltransferase ASH1L-like,
            partial [Sus scrofa]
          Length = 2824

 Score = 76.3 bits (186), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +++ A  KG G+   +    G+  F++E+LGEV              S Q        EF
Sbjct: 1997 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2033

Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
             N  +E+    +D Y L     +V+D+    N A  I HSC PNCE +  +V+G Y+IG+
Sbjct: 2034 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2093

Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            Y ++ +  G E+T+DYN  + + E+ +  +C CG + CRG
Sbjct: 2094 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2131


>gi|344286471|ref|XP_003414981.1| PREDICTED: probable histone-lysine N-methyltransferase ASH1L
            [Loxodonta africana]
          Length = 2917

 Score = 76.3 bits (186), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +++ A  KG G+   +    G+  F++E+LGEV              S Q        EF
Sbjct: 2096 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2132

Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
             N  +E+    +D Y L     +V+D+    N A  I HSC PNCE +  +V+G Y+IG+
Sbjct: 2133 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2192

Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            Y ++ +  G E+T+DYN  + + E+ +  +C CG + CRG
Sbjct: 2193 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2230


>gi|397492363|ref|XP_003817092.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            ASH1L [Pan paniscus]
          Length = 2964

 Score = 76.3 bits (186), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +++ A  KG G+   +    G+  F++E+LGEV              S Q        EF
Sbjct: 2143 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2179

Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
             N  +E+    +D Y L     +V+D+    N A  I HSC PNCE +  +V+G Y+IG+
Sbjct: 2180 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2239

Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            Y ++ +  G E+T+DYN  + + E+ +  +C CG + CRG
Sbjct: 2240 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2277


>gi|281338719|gb|EFB14303.1| hypothetical protein PANDA_018255 [Ailuropoda melanoleuca]
          Length = 2981

 Score = 76.3 bits (186), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +++ A  KG G+   +    G+  F++E+LGEV              S Q        EF
Sbjct: 2160 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2196

Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
             N  +E+    +D Y L     +V+D+    N A  I HSC PNCE +  +V+G Y+IG+
Sbjct: 2197 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2256

Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            Y ++ +  G E+T+DYN  + + E+ +  +C CG + CRG
Sbjct: 2257 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2294


>gi|7739725|gb|AAF68983.1|AF257305_1 ASH1 [Homo sapiens]
          Length = 2969

 Score = 76.3 bits (186), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +++ A  KG G+   +    G+  F++E+LGEV              S Q        EF
Sbjct: 2148 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2184

Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
             N  +E+    +D Y L     +V+D+    N A  I HSC PNCE +  +V+G Y+IG+
Sbjct: 2185 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2244

Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            Y ++ +  G E+T+DYN  + + E+ +  +C CG + CRG
Sbjct: 2245 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2282


>gi|390476801|ref|XP_002760038.2| PREDICTED: histone-lysine N-methyltransferase ASH1L [Callithrix
            jacchus]
          Length = 2970

 Score = 76.3 bits (186), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +++ A  KG G+   +    G+  F++E+LGEV              S Q        EF
Sbjct: 2149 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2185

Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
             N  +E+    +D Y L     +V+D+    N A  I HSC PNCE +  +V+G Y+IG+
Sbjct: 2186 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2245

Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            Y ++ +  G E+T+DYN  + + E+ +  +C CG + CRG
Sbjct: 2246 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2283


>gi|324500453|gb|ADY40214.1| Histone-lysine N-methyltransferase lin-59 [Ascaris suum]
          Length = 1467

 Score = 76.3 bits (186), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 55/170 (32%), Positives = 79/170 (46%), Gaps = 20/170 (11%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            +GLGV    +    +  FV E++GEV            + +    N      F N Y   
Sbjct: 799  RGLGV--RTDVPLQKGQFVCEYVGEVV----------SMETFDARNAHSYRAFRNHYA-- 844

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
                  GY   V+DA  K N A  + HSC PNCE +  +V+G ++IG++ +R +  GEE+
Sbjct: 845  -LNLCPGY---VIDAYQKGNIARFVNHSCVPNCEMQRWSVNGQHRIGLFALRVVAKGEEL 900

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLL 2060
            T+DYN   +S + Y  + C CG   CRG         A EK L    G+L
Sbjct: 901  TYDYN--WDSFDFYGVTPCSCGVPNCRGFLNKNVLMNAKEKELARSSGVL 948


>gi|119573453|gb|EAW53068.1| ash1 (absent, small, or homeotic)-like (Drosophila) [Homo sapiens]
          Length = 2969

 Score = 76.3 bits (186), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +++ A  KG G+   +    G+  F++E+LGEV              S Q        EF
Sbjct: 2148 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2184

Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
             N  +E+    +D Y L     +V+D+    N A  I HSC PNCE +  +V+G Y+IG+
Sbjct: 2185 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2244

Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            Y ++ +  G E+T+DYN  + + E+ +  +C CG + CRG
Sbjct: 2245 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2282


>gi|195376627|ref|XP_002047094.1| GJ13235 [Drosophila virilis]
 gi|194154252|gb|EDW69436.1| GJ13235 [Drosophila virilis]
          Length = 2005

 Score = 76.3 bits (186), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 50/149 (33%), Positives = 77/149 (51%), Gaps = 20/149 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+    +   GE  F++E++GEV    + FE++  + S  +N       +Y + L 
Sbjct: 1082 KKGCGITAELQIPPGE--FIMEYVGEVIDSEE-FERRQHLYSEDRNRH-----YYFMAL- 1132

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              +G+A      ++DA  K N +  I HSC PN E +   V+G  +IG ++V+ I  GEE
Sbjct: 1133 --RGEA------IIDATTKGNISRYINHSCDPNAETQKWTVNGELRIGFFSVKTILPGEE 1184

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            ITFDY      +   +A  C C S  CRG
Sbjct: 1185 ITFDYQYQRYGR---DAQRCYCESANCRG 1210


>gi|195126250|ref|XP_002007587.1| GI12297 [Drosophila mojavensis]
 gi|193919196|gb|EDW18063.1| GI12297 [Drosophila mojavensis]
          Length = 1972

 Score = 76.3 bits (186), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 50/149 (33%), Positives = 77/149 (51%), Gaps = 20/149 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+    +   GE  F++E++GEV    + FE++  + S  +N       +Y + L 
Sbjct: 1058 KKGCGITAELQIQPGE--FIMEYVGEVIDSEE-FERRQHLYSEDRNRH-----YYFMAL- 1108

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              +G+A      ++DA  K N +  I HSC PN E +   V+G  +IG ++V+ I  GEE
Sbjct: 1109 --RGEA------IIDATTKGNISRYINHSCDPNAETQKWTVNGELRIGFFSVKTIMPGEE 1160

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            ITFDY      +   +A  C C S  CRG
Sbjct: 1161 ITFDYQYQRYGR---DAQRCYCESANCRG 1186


>gi|194895514|ref|XP_001978270.1| GG17783 [Drosophila erecta]
 gi|190649919|gb|EDV47197.1| GG17783 [Drosophila erecta]
          Length = 2384

 Score = 76.3 bits (186), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 49/149 (32%), Positives = 77/149 (51%), Gaps = 20/149 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+    +   GE  F++E++GEV    + FE++  + S  +N       +Y + L 
Sbjct: 1424 KKGCGITAELQIPPGE--FIMEYVGEVIDSEE-FERRQHLYSKDRNRH-----YYFMAL- 1474

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              +G+A      ++DA  K N +  I HSC PN E +   V+G  +IG ++V+ I  GEE
Sbjct: 1475 --RGEA------IIDATSKGNISRYINHSCDPNAETQKWTVNGELRIGFFSVKPIQPGEE 1526

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            ITFDY      +   +A  C C +  CRG
Sbjct: 1527 ITFDYQYQRYGR---DAQRCYCEAANCRG 1552


>gi|410986772|ref|XP_003999683.1| PREDICTED: histone-lysine N-methyltransferase ASH1L isoform 1 [Felis
            catus]
          Length = 2965

 Score = 76.3 bits (186), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +++ A  KG G+   +    G+  F++E+LGEV              S Q        EF
Sbjct: 2144 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2180

Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
             N  +E+    +D Y L     +V+D+    N A  I HSC PNCE +  +V+G Y+IG+
Sbjct: 2181 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2240

Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            Y ++ +  G E+T+DYN  + + E+ +  +C CG + CRG
Sbjct: 2241 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2278


>gi|410986774|ref|XP_003999684.1| PREDICTED: histone-lysine N-methyltransferase ASH1L isoform 2 [Felis
            catus]
          Length = 2974

 Score = 76.3 bits (186), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +++ A  KG G+   +    G+  F++E+LGEV              S Q        EF
Sbjct: 2153 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2189

Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
             N  +E+    +D Y L     +V+D+    N A  I HSC PNCE +  +V+G Y+IG+
Sbjct: 2190 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2249

Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            Y ++ +  G E+T+DYN  + + E+ +  +C CG + CRG
Sbjct: 2250 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2287


>gi|241753587|ref|XP_002401135.1| huntingtin interacting protein, putative [Ixodes scapularis]
 gi|215508354|gb|EEC17808.1| huntingtin interacting protein, putative [Ixodes scapularis]
          Length = 1594

 Score = 76.3 bits (186), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 51/155 (32%), Positives = 78/155 (50%), Gaps = 20/155 (12%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +K++  +KG G+   +    G   FV+E++GEV    + F K+  ++   ++N      +
Sbjct: 621  EKFLTEKKGWGLRTVETLASGA--FVMEYVGEVL-TPEDFRKR--VKQYARDNHQ---HY 672

Query: 1944 YNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRG 2003
            Y + L   + D       ++DA  K N +  I HSC PNCE +   V+G  +IG +T R 
Sbjct: 673  YFMAL---RSDE------IIDATQKGNVSRFINHSCDPNCETQKWTVNGELRIGFFTRRP 723

Query: 2004 IHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            +  GEE+TFDY      K   EA  C C S  CRG
Sbjct: 724  LRAGEELTFDYQFQRYGK---EAQKCYCESSKCRG 755


>gi|410033849|ref|XP_003949641.1| PREDICTED: histone-lysine N-methyltransferase ASH1L [Pan troglodytes]
          Length = 2964

 Score = 76.3 bits (186), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +++ A  KG G+   +    G+  F++E+LGEV              S Q        EF
Sbjct: 2143 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2179

Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
             N  +E+    +D Y L     +V+D+    N A  I HSC PNCE +  +V+G Y+IG+
Sbjct: 2180 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2239

Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            Y ++ +  G E+T+DYN  + + E+ +  +C CG + CRG
Sbjct: 2240 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2277


>gi|410226116|gb|JAA10277.1| ash1 (absent, small, or homeotic)-like [Pan troglodytes]
 gi|410264036|gb|JAA19984.1| ash1 (absent, small, or homeotic)-like [Pan troglodytes]
 gi|410264040|gb|JAA19986.1| ash1 (absent, small, or homeotic)-like [Pan troglodytes]
 gi|410306368|gb|JAA31784.1| ash1 (absent, small, or homeotic)-like [Pan troglodytes]
 gi|410355463|gb|JAA44335.1| ash1 (absent, small, or homeotic)-like [Pan troglodytes]
          Length = 2964

 Score = 76.3 bits (186), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +++ A  KG G+   +    G+  F++E+LGEV              S Q        EF
Sbjct: 2143 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2179

Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
             N  +E+    +D Y L     +V+D+    N A  I HSC PNCE +  +V+G Y+IG+
Sbjct: 2180 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2239

Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            Y ++ +  G E+T+DYN  + + E+ +  +C CG + CRG
Sbjct: 2240 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2277


>gi|355558542|gb|EHH15322.1| hypothetical protein EGK_01394 [Macaca mulatta]
          Length = 2796

 Score = 76.3 bits (186), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +++ A  KG G+   +    G+  F++E+LGEV              S Q        EF
Sbjct: 2068 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2104

Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
             N  +E+    +D Y L     +V+D+    N A  I HSC PNCE +  +V+G Y+IG+
Sbjct: 2105 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2164

Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            Y ++ +  G E+T+DYN  + + E+ +  +C CG + CRG
Sbjct: 2165 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2202


>gi|224084984|ref|XP_002307459.1| SET domain protein [Populus trichocarpa]
 gi|222856908|gb|EEE94455.1| SET domain protein [Populus trichocarpa]
          Length = 594

 Score = 76.3 bits (186), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 52/149 (34%), Positives = 74/149 (49%), Gaps = 19/149 (12%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+  +++   G+  F++E++GEV  V  +  +Q    S    +      FY + L 
Sbjct: 170  KKGFGLRLDEDISRGQ--FLIEYVGEVLDVHAYEARQKDYASKGHKH------FYFMTL- 220

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
                  DG +  V+DA  K N    I HSC PNC  +   V+G   IG++ +R I  GEE
Sbjct: 221  ------DGSE--VIDACAKGNLGRFINHSCDPNCRTEKWVVNGEICIGLFALRDIKMGEE 272

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            +TFDYN V        A  C CGS  CRG
Sbjct: 273  VTFDYNYVRVVGA--AAKRCYCGSPQCRG 299


>gi|348579791|ref|XP_003475662.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
            N-methyltransferase ASH1L-like [Cavia porcellus]
          Length = 2964

 Score = 76.3 bits (186), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +++ A  KG G+   +    G+  F++E+LGEV              S Q        EF
Sbjct: 2143 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2179

Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
             N  +E+    +D Y L     +V+D+    N A  I HSC PNCE +  +V+G Y+IG+
Sbjct: 2180 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2239

Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            Y ++ +  G E+T+DYN  + + E+ +  +C CG + CRG
Sbjct: 2240 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2277


>gi|449490008|ref|XP_004176439.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            ASH1L-like [Taeniopygia guttata]
          Length = 2968

 Score = 76.3 bits (186), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 81/160 (50%), Gaps = 30/160 (18%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +++ A  KG G+   +    G+  F++E+LGEV              S Q        EF
Sbjct: 2148 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2184

Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
             N  +E+    +D Y L     +V+D+    N A  I HSC PNCE +  +V+G Y+IG+
Sbjct: 2185 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCNPNCEMQKWSVNGVYRIGL 2244

Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            Y ++ +  G E+T+DYN  + + E+ +  +C CG   CRG
Sbjct: 2245 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFDKCRG 2282


>gi|410911836|ref|XP_003969396.1| PREDICTED: histone-lysine N-methyltransferase ASH1L-like [Takifugu
            rubripes]
          Length = 2782

 Score = 76.3 bits (186), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 61/205 (29%), Positives = 97/205 (47%), Gaps = 44/205 (21%)

Query: 1853 EEIEKEAVDDCDVR-TMKMCRGILKAMDSRPDDKYVA----------YR---KGLGVVCN 1898
            + IEK  +DDC  R +   C         + D++++           +R   KG G+   
Sbjct: 1937 DRIEKSCLDDCLNRMSFAECSPSTCPSADQCDNQHIQRHDWVQCLERFRTEGKGWGIRTK 1996

Query: 1899 KEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGY 1958
            +    G+  F++E+LGEV              S Q        EF +  +E+    +  Y
Sbjct: 1997 EPLRAGQ--FIIEYLGEVV-------------SEQ--------EFRSRMMEQYFSHSGNY 2033

Query: 1959 DL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFD 2013
             L     +V+D+    N A  I HSC PNCE +  +V+G Y+IG++ +  I  G E+T+D
Sbjct: 2034 CLNLDSGMVIDSYRMGNEARFINHSCEPNCEMQKWSVNGVYRIGLFALGEIPSGTELTYD 2093

Query: 2014 YNSVTESKEEYEASVCLCGSQVCRG 2038
            YN  + + EE +A  C+CGS+ CRG
Sbjct: 2094 YNFHSFNTEEQQA--CMCGSESCRG 2116


>gi|195352880|ref|XP_002042939.1| GM11634 [Drosophila sechellia]
 gi|194126986|gb|EDW49029.1| GM11634 [Drosophila sechellia]
          Length = 1965

 Score = 76.3 bits (186), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 50/153 (32%), Positives = 78/153 (50%), Gaps = 20/153 (13%)

Query: 1886 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN 1945
            +   +KG G+        GE  F++E++GEV    + FE++  + S  +N       +Y 
Sbjct: 1272 FRTEKKGCGITAELLIPPGE--FIMEYVGEVIDSEE-FERRQHLYSKDRNRH-----YYF 1323

Query: 1946 IYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIH 2005
            + L   +G+A      V+DA  K N +  I HSC PN E +   V+G  +IG ++V+ I 
Sbjct: 1324 MAL---RGEA------VIDATSKGNISRYINHSCDPNAETQKWTVNGELRIGFFSVKPIQ 1374

Query: 2006 YGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
             GEEITFDY  +   +   +A  C C +  CRG
Sbjct: 1375 PGEEITFDYQYLRYGR---DAQRCYCEATNCRG 1404


>gi|168044865|ref|XP_001774900.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162673794|gb|EDQ60312.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1980

 Score = 76.3 bits (186), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 46/133 (34%), Positives = 69/133 (51%), Gaps = 21/133 (15%)

Query: 1908 FVVEFLGEVY--PVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDA 1965
            F++E++GEV   P ++  +K+  + S QK+       FY + L   +         ++DA
Sbjct: 928  FIIEYVGEVLDMPSFEARQKEYSMNS-QKH-------FYFMTLSANE---------IIDA 970

Query: 1966 MHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYE 2025
              K N    I HSC PNC+ +   VDG   IG++ +R +  GEE+TFDYN V       +
Sbjct: 971  CSKGNLGRFINHSCEPNCQTEKWMVDGEVCIGLFAIRDVKKGEEVTFDYNFVRVGGA--D 1028

Query: 2026 ASVCLCGSQVCRG 2038
            A  C CG+  CRG
Sbjct: 1029 AKKCECGANKCRG 1041


>gi|403167549|ref|XP_003327326.2| histone-lysine N-methyltransferase SETD2 [Puccinia graminis f. sp.
            tritici CRL 75-36-700-3]
 gi|375167080|gb|EFP82907.2| histone-lysine N-methyltransferase SETD2 [Puccinia graminis f. sp.
            tritici CRL 75-36-700-3]
          Length = 974

 Score = 76.3 bits (186), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 56/184 (30%), Positives = 86/184 (46%), Gaps = 38/184 (20%)

Query: 1893 LGVVCNKEGGFG--------EDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPE-- 1942
            + +V   + GFG        +D FV E+LGEV           G+++L K  +D   E  
Sbjct: 200  IEIVLTPKKGFGMRLQADVPKDTFVYEYLGEVI----------GVKALHKRLKDYGQEGI 249

Query: 1943 --FYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
              FY + L++     D Y    +DA  K  +   + HSC PNC      V    ++GI+T
Sbjct: 250  KHFYFMELQK-----DQY----IDATKKGGFGRFLNHSCNPNCYIGKWVVGRQLRMGIFT 300

Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYL---NLTGEGAFEKVLKELH 2057
             R +  GEE+TF+YN     +  +EA  C CG   C G +L     T  GA +++  +  
Sbjct: 301  KRAVRGGEELTFNYNV---DRYGHEAQECFCGEANCVG-FLGGKTQTDLGAMDELYIDAL 356

Query: 2058 GLLD 2061
            G++D
Sbjct: 357  GIVD 360


>gi|395532129|ref|XP_003768124.1| PREDICTED: histone-lysine N-methyltransferase ASH1L isoform 1
            [Sarcophilus harrisii]
          Length = 2969

 Score = 75.9 bits (185), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 81/160 (50%), Gaps = 30/160 (18%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +++ A  KG G+   +    G+  F++E+LGEV              S Q        EF
Sbjct: 2149 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2185

Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
             N  +E+    +D Y L     +V+D+    N A  I HSC PNCE +  +V+G Y+IG+
Sbjct: 2186 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCNPNCEMQKWSVNGVYRIGL 2245

Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            Y ++ +  G E+T+DYN  + + E+ +  +C CG   CRG
Sbjct: 2246 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFDKCRG 2283


>gi|395532131|ref|XP_003768125.1| PREDICTED: histone-lysine N-methyltransferase ASH1L isoform 2
            [Sarcophilus harrisii]
          Length = 2974

 Score = 75.9 bits (185), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 81/160 (50%), Gaps = 30/160 (18%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +++ A  KG G+   +    G+  F++E+LGEV              S Q        EF
Sbjct: 2154 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2190

Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
             N  +E+    +D Y L     +V+D+    N A  I HSC PNCE +  +V+G Y+IG+
Sbjct: 2191 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCNPNCEMQKWSVNGVYRIGL 2250

Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            Y ++ +  G E+T+DYN  + + E+ +  +C CG   CRG
Sbjct: 2251 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFDKCRG 2288


>gi|126307634|ref|XP_001366993.1| PREDICTED: probable histone-lysine N-methyltransferase ASH1L
            [Monodelphis domestica]
          Length = 2968

 Score = 75.9 bits (185), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 81/160 (50%), Gaps = 30/160 (18%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +++ A  KG G+   +    G+  F++E+LGEV              S Q        EF
Sbjct: 2148 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2184

Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
             N  +E+    +D Y L     +V+D+    N A  I HSC PNCE +  +V+G Y+IG+
Sbjct: 2185 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCNPNCEMQKWSVNGVYRIGL 2244

Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            Y ++ +  G E+T+DYN  + + E+ +  +C CG   CRG
Sbjct: 2245 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFDKCRG 2282


>gi|24641786|ref|NP_572888.2| Set2, isoform A [Drosophila melanogaster]
 gi|22832197|gb|AAF48273.2| Set2, isoform A [Drosophila melanogaster]
          Length = 2362

 Score = 75.9 bits (185), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 50/149 (33%), Positives = 77/149 (51%), Gaps = 20/149 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+        GE  F++E++GEV    + FE++  + S  +N       +Y + L 
Sbjct: 1420 KKGCGITAELLIPPGE--FIMEYVGEVIDSEE-FERRQHLYSKDRNRH-----YYFMAL- 1470

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              +G+A      V+DA  K N +  I HSC PN E +   V+G  +IG ++V+ I  GEE
Sbjct: 1471 --RGEA------VIDATSKGNISRYINHSCDPNAETQKWTVNGELRIGFFSVKPIQPGEE 1522

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            ITFDY  +   +   +A  C C +  CRG
Sbjct: 1523 ITFDYQYLRYGR---DAQRCYCEAANCRG 1548


>gi|281360813|ref|NP_001162740.1| Set2, isoform B [Drosophila melanogaster]
 gi|118582047|sp|Q9VYD1.2|C1716_DROME RecName: Full=Probable histone-lysine N-methyltransferase CG1716
 gi|92109778|gb|ABE73213.1| LD27386p [Drosophila melanogaster]
 gi|272506087|gb|ACZ95275.1| Set2, isoform B [Drosophila melanogaster]
          Length = 2313

 Score = 75.9 bits (185), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 50/149 (33%), Positives = 77/149 (51%), Gaps = 20/149 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+        GE  F++E++GEV    + FE++  + S  +N       +Y + L 
Sbjct: 1371 KKGCGITAELLIPPGE--FIMEYVGEVIDSEE-FERRQHLYSKDRNRH-----YYFMAL- 1421

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              +G+A      V+DA  K N +  I HSC PN E +   V+G  +IG ++V+ I  GEE
Sbjct: 1422 --RGEA------VIDATSKGNISRYINHSCDPNAETQKWTVNGELRIGFFSVKPIQPGEE 1473

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            ITFDY  +   +   +A  C C +  CRG
Sbjct: 1474 ITFDYQYLRYGR---DAQRCYCEAANCRG 1499


>gi|348530060|ref|XP_003452529.1| PREDICTED: hypothetical protein LOC100707110 [Oreochromis niloticus]
          Length = 2876

 Score = 75.5 bits (184), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 51/153 (33%), Positives = 78/153 (50%), Gaps = 30/153 (19%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+   +    G+  F++E+LGEV              S Q        EF +  +E+
Sbjct: 2027 KGWGIRTKESLRSGQ--FIIEYLGEVV-------------SEQ--------EFRSRMMEQ 2063

Query: 1951 PKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIH 2005
                +  Y L     +V+D+    N A  I HSC PNCE +  +V+G Y+IG++ ++ I 
Sbjct: 2064 YFSHSGHYCLNLDSGMVIDSYRMGNEARFINHSCEPNCEMQKWSVNGVYRIGLFALKDIS 2123

Query: 2006 YGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
             G E+T+DYN  + + EE +  VC CGS+ CRG
Sbjct: 2124 SGTELTYDYNFHSFNTEEQQ--VCKCGSESCRG 2154


>gi|432881031|ref|XP_004073771.1| PREDICTED: histone-lysine N-methyltransferase ASH1L-like [Oryzias
            latipes]
          Length = 2798

 Score = 75.5 bits (184), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +++ A  KG G+   +    G+  F++E+LGEV              S Q        EF
Sbjct: 1957 ERFRAEGKGWGIRTKEPLRAGQ--FIIEYLGEVV-------------SEQ--------EF 1993

Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
             +  +E+    +  Y L     +V+D+    N A  I HSC PNCE +  +V+G Y+IG+
Sbjct: 1994 RSRMMEQYFSHSGHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2053

Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            + ++ +  G E+T+DYN  + + EE +A  C CGS+ CRG
Sbjct: 2054 FALKDVSSGTELTYDYNFHSFNTEEQQA--CKCGSESCRG 2091


>gi|358058803|dbj|GAA95766.1| hypothetical protein E5Q_02423 [Mixia osmundae IAM 14324]
          Length = 2083

 Score = 75.5 bits (184), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 47/130 (36%), Positives = 64/130 (49%), Gaps = 23/130 (17%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI---YLERPKGDADGYDLVVV 1963
            D ++E++GE+            IR    +  + A E   I   YL R   D      +VV
Sbjct: 1412 DMIIEYVGEL------------IRQQVADKREKAYEKMGIGSSYLFRVDDD------LVV 1453

Query: 1964 DAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEE 2023
            DA  K  YA  I H C PNC A++  + GH +I IY +  I  G+EIT+DY+  TES + 
Sbjct: 1454 DATKKGTYARLINHCCAPNCTARIITIGGHKKIVIYALTDIEPGDEITYDYHFATESDDL 1513

Query: 2024 YEASVCLCGS 2033
                 CLCGS
Sbjct: 1514 KIP--CLCGS 1521


>gi|320168697|gb|EFW45596.1| Setd1a protein [Capsaspora owczarzaki ATCC 30864]
          Length = 1312

 Score = 75.5 bits (184), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 40/83 (48%), Positives = 50/83 (60%), Gaps = 9/83 (10%)

Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
            VVDA +K N A  + H C PNC AK+  VDGH +I IY+ R I  GEEIT+DY      K
Sbjct: 1237 VVDATYKGNLARFMNHCCEPNCYAKIIMVDGHQRIVIYSKRDIKKGEEITYDY------K 1290

Query: 2022 EEYEAS--VCLCGSQVCRGSYLN 2042
              YE +   CLCG+  C+  +LN
Sbjct: 1291 FPYEENKIPCLCGAVNCK-KFLN 1312


>gi|395329295|gb|EJF61682.1| hypothetical protein DICSQDRAFT_85722 [Dichomitus squalens LYAD-421
            SS1]
          Length = 1095

 Score = 75.5 bits (184), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 50/136 (36%), Positives = 65/136 (47%), Gaps = 25/136 (18%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI---YLERPKGDADGYDLVVV 1963
            D V+E++GEV            IR+   +  + A E   I   YL R   D      +VV
Sbjct: 980  DLVIEYVGEV------------IRAQVADKREKAYERQGIGSSYLFRIDED------LVV 1021

Query: 1964 DAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEE 2023
            DA  K N    I HSC PNC AK+  + G  +I IY  + I  G EIT+DY+   E    
Sbjct: 1022 DATKKGNLGRLINHSCDPNCTAKIITISGEKKIVIYAKQDIELGSEITYDYHFPIEQ--- 1078

Query: 2024 YEASVCLCGSQVCRGS 2039
             +   CLCGS  CRG+
Sbjct: 1079 -DKIPCLCGSAKCRGT 1093


>gi|312072804|ref|XP_003139232.1| hypothetical protein LOAG_03647 [Loa loa]
 gi|307765598|gb|EFO24832.1| hypothetical protein LOAG_03647 [Loa loa]
          Length = 1422

 Score = 75.5 bits (184), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 46/133 (34%), Positives = 69/133 (51%), Gaps = 21/133 (15%)

Query: 1908 FVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMH 1967
            F++E++GEV       + ++ IR  ++  +DP  +  + YL   K  A      V+DA  
Sbjct: 627  FIIEYIGEV------IDAEEMIRRGRRYGKDP--KHVHHYLMALKNGA------VIDATA 672

Query: 1968 KANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEY--E 2025
            K N +  I HSC PNCE++   VD   ++G + ++ I  GEEI FDY       E Y  +
Sbjct: 673  KGNVSRFINHSCDPNCESQKWTVDRQLRVGFFVIKPIALGEEIVFDYQL-----ERYGRK 727

Query: 2026 ASVCLCGSQVCRG 2038
            A  C CG+  CRG
Sbjct: 728  AQRCFCGAANCRG 740


>gi|402081815|gb|EJT76960.1| histone-lysine N-methyltransferase [Gaeumannomyces graminis var.
            tritici R3-111a-1]
          Length = 1319

 Score = 75.5 bits (184), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 48/138 (34%), Positives = 70/138 (50%), Gaps = 17/138 (12%)

Query: 1905 EDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVD 1964
            +DD ++E++GE   V     K    R L+           + YL R   +A      V+D
Sbjct: 1199 KDDMIIEYVGEE--VRPSVAKVREARYLKSG-------IGSTYLFRIDDEA------VID 1243

Query: 1965 AMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEY 2024
            A  K   A  I HSC PNC AK+  V+G  +I IY +R I   EE+T+DY    E +++ 
Sbjct: 1244 ATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIGQNEELTYDYKFEPE-EDQK 1302

Query: 2025 EASVCLCGSQVCRGSYLN 2042
            +   CLCG+  C+G +LN
Sbjct: 1303 DRVPCLCGTTACKG-FLN 1319


>gi|195392836|ref|XP_002055060.1| GJ19006 [Drosophila virilis]
 gi|194149570|gb|EDW65261.1| GJ19006 [Drosophila virilis]
          Length = 2101

 Score = 75.5 bits (184), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 50/149 (33%), Positives = 77/149 (51%), Gaps = 20/149 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+    +   GE  F++E++GEV    + FE++  I S  +N       +Y + L 
Sbjct: 1071 KKGCGITAELQIPPGE--FIMEYVGEVIDSEE-FERRQHIYSRDRNRH-----YYFMAL- 1121

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              +G+A      ++DA  K N +  I HSC PN E +   V+G  +IG ++V+ I  GEE
Sbjct: 1122 --RGEA------IIDATAKGNISRYINHSCDPNAETQKWTVNGELRIGFFSVKTIMPGEE 1173

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            ITFDY      +   +A  C C +  CRG
Sbjct: 1174 ITFDYQYQRYGR---DAQRCYCEASNCRG 1199


>gi|15150415|gb|AAK84931.1| SD01656p [Drosophila melanogaster]
          Length = 1443

 Score = 75.1 bits (183), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 50/149 (33%), Positives = 77/149 (51%), Gaps = 20/149 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+        GE  F++E++GEV    + FE++  + S  +N       +Y + L 
Sbjct: 501  KKGCGITAELLIPPGE--FIMEYVGEVIDSEE-FERRQHLYSKDRNRH-----YYFMAL- 551

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              +G+A      V+DA  K N +  I HSC PN E +   V+G  +IG ++V+ I  GEE
Sbjct: 552  --RGEA------VIDATSKGNISRYINHSCDPNAETQKWTVNGELRIGFFSVKPIQPGEE 603

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            ITFDY  +   +   +A  C C +  CRG
Sbjct: 604  ITFDYQYLRYGR---DAQRCYCEAANCRG 629


>gi|301614673|ref|XP_002936809.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Xenopus
            (Silurana) tropicalis]
          Length = 1298

 Score = 75.1 bits (183), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 45/158 (28%), Positives = 86/158 (54%), Gaps = 21/158 (13%)

Query: 1882 PDDKYVAYR-KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
            P+ K +    KG G++  ++   GE  FV E++GE+       ++++ +  ++   E+  
Sbjct: 1007 PETKIIKTEGKGWGLIATRDIKKGE--FVNEYIGEL------IDEEECMYRIRHAQENDI 1058

Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
              FY + +++ +         ++DA  K N++  + HSC+PNCE +  +V+G  ++G++ 
Sbjct: 1059 THFYMLTIDKDR---------IIDAGPKGNFSRFMNHSCQPNCETQKWSVNGDTRVGLFA 1109

Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            VR I  GEE+TF+YN      E+   ++C CG+  C G
Sbjct: 1110 VRDIPAGEELTFNYNLDCLGNEK---TICRCGAPNCSG 1144


>gi|427794953|gb|JAA62928.1| hypothetical protein, partial [Rhipicephalus pulchellus]
          Length = 1557

 Score = 75.1 bits (183), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 50/145 (34%), Positives = 70/145 (48%), Gaps = 30/145 (20%)

Query: 1903 FGEDDFVVEFLGE-VYPVW----KWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADG 1957
               D+ V+E++G+ V P+     + F  Q GI S            + + +E        
Sbjct: 1438 IAADEMVIEYVGQMVRPIMADRREQFYTQIGIGSSY---------LFRVDVE-------- 1480

Query: 1958 YDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSV 2017
                ++DA    N A  I HSC PNC AKV  V+G  +I IY+ + I+  EEIT+DY   
Sbjct: 1481 ---TIIDATKCGNLARFINHSCNPNCYAKVITVEGQKKIVIYSKQPINVNEEITYDYKFP 1537

Query: 2018 TESKEEYEASVCLCGSQVCRGSYLN 2042
             E     E  VCLCG+  CRG +LN
Sbjct: 1538 LED----EKIVCLCGAPQCRG-FLN 1557


>gi|219111565|ref|XP_002177534.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217412069|gb|EEC51997.1| predicted protein, partial [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 144

 Score = 75.1 bits (183), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 51/150 (34%), Positives = 81/150 (54%), Gaps = 20/150 (13%)

Query: 1891 KGLGVV-CNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            KG G+V C+K    G+ D V+E++G V       EK+D +   ++++ +  P FY + L 
Sbjct: 13   KGWGLVPCDK---IGKGDLVLEYVGNVIDA---KEKEDRLSEWERDHPND-PNFYIMSLR 65

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
                     D   +DA HKAN +  I HSC PNC      V+G+ + GI+  R I  GE 
Sbjct: 66   ---------DQWYIDARHKANLSRFINHSCAPNCFLTQINVNGYARNGIFAKRDIQAGEF 116

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGS 2039
            +++DY+  T+  + +   VC CG++ CRG+
Sbjct: 117  LSYDYHFDTKQGDRF---VCRCGAKSCRGT 143


>gi|115446669|ref|NP_001047114.1| Os02g0554000 [Oryza sativa Japonica Group]
 gi|50725771|dbj|BAD33302.1| SET domain-containing protein-like [Oryza sativa Japonica Group]
 gi|113536645|dbj|BAF09028.1| Os02g0554000 [Oryza sativa Japonica Group]
          Length = 637

 Score = 75.1 bits (183), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 50/154 (32%), Positives = 74/154 (48%), Gaps = 19/154 (12%)

Query: 1885 KYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFY 1944
            K+   +KG G+   ++    E  F++E++GEV  +  +  +Q    S  + +      FY
Sbjct: 199  KFHTGKKGYGLQLKED--VSEGRFLIEYVGEVLDITAYESRQRYYASKGQKH------FY 250

Query: 1945 NIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGI 2004
             + L   +         V+DA  K N    I HSC PNC  +   V+G   IGI+ +R I
Sbjct: 251  FMALNGGE---------VIDACTKGNLGRFINHSCSPNCRTEKWMVNGEVCIGIFAMRNI 301

Query: 2005 HYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
              GEE+TFDYN V  S    +   C CG+  CRG
Sbjct: 302  KKGEELTFDYNYVRVSGAAPQK--CFCGTAKCRG 333


>gi|346974289|gb|EGY17741.1| histone-lysine N-methyltransferase [Verticillium dahliae VdLs.17]
          Length = 1148

 Score = 75.1 bits (183), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 47/146 (32%), Positives = 73/146 (50%), Gaps = 23/146 (15%)

Query: 1900 EGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYD 1959
            E    +DD ++E++GE     +  ++   IR ++             YL++  G +  + 
Sbjct: 1023 EENINKDDMIIEYVGE-----QVRQQISEIREVR-------------YLKQGMGSSYLFR 1064

Query: 1960 L---VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNS 2016
            +    V+DA  K   A  I HSC PNC AK+  VDG  +I IY +R I   EE+T+DY  
Sbjct: 1065 IDENTVIDATKKGGIARFINHSCMPNCTAKIIKVDGSKRIVIYALRDIARTEELTYDYKF 1124

Query: 2017 VTESKEEYEASVCLCGSQVCRGSYLN 2042
              E     +   CLCG+ +C+G +LN
Sbjct: 1125 EREIG-SLDRIPCLCGTALCKG-FLN 1148


>gi|357604624|gb|EHJ64265.1| mixed-lineage leukemia protein, mll [Danaus plexippus]
          Length = 4387

 Score = 74.7 bits (182), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 55/164 (33%), Positives = 76/164 (46%), Gaps = 32/164 (19%)

Query: 1886 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVW------KWFEKQDGIRSLQKNNEDP 1939
            Y ++  G G+ C ++    E D V+E+ GEV          K +E   G R +       
Sbjct: 4249 YRSHIHGRGLFCKRD--IEEGDMVIEYAGEVIRAVLADQREKKYEAMSGRRGVG------ 4300

Query: 1940 APEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIY 1999
                   Y+ R        D +VVDA  K N A  I HSC PNC ++V  + GH  I I+
Sbjct: 4301 -----GCYMFRID------DNLVVDATLKGNAARFINHSCDPNCYSRVVDIHGHKHILIF 4349

Query: 2000 TVRGIHYGEEITFDYNSVTESKEEYEASV-CLCGSQVCRGSYLN 2042
             +R I  GEE+T+DY    E     E  + C CG++ CR  YLN
Sbjct: 4350 ALRRITIGEELTYDYKFPFE-----EVKIPCTCGAKKCR-KYLN 4387


>gi|358401203|gb|EHK50509.1| hypothetical protein TRIATDRAFT_171650, partial [Trichoderma
            atroviride IMI 206040]
          Length = 1241

 Score = 74.7 bits (182), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 47/143 (32%), Positives = 70/143 (48%), Gaps = 23/143 (16%)

Query: 1903 FGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGY---D 1959
              +DD ++E++GE        E +  I  +++N           YL+   G +  +   D
Sbjct: 1119 INKDDMIIEYVGE--------EVRQQIAEIRENR----------YLKSGIGSSYLFRIDD 1160

Query: 1960 LVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTE 2019
              V+DA  K   A  I HSC PNC AK+  V+G  +I IY +R I   EE+T+DY    E
Sbjct: 1161 NTVIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAMNEELTYDYKFERE 1220

Query: 2020 SKEEYEASVCLCGSQVCRGSYLN 2042
                 +   CLCG+  C+G +LN
Sbjct: 1221 IG-SLDRIPCLCGTAACKG-FLN 1241


>gi|358389897|gb|EHK27489.1| hypothetical protein TRIVIDRAFT_34353 [Trichoderma virens Gv29-8]
          Length = 1221

 Score = 74.7 bits (182), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 47/143 (32%), Positives = 70/143 (48%), Gaps = 23/143 (16%)

Query: 1903 FGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGY---D 1959
              +DD ++E++GE        E +  I  +++N           YL+   G +  +   D
Sbjct: 1099 INKDDMIIEYVGE--------EVRQQIAEIRENR----------YLKSGIGSSYLFRIDD 1140

Query: 1960 LVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTE 2019
              V+DA  K   A  I HSC PNC AK+  V+G  +I IY +R I   EE+T+DY    E
Sbjct: 1141 NTVIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAMNEELTYDYKFERE 1200

Query: 2020 SKEEYEASVCLCGSQVCRGSYLN 2042
                 +   CLCG+  C+G +LN
Sbjct: 1201 IG-SLDRIPCLCGTAACKG-FLN 1221


>gi|301629157|ref|XP_002943714.1| PREDICTED: hypothetical protein LOC100496979 [Xenopus (Silurana)
            tropicalis]
          Length = 1666

 Score = 74.7 bits (182), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 49/160 (30%), Positives = 78/160 (48%), Gaps = 30/160 (18%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +++ A  KG G+   +        F++E+LGEV                         EF
Sbjct: 54   ERFRAEGKGWGIRTKEP--LKASQFIIEYLGEVVSET---------------------EF 90

Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
             N  +E+    +D Y L     +V+D+    N A  I HSC PNCE +  +V+G Y+IG+
Sbjct: 91   RNRTIEQYHNHSDHYCLSLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 150

Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            Y ++ +  G E+T+DYN  + + E+ +  VC CG + CRG
Sbjct: 151  YALKDMPAGTELTYDYNFHSFNTEKQQ--VCKCGVEKCRG 188


>gi|431892339|gb|ELK02779.1| Putative histone-lysine N-methyltransferase ASH1L [Pteropus alecto]
          Length = 1291

 Score = 74.7 bits (182), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 52/160 (32%), Positives = 82/160 (51%), Gaps = 30/160 (18%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +++ A  KG G+   +    G+  F++E+LGEV              S Q        EF
Sbjct: 474  ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 510

Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
             N  +E+    +D Y L     +V+D+    N A  I HSC PNCE +  +V+G Y+IG+
Sbjct: 511  RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 570

Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            Y +R +  G E+T+DYN  + + E+ +  +C CG + CRG
Sbjct: 571  YALRDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 608


>gi|340514680|gb|EGR44940.1| predicted protein [Trichoderma reesei QM6a]
          Length = 1236

 Score = 74.7 bits (182), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 47/143 (32%), Positives = 70/143 (48%), Gaps = 23/143 (16%)

Query: 1903 FGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGY---D 1959
              +DD ++E++GE        E +  I  +++N           YL+   G +  +   D
Sbjct: 1114 INKDDMIIEYVGE--------EVRQQIAEIRENR----------YLKSGIGSSYLFRIDD 1155

Query: 1960 LVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTE 2019
              V+DA  K   A  I HSC PNC AK+  V+G  +I IY +R I   EE+T+DY    E
Sbjct: 1156 NTVIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAMNEELTYDYKFERE 1215

Query: 2020 SKEEYEASVCLCGSQVCRGSYLN 2042
                 +   CLCG+  C+G +LN
Sbjct: 1216 IG-SLDRIPCLCGTAACKG-FLN 1236


>gi|302416827|ref|XP_003006245.1| histone-lysine N-methyltransferase [Verticillium albo-atrum VaMs.102]
 gi|261355661|gb|EEY18089.1| histone-lysine N-methyltransferase [Verticillium albo-atrum VaMs.102]
          Length = 1135

 Score = 74.7 bits (182), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 47/146 (32%), Positives = 73/146 (50%), Gaps = 23/146 (15%)

Query: 1900 EGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYD 1959
            E    +DD ++E++GE     +  ++   IR ++             YL++  G +  + 
Sbjct: 1010 EENINKDDMIIEYVGE-----QVRQQISEIREVR-------------YLKQGMGSSYLFR 1051

Query: 1960 L---VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNS 2016
            +    V+DA  K   A  I HSC PNC AK+  VDG  +I IY +R I   EE+T+DY  
Sbjct: 1052 IDENTVIDATKKGGIARFINHSCMPNCTAKIIKVDGSKRIVIYALRDIARTEELTYDYKF 1111

Query: 2017 VTESKEEYEASVCLCGSQVCRGSYLN 2042
              E     +   CLCG+ +C+G +LN
Sbjct: 1112 EREIG-SLDRIPCLCGTALCKG-FLN 1135


>gi|260800140|ref|XP_002594994.1| hypothetical protein BRAFLDRAFT_99284 [Branchiostoma floridae]
 gi|229280233|gb|EEN51005.1| hypothetical protein BRAFLDRAFT_99284 [Branchiostoma floridae]
          Length = 1541

 Score = 74.7 bits (182), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 41/132 (31%), Positives = 73/132 (55%), Gaps = 18/132 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            DFV E++GE+       ++++  R ++K +ED    FY + L++ +         ++DA 
Sbjct: 1161 DFVYEYVGEL------IDEEEVQRRIKKAHEDNVTNFYMLTLDKNR---------IIDAG 1205

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             KAN +  + HSC+PNCE +   V+G  ++G++ +  I  G E+TF+YN      E+   
Sbjct: 1206 PKANMSRFMNHSCQPNCETQKWMVNGDIRVGLFAMDDIPTGSELTFNYNLDCLGNEK--- 1262

Query: 2027 SVCLCGSQVCRG 2038
            + C CG+ +C G
Sbjct: 1263 TPCNCGAPICSG 1274


>gi|66828443|ref|XP_647576.1| SET domain-containing protein [Dictyostelium discoideum AX4]
 gi|60475584|gb|EAL73519.1| SET domain-containing protein [Dictyostelium discoideum AX4]
          Length = 898

 Score = 74.7 bits (182), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 50/152 (32%), Positives = 77/152 (50%), Gaps = 23/152 (15%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G++ N++    E  F++E+ GEV        KQ  +R +++   +    FY + L+
Sbjct: 626  KKGWGLIANED--IEEKQFIMEYCGEV------ISKQTCLRRMKEAENEKF--FYFLTLD 675

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              +          +DA  + N A  + HSC PNCE +   V G  +IGI+ ++ I  G E
Sbjct: 676  SKE---------CLDASKRGNLARFMNHSCDPNCETQKWTVGGEVKIGIFAIKPIPKGTE 726

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
            +TFDYN      ++ E   C CGS  CRG YL
Sbjct: 727  LTFDYNYERFGAQKQE---CYCGSVNCRG-YL 754


>gi|281206847|gb|EFA81031.1| SET domain-containing protein [Polysphondylium pallidum PN500]
          Length = 1363

 Score = 74.7 bits (182), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 51/154 (33%), Positives = 73/154 (47%), Gaps = 26/154 (16%)

Query: 1886 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN 1945
            + A +KG G+   ++       FV+E+ GEV    +  ++            D    FY 
Sbjct: 988  FNAKKKGWGLKAKEK--ISAHQFVIEYCGEVITRAQSMDRM--------READGEKYFYF 1037

Query: 1946 IYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIH 2005
            + L+  +         V+DA  K N A  I HSC PNCE +  +VDG  +IGI+ ++ I 
Sbjct: 1038 LTLDSKE---------VLDASRKGNLARFINHSCDPNCETQKWSVDGETRIGIFALKDIE 1088

Query: 2006 YGEEITFDYN--SVTESKEEYEASVCLCGSQVCR 2037
             G E+TFDYN   V  SK+      C CGS  CR
Sbjct: 1089 AGTELTFDYNYERVGSSKQS-----CYCGSVNCR 1117


>gi|194766778|ref|XP_001965501.1| GF22528 [Drosophila ananassae]
 gi|190619492|gb|EDV35016.1| GF22528 [Drosophila ananassae]
          Length = 2414

 Score = 74.3 bits (181), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 48/149 (32%), Positives = 76/149 (51%), Gaps = 20/149 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+    +   GE  F++E++GEV    + FE++  + S  +        +Y + L 
Sbjct: 1433 KKGCGITAELQIPPGE--FIMEYVGEVI-DSEEFERRQHLYSRDRKRH-----YYFMAL- 1483

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              +G+A      ++DA  K N +  I HSC PN E +   V+G  +IG ++V+ I  GEE
Sbjct: 1484 --RGEA------IIDATSKGNISRYINHSCDPNAETQKWTVNGELRIGFFSVKTIQPGEE 1535

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            ITFDY      +   +A  C C +  CRG
Sbjct: 1536 ITFDYQYQRYGR---DAQRCYCEATNCRG 1561


>gi|427779581|gb|JAA55242.1| Putative histone-lysine n-methyltransferase setd2 [Rhipicephalus
            pulchellus]
          Length = 2038

 Score = 74.3 bits (181), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 52/155 (33%), Positives = 78/155 (50%), Gaps = 20/155 (12%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +K++  +KG G+   +    G   FV+E++GEV    + F K+  ++   ++N      +
Sbjct: 872  EKFMTEKKGWGLRTLETVSSG--TFVMEYVGEVL-TPEDFRKR--VKQYARDNNQ---HY 923

Query: 1944 YNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRG 2003
            Y + L      AD     ++DA  K N +  I HSC PNCE +   V+G  +IG +T R 
Sbjct: 924  YFMALR-----ADE----IIDATQKGNVSRFINHSCDPNCETQKWTVNGELRIGFFTRRP 974

Query: 2004 IHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            +  GEE+TFDY      K   EA  C C S  CRG
Sbjct: 975  LRAGEELTFDYQFQRYGK---EAQRCHCESSNCRG 1006


>gi|336372757|gb|EGO01096.1| hypothetical protein SERLA73DRAFT_50848 [Serpula lacrymans var.
            lacrymans S7.3]
          Length = 260

 Score = 74.3 bits (181), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 52/139 (37%), Positives = 68/139 (48%), Gaps = 26/139 (18%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI---YLERPKGDADGYDLVVV 1963
            + V+E++GEV            IR+   +  +   E   I   YL R   D      +VV
Sbjct: 145  EMVIEYVGEV------------IRAQVADKREKVYERQGIGSSYLFRIDED------LVV 186

Query: 1964 DAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEE 2023
            DA  K N    I HSC PNC AK+  ++G  +I IY  + I  GEEIT+DY+   E    
Sbjct: 187  DATKKGNLGRLINHSCDPNCTAKIITINGEKKIVIYAKQDIELGEEITYDYHFPIEQ--- 243

Query: 2024 YEASVCLCGSQVCRGSYLN 2042
             +   CLCGS  CRG YLN
Sbjct: 244  -DKIPCLCGSAKCRG-YLN 260


>gi|195012609|ref|XP_001983710.1| GH16034 [Drosophila grimshawi]
 gi|193897192|gb|EDV96058.1| GH16034 [Drosophila grimshawi]
          Length = 2059

 Score = 74.3 bits (181), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 48/149 (32%), Positives = 76/149 (51%), Gaps = 20/149 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+    +   GE  F++E++GEV    + FE++  + S  +N       +Y + L 
Sbjct: 1167 KKGCGITAELQMPSGE--FIMEYVGEVIDSEE-FERRQHLYSEDRNRH-----YYFMAL- 1217

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              + D+      ++DA  K N +  I HSC PN E +   V+G  +IG ++++ I  GEE
Sbjct: 1218 --RSDS------IIDATSKGNISRYINHSCDPNAETQKWTVNGELRIGFFSLKTIMPGEE 1269

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            ITFDY      +   +A  C C S  CRG
Sbjct: 1270 ITFDYQYQRYGR---DAQRCYCESANCRG 1295


>gi|390355933|ref|XP_784903.3| PREDICTED: uncharacterized protein LOC579712 isoform 3
            [Strongylocentrotus purpuratus]
          Length = 3326

 Score = 74.3 bits (181), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 44/131 (33%), Positives = 69/131 (52%), Gaps = 18/131 (13%)

Query: 1908 FVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMH 1967
            F++E+LGEV  V + +++       QK++       Y + L       DG   +V+D   
Sbjct: 2556 FIIEYLGEVISVKELWKRALDDYQYQKHH-------YCLNL-------DGG--MVIDGYR 2599

Query: 1968 KANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEAS 2027
              N    + HSC PNCE +   V+G Y+IG++ +R I  GEE+T+DYN  + + E  +  
Sbjct: 2600 YGNEGRFVNHSCNPNCEMQKWMVNGLYRIGMFALRDIQPGEELTYDYNFHSFNMETQQE- 2658

Query: 2028 VCLCGSQVCRG 2038
             C CG + CRG
Sbjct: 2659 -CNCGHETCRG 2668


>gi|390355935|ref|XP_003728661.1| PREDICTED: uncharacterized protein LOC579712 isoform 1
            [Strongylocentrotus purpuratus]
          Length = 3164

 Score = 74.3 bits (181), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 44/131 (33%), Positives = 69/131 (52%), Gaps = 18/131 (13%)

Query: 1908 FVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMH 1967
            F++E+LGEV  V + +++       QK++       Y + L       DG   +V+D   
Sbjct: 2377 FIIEYLGEVISVKELWKRALDDYQYQKHH-------YCLNL-------DGG--MVIDGYR 2420

Query: 1968 KANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEAS 2027
              N    + HSC PNCE +   V+G Y+IG++ +R I  GEE+T+DYN  + + E  +  
Sbjct: 2421 YGNEGRFVNHSCNPNCEMQKWMVNGLYRIGMFALRDIQPGEELTYDYNFHSFNMETQQE- 2479

Query: 2028 VCLCGSQVCRG 2038
             C CG + CRG
Sbjct: 2480 -CNCGHETCRG 2489


>gi|402856517|ref|XP_003892835.1| PREDICTED: histone-lysine N-methyltransferase ASH1L [Papio anubis]
          Length = 1277

 Score = 74.3 bits (181), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +++ A  KG G+   +    G+  F++E+LGEV              S Q        EF
Sbjct: 456  ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 492

Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
             N  +E+    +D Y L     +V+D+    N A  I HSC PNCE +  +V+G Y+IG+
Sbjct: 493  RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 552

Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            Y ++ +  G E+T+DYN  + + E+ +  +C CG + CRG
Sbjct: 553  YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 590


>gi|74140676|dbj|BAC28183.2| unnamed protein product [Mus musculus]
          Length = 418

 Score = 74.3 bits (181), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 49/160 (30%), Positives = 80/160 (50%), Gaps = 30/160 (18%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +++ A  KG G+   +    G+  F++E+LGEV                         EF
Sbjct: 91   ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVVS---------------------EQEF 127

Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
             N  +E+    +D Y L     +V+D+    N A  I HSC PNCE +  +V+G Y+IG+
Sbjct: 128  RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 187

Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            Y ++ +  G E+T+DYN  + + E+ +  +C CG + CRG
Sbjct: 188  YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 225


>gi|390355937|ref|XP_003728662.1| PREDICTED: uncharacterized protein LOC579712 isoform 2
            [Strongylocentrotus purpuratus]
          Length = 3111

 Score = 74.3 bits (181), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 44/131 (33%), Positives = 69/131 (52%), Gaps = 18/131 (13%)

Query: 1908 FVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMH 1967
            F++E+LGEV  V + +++       QK++       Y + L       DG   +V+D   
Sbjct: 2324 FIIEYLGEVISVKELWKRALDDYQYQKHH-------YCLNL-------DGG--MVIDGYR 2367

Query: 1968 KANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEAS 2027
              N    + HSC PNCE +   V+G Y+IG++ +R I  GEE+T+DYN  + + E  +  
Sbjct: 2368 YGNEGRFVNHSCNPNCEMQKWMVNGLYRIGMFALRDIQPGEELTYDYNFHSFNMETQQE- 2426

Query: 2028 VCLCGSQVCRG 2038
             C CG + CRG
Sbjct: 2427 -CNCGHETCRG 2436


>gi|302910631|ref|XP_003050330.1| histone H3 methyltransferase complex protein [Nectria haematococca
            mpVI 77-13-4]
 gi|256731267|gb|EEU44617.1| histone H3 methyltransferase complex protein [Nectria haematococca
            mpVI 77-13-4]
          Length = 1281

 Score = 74.3 bits (181), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 46/143 (32%), Positives = 70/143 (48%), Gaps = 23/143 (16%)

Query: 1903 FGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDL-- 1960
              +DD ++E++GE        E +  I  +++N           YL+   G +  + +  
Sbjct: 1159 IAKDDMIIEYVGE--------EVRQQIAEIRENR----------YLKSGIGSSYLFRIDE 1200

Query: 1961 -VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTE 2019
              V+DA  K   A  I HSC PNC AK+  V+G  +I IY +R I   EE+T+DY    E
Sbjct: 1201 NTVIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAMNEELTYDYKFERE 1260

Query: 2020 SKEEYEASVCLCGSQVCRGSYLN 2042
                 +   CLCG+  C+G +LN
Sbjct: 1261 IG-SLDRIPCLCGTAACKG-FLN 1281


>gi|320593249|gb|EFX05658.1| set domain containing protein [Grosmannia clavigera kw1407]
          Length = 1450

 Score = 73.9 bits (180), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 37/84 (44%), Positives = 47/84 (55%), Gaps = 2/84 (2%)

Query: 1959 DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVT 2018
            D  V+DA  K   A  I HSC PNC AK+  V+G  +I IY +R I   EE+T+DY    
Sbjct: 1369 DGTVIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIGQNEELTYDYKFEP 1428

Query: 2019 ESKEEYEASVCLCGSQVCRGSYLN 2042
            E   E     CLCG+  C+G +LN
Sbjct: 1429 EDNPEDRVP-CLCGTTACKG-FLN 1450


>gi|195448204|ref|XP_002071555.1| GK25076 [Drosophila willistoni]
 gi|194167640|gb|EDW82541.1| GK25076 [Drosophila willistoni]
          Length = 2217

 Score = 73.9 bits (180), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 49/149 (32%), Positives = 77/149 (51%), Gaps = 20/149 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+    +   GE  F++E++GEV    + FE++  + S  +N       +Y + L 
Sbjct: 1167 KKGCGITAELQIPPGE--FIMEYVGEVIDAEE-FERRQHLYSKDRNRH-----YYFMAL- 1217

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              +G+A      ++DA  K N +  I HSC PN E +   V+G  +IG ++V+ I  GEE
Sbjct: 1218 --RGEA------IIDATSKGNISRYINHSCDPNAETQKWTVNGELRIGFFSVKTILPGEE 1269

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            ITFDY      +   +A  C C +  CRG
Sbjct: 1270 ITFDYQY---QRYGRDAQRCYCEAINCRG 1295


>gi|296422581|ref|XP_002840838.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295637063|emb|CAZ85029.1| unnamed protein product [Tuber melanosporum]
          Length = 1200

 Score = 73.9 bits (180), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 38/84 (45%), Positives = 48/84 (57%), Gaps = 2/84 (2%)

Query: 1959 DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVT 2018
            D  V+DA      A  I HSC PNC AK+  V+G  +I IY +R I   EE+T+DY    
Sbjct: 1119 DTTVIDATKAGGIARFINHSCTPNCTAKIIKVEGSKRIVIYALRDIRENEELTYDYKFER 1178

Query: 2019 ESKEEYEASVCLCGSQVCRGSYLN 2042
            E + E E   CLCGS  C+G +LN
Sbjct: 1179 ELESE-ERIPCLCGSSGCKG-FLN 1200


>gi|408391029|gb|EKJ70413.1| hypothetical protein FPSE_09407 [Fusarium pseudograminearum CS3096]
          Length = 1263

 Score = 73.9 bits (180), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 46/143 (32%), Positives = 70/143 (48%), Gaps = 23/143 (16%)

Query: 1903 FGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGY---D 1959
              +DD ++E++GE        + +  I  +++N           YL+   G +  +   D
Sbjct: 1141 IAKDDMIIEYVGE--------QVRQQISEIRENR----------YLKSGIGSSYLFRIDD 1182

Query: 1960 LVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTE 2019
              V+DA  K   A  I HSC PNC AK+  V+G  +I IY +R I   EE+T+DY    E
Sbjct: 1183 NTVIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIALNEELTYDYKFERE 1242

Query: 2020 SKEEYEASVCLCGSQVCRGSYLN 2042
                 +   CLCG+  C+G +LN
Sbjct: 1243 IG-STDRIPCLCGTAACKG-FLN 1263


>gi|168009924|ref|XP_001757655.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162691349|gb|EDQ77712.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1715

 Score = 73.9 bits (180), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 46/133 (34%), Positives = 69/133 (51%), Gaps = 21/133 (15%)

Query: 1908 FVVEFLGEVY--PVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDA 1965
            F++E++GEV   P ++  +K+  + S QK+       FY + L   +         ++DA
Sbjct: 766  FIIEYVGEVLDMPSFEARQKEYSMNS-QKH-------FYFMTLSANE---------IIDA 808

Query: 1966 MHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYE 2025
             +K N    I HSC PNC+ +   VDG   IG++ +R I   EE+TFDYN V       +
Sbjct: 809  CNKGNLGRFINHSCEPNCQTEKWMVDGEVCIGLFAIRDIKEREEVTFDYNFVRVGG--AD 866

Query: 2026 ASVCLCGSQVCRG 2038
            A  C CG+  CRG
Sbjct: 867  AKKCECGASKCRG 879


>gi|260791327|ref|XP_002590691.1| hypothetical protein BRAFLDRAFT_125552 [Branchiostoma floridae]
 gi|229275887|gb|EEN46702.1| hypothetical protein BRAFLDRAFT_125552 [Branchiostoma floridae]
          Length = 2482

 Score = 73.9 bits (180), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
            +++DA    N A  I H C PNC AK+  V+G+ +I IY+ R I   EEIT+DY    E 
Sbjct: 2406 MIIDATKNGNLARFINHCCNPNCYAKIITVEGYKKIVIYSRRDIAVNEEITYDYKFPIED 2465

Query: 2021 KEEYEASVCLCGSQVCRGS 2039
                E   CLCG++ CRG+
Sbjct: 2466 ----EKIPCLCGAENCRGT 2480


>gi|410516926|sp|Q4I5R3.2|SET1_GIBZE RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
            specific; AltName: Full=COMPASS component SET1; AltName:
            Full=SET domain-containing protein 1
          Length = 1263

 Score = 73.9 bits (180), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 46/143 (32%), Positives = 70/143 (48%), Gaps = 23/143 (16%)

Query: 1903 FGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGY---D 1959
              +DD ++E++GE        + +  I  +++N           YL+   G +  +   D
Sbjct: 1141 IAKDDMIIEYVGE--------QVRQQISEIRENR----------YLKSGIGSSYLFRIDD 1182

Query: 1960 LVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTE 2019
              V+DA  K   A  I HSC PNC AK+  V+G  +I IY +R I   EE+T+DY    E
Sbjct: 1183 NTVIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIALNEELTYDYKFERE 1242

Query: 2020 SKEEYEASVCLCGSQVCRGSYLN 2042
                 +   CLCG+  C+G +LN
Sbjct: 1243 IG-STDRIPCLCGTAACKG-FLN 1263


>gi|170591502|ref|XP_001900509.1| SET domain containing protein [Brugia malayi]
 gi|158592121|gb|EDP30723.1| SET domain containing protein [Brugia malayi]
          Length = 1056

 Score = 73.9 bits (180), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 37/79 (46%), Positives = 48/79 (60%), Gaps = 6/79 (7%)

Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYN-SVTES 2020
            V+DA    N A  I HSC+PNC AK+  VDG  +I IY+   I+ G+EIT+DY   + E 
Sbjct: 981  VIDATQMGNLARFINHSCQPNCYAKIVVVDGEKRIVIYSKLAINKGDEITYDYKFPIEED 1040

Query: 2021 KEEYEASVCLCGSQVCRGS 2039
            K +     CLCG+  CRGS
Sbjct: 1041 KID-----CLCGAPGCRGS 1054


>gi|154422490|ref|XP_001584257.1| SET domain containing protein [Trichomonas vaginalis G3]
 gi|121918503|gb|EAY23271.1| SET domain containing protein [Trichomonas vaginalis G3]
          Length = 259

 Score = 73.9 bits (180), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 41/86 (47%), Positives = 50/86 (58%), Gaps = 3/86 (3%)

Query: 1959 DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVT 2018
            D + +DA HK   A  + HSC PNC+  V    G   I I+  + I   EE+T+DYN   
Sbjct: 158  DDLYIDATHKGGIARFLNHSCDPNCKTCVVEAGGQRHIVIFAKKKIEPFEELTYDYNLPY 217

Query: 2019 ESKEEYEASVCLCGSQVCRGSYLNLT 2044
            ESKE  +A VCLCGS  CRG YLN T
Sbjct: 218  ESKE--KAIVCLCGSPKCRG-YLNYT 240


>gi|158301050|ref|XP_001238385.2| AGAP011688-PA [Anopheles gambiae str. PEST]
 gi|157013454|gb|EAU75883.2| AGAP011688-PA [Anopheles gambiae str. PEST]
          Length = 2404

 Score = 73.9 bits (180), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 52/159 (32%), Positives = 80/159 (50%), Gaps = 21/159 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+  +     GE  F++E++GEV    ++ E+ +   S +KN       +Y + L 
Sbjct: 1289 KKGFGIQASSAIAPGE--FIMEYVGEVLNSAQFDERAEAY-SREKNKH-----YYFMALR 1340

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
                 +DG    ++DA  K N +  I HSC PN E +   V+G  +IG ++ + I  GEE
Sbjct: 1341 -----SDG----IIDATTKGNISRFINHSCDPNAETQKWTVNGELRIGFFSTKYILPGEE 1391

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSY-LNLTGEG 2047
            ITFDY      +   +A  C C ++ CRG      TGEG
Sbjct: 1392 ITFDYQFQRYGR---KAQKCYCEAESCRGWIGAKPTGEG 1427


>gi|391325531|ref|XP_003737286.1| PREDICTED: histone-lysine N-methyltransferase SETD1B-A-like
            [Metaseiulus occidentalis]
          Length = 976

 Score = 73.9 bits (180), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 38/82 (46%), Positives = 49/82 (59%), Gaps = 5/82 (6%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N A  I HSC PNC A+V  V+G  +I IY+ R I   EEIT+DY      
Sbjct: 900  TIIDATKCGNLARFINHSCNPNCYARVITVEGQKKIVIYSKRDISVNEEITYDYKF---P 956

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
            +EE + + CLCG+  CRG YLN
Sbjct: 957  REEVKIT-CLCGTPQCRG-YLN 976


>gi|430813239|emb|CCJ29409.1| unnamed protein product [Pneumocystis jirovecii]
          Length = 375

 Score = 73.9 bits (180), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 49/133 (36%), Positives = 64/133 (48%), Gaps = 19/133 (14%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            D V+E++GE+       +    IR  Q   +         YL R   D       VVDA 
Sbjct: 260  DMVIEYVGEIVR-----QTVADIRERQYERQGIGSS----YLFRIDDDT------VVDAT 304

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K N A  I HSC P+C AK+  V+G  +I IY  R I  GEEIT+DY    E  +    
Sbjct: 305  KKGNIARFINHSCDPSCTAKIIRVEGEKKIVIYAHRDIEKGEEITYDYKFPIEDVK---- 360

Query: 2027 SVCLCGSQVCRGS 2039
              CLCG++ CRG+
Sbjct: 361  IPCLCGAKACRGT 373


>gi|295913201|gb|ADG57859.1| transcription factor [Lycoris longituba]
          Length = 164

 Score = 73.9 bits (180), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 51/134 (38%), Positives = 67/134 (50%), Gaps = 19/134 (14%)

Query: 1906 DDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDA 1965
            +DFV+E++GE+        +   IR  Q             YL R     DGY   VVDA
Sbjct: 48   EDFVIEYVGELVR-----RQISDIRECQYEKMGIGSS----YLFRLD---DGY---VVDA 92

Query: 1966 MHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYE 2025
              +   A  I HSC PNC  KV  V+G  +I IY  R IH GEE+T++Y    E ++   
Sbjct: 93   TKRGGIARFINHSCEPNCYTKVITVEGQKKIFIYAKRHIHAGEELTYNYKFPLEEQK--- 149

Query: 2026 ASVCLCGSQVCRGS 2039
              +C CGS+ CRGS
Sbjct: 150  -ILCNCGSKRCRGS 162


>gi|346322948|gb|EGX92546.1| histone-lysine N-methyltransferase [Cordyceps militaris CM01]
          Length = 1151

 Score = 73.9 bits (180), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 46/141 (32%), Positives = 70/141 (49%), Gaps = 23/141 (16%)

Query: 1905 EDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDL---V 1961
            +DD ++E++GE        E +  I  +++N           YL+   G +  + +    
Sbjct: 1031 KDDMIIEYVGE--------EVRQQISEIRENR----------YLKSGIGSSYLFRIDENT 1072

Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
            V+DA  K   A  I HSC PNC AK+  V+G  +I IY +R I   EE+T+DY    E  
Sbjct: 1073 VIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDITTNEELTYDYKFEREIG 1132

Query: 2022 EEYEASVCLCGSQVCRGSYLN 2042
               +   CLCG+  C+G +LN
Sbjct: 1133 -SLDRIPCLCGTAACKG-FLN 1151


>gi|413937237|gb|AFW71788.1| hypothetical protein ZEAMMB73_686749 [Zea mays]
          Length = 1815

 Score = 73.6 bits (179), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 50/152 (32%), Positives = 73/152 (48%), Gaps = 20/152 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+   ++    E  F++E++GEV  +  +  +Q       + +      FY + L 
Sbjct: 1077 KKGYGLQLQED--VTEGRFLIEYVGEVLDITSYESRQRYYACKGQKH------FYFMALN 1128

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              +         V+DA  K N    I HSC PNC  +   V+G   IGI+ +R I  GEE
Sbjct: 1129 GGE---------VIDACTKGNLGRFINHSCSPNCCTEKWMVNGEVCIGIFALRSIKKGEE 1179

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
            +TFDYN V  S    +   C CG+  CRG YL
Sbjct: 1180 LTFDYNYVRVSGAAPQK--CFCGTAKCRG-YL 1208


>gi|353243391|emb|CCA74938.1| related to regulatory protein SET1 [Piriformospora indica DSM 11827]
          Length = 1224

 Score = 73.6 bits (179), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 37/84 (44%), Positives = 48/84 (57%), Gaps = 5/84 (5%)

Query: 1959 DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVT 2018
            D +VVDA    N    I HSC PNC AK+  + G  +I IY    IH G+E+T+DY+   
Sbjct: 1146 DDLVVDATKIGNLGRLINHSCDPNCTAKIITIGGQKKIVIYAKVDIHPGDEVTYDYHFPI 1205

Query: 2019 ESKEEYEASVCLCGSQVCRGSYLN 2042
            E+    E   CLCG+  CRG +LN
Sbjct: 1206 EN----EKIPCLCGAAKCRG-FLN 1224


>gi|413937236|gb|AFW71787.1| hypothetical protein ZEAMMB73_686749 [Zea mays]
          Length = 1756

 Score = 73.6 bits (179), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 50/152 (32%), Positives = 73/152 (48%), Gaps = 20/152 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+   ++    E  F++E++GEV  +  +  +Q       + +      FY + L 
Sbjct: 1018 KKGYGLQLQED--VTEGRFLIEYVGEVLDITSYESRQRYYACKGQKH------FYFMALN 1069

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              +         V+DA  K N    I HSC PNC  +   V+G   IGI+ +R I  GEE
Sbjct: 1070 GGE---------VIDACTKGNLGRFINHSCSPNCCTEKWMVNGEVCIGIFALRSIKKGEE 1120

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
            +TFDYN V  S    +   C CG+  CRG YL
Sbjct: 1121 LTFDYNYVRVSGAAPQK--CFCGTAKCRG-YL 1149


>gi|400596097|gb|EJP63881.1| histone H3 methyltransferase complex protein [Beauveria bassiana
            ARSEF 2860]
          Length = 1220

 Score = 73.6 bits (179), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 46/141 (32%), Positives = 70/141 (49%), Gaps = 23/141 (16%)

Query: 1905 EDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDL---V 1961
            +DD ++E++GE        E +  I  +++N           YL+   G +  + +    
Sbjct: 1100 KDDMIIEYVGE--------EVRQQISEIRENR----------YLKSGIGSSYLFRIDENT 1141

Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
            V+DA  K   A  I HSC PNC AK+  V+G  +I IY +R I   EE+T+DY    E  
Sbjct: 1142 VIDATKKGGIARFINHSCLPNCTAKIIKVEGSKRIVIYALREIAMNEELTYDYKFEREIG 1201

Query: 2022 EEYEASVCLCGSQVCRGSYLN 2042
               +   CLCG+  C+G +LN
Sbjct: 1202 -SLDRIPCLCGTAACKG-FLN 1220


>gi|407921620|gb|EKG14761.1| hypothetical protein MPH_08036 [Macrophomina phaseolina MS6]
          Length = 1167

 Score = 73.6 bits (179), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 50/137 (36%), Positives = 71/137 (51%), Gaps = 17/137 (12%)

Query: 1906 DDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDA 1965
            +D ++E++GE     K  +K   IR ++ + +       + YL R   D+      VVDA
Sbjct: 1048 NDMIIEYVGE-----KVRQKVADIREIKYDKQG----VGSSYLFRIDEDS------VVDA 1092

Query: 1966 MHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYE 2025
              K   A  I HSC PNC AK+  VDG  +I IY +R I   EE+T+DY    E   + +
Sbjct: 1093 TKKGGIARFINHSCSPNCTAKIIRVDGTKRIVIYALRDIKTNEELTYDYKFEREIGSD-D 1151

Query: 2026 ASVCLCGSQVCRGSYLN 2042
               CLCGS  C+G +LN
Sbjct: 1152 RIPCLCGSVNCKG-FLN 1167


>gi|47219458|emb|CAG10822.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 2598

 Score = 73.6 bits (179), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 51/153 (33%), Positives = 77/153 (50%), Gaps = 30/153 (19%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+   +    G+  F++E+LGEV              S Q        EF +  +E+
Sbjct: 1755 KGWGIRTKQPLRAGQ--FIIEYLGEVV-------------SEQ--------EFRSRMMEQ 1791

Query: 1951 PKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIH 2005
                +  Y L     +V+D+    N A  I HSC PNCE +  +V+G Y+IG++ +  I 
Sbjct: 1792 YFSHSGNYCLNLDSGMVIDSYRMGNEARFINHSCEPNCEMQKWSVNGVYRIGLFALGEIP 1851

Query: 2006 YGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
             G E+T+DYN  + + EE +A  C CGS+ CRG
Sbjct: 1852 SGTELTYDYNFHSFNTEEQQA--CKCGSESCRG 1882


>gi|301122693|ref|XP_002909073.1| histone-lysine N-methyltransferase, putative [Phytophthora infestans
            T30-4]
 gi|262099835|gb|EEY57887.1| histone-lysine N-methyltransferase, putative [Phytophthora infestans
            T30-4]
          Length = 751

 Score = 73.6 bits (179), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 57/176 (32%), Positives = 86/176 (48%), Gaps = 27/176 (15%)

Query: 1863 CDVRTMKMCRGILKAMDSRPDDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKW 1922
            C  R +K  R  LK+M      +Y+    G G++ N++   GE  FV+E++GEV      
Sbjct: 190  CSNRAIK--RRQLKSMRV----EYIPGGPGFGLITNEDINAGE--FVIEYVGEV------ 235

Query: 1923 FEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPN 1982
             + ++  R +    ++    FY + LE+          +V+DA +++N +  I HSC PN
Sbjct: 236  IDDKECERRMITYRDNGEVNFYMMELEKN---------IVIDAKYRSNDSRFINHSCDPN 286

Query: 1983 CEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
               +   VDG  +IGI+  R I   EEIT DYN         EA+ C CGS  C G
Sbjct: 287  SVTQKWNVDGMQRIGIFARRNIAPNEEITIDYN----FSHFGEAADCRCGSTACTG 338


>gi|390605099|gb|EIN14490.1| SET domain-containing protein [Punctularia strigosozonata HHB-11173
            SS5]
          Length = 164

 Score = 73.6 bits (179), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 52/139 (37%), Positives = 69/139 (49%), Gaps = 26/139 (18%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI---YLERPKGDADGYDLVVV 1963
            + V+E++GEV            IR+   +  + A E   I   YL R   D      +VV
Sbjct: 49   EMVIEYVGEV------------IRAQVADKREKAYERQGIGSSYLFRIDED------LVV 90

Query: 1964 DAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEE 2023
            DA  K N    I HSC PNC AK+  ++G  +I IY  + I  G+EIT+DY+   E    
Sbjct: 91   DATKKGNLGRLINHSCDPNCTAKIITINGEKKIVIYAKQDIELGDEITYDYHFPIEQ--- 147

Query: 2024 YEASVCLCGSQVCRGSYLN 2042
             +   CLCGS  CRG YLN
Sbjct: 148  -DKIPCLCGSAKCRG-YLN 164


>gi|116199091|ref|XP_001225357.1| hypothetical protein CHGG_07701 [Chaetomium globosum CBS 148.51]
 gi|121922631|sp|Q2GWF3.1|SET1_CHAGB RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
            specific; AltName: Full=COMPASS component SET1; AltName:
            Full=SET domain-containing protein 1
 gi|88178980|gb|EAQ86448.1| hypothetical protein CHGG_07701 [Chaetomium globosum CBS 148.51]
          Length = 1076

 Score = 73.6 bits (179), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 48/141 (34%), Positives = 70/141 (49%), Gaps = 23/141 (16%)

Query: 1905 EDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGY---DLV 1961
            +DD ++E++GE        E +  I  L++N           YL+   G +  +   D  
Sbjct: 956  KDDMIIEYVGE--------EVRQQIAELRENR----------YLKSGIGSSYLFRIDDNT 997

Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
            V+DA  K   A  I HSC PNC AK+  V+G  +I IY +R I   EE+T+DY    E  
Sbjct: 998  VIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDYKFERELG 1057

Query: 2022 EEYEASVCLCGSQVCRGSYLN 2042
               +   CLCG+  C+G +LN
Sbjct: 1058 ST-DRIPCLCGTAACKG-FLN 1076


>gi|358338843|dbj|GAA57433.1| histone-lysine N-methyltransferase trithorax, partial [Clonorchis
            sinensis]
          Length = 328

 Score = 73.6 bits (179), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 51/141 (36%), Positives = 69/141 (48%), Gaps = 21/141 (14%)

Query: 1902 GFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLV 1961
            GF ED+ V+E++GE+   +    ++   RS             + Y+ R   D      +
Sbjct: 209  GFREDEMVIEYMGELIRNFVCETREIRYRSAG----------VDCYMFRIDSD------L 252

Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
            V+DA +  N A  I HSC PNC AKV  VD    I I   R I+ GEE+T+DY    ES 
Sbjct: 253  VIDATYAGNAARFINHSCDPNCYAKVVTVDDKKHIVILAQRRIYPGEELTYDYRFPKES- 311

Query: 2022 EEYEASVCLCGSQVCRGSYLN 2042
               +  +C CGS  CR  YLN
Sbjct: 312  ---DKLLCNCGSYNCR-KYLN 328


>gi|170587756|ref|XP_001898640.1| SET domain containing protein [Brugia malayi]
 gi|158593910|gb|EDP32504.1| SET domain containing protein [Brugia malayi]
          Length = 1449

 Score = 73.2 bits (178), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 45/133 (33%), Positives = 69/133 (51%), Gaps = 21/133 (15%)

Query: 1908 FVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMH 1967
            F++E++GEV       + ++ IR  ++  +DP  +  + YL   K  A      V+DA  
Sbjct: 657  FIIEYVGEV------IDAEEMIRRGRRYGKDP--KHVHHYLMALKNGA------VIDATA 702

Query: 1968 KANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEY--E 2025
            K N +  I HSC PNCE++   V+   ++G + ++ I  GEEI FDY       E Y  +
Sbjct: 703  KGNVSRFINHSCDPNCESQKWTVNRQLRVGFFVIKPIALGEEIVFDYQL-----ERYGRK 757

Query: 2026 ASVCLCGSQVCRG 2038
            A  C CG+  CRG
Sbjct: 758  AQRCFCGAANCRG 770


>gi|429862241|gb|ELA36898.1| histone-lysine n-methyltransferase [Colletotrichum gloeosporioides
            Nara gc5]
          Length = 1270

 Score = 73.2 bits (178), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 47/146 (32%), Positives = 70/146 (47%), Gaps = 23/146 (15%)

Query: 1900 EGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGY- 1958
            E    +DD ++E++GE        + +  I  +++            YL+   G +  + 
Sbjct: 1145 EENINKDDMIIEYVGE--------QVRQSISEIREKR----------YLKSGMGSSYLFR 1186

Query: 1959 --DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNS 2016
              D  V+DA  K   A  I HSC PNC AK+  VDG  +I IY +R I   EE+T+DY  
Sbjct: 1187 IDDNTVIDATKKGGIARFINHSCMPNCTAKIIKVDGSKRIVIYALRDIAQHEELTYDYKF 1246

Query: 2017 VTESKEEYEASVCLCGSQVCRGSYLN 2042
              E     +   CLCG+  C+G +LN
Sbjct: 1247 EREIG-SLDRIPCLCGTAACKG-FLN 1270


>gi|357161607|ref|XP_003579145.1| PREDICTED: uncharacterized protein LOC100843412 [Brachypodium
            distachyon]
          Length = 1194

 Score = 73.2 bits (178), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 54/139 (38%), Positives = 67/139 (48%), Gaps = 29/139 (20%)

Query: 1906 DDFVVEFLGEVY--PVWKWFEKQ---DGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDL 1960
            +DFV+E++GE+   PV    E Q    GI S               YL R   D      
Sbjct: 1078 EDFVIEYVGELIRRPVSDIREAQYEKSGIGS--------------SYLFRLDDD------ 1117

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             VVDA  +   A  I HSC PNC  KV  VDG  +I IY+ R I+ GEE+T++Y    E 
Sbjct: 1118 YVVDATKRGGLARFINHSCEPNCYTKVITVDGQKKIFIYSKRRIYAGEELTYNYKFPLEE 1177

Query: 2021 KEEYEASVCLCGSQVCRGS 2039
            K+      C CGS  CRGS
Sbjct: 1178 KK----IPCHCGSLRCRGS 1192


>gi|336385606|gb|EGO26753.1| hypothetical protein SERLADRAFT_385814 [Serpula lacrymans var.
            lacrymans S7.9]
          Length = 115

 Score = 73.2 bits (178), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 39/82 (47%), Positives = 48/82 (58%), Gaps = 5/82 (6%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
            +VVDA  K N    I HSC PNC AK+  ++G  +I IY  + I  GEEIT+DY+   E 
Sbjct: 39   LVVDATKKGNLGRLINHSCDPNCTAKIITINGEKKIVIYAKQDIELGEEITYDYHFPIEQ 98

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
                +   CLCGS  CRG YLN
Sbjct: 99   ----DKIPCLCGSAKCRG-YLN 115


>gi|328768995|gb|EGF79040.1| hypothetical protein BATDEDRAFT_35515 [Batrachochytrium dendrobatidis
            JAM81]
          Length = 1367

 Score = 73.2 bits (178), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 48/137 (35%), Positives = 65/137 (47%), Gaps = 25/137 (18%)

Query: 1906 DDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI---YLERPKGDADGYDLVV 1962
            +D V+E++GE+            IR    ++ +   E   I   YL R   D       +
Sbjct: 1251 NDMVIEYIGEI------------IRQKVADHREKLYEASGIGSSYLFRVDED------TI 1292

Query: 1963 VDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKE 2022
            +DA    N A  I H C PNC AKV +VDG  +I IY  R I  GEE+T+DY    E   
Sbjct: 1293 IDATKTGNLARFINHCCEPNCNAKVISVDGTKRIVIYANRDIKEGEELTYDYKFPIEE-- 1350

Query: 2023 EYEASVCLCGSQVCRGS 2039
              +   CLCG+  CRG+
Sbjct: 1351 --DKIPCLCGAVNCRGT 1365


>gi|19115892|ref|NP_594980.1| histone lysine methyltransferase Set2 [Schizosaccharomyces pombe
            972h-]
 gi|74626626|sp|O14026.1|SET2_SCHPO RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36
            specific; AltName: Full=Lysine N-methyltransferase 3;
            AltName: Full=SET domain-containing protein 2
 gi|2408044|emb|CAB16247.1| histone lysine methyltransferase Set2 [Schizosaccharomyces pombe]
          Length = 798

 Score = 73.2 bits (178), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 47/155 (30%), Positives = 75/155 (48%), Gaps = 20/155 (12%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            D ++  +KG G+    +    +D FV E++GEV P  K+ ++      +++ + +    F
Sbjct: 183  DVFLTEKKGFGL--RADANLPKDTFVYEYIGEVIPEQKFRKR------MRQYDSEGIKHF 234

Query: 1944 YNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRG 2003
            Y + L+  KG+        +DA  + + A    HSCRPNC      V    ++GI+  R 
Sbjct: 235  YFMMLQ--KGE-------YIDATKRGSLARFCNHSCRPNCYVDKWMVGDKLRMGIFCKRD 285

Query: 2004 IHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            I  GEE+TFDYN     +   +A  C CG   C G
Sbjct: 286  IIRGEELTFDYNV---DRYGAQAQPCYCGEPCCVG 317


>gi|409047697|gb|EKM57176.1| hypothetical protein PHACADRAFT_142398 [Phanerochaete carnosa
            HHB-10118-sp]
          Length = 1389

 Score = 73.2 bits (178), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 50/139 (35%), Positives = 67/139 (48%), Gaps = 26/139 (18%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI---YLERPKGDADGYDLVVV 1963
            + V+E++GE+            IR+   +  + A E   I   YL R   D      +VV
Sbjct: 1274 EMVIEYVGEI------------IRAQVADKREKAYERQGIGSSYLFRIDED------LVV 1315

Query: 1964 DAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEE 2023
            DA  K N    I HSC PNC AK+  ++   +I IY  + I  G EIT+DY+   E    
Sbjct: 1316 DATKKGNLGRLINHSCDPNCTAKIITINSEKKIVIYAKQDIELGSEITYDYHFPIEQ--- 1372

Query: 2024 YEASVCLCGSQVCRGSYLN 2042
             +   CLCGS  CRG YLN
Sbjct: 1373 -DKIPCLCGSAKCRG-YLN 1389


>gi|302141761|emb|CBI18964.3| unnamed protein product [Vitis vinifera]
          Length = 1958

 Score = 73.2 bits (178), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 49/149 (32%), Positives = 72/149 (48%), Gaps = 19/149 (12%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+   ++   G+  F++E++GEV  +  +  +Q    S    +      FY + L 
Sbjct: 1274 KKGYGLQLQQDISQGQ--FLIEYVGEVLDLQTYEARQKEYASRGHKH------FYFMTLN 1325

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              +         V+DA  K N    I HSC PNC  +   V+G   IG++ +R I  GEE
Sbjct: 1326 GSE---------VIDACAKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALRDIKKGEE 1376

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            +TFDYN V        A  C+CGS  CRG
Sbjct: 1377 VTFDYNYVRVFG--AAAKKCVCGSPQCRG 1403


>gi|402594990|gb|EJW88916.1| SET domain-containing protein [Wuchereria bancrofti]
          Length = 1425

 Score = 73.2 bits (178), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 45/133 (33%), Positives = 69/133 (51%), Gaps = 21/133 (15%)

Query: 1908 FVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMH 1967
            F++E++GEV       + ++ IR  ++  +DP  +  + YL   K  A      V+DA  
Sbjct: 630  FIIEYVGEV------IDAEEMIRRGRRYGKDP--KHVHHYLMALKNGA------VIDATA 675

Query: 1968 KANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEY--E 2025
            K N +  I HSC PNCE++   V+   ++G + ++ I  GEEI FDY       E Y  +
Sbjct: 676  KGNVSRFINHSCDPNCESQKWTVNRQLRVGFFVIKPIALGEEIVFDYQL-----ERYGRK 730

Query: 2026 ASVCLCGSQVCRG 2038
            A  C CG+  CRG
Sbjct: 731  AQRCFCGAANCRG 743


>gi|380494835|emb|CCF32851.1| SET domain-containing protein [Colletotrichum higginsianum]
          Length = 1257

 Score = 73.2 bits (178), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 47/146 (32%), Positives = 70/146 (47%), Gaps = 23/146 (15%)

Query: 1900 EGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGY- 1958
            E    +DD ++E++GE        + +  I  +++            YL+   G +  + 
Sbjct: 1132 EENINKDDMIIEYVGE--------QVRQSISEIREKR----------YLKSGMGSSYLFR 1173

Query: 1959 --DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNS 2016
              D  V+DA  K   A  I HSC PNC AK+  VDG  +I IY +R I   EE+T+DY  
Sbjct: 1174 IDDNTVIDATKKGGIARFINHSCMPNCTAKIIKVDGSKRIVIYALRDIGQHEELTYDYKF 1233

Query: 2017 VTESKEEYEASVCLCGSQVCRGSYLN 2042
              E     +   CLCG+  C+G +LN
Sbjct: 1234 EREIG-SLDRIPCLCGTAACKG-FLN 1257


>gi|392560212|gb|EIW53395.1| hypothetical protein TRAVEDRAFT_154887 [Trametes versicolor FP-101664
            SS1]
          Length = 1014

 Score = 73.2 bits (178), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 49/136 (36%), Positives = 65/136 (47%), Gaps = 25/136 (18%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI---YLERPKGDADGYDLVVV 1963
            + V+E++GEV            IR+   +  + A E   I   YL R   D      +VV
Sbjct: 899  EMVIEYVGEV------------IRAQVADKREKAYERQGIGSSYLFRIDED------LVV 940

Query: 1964 DAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEE 2023
            DA  K N    I HSC PNC AK+  + G  +I IY  + I  G EIT+DY+   E    
Sbjct: 941  DATKKGNLGRLINHSCDPNCTAKIITISGEKKIVIYAKQDIELGSEITYDYHFPIEQ--- 997

Query: 2024 YEASVCLCGSQVCRGS 2039
             +   CLCGS  CRG+
Sbjct: 998  -DKIPCLCGSAKCRGT 1012


>gi|30704948|gb|AAH52194.1| Ash1l protein, partial [Mus musculus]
          Length = 963

 Score = 73.2 bits (178), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 49/160 (30%), Positives = 80/160 (50%), Gaps = 30/160 (18%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +++ A  KG G+   +    G+  F++E+LGEV                         EF
Sbjct: 143  ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVVS---------------------EQEF 179

Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
             N  +E+    +D Y L     +V+D+    N A  I HSC PNCE +  +V+G Y+IG+
Sbjct: 180  RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 239

Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            Y ++ +  G E+T+DYN  + + E+ +  +C CG + CRG
Sbjct: 240  YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 277


>gi|224063022|ref|XP_002300966.1| SET domain protein [Populus trichocarpa]
 gi|222842692|gb|EEE80239.1| SET domain protein [Populus trichocarpa]
          Length = 605

 Score = 73.2 bits (178), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 51/149 (34%), Positives = 73/149 (48%), Gaps = 19/149 (12%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+   ++   G+  F++E++GEV  V  +  +Q    S    +      FY + L 
Sbjct: 136  KKGFGLRLEEDITRGQ--FLIEYVGEVLDVHAYEARQKEYASKGHKH------FYFMTL- 186

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
                  DG +  V+DA  K N    I HSC PNC  +   V+G   IG++ +R I  GEE
Sbjct: 187  ------DGSE--VIDACVKGNLGRFINHSCDPNCRTEKWVVNGEICIGLFALRDIKKGEE 238

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            +TFDYN V        A  C CGS  C+G
Sbjct: 239  VTFDYNYVRVVGA--AAKRCYCGSPQCQG 265


>gi|242019388|ref|XP_002430143.1| histone-lysine N-methyltransferase SUVR5, putative [Pediculus humanus
            corporis]
 gi|212515234|gb|EEB17405.1| histone-lysine N-methyltransferase SUVR5, putative [Pediculus humanus
            corporis]
          Length = 1448

 Score = 72.8 bits (177), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 50/149 (33%), Positives = 76/149 (51%), Gaps = 20/149 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+    E     + F++E++GEV       +K+ G R ++   ++    FY + L 
Sbjct: 571  KKGFGL--RAEEDLSGNTFIMEYVGEVVN-----QKEFG-RRVKMYAKENNKHFYFMAL- 621

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              KGDA      V+DA +K N +  I HSC PN E +   ++G  ++G +T R +  GEE
Sbjct: 622  --KGDA------VIDATNKGNISRFINHSCDPNAETQKWTINGELRVGFFTRRFVAAGEE 673

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            ITFDY      K   +A  C C +  CRG
Sbjct: 674  ITFDYQFQRYGK---QAQKCYCEASNCRG 699


>gi|384484496|gb|EIE76676.1| hypothetical protein RO3G_01380 [Rhizopus delemar RA 99-880]
          Length = 565

 Score = 72.8 bits (177), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 48/133 (36%), Positives = 64/133 (48%), Gaps = 19/133 (14%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            D V+E++GEV        +Q      +K+ E       + YL R   D      +V+DA 
Sbjct: 450  DIVIEYIGEVI-------RQQVAEIREKHYERIG--IGSSYLFRVDDD------MVIDAT 494

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K   A  I H C PNC AK+  VD   ++ IY  R I  GEEIT+DY    E+    E 
Sbjct: 495  KKGGMARFINHCCTPNCSAKIITVDKQKKVVIYANRDIEPGEEITYDYKFPIEA----EK 550

Query: 2027 SVCLCGSQVCRGS 2039
              C CGS+ C+GS
Sbjct: 551  IPCFCGSKFCKGS 563


>gi|345480373|ref|XP_001606723.2| PREDICTED: hypothetical protein LOC100123115 [Nasonia vitripennis]
          Length = 1746

 Score = 72.8 bits (177), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 48/147 (32%), Positives = 72/147 (48%), Gaps = 22/147 (14%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
            GL    N E G    DF++E++GEV       + +D  +  ++ ++D    +Y + L   
Sbjct: 858  GLRATTNLEAG----DFIMEYVGEV------LDPKDFRKRAKEYSKDKNRHYYFMAL--- 904

Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
            K D       ++DA  K N +  I HSC PN E +   V+G  +IG +  + +  GEEIT
Sbjct: 905  KSDQ------IIDATMKGNISRFINHSCDPNAETQKWTVNGELRIGFFNKKFVAAGEEIT 958

Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRG 2038
            FDY+     K   EA  C C +  CRG
Sbjct: 959  FDYHFQRYGK---EAQKCFCEATNCRG 982


>gi|448106516|ref|XP_004200765.1| Piso0_003363 [Millerozyma farinosa CBS 7064]
 gi|448109616|ref|XP_004201396.1| Piso0_003363 [Millerozyma farinosa CBS 7064]
 gi|359382187|emb|CCE81024.1| Piso0_003363 [Millerozyma farinosa CBS 7064]
 gi|359382952|emb|CCE80259.1| Piso0_003363 [Millerozyma farinosa CBS 7064]
          Length = 1062

 Score = 72.8 bits (177), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 48/82 (58%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA  K   A  I H C P+C AK+  VDG  +I IY +R I   EE+T+DY    E+
Sbjct: 983  TVIDATKKGGIARFINHCCSPSCTAKIIKVDGKKRIVIYALRDIDKNEELTYDYKFERET 1042

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
             +E E   CLCG+  C+G YLN
Sbjct: 1043 NDE-ERIRCLCGAPGCKG-YLN 1062


>gi|378725927|gb|EHY52386.1| histone-lysine N-methyltransferase SETD1 [Exophiala dermatitidis
            NIH/UT8656]
          Length = 1277

 Score = 72.8 bits (177), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 38/82 (46%), Positives = 47/82 (57%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             VVDA  K   A  I HSC PNC AK+  V G  +I IY +R I   EE+T+DY    E 
Sbjct: 1198 TVVDATKKGGIARFINHSCSPNCTAKIIRVGGTKRIVIYALRDIEKDEELTYDYKFEREI 1257

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
              + +   CLCGS VC+G +LN
Sbjct: 1258 DSD-DRIPCLCGSAVCKG-FLN 1277


>gi|330792328|ref|XP_003284241.1| hypothetical protein DICPUDRAFT_27300 [Dictyostelium purpureum]
 gi|325085814|gb|EGC39214.1| hypothetical protein DICPUDRAFT_27300 [Dictyostelium purpureum]
          Length = 151

 Score = 72.8 bits (177), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 51/150 (34%), Positives = 74/150 (49%), Gaps = 26/150 (17%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G++  +     + DFV+E+ GEV        K   +  +Q+N  +    FY + L  
Sbjct: 1    KGWGLISCEN--INKGDFVMEYCGEV------ISKTTCLNRMQENENEKF--FYFLTLNS 50

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +          +DA  + N A  I HSC PNCE +   V G  +IGI++++ I  G E+
Sbjct: 51   KE---------CLDASRRGNLARFINHSCDPNCETQKWIVGGEVKIGIFSIKPIEKGTEL 101

Query: 2011 TFDYN--SVTESKEEYEASVCLCGSQVCRG 2038
            TFDYN      SK+E     C CGS+ CRG
Sbjct: 102  TFDYNYERFGASKQE-----CYCGSKNCRG 126


>gi|115489550|ref|NP_001067262.1| Os12g0613200 [Oryza sativa Japonica Group]
 gi|108862955|gb|ABA99391.2| SET domain containing protein, expressed [Oryza sativa Japonica
            Group]
 gi|113649769|dbj|BAF30281.1| Os12g0613200 [Oryza sativa Japonica Group]
          Length = 1212

 Score = 72.8 bits (177), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 57/170 (33%), Positives = 80/170 (47%), Gaps = 25/170 (14%)

Query: 1874 ILKAMDSRPDDKYVAYRKG----LGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGI 1929
            +LK M S+   K + +++      G+V  +      +DFV+E++GE+        +   I
Sbjct: 1062 LLKIMQSKSRKKRLRFQRSKIHEWGLVALE--SIDAEDFVIEYVGELI-----RRQVSDI 1114

Query: 1930 RSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTA 1989
            R  Q           + YL R   D       VVDA  +   A  I HSC PNC  KV  
Sbjct: 1115 REDQYEKSG----IGSSYLFRLDDD------YVVDATKRGGLARFINHSCDPNCYTKVIT 1164

Query: 1990 VDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGS 2039
            V+G  +I IY  R I+ GEE+T++Y    E K+      C CGSQ CRGS
Sbjct: 1165 VEGQKKIVIYAKRRIYAGEELTYNYKFPLEEKK----IPCHCGSQRCRGS 1210


>gi|330797279|ref|XP_003286689.1| hypothetical protein DICPUDRAFT_150684 [Dictyostelium purpureum]
 gi|325083363|gb|EGC36818.1| hypothetical protein DICPUDRAFT_150684 [Dictyostelium purpureum]
          Length = 1340

 Score = 72.8 bits (177), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 48/136 (35%), Positives = 65/136 (47%), Gaps = 25/136 (18%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGY---DLVVV 1963
            D V+E++GEV            IR      +  A E    Y+++  G +  +   D  ++
Sbjct: 1225 DMVIEYIGEV------------IR------QKVADEREKRYIKKGIGSSYLFRVDDDTII 1266

Query: 1964 DAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEE 2023
            DA  K N A  I H C PNC AKV  ++   +I IY  R I+ GEEIT+DY    E    
Sbjct: 1267 DATLKGNLARFINHCCDPNCIAKVLTINNQKKIIIYAKRDINIGEEITYDYKFPIED--- 1323

Query: 2024 YEASVCLCGSQVCRGS 2039
             E   CLC S  CRG+
Sbjct: 1324 -EKIPCLCKSPKCRGT 1338


>gi|310792530|gb|EFQ28057.1| SET domain-containing protein [Glomerella graminicola M1.001]
          Length = 1262

 Score = 72.8 bits (177), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 47/146 (32%), Positives = 70/146 (47%), Gaps = 23/146 (15%)

Query: 1900 EGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGY- 1958
            E    +DD ++E++GE        + +  I  +++            YL+   G +  + 
Sbjct: 1137 EENINKDDMIIEYVGE--------QVRQSISEIREKR----------YLKSGMGSSYLFR 1178

Query: 1959 --DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNS 2016
              D  V+DA  K   A  I HSC PNC AK+  VDG  +I IY +R I   EE+T+DY  
Sbjct: 1179 IDDNTVIDATKKGGIARFINHSCMPNCTAKIIKVDGSKRIVIYALRDIGQHEELTYDYKF 1238

Query: 2017 VTESKEEYEASVCLCGSQVCRGSYLN 2042
              E     +   CLCG+  C+G +LN
Sbjct: 1239 EREIG-SLDRIPCLCGTAACKG-FLN 1262


>gi|147837037|emb|CAN63644.1| hypothetical protein VITISV_006299 [Vitis vinifera]
          Length = 258

 Score = 72.8 bits (177), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 36/67 (53%), Positives = 51/67 (76%), Gaps = 5/67 (7%)

Query: 2016 SVTESKEEYEAS----VCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDRHQLMLEACE 2071
            ++T+S+   +++    VC+    + R SYLNLTGEG+F+KVLKE HG+LDR+QLM EACE
Sbjct: 148  TITQSRRVRKSTKFLPVCVVVKFIER-SYLNLTGEGSFQKVLKECHGILDRYQLMFEACE 206

Query: 2072 LNSVSEE 2078
            LN +SE+
Sbjct: 207  LNMLSEK 213


>gi|342887802|gb|EGU87231.1| hypothetical protein FOXB_02213 [Fusarium oxysporum Fo5176]
          Length = 1258

 Score = 72.8 bits (177), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 45/143 (31%), Positives = 70/143 (48%), Gaps = 23/143 (16%)

Query: 1903 FGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGY---D 1959
              +DD ++E++GE        + +  I  +++N           YL+   G +  +   D
Sbjct: 1136 IAKDDMIIEYVGE--------QVRQQIAEIRENR----------YLKSGIGSSYLFRIDD 1177

Query: 1960 LVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTE 2019
              V+DA  K   A  I HSC PNC AK+  V+G  +I IY ++ I   EE+T+DY    E
Sbjct: 1178 NTVIDATKKGGIARFINHSCEPNCTAKIIKVEGSKRIVIYALQDIAMSEELTYDYKFERE 1237

Query: 2020 SKEEYEASVCLCGSQVCRGSYLN 2042
                 +   CLCG+  C+G +LN
Sbjct: 1238 IG-SLDRIPCLCGTAACKG-FLN 1258


>gi|348520760|ref|XP_003447895.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
            [Oreochromis niloticus]
          Length = 1167

 Score = 72.8 bits (177), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 81/148 (54%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G++C ++   GE  FV E++GE+       ++++    ++  +E+   +FY + +++
Sbjct: 865  KGWGLICLRDIKKGE--FVNEYIGEL------IDEEECRARIKYAHENNITDFYMLTIDK 916

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NY+  + HSC+PNCE +   V+G  ++G++ V  I  G E+
Sbjct: 917  DR---------IIDAGPKGNYSRFMNHSCQPNCETQKWTVNGDTRVGLFAVCDIPAGTEL 967

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN      E+   +VC CG+  C G
Sbjct: 968  TFNYNLDCLGNEK---TVCRCGAPNCSG 992


>gi|301754075|ref|XP_002912890.1| PREDICTED: histone-lysine N-methyltransferase SETD2-like [Ailuropoda
            melanoleuca]
          Length = 2549

 Score = 72.8 bits (177), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 57/199 (28%), Positives = 87/199 (43%), Gaps = 43/199 (21%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+   K+     + FV+E+ GEV    K F+ +    +  KN         + Y  
Sbjct: 1544 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1592

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              K D       ++DA  K N +  + HSC PNCE +   V+G  ++G +T + +  G E
Sbjct: 1593 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1646

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNL----------------------TGEG 2047
            +TFDY      K   EA  C CGS  CRG YL                        T +G
Sbjct: 1647 LTFDYQFQRYGK---EAQKCFCGSANCRG-YLGGENRVSIRAAGGKMKKERSRKKDTVDG 1702

Query: 2048 AFEKVLKELHGLLDRHQLM 2066
              E +++   GL D++Q++
Sbjct: 1703 ELEALMENGEGLSDKNQVL 1721


>gi|123703948|ref|NP_001038599.2| histone-lysine N-methyltransferase SETD1B-A [Danio rerio]
 gi|166977691|sp|Q1LY77.2|SE1BA_DANRE RecName: Full=Histone-lysine N-methyltransferase SETD1B-A; AltName:
            Full=SET domain-containing protein 1B-A
 gi|123293815|emb|CAK10781.2| novel protein [Danio rerio]
          Length = 1844

 Score = 72.4 bits (176), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N+A  I HSC PNC AKV  V+   +I IY+ + I+  EEIT+DY    E 
Sbjct: 1768 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSRQPINVNEEITYDYKFPIED 1827

Query: 2021 KEEYEASVCLCGSQVCRGS 2039
                E   CLCG++ CRG+
Sbjct: 1828 ----EKIPCLCGAENCRGT 1842


>gi|281343603|gb|EFB19187.1| hypothetical protein PANDA_000629 [Ailuropoda melanoleuca]
          Length = 2535

 Score = 72.4 bits (176), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 57/199 (28%), Positives = 87/199 (43%), Gaps = 43/199 (21%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+   K+     + FV+E+ GEV    K F+ +    +  KN         + Y  
Sbjct: 1530 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1578

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              K D       ++DA  K N +  + HSC PNCE +   V+G  ++G +T + +  G E
Sbjct: 1579 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1632

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNL----------------------TGEG 2047
            +TFDY      K   EA  C CGS  CRG YL                        T +G
Sbjct: 1633 LTFDYQFQRYGK---EAQKCFCGSANCRG-YLGGENRVSIRAAGGKMKKERSRKKDTVDG 1688

Query: 2048 AFEKVLKELHGLLDRHQLM 2066
              E +++   GL D++Q++
Sbjct: 1689 ELEALMENGEGLSDKNQVL 1707


>gi|402593200|gb|EJW87127.1| SET domain-containing protein, partial [Wuchereria bancrofti]
          Length = 602

 Score = 72.4 bits (176), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 37/79 (46%), Positives = 48/79 (60%), Gaps = 6/79 (7%)

Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYN-SVTES 2020
            V+DA    N A  I HSC+PNC AK+  VDG  +I IY+   I+ G+EIT+DY   + E 
Sbjct: 527  VIDATQMGNLARFINHSCQPNCYAKIVVVDGEKRIVIYSKLAINKGDEITYDYKFPIEED 586

Query: 2021 KEEYEASVCLCGSQVCRGS 2039
            K +     CLCG+  CRGS
Sbjct: 587  KID-----CLCGAPGCRGS 600


>gi|68473736|ref|XP_718971.1| potential COMPASS histone methyltransferase subunit Set1p [Candida
            albicans SC5314]
 gi|68473945|ref|XP_718869.1| potential COMPASS histone methyltransferase subunit Set1p [Candida
            albicans SC5314]
 gi|74586641|sp|Q5ABG1.1|SET1_CANAL RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
            specific; AltName: Full=COMPASS component SET1; AltName:
            Full=SET domain-containing protein 1
 gi|46440662|gb|EAK99965.1| potential COMPASS histone methyltransferase subunit Set1p [Candida
            albicans SC5314]
 gi|46440768|gb|EAL00070.1| potential COMPASS histone methyltransferase subunit Set1p [Candida
            albicans SC5314]
          Length = 1040

 Score = 72.4 bits (176), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 37/84 (44%), Positives = 49/84 (58%), Gaps = 2/84 (2%)

Query: 1959 DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVT 2018
            D  V+DA  K   A  I H C P+C AK+  V+G  +I IY +R I   EE+T+DY    
Sbjct: 959  DNTVIDATKKGGIARFINHCCSPSCTAKIIKVEGKKRIVIYALRDIEANEELTYDYKFER 1018

Query: 2019 ESKEEYEASVCLCGSQVCRGSYLN 2042
            E+ +E E   CLCG+  C+G YLN
Sbjct: 1019 ETNDE-ERIRCLCGAPGCKG-YLN 1040


>gi|432887915|ref|XP_004074975.1| PREDICTED: uncharacterized protein LOC101162384 [Oryzias latipes]
          Length = 1787

 Score = 72.4 bits (176), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N+A  I HSC PNC AKV  V+   +I IY+ + I+  EEIT+DY    E 
Sbjct: 1711 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSRQPINVNEEITYDYKFPIED 1770

Query: 2021 KEEYEASVCLCGSQVCRGS 2039
                E   CLCG++ CRG+
Sbjct: 1771 ----EKIPCLCGAENCRGT 1785


>gi|238879404|gb|EEQ43042.1| conserved hypothetical protein [Candida albicans WO-1]
          Length = 1040

 Score = 72.4 bits (176), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 37/84 (44%), Positives = 49/84 (58%), Gaps = 2/84 (2%)

Query: 1959 DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVT 2018
            D  V+DA  K   A  I H C P+C AK+  V+G  +I IY +R I   EE+T+DY    
Sbjct: 959  DNTVIDATKKGGIARFINHCCSPSCTAKIIKVEGKKRIVIYALRDIEANEELTYDYKFER 1018

Query: 2019 ESKEEYEASVCLCGSQVCRGSYLN 2042
            E+ +E E   CLCG+  C+G YLN
Sbjct: 1019 ETNDE-ERIRCLCGAPGCKG-YLN 1040


>gi|334333796|ref|XP_001375978.2| PREDICTED: histone-lysine N-methyltransferase SETD2 [Monodelphis
            domestica]
          Length = 2592

 Score = 72.4 bits (176), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 59/201 (29%), Positives = 87/201 (43%), Gaps = 47/201 (23%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+   K+     + FV+E+ GEV    K F+ +    +  KN         + Y  
Sbjct: 1582 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1630

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              K D       ++DA  K N +  + HSC PNCE +   V+G  ++G +T + +  G E
Sbjct: 1631 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1684

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGE----------------------- 2046
            +TFDY      K   EA  C CGS  CRG YL   GE                       
Sbjct: 1685 LTFDYQFQRYGK---EAQKCFCGSANCRG-YLG--GENRVSIRAAGGKMKKERSRKKDSV 1738

Query: 2047 -GAFEKVLKELHGLLDRHQLM 2066
             G  E +L+   GL D++Q++
Sbjct: 1739 DGELEALLENGEGLSDKNQVL 1759


>gi|449492020|ref|XP_004174653.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            SETD2 [Taeniopygia guttata]
          Length = 2489

 Score = 72.4 bits (176), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 60/201 (29%), Positives = 89/201 (44%), Gaps = 47/201 (23%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+   K+     + FV+E+ GEV    K F+ +    +  KN       +Y + L 
Sbjct: 1543 KKGWGLRAAKD--LPSNTFVLEYCGEVL-DHKEFKARVKEYARNKNIH-----YYFMAL- 1593

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              K D       ++DA  K N +  + HSC PNCE +   V+G  ++G +T + +  G E
Sbjct: 1594 --KNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1645

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGE----------------------- 2046
            +TFDY      K   EA  C CGS  CRG YL   GE                       
Sbjct: 1646 LTFDYQFQRYGK---EAQKCFCGSSNCRG-YLG--GENRVSIRAAGGKMKKERSRKKDSV 1699

Query: 2047 -GAFEKVLKELHGLLDRHQLM 2066
             G  E +L+   GL D++Q++
Sbjct: 1700 DGELEALLENGEGLSDKNQVL 1720


>gi|356558250|ref|XP_003547420.1| PREDICTED: uncharacterized protein LOC100806034 [Glycine max]
          Length = 1300

 Score = 72.4 bits (176), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 53/137 (38%), Positives = 68/137 (49%), Gaps = 25/137 (18%)

Query: 1906 DDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI---YLERPKGDADGYDLVV 1962
            +DFV+E++GE+            IR    +  +   E   I   YL R     DGY   V
Sbjct: 1184 EDFVIEYIGEL------------IRPRISDIRERQYEKMGIGSSYLFRLD---DGY---V 1225

Query: 1963 VDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKE 2022
            VDA  +   A  I HSC PNC  KV +V+G  +I IY  R I  GEEIT++Y    E K+
Sbjct: 1226 VDATKRGGIARFINHSCEPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKK 1285

Query: 2023 EYEASVCLCGSQVCRGS 2039
                  C CGS+ CRGS
Sbjct: 1286 ----IPCNCGSRKCRGS 1298


>gi|340373417|ref|XP_003385238.1| PREDICTED: hypothetical protein LOC100636150 [Amphimedon
            queenslandica]
          Length = 1053

 Score = 72.4 bits (176), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 35/78 (44%), Positives = 43/78 (55%), Gaps = 4/78 (5%)

Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
            V+DA    N+A  I H C PNC AK+  V    +I IY+ R I  GEEIT+DY    E  
Sbjct: 978  VIDATKSGNFARFINHCCDPNCYAKIITVGNQKKIVIYSKRDIRAGEEITYDYKFPIED- 1036

Query: 2022 EEYEASVCLCGSQVCRGS 2039
               E   CLCG+  CRG+
Sbjct: 1037 ---EKIPCLCGAPQCRGT 1051


>gi|405951732|gb|EKC19620.1| Histone-lysine N-methyltransferase MLL4 [Crassostrea gigas]
          Length = 4493

 Score = 72.4 bits (176), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 55/160 (34%), Positives = 75/160 (46%), Gaps = 29/160 (18%)

Query: 1886 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN 1945
            Y ++  G G+ C +     E + V+E+ GEV            IR    +  +   E   
Sbjct: 4360 YRSHIHGRGLYCKR--NIDEGEMVIEYSGEV------------IRGSLTDKREKYYEGKG 4405

Query: 1946 I--YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRG 2003
            I  Y+ R     D YD  V+DA    N A  I HSC PNC +KV  VDG   I I+ ++ 
Sbjct: 4406 IGCYMFR----IDDYD--VIDATLHGNAARFINHSCEPNCYSKVINVDGKKHIVIFAMKS 4459

Query: 2004 IHYGEEITFDYNSVTESKEEYEASV-CLCGSQVCRGSYLN 2042
            I  GEE+T+DY    E     E  + C CG++ CR  YLN
Sbjct: 4460 IKRGEELTYDYKFPIE-----EVKIPCTCGAKKCR-RYLN 4493


>gi|432880997|ref|XP_004073754.1| PREDICTED: uncharacterized protein LOC101157226 [Oryzias latipes]
          Length = 2812

 Score = 72.4 bits (176), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 55/162 (33%), Positives = 79/162 (48%), Gaps = 31/162 (19%)

Query: 1886 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN 1945
            Y +   G G+ C +    GE   V+E+ G V            IRS+  +  +   +FY+
Sbjct: 2677 YRSLIHGRGLFCKRNIEAGE--MVIEYAGTV------------IRSVLTDKRE---KFYD 2719

Query: 1946 -----IYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
                  Y+ R     D +D  VVDA  + N A  I HSC PNC ++V  VDG   I I+ 
Sbjct: 2720 GKGIGCYMFR----IDDFD--VVDATMQGNAARFINHSCEPNCYSRVINVDGRKHIVIFA 2773

Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            +R I+ GEE+T+DY    E  +E     C CG++ CR  +LN
Sbjct: 2774 LRKIYRGEELTYDYKFPIE--DEDNKLHCNCGTRRCR-RFLN 2812


>gi|426370676|ref|XP_004052287.1| PREDICTED: histone-lysine N-methyltransferase MLL [Gorilla gorilla
            gorilla]
          Length = 3837

 Score = 72.4 bits (176), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 52/156 (33%), Positives = 75/156 (48%), Gaps = 31/156 (19%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
            G G+ C +    GE   V+E+ G V            IRS+Q +  +   ++Y+      
Sbjct: 3708 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3750

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
            Y+ R     D  D  VVDA    N A  I HSC PNC ++V  +DG   I I+ +R I+ 
Sbjct: 3751 YMFR----ID--DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3804

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            GEE+T+DY    E  +      C CG++ CR  +LN
Sbjct: 3805 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3837


>gi|443722431|gb|ELU11300.1| hypothetical protein CAPTEDRAFT_160470, partial [Capitella teleta]
          Length = 282

 Score = 72.4 bits (176), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 48/135 (35%), Positives = 68/135 (50%), Gaps = 26/135 (19%)

Query: 1908 FVVEFLGEV--YPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDA 1965
            FV+E++GEV  YP ++   KQ          ED     Y + L    GD       ++DA
Sbjct: 79   FVMEYVGEVLDYPNFRLRCKQYA--------EDNHTHHYFMAL---NGDE------IIDA 121

Query: 1966 MHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYE 2025
              K N +  I HSC PNCE +   V+G  ++G +T+R I  G E+TFDY       E+Y 
Sbjct: 122  TQKGNTSRFINHSCDPNCETQKWTVNGQLRVGFFTLRSIPAGTELTFDYQF-----EQYG 176

Query: 2026 ASV--CLCGSQVCRG 2038
            + +  C CG+  CRG
Sbjct: 177  SEIQRCFCGADSCRG 191


>gi|327289513|ref|XP_003229469.1| PREDICTED: histone-lysine N-methyltransferase SETD2-like [Anolis
            carolinensis]
          Length = 2579

 Score = 72.4 bits (176), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 59/201 (29%), Positives = 87/201 (43%), Gaps = 47/201 (23%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+   K+     + FV+E+ GEV    K F+ +    +  KN         + Y  
Sbjct: 1577 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKTRVKEYARSKN--------IHYYFM 1625

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              K D       ++DA  K N +  + HSC PNCE +   V+G  ++G +T + +  G E
Sbjct: 1626 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKMVPSGSE 1679

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGE----------------------- 2046
            +TFDY      K   EA  C CGS  CRG YL   GE                       
Sbjct: 1680 LTFDYQFQRYGK---EAQKCFCGSTNCRG-YLG--GENRVSIRAAGGKMKKERSRKKDSV 1733

Query: 2047 -GAFEKVLKELHGLLDRHQLM 2066
             G  E +L+   GL D++Q++
Sbjct: 1734 DGELEALLENGEGLSDKNQVL 1754


>gi|198467361|ref|XP_001354372.2| GA14357 [Drosophila pseudoobscura pseudoobscura]
 gi|198149208|gb|EAL31425.2| GA14357 [Drosophila pseudoobscura pseudoobscura]
          Length = 2918

 Score = 72.4 bits (176), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 47/153 (30%), Positives = 77/153 (50%), Gaps = 20/153 (13%)

Query: 1886 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN 1945
            +   +KG G+    +   GE  F++E++GEV    + FE++    S  +N       +Y 
Sbjct: 1916 FRTEKKGCGITAELQIPAGE--FIMEYVGEVI-DSEEFERRQHRYSKDRNRH-----YYF 1967

Query: 1946 IYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIH 2005
            + L   +G+A      ++DA  + N +  I HSC PN E +   V+G  +IG ++++ I 
Sbjct: 1968 MAL---RGEA------IIDATMRGNISRYINHSCDPNAETQKWTVNGELRIGFFSLKNIL 2018

Query: 2006 YGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
             GEEITFDY      +   +A  C C +  CRG
Sbjct: 2019 PGEEITFDYQYQRYGR---DAQRCYCEAANCRG 2048


>gi|312091131|ref|XP_003146871.1| histone methyltransferase [Loa loa]
 gi|307757965|gb|EFO17199.1| histone methyltransferase [Loa loa]
          Length = 278

 Score = 72.0 bits (175), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 37/79 (46%), Positives = 48/79 (60%), Gaps = 6/79 (7%)

Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYN-SVTES 2020
            V+DA    N A  I HSC+PNC AK+  VDG  +I IY+   I+ G+EIT+DY   + E 
Sbjct: 203  VIDATQMGNLARFINHSCQPNCYAKIVVVDGEKRIVIYSKLAINKGDEITYDYKFPIEED 262

Query: 2021 KEEYEASVCLCGSQVCRGS 2039
            K +     CLCG+  CRGS
Sbjct: 263  KID-----CLCGAPGCRGS 276


>gi|384499027|gb|EIE89518.1| hypothetical protein RO3G_14229 [Rhizopus delemar RA 99-880]
          Length = 1674

 Score = 72.0 bits (175), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 36/81 (44%), Positives = 46/81 (56%), Gaps = 4/81 (4%)

Query: 1959 DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVT 2018
            D  V+DA  + + A  I H C PNC AK+  VD   +I IY  R I  GEEIT+DY    
Sbjct: 1596 DDTVIDATKRGSIARFINHCCSPNCSAKIITVDKQKKIVIYANRDIEPGEEITYDYKFPI 1655

Query: 2019 ESKEEYEASVCLCGSQVCRGS 2039
            E+    E   CLCGS+ C+G+
Sbjct: 1656 EA----EKIPCLCGSKFCKGT 1672


>gi|444725290|gb|ELW65863.1| Histone-lysine N-methyltransferase MLL [Tupaia chinensis]
          Length = 3806

 Score = 72.0 bits (175), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
            G G+ C +    GE   V+E+ G V            IRS+Q +  +   ++Y+      
Sbjct: 3677 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3719

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
            Y+ R        D  VVDA    N A  I HSC PNC ++V  +DG   I I+ +R I+ 
Sbjct: 3720 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3773

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            GEE+T+DY    E  +      C CG++ CR  +LN
Sbjct: 3774 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3806


>gi|351705860|gb|EHB08779.1| Histone-lysine N-methyltransferase HRX [Heterocephalus glaber]
          Length = 3899

 Score = 72.0 bits (175), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
            G G+ C +    GE   V+E+ G V            IRS+Q +  +   ++Y+      
Sbjct: 3770 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3812

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
            Y+ R        D  VVDA    N A  I HSC PNC ++V  +DG   I I+ +R I+ 
Sbjct: 3813 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3866

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            GEE+T+DY    E  +      C CG++ CR  +LN
Sbjct: 3867 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3899


>gi|348534024|ref|XP_003454503.1| PREDICTED: histone-lysine N-methyltransferase SETD2-like [Oreochromis
            niloticus]
          Length = 2253

 Score = 72.0 bits (175), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 46/148 (31%), Positives = 71/148 (47%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+   K+     + FV+E+ GEV    K F+ +    +  KN       +Y + L+ 
Sbjct: 1065 KGWGLRAAKD--LAPNTFVLEYCGEVL-DHKEFKTRVKEYARNKNIH-----YYFMSLKN 1116

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K N +  + HSC PNCE +   V+G  ++G +T + +  G E+
Sbjct: 1117 NE---------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKAVTAGTEL 1167

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TFDY      K   EA  C CG+  CRG
Sbjct: 1168 TFDYQFQRYGK---EAQKCFCGAPSCRG 1192


>gi|348532887|ref|XP_003453937.1| PREDICTED: histone-lysine N-methyltransferase SETD1B-A-like
            [Oreochromis niloticus]
          Length = 1846

 Score = 72.0 bits (175), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N+A  I HSC PNC AKV  V+   +I IY+ + I+  EEIT+DY    E 
Sbjct: 1770 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSRQPINVNEEITYDYKFPIED 1829

Query: 2021 KEEYEASVCLCGSQVCRGS 2039
                E   CLCG++ CRG+
Sbjct: 1830 ----EKIPCLCGAENCRGT 1844


>gi|367024877|ref|XP_003661723.1| SET1-like protein [Myceliophthora thermophila ATCC 42464]
 gi|347008991|gb|AEO56478.1| SET1-like protein [Myceliophthora thermophila ATCC 42464]
          Length = 1260

 Score = 72.0 bits (175), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 47/141 (33%), Positives = 70/141 (49%), Gaps = 23/141 (16%)

Query: 1905 EDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGY---DLV 1961
            +DD ++E++GE        E +  I  L+++           YL+   G +  +   D  
Sbjct: 1140 KDDMIIEYVGE--------EVRQQIAELREHR----------YLKSGIGSSYLFRIDDNT 1181

Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
            V+DA  K   A  I HSC PNC AK+  V+G  +I IY +R I   EE+T+DY    E  
Sbjct: 1182 VIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDYKFERELG 1241

Query: 2022 EEYEASVCLCGSQVCRGSYLN 2042
               +   CLCG+  C+G +LN
Sbjct: 1242 -STDRIPCLCGTAACKG-FLN 1260


>gi|380812066|gb|AFE77908.1| histone-lysine N-methyltransferase SETD2 [Macaca mulatta]
          Length = 2565

 Score = 72.0 bits (175), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 61/207 (29%), Positives = 89/207 (42%), Gaps = 48/207 (23%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+   K+     + FV+E+ GEV    K F+ +    +  KN         + Y  
Sbjct: 1560 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1608

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              K D       ++DA  K N +  + HSC PNCE +   V+G  ++G +T + +  G E
Sbjct: 1609 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1662

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGE----------------------- 2046
            +TFDY      K   EA  C CGS  CRG YL   GE                       
Sbjct: 1663 LTFDYQFQRYGK---EAQKCFCGSANCRG-YLG--GENRVSIRAAGGKMKKERSRKKDSV 1716

Query: 2047 -GAFEKVLKELHGLLDRHQLMLEACEL 2072
             G  E +++   GL D++Q+ L  C L
Sbjct: 1717 DGELEALMENGEGLSDKNQV-LSLCRL 1742


>gi|432909264|ref|XP_004078147.1| PREDICTED: histone-lysine N-methyltransferase SETD2-like [Oryzias
            latipes]
          Length = 1665

 Score = 72.0 bits (175), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 47/148 (31%), Positives = 71/148 (47%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+   KE     + FV+E+ GEV    K F+ +    +  KN       +Y + L+ 
Sbjct: 641  KGWGLRAAKE--MAPNTFVLEYCGEVLD-HKEFKTRVKEYARNKNIH-----YYFMSLKN 692

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K N +  + HSC PNCE +   V+G  ++G +T + +  G E+
Sbjct: 693  NE---------IIDATLKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKAVAAGTEL 743

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TFDY      K   EA  C CG+  CRG
Sbjct: 744  TFDYQFQRYGK---EAQKCFCGAPSCRG 768


>gi|109040979|ref|XP_001113652.1| PREDICTED: histone-lysine N-methyltransferase SETD2-like isoform 2
            [Macaca mulatta]
          Length = 2550

 Score = 72.0 bits (175), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 61/207 (29%), Positives = 89/207 (42%), Gaps = 48/207 (23%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+   K+     + FV+E+ GEV    K F+ +    +  KN         + Y  
Sbjct: 1545 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1593

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              K D       ++DA  K N +  + HSC PNCE +   V+G  ++G +T + +  G E
Sbjct: 1594 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1647

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGE----------------------- 2046
            +TFDY      K   EA  C CGS  CRG YL   GE                       
Sbjct: 1648 LTFDYQFQRYGK---EAQKCFCGSANCRG-YLG--GENRVSIRAAGGKMKKERSRKKDSV 1701

Query: 2047 -GAFEKVLKELHGLLDRHQLMLEACEL 2072
             G  E +++   GL D++Q+ L  C L
Sbjct: 1702 DGELEALMENGEGLSDKNQV-LSLCRL 1727


>gi|395520196|ref|XP_003764223.1| PREDICTED: histone-lysine N-methyltransferase MLL [Sarcophilus
            harrisii]
          Length = 3995

 Score = 72.0 bits (175), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 52/153 (33%), Positives = 72/153 (47%), Gaps = 25/153 (16%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI--YLE 1949
            G G+ C +    GE   V+E+ G V            IRS+Q +  +   E   I  Y+ 
Sbjct: 3866 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKREKYYESKGIGCYMF 3911

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
            R        D  VVDA    N A  I HSC PNC ++V  +DG   I I+ +R I+ GEE
Sbjct: 3912 RID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEE 3965

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            +T+DY    E  +      C CG++ CR  +LN
Sbjct: 3966 LTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3995


>gi|388581385|gb|EIM21694.1| SET domain-containing protein [Wallemia sebi CBS 633.66]
          Length = 681

 Score = 72.0 bits (175), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 61/243 (25%), Positives = 106/243 (43%), Gaps = 34/243 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+  N +     D F++E++GEV    ++      +R +   +++    FY + L+
Sbjct: 95   KKGYGLRANVD--LDRDTFLIEYIGEVVTQTQF------LRRMNTYSKEGIKHFYFMMLQ 146

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              +          +DA  + N      HSC PNC      V  + ++GI+T R I  GEE
Sbjct: 147  NEE---------FIDATRRGNIGRFANHSCAPNCFVSKWVVGKYVKMGIFTKRKIEKGEE 197

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSY--LNLTGEGAFEKVLKELHGL----LDRH 2063
            +TF+YN     +  ++A  C CG   C G       T  G  +  + +  G+    + +H
Sbjct: 198  LTFNYNV---DRYGHDAQPCYCGEPNCVGFIGGKTQTDIGGMDDQILDALGITPEEIFQH 254

Query: 2064 QLMLEACELNSVSEEDYLELGRAGLGSCLLGGLPNWVVAY----SARLVRFINLERTKLP 2119
            QL     + +   +EDY       L   +L  +P  + A     + R +    L+R ++ 
Sbjct: 255  QLKGSRKKKSKKLDEDY----ELTLKPMVLTDVPKVITAVRQSSTNRKILIKLLQRMRMT 310

Query: 2120 EEI 2122
            EEI
Sbjct: 311  EEI 313


>gi|402860278|ref|XP_003894560.1| PREDICTED: histone-lysine N-methyltransferase SETD2 [Papio anubis]
          Length = 2521

 Score = 72.0 bits (175), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 61/207 (29%), Positives = 89/207 (42%), Gaps = 48/207 (23%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+   K+     + FV+E+ GEV    K F+ +    +  KN         + Y  
Sbjct: 1516 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1564

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              K D       ++DA  K N +  + HSC PNCE +   V+G  ++G +T + +  G E
Sbjct: 1565 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1618

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGE----------------------- 2046
            +TFDY      K   EA  C CGS  CRG YL   GE                       
Sbjct: 1619 LTFDYQFQRYGK---EAQKCFCGSANCRG-YLG--GENRVSIRAAGGKMKKERSRKKDSV 1672

Query: 2047 -GAFEKVLKELHGLLDRHQLMLEACEL 2072
             G  E +++   GL D++Q+ L  C L
Sbjct: 1673 DGELEALMENGEGLSDKNQV-LSLCRL 1698


>gi|354496911|ref|XP_003510567.1| PREDICTED: histone-lysine N-methyltransferase MLL [Cricetulus
            griseus]
          Length = 3907

 Score = 72.0 bits (175), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 52/156 (33%), Positives = 75/156 (48%), Gaps = 31/156 (19%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
            G G+ C +    GE   V+E+ G V            IRS+Q +  +   ++Y+      
Sbjct: 3778 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3820

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
            Y+ R     D  D  VVDA    N A  I HSC PNC ++V  +DG   I I+ +R I+ 
Sbjct: 3821 YMFR----ID--DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3874

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            GEE+T+DY    E  +      C CG++ CR  +LN
Sbjct: 3875 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3907


>gi|344304500|gb|EGW34732.1| hypothetical protein SPAPADRAFT_133304 [Spathaspora passalidarum NRRL
            Y-27907]
          Length = 1060

 Score = 72.0 bits (175), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 48/82 (58%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA  K   A  I H C P+C AK+  V+G  +I IY +R I   EE+T+DY    E+
Sbjct: 981  TVIDATKKGGIARFINHCCSPSCTAKIIKVEGKKRIVIYALRDIEANEELTYDYKFERET 1040

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
             +E E   CLCG+  C+G YLN
Sbjct: 1041 NDE-ERIRCLCGAPGCKG-YLN 1060


>gi|334330381|ref|XP_001380704.2| PREDICTED: histone-lysine N-methyltransferase MLL [Monodelphis
            domestica]
          Length = 3960

 Score = 72.0 bits (175), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 52/153 (33%), Positives = 72/153 (47%), Gaps = 25/153 (16%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI--YLE 1949
            G G+ C +    GE   V+E+ G V            IRS+Q +  +   E   I  Y+ 
Sbjct: 3831 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKREKYYESKGIGCYMF 3876

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
            R        D  VVDA    N A  I HSC PNC ++V  +DG   I I+ +R I+ GEE
Sbjct: 3877 RID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEE 3930

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            +T+DY    E  +      C CG++ CR  +LN
Sbjct: 3931 LTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3960


>gi|355752689|gb|EHH56809.1| hypothetical protein EGM_06289 [Macaca fascicularis]
          Length = 3844

 Score = 72.0 bits (175), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
            G G+ C +    GE   V+E+ G V            IRS+Q +  +   ++Y+      
Sbjct: 3715 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3757

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
            Y+ R        D  VVDA    N A  I HSC PNC ++V  +DG   I I+ +R I+ 
Sbjct: 3758 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3811

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            GEE+T+DY    E  +      C CG++ CR  +LN
Sbjct: 3812 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3844


>gi|197927225|ref|NP_001074809.2| SET domain containing 2 [Mus musculus]
          Length = 2537

 Score = 72.0 bits (175), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+   K+     + FV+E+ GEV    K F+ +    +  KN         + Y  
Sbjct: 1533 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1581

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              K D       ++DA  K N +  + HSC PNCE +   V+G  ++G +T + +  G E
Sbjct: 1582 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1635

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
            +TFDY      K   EA  C CGS  CRG YL
Sbjct: 1636 LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1663


>gi|395516140|ref|XP_003762252.1| PREDICTED: histone-lysine N-methyltransferase SETD2 [Sarcophilus
            harrisii]
          Length = 2570

 Score = 72.0 bits (175), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 59/201 (29%), Positives = 87/201 (43%), Gaps = 47/201 (23%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+   K+     + FV+E+ GEV    K F+ +    +  KN         + Y  
Sbjct: 1561 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1609

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              K D       ++DA  K N +  + HSC PNCE +   V+G  ++G +T + +  G E
Sbjct: 1610 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1663

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGE----------------------- 2046
            +TFDY      K   EA  C CGS  CRG YL   GE                       
Sbjct: 1664 LTFDYQFQRYGK---EAQKCFCGSANCRG-YLG--GENRVSIRAAGGKMKKERSRKKDSV 1717

Query: 2047 -GAFEKVLKELHGLLDRHQLM 2066
             G  E +L+   GL D++Q++
Sbjct: 1718 DGELEALLENGEGLSDKNQVL 1738


>gi|392350034|ref|XP_003750554.1| PREDICTED: histone-lysine N-methyltransferase MLL, partial [Rattus
            norvegicus]
          Length = 3894

 Score = 72.0 bits (175), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 52/156 (33%), Positives = 75/156 (48%), Gaps = 31/156 (19%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
            G G+ C +    GE   V+E+ G V            IRS+Q +  +   ++Y+      
Sbjct: 3765 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3807

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
            Y+ R     D  D  VVDA    N A  I HSC PNC ++V  +DG   I I+ +R I+ 
Sbjct: 3808 YMFR----ID--DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3861

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            GEE+T+DY    E  +      C CG++ CR  +LN
Sbjct: 3862 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3894


>gi|297482744|ref|XP_002693122.1| PREDICTED: histone-lysine N-methyltransferase MLL, partial [Bos
            taurus]
 gi|296480196|tpg|DAA22311.1| TPA: myeloid/lymphoid or mixed-lineage leukemia-like [Bos taurus]
          Length = 3821

 Score = 72.0 bits (175), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
            G G+ C +    GE   V+E+ G V            IRS+Q +  +   ++Y+      
Sbjct: 3692 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3734

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
            Y+ R        D  VVDA    N A  I HSC PNC ++V  +DG   I I+ +R I+ 
Sbjct: 3735 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3788

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            GEE+T+DY    E  +      C CG++ CR  +LN
Sbjct: 3789 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3821


>gi|390461098|ref|XP_003732596.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 2
            [Callithrix jacchus]
          Length = 1400

 Score = 71.6 bits (174), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 45/148 (30%), Positives = 79/148 (53%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+V  ++   GE  FV E++GEV       ++++ +  ++  +E+    FY + +++
Sbjct: 1108 KGWGLVAKRDIRKGE--FVNEYVGEV------IDEEECMARIKHAHENDITHFYMLTIDK 1159

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NY+  + HSC+PNCE     V+G  ++G++ V  I  G E+
Sbjct: 1160 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1210

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN      E+   +VC CG+  C G
Sbjct: 1211 TFNYNLDCLGNEK---TVCRCGASNCSG 1235


>gi|114640631|ref|XP_508792.2| PREDICTED: histone-lysine N-methyltransferase MLL [Pan troglodytes]
          Length = 3969

 Score = 71.6 bits (174), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
            G G+ C +    GE   V+E+ G V            IRS+Q +  +   ++Y+      
Sbjct: 3840 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3882

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
            Y+ R        D  VVDA    N A  I HSC PNC ++V  +DG   I I+ +R I+ 
Sbjct: 3883 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3936

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            GEE+T+DY    E  +      C CG++ CR  +LN
Sbjct: 3937 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3969


>gi|403287002|ref|XP_003934751.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Saimiri
            boliviensis boliviensis]
          Length = 1368

 Score = 71.6 bits (174), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 45/148 (30%), Positives = 79/148 (53%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+V  ++   GE  FV E++GEV       ++++ +  ++  +E+    FY + +++
Sbjct: 1073 KGWGLVAKRDIRKGE--FVNEYVGEV------IDEEECMARIKHAHENDITHFYMLTIDK 1124

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NY+  + HSC+PNCE     V+G  ++G++ V  I  G E+
Sbjct: 1125 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1175

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN      E+   +VC CG+  C G
Sbjct: 1176 TFNYNLDCLGNEK---TVCRCGASNCSG 1200


>gi|119587788|gb|EAW67384.1| myeloid/lymphoid or mixed-lineage leukemia (trithorax homolog,
            Drosophila), isoform CRA_e [Homo sapiens]
          Length = 3972

 Score = 71.6 bits (174), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
            G G+ C +    GE   V+E+ G V            IRS+Q +  +   ++Y+      
Sbjct: 3843 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3885

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
            Y+ R        D  VVDA    N A  I HSC PNC ++V  +DG   I I+ +R I+ 
Sbjct: 3886 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3939

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            GEE+T+DY    E  +      C CG++ CR  +LN
Sbjct: 3940 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3972


>gi|119587784|gb|EAW67380.1| myeloid/lymphoid or mixed-lineage leukemia (trithorax homolog,
            Drosophila), isoform CRA_a [Homo sapiens]
          Length = 3969

 Score = 71.6 bits (174), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
            G G+ C +    GE   V+E+ G V            IRS+Q +  +   ++Y+      
Sbjct: 3840 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3882

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
            Y+ R        D  VVDA    N A  I HSC PNC ++V  +DG   I I+ +R I+ 
Sbjct: 3883 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3936

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            GEE+T+DY    E  +      C CG++ CR  +LN
Sbjct: 3937 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3969


>gi|449666506|ref|XP_002161122.2| PREDICTED: uncharacterized protein LOC100198749 [Hydra
            magnipapillata]
          Length = 1403

 Score = 71.6 bits (174), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 36/78 (46%), Positives = 44/78 (56%), Gaps = 4/78 (5%)

Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
            V+DA      A  I H C PNC AKV  V+G  +I IY+ R I  GEEIT+DY    E  
Sbjct: 1328 VIDATKDGCNARFINHCCDPNCYAKVILVEGAKKIVIYSRRAIKLGEEITYDYKFPIED- 1386

Query: 2022 EEYEASVCLCGSQVCRGS 2039
               E   CLCG+ +CRG+
Sbjct: 1387 ---EKIPCLCGAALCRGT 1401


>gi|308199413|ref|NP_001184033.1| histone-lysine N-methyltransferase MLL isoform 1 precursor [Homo
            sapiens]
          Length = 3972

 Score = 71.6 bits (174), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
            G G+ C +    GE   V+E+ G V            IRS+Q +  +   ++Y+      
Sbjct: 3843 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3885

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
            Y+ R        D  VVDA    N A  I HSC PNC ++V  +DG   I I+ +R I+ 
Sbjct: 3886 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3939

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            GEE+T+DY    E  +      C CG++ CR  +LN
Sbjct: 3940 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3972


>gi|119587787|gb|EAW67383.1| myeloid/lymphoid or mixed-lineage leukemia (trithorax homolog,
            Drosophila), isoform CRA_d [Homo sapiens]
          Length = 4002

 Score = 71.6 bits (174), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
            G G+ C +    GE   V+E+ G V            IRS+Q +  +   ++Y+      
Sbjct: 3873 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3915

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
            Y+ R        D  VVDA    N A  I HSC PNC ++V  +DG   I I+ +R I+ 
Sbjct: 3916 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3969

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            GEE+T+DY    E  +      C CG++ CR  +LN
Sbjct: 3970 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 4002


>gi|56550039|ref|NP_005924.2| histone-lysine N-methyltransferase MLL isoform 2 precursor [Homo
            sapiens]
 gi|146345435|sp|Q03164.5|MLL1_HUMAN RecName: Full=Histone-lysine N-methyltransferase MLL; AltName:
            Full=ALL-1; AltName: Full=CXXC-type zinc finger protein
            7; AltName: Full=Lysine N-methyltransferase 2A;
            Short=KMT2A; AltName: Full=Trithorax-like protein;
            AltName: Full=Zinc finger protein HRX; Contains: RecName:
            Full=MLL cleavage product N320; AltName: Full=N-terminal
            cleavage product of 320 kDa; Short=p320; Contains:
            RecName: Full=MLL cleavage product C180; AltName:
            Full=C-terminal cleavage product of 180 kDa; Short=p180
 gi|34305635|gb|AAQ63624.1| myeloid/lymphoid or mixed-lineage leukemia (trithorax homolog,
            Drosophila) [Homo sapiens]
          Length = 3969

 Score = 71.6 bits (174), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
            G G+ C +    GE   V+E+ G V            IRS+Q +  +   ++Y+      
Sbjct: 3840 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3882

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
            Y+ R        D  VVDA    N A  I HSC PNC ++V  +DG   I I+ +R I+ 
Sbjct: 3883 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3936

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            GEE+T+DY    E  +      C CG++ CR  +LN
Sbjct: 3937 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3969


>gi|392341954|ref|XP_003754471.1| PREDICTED: histone-lysine N-methyltransferase MLL [Rattus norvegicus]
          Length = 3987

 Score = 71.6 bits (174), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 52/156 (33%), Positives = 75/156 (48%), Gaps = 31/156 (19%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
            G G+ C +    GE   V+E+ G V            IRS+Q +  +   ++Y+      
Sbjct: 3858 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3900

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
            Y+ R     D  D  VVDA    N A  I HSC PNC ++V  +DG   I I+ +R I+ 
Sbjct: 3901 YMFR----ID--DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3954

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            GEE+T+DY    E  +      C CG++ CR  +LN
Sbjct: 3955 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3987


>gi|395743560|ref|XP_002822597.2| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            MLL [Pongo abelii]
          Length = 4012

 Score = 71.6 bits (174), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
            G G+ C +    GE   V+E+ G V            IRS+Q +  +   ++Y+      
Sbjct: 3883 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3925

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
            Y+ R        D  VVDA    N A  I HSC PNC ++V  +DG   I I+ +R I+ 
Sbjct: 3926 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3979

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            GEE+T+DY    E  +      C CG++ CR  +LN
Sbjct: 3980 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 4012


>gi|355567103|gb|EHH23482.1| hypothetical protein EGK_06957, partial [Macaca mulatta]
          Length = 3824

 Score = 71.6 bits (174), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
            G G+ C +    GE   V+E+ G V            IRS+Q +  +   ++Y+      
Sbjct: 3695 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3737

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
            Y+ R        D  VVDA    N A  I HSC PNC ++V  +DG   I I+ +R I+ 
Sbjct: 3738 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3791

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            GEE+T+DY    E  +      C CG++ CR  +LN
Sbjct: 3792 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3824


>gi|331214149|ref|XP_003319756.1| Setd1a protein [Puccinia graminis f. sp. tritici CRL 75-36-700-3]
          Length = 1014

 Score = 71.6 bits (174), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 35/81 (43%), Positives = 47/81 (58%), Gaps = 4/81 (4%)

Query: 1959 DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVT 2018
            D +VVDA  K N    I H C PNC AK+  ++G  +I IY    I  G+E+T+DY+   
Sbjct: 936  DDLVVDATKKGNLGRLINHCCSPNCTAKIITINGEKKIVIYAKVTIELGDEVTYDYHF-- 993

Query: 2019 ESKEEYEASVCLCGSQVCRGS 2039
              KEE +   CLCGS  C+G+
Sbjct: 994  -PKEEVKIP-CLCGSVKCKGT 1012


>gi|356532622|ref|XP_003534870.1| PREDICTED: uncharacterized protein LOC100805708 [Glycine max]
          Length = 1213

 Score = 71.6 bits (174), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 52/137 (37%), Positives = 68/137 (49%), Gaps = 25/137 (18%)

Query: 1906 DDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI---YLERPKGDADGYDLVV 1962
            +DFV+E++GE+            IR    +  +   E   I   YL R     DGY   V
Sbjct: 1097 EDFVIEYIGEL------------IRPRISDIRERQYEKMGIGSSYLFRLD---DGY---V 1138

Query: 1963 VDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKE 2022
            VDA  +   A  + HSC PNC  KV +V+G  +I IY  R I  GEEIT++Y    E K+
Sbjct: 1139 VDATKRGGIARFVNHSCEPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKK 1198

Query: 2023 EYEASVCLCGSQVCRGS 2039
                  C CGS+ CRGS
Sbjct: 1199 ----IPCNCGSRKCRGS 1211


>gi|195171947|ref|XP_002026763.1| GL27000 [Drosophila persimilis]
 gi|194111702|gb|EDW33745.1| GL27000 [Drosophila persimilis]
          Length = 944

 Score = 71.6 bits (174), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 45/149 (30%), Positives = 76/149 (51%), Gaps = 20/149 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+    +   GE  F++E++GEV       + ++  R   + ++D    +Y + L 
Sbjct: 62   KKGCGITAELQIPAGE--FIMEYVGEV------IDSEEFERRQHRYSKDRNRHYYFMAL- 112

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              +G+A      ++DA  + N +  I HSC PN E +   V+G  +IG ++++ I  GEE
Sbjct: 113  --RGEA------IIDATMRGNISRYINHSCDPNAETQKWTVNGELRIGFFSLKNILPGEE 164

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            ITFDY      +   +A  C C +  CRG
Sbjct: 165  ITFDYQYQRYGR---DAQRCYCEAANCRG 190


>gi|332208875|ref|XP_003253537.1| PREDICTED: histone-lysine N-methyltransferase MLL [Nomascus
            leucogenys]
          Length = 3968

 Score = 71.6 bits (174), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
            G G+ C +    GE   V+E+ G V            IRS+Q +  +   ++Y+      
Sbjct: 3839 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3881

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
            Y+ R        D  VVDA    N A  I HSC PNC ++V  +DG   I I+ +R I+ 
Sbjct: 3882 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3935

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            GEE+T+DY    E  +      C CG++ CR  +LN
Sbjct: 3936 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3968


>gi|297458806|ref|XP_585092.4| PREDICTED: histone-lysine N-methyltransferase MLL [Bos taurus]
          Length = 3826

 Score = 71.6 bits (174), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
            G G+ C +    GE   V+E+ G V            IRS+Q +  +   ++Y+      
Sbjct: 3697 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3739

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
            Y+ R        D  VVDA    N A  I HSC PNC ++V  +DG   I I+ +R I+ 
Sbjct: 3740 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3793

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            GEE+T+DY    E  +      C CG++ CR  +LN
Sbjct: 3794 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3826


>gi|440904942|gb|ELR55394.1| Histone-lysine N-methyltransferase MLL, partial [Bos grunniens mutus]
          Length = 3846

 Score = 71.6 bits (174), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
            G G+ C +    GE   V+E+ G V            IRS+Q +  +   ++Y+      
Sbjct: 3717 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3759

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
            Y+ R        D  VVDA    N A  I HSC PNC ++V  +DG   I I+ +R I+ 
Sbjct: 3760 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3813

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            GEE+T+DY    E  +      C CG++ CR  +LN
Sbjct: 3814 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3846


>gi|1490271|emb|CAA93625.1| ALL-1 protein [Homo sapiens]
          Length = 4005

 Score = 71.6 bits (174), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
            G G+ C +    GE   V+E+ G V            IRS+Q +  +   ++Y+      
Sbjct: 3876 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3918

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
            Y+ R        D  VVDA    N A  I HSC PNC ++V  +DG   I I+ +R I+ 
Sbjct: 3919 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3972

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            GEE+T+DY    E  +      C CG++ CR  +LN
Sbjct: 3973 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 4005


>gi|402895434|ref|XP_003910832.1| PREDICTED: histone-lysine N-methyltransferase MLL [Papio anubis]
          Length = 3968

 Score = 71.6 bits (174), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
            G G+ C +    GE   V+E+ G V            IRS+Q +  +   ++Y+      
Sbjct: 3839 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3881

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
            Y+ R        D  VVDA    N A  I HSC PNC ++V  +DG   I I+ +R I+ 
Sbjct: 3882 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3935

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            GEE+T+DY    E  +      C CG++ CR  +LN
Sbjct: 3936 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3968


>gi|344293012|ref|XP_003418218.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            MLL-like [Loxodonta africana]
          Length = 3962

 Score = 71.6 bits (174), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
            G G+ C +    GE   V+E+ G V            IRS+Q +  +   ++Y+      
Sbjct: 3833 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3875

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
            Y+ R        D  VVDA    N A  I HSC PNC ++V  +DG   I I+ +R I+ 
Sbjct: 3876 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3929

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            GEE+T+DY    E  +      C CG++ CR  +LN
Sbjct: 3930 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3962


>gi|326921432|ref|XP_003206963.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            SETD2-like [Meleagris gallopavo]
          Length = 2147

 Score = 71.6 bits (174), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 59/204 (28%), Positives = 88/204 (43%), Gaps = 47/204 (23%)

Query: 1887 VAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI 1946
            +  +KG G+   K+     + FV+E+ GEV    K F+ +    +  KN         + 
Sbjct: 1361 LTEKKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHY 1409

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
            Y    K D       ++DA  K N +  + HSC PNCE +   V+G  ++G +T + +  
Sbjct: 1410 YFMALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPS 1463

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGE-------------------- 2046
            G E+TFDY      K   EA  C CGS  CRG YL   GE                    
Sbjct: 1464 GSELTFDYQFQRYGK---EAQKCFCGSANCRG-YLG--GENRVSIRAAGGKMKKERSRKK 1517

Query: 2047 ----GAFEKVLKELHGLLDRHQLM 2066
                G  E +L+   GL D++Q++
Sbjct: 1518 DSVDGELEALLENGEGLSDKNQVL 1541


>gi|390469747|ref|XP_002754504.2| PREDICTED: histone-lysine N-methyltransferase MLL [Callithrix
            jacchus]
          Length = 3994

 Score = 71.6 bits (174), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
            G G+ C +    GE   V+E+ G V            IRS+Q +  +   ++Y+      
Sbjct: 3865 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3907

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
            Y+ R        D  VVDA    N A  I HSC PNC ++V  +DG   I I+ +R I+ 
Sbjct: 3908 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3961

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            GEE+T+DY    E  +      C CG++ CR  +LN
Sbjct: 3962 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3994


>gi|355559685|gb|EHH16413.1| hypothetical protein EGK_11693 [Macaca mulatta]
          Length = 2343

 Score = 71.6 bits (174), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 61/207 (29%), Positives = 89/207 (42%), Gaps = 48/207 (23%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+   K+     + FV+E+ GEV    K F+ +    +  KN         + Y  
Sbjct: 1338 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1386

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              K D       ++DA  K N +  + HSC PNCE +   V+G  ++G +T + +  G E
Sbjct: 1387 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1440

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGE----------------------- 2046
            +TFDY      K   EA  C CGS  CRG YL   GE                       
Sbjct: 1441 LTFDYQFQRYGK---EAQKCFCGSANCRG-YLG--GENRVSIRAAGGKMKKERSRKKDSV 1494

Query: 2047 -GAFEKVLKELHGLLDRHQLMLEACEL 2072
             G  E +++   GL D++Q+ L  C L
Sbjct: 1495 DGELEALMENGEGLSDKNQV-LSLCRL 1520


>gi|328701191|ref|XP_003241521.1| PREDICTED: hypothetical protein LOC100573227 [Acyrthosiphon pisum]
          Length = 1315

 Score = 71.6 bits (174), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 44/79 (55%), Gaps = 4/79 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N A  I HSC PNC AK+  +DG  +I IY+ + I   EEIT+DY    E 
Sbjct: 1239 TIIDATKCGNLARFINHSCNPNCYAKIIQIDGQKKIVIYSKQPIGVNEEITYDYKFPLED 1298

Query: 2021 KEEYEASVCLCGSQVCRGS 2039
             +      CLCG+  CRG+
Sbjct: 1299 NK----IPCLCGTHCCRGT 1313


>gi|432873648|ref|XP_004072321.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like [Oryzias
            latipes]
          Length = 1597

 Score = 71.6 bits (174), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 46/148 (31%), Positives = 74/148 (50%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            +G G+  N+     + DFV E++GEV       + ++  + +++ +E+    FY + L +
Sbjct: 1308 RGWGLQTNQ--ALRKGDFVAEYVGEV------IDSEECQQRIKRAHENHVTNFYMLTLTK 1359

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         V+DA  K N A  I HSC PNCE +   V+G  +IGI+ +  I  G E+
Sbjct: 1360 DR---------VIDAGPKGNSARFINHSCNPNCETQKWTVNGDVRIGIFALCDIEAGTEL 1410

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN           + C CGS+ C G
Sbjct: 1411 TFNYNLHCVGNRR---TSCHCGSENCSG 1435


>gi|195130337|ref|XP_002009608.1| GI15146 [Drosophila mojavensis]
 gi|193908058|gb|EDW06925.1| GI15146 [Drosophila mojavensis]
          Length = 1885

 Score = 71.6 bits (174), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 47/153 (30%), Positives = 76/153 (49%), Gaps = 20/153 (13%)

Query: 1886 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN 1945
            +   +KG G+        GE  F++E++GEV    ++  +Q     ++  +      +Y 
Sbjct: 987  FRTKKKGCGITAEMLIPPGE--FIMEYVGEVIDSEEFERRQHHYSQIRNRH------YYF 1038

Query: 1946 IYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIH 2005
            + L   +G+A      ++DA  K N +  I HSC PN E +   V+G  +IG ++V+ I 
Sbjct: 1039 MAL---RGEA------IIDATVKGNISRYINHSCDPNAETQKWTVNGELRIGFFSVKTIL 1089

Query: 2006 YGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
             GEEITFDY      +   +A  C C S+ CRG
Sbjct: 1090 PGEEITFDYQYQRYGR---DAQRCYCESENCRG 1119


>gi|426244626|ref|XP_004016122.1| PREDICTED: histone-lysine N-methyltransferase MLL [Ovis aries]
          Length = 3710

 Score = 71.6 bits (174), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
            G G+ C +    GE   V+E+ G V            IRS+Q +  +   ++Y+      
Sbjct: 3581 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3623

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
            Y+ R        D  VVDA    N A  I HSC PNC ++V  +DG   I I+ +R I+ 
Sbjct: 3624 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3677

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            GEE+T+DY    E  +      C CG++ CR  +LN
Sbjct: 3678 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3710


>gi|241948091|ref|XP_002416768.1| COMPASS complex histone methyltransferase subunit, putative;
            histone-lysine n-methyltransferase, h3 lysine-4 specific,
            putative [Candida dubliniensis CD36]
 gi|223640106|emb|CAX44352.1| COMPASS complex histone methyltransferase subunit, putative [Candida
            dubliniensis CD36]
          Length = 1032

 Score = 71.6 bits (174), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 48/82 (58%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA  K   A  I H C P+C AK+  V+G  +I IY +R I   EE+T+DY    E+
Sbjct: 953  TVIDATKKGGIARFINHCCSPSCTAKIIKVEGKKRIVIYALRDIEANEELTYDYKFERET 1012

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
             +E E   CLCG+  C+G YLN
Sbjct: 1013 NDE-ERIRCLCGAPGCKG-YLN 1032


>gi|345799715|ref|XP_536554.3| PREDICTED: histone-lysine N-methyltransferase MLL [Canis lupus
            familiaris]
          Length = 3829

 Score = 71.6 bits (174), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
            G G+ C +    GE   V+E+ G V            IRS+Q +  +   ++Y+      
Sbjct: 3700 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3742

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
            Y+ R        D  VVDA    N A  I HSC PNC ++V  +DG   I I+ +R I+ 
Sbjct: 3743 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3796

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            GEE+T+DY    E  +      C CG++ CR  +LN
Sbjct: 3797 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3829


>gi|296197020|ref|XP_002746091.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 1
            [Callithrix jacchus]
          Length = 1365

 Score = 71.6 bits (174), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 45/148 (30%), Positives = 79/148 (53%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+V  ++   GE  FV E++GEV       ++++ +  ++  +E+    FY + +++
Sbjct: 1073 KGWGLVAKRDIRKGE--FVNEYVGEV------IDEEECMARIKHAHENDITHFYMLTIDK 1124

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NY+  + HSC+PNCE     V+G  ++G++ V  I  G E+
Sbjct: 1125 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1175

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN      E+   +VC CG+  C G
Sbjct: 1176 TFNYNLDCLGNEK---TVCRCGASNCSG 1200


>gi|432892259|ref|XP_004075732.1| PREDICTED: histone-lysine N-methyltransferase MLL-like [Oryzias
            latipes]
          Length = 4536

 Score = 71.6 bits (174), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 58/173 (33%), Positives = 79/173 (45%), Gaps = 31/173 (17%)

Query: 1875 LKAMDSRPDDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQK 1934
            LKAM       Y +   G G+ C K    GE   V+E+ G V            IRS+  
Sbjct: 4390 LKAMSKETVGVYRSPIHGRGLFCKKTIEAGE--MVIEYSGNV------------IRSVLT 4435

Query: 1935 NNEDPAPEFYN-----IYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTA 1989
            +  +   ++Y+      Y+ R     D Y+  VVDA    N A  I HSC PNC ++V  
Sbjct: 4436 DKRE---KYYDAKGIGCYMFR----IDDYE--VVDATVHGNAARFINHSCEPNCYSRVLT 4486

Query: 1990 VDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            VDG   I I+  R I  GEE+T+DY    E  +      C CG++ CR  +LN
Sbjct: 4487 VDGQKHIVIFASRRICCGEELTYDYKFPIE--DASNKLPCNCGTKKCR-KFLN 4536


>gi|417414196|gb|JAA53397.1| Putative histone-lysine n-methyltransferase mll, partial [Desmodus
            rotundus]
          Length = 3966

 Score = 71.6 bits (174), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
            G G+ C +    GE   V+E+ G V            IRS+Q +  +   ++Y+      
Sbjct: 3837 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3879

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
            Y+ R        D  VVDA    N A  I HSC PNC ++V  +DG   I I+ +R I+ 
Sbjct: 3880 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3933

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            GEE+T+DY    E  +      C CG++ CR  +LN
Sbjct: 3934 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3966


>gi|157824020|ref|NP_001101659.1| histone-lysine N-methyltransferase SETD2 [Rattus norvegicus]
 gi|149018436|gb|EDL77077.1| kinesin family member 9 (predicted) [Rattus norvegicus]
          Length = 2294

 Score = 71.6 bits (174), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+   K+     + FV+E+ GEV    K F+ +    +  KN         + Y  
Sbjct: 1290 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1338

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              K D       ++DA  K N +  + HSC PNCE +   V+G  ++G +T + +  G E
Sbjct: 1339 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1392

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
            +TFDY      K   EA  C CGS  CRG YL
Sbjct: 1393 LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1420


>gi|403159096|ref|XP_003890756.1| histone-lysine N-methyltransferase SETD1 [Puccinia graminis f. sp.
            tritici CRL 75-36-700-3]
 gi|375166585|gb|EHS63201.1| histone-lysine N-methyltransferase SETD1 [Puccinia graminis f. sp.
            tritici CRL 75-36-700-3]
          Length = 1502

 Score = 71.6 bits (174), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 35/81 (43%), Positives = 47/81 (58%), Gaps = 4/81 (4%)

Query: 1959 DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVT 2018
            D +VVDA  K N    I H C PNC AK+  ++G  +I IY    I  G+E+T+DY+   
Sbjct: 1424 DDLVVDATKKGNLGRLINHCCSPNCTAKIITINGEKKIVIYAKVTIELGDEVTYDYHF-- 1481

Query: 2019 ESKEEYEASVCLCGSQVCRGS 2039
              KEE +   CLCGS  C+G+
Sbjct: 1482 -PKEEVKIP-CLCGSVKCKGT 1500


>gi|354484245|ref|XP_003504300.1| PREDICTED: histone-lysine N-methyltransferase SETD2-like [Cricetulus
            griseus]
 gi|344236054|gb|EGV92157.1| Histone-lysine N-methyltransferase SETD2 [Cricetulus griseus]
          Length = 2412

 Score = 71.6 bits (174), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+   K+     + FV+E+ GEV    K F+ +    +  KN         + Y  
Sbjct: 1408 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1456

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              K D       ++DA  K N +  + HSC PNCE +   V+G  ++G +T + +  G E
Sbjct: 1457 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1510

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
            +TFDY      K   EA  C CGS  CRG YL
Sbjct: 1511 LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1538


>gi|297269329|ref|XP_001093874.2| PREDICTED: histone-lysine N-methyltransferase MLL [Macaca mulatta]
          Length = 3986

 Score = 71.6 bits (174), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
            G G+ C +    GE   V+E+ G V            IRS+Q +  +   ++Y+      
Sbjct: 3857 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3899

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
            Y+ R        D  VVDA    N A  I HSC PNC ++V  +DG   I I+ +R I+ 
Sbjct: 3900 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3953

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            GEE+T+DY    E  +      C CG++ CR  +LN
Sbjct: 3954 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3986


>gi|73985747|ref|XP_864158.1| PREDICTED: histone-lysine N-methyltransferase SETD2 isoform 11 [Canis
            lupus familiaris]
          Length = 2562

 Score = 71.6 bits (174), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+   K+     + FV+E+ GEV    K F+ +    +  KN         + Y  
Sbjct: 1557 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1605

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              K D       ++DA  K N +  + HSC PNCE +   V+G  ++G +T + +  G E
Sbjct: 1606 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1659

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
            +TFDY      K   EA  C CGS  CRG YL
Sbjct: 1660 LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1687


>gi|627837|pir||A48205 All-1 protein +GTE form - mouse (fragment)
          Length = 3869

 Score = 71.6 bits (174), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
            G G+ C +    GE   V+E+ G V            IRS+Q +  +   ++Y+      
Sbjct: 3740 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3782

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
            Y+ R        D  VVDA    N A  I HSC PNC ++V  +DG   I I+ +R I+ 
Sbjct: 3783 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3836

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            GEE+T+DY    E  +      C CG++ CR  +LN
Sbjct: 3837 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3869


>gi|297817294|ref|XP_002876530.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
 gi|297322368|gb|EFH52789.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
          Length = 354

 Score = 71.6 bits (174), Expect = 6e-09,   Method: Composition-based stats.
 Identities = 48/146 (32%), Positives = 73/146 (50%), Gaps = 20/146 (13%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
            G G+V +++   GE  F++E++GEV    K  E++     L K N      FY   +   
Sbjct: 123  GYGIVADEDINSGE--FIIEYVGEVVIDEKICEER-----LWKLNHKVEKNFYLCQINWN 175

Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
                     +V+DA HK N +  I HSC PN E +   +DG  +IGI+  R I+ GE++T
Sbjct: 176  ---------MVIDATHKGNKSRYINHSCNPNTEMQKWIIDGETRIGIFATRFINKGEQLT 226

Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCR 2037
            +DY  V    ++     C CG+  CR
Sbjct: 227  YDYQFVQFGADQD----CYCGAVCCR 248


>gi|328711160|ref|XP_001945277.2| PREDICTED: histone-lysine N-methyltransferase SETD1B-like
            [Acyrthosiphon pisum]
          Length = 1322

 Score = 71.6 bits (174), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 44/79 (55%), Gaps = 4/79 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N A  I HSC PNC AK+  +DG  +I IY+ + I   EEIT+DY    E 
Sbjct: 1246 TIIDATKCGNLARFINHSCNPNCYAKIIQIDGQKKIVIYSKQPIGVNEEITYDYKFPLED 1305

Query: 2021 KEEYEASVCLCGSQVCRGS 2039
             +      CLCG+  CRG+
Sbjct: 1306 NK----IPCLCGTHCCRGT 1320


>gi|119585211|gb|EAW64807.1| SET domain containing 2, isoform CRA_c [Homo sapiens]
          Length = 1819

 Score = 71.6 bits (174), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 50/155 (32%), Positives = 73/155 (47%), Gaps = 21/155 (13%)

Query: 1887 VAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI 1946
            +  +KG G+   K+     + FV+E+ GEV    K F+ +    +  KN         + 
Sbjct: 1334 LTEKKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHY 1382

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
            Y    K D       ++DA  K N +  + HSC PNCE +   V+G  ++G +T + +  
Sbjct: 1383 YFMALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPS 1436

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
            G E+TFDY      K   EA  C CGS  CRG YL
Sbjct: 1437 GSELTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1467


>gi|301785015|ref|XP_002927929.1| PREDICTED: histone-lysine N-methyltransferase MLL-like [Ailuropoda
            melanoleuca]
          Length = 3981

 Score = 71.6 bits (174), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
            G G+ C +    GE   V+E+ G V            IRS+Q +  +   ++Y+      
Sbjct: 3852 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3894

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
            Y+ R        D  VVDA    N A  I HSC PNC ++V  +DG   I I+ +R I+ 
Sbjct: 3895 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3948

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            GEE+T+DY    E  +      C CG++ CR  +LN
Sbjct: 3949 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3981


>gi|124486682|ref|NP_001074518.1| histone-lysine N-methyltransferase MLL [Mus musculus]
          Length = 3963

 Score = 71.6 bits (174), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
            G G+ C +    GE   V+E+ G V            IRS+Q +  +   ++Y+      
Sbjct: 3834 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3876

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
            Y+ R        D  VVDA    N A  I HSC PNC ++V  +DG   I I+ +R I+ 
Sbjct: 3877 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3930

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            GEE+T+DY    E  +      C CG++ CR  +LN
Sbjct: 3931 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3963


>gi|688443|gb|AAA62593.1| All-1 protein, partial [Mus musculus]
          Length = 3866

 Score = 71.6 bits (174), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
            G G+ C +    GE   V+E+ G V            IRS+Q +  +   ++Y+      
Sbjct: 3737 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3779

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
            Y+ R        D  VVDA    N A  I HSC PNC ++V  +DG   I I+ +R I+ 
Sbjct: 3780 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3833

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            GEE+T+DY    E  +      C CG++ CR  +LN
Sbjct: 3834 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3866


>gi|426232375|ref|XP_004010202.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
            N-methyltransferase NSD2 [Ovis aries]
          Length = 1273

 Score = 71.6 bits (174), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 80/148 (54%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+V  ++   GE  FV E++GE+       ++++ +  +++ +E+    FY + +++
Sbjct: 1019 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKRAHENDITHFYMLTIDK 1070

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NY+  + HSC+PNCE     V+G  ++G++ V  I  G E+
Sbjct: 1071 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1121

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN      E+   +VC CG+  C G
Sbjct: 1122 TFNYNLDCLGNEK---TVCRCGASNCSG 1146


>gi|385305977|gb|EIF49918.1| putative compass histone methyltransferase subunit set1p [Dekkera
            bruxellensis AWRI1499]
          Length = 1104

 Score = 71.6 bits (174), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 38/84 (45%), Positives = 48/84 (57%), Gaps = 2/84 (2%)

Query: 1959 DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVT 2018
            D  V+DA  K   A  I H C P+C AK+  VDG  +I IY +R I   EE+T+DY    
Sbjct: 1023 DNTVIDASKKGGIARFINHCCDPSCTAKIIKVDGKKRIVIYALRDIAANEELTYDYKFEK 1082

Query: 2019 ESKEEYEASVCLCGSQVCRGSYLN 2042
            E+  E E   CLCG+  C+G YLN
Sbjct: 1083 ETNPE-ERIPCLCGAPNCKG-YLN 1104


>gi|363729887|ref|XP_418510.3| PREDICTED: histone-lysine N-methyltransferase SETD2 [Gallus gallus]
          Length = 2554

 Score = 71.6 bits (174), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 59/201 (29%), Positives = 87/201 (43%), Gaps = 47/201 (23%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+   K+     + FV+E+ GEV    K F+ +    +  KN         + Y  
Sbjct: 1557 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1605

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              K D       ++DA  K N +  + HSC PNCE +   V+G  ++G +T + +  G E
Sbjct: 1606 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1659

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGE----------------------- 2046
            +TFDY      K   EA  C CGS  CRG YL   GE                       
Sbjct: 1660 LTFDYQFQRYGK---EAQKCFCGSANCRG-YLG--GENRVSIRAAGGKMKKERSRKKDSV 1713

Query: 2047 -GAFEKVLKELHGLLDRHQLM 2066
             G  E +L+   GL D++Q++
Sbjct: 1714 DGELEALLENGEGLSDKNQVL 1734


>gi|341940997|sp|P55200.3|MLL1_MOUSE RecName: Full=Histone-lysine N-methyltransferase MLL; AltName:
            Full=ALL-1; AltName: Full=Zinc finger protein HRX;
            Contains: RecName: Full=MLL cleavage product N320;
            AltName: Full=N-terminal cleavage product of 320 kDa;
            Short=p320; Contains: RecName: Full=MLL cleavage product
            C180; AltName: Full=C-terminal cleavage product of 180
            kDa; Short=p180
          Length = 3966

 Score = 71.6 bits (174), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
            G G+ C +    GE   V+E+ G V            IRS+Q +  +   ++Y+      
Sbjct: 3837 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3879

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
            Y+ R        D  VVDA    N A  I HSC PNC ++V  +DG   I I+ +R I+ 
Sbjct: 3880 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3933

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            GEE+T+DY    E  +      C CG++ CR  +LN
Sbjct: 3934 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3966


>gi|76666643|ref|XP_613048.2| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 1
            [Bos taurus]
 gi|297476142|ref|XP_002688498.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Bos
            taurus]
 gi|296486298|tpg|DAA28411.1| TPA: Wolf-Hirschhorn syndrome candidate 1 [Bos taurus]
          Length = 1365

 Score = 71.6 bits (174), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 80/148 (54%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+V  ++   GE  FV E++GE+       ++++ +  +++ +E+    FY + +++
Sbjct: 1073 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKRAHENDITHFYMLTIDK 1124

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NY+  + HSC+PNCE     V+G  ++G++ V  I  G E+
Sbjct: 1125 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1175

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN      E+   +VC CG+  C G
Sbjct: 1176 TFNYNLDCLGNEK---TVCRCGASNCSG 1200


>gi|395848655|ref|XP_003796965.1| PREDICTED: histone-lysine N-methyltransferase MLL [Otolemur
            garnettii]
          Length = 4062

 Score = 71.6 bits (174), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 52/156 (33%), Positives = 75/156 (48%), Gaps = 31/156 (19%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
            G G+ C +    GE   V+E+ G V            IRS+Q +  +   ++Y+      
Sbjct: 3933 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3975

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
            Y+ R     D  D  VVDA    N A  I HSC PNC ++V  +DG   I I+ +R I+ 
Sbjct: 3976 YMFR----ID--DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 4029

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            GEE+T+DY    E  +      C CG++ CR  +LN
Sbjct: 4030 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 4062


>gi|340378403|ref|XP_003387717.1| PREDICTED: histone-lysine N-methyltransferase SETD2-like [Amphimedon
            queenslandica]
          Length = 862

 Score = 71.6 bits (174), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 49/149 (32%), Positives = 72/149 (48%), Gaps = 26/149 (17%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
            GL   C+         FV+E+ GEV  + + FE++  I   +         +Y + L   
Sbjct: 136  GLKATCD----ISRYSFVMEYCGEVCSLEE-FERRRNIYEKESRRH-----YYFMSL--- 182

Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
            K D       ++DA  K N +  I HSC PNCE +   V+G  ++G + +R I  GEE+T
Sbjct: 183  KTDE------ILDATRKGNLSRFINHSCEPNCETQKWTVNGRLRVGFFALRHIPAGEELT 236

Query: 2012 FDYNSVTESKEEYEASV--CLCGSQVCRG 2038
            FDY       + +  SV  C CGS+ CRG
Sbjct: 237  FDYQF-----QRFGESVQKCYCGSETCRG 260


>gi|426340342|ref|XP_004034089.1| PREDICTED: histone-lysine N-methyltransferase SETD2 [Gorilla gorilla
            gorilla]
          Length = 2564

 Score = 71.6 bits (174), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+   K+     + FV+E+ GEV    K F+ +    +  KN         + Y  
Sbjct: 1559 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1607

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              K D       ++DA  K N +  + HSC PNCE +   V+G  ++G +T + +  G E
Sbjct: 1608 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1661

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
            +TFDY      K   EA  C CGS  CRG YL
Sbjct: 1662 LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1689


>gi|184394|gb|AAA58669.1| HRX [Homo sapiens]
          Length = 3969

 Score = 71.6 bits (174), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
            G G+ C +    GE   V+E+ G V            IRS+Q +  +   ++Y+      
Sbjct: 3840 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3882

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
            Y+ R        D  VVDA    N A  I HSC PNC ++V  +DG   I I+ +R I+ 
Sbjct: 3883 YMFRID------DSEVVDATMHGNRARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3936

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            GEE+T+DY    E  +      C CG++ CR  +LN
Sbjct: 3937 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3969


>gi|448520177|ref|XP_003868242.1| Set1 Lysine histone methyltransferase [Candida orthopsilosis Co
            90-125]
 gi|380352581|emb|CCG22808.1| Set1 Lysine histone methyltransferase [Candida orthopsilosis]
          Length = 1038

 Score = 71.6 bits (174), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 48/82 (58%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA  K   A  I H C P+C AK+  V+G  +I IY +R I   EE+T+DY    E+
Sbjct: 959  TVIDATKKGGIARFINHCCNPSCTAKIIKVEGKKRIVIYALRDIEANEELTYDYKFERET 1018

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
             +E E   CLCG+  C+G YLN
Sbjct: 1019 NDE-ERIRCLCGAPGCKG-YLN 1038


>gi|296474690|tpg|DAA16805.1| TPA: Wolf-Hirschhorn syndrome candidate 1 protein-like [Bos taurus]
          Length = 2547

 Score = 71.6 bits (174), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+   K+     + FV+E+ GEV    K F+ +    +  KN         + Y  
Sbjct: 1542 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1590

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              K D       ++DA  K N +  + HSC PNCE +   V+G  ++G +T + +  G E
Sbjct: 1591 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1644

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
            +TFDY      K   EA  C CGS  CRG YL
Sbjct: 1645 LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1672


>gi|149041498|gb|EDL95339.1| myeloid/lymphoid or mixed-lineage leukemia (mapped) [Rattus
            norvegicus]
          Length = 3725

 Score = 71.6 bits (174), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
            G G+ C +    GE   V+E+ G V            IRS+Q +  +   ++Y+      
Sbjct: 3596 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3638

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
            Y+ R        D  VVDA    N A  I HSC PNC ++V  +DG   I I+ +R I+ 
Sbjct: 3639 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3692

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            GEE+T+DY    E  +      C CG++ CR  +LN
Sbjct: 3693 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3725


>gi|157112020|ref|XP_001657377.1| huntingtin interacting protein [Aedes aegypti]
 gi|108878208|gb|EAT42433.1| AAEL006013-PA [Aedes aegypti]
          Length = 2367

 Score = 71.6 bits (174), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 48/153 (31%), Positives = 76/153 (49%), Gaps = 20/153 (13%)

Query: 1886 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN 1945
            +   +KG G+  + E   G  DF++E++GEV    + F+++  + S +KN       +Y 
Sbjct: 1277 FRTEKKGFGIQASTEIVPG--DFIMEYVGEVL-NSEQFDERAELYSKEKNQH-----YYF 1328

Query: 1946 IYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIH 2005
            + L   + DA      ++DA  K N +  I HSC PN E +   V+G  +IG +  + I 
Sbjct: 1329 MAL---RSDA------IIDATTKGNISRFINHSCDPNAETQKWTVNGELRIGFFCTKYIM 1379

Query: 2006 YGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
             GEEITFDY      +    A  C C ++ C G
Sbjct: 1380 PGEEITFDYQFQRYGR---RAQKCYCEAENCTG 1409


>gi|432092361|gb|ELK24976.1| Histone-lysine N-methyltransferase SETD2 [Myotis davidii]
          Length = 2865

 Score = 71.6 bits (174), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+   K+     + FV+E+ GEV    K F+ +    +  KN         + Y  
Sbjct: 1862 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1910

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              K D       ++DA  K N +  + HSC PNCE +   V+G  ++G +T + +  G E
Sbjct: 1911 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1964

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
            +TFDY      K   EA  C CGS  CRG YL
Sbjct: 1965 LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1992


>gi|403263194|ref|XP_003923935.1| PREDICTED: histone-lysine N-methyltransferase MLL [Saimiri
            boliviensis boliviensis]
          Length = 3985

 Score = 71.6 bits (174), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
            G G+ C +    GE   V+E+ G V            IRS+Q +  +   ++Y+      
Sbjct: 3856 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3898

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
            Y+ R        D  VVDA    N A  I HSC PNC ++V  +DG   I I+ +R I+ 
Sbjct: 3899 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3952

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            GEE+T+DY    E  +      C CG++ CR  +LN
Sbjct: 3953 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3985


>gi|336259450|ref|XP_003344526.1| hypothetical protein SMAC_07534 [Sordaria macrospora k-hell]
 gi|380093240|emb|CCC08898.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 1314

 Score = 71.6 bits (174), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 47/143 (32%), Positives = 69/143 (48%), Gaps = 23/143 (16%)

Query: 1903 FGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGY---D 1959
              +DD ++E++GE        E +  I  L++            YL+   G +  +   D
Sbjct: 1192 INKDDMIIEYVGE--------EVRQQIAELREAR----------YLKSGIGSSYLFRIDD 1233

Query: 1960 LVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTE 2019
              V+DA  K   A  I HSC PNC AK+  V+G  +I IY +R I   EE+T+DY    E
Sbjct: 1234 NTVIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDYKFERE 1293

Query: 2020 SKEEYEASVCLCGSQVCRGSYLN 2042
                 +   CLCG+  C+G +LN
Sbjct: 1294 IGST-DRIPCLCGTAACKG-FLN 1314


>gi|119914792|ref|XP_589886.3| PREDICTED: histone-lysine N-methyltransferase SETD2 [Bos taurus]
          Length = 2547

 Score = 71.6 bits (174), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+   K+     + FV+E+ GEV    K F+ +    +  KN         + Y  
Sbjct: 1542 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1590

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              K D       ++DA  K N +  + HSC PNCE +   V+G  ++G +T + +  G E
Sbjct: 1591 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1644

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
            +TFDY      K   EA  C CGS  CRG YL
Sbjct: 1645 LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1672


>gi|344228738|gb|EGV60624.1| histone H3-K4 methyltransferase Set1 [Candida tenuis ATCC 10573]
          Length = 1037

 Score = 71.6 bits (174), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA  K   A  I H C P+C AK+  VDG  +I IY +R I   EE+T+DY    E+
Sbjct: 958  TVIDATKKGGIARFINHCCSPSCTAKIIKVDGKKRIVIYALRDIEANEELTYDYKFERET 1017

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
             +  E   CLCG+  C+G YLN
Sbjct: 1018 NDS-ERIRCLCGAPGCKG-YLN 1037


>gi|397498815|ref|XP_003820170.1| PREDICTED: histone-lysine N-methyltransferase MLL [Pan paniscus]
          Length = 4202

 Score = 71.6 bits (174), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
            G G+ C +    GE   V+E+ G V            IRS+Q +  +   ++Y+      
Sbjct: 4073 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 4115

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
            Y+ R        D  VVDA    N A  I HSC PNC ++V  +DG   I I+ +R I+ 
Sbjct: 4116 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 4169

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            GEE+T+DY    E  +      C CG++ CR  +LN
Sbjct: 4170 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 4202


>gi|432105765|gb|ELK31956.1| Histone-lysine N-methyltransferase MLL, partial [Myotis davidii]
          Length = 3463

 Score = 71.2 bits (173), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
            G G+ C +    GE   V+E+ G V            IRS+Q +  +   ++Y+      
Sbjct: 3334 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3376

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
            Y+ R        D  VVDA    N A  I HSC PNC ++V  +DG   I I+ +R I+ 
Sbjct: 3377 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3430

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            GEE+T+DY    E  +      C CG++ CR  +LN
Sbjct: 3431 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3463


>gi|148693675|gb|EDL25622.1| mCG1547 [Mus musculus]
          Length = 3706

 Score = 71.2 bits (173), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
            G G+ C +    GE   V+E+ G V            IRS+Q +  +   ++Y+      
Sbjct: 3577 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3619

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
            Y+ R        D  VVDA    N A  I HSC PNC ++V  +DG   I I+ +R I+ 
Sbjct: 3620 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3673

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            GEE+T+DY    E  +      C CG++ CR  +LN
Sbjct: 3674 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3706


>gi|410220670|gb|JAA07554.1| SET domain containing 2 [Pan troglodytes]
 gi|410261336|gb|JAA18634.1| SET domain containing 2 [Pan troglodytes]
 gi|410295964|gb|JAA26582.1| SET domain containing 2 [Pan troglodytes]
 gi|410339683|gb|JAA38788.1| SET domain containing 2 [Pan troglodytes]
          Length = 2564

 Score = 71.2 bits (173), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+   K+     + FV+E+ GEV    K F+ +    +  KN         + Y  
Sbjct: 1559 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1607

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              K D       ++DA  K N +  + HSC PNCE +   V+G  ++G +T + +  G E
Sbjct: 1608 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1661

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
            +TFDY      K   EA  C CGS  CRG YL
Sbjct: 1662 LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1689


>gi|348582642|ref|XP_003477085.1| PREDICTED: histone-lysine N-methyltransferase SETD2-like [Cavia
            porcellus]
          Length = 2565

 Score = 71.2 bits (173), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+   K+     + FV+E+ GEV    K F+ +    +  KN         + Y  
Sbjct: 1560 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1608

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              K D       ++DA  K N +  + HSC PNCE +   V+G  ++G +T + +  G E
Sbjct: 1609 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1662

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
            +TFDY      K   EA  C CGS  CRG YL
Sbjct: 1663 LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1690


>gi|197313748|ref|NP_054878.5| histone-lysine N-methyltransferase SETD2 [Homo sapiens]
 gi|296452963|sp|Q9BYW2.3|SETD2_HUMAN RecName: Full=Histone-lysine N-methyltransferase SETD2; AltName:
            Full=HIF-1; AltName: Full=Huntingtin yeast partner B;
            AltName: Full=Huntingtin-interacting protein 1;
            Short=HIP-1; AltName: Full=Huntingtin-interacting protein
            B; AltName: Full=Lysine N-methyltransferase 3A; AltName:
            Full=SET domain-containing protein 2; Short=hSET2;
            AltName: Full=p231HBP
          Length = 2564

 Score = 71.2 bits (173), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+   K+     + FV+E+ GEV    K F+ +    +  KN         + Y  
Sbjct: 1559 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1607

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              K D       ++DA  K N +  + HSC PNCE +   V+G  ++G +T + +  G E
Sbjct: 1608 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1661

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
            +TFDY      K   EA  C CGS  CRG YL
Sbjct: 1662 LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1689


>gi|297671474|ref|XP_002813857.1| PREDICTED: histone-lysine N-methyltransferase SETD2 [Pongo abelii]
          Length = 2563

 Score = 71.2 bits (173), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+   K+     + FV+E+ GEV    K F+ +    +  KN         + Y  
Sbjct: 1558 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1606

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              K D       ++DA  K N +  + HSC PNCE +   V+G  ++G +T + +  G E
Sbjct: 1607 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1660

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
            +TFDY      K   EA  C CGS  CRG YL
Sbjct: 1661 LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1688


>gi|114586572|ref|XP_516423.2| PREDICTED: histone-lysine N-methyltransferase SETD2 isoform 3 [Pan
            troglodytes]
          Length = 2549

 Score = 71.2 bits (173), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+   K+     + FV+E+ GEV    K F+ +    +  KN         + Y  
Sbjct: 1544 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1592

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              K D       ++DA  K N +  + HSC PNCE +   V+G  ++G +T + +  G E
Sbjct: 1593 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1646

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
            +TFDY      K   EA  C CGS  CRG YL
Sbjct: 1647 LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1674


>gi|440891718|gb|ELR45266.1| Histone-lysine N-methyltransferase SETD2, partial [Bos grunniens
            mutus]
          Length = 2533

 Score = 71.2 bits (173), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+   K+     + FV+E+ GEV    K F+ +    +  KN         + Y  
Sbjct: 1528 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1576

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              K D       ++DA  K N +  + HSC PNCE +   V+G  ++G +T + +  G E
Sbjct: 1577 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1630

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
            +TFDY      K   EA  C CGS  CRG YL
Sbjct: 1631 LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1658


>gi|410911878|ref|XP_003969417.1| PREDICTED: uncharacterized protein LOC101064190 [Takifugu rubripes]
          Length = 2720

 Score = 71.2 bits (173), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 52/154 (33%), Positives = 74/154 (48%), Gaps = 30/154 (19%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
            G G+ C +    GE   V+E+ G V            IR++  +      ++Y+      
Sbjct: 2591 GRGLFCKRNIEAGE--MVIEYAGTV------------IRAVLTDKRQ---KYYDGKGIGC 2633

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
            Y+ R     D +D  VVDA  + N A  I HSC PNC ++V  VDG   I I+ +R I+ 
Sbjct: 2634 YMFR----IDDFD--VVDATMQGNAARFINHSCEPNCYSRVINVDGRKHIVIFALRKIYR 2687

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSY 2040
            GEE+T+DY    E  E      C CG++ CRGS 
Sbjct: 2688 GEELTYDYKFPIEDDESKLH--CNCGTRRCRGSL 2719


>gi|397495290|ref|XP_003818492.1| PREDICTED: histone-lysine N-methyltransferase SETD2 [Pan paniscus]
          Length = 2564

 Score = 71.2 bits (173), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+   K+     + FV+E+ GEV    K F+ +    +  KN         + Y  
Sbjct: 1559 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1607

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              K D       ++DA  K N +  + HSC PNCE +   V+G  ++G +T + +  G E
Sbjct: 1608 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1661

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
            +TFDY      K   EA  C CGS  CRG YL
Sbjct: 1662 LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1689


>gi|242010887|ref|XP_002426189.1| mixed-lineage leukemia protein, mll, putative [Pediculus humanus
            corporis]
 gi|212510240|gb|EEB13451.1| mixed-lineage leukemia protein, mll, putative [Pediculus humanus
            corporis]
          Length = 574

 Score = 71.2 bits (173), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 48/142 (33%), Positives = 68/142 (47%), Gaps = 24/142 (16%)

Query: 1903 FGEDDFVVEFLGE-VYPVWKWF-EKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDL 1960
               D+ V+E++G+ V P    F EK+   R +       +   + I LE           
Sbjct: 455  IAADEMVIEYVGQMVRPFLADFREKEYEKRGIG------SSYLFRIDLE----------- 497

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N A  I HSC PNC AK+  ++G  +I IY+ + I   EEIT+DY    E 
Sbjct: 498  TIIDATKCGNLARFINHSCNPNCYAKIITIEGQKKIVIYSKKDIKVDEEITYDYKFPIEE 557

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
                E   CLCG+  C+G YLN
Sbjct: 558  ----EKIPCLCGAAQCKG-YLN 574


>gi|312378119|gb|EFR24776.1| hypothetical protein AND_10404 [Anopheles darlingi]
          Length = 2632

 Score = 71.2 bits (173), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 48/149 (32%), Positives = 74/149 (49%), Gaps = 20/149 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+  +     GE  F++E++GEV      F+++    S  KN       +Y + L 
Sbjct: 1470 KKGFGIQASAPIAPGE--FIMEYVGEVL-NGSQFDQRAEAYSRDKNKH-----YYFMALR 1521

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
                 +DG    ++DA  K N +  I HSC PN E +   V+G  +IG ++ + I  GEE
Sbjct: 1522 -----SDG----IIDATTKGNISRFINHSCDPNAETQKWTVNGELRIGFFSTKYILPGEE 1572

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            ITFDY      +   +A  C C ++ CRG
Sbjct: 1573 ITFDYQF---QRYGRKAQKCFCEAENCRG 1598


>gi|296225059|ref|XP_002758501.1| PREDICTED: histone-lysine N-methyltransferase SETD2 [Callithrix
            jacchus]
          Length = 2510

 Score = 71.2 bits (173), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+   K+     + FV+E+ GEV    K F+ +    +  KN         + Y  
Sbjct: 1505 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1553

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              K D       ++DA  K N +  + HSC PNCE +   V+G  ++G +T + +  G E
Sbjct: 1554 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1607

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
            +TFDY      K   EA  C CGS  CRG YL
Sbjct: 1608 LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1635


>gi|354544237|emb|CCE40960.1| hypothetical protein CPAR2_109980 [Candida parapsilosis]
          Length = 1042

 Score = 71.2 bits (173), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 48/82 (58%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA  K   A  I H C P+C AK+  V+G  +I IY +R I   EE+T+DY    E+
Sbjct: 963  TVIDATKKGGIARFINHCCNPSCTAKIIKVEGKKRIVIYALRDIEANEELTYDYKFERET 1022

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
             +E E   CLCG+  C+G YLN
Sbjct: 1023 NDE-ERIRCLCGAPGCKG-YLN 1042


>gi|255539394|ref|XP_002510762.1| set domain protein, putative [Ricinus communis]
 gi|223551463|gb|EEF52949.1| set domain protein, putative [Ricinus communis]
          Length = 1258

 Score = 71.2 bits (173), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 53/137 (38%), Positives = 68/137 (49%), Gaps = 25/137 (18%)

Query: 1906 DDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI---YLERPKGDADGYDLVV 1962
            +DFV+E++GE+            IR    +  +   E   I   YL R     DGY   V
Sbjct: 1142 EDFVIEYVGEL------------IRPRISDIRERLYEKMGIGSSYLFRLD---DGY---V 1183

Query: 1963 VDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKE 2022
            VDA  +   A  I HSC PNC  KV +V+G  +I IY  R I  GEEIT++Y    E K+
Sbjct: 1184 VDATKRGGVARFINHSCEPNCYTKVISVEGQKKIFIYAKRHIAAGEEITYNYKFPLEEKK 1243

Query: 2023 EYEASVCLCGSQVCRGS 2039
                  C CGS+ CRGS
Sbjct: 1244 ----IPCNCGSRKCRGS 1256


>gi|384493570|gb|EIE84061.1| hypothetical protein RO3G_08766 [Rhizopus delemar RA 99-880]
          Length = 815

 Score = 71.2 bits (173), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 36/79 (45%), Positives = 45/79 (56%), Gaps = 7/79 (8%)

Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
            ++DA  K   A  I HSC PNC  +   V  + +IGI+T R I  GEE+TFDY       
Sbjct: 103  IIDATKKGCLARFINHSCNPNCVTQKWVVGKNMRIGIFTTRCIKAGEELTFDYKF----- 157

Query: 2022 EEY--EASVCLCGSQVCRG 2038
            E Y  +A VC CG QVC+G
Sbjct: 158  ERYGAQAQVCYCGEQVCKG 176


>gi|332216412|ref|XP_003257344.1| PREDICTED: histone-lysine N-methyltransferase SETD2 [Nomascus
            leucogenys]
          Length = 2499

 Score = 71.2 bits (173), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+   K+     + FV+E+ GEV    K F+ +    +  KN         + Y  
Sbjct: 1544 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1592

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              K D       ++DA  K N +  + HSC PNCE +   V+G  ++G +T + +  G E
Sbjct: 1593 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1646

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
            +TFDY      K   EA  C CGS  CRG YL
Sbjct: 1647 LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1674


>gi|302682536|ref|XP_003030949.1| hypothetical protein SCHCODRAFT_77129 [Schizophyllum commune H4-8]
 gi|300104641|gb|EFI96046.1| hypothetical protein SCHCODRAFT_77129 [Schizophyllum commune H4-8]
          Length = 171

 Score = 71.2 bits (173), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 48/82 (58%), Gaps = 5/82 (6%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
            +VVDA  K N    I HSC PNC AK+  ++G  +I IY  R I  G+EIT+DY+   E 
Sbjct: 95   IVVDATKKGNLGRLINHSCDPNCTAKIITINGEKKIVIYAKRDIELGDEITYDYHFPFEQ 154

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
                +   CLCG+  CRG +LN
Sbjct: 155  ----DKIPCLCGTAKCRG-FLN 171


>gi|431905124|gb|ELK10179.1| Histone-lysine N-methyltransferase SETD2 [Pteropus alecto]
          Length = 2482

 Score = 71.2 bits (173), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+   K+     + FV+E+ GEV    K F+ +    +  KN         + Y  
Sbjct: 1477 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1525

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              K D       ++DA  K N +  + HSC PNCE +   V+G  ++G +T + +  G E
Sbjct: 1526 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1579

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
            +TFDY      K   EA  C CGS  CRG YL
Sbjct: 1580 LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1607


>gi|410972021|ref|XP_003992459.1| PREDICTED: histone-lysine N-methyltransferase MLL [Felis catus]
          Length = 3554

 Score = 71.2 bits (173), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
            G G+ C +    GE   V+E+ G V            IRS+Q +  +   ++Y+      
Sbjct: 3425 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3467

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
            Y+ R        D  VVDA    N A  I HSC PNC ++V  +DG   I I+ +R I+ 
Sbjct: 3468 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3521

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            GEE+T+DY    E  +      C CG++ CR  +LN
Sbjct: 3522 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3554


>gi|392590566|gb|EIW79895.1| SET domain-containing protein [Coniophora puteana RWD-64-598 SS2]
          Length = 160

 Score = 71.2 bits (173), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 50/139 (35%), Positives = 68/139 (48%), Gaps = 26/139 (18%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI---YLERPKGDADGYDLVVV 1963
            + V+E++GEV            +R+   +  + A E   I   YL R   D      +VV
Sbjct: 45   EMVIEYVGEV------------VRAQVADKREKAYERQGIGSSYLFRIDED------LVV 86

Query: 1964 DAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEE 2023
            DA  K N    I HSC PNC A++  + G  +I IY  + I  G+EIT+DY+   E    
Sbjct: 87   DATKKGNLGRLINHSCDPNCTARIITISGEKKIVIYAKQDIELGDEITYDYHFPIEQ--- 143

Query: 2024 YEASVCLCGSQVCRGSYLN 2042
             +   CLCGS  CRG YLN
Sbjct: 144  -DKIPCLCGSAKCRG-YLN 160


>gi|42407424|dbj|BAD10031.1| SET domain protein-like [Oryza sativa Japonica Group]
          Length = 437

 Score = 71.2 bits (173), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 46/145 (31%), Positives = 71/145 (48%), Gaps = 20/145 (13%)

Query: 1905 EDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVD 1964
            +DDFV+EF+GEV       E+ + +R     N          Y+ + K D       V+D
Sbjct: 311  KDDFVIEFVGEVIDDETCEERLEDMRRRGDKN---------FYMCKVKKD------FVID 355

Query: 1965 AMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEY 2024
            A  K N      HSC PNC+ +   V+G  ++G++  + I  GE +T+DY        E 
Sbjct: 356  ATFKGNDCRFFNHSCEPNCQLQKWQVNGKTRLGVFASKAIEVGEPLTYDYRFEQHYGPEI 415

Query: 2025 EASVCLCGSQVCRGSYLNLTGEGAF 2049
            E   C CG+Q C+G+ +++ G G F
Sbjct: 416  E---CFCGAQNCQGN-MSIVG-GCF 435


>gi|119585214|gb|EAW64810.1| SET domain containing 2, isoform CRA_f [Homo sapiens]
          Length = 2342

 Score = 71.2 bits (173), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+   K+     + FV+E+ GEV    K F+ +    +  KN         + Y  
Sbjct: 1337 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1385

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              K D       ++DA  K N +  + HSC PNCE +   V+G  ++G +T + +  G E
Sbjct: 1386 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1439

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
            +TFDY      K   EA  C CGS  CRG YL
Sbjct: 1440 LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1467


>gi|350294046|gb|EGZ75131.1| histone-lysine N-methyltransferase, H3 lysine-4 specific [Neurospora
            tetrasperma FGSC 2509]
          Length = 1313

 Score = 71.2 bits (173), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 47/143 (32%), Positives = 69/143 (48%), Gaps = 23/143 (16%)

Query: 1903 FGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGY---D 1959
              +DD ++E++GE        E +  I  L++            YL+   G +  +   D
Sbjct: 1191 INKDDMIIEYVGE--------EVRQQIAELREAR----------YLKSGIGSSYLFRIDD 1232

Query: 1960 LVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTE 2019
              V+DA  K   A  I HSC PNC AK+  V+G  +I IY +R I   EE+T+DY    E
Sbjct: 1233 NTVIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDYKFERE 1292

Query: 2020 SKEEYEASVCLCGSQVCRGSYLN 2042
                 +   CLCG+  C+G +LN
Sbjct: 1293 IGST-DRIPCLCGTAACKG-FLN 1313


>gi|148677064|gb|EDL09011.1| mCG15806 [Mus musculus]
          Length = 2034

 Score = 71.2 bits (173), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+   K+     + FV+E+ GEV    K F+ +    +  KN         + Y  
Sbjct: 1030 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1078

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              K D       ++DA  K N +  + HSC PNCE +   V+G  ++G +T + +  G E
Sbjct: 1079 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1132

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
            +TFDY      K   EA  C CGS  CRG YL
Sbjct: 1133 LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1160


>gi|359078405|ref|XP_002697155.2| PREDICTED: histone-lysine N-methyltransferase SETD2 [Bos taurus]
          Length = 1448

 Score = 71.2 bits (173), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+   K+     + FV+E+ GEV    K F+ +    +  KN         + Y  
Sbjct: 482  KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 530

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              K D       ++DA  K N +  + HSC PNCE +   V+G  ++G +T + +  G E
Sbjct: 531  ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 584

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
            +TFDY      K   EA  C CGS  CRG YL
Sbjct: 585  LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 612


>gi|432114829|gb|ELK36567.1| Putative histone-lysine N-methyltransferase NSD2 [Myotis davidii]
          Length = 1037

 Score = 71.2 bits (173), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 78/148 (52%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+V  ++   GE  FV E++GE+       ++++ +  ++   E+    FY + +++
Sbjct: 687  KGWGLVATRDIRKGE--FVNEYVGEL------IDEEECMARIKHAQENDITHFYMLTIDK 738

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NY+  + HSC+PNCE     V+G  ++G++ V  I  G E+
Sbjct: 739  DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 789

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN      E+   +VC CG+  C G
Sbjct: 790  TFNYNLDCLGNEK---TVCRCGAPNCSG 814


>gi|367037743|ref|XP_003649252.1| lysine methyltransferase enzyme-like protein [Thielavia terrestris
            NRRL 8126]
 gi|346996513|gb|AEO62916.1| lysine methyltransferase enzyme-like protein [Thielavia terrestris
            NRRL 8126]
          Length = 1286

 Score = 71.2 bits (173), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 36/84 (42%), Positives = 47/84 (55%), Gaps = 2/84 (2%)

Query: 1959 DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVT 2018
            D  V+DA  K   A  I HSC PNC AK+  V+G  +I IY +R I   EE+T+DY    
Sbjct: 1205 DNTVIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDYKFER 1264

Query: 2019 ESKEEYEASVCLCGSQVCRGSYLN 2042
            E     +   CLCG+  C+G +LN
Sbjct: 1265 ELGST-DRIPCLCGTAACKG-FLN 1286


>gi|119585209|gb|EAW64805.1| SET domain containing 2, isoform CRA_a [Homo sapiens]
          Length = 1538

 Score = 71.2 bits (173), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 50/155 (32%), Positives = 73/155 (47%), Gaps = 21/155 (13%)

Query: 1887 VAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI 1946
            +  +KG G+   K+     + FV+E+ GEV    K F+ +    +  KN         + 
Sbjct: 1053 LTEKKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHY 1101

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
            Y    K D       ++DA  K N +  + HSC PNCE +   V+G  ++G +T + +  
Sbjct: 1102 YFMALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPS 1155

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
            G E+TFDY      K   EA  C CGS  CRG YL
Sbjct: 1156 GSELTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1186


>gi|210032580|ref|NP_055863.1| histone-lysine N-methyltransferase SETD1B [Homo sapiens]
 gi|166977692|sp|Q9UPS6.2|SET1B_HUMAN RecName: Full=Histone-lysine N-methyltransferase SETD1B; AltName:
            Full=Lysine N-methyltransferase 2G; AltName: Full=SET
            domain-containing protein 1B; Short=hSET1B
          Length = 1923

 Score = 71.2 bits (173), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N+A  I HSC PNC AKV  V+   +I IY+ + I+  EEIT+DY    E 
Sbjct: 1847 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 1906

Query: 2021 KEEYEASVCLCGSQVCRGS 2039
             +      CLCGS+ CRG+
Sbjct: 1907 VK----IPCLCGSENCRGT 1921


>gi|334192482|gb|AEG67286.1| histone-lysine N-methyltransferase [Homo sapiens]
          Length = 1966

 Score = 71.2 bits (173), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N+A  I HSC PNC AKV  V+   +I IY+ + I+  EEIT+DY    E 
Sbjct: 1890 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 1949

Query: 2021 KEEYEASVCLCGSQVCRGS 2039
             +      CLCGS+ CRG+
Sbjct: 1950 VK----IPCLCGSENCRGT 1964


>gi|443726566|gb|ELU13685.1| hypothetical protein CAPTEDRAFT_150651 [Capitella teleta]
          Length = 292

 Score = 71.2 bits (173), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 35/79 (44%), Positives = 45/79 (56%), Gaps = 4/79 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
            +++DA    N A  I HSC PNC AK+  V+ H +I IY+ R I   EEIT+DY    E 
Sbjct: 216  LIIDATKCGNLARFINHSCNPNCVAKIITVESHKKIVIYSRRDIGVNEEITYDYKFPLED 275

Query: 2021 KEEYEASVCLCGSQVCRGS 2039
                E   CLCG+  CRG+
Sbjct: 276  ----EKIPCLCGTSACRGT 290


>gi|431908264|gb|ELK11862.1| Histone-lysine N-methyltransferase HRX [Pteropus alecto]
          Length = 3459

 Score = 71.2 bits (173), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
            G G+ C +    GE   V+E+ G V            IRS+Q +  +   ++Y+      
Sbjct: 3330 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3372

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
            Y+ R        D  VVDA    N A  I HSC PNC ++V  +DG   I I+ +R I+ 
Sbjct: 3373 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3426

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            GEE+T+DY    E  +      C CG++ CR  +LN
Sbjct: 3427 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3459


>gi|74697791|sp|Q8X0S9.1|SET1_NEUCR RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
            specific; AltName: Full=COMPASS component SET1; AltName:
            Full=SET domain-containing protein 1
 gi|18376303|emb|CAD21415.1| related to regulatory protein SET1 [Neurospora crassa]
          Length = 1313

 Score = 71.2 bits (173), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 47/143 (32%), Positives = 69/143 (48%), Gaps = 23/143 (16%)

Query: 1903 FGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGY---D 1959
              +DD ++E++GE        E +  I  L++            YL+   G +  +   D
Sbjct: 1191 INKDDMIIEYVGE--------EVRQQIAELREAR----------YLKSGIGSSYLFRIDD 1232

Query: 1960 LVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTE 2019
              V+DA  K   A  I HSC PNC AK+  V+G  +I IY +R I   EE+T+DY    E
Sbjct: 1233 NTVIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDYKFERE 1292

Query: 2020 SKEEYEASVCLCGSQVCRGSYLN 2042
                 +   CLCG+  C+G +LN
Sbjct: 1293 IGST-DRIPCLCGTAACKG-FLN 1313


>gi|410047437|ref|XP_003314036.2| PREDICTED: uncharacterized protein LOC473295, partial [Pan
            troglodytes]
          Length = 1955

 Score = 71.2 bits (173), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N+A  I HSC PNC AKV  V+   +I IY+ + I+  EEIT+DY    E 
Sbjct: 1879 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 1938

Query: 2021 KEEYEASVCLCGSQVCRGS 2039
             +      CLCGS+ CRG+
Sbjct: 1939 VK----IPCLCGSENCRGT 1953


>gi|242221772|ref|XP_002476627.1| predicted protein [Postia placenta Mad-698-R]
 gi|220724099|gb|EED78169.1| predicted protein [Postia placenta Mad-698-R]
          Length = 115

 Score = 71.2 bits (173), Expect = 8e-09,   Method: Composition-based stats.
 Identities = 50/137 (36%), Positives = 67/137 (48%), Gaps = 26/137 (18%)

Query: 1909 VVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI---YLERPKGDADGYDLVVVDA 1965
            V+E++GE+            IR+   +  + A E   I   YL R   D      +VVDA
Sbjct: 2    VIEYVGEI------------IRAQVADKREKAYERQGIGSSYLFRIDED------LVVDA 43

Query: 1966 MHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYE 2025
              K N    I HSC PNC AK+  ++G  +I IY  + I  G EIT+DY+   E     +
Sbjct: 44   TKKGNLGRLINHSCDPNCTAKIITINGEKKIVIYAKQDIELGSEITYDYHFPIEQ----D 99

Query: 2026 ASVCLCGSQVCRGSYLN 2042
               CLCGS  CRG +LN
Sbjct: 100  KIPCLCGSAKCRG-FLN 115


>gi|198435574|ref|XP_002121834.1| PREDICTED: absent, small, or homeotic discs 1 homolog [Ciona
            intestinalis]
          Length = 2850

 Score = 71.2 bits (173), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 53/174 (30%), Positives = 83/174 (47%), Gaps = 22/174 (12%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            +G GV  N +    E  F++E++GEV       E++   R+++  N +   + Y + LE 
Sbjct: 2130 RGWGVRTNSD--IPEGQFLLEYVGEVVS-----EREFRRRTIE--NYNAHNDHYCVQLEA 2180

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
                       V+D    AN    + HSC+PNCE +   V+G Y++G++  R I   EE+
Sbjct: 2181 G---------TVIDGYRLANEGRFVNHSCQPNCEMQKWVVNGEYRVGLFAKRPIVSSEEL 2231

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFE--KVLKELHGLLDR 2062
            T+DYN    + +  +   C CGS  CRG     T  GA +  K    LH   +R
Sbjct: 2232 TYDYNFHAYNLDRQQP--CRCGSSECRGVIGGKTQRGAEQGGKTRSTLHPTKER 2283


>gi|242786320|ref|XP_002480782.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
 gi|218720929|gb|EED20348.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
          Length = 1155

 Score = 71.2 bits (173), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 47/82 (57%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA  +   A  I HSC PNC AK+  VDG  +I IY +R I   EE+T+DY    E 
Sbjct: 1076 AVIDATKRGGIARFINHSCTPNCTAKIIRVDGSKRIVIYALRDISKDEELTYDYKFEREW 1135

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
              E +   CLCGS  C+G +LN
Sbjct: 1136 DSE-DRIPCLCGSAGCKG-FLN 1155


>gi|384250559|gb|EIE24038.1| SET domain-containing protein, partial [Coccomyxa subellipsoidea
            C-169]
          Length = 295

 Score = 70.9 bits (172), Expect = 8e-09,   Method: Composition-based stats.
 Identities = 54/170 (31%), Positives = 83/170 (48%), Gaps = 25/170 (14%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +K  A  KG G+   ++   G+  F+VE++GEV       E+++ +R      E     +
Sbjct: 85   EKRRAGAKGFGLFATQDLVAGQ--FIVEYIGEV------LEEEEYLRRKDYYQESGQRHY 136

Query: 1944 YNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRG 2003
            Y  ++    G+       V+DA  K      I HSC PNCE +   V G   IG+Y ++ 
Sbjct: 137  Y--FMNIGNGE-------VIDAARKGALGRFINHSCNPNCETQKWVVRGELAIGLYALKD 187

Query: 2004 IHYGEEITFDYNSVTESKEEY--EASVCLCGSQVCRGSYLNLTGEGAFEK 2051
            I  G E+TFDYN      E Y  +   CLC ++VCRG ++  TGE   ++
Sbjct: 188  IPAGVELTFDYNF-----ERYGDKPMRCLCEAKVCRG-FIGGTGEAVAQE 231


>gi|119618696|gb|EAW98290.1| hCG1812756 [Homo sapiens]
          Length = 1048

 Score = 70.9 bits (172), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 34/80 (42%), Positives = 46/80 (57%), Gaps = 4/80 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N+A  I HSC PNC AKV  V+   +I IY+ + I+  EEIT+DY    E 
Sbjct: 972  TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 1031

Query: 2021 KEEYEASVCLCGSQVCRGSY 2040
             +      CLCGS+ CRG+ 
Sbjct: 1032 VK----IPCLCGSENCRGTL 1047


>gi|336472713|gb|EGO60873.1| hypothetical protein NEUTE1DRAFT_144212 [Neurospora tetrasperma FGSC
            2508]
          Length = 1282

 Score = 70.9 bits (172), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 47/143 (32%), Positives = 69/143 (48%), Gaps = 23/143 (16%)

Query: 1903 FGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGY---D 1959
              +DD ++E++GE        E +  I  L++            YL+   G +  +   D
Sbjct: 1160 INKDDMIIEYVGE--------EVRQQIAELREAR----------YLKSGIGSSYLFRIDD 1201

Query: 1960 LVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTE 2019
              V+DA  K   A  I HSC PNC AK+  V+G  +I IY +R I   EE+T+DY    E
Sbjct: 1202 NTVIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDYKFERE 1261

Query: 2020 SKEEYEASVCLCGSQVCRGSYLN 2042
                 +   CLCG+  C+G +LN
Sbjct: 1262 IGST-DRIPCLCGTAACKG-FLN 1282


>gi|440632035|gb|ELR01954.1| hypothetical protein GMDG_05127 [Geomyces destructans 20631-21]
          Length = 1301

 Score = 70.9 bits (172), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 35/82 (42%), Positives = 47/82 (57%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             VVDA  +   A  I HSC PNC AK+  V+G  +I IY +R I   EE+T+DY    E 
Sbjct: 1222 TVVDATKRGGIARFINHSCMPNCTAKIIKVEGTRRIVIYALRDIKLNEELTYDYKFEREI 1281

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
              + +   CLCG+  C+G +LN
Sbjct: 1282 GSD-DRIPCLCGTVACKG-FLN 1301


>gi|417406999|gb|JAA50136.1| Putative clathrin coat binding protein/huntingtin [Desmodus rotundus]
          Length = 2557

 Score = 70.9 bits (172), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+   K+     + FV+E+ GEV    K F+ +    +  KN         + Y  
Sbjct: 1552 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1600

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              K D       ++DA  K N +  + HSC PNCE +   V+G  ++G +T + +  G E
Sbjct: 1601 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1654

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
            +TFDY      K   EA  C CGS  CRG YL
Sbjct: 1655 LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1682


>gi|212543321|ref|XP_002151815.1| SET domain protein [Talaromyces marneffei ATCC 18224]
 gi|210066722|gb|EEA20815.1| SET domain protein [Talaromyces marneffei ATCC 18224]
          Length = 1188

 Score = 70.9 bits (172), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 47/82 (57%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA  +   A  I HSC PNC AK+  VDG  +I IY +R I   EE+T+DY    E 
Sbjct: 1109 AVIDATKRGGIARFINHSCTPNCTAKIIRVDGSKRIVIYALRDISKDEELTYDYKFEREW 1168

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
              E +   CLCGS  C+G +LN
Sbjct: 1169 DSE-DRIPCLCGSAGCKG-FLN 1188


>gi|403281795|ref|XP_003932362.1| PREDICTED: histone-lysine N-methyltransferase SETD1B [Saimiri
            boliviensis boliviensis]
          Length = 1823

 Score = 70.9 bits (172), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N+A  I HSC PNC AKV  V+   +I IY+ + I+  EEIT+DY    E 
Sbjct: 1747 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 1806

Query: 2021 KEEYEASVCLCGSQVCRGS 2039
             +      CLCGS+ CRG+
Sbjct: 1807 VK----IPCLCGSENCRGT 1821


>gi|211830050|gb|AAH38367.2| Setd1b protein [Mus musculus]
          Length = 1103

 Score = 70.9 bits (172), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 34/80 (42%), Positives = 46/80 (57%), Gaps = 4/80 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N+A  I HSC PNC AKV  V+   +I IY+ + I+  EEIT+DY    E 
Sbjct: 1027 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 1086

Query: 2021 KEEYEASVCLCGSQVCRGSY 2040
             +      CLCGS+ CRG+ 
Sbjct: 1087 VK----IPCLCGSENCRGTL 1102


>gi|322788177|gb|EFZ13959.1| hypothetical protein SINV_06678 [Solenopsis invicta]
          Length = 1093

 Score = 70.9 bits (172), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 40/131 (30%), Positives = 66/131 (50%), Gaps = 18/131 (13%)

Query: 1908 FVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMH 1967
            FV+E++GE+       +  +  R L +  E     FY + ++  +          +DA  
Sbjct: 915  FVIEYVGEI------IDDAEYKRRLHRKKELKNENFYFLTIDNNR---------TIDAEP 959

Query: 1968 KANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEAS 2027
            K N +  + HSC PNCE +   V+G  +IG++ +R I  GEE+TF+YN  ++ +      
Sbjct: 960  KGNLSRFMNHSCAPNCETQKWTVNGDTRIGLFALRDIESGEELTFNYNLASDGETR---K 1016

Query: 2028 VCLCGSQVCRG 2038
             CLCG+  C G
Sbjct: 1017 ACLCGASNCSG 1027


>gi|224071200|ref|XP_002193972.1| PREDICTED: histone-lysine N-methyltransferase SETD1B [Taeniopygia
            guttata]
          Length = 2004

 Score = 70.9 bits (172), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N+A  I HSC PNC AKV  V+   +I IY+ + I+  EEIT+DY    E 
Sbjct: 1928 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 1987

Query: 2021 KEEYEASVCLCGSQVCRGS 2039
             +      CLCGS+ CRG+
Sbjct: 1988 VK----IPCLCGSENCRGT 2002


>gi|12697196|emb|CAC28349.1| huntingtin interacting protein 1 [Homo sapiens]
 gi|50512435|gb|AAT77612.1| HSPC069 isoform a [Homo sapiens]
          Length = 2061

 Score = 70.9 bits (172), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+   K+     + FV+E+ GEV    K F+ +    +  KN         + Y  
Sbjct: 1056 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1104

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              K D       ++DA  K N +  + HSC PNCE +   V+G  ++G +T + +  G E
Sbjct: 1105 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1158

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
            +TFDY      K   EA  C CGS  CRG YL
Sbjct: 1159 LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1186


>gi|194764639|ref|XP_001964436.1| GF23177 [Drosophila ananassae]
 gi|190614708|gb|EDV30232.1| GF23177 [Drosophila ananassae]
          Length = 3708

 Score = 70.9 bits (172), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 55/151 (36%), Positives = 73/151 (48%), Gaps = 23/151 (15%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
            G G+ C K+   GE   V+E+ GE+            IRS   +  +   +   I     
Sbjct: 3581 GRGLYCTKDIEAGE--MVIEYAGEL------------IRSTLTDKRERYYDSRGIGCYMF 3626

Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
            K D    D +VVDA  + N A  I HSC PNC +KV  + GH  I I+ +R I  GEE+T
Sbjct: 3627 KID----DNLVVDATMRGNAARFINHSCEPNCYSKVVDILGHKHIIIFALRRIVQGEELT 3682

Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            +DY    +   E E   C CGS+ CR  YLN
Sbjct: 3683 YDY----KFPFEEEKIPCSCGSKRCR-KYLN 3708


>gi|109658484|gb|AAI17163.1| SET domain containing 2 [Homo sapiens]
 gi|109658962|gb|AAI17165.1| SET domain containing 2 [Homo sapiens]
          Length = 2061

 Score = 70.9 bits (172), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+   K+     + FV+E+ GEV    K F+ +    +  KN         + Y  
Sbjct: 1056 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1104

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              K D       ++DA  K N +  + HSC PNCE +   V+G  ++G +T + +  G E
Sbjct: 1105 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1158

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
            +TFDY      K   EA  C CGS  CRG YL
Sbjct: 1159 LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1186


>gi|402887949|ref|XP_003907341.1| PREDICTED: uncharacterized protein LOC101023789 [Papio anubis]
          Length = 1927

 Score = 70.9 bits (172), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N+A  I HSC PNC AKV  V+   +I IY+ + I+  EEIT+DY    E 
Sbjct: 1851 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 1910

Query: 2021 KEEYEASVCLCGSQVCRGS 2039
             +      CLCGS+ CRG+
Sbjct: 1911 VK----IPCLCGSENCRGT 1925


>gi|160333334|ref|NP_001103749.1| histone-lysine N-methyltransferase MLL [Danio rerio]
 gi|158714185|gb|ABW79914.1| myeloid/lymphoid or mixed-lineage leukemia [Danio rerio]
          Length = 4218

 Score = 70.9 bits (172), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 55/162 (33%), Positives = 76/162 (46%), Gaps = 31/162 (19%)

Query: 1886 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN 1945
            Y +   G G+ C K    GE   V+E+ G V            IRS+  +  +   ++Y+
Sbjct: 4083 YRSAIHGRGLFCRKNIEPGE--MVIEYSGNV------------IRSVLTDKRE---KYYD 4125

Query: 1946 -----IYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
                  Y+ R     D Y+  VVDA    N A  I HSC PNC ++V  VDG   I I+ 
Sbjct: 4126 DKGIGCYMFR----IDDYE--VVDATIHGNSARFINHSCEPNCYSRVVNVDGQKHIVIFA 4179

Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
             R I+ GEE+T+DY    E  E      C CG++ CR  +LN
Sbjct: 4180 TRKIYKGEELTYDYKFPIE--EPGNKLPCNCGAKKCR-KFLN 4218


>gi|302801428|ref|XP_002982470.1| hypothetical protein SELMODRAFT_421873 [Selaginella moellendorffii]
 gi|300149569|gb|EFJ16223.1| hypothetical protein SELMODRAFT_421873 [Selaginella moellendorffii]
          Length = 1285

 Score = 70.9 bits (172), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 49/152 (32%), Positives = 70/152 (46%), Gaps = 26/152 (17%)

Query: 1895 VVCNKEG-------GFGEDDFVVEFLGEVYPVWKW-FEKQDGIRSLQKNNEDPAPEFYNI 1946
            V C K+G          +  FV+E++GEV     +   +++  R  QK+       FY +
Sbjct: 580  VRCGKKGFGLKALENIAKGSFVIEYVGEVLDSRSFELRQKEYARQRQKH-------FYFM 632

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
             L   +         V+DA  K N    I HSC PNC+ +   V+G   IG++ +R +  
Sbjct: 633  TLNSSE---------VIDACRKGNLGRFINHSCEPNCQTEKWCVNGEICIGLFAIRDVAK 683

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
             EEITF+YN   E      A  C CGS  CRG
Sbjct: 684  NEEITFNYN--FERLYGAAAKKCHCGSAHCRG 713


>gi|225380774|gb|ACN88688.1| myeloid/lymphoid or mixed-lineage leukemia [Danio rerio]
          Length = 4219

 Score = 70.9 bits (172), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 55/162 (33%), Positives = 76/162 (46%), Gaps = 31/162 (19%)

Query: 1886 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN 1945
            Y +   G G+ C K    GE   V+E+ G V            IRS+  +  +   ++Y+
Sbjct: 4084 YRSAIHGRGLFCRKNIEPGE--MVIEYSGNV------------IRSVLTDKRE---KYYD 4126

Query: 1946 -----IYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
                  Y+ R     D Y+  VVDA    N A  I HSC PNC ++V  VDG   I I+ 
Sbjct: 4127 DKGIGCYMFR----IDDYE--VVDATIHGNSARFINHSCEPNCYSRVVNVDGQKHIVIFA 4180

Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
             R I+ GEE+T+DY    E  E      C CG++ CR  +LN
Sbjct: 4181 TRKIYKGEELTYDYKFPIE--EPGNKLPCNCGAKKCR-KFLN 4219


>gi|134084734|emb|CAK43391.1| unnamed protein product [Aspergillus niger]
          Length = 1079

 Score = 70.9 bits (172), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA  +   A  I HSC PNC AK+  VDG  +I IY +R I   EE+T+DY    E 
Sbjct: 1000 TVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREW 1059

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
              + +   CLCGS  C+G +LN
Sbjct: 1060 DSD-DRIPCLCGSTGCKG-FLN 1079


>gi|409078063|gb|EKM78427.1| hypothetical protein AGABI1DRAFT_41599 [Agaricus bisporus var.
            burnettii JB137-S8]
 gi|426194069|gb|EKV44001.1| histone methyltransferase [Agaricus bisporus var. bisporus H97]
          Length = 163

 Score = 70.9 bits (172), Expect = 9e-09,   Method: Composition-based stats.
 Identities = 51/139 (36%), Positives = 68/139 (48%), Gaps = 26/139 (18%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI---YLERPKGDADGYDLVVV 1963
            + V+E++GEV            IR+   +  + A E   I   YL R   D      +VV
Sbjct: 48   EMVIEYVGEV------------IRAAVADKREKAYERQGIGSSYLFRIDED------LVV 89

Query: 1964 DAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEE 2023
            DA  K N    I HSC PNC AK+  + G  +I IY  + I  G+EIT+DY+   E    
Sbjct: 90   DATKKGNLGRLINHSCDPNCTAKIITISGVKKIVIYAKQDIELGDEITYDYHFPFEQ--- 146

Query: 2024 YEASVCLCGSQVCRGSYLN 2042
             +   CLCGS  CRG +LN
Sbjct: 147  -DKIPCLCGSAKCRG-FLN 163


>gi|338714932|ref|XP_001495700.3| PREDICTED: histone-lysine N-methyltransferase SETD2 [Equus caballus]
          Length = 2064

 Score = 70.9 bits (172), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+   K+     + FV+E+ GEV    K F+ +    +  KN         + Y  
Sbjct: 1059 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1107

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              K D       ++DA  K N +  + HSC PNCE +   V+G  ++G +T + +  G E
Sbjct: 1108 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1161

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
            +TFDY      K   EA  C CGS  CRG YL
Sbjct: 1162 LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1189


>gi|317037780|ref|XP_001399137.2| histone-lysine N-methyltransferase, H3 lysine-4 specific [Aspergillus
            niger CBS 513.88]
          Length = 1239

 Score = 70.9 bits (172), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA  +   A  I HSC PNC AK+  VDG  +I IY +R I   EE+T+DY    E 
Sbjct: 1160 TVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREW 1219

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
              + +   CLCGS  C+G +LN
Sbjct: 1220 DSD-DRIPCLCGSTGCKG-FLN 1239


>gi|403268536|ref|XP_003926329.1| PREDICTED: histone-lysine N-methyltransferase SETD2 [Saimiri
            boliviensis boliviensis]
          Length = 2057

 Score = 70.9 bits (172), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+   K+     + FV+E+ GEV    K F+ +    +  KN         + Y  
Sbjct: 1052 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1100

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              K D       ++DA  K N +  + HSC PNCE +   V+G  ++G +T + +  G E
Sbjct: 1101 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1154

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
            +TFDY      K   EA  C CGS  CRG YL
Sbjct: 1155 LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1182


>gi|396578140|ref|NP_001035488.2| histone-lysine N-methyltransferase SETD1B [Mus musculus]
 gi|166977693|sp|Q8CFT2.2|SET1B_MOUSE RecName: Full=Histone-lysine N-methyltransferase SETD1B; AltName:
            Full=SET domain-containing protein 1B
          Length = 1985

 Score = 70.9 bits (172), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N+A  I HSC PNC AKV  V+   +I IY+ + I+  EEIT+DY    E 
Sbjct: 1909 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 1968

Query: 2021 KEEYEASVCLCGSQVCRGS 2039
             +      CLCGS+ CRG+
Sbjct: 1969 VK----IPCLCGSENCRGT 1983


>gi|170050731|ref|XP_001861443.1| set domain protein [Culex quinquefasciatus]
 gi|167872245|gb|EDS35628.1| set domain protein [Culex quinquefasciatus]
          Length = 1181

 Score = 70.9 bits (172), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 43/148 (29%), Positives = 74/148 (50%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+V  ++   G+  FV+E++GEV         ++  R +++  E     +Y + ++ 
Sbjct: 961  KGWGLVAQEDIHQGQ--FVIEYVGEV------INGEELARRIKQKQEQKDENYYFLTVD- 1011

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
                      + +DA  K N A  I HSC PNCE  +  V G   +G++ ++ +  GEE+
Sbjct: 1012 --------SELTIDAGPKGNLARFINHSCEPNCETLLWKVGGSQSVGLFALKDLKAGEEL 1063

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN  T   ++    +C CG+  C G
Sbjct: 1064 TFNYNFETFGDQK---KICHCGAAKCSG 1088


>gi|441630858|ref|XP_003280765.2| PREDICTED: uncharacterized protein LOC100584028 [Nomascus leucogenys]
          Length = 1863

 Score = 70.9 bits (172), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N+A  I HSC PNC AKV  V+   +I IY+ + I+  EEIT+DY    E 
Sbjct: 1787 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 1846

Query: 2021 KEEYEASVCLCGSQVCRGS 2039
             +      CLCGS+ CRG+
Sbjct: 1847 VK----IPCLCGSENCRGT 1861


>gi|395543169|ref|XP_003773493.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
            N-methyltransferase NSD2 [Sarcophilus harrisii]
          Length = 1464

 Score = 70.9 bits (172), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+V  ++   GE  FV E++GE+       ++++ +  ++  +E+    FY + +++
Sbjct: 1074 KGWGLVAKRDIKKGE--FVNEYVGEL------IDEEECMARIKYAHENDITHFYMLTIDK 1125

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NY+  + HSC+PNCE     V+G  ++G++ V  I  G E+
Sbjct: 1126 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1176

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN      E+   +VC CG+  C G
Sbjct: 1177 TFNYNLDCLGNEK---TVCRCGASNCSG 1201


>gi|71897211|ref|NP_001025832.1| histone-lysine N-methyltransferase SETD1B [Gallus gallus]
 gi|82231199|sp|Q5F3P8.1|SET1B_CHICK RecName: Full=Histone-lysine N-methyltransferase SETD1B; AltName:
            Full=SET domain-containing protein 1B
 gi|60098811|emb|CAH65236.1| hypothetical protein RCJMB04_10j6 [Gallus gallus]
          Length = 2008

 Score = 70.9 bits (172), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N+A  I HSC PNC AKV  V+   +I IY+ + I+  EEIT+DY    E 
Sbjct: 1932 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 1991

Query: 2021 KEEYEASVCLCGSQVCRGS 2039
             +      CLCGS+ CRG+
Sbjct: 1992 VK----IPCLCGSENCRGT 2006


>gi|301754587|ref|XP_002913168.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            SETD1B-like [Ailuropoda melanoleuca]
          Length = 1805

 Score = 70.9 bits (172), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N+A  I HSC PNC AKV  V+   +I IY+ + I+  EEIT+DY    E 
Sbjct: 1729 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 1788

Query: 2021 KEEYEASVCLCGSQVCRGS 2039
             +      CLCGS+ CRG+
Sbjct: 1789 VK----IPCLCGSENCRGT 1803


>gi|149063329|gb|EDM13652.1| rCG21620 [Rattus norvegicus]
          Length = 1091

 Score = 70.9 bits (172), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 34/80 (42%), Positives = 46/80 (57%), Gaps = 4/80 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N+A  I HSC PNC AKV  V+   +I IY+ + I+  EEIT+DY    E 
Sbjct: 1015 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 1074

Query: 2021 KEEYEASVCLCGSQVCRGSY 2040
             +      CLCGS+ CRG+ 
Sbjct: 1075 VK----IPCLCGSENCRGTL 1090


>gi|157126101|ref|XP_001654536.1| set domain protein [Aedes aegypti]
 gi|108873380|gb|EAT37605.1| AAEL010414-PA [Aedes aegypti]
          Length = 1480

 Score = 70.9 bits (172), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 49/149 (32%), Positives = 75/149 (50%), Gaps = 20/149 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+V  ++   G+  FV+E++GEV         ++  R LQ         +Y + + 
Sbjct: 1229 QKGWGLVAQEDIRQGQ--FVIEYVGEV------ISNEELERRLQHKVAQKDENYYFLTV- 1279

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
                D++    + +DA  K N A  I HSC PNCE  +  V G   +G++ +  I  GEE
Sbjct: 1280 ----DSE----LTIDAGPKGNLARFINHSCEPNCETMLWTVGGAQSVGLFAIMDIKAGEE 1331

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            +TF+YN   ESK + E  VC C +  C G
Sbjct: 1332 LTFNYN--FESKSD-EKKVCHCNASKCSG 1357


>gi|392352531|ref|XP_003751234.1| PREDICTED: histone-lysine N-methyltransferase SETD1B-like [Rattus
            norvegicus]
          Length = 1900

 Score = 70.9 bits (172), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N+A  I HSC PNC AKV  V+   +I IY+ + I+  EEIT+DY    E 
Sbjct: 1824 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 1883

Query: 2021 KEEYEASVCLCGSQVCRGS 2039
             +      CLCGS+ CRG+
Sbjct: 1884 VK----IPCLCGSENCRGT 1898


>gi|312083807|ref|XP_003144016.1| hypothetical protein LOAG_08436 [Loa loa]
          Length = 761

 Score = 70.9 bits (172), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 59/176 (33%), Positives = 82/176 (46%), Gaps = 22/176 (12%)

Query: 1867 TMKMC-RGILKAMDSRPDDKYVAYR----KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWK 1921
            + +MC    L+  D+  DD ++  +    KG G     +   G D  + E++G V    +
Sbjct: 448  SQQMCANNFLRHHDTNDDDLFMEEKPTILKGFGAFAKCDINKGTD--LTEYVGHVMTKEE 505

Query: 1922 WFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRP 1981
            +FEK    R L  N E     ++ + L       D Y    VDA +  N A    HSC P
Sbjct: 506  YFEKLR-FRCLFNNLE---ASYFGMQLTN-----DFY----VDARNYGNIARSFNHSCEP 552

Query: 1982 NCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCR 2037
            N +     VDG Y++ I T+R I  GEE+TFDY+  TE  E      CLCGS  CR
Sbjct: 553  NTKVDAVVVDGIYRLKISTIRDIKKGEELTFDYD--TEIIEGLVGMECLCGSTNCR 606


>gi|410951014|ref|XP_003982197.1| PREDICTED: histone-lysine N-methyltransferase SETD2 [Felis catus]
          Length = 2064

 Score = 70.9 bits (172), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+   K+     + FV+E+ GEV    K F+ +    +  KN         + Y  
Sbjct: 1059 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1107

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              K D       ++DA  K N +  + HSC PNCE +   V+G  ++G +T + +  G E
Sbjct: 1108 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1161

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
            +TFDY      K   EA  C CGS  CRG YL
Sbjct: 1162 LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1189


>gi|344297409|ref|XP_003420391.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            SETD1B-like [Loxodonta africana]
          Length = 1750

 Score = 70.9 bits (172), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N+A  I HSC PNC AKV  V+   +I IY+ + I+  EEIT+DY    E 
Sbjct: 1674 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 1733

Query: 2021 KEEYEASVCLCGSQVCRGS 2039
             +      CLCGS+ CRG+
Sbjct: 1734 VK----IPCLCGSENCRGT 1748


>gi|164426120|ref|XP_961572.2| hypothetical protein NCU01206 [Neurospora crassa OR74A]
 gi|157071206|gb|EAA32336.2| conserved hypothetical protein [Neurospora crassa OR74A]
          Length = 1150

 Score = 70.9 bits (172), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 47/143 (32%), Positives = 69/143 (48%), Gaps = 23/143 (16%)

Query: 1903 FGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGY---D 1959
              +DD ++E++GE        E +  I  L++            YL+   G +  +   D
Sbjct: 1028 INKDDMIIEYVGE--------EVRQQIAELREAR----------YLKSGIGSSYLFRIDD 1069

Query: 1960 LVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTE 2019
              V+DA  K   A  I HSC PNC AK+  V+G  +I IY +R I   EE+T+DY    E
Sbjct: 1070 NTVIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDYKFERE 1129

Query: 2020 SKEEYEASVCLCGSQVCRGSYLN 2042
                 +   CLCG+  C+G +LN
Sbjct: 1130 IGST-DRIPCLCGTAACKG-FLN 1150


>gi|345321023|ref|XP_001506028.2| PREDICTED: hypothetical protein LOC100074411 [Ornithorhynchus
            anatinus]
          Length = 1258

 Score = 70.9 bits (172), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N+A  I HSC PNC AKV  V+   +I IY+ + I+  EEIT+DY    E 
Sbjct: 1182 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 1241

Query: 2021 KEEYEASVCLCGSQVCRGS 2039
             +      CLCGS+ CRG+
Sbjct: 1242 VK----IPCLCGSENCRGT 1256


>gi|255730355|ref|XP_002550102.1| hypothetical protein CTRG_04399 [Candida tropicalis MYA-3404]
 gi|240132059|gb|EER31617.1| hypothetical protein CTRG_04399 [Candida tropicalis MYA-3404]
          Length = 1056

 Score = 70.9 bits (172), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 48/82 (58%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA  K   A  I H C P+C AK+  V+G  +I IY +R I   EE+T+DY    E+
Sbjct: 977  TVIDATKKGGIARFINHCCSPSCTAKIIKVEGIKRIVIYALRDIEANEELTYDYKFERET 1036

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
             +E E   CLCG+  C+G YLN
Sbjct: 1037 NDE-ERIRCLCGAPGCKG-YLN 1056


>gi|118090799|ref|XP_420839.2| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Gallus
            gallus]
          Length = 1369

 Score = 70.9 bits (172), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+V  ++   GE  FV E++GE+       ++++ +  ++  +E+    FY + +++
Sbjct: 1078 KGWGLVAKRDIKKGE--FVNEYVGEL------IDEEECMARIKYAHENDITHFYMLTIDK 1129

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NY+  + HSC+PNCE     V+G  ++G++ V  I  G E+
Sbjct: 1130 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1180

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN      E+   +VC CG+  C G
Sbjct: 1181 TFNYNLDCLGNEK---TVCKCGAPNCSG 1205


>gi|154278862|ref|XP_001540244.1| hypothetical protein HCAG_04084 [Ajellomyces capsulatus NAm1]
 gi|150412187|gb|EDN07574.1| hypothetical protein HCAG_04084 [Ajellomyces capsulatus NAm1]
          Length = 1266

 Score = 70.9 bits (172), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA  +   A  I HSC PNC AK+  VDG  +I IY +R I   EE+T+DY    E 
Sbjct: 1187 TVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREW 1246

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
              + +   CLCGS  C+G +LN
Sbjct: 1247 DSD-DRIPCLCGSTGCKG-FLN 1266


>gi|225554361|gb|EEH02660.1| histone-lysine N-methyltransferase [Ajellomyces capsulatus G186AR]
          Length = 1267

 Score = 70.9 bits (172), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA  +   A  I HSC PNC AK+  VDG  +I IY +R I   EE+T+DY    E 
Sbjct: 1188 TVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREW 1247

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
              + +   CLCGS  C+G +LN
Sbjct: 1248 DSD-DRIPCLCGSTGCKG-FLN 1267


>gi|168275530|dbj|BAG10485.1| SET domain-containing protein 2 [synthetic construct]
          Length = 2642

 Score = 70.9 bits (172), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 34/82 (41%), Positives = 46/82 (56%), Gaps = 4/82 (4%)

Query: 1960 LVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTE 2019
            L ++DA  K N +  + HSC PNCE +   V+G  ++G +T + +  G E+TFDY     
Sbjct: 1690 LQIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSELTFDYQFQRY 1749

Query: 2020 SKEEYEASVCLCGSQVCRGSYL 2041
             K   EA  C CGS  CRG YL
Sbjct: 1750 GK---EAQKCFCGSANCRG-YL 1767


>gi|115400872|ref|XP_001216024.1| hypothetical protein ATEG_07403 [Aspergillus terreus NIH2624]
 gi|114189965|gb|EAU31665.1| hypothetical protein ATEG_07403 [Aspergillus terreus NIH2624]
          Length = 1230

 Score = 70.9 bits (172), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA  +   A  I HSC PNC AK+  VDG  +I IY +R I   EE+T+DY    E 
Sbjct: 1151 TVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREW 1210

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
              + +   CLCGS  C+G +LN
Sbjct: 1211 DSD-DRIPCLCGSTGCKG-FLN 1230


>gi|406607680|emb|CCH40952.1| Histone-lysine N-methyltransferase [Wickerhamomyces ciferrii]
          Length = 1071

 Score = 70.9 bits (172), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 35/82 (42%), Positives = 48/82 (58%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA  K   A  I H C+P+C AK+  V+G  +I IY +R I   EE+T+DY    E+
Sbjct: 992  TVIDATKKGGIARFINHCCQPSCTAKIIKVEGQKRIVIYALRDIGANEELTYDYKFERET 1051

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
             +  E   CLCG+  C+G YLN
Sbjct: 1052 NDN-ERVRCLCGAPGCKG-YLN 1071


>gi|392332670|ref|XP_003752655.1| PREDICTED: uncharacterized protein LOC100359816 [Rattus norvegicus]
          Length = 2265

 Score = 70.9 bits (172), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N+A  I HSC PNC AKV  V+   +I IY+ + I+  EEIT+DY    E 
Sbjct: 2189 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 2248

Query: 2021 KEEYEASVCLCGSQVCRGS 2039
             +      CLCGS+ CRG+
Sbjct: 2249 VK----IPCLCGSENCRGT 2263


>gi|340381930|ref|XP_003389474.1| PREDICTED: histone-lysine N-methyltransferase trithorax-like
            [Amphimedon queenslandica]
          Length = 192

 Score = 70.9 bits (172), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 56/147 (38%), Positives = 71/147 (48%), Gaps = 24/147 (16%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
            GLG+ C +E   G  D V+E+ G V            IRS   +  +   E   I     
Sbjct: 65   GLGLFCLQEIDSG--DMVIEYAGTV------------IRSTLTDYRERFYESRGIGCYMF 110

Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
            + D+D     VVDA    N A  I HSC PNC +KV AVDG  +I I+ +R I  GEE+T
Sbjct: 111  RIDSDE----VVDATMSGNMARFINHSCEPNCYSKVVAVDGQKKIMIFALRRIVPGEELT 166

Query: 2012 FDYNSVTESKEEYEASV-CLCGSQVCR 2037
            +DY    E     EA + C CGS  CR
Sbjct: 167  YDYKFPIE-----EAKIPCKCGSARCR 188


>gi|307180358|gb|EFN68384.1| Histone-lysine N-methyltransferase trithorax [Camponotus floridanus]
          Length = 3218

 Score = 70.9 bits (172), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 60/175 (34%), Positives = 81/175 (46%), Gaps = 23/175 (13%)

Query: 1868 MKMCRGILKAMDSRPDDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQD 1927
            M M   ILK    +    Y ++  G G+ C ++   GE   V+E+ GEV           
Sbjct: 3067 MAMRFRILKETSKKSVGVYHSHIHGRGLFCLRDIEAGE--MVIEYAGEV----------- 3113

Query: 1928 GIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKV 1987
             IRS   +  +   +  NI     K D    D +VVDA  K N A  I HSC PNC ++V
Sbjct: 3114 -IRSSLTDKREKYYDSKNIGCYMFKID----DHLVVDATMKGNAARFINHSCEPNCYSRV 3168

Query: 1988 TAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
              + G   I I+ +R I  GEE+T+DY    E  +      C CGS+ CR  YLN
Sbjct: 3169 VDILGKKHILIFALRRIIQGEELTYDYKFPFEDIK----IPCTCGSRRCR-KYLN 3218


>gi|426374487|ref|XP_004054104.1| PREDICTED: uncharacterized protein LOC101124677 [Gorilla gorilla
            gorilla]
          Length = 1922

 Score = 70.9 bits (172), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N+A  I HSC PNC AKV  V+   +I IY+ + I+  EEIT+DY    E 
Sbjct: 1846 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 1905

Query: 2021 KEEYEASVCLCGSQVCRGS 2039
             +      CLCGS+ CRG+
Sbjct: 1906 VK----IPCLCGSENCRGT 1920


>gi|410976579|ref|XP_003994695.1| PREDICTED: uncharacterized protein LOC101096419 [Felis catus]
          Length = 1919

 Score = 70.9 bits (172), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N+A  I HSC PNC AKV  V+   +I IY+ + I+  EEIT+DY    E 
Sbjct: 1843 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 1902

Query: 2021 KEEYEASVCLCGSQVCRGS 2039
             +      CLCGS+ CRG+
Sbjct: 1903 VK----IPCLCGSENCRGT 1917


>gi|350630881|gb|EHA19253.1| hypothetical protein ASPNIDRAFT_56859 [Aspergillus niger ATCC 1015]
          Length = 1101

 Score = 70.5 bits (171), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA  +   A  I HSC PNC AK+  VDG  +I IY +R I   EE+T+DY    E 
Sbjct: 1022 TVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREW 1081

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
              + +   CLCGS  C+G +LN
Sbjct: 1082 DSD-DRIPCLCGSTGCKG-FLN 1101


>gi|325089235|gb|EGC42545.1| histone-lysine N-methyltransferase [Ajellomyces capsulatus H88]
          Length = 1267

 Score = 70.5 bits (171), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA  +   A  I HSC PNC AK+  VDG  +I IY +R I   EE+T+DY    E 
Sbjct: 1188 TVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREW 1247

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
              + +   CLCGS  C+G +LN
Sbjct: 1248 DSD-DRIPCLCGSTGCKG-FLN 1267


>gi|195109821|ref|XP_001999480.1| GI24532 [Drosophila mojavensis]
 gi|193916074|gb|EDW14941.1| GI24532 [Drosophila mojavensis]
          Length = 3756

 Score = 70.5 bits (171), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 55/151 (36%), Positives = 72/151 (47%), Gaps = 23/151 (15%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
            G G+ C K+   GE   V+E+ GE+            IRS   +  +   +   I     
Sbjct: 3629 GRGLYCTKDIEAGE--MVIEYAGEL------------IRSTLTDKRERYYDSRGIGCYMF 3674

Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
            K D    D +VVDA  + N A  I HSC PNC +KV  + GH  I I+ +R I  GEE+T
Sbjct: 3675 KID----DNLVVDATMRGNAARFINHSCEPNCYSKVVDILGHKHIIIFALRRIVQGEELT 3730

Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            +DY    E     E   C CGS+ CR  YLN
Sbjct: 3731 YDYKFPFED----EKIPCSCGSKRCR-KYLN 3756


>gi|20521978|dbj|BAB21823.2| KIAA1732 protein [Homo sapiens]
          Length = 1915

 Score = 70.5 bits (171), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+   K+     + FV+E+ GEV    K F+ +    +  KN         + Y  
Sbjct: 910  KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 958

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              K D       ++DA  K N +  + HSC PNCE +   V+G  ++G +T + +  G E
Sbjct: 959  ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1012

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
            +TFDY      K   EA  C CGS  CRG YL
Sbjct: 1013 LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1040


>gi|390360513|ref|XP_785219.3| PREDICTED: histone-lysine N-methyltransferase NSD3-like
            [Strongylocentrotus purpuratus]
          Length = 1736

 Score = 70.5 bits (171), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 41/149 (27%), Positives = 80/149 (53%), Gaps = 19/149 (12%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            DFV E++GE+       ++++  R +++ +E+   +FY + L++ +         ++DA 
Sbjct: 1257 DFVNEYVGEL------VDEEECRRRIKQAHEENITDFYFLTLDKDR---------IIDAG 1301

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K N +  + HSC+PNCE +   V+G  ++G++ +R I  G EI+F+YN      E+   
Sbjct: 1302 PKGNLSRFMNHSCQPNCETQKWTVNGDTRVGLFAIRNIAAGNEISFNYNLDCLGNEKKR- 1360

Query: 2027 SVCLCGSQVCRGSYLNLTGEGAFEKVLKE 2055
              C CG+  C G ++ +  + A    ++E
Sbjct: 1361 --CECGAPNCSG-FIGVRPKTAAAAAMEE 1386


>gi|348554403|ref|XP_003463015.1| PREDICTED: hypothetical protein LOC100714908 [Cavia porcellus]
          Length = 1931

 Score = 70.5 bits (171), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 45/79 (56%), Gaps = 4/79 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N+A  I HSC PNC AKV  V+   +I IY+ + I   EEIT+DY    E 
Sbjct: 1855 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHISVNEEITYDYKFPIED 1914

Query: 2021 KEEYEASVCLCGSQVCRGS 2039
             +      CLCGS+ CRG+
Sbjct: 1915 VK----IPCLCGSENCRGT 1929


>gi|340959767|gb|EGS20948.1| hypothetical protein CTHT_0027870 [Chaetomium thermophilum var.
            thermophilum DSM 1495]
          Length = 1295

 Score = 70.5 bits (171), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 36/84 (42%), Positives = 47/84 (55%), Gaps = 2/84 (2%)

Query: 1959 DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVT 2018
            D  V+DA  K   A  I HSC PNC AK+  V+G  +I IY +R I   EE+T+DY    
Sbjct: 1214 DNTVIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAKNEELTYDYKFER 1273

Query: 2019 ESKEEYEASVCLCGSQVCRGSYLN 2042
            E     +   CLCG+  C+G +LN
Sbjct: 1274 ELGSA-DRIPCLCGTAACKG-FLN 1295


>gi|195392284|ref|XP_002054789.1| trx [Drosophila virilis]
 gi|194152875|gb|EDW68309.1| trx [Drosophila virilis]
          Length = 3822

 Score = 70.5 bits (171), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 55/151 (36%), Positives = 72/151 (47%), Gaps = 23/151 (15%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
            G G+ C K+   GE   V+E+ GE+            IRS   +  +   +   I     
Sbjct: 3695 GRGLYCTKDIEAGE--MVIEYAGEL------------IRSTLTDKRERYYDSRGIGCYMF 3740

Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
            K D    D +VVDA  + N A  I HSC PNC +KV  + GH  I I+ +R I  GEE+T
Sbjct: 3741 KID----DNLVVDATMRGNAARFINHSCEPNCYSKVVDILGHKHIIIFALRRIVQGEELT 3796

Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            +DY    E     E   C CGS+ CR  YLN
Sbjct: 3797 YDYKFPFED----EKIPCSCGSKRCR-KYLN 3822


>gi|355746723|gb|EHH51337.1| hypothetical protein EGM_10693 [Macaca fascicularis]
          Length = 2343

 Score = 70.5 bits (171), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 60/207 (28%), Positives = 89/207 (42%), Gaps = 48/207 (23%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+   ++     + FV+E+ GEV    K F+ +    +  KN         + Y  
Sbjct: 1338 KKGWGLRAARD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1386

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              K D       ++DA  K N +  + HSC PNCE +   V+G  ++G +T + +  G E
Sbjct: 1387 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1440

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGE----------------------- 2046
            +TFDY      K   EA  C CGS  CRG YL   GE                       
Sbjct: 1441 LTFDYQFQRYGK---EAQKCFCGSANCRG-YLG--GENRVSIRAAGGKMKKERSRKKDSV 1494

Query: 2047 -GAFEKVLKELHGLLDRHQLMLEACEL 2072
             G  E +++   GL D++Q+ L  C L
Sbjct: 1495 DGELEALMENGEGLSDKNQV-LSLCRL 1520


>gi|60688116|gb|AAH90954.1| SETD2 protein [Homo sapiens]
          Length = 1845

 Score = 70.5 bits (171), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+   K+     + FV+E+ GEV    K F+ +    +  KN         + Y  
Sbjct: 840  KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 888

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              K D       ++DA  K N +  + HSC PNCE +   V+G  ++G +T + +  G E
Sbjct: 889  ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 942

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
            +TFDY      K   EA  C CGS  CRG YL
Sbjct: 943  LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 970


>gi|327348240|gb|EGE77097.1| histone-lysine N-methyltransferase [Ajellomyces dermatitidis ATCC
            18188]
          Length = 1280

 Score = 70.5 bits (171), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA  +   A  I HSC PNC AK+  VDG  +I IY +R I   EE+T+DY    E 
Sbjct: 1201 TVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREW 1260

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
              + +   CLCGS  C+G +LN
Sbjct: 1261 DSD-DRIPCLCGSTGCKG-FLN 1280


>gi|358373521|dbj|GAA90119.1| SET domain protein [Aspergillus kawachii IFO 4308]
          Length = 1239

 Score = 70.5 bits (171), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA  +   A  I HSC PNC AK+  VDG  +I IY +R I   EE+T+DY    E 
Sbjct: 1160 TVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREW 1219

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
              + +   CLCGS  C+G +LN
Sbjct: 1220 DSD-DRIPCLCGSTGCKG-FLN 1239


>gi|28972602|dbj|BAC65717.1| mKIAA1076 protein [Mus musculus]
          Length = 855

 Score = 70.5 bits (171), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 34/80 (42%), Positives = 46/80 (57%), Gaps = 4/80 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N+A  I HSC PNC AKV  V+   +I IY+ + I+  EEIT+DY    E 
Sbjct: 779  TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 838

Query: 2021 KEEYEASVCLCGSQVCRGSY 2040
             +      CLCGS+ CRG+ 
Sbjct: 839  VK----IPCLCGSENCRGTL 854


>gi|297263735|ref|XP_002808043.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            SETD1B-like [Macaca mulatta]
          Length = 2216

 Score = 70.5 bits (171), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N+A  I HSC PNC AKV  V+   +I IY+ + I+  EEIT+DY    E 
Sbjct: 2140 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 2199

Query: 2021 KEEYEASVCLCGSQVCRGS 2039
             +      CLCGS+ CRG+
Sbjct: 2200 VK----IPCLCGSENCRGT 2214


>gi|261201264|ref|XP_002627032.1| histone-lysine N-methyltransferase [Ajellomyces dermatitidis
            SLH14081]
 gi|239592091|gb|EEQ74672.1| histone-lysine N-methyltransferase [Ajellomyces dermatitidis
            SLH14081]
 gi|239611745|gb|EEQ88732.1| histone-lysine N-methyltransferase [Ajellomyces dermatitidis ER-3]
          Length = 1259

 Score = 70.5 bits (171), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA  +   A  I HSC PNC AK+  VDG  +I IY +R I   EE+T+DY    E 
Sbjct: 1180 TVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREW 1239

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
              + +   CLCGS  C+G +LN
Sbjct: 1240 DSD-DRIPCLCGSTGCKG-FLN 1259


>gi|126332220|ref|XP_001374612.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2
            [Monodelphis domestica]
          Length = 1366

 Score = 70.5 bits (171), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+V  ++   GE  FV E++GE+       ++++ +  ++  +E+    FY + +++
Sbjct: 1074 KGWGLVAKRDIKKGE--FVNEYVGEL------IDEEECMARIKYAHENDITHFYMLTIDK 1125

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NY+  + HSC+PNCE     V+G  ++G++ V  I  G E+
Sbjct: 1126 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1176

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN      E+   +VC CG+  C G
Sbjct: 1177 TFNYNLDCLGNEK---TVCRCGASNCSG 1201


>gi|5689489|dbj|BAA83028.1| KIAA1076 protein [Homo sapiens]
          Length = 804

 Score = 70.5 bits (171), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N+A  I HSC PNC AKV  V+   +I IY+ + I+  EEIT+DY    E 
Sbjct: 728  TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 787

Query: 2021 KEEYEASVCLCGSQVCRGS 2039
             +      CLCGS+ CRG+
Sbjct: 788  VK----IPCLCGSENCRGT 802


>gi|297672976|ref|XP_002814554.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
            N-methyltransferase NSD2 [Pongo abelii]
          Length = 1365

 Score = 70.5 bits (171), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+V  ++   GE  FV E++GE+       ++++ +  ++  +E+    FY + +++
Sbjct: 1073 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKHAHENDITHFYMLTIDK 1124

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NY+  + HSC+PNCE     V+G  ++G++ V  I  G E+
Sbjct: 1125 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1175

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN      E+   +VC CG+  C G
Sbjct: 1176 TFNYNLDCLGNEK---TVCRCGASNCSG 1200


>gi|119587786|gb|EAW67382.1| myeloid/lymphoid or mixed-lineage leukemia (trithorax homolog,
            Drosophila), isoform CRA_c [Homo sapiens]
          Length = 3130

 Score = 70.5 bits (171), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
            G G+ C +    GE   V+E+ G V            IRS+Q +  +   ++Y+      
Sbjct: 3001 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3043

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
            Y+ R        D  VVDA    N A  I HSC PNC ++V  +DG   I I+ +R I+ 
Sbjct: 3044 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3097

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            GEE+T+DY    E  +      C CG++ CR  +LN
Sbjct: 3098 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3130


>gi|27371314|gb|AAH41681.1| Setd1b protein, partial [Mus musculus]
          Length = 917

 Score = 70.5 bits (171), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 34/80 (42%), Positives = 46/80 (57%), Gaps = 4/80 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N+A  I HSC PNC AKV  V+   +I IY+ + I+  EEIT+DY    E 
Sbjct: 841  TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 900

Query: 2021 KEEYEASVCLCGSQVCRGSY 2040
             +      CLCGS+ CRG+ 
Sbjct: 901  VK----IPCLCGSENCRGTL 916


>gi|26251880|gb|AAH40775.1| Setd1b protein, partial [Mus musculus]
          Length = 911

 Score = 70.5 bits (171), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 34/80 (42%), Positives = 46/80 (57%), Gaps = 4/80 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N+A  I HSC PNC AKV  V+   +I IY+ + I+  EEIT+DY    E 
Sbjct: 835  TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 894

Query: 2021 KEEYEASVCLCGSQVCRGSY 2040
             +      CLCGS+ CRG+ 
Sbjct: 895  VK----IPCLCGSENCRGTL 910


>gi|351704076|gb|EHB06995.1| Putative histone-lysine N-methyltransferase NSD2 [Heterocephalus
            glaber]
          Length = 1372

 Score = 70.5 bits (171), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+V  ++   GE  FV E++GE+       ++++ +  ++  +E+    FY + +++
Sbjct: 1079 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKYAHENDITHFYMLTIDK 1130

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NY+  + HSC+PNCE     V+G  ++G++ V  I  G E+
Sbjct: 1131 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1181

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN      E+   +VC CG+  C G
Sbjct: 1182 TFNYNLDCLGNEK---TVCRCGASNCSG 1206


>gi|383860108|ref|XP_003705533.1| PREDICTED: uncharacterized protein LOC100883855 [Megachile rotundata]
          Length = 1766

 Score = 70.5 bits (171), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 49/149 (32%), Positives = 74/149 (49%), Gaps = 20/149 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+    +   GE  F++E++GEV       + +D  R  ++ ++D    +Y + L 
Sbjct: 822  KKGFGLRAMADMLAGE--FIMEYVGEV------VDPKDFRRRAKEYSKDKNRHYYFMAL- 872

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              K D       ++DA  K N +  I HSC PN E +   V+G  +IG +  + I  GEE
Sbjct: 873  --KSDQ------IIDATMKGNVSRFINHSCDPNSETQKWTVNGELRIGFFNKKFIAAGEE 924

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            ITFDY+     K   EA  C C +  CRG
Sbjct: 925  ITFDYHFQRYGK---EAQKCFCEAANCRG 950


>gi|195064789|ref|XP_001996640.1| GH19675 [Drosophila grimshawi]
 gi|193892772|gb|EDV91638.1| GH19675 [Drosophila grimshawi]
          Length = 3837

 Score = 70.5 bits (171), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 55/151 (36%), Positives = 72/151 (47%), Gaps = 23/151 (15%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
            G G+ C K+   GE   V+E+ GE+            IRS   +  +   +   I     
Sbjct: 3710 GRGLYCTKDIEAGE--MVIEYAGEL------------IRSTLTDKRERYYDSRGIGCYMF 3755

Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
            K D    D +VVDA  + N A  I HSC PNC +KV  + GH  I I+ +R I  GEE+T
Sbjct: 3756 KID----DNLVVDATMRGNAARFINHSCEPNCYSKVVDILGHKHIIIFALRRIVQGEELT 3811

Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            +DY    E     E   C CGS+ CR  YLN
Sbjct: 3812 YDYKFPFED----EKIPCSCGSKRCR-KYLN 3837


>gi|149243887|ref|XP_001526541.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
            YB-4239]
 gi|146448935|gb|EDK43191.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
            YB-4239]
          Length = 1156

 Score = 70.5 bits (171), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 35/82 (42%), Positives = 48/82 (58%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA  K   A  I H C P+C AK+  VDG  +I IY +R I   EE+T+DY    E+
Sbjct: 1077 TVIDATKKGGIARFINHCCSPSCTAKIIKVDGKKRIVIYALRDIEANEELTYDYKFERET 1136

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
             ++ E   CLCG+  C+G +LN
Sbjct: 1137 NDD-ERIRCLCGAPGCKG-FLN 1156


>gi|355744804|gb|EHH49429.1| Putative histone-lysine N-methyltransferase NSD2 [Macaca
            fascicularis]
          Length = 1365

 Score = 70.5 bits (171), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+V  ++   GE  FV E++GE+       ++++ +  ++  +E+    FY + +++
Sbjct: 1073 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKHAHENDITHFYMLTIDK 1124

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NY+  + HSC+PNCE     V+G  ++G++ V  I  G E+
Sbjct: 1125 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1175

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN      E+   +VC CG+  C G
Sbjct: 1176 TFNYNLDCLGNEK---TVCRCGASNCSG 1200


>gi|410905477|ref|XP_003966218.1| PREDICTED: histone-lysine N-methyltransferase SETD2-like [Takifugu
            rubripes]
          Length = 1950

 Score = 70.5 bits (171), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 46/148 (31%), Positives = 71/148 (47%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+   K+     + FV+E+ GEV    K F+ +    +  KN       +Y + L+ 
Sbjct: 932  KGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKTRVKEYARNKNIH-----YYFMALKN 983

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K N +  + HSC PNCE +   V+G  ++G +T + +  G E+
Sbjct: 984  NE---------IIDATLKGNLSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKAVTAGTEL 1034

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TFDY      K   EA  C CG+  CRG
Sbjct: 1035 TFDYQFQRYGK---EAQKCFCGTLSCRG 1059


>gi|393910299|gb|EFO20057.2| hypothetical protein LOAG_08436 [Loa loa]
          Length = 770

 Score = 70.5 bits (171), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 59/176 (33%), Positives = 81/176 (46%), Gaps = 22/176 (12%)

Query: 1867 TMKMC-RGILKAMDSRPDDKYV----AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWK 1921
            + +MC    L+  D+  DD ++       KG G     +   G D  + E++G V    +
Sbjct: 457  SQQMCANNFLRHHDTNDDDLFMEEKPTILKGFGAFAKCDINKGTD--LTEYVGHVMTKEE 514

Query: 1922 WFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRP 1981
            +FEK    R L  N E     ++ + L       D Y    VDA +  N A    HSC P
Sbjct: 515  YFEKLR-FRCLFNNLE---ASYFGMQLTN-----DFY----VDARNYGNIARSFNHSCEP 561

Query: 1982 NCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCR 2037
            N +     VDG Y++ I T+R I  GEE+TFDY+  TE  E      CLCGS  CR
Sbjct: 562  NTKVDAVVVDGIYRLKISTIRDIKKGEELTFDYD--TEIIEGLVGMECLCGSTNCR 615


>gi|383421363|gb|AFH33895.1| putative histone-lysine N-methyltransferase NSD2 isoform 1 [Macaca
            mulatta]
 gi|384949270|gb|AFI38240.1| putative histone-lysine N-methyltransferase NSD2 isoform 1 [Macaca
            mulatta]
 gi|387540940|gb|AFJ71097.1| putative histone-lysine N-methyltransferase NSD2 isoform 1 [Macaca
            mulatta]
          Length = 1365

 Score = 70.5 bits (171), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+V  ++   GE  FV E++GE+       ++++ +  ++  +E+    FY + +++
Sbjct: 1073 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKHAHENDITHFYMLTIDK 1124

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NY+  + HSC+PNCE     V+G  ++G++ V  I  G E+
Sbjct: 1125 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1175

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN      E+   +VC CG+  C G
Sbjct: 1176 TFNYNLDCLGNEK---TVCRCGASNCSG 1200


>gi|355557406|gb|EHH14186.1| Putative histone-lysine N-methyltransferase NSD2 [Macaca mulatta]
          Length = 1365

 Score = 70.5 bits (171), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+V  ++   GE  FV E++GE+       ++++ +  ++  +E+    FY + +++
Sbjct: 1073 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKHAHENDITHFYMLTIDK 1124

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NY+  + HSC+PNCE     V+G  ++G++ V  I  G E+
Sbjct: 1125 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1175

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN      E+   +VC CG+  C G
Sbjct: 1176 TFNYNLDCLGNEK---TVCRCGASNCSG 1200


>gi|348530102|ref|XP_003452550.1| PREDICTED: histone-lysine N-methyltransferase MLL4-like [Oreochromis
            niloticus]
          Length = 399

 Score = 70.5 bits (171), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 38/84 (45%), Positives = 50/84 (59%), Gaps = 3/84 (3%)

Query: 1959 DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVT 2018
            D  VVDA  + N A  I HSC PNC ++V  VDG   I I+ +R I+ GEE+T+DY    
Sbjct: 319  DFDVVDATMQGNAARFINHSCEPNCYSRVINVDGRKHIVIFALRKIYRGEELTYDYKFPI 378

Query: 2019 ESKEEYEASVCLCGSQVCRGSYLN 2042
            E  +E    +C CG++ CR  YLN
Sbjct: 379  E--DESNKLLCNCGARRCR-RYLN 399


>gi|344276291|ref|XP_003409942.1| PREDICTED: histone-lysine N-methyltransferase SETD2 [Loxodonta
            africana]
          Length = 2551

 Score = 70.5 bits (171), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 49/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+   ++     + FV+E+ GEV    K F+ +    +  KN         + Y  
Sbjct: 1545 KKGWGLRAARD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1593

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              K D       ++DA  K N +  + HSC PNCE +   V+G  ++G +T + +  G E
Sbjct: 1594 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1647

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
            +TFDY      K   EA  C CGS  CRG YL
Sbjct: 1648 LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1675


>gi|19913348|ref|NP_579877.1| histone-lysine N-methyltransferase NSD2 isoform 1 [Homo sapiens]
 gi|19913350|ref|NP_579878.1| histone-lysine N-methyltransferase NSD2 isoform 1 [Homo sapiens]
 gi|19913358|ref|NP_579890.1| histone-lysine N-methyltransferase NSD2 isoform 1 [Homo sapiens]
 gi|109633019|ref|NP_001035889.1| histone-lysine N-methyltransferase NSD2 isoform 1 [Homo sapiens]
 gi|74706096|sp|O96028.1|NSD2_HUMAN RecName: Full=Histone-lysine N-methyltransferase NSD2; AltName:
            Full=Multiple myeloma SET domain-containing protein;
            Short=MMSET; AltName: Full=Nuclear SET domain-containing
            protein 2; Short=NSD2; AltName: Full=Protein trithorax-5;
            AltName: Full=Wolf-Hirschhorn syndrome candidate 1
            protein; Short=WHSC1
 gi|3249713|gb|AAC24150.1| MMSET type II [Homo sapiens]
 gi|4378019|gb|AAD19343.1| putative WHSC1 protein [Homo sapiens]
 gi|4521954|gb|AAD21770.1| putative WHSC1 protein [Homo sapiens]
 gi|4521955|gb|AAD21771.1| putative WHSC1 protein [Homo sapiens]
 gi|5123789|emb|CAB45386.1| TRX5 protein [Homo sapiens]
 gi|6683809|gb|AAF23370.1| MMSET type II [Homo sapiens]
 gi|119602958|gb|EAW82552.1| Wolf-Hirschhorn syndrome candidate 1, isoform CRA_e [Homo sapiens]
 gi|119602959|gb|EAW82553.1| Wolf-Hirschhorn syndrome candidate 1, isoform CRA_e [Homo sapiens]
 gi|119602962|gb|EAW82556.1| Wolf-Hirschhorn syndrome candidate 1, isoform CRA_e [Homo sapiens]
 gi|168273154|dbj|BAG10416.1| histone-lysine N-methyltransferase NSD2 [synthetic construct]
 gi|187252511|gb|AAI66668.1| Wolf-Hirschhorn syndrome candidate 1 [synthetic construct]
          Length = 1365

 Score = 70.5 bits (171), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+V  ++   GE  FV E++GE+       ++++ +  ++  +E+    FY + +++
Sbjct: 1073 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKHAHENDITHFYMLTIDK 1124

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NY+  + HSC+PNCE     V+G  ++G++ V  I  G E+
Sbjct: 1125 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1175

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN      E+   +VC CG+  C G
Sbjct: 1176 TFNYNLDCLGNEK---TVCRCGASNCSG 1200


>gi|357619110|gb|EHJ71815.1| putative huntingtin interacting protein [Danaus plexippus]
          Length = 225

 Score = 70.5 bits (171), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 54/157 (34%), Positives = 78/157 (49%), Gaps = 20/157 (12%)

Query: 1882 PDDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAP 1941
            P   + A +KG GV    +   GE  F++E++GEV    +++++       Q  ++D   
Sbjct: 78   PLKVFYADKKGCGVEATTDITNGE--FLMEYVGEVLDYDQFYKRA------QAYSDDNNL 129

Query: 1942 EFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTV 2001
              Y + L   KGD       V+DA  K N +  I HSC PN E +   V+G  +IG ++ 
Sbjct: 130  HHYFMSL---KGD------TVIDATLKGNISRFINHSCEPNAETQKWTVNGELRIGFFSK 180

Query: 2002 RGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            R I  GEEITFDY      K    A  C CG++ CRG
Sbjct: 181  REISAGEEITFDYQFQRFGK---VAQRCYCGAENCRG 214


>gi|241998002|ref|XP_002433644.1| set domain protein, putative [Ixodes scapularis]
 gi|215495403|gb|EEC05044.1| set domain protein, putative [Ixodes scapularis]
          Length = 729

 Score = 70.5 bits (171), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 43/137 (31%), Positives = 72/137 (52%), Gaps = 19/137 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            DFV+E++GE+        +Q+  R L + + + +  FY + L+R +         ++DA 
Sbjct: 596  DFVMEYVGEI------INEQECERRLSRLHLEHSSNFYFLTLDRDR---------IIDAG 640

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             + N +  + HSC PNCE +   V+G  ++GI+ +R I  G E+TF+YN      E  + 
Sbjct: 641  PRGNLSRFMNHSCDPNCETQKWTVNGDTRVGIFAIRDIAPGTELTFNYNLDCRGNERIK- 699

Query: 2027 SVCLCGSQVCRGSYLNL 2043
              C CG+  C G Y+ L
Sbjct: 700  --CACGASNCSG-YMGL 713


>gi|10720313|sp|Q24742.1|TRX_DROVI RecName: Full=Histone-lysine N-methyltransferase trithorax
 gi|899254|emb|CAA90349.1| predicted trithorax protein [Drosophila virilis]
          Length = 3828

 Score = 70.5 bits (171), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 55/151 (36%), Positives = 72/151 (47%), Gaps = 23/151 (15%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
            G G+ C K+   GE   V+E+ GE+            IRS   +  +   +   I     
Sbjct: 3701 GRGLYCTKDIEAGE--MVIEYAGEL------------IRSTLTDKRERYYDSRGIGCYMF 3746

Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
            K D    D +VVDA  + N A  I HSC PNC +KV  + GH  I I+ +R I  GEE+T
Sbjct: 3747 KID----DNLVVDATMRGNAARFINHSCEPNCYSKVVDILGHKHIIIFALRRIVQGEELT 3802

Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            +DY    E     E   C CGS+ CR  YLN
Sbjct: 3803 YDYKFPFED----EKIPCSCGSKRCR-KYLN 3828


>gi|345791349|ref|XP_543382.3| PREDICTED: histone-lysine N-methyltransferase SETD1B [Canis lupus
            familiaris]
          Length = 1920

 Score = 70.5 bits (171), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N+A  I HSC PNC AKV  V+   +I IY+ + I+  EEIT+DY    E 
Sbjct: 1844 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSNQHINVNEEITYDYKFPIED 1903

Query: 2021 KEEYEASVCLCGSQVCRGS 2039
             +      CLCGS+ CRG+
Sbjct: 1904 VK----IPCLCGSENCRGT 1918


>gi|452822785|gb|EME29801.1| myeloid/lymphoid or mixed-lineage leukemia protein 3 [Galdieria
            sulphuraria]
          Length = 969

 Score = 70.5 bits (171), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 51/141 (36%), Positives = 71/141 (50%), Gaps = 33/141 (23%)

Query: 1905 EDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----IYLERPKGDADGYD 1959
            +++FV+E+ GE+            IR +     D   +FY+      Y+ R   D     
Sbjct: 852  DEEFVIEYAGEL------------IRPVIA---DIREKFYDRRKIGCYMFRLNDD----- 891

Query: 1960 LVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQ-IGIYTVRGIHYGEEITFDYNSVT 2018
              +VDA  K NYA  I HSC PNC +K+  VDG  Q IGI+  R I  GEE+T+DY    
Sbjct: 892  -FIVDATMKGNYARFINHSCEPNCRSKIITVDGDKQVIGIFAKRNIAAGEELTYDYQF-- 948

Query: 2019 ESKEEYEASV-CLCGSQVCRG 2038
               EE+  ++ C CG+  CRG
Sbjct: 949  ---EEFGETIPCNCGAPNCRG 966


>gi|426343599|ref|XP_004038381.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 1
            [Gorilla gorilla gorilla]
          Length = 1365

 Score = 70.5 bits (171), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+V  ++   GE  FV E++GE+       ++++ +  ++  +E+    FY + +++
Sbjct: 1073 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKHAHENDITHFYMLTIDK 1124

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NY+  + HSC+PNCE     V+G  ++G++ V  I  G E+
Sbjct: 1125 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1175

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN      E+   +VC CG+  C G
Sbjct: 1176 TFNYNLDCLGNEK---TVCRCGASNCSG 1200


>gi|391863483|gb|EIT72791.1| histone H3 (Lys4) methyltransferase complex, subunit SET1
            [Aspergillus oryzae 3.042]
          Length = 1223

 Score = 70.5 bits (171), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA  +   A  I HSC PNC AK+  VDG  +I IY +R I   EE+T+DY    E 
Sbjct: 1144 TVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREW 1203

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
              + +   CLCGS  C+G +LN
Sbjct: 1204 DSD-DRIPCLCGSTGCKG-FLN 1223


>gi|351698529|gb|EHB01448.1| Histone-lysine N-methyltransferase SETD1B [Heterocephalus glaber]
          Length = 1486

 Score = 70.5 bits (171), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N+A  I HSC PNC AKV  V+   +I IY+ + I+  EEIT+DY    E 
Sbjct: 1410 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 1469

Query: 2021 KEEYEASVCLCGSQVCRGS 2039
             +      CLCGS+ CRG+
Sbjct: 1470 VK----IPCLCGSENCRGT 1484


>gi|171692915|ref|XP_001911382.1| hypothetical protein [Podospora anserina S mat+]
 gi|170946406|emb|CAP73207.1| unnamed protein product [Podospora anserina S mat+]
          Length = 1083

 Score = 70.5 bits (171), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 36/84 (42%), Positives = 47/84 (55%), Gaps = 2/84 (2%)

Query: 1959 DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVT 2018
            D  V+DA  K   A  I HSC PNC AK+  V+G  +I IY +R I   EE+T+DY    
Sbjct: 1002 DNTVIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDYKFER 1061

Query: 2019 ESKEEYEASVCLCGSQVCRGSYLN 2042
            E     +   CLCG+  C+G +LN
Sbjct: 1062 EIGAT-DRIPCLCGTAACKG-FLN 1083


>gi|169769549|ref|XP_001819244.1| histone-lysine N-methyltransferase, H3 lysine-4 specific [Aspergillus
            oryzae RIB40]
 gi|121933328|sp|Q2UMH3.1|SET1_ASPOR RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
            specific; AltName: Full=COMPASS component SET1; AltName:
            Full=SET domain-containing protein 1
 gi|83767103|dbj|BAE57242.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 1229

 Score = 70.5 bits (171), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA  +   A  I HSC PNC AK+  VDG  +I IY +R I   EE+T+DY    E 
Sbjct: 1150 TVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREW 1209

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
              + +   CLCGS  C+G +LN
Sbjct: 1210 DSD-DRIPCLCGSTGCKG-FLN 1229


>gi|294658913|ref|XP_461254.2| DEHA2F20834p [Debaryomyces hansenii CBS767]
 gi|218511781|sp|Q6BKL7.2|SET1_DEBHA RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
            specific; AltName: Full=COMPASS component SET1; AltName:
            Full=SET domain-containing protein 1
 gi|202953480|emb|CAG89643.2| DEHA2F20834p [Debaryomyces hansenii CBS767]
          Length = 1088

 Score = 70.5 bits (171), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             VVDA  K   A  I H C P+C AK+  V+G  +I IY +R I   EE+T+DY    E+
Sbjct: 1009 TVVDATKKGGIARFINHCCNPSCTAKIIKVEGKKRIVIYALRDIEANEELTYDYKFEKET 1068

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
             +  E   CLCG+  C+G YLN
Sbjct: 1069 NDA-ERIRCLCGAPGCKG-YLN 1088


>gi|114592860|ref|XP_001146084.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 6
            [Pan troglodytes]
 gi|114592864|ref|XP_001146248.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 7
            [Pan troglodytes]
 gi|114592866|ref|XP_001146323.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 8
            [Pan troglodytes]
 gi|114592870|ref|XP_001146473.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform
            10 [Pan troglodytes]
 gi|397483594|ref|XP_003812984.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Pan
            paniscus]
 gi|410227780|gb|JAA11109.1| Wolf-Hirschhorn syndrome candidate 1 [Pan troglodytes]
 gi|410259494|gb|JAA17713.1| Wolf-Hirschhorn syndrome candidate 1 [Pan troglodytes]
 gi|410299310|gb|JAA28255.1| Wolf-Hirschhorn syndrome candidate 1 [Pan troglodytes]
 gi|410334709|gb|JAA36301.1| Wolf-Hirschhorn syndrome candidate 1 [Pan troglodytes]
          Length = 1365

 Score = 70.5 bits (171), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+V  ++   GE  FV E++GE+       ++++ +  ++  +E+    FY + +++
Sbjct: 1073 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKHAHENDITHFYMLTIDK 1124

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NY+  + HSC+PNCE     V+G  ++G++ V  I  G E+
Sbjct: 1125 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1175

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN      E+   +VC CG+  C G
Sbjct: 1176 TFNYNLDCLGNEK---TVCRCGASNCSG 1200


>gi|390351134|ref|XP_003727587.1| PREDICTED: histone-lysine N-methyltransferase SETD1B-A-like
            [Strongylocentrotus purpuratus]
          Length = 282

 Score = 70.5 bits (171), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 45/140 (32%), Positives = 66/140 (47%), Gaps = 25/140 (17%)

Query: 1903 FGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI---YLERPKGDADGYD 1959
               D+ V+E++GE             +R    ++ + A E   I   YL R         
Sbjct: 163  IAADEMVIEYVGE------------SVRQSIADSREKAYERMGIGSSYLFRIDA------ 204

Query: 1960 LVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTE 2019
            + ++DA    N A  I HSC PNC AK+  V+   +I IY+ + I+ G+EIT+DY    E
Sbjct: 205  VTIIDATKSGNLARFINHSCNPNCYAKIITVESEKKIVIYSKQTINVGDEITYDYKFPIE 264

Query: 2020 SKEEYEASVCLCGSQVCRGS 2039
                 E   CLCG+  CRG+
Sbjct: 265  D----EKISCLCGAAQCRGT 280


>gi|334327124|ref|XP_003340832.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            SETD1B-like, partial [Monodelphis domestica]
          Length = 1723

 Score = 70.5 bits (171), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 45/79 (56%), Gaps = 4/79 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N+A  I HSC PNC AKV  V+   +I IY+ + I   EEIT+DY    E 
Sbjct: 1647 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHISVNEEITYDYKFPIED 1706

Query: 2021 KEEYEASVCLCGSQVCRGS 2039
             +      CLCGS+ CRG+
Sbjct: 1707 VK----IPCLCGSENCRGT 1721


>gi|300796853|ref|NP_001178481.1| probable histone-lysine N-methyltransferase NSD2 [Rattus norvegicus]
          Length = 1346

 Score = 70.5 bits (171), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+V  ++   GE  FV E++GE+       ++++ +  ++  +E+    FY + +++
Sbjct: 1054 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKYAHENDITHFYMLTIDK 1105

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NY+  + HSC+PNCE     V+G  ++G++ V  I  G E+
Sbjct: 1106 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1156

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN      E+   +VC CG+  C G
Sbjct: 1157 TFNYNLDCLGNEK---TVCRCGASNCSG 1181


>gi|47223666|emb|CAF99275.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 1830

 Score = 70.5 bits (171), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 46/79 (58%), Gaps = 4/79 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N+A  I HSC PNC AKV  V+   +I IY+ + I+  EEIT+DY    E 
Sbjct: 1754 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSRQPINVNEEITYDYKFPIED 1813

Query: 2021 KEEYEASVCLCGSQVCRGS 2039
             +      CLCG++ CRG+
Sbjct: 1814 VK----IPCLCGAENCRGT 1828


>gi|162318272|gb|AAI56161.1| Wolf-Hirschhorn syndrome candidate 1 (human) [synthetic construct]
 gi|162318442|gb|AAI56968.1| Wolf-Hirschhorn syndrome candidate 1 (human) [synthetic construct]
          Length = 1346

 Score = 70.5 bits (171), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+V  ++   GE  FV E++GE+       ++++ +  ++  +E+    FY + +++
Sbjct: 1054 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKYAHENDITHFYMLTIDK 1105

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NY+  + HSC+PNCE     V+G  ++G++ V  I  G E+
Sbjct: 1106 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1156

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN      E+   +VC CG+  C G
Sbjct: 1157 TFNYNLDCLGNEK---TVCRCGASNCSG 1181


>gi|50512437|gb|AAT77613.1| HSPC069 isoform b [Homo sapiens]
          Length = 1211

 Score = 70.5 bits (171), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 50/155 (32%), Positives = 73/155 (47%), Gaps = 21/155 (13%)

Query: 1887 VAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI 1946
            +  +KG G+   K+     + FV+E+ GEV    K F+ +    +  KN         + 
Sbjct: 1053 LTEKKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHY 1101

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
            Y    K D       ++DA  K N +  + HSC PNCE +   V+G  ++G +T + +  
Sbjct: 1102 YFMALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPS 1155

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
            G E+TFDY      K   EA  C CGS  CRG YL
Sbjct: 1156 GSELTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1186


>gi|410958014|ref|XP_003985618.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Felis
            catus]
          Length = 1300

 Score = 70.1 bits (170), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+V  ++   GE  FV E++GE+       ++++ +  ++  +E+    FY + +++
Sbjct: 1008 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKHAHENDITHFYMLTIDK 1059

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NY+  + HSC+PNCE     V+G  ++G++ V  I  G E+
Sbjct: 1060 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1110

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN      E+   +VC CG+  C G
Sbjct: 1111 TFNYNLDCLGNEK---TVCRCGASNCSG 1135


>gi|158818|gb|AAA29025.1| zinc-binding protein [Drosophila melanogaster]
          Length = 3759

 Score = 70.1 bits (170), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 55/151 (36%), Positives = 71/151 (47%), Gaps = 23/151 (15%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
            G G+ C K+   GE   V+E+ GE+            IRS   +  +   +   I     
Sbjct: 3632 GRGLYCTKDIEAGE--MVIEYAGEL------------IRSTLTDKRERYYDSRGIGCYMF 3677

Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
            K D    D +VVDA  + N A  I H C PNC +KV  + GH  I I+ VR I  GEE+T
Sbjct: 3678 KID----DNLVVDATMRGNAARFINHCCEPNCYSKVVDILGHKHIIIFAVRRIVQGEELT 3733

Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            +DY    E     E   C CGS+ CR  YLN
Sbjct: 3734 YDYKFPFED----EKIPCSCGSKRCR-KYLN 3759


>gi|326919530|ref|XP_003206033.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
            [Meleagris gallopavo]
          Length = 1348

 Score = 70.1 bits (170), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+V  ++   GE  FV E++GE+       ++++ +  ++  +E+    FY + +++
Sbjct: 1057 KGWGLVAKRDIKKGE--FVNEYVGEL------IDEEECMARIKYAHENDITHFYMLTIDK 1108

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NY+  + HSC+PNCE     V+G  ++G++ V  I  G E+
Sbjct: 1109 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1159

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN      E+   +VC CG+  C G
Sbjct: 1160 TFNYNLDCLGNEK---TVCKCGAPNCSG 1184


>gi|255938628|ref|XP_002560084.1| Pc14g00900 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211584705|emb|CAP74231.1| Pc14g00900 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 1202

 Score = 70.1 bits (170), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA  +   A  I HSC PNC AK+  VDG  +I IY +R I   EE+T+DY    E 
Sbjct: 1123 TVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREW 1182

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
              + +   CLCGS  C+G +LN
Sbjct: 1183 DSD-DRIPCLCGSTGCKG-FLN 1202


>gi|224050217|ref|XP_002195834.1| PREDICTED: histone-lysine N-methyltransferase NSD2 [Taeniopygia
            guttata]
          Length = 1339

 Score = 70.1 bits (170), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+V  ++   GE  FV E++GE+       ++++ +  ++  +E+    FY + +++
Sbjct: 1077 KGWGLVAKRDIKKGE--FVNEYVGEL------IDEEECMARIKYAHENDITHFYMLTIDK 1128

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NY+  + HSC+PNCE     V+G  ++G++ V  I  G E+
Sbjct: 1129 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1179

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN      E+   +VC CG+  C G
Sbjct: 1180 TFNYNLDCLGNEK---TVCKCGAPNCSG 1204


>gi|332025910|gb|EGI66066.1| Histone-lysine N-methyltransferase trithorax [Acromyrmex echinatior]
          Length = 3452

 Score = 70.1 bits (170), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 60/175 (34%), Positives = 80/175 (45%), Gaps = 23/175 (13%)

Query: 1868 MKMCRGILKAMDSRPDDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQD 1927
            M M   ILK         Y ++  G G+ C ++   GE   V+E+ GEV           
Sbjct: 3301 MAMRFRILKETSKASVGVYYSHIHGRGLFCLRDIEPGE--MVIEYAGEV----------- 3347

Query: 1928 GIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKV 1987
             IRS   +  +   +  NI     K D    D +VVDA  K N A  I HSC PNC ++V
Sbjct: 3348 -IRSSLTDKREKYYDSKNIGCYMFKID----DHLVVDATMKGNAARFINHSCEPNCYSRV 3402

Query: 1988 TAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
              + G   I I+ +R I  GEE+T+DY    E  +      C CGS+ CR  YLN
Sbjct: 3403 VDILGKKHILIFALRRIIQGEELTYDYKFPFEDIK----IPCTCGSRKCR-KYLN 3452


>gi|62088596|dbj|BAD92745.1| myeloid/lymphoid or mixed-lineage leukemia (trithorax homolog,
            Drosophila) variant [Homo sapiens]
          Length = 2880

 Score = 70.1 bits (170), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
            G G+ C +    GE   V+E+ G V            IRS+Q +  +   ++Y+      
Sbjct: 2751 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 2793

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
            Y+ R        D  VVDA    N A  I HSC PNC ++V  +DG   I I+ +R I+ 
Sbjct: 2794 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 2847

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            GEE+T+DY    E  +      C CG++ CR  +LN
Sbjct: 2848 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 2880


>gi|469800|emb|CAA83516.1| predicted trithorax protein [Drosophila melanogaster]
 gi|1052593|emb|CAA90513.1| trithorax protein trxII [Drosophila melanogaster]
 gi|1311653|gb|AAB35873.1| large trx isoform=trithorax gene product large isoform {alternatively
            spliced, exon II-containing isoform} [Drosophila,
            embryos, Peptide, 3726 aa]
          Length = 3726

 Score = 70.1 bits (170), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 55/151 (36%), Positives = 71/151 (47%), Gaps = 23/151 (15%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
            G G+ C K+   GE   V+E+ GE+            IRS   +  +   +   I     
Sbjct: 3599 GRGLYCTKDIEAGE--MVIEYAGEL------------IRSTLTDKRERYYDSRGIGCYMF 3644

Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
            K D    D +VVDA  + N A  I H C PNC +KV  + GH  I I+ VR I  GEE+T
Sbjct: 3645 KID----DNLVVDATMRGNAARFINHCCEPNCYSKVVDILGHKHIIIFAVRRIVQGEELT 3700

Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            +DY    E     E   C CGS+ CR  YLN
Sbjct: 3701 YDYKFPFED----EKIPCSCGSKRCR-KYLN 3726


>gi|469801|emb|CAA83515.1| predicted trithorax protein [Drosophila melanogaster]
 gi|1052594|emb|CAA90514.1| trithorax protein trxI [Drosophila melanogaster]
          Length = 3358

 Score = 70.1 bits (170), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 55/151 (36%), Positives = 71/151 (47%), Gaps = 23/151 (15%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
            G G+ C K+   GE   V+E+ GE+            IRS   +  +   +   I     
Sbjct: 3231 GRGLYCTKDIEAGE--MVIEYAGEL------------IRSTLTDKRERYYDSRGIGCYMF 3276

Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
            K D    D +VVDA  + N A  I H C PNC +KV  + GH  I I+ VR I  GEE+T
Sbjct: 3277 KID----DNLVVDATMRGNAARFINHCCEPNCYSKVVDILGHKHIIIFAVRRIVQGEELT 3332

Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            +DY    E     E   C CGS+ CR  YLN
Sbjct: 3333 YDYKFPFED----EKIPCSCGSKRCR-KYLN 3358


>gi|195156904|ref|XP_002019336.1| GL12290 [Drosophila persimilis]
 gi|194115927|gb|EDW37970.1| GL12290 [Drosophila persimilis]
          Length = 1548

 Score = 70.1 bits (170), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 44/79 (55%), Gaps = 4/79 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N A  I HSC PNC AKV  ++   +I IY+ + I   EEIT+DY    E 
Sbjct: 1472 TIIDATKCGNLARFINHSCNPNCYAKVITIESEKKIVIYSKQPIGVNEEITYDYKFPLED 1531

Query: 2021 KEEYEASVCLCGSQVCRGS 2039
                E   CLCG+Q CRG+
Sbjct: 1532 ----EKIPCLCGAQGCRGT 1546


>gi|295424166|ref|NP_780440.2| histone-lysine N-methyltransferase NSD2 isoform 2 [Mus musculus]
 gi|118572947|sp|Q8BVE8.2|NSD2_MOUSE RecName: Full=Histone-lysine N-methyltransferase NSD2; AltName:
            Full=Multiple myeloma SET domain-containing protein;
            Short=MMSET; AltName: Full=Nuclear SET domain-containing
            protein 2; Short=NSD2; AltName: Full=Wolf-Hirschhorn
            syndrome candidate 1 protein homolog; Short=WHSC1
          Length = 1365

 Score = 70.1 bits (170), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+V  ++   GE  FV E++GE+       ++++ +  ++  +E+    FY + +++
Sbjct: 1073 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKYAHENDITHFYMLTIDK 1124

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NY+  + HSC+PNCE     V+G  ++G++ V  I  G E+
Sbjct: 1125 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1175

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN      E+   +VC CG+  C G
Sbjct: 1176 TFNYNLDCLGNEK---TVCRCGASNCSG 1200


>gi|198452207|ref|XP_002137435.1| GA27210, isoform A [Drosophila pseudoobscura pseudoobscura]
 gi|198131831|gb|EDY67993.1| GA27210, isoform A [Drosophila pseudoobscura pseudoobscura]
          Length = 3779

 Score = 70.1 bits (170), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 55/151 (36%), Positives = 72/151 (47%), Gaps = 23/151 (15%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
            G G+ C K+   GE   V+E+ GE+            IRS   +  +   +   I     
Sbjct: 3652 GRGLYCTKDIEAGE--MVIEYAGEL------------IRSTLTDKRERYYDSRGIGCYMF 3697

Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
            K D    D +VVDA  + N A  I HSC PNC +KV  + GH  I I+ +R I  GEE+T
Sbjct: 3698 KID----DNLVVDATMRGNAARFINHSCEPNCYSKVVDILGHKHIIIFALRRIVQGEELT 3753

Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            +DY    E     E   C CGS+ CR  YLN
Sbjct: 3754 YDYKFPFED----EKIPCSCGSKRCR-KYLN 3779


>gi|355786615|gb|EHH66798.1| hypothetical protein EGM_03852, partial [Macaca fascicularis]
          Length = 673

 Score = 70.1 bits (170), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N+A  I HSC PNC AKV  V+   +I IY+ + I+  EEIT+DY    E 
Sbjct: 597  TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 656

Query: 2021 KEEYEASVCLCGSQVCRGS 2039
             +      CLCGS+ CRG+
Sbjct: 657  VK----IPCLCGSENCRGT 671


>gi|295424164|ref|NP_001074571.2| histone-lysine N-methyltransferase NSD2 isoform 1 [Mus musculus]
          Length = 1366

 Score = 70.1 bits (170), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+V  ++   GE  FV E++GE+       ++++ +  ++  +E+    FY + +++
Sbjct: 1074 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKYAHENDITHFYMLTIDK 1125

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NY+  + HSC+PNCE     V+G  ++G++ V  I  G E+
Sbjct: 1126 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1176

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN      E+   +VC CG+  C G
Sbjct: 1177 TFNYNLDCLGNEK---TVCRCGASNCSG 1201


>gi|354483938|ref|XP_003504149.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 1
            [Cricetulus griseus]
          Length = 1365

 Score = 70.1 bits (170), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+V  ++   GE  FV E++GE+       ++++ +  ++  +E+    FY + +++
Sbjct: 1073 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKYAHENDITHFYMLTIDK 1124

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NY+  + HSC+PNCE     V+G  ++G++ V  I  G E+
Sbjct: 1125 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1175

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN      E+   +VC CG+  C G
Sbjct: 1176 TFNYNLDCLGNEK---TVCRCGASNCSG 1200


>gi|281339990|gb|EFB15574.1| hypothetical protein PANDA_004672 [Ailuropoda melanoleuca]
          Length = 1363

 Score = 70.1 bits (170), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+V  ++   GE  FV E++GE+       ++++ +  ++  +E+    FY + +++
Sbjct: 1071 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKYAHENDITHFYMLTIDK 1122

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NY+  + HSC+PNCE     V+G  ++G++ V  I  G E+
Sbjct: 1123 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1173

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN      E+   +VC CG+  C G
Sbjct: 1174 TFNYNLDCLGNEK---TVCRCGASNCSG 1198


>gi|147899914|ref|NP_001087630.1| histone-lysine N-methyltransferase SETD1B [Xenopus laevis]
 gi|82234463|sp|Q66J90.1|SET1B_XENLA RecName: Full=Histone-lysine N-methyltransferase SETD1B; AltName:
            Full=SET domain-containing protein 1B
 gi|51703454|gb|AAH81016.1| MGC81602 protein [Xenopus laevis]
          Length = 1938

 Score = 70.1 bits (170), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 46/79 (58%), Gaps = 4/79 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N+A  I HSC PNC AKV  V+   +I IY+ + I+  EEIT+DY    E 
Sbjct: 1862 TIIDATKCGNFARFINHSCNPNCYAKVVTVESQKKIVIYSKQYINVNEEITYDYKFPIED 1921

Query: 2021 KEEYEASVCLCGSQVCRGS 2039
             +      CLCG++ CRG+
Sbjct: 1922 VK----IPCLCGAENCRGT 1936


>gi|350588548|ref|XP_003357368.2| PREDICTED: histone-lysine N-methyltransferase MLL [Sus scrofa]
          Length = 2525

 Score = 70.1 bits (170), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
            G G+ C +    GE   V+E+ G V            IRS+Q +  +   ++Y+      
Sbjct: 2396 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 2438

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
            Y+ R        D  VVDA    N A  I HSC PNC ++V  +DG   I I+ +R I+ 
Sbjct: 2439 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 2492

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            GEE+T+DY    E  +      C CG++ CR  +LN
Sbjct: 2493 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 2525


>gi|67539250|ref|XP_663399.1| hypothetical protein AN5795.2 [Aspergillus nidulans FGSC A4]
 gi|74680884|sp|Q5B0Y5.1|SET1_EMENI RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
            specific; AltName: Full=COMPASS component SET1; AltName:
            Full=SET domain-containing protein 1
 gi|40743698|gb|EAA62888.1| hypothetical protein AN5795.2 [Aspergillus nidulans FGSC A4]
 gi|259484715|tpe|CBF81174.1| TPA: Histone-lysine N-methyltransferase, H3 lysine-4 specific (EC
            2.1.1.43)(COMPASS component SET1)(SET domain-containing
            protein 1) [Source:UniProtKB/Swiss-Prot;Acc:Q5B0Y5]
            [Aspergillus nidulans FGSC A4]
          Length = 1220

 Score = 70.1 bits (170), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA  +   A  I HSC PNC AK+  VDG  +I IY +R I   EE+T+DY    E 
Sbjct: 1141 TVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREW 1200

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
              + +   CLCGS  C+G +LN
Sbjct: 1201 DSD-DRIPCLCGSAGCKG-FLN 1220


>gi|328778088|ref|XP_392252.4| PREDICTED: histone-lysine N-methyltransferase trithorax [Apis
            mellifera]
          Length = 3195

 Score = 70.1 bits (170), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 59/175 (33%), Positives = 81/175 (46%), Gaps = 23/175 (13%)

Query: 1868 MKMCRGILKAMDSRPDDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQD 1927
            M M   ILK         Y ++  G G+ C ++   GE   V+E+ GEV           
Sbjct: 3044 MAMRFRILKETSKESVGVYHSHIHGRGLFCLRDIEAGE--MVIEYAGEV----------- 3090

Query: 1928 GIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKV 1987
             IR+   +  +   +  NI     K D    D +VVDA  K N A  I HSC PNC ++V
Sbjct: 3091 -IRASLTDKREKYYDSKNIGCYMFKID----DHLVVDATMKGNAARFINHSCEPNCYSRV 3145

Query: 1988 TAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
              + G   I I+ +R I+ GEE+T+DY    E  +      C CGS+ CR  YLN
Sbjct: 3146 VDILGKKHILIFALRRINQGEELTYDYKFPFEDIK----IPCTCGSRRCR-KYLN 3195


>gi|395513793|ref|XP_003761107.1| PREDICTED: uncharacterized protein LOC100928096 [Sarcophilus
            harrisii]
          Length = 1224

 Score = 70.1 bits (170), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 34/80 (42%), Positives = 46/80 (57%), Gaps = 4/80 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N+A  I HSC PNC AKV  V+   +I IY+ + I+  EEIT+DY    E 
Sbjct: 1148 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 1207

Query: 2021 KEEYEASVCLCGSQVCRGSY 2040
             +      CLCGS+ CRG+ 
Sbjct: 1208 VK----IPCLCGSENCRGTL 1223


>gi|344244292|gb|EGW00396.1| putative histone-lysine N-methyltransferase NSD2 [Cricetulus griseus]
          Length = 1344

 Score = 70.1 bits (170), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+V  ++   GE  FV E++GE+       ++++ +  ++  +E+    FY + +++
Sbjct: 1052 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKYAHENDITHFYMLTIDK 1103

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NY+  + HSC+PNCE     V+G  ++G++ V  I  G E+
Sbjct: 1104 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1154

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN      E+   +VC CG+  C G
Sbjct: 1155 TFNYNLDCLGNEK---TVCRCGASNCSG 1179


>gi|157127309|ref|XP_001654916.1| hypothetical protein AaeL_AAEL010807 [Aedes aegypti]
 gi|108872954|gb|EAT37179.1| AAEL010807-PA [Aedes aegypti]
          Length = 1670

 Score = 70.1 bits (170), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 44/79 (55%), Gaps = 4/79 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N A  I HSC PNC AKV  ++   +I IY+ + I   EEIT+DY    E 
Sbjct: 1594 TIIDATKCGNLARFINHSCNPNCYAKVITIESEKKIVIYSKQAIGINEEITYDYKFPLED 1653

Query: 2021 KEEYEASVCLCGSQVCRGS 2039
                E   CLCG+Q CRG+
Sbjct: 1654 ----EKIPCLCGAQGCRGT 1668


>gi|260944792|ref|XP_002616694.1| hypothetical protein CLUG_03935 [Clavispora lusitaniae ATCC 42720]
 gi|238850343|gb|EEQ39807.1| hypothetical protein CLUG_03935 [Clavispora lusitaniae ATCC 42720]
          Length = 469

 Score = 70.1 bits (170), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA  K   A  I H C P+C AK+  VDG  +I IY +R I   EE+T+DY    E+
Sbjct: 390  TVIDATKKGGIARFINHCCNPSCTAKIIKVDGKKRIVIYALRDIEANEELTYDYKFERET 449

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
             +  E   CLCG+  C+G YLN
Sbjct: 450  NDA-ERIRCLCGAPGCKG-YLN 469


>gi|190344535|gb|EDK36223.2| hypothetical protein PGUG_00321 [Meyerozyma guilliermondii ATCC 6260]
          Length = 1055

 Score = 70.1 bits (170), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 35/81 (43%), Positives = 48/81 (59%), Gaps = 2/81 (2%)

Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
            V+DA  K   A  I H C P+C AK+  V+G  +I IY +R I   EE+T+DY    E+ 
Sbjct: 977  VIDATKKGGIARFINHCCNPSCTAKIIKVEGKKRIVIYALRDIEANEELTYDYKFERETN 1036

Query: 2022 EEYEASVCLCGSQVCRGSYLN 2042
            ++ E   CLCG+  C+G YLN
Sbjct: 1037 DD-ERIRCLCGAPGCKG-YLN 1055


>gi|383861703|ref|XP_003706324.1| PREDICTED: uncharacterized protein LOC100882965 [Megachile rotundata]
          Length = 3434

 Score = 70.1 bits (170), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 60/177 (33%), Positives = 82/177 (46%), Gaps = 27/177 (15%)

Query: 1868 MKMCRGILKAMDSRPDDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQD 1927
            M M   ILK         Y ++  G G+ C ++   GE   V+E+ GEV           
Sbjct: 3283 MAMRFRILKETSKESVGVYHSHIHGRGLFCLRDIEAGE--MVIEYAGEV----------- 3329

Query: 1928 GIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKV 1987
             IR+   +  +   +  NI     K D    D +VVDA  K N A  I HSC PNC ++V
Sbjct: 3330 -IRASLTDKREKYYDSKNIGCYMFKID----DHLVVDATMKGNAARFINHSCEPNCYSRV 3384

Query: 1988 TAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYE--ASVCLCGSQVCRGSYLN 2042
              + G   I I+ +R I+ GEE+T+DY      K  +E     C CGS+ CR  YLN
Sbjct: 3385 VDILGKKHILIFALRRINQGEELTYDY------KFPFEDIKIPCTCGSRRCR-KYLN 3434


>gi|301606681|ref|XP_002932945.1| PREDICTED: histone-lysine N-methyltransferase MLL isoform 2 [Xenopus
            (Silurana) tropicalis]
          Length = 3840

 Score = 70.1 bits (170), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 51/157 (32%), Positives = 72/157 (45%), Gaps = 33/157 (21%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
            G G+ C +    GE   V+E+ G V            IRS+  +  +   ++Y+      
Sbjct: 3711 GRGLFCRRNIDAGE--MVIEYSGNV------------IRSILTDKRE---KYYD------ 3747

Query: 1952 KGDADGY------DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIH 2005
             G   G       D  VVDA    N A  I HSC PNC ++V  +DG   I I+ +R I+
Sbjct: 3748 -GKGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVIPIDGQKHIVIFAMRKIY 3806

Query: 2006 YGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
             GEE+T+DY    E      A  C CG++ CR  +LN
Sbjct: 3807 RGEELTYDYKFPIEDANNKLA--CNCGTKKCR-KFLN 3840


>gi|301606679|ref|XP_002932944.1| PREDICTED: histone-lysine N-methyltransferase MLL isoform 1 [Xenopus
            (Silurana) tropicalis]
          Length = 3855

 Score = 70.1 bits (170), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 51/157 (32%), Positives = 72/157 (45%), Gaps = 33/157 (21%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
            G G+ C +    GE   V+E+ G V            IRS+  +  +   ++Y+      
Sbjct: 3726 GRGLFCRRNIDAGE--MVIEYSGNV------------IRSILTDKRE---KYYD------ 3762

Query: 1952 KGDADGY------DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIH 2005
             G   G       D  VVDA    N A  I HSC PNC ++V  +DG   I I+ +R I+
Sbjct: 3763 -GKGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVIPIDGQKHIVIFAMRKIY 3821

Query: 2006 YGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
             GEE+T+DY    E      A  C CG++ CR  +LN
Sbjct: 3822 RGEELTYDYKFPIEDANNKLA--CNCGTKKCR-KFLN 3855


>gi|198454568|ref|XP_002137902.1| GA26260 [Drosophila pseudoobscura pseudoobscura]
 gi|198132853|gb|EDY68460.1| GA26260 [Drosophila pseudoobscura pseudoobscura]
          Length = 1755

 Score = 70.1 bits (170), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 44/79 (55%), Gaps = 4/79 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N A  I HSC PNC AKV  ++   +I IY+ + I   EEIT+DY    E 
Sbjct: 1679 TIIDATKCGNLARFINHSCNPNCYAKVITIESEKKIVIYSKQPIGVNEEITYDYKFPLED 1738

Query: 2021 KEEYEASVCLCGSQVCRGS 2039
                E   CLCG+Q CRG+
Sbjct: 1739 ----EKIPCLCGAQGCRGT 1753


>gi|146422003|ref|XP_001486944.1| hypothetical protein PGUG_00321 [Meyerozyma guilliermondii ATCC 6260]
          Length = 1055

 Score = 70.1 bits (170), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 35/81 (43%), Positives = 48/81 (59%), Gaps = 2/81 (2%)

Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
            V+DA  K   A  I H C P+C AK+  V+G  +I IY +R I   EE+T+DY    E+ 
Sbjct: 977  VIDATKKGGIARFINHCCNPSCTAKIIKVEGKKRIVIYALRDIEANEELTYDYKFERETN 1036

Query: 2022 EEYEASVCLCGSQVCRGSYLN 2042
            ++ E   CLCG+  C+G YLN
Sbjct: 1037 DD-ERIRCLCGAPGCKG-YLN 1055


>gi|6841376|gb|AAF29041.1|AF161554_1 HSPC069 [Homo sapiens]
          Length = 591

 Score = 70.1 bits (170), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 50/155 (32%), Positives = 73/155 (47%), Gaps = 21/155 (13%)

Query: 1887 VAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI 1946
            +  +KG G+   K+     + FV+E+ GEV    K F+ +    +  KN         + 
Sbjct: 106  LTEKKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHY 154

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
            Y    K D       ++DA  K N +  + HSC PNCE +   V+G  ++G +T + +  
Sbjct: 155  YFMALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPS 208

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
            G E+TFDY      K   EA  C CGS  CRG YL
Sbjct: 209  GSELTFDYQFQRYGK---EAQKCFCGSANCRG-YL 239


>gi|390178053|ref|XP_003736554.1| GA27210, isoform B [Drosophila pseudoobscura pseudoobscura]
 gi|388859306|gb|EIM52627.1| GA27210, isoform B [Drosophila pseudoobscura pseudoobscura]
          Length = 3474

 Score = 70.1 bits (170), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 55/151 (36%), Positives = 72/151 (47%), Gaps = 23/151 (15%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
            G G+ C K+   GE   V+E+ GE+            IRS   +  +   +   I     
Sbjct: 3347 GRGLYCTKDIEAGE--MVIEYAGEL------------IRSTLTDKRERYYDSRGIGCYMF 3392

Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
            K D    D +VVDA  + N A  I HSC PNC +KV  + GH  I I+ +R I  GEE+T
Sbjct: 3393 KID----DNLVVDATMRGNAARFINHSCEPNCYSKVVDILGHKHIIIFALRRIVQGEELT 3448

Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            +DY    E     E   C CGS+ CR  YLN
Sbjct: 3449 YDYKFPFED----EKIPCSCGSKRCR-KYLN 3474


>gi|355564772|gb|EHH21272.1| hypothetical protein EGK_04290, partial [Macaca mulatta]
          Length = 663

 Score = 70.1 bits (170), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N+A  I HSC PNC AKV  V+   +I IY+ + I+  EEIT+DY    E 
Sbjct: 587  TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 646

Query: 2021 KEEYEASVCLCGSQVCRGS 2039
             +      CLCGS+ CRG+
Sbjct: 647  VK----IPCLCGSENCRGT 661


>gi|348571627|ref|XP_003471597.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 2
            [Cavia porcellus]
          Length = 1367

 Score = 70.1 bits (170), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+V  ++   GE  FV E++GE+       ++++ +  ++  +E+    FY + +++
Sbjct: 1074 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKYAHENDITHFYMLTIDK 1125

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NY+  + HSC+PNCE     V+G  ++G++ V  I  G E+
Sbjct: 1126 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1176

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN      E+   +VC CG+  C G
Sbjct: 1177 TFNYNLDCLGNEK---TVCRCGASNCSG 1201


>gi|348571625|ref|XP_003471596.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 1
            [Cavia porcellus]
          Length = 1366

 Score = 70.1 bits (170), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+V  ++   GE  FV E++GE+       ++++ +  ++  +E+    FY + +++
Sbjct: 1073 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKYAHENDITHFYMLTIDK 1124

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NY+  + HSC+PNCE     V+G  ++G++ V  I  G E+
Sbjct: 1125 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1175

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN      E+   +VC CG+  C G
Sbjct: 1176 TFNYNLDCLGNEK---TVCRCGASNCSG 1200


>gi|332028801|gb|EGI68830.1| Putative histone-lysine N-methyltransferase NSD2 [Acromyrmex
            echinatior]
          Length = 1304

 Score = 70.1 bits (170), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 40/131 (30%), Positives = 66/131 (50%), Gaps = 18/131 (13%)

Query: 1908 FVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMH 1967
            FV+E++GE+       +  +  R L +  E     FY + ++  +          +DA  
Sbjct: 973  FVIEYVGEI------IDDAEYKRRLHRKKELKNENFYFLTIDNNR---------TIDAEP 1017

Query: 1968 KANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEAS 2027
            K N +  + HSC PNCE +   V+G  +IG++ +R I  GEE+TF+YN  ++ +      
Sbjct: 1018 KGNLSRFMNHSCAPNCETQKWTVNGDTRIGLFALRDIESGEELTFNYNLASDGETR---K 1074

Query: 2028 VCLCGSQVCRG 2038
             CLCG+  C G
Sbjct: 1075 ACLCGAPNCSG 1085


>gi|358056897|dbj|GAA97247.1| hypothetical protein E5Q_03924 [Mixia osmundae IAM 14324]
          Length = 949

 Score = 70.1 bits (170), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 51/152 (33%), Positives = 69/152 (45%), Gaps = 28/152 (18%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPE----FYNI 1946
            KG GV   ++    +D FV E++GEV           G   LQK  +D   E    FY +
Sbjct: 279  KGFGVRAAED--MLKDAFVYEYIGEVV----------GAGQLQKRMKDYYEEGIEHFYFM 326

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
             L+R +          +DA  K N    + HSC PNC      V    ++GI+T R I  
Sbjct: 327  ALQREE---------FIDATKKGNKGRFLNHSCSPNCYVSKWVVGEKMRMGIFTKRKIQA 377

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            GEE+TF+YN     +  +EA  C CG   C G
Sbjct: 378  GEELTFNYNV---DRYGHEAQPCYCGEANCVG 406


>gi|301762334|ref|XP_002916587.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
            N-methyltransferase NSD2-like [Ailuropoda melanoleuca]
          Length = 1364

 Score = 70.1 bits (170), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+V  ++   GE  FV E++GE+       ++++ +  ++  +E+    FY + +++
Sbjct: 1072 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKYAHENDITHFYMLTIDK 1123

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NY+  + HSC+PNCE     V+G  ++G++ V  I  G E+
Sbjct: 1124 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1174

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN      E+   +VC CG+  C G
Sbjct: 1175 TFNYNLDCLGNEK---TVCRCGASNCSG 1199


>gi|195446231|ref|XP_002070688.1| GK10891 [Drosophila willistoni]
 gi|194166773|gb|EDW81674.1| GK10891 [Drosophila willistoni]
          Length = 447

 Score = 70.1 bits (170), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 55/151 (36%), Positives = 72/151 (47%), Gaps = 23/151 (15%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
            G G+ C K+   GE   V+E+ GE+            IRS   +  +   +   I     
Sbjct: 320  GRGLYCTKDIEAGE--MVIEYAGEL------------IRSTLTDKRERYYDSRGIGCYMF 365

Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
            K D    D +VVDA  + N A  I HSC PNC +KV  + GH  I I+ +R I  GEE+T
Sbjct: 366  KID----DNLVVDATMRGNAARFINHSCEPNCYSKVVDILGHKHIIIFALRRIVQGEELT 421

Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            +DY    E     E   C CGS+ CR  YLN
Sbjct: 422  YDYKFPFEE----EKIPCSCGSKRCR-KYLN 447


>gi|47225482|emb|CAG11965.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 1625

 Score = 70.1 bits (170), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 47/155 (30%), Positives = 73/155 (47%), Gaps = 20/155 (12%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            D  +   KG G+   K+     + FV+E+ GEV    K F+ +    +  KN       +
Sbjct: 295  DVILTENKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKTRVKEYARNKNIH-----Y 346

Query: 1944 YNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRG 2003
            Y + L+  +         ++DA  K N +  + HSC PNCE +   V+G  ++G +T + 
Sbjct: 347  YFMSLKNNE---------IIDATLKGNLSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKA 397

Query: 2004 IHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            +  G E+TFDY      K   EA  C CG+  CRG
Sbjct: 398  VTAGTELTFDYQFQRYGK---EAQKCFCGTPNCRG 429


>gi|18406465|ref|NP_566010.1| histone-lysine N-methyltransferase ASHH3 [Arabidopsis thaliana]
 gi|94707125|sp|Q945S8.2|ASHH3_ARATH RecName: Full=Histone-lysine N-methyltransferase ASHH3; AltName:
            Full=ASH1 homolog 3; AltName: Full=Protein SET DOMAIN
            GROUP 7
 gi|15028059|gb|AAK76560.1| unknown protein [Arabidopsis thaliana]
 gi|20197070|gb|AAC23419.2| expressed protein [Arabidopsis thaliana]
 gi|20259301|gb|AAM14386.1| unknown protein [Arabidopsis thaliana]
 gi|330255289|gb|AEC10383.1| histone-lysine N-methyltransferase ASHH3 [Arabidopsis thaliana]
          Length = 363

 Score = 70.1 bits (170), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 57/187 (30%), Positives = 84/187 (44%), Gaps = 36/187 (19%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
            G G+V  +E   GE  F++E++GEV       + +     L K        FY   + R 
Sbjct: 127  GSGIVAEEEIEAGE--FIIEYVGEV------IDDKTCEERLWKMKHRGETNFYLCEITRD 178

Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
                     +V+DA HK N +  I HSC PN + +   +DG  +IGI+  RGI  GE +T
Sbjct: 179  ---------MVIDATHKGNKSRYINHSCNPNTQMQKWIIDGETRIGIFATRGIKKGEHLT 229

Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCR------GSYLNLTGEGAFEKVLKEL--------- 2056
            +DY  V    ++     C CG+  CR       S   +  + AF  V  EL         
Sbjct: 230  YDYQFVQFGADQD----CHCGAVGCRRKLGVKPSKPKIASDEAFNLVAHELAQTLPKVHQ 285

Query: 2057 HGLLDRH 2063
            +GL++RH
Sbjct: 286  NGLVNRH 292


>gi|307111585|gb|EFN59819.1| hypothetical protein CHLNCDRAFT_18588, partial [Chlorella variabilis]
          Length = 380

 Score = 69.7 bits (169), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 52/160 (32%), Positives = 73/160 (45%), Gaps = 39/160 (24%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVY----------PVWKWFEKQDGIRSLQKNNEDPA 1940
            KG G+   ++   G+  F++E+LGEV             WK +  + G R          
Sbjct: 183  KGFGLFAAEDMKAGQ--FLIEYLGEVLEEEEYHRRQGAAWKEYFIETGQRHYY------- 233

Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
              F N+      G+ +     V+DA  + N    I HSC PNCE +   V G   IG++T
Sbjct: 234  --FMNV------GNGE-----VIDASRRGNLGRFINHSCEPNCETQKWVVHGELAIGLFT 280

Query: 2001 VRGIHYGEEITFDYNSVTESKEEY--EASVCLCGSQVCRG 2038
            +  I  G E+TFDYN      E Y  +   CLCGS+ CRG
Sbjct: 281  LEDISAGTELTFDYNF-----ERYGDKPMKCLCGSKNCRG 315


>gi|119185079|ref|XP_001243361.1| hypothetical protein CIMG_07257 [Coccidioides immitis RS]
 gi|121936913|sp|Q1DR06.1|SET1_COCIM RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
            specific; AltName: Full=COMPASS component SET1; AltName:
            Full=SET domain-containing protein 1
 gi|392866240|gb|EAS28850.2| histone-lysine N-methyltransferase, H3 lysine-4 specific
            [Coccidioides immitis RS]
          Length = 1271

 Score = 69.7 bits (169), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA  +   A  I HSC PNC AK+  VDG  +I IY +R I   EE+T+DY    E 
Sbjct: 1192 TVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIDRDEELTYDYKFEREW 1251

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
              + +   CLCGS  C+G +LN
Sbjct: 1252 DSD-DRIPCLCGSAGCKG-FLN 1271


>gi|444724926|gb|ELW65512.1| Histone-lysine N-methyltransferase SETD1B [Tupaia chinensis]
          Length = 1554

 Score = 69.7 bits (169), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 45/79 (56%), Gaps = 4/79 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N+A  I HSC PNC AKV  V+   +I IY+ + I   EEIT+DY    E 
Sbjct: 1478 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHISVNEEITYDYKFPIED 1537

Query: 2021 KEEYEASVCLCGSQVCRGS 2039
             +      CLCGS+ CRG+
Sbjct: 1538 IK----IPCLCGSENCRGT 1552


>gi|348573849|ref|XP_003472703.1| PREDICTED: histone-lysine N-methyltransferase MLL-like, partial
            [Cavia porcellus]
          Length = 2799

 Score = 69.7 bits (169), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
            G G+ C +    GE   V+E+ G V            IRS+Q +  +   ++Y+      
Sbjct: 2670 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 2712

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
            Y+ R        D  VVDA    N A  I HSC PNC ++V  +DG   I I+ +R I+ 
Sbjct: 2713 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 2766

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            GEE+T+DY    E  +      C CG++ CR  +LN
Sbjct: 2767 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 2799


>gi|320032561|gb|EFW14513.1| histone-lysine N-methyltransferase [Coccidioides posadasii str.
            Silveira]
          Length = 1271

 Score = 69.7 bits (169), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA  +   A  I HSC PNC AK+  VDG  +I IY +R I   EE+T+DY    E 
Sbjct: 1192 TVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIDRDEELTYDYKFEREW 1251

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
              + +   CLCGS  C+G +LN
Sbjct: 1252 DSD-DRIPCLCGSAGCKG-FLN 1271


>gi|303313714|ref|XP_003066866.1| SET domain containing protein [Coccidioides posadasii C735 delta
            SOWgp]
 gi|240106533|gb|EER24721.1| SET domain containing protein [Coccidioides posadasii C735 delta
            SOWgp]
          Length = 1271

 Score = 69.7 bits (169), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA  +   A  I HSC PNC AK+  VDG  +I IY +R I   EE+T+DY    E 
Sbjct: 1192 TVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIDRDEELTYDYKFEREW 1251

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
              + +   CLCGS  C+G +LN
Sbjct: 1252 DSD-DRIPCLCGSAGCKG-FLN 1271


>gi|149047443|gb|EDM00113.1| similar to Wolf-Hirschhorn syndrome candidate 1 protein isoform 3
            (predicted) [Rattus norvegicus]
          Length = 1298

 Score = 69.7 bits (169), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+V  ++   GE  FV E++GE+       ++++ +  ++  +E+    FY + +++
Sbjct: 1006 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKYAHENDITHFYMLTIDK 1057

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NY+  + HSC+PNCE     V+G  ++G++ V  I  G E+
Sbjct: 1058 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1108

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN      E+   +VC CG+  C G
Sbjct: 1109 TFNYNLDCLGNEK---TVCRCGASNCSG 1133


>gi|345798392|ref|XP_536224.3| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
            N-methyltransferase NSD2 [Canis lupus familiaris]
          Length = 1364

 Score = 69.7 bits (169), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+V  ++   GE  FV E++GE+       ++++ +  ++  +E+    FY + +++
Sbjct: 1072 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKYAHENDITHFYMLTIDK 1123

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NY+  + HSC+PNCE     V+G  ++G++ V  I  G E+
Sbjct: 1124 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1174

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN      E+   +VC CG+  C G
Sbjct: 1175 TFNYNLDCLGNEK---TVCRCGASNCSG 1199


>gi|389746109|gb|EIM87289.1| SET domain-containing protein [Stereum hirsutum FP-91666 SS1]
          Length = 191

 Score = 69.7 bits (169), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 52/139 (37%), Positives = 68/139 (48%), Gaps = 26/139 (18%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI---YLERPKGDADGYDLVVV 1963
            + V+E++GEV            IR+   +  + A E   I   YL R   D      +VV
Sbjct: 76   EMVIEYVGEV------------IRAQIADKREKAYERQGIGSSYLFRIDED------LVV 117

Query: 1964 DAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEE 2023
            DA  K N    I HSC PNC AK+  + G  +I IY  + I  G+EIT+DY+   E    
Sbjct: 118  DATKKGNLGRLINHSCDPNCTAKIITILGEKKIVIYAKQDIELGDEITYDYHFPIEQ--- 174

Query: 2024 YEASVCLCGSQVCRGSYLN 2042
             +   CLCGS  CRG YLN
Sbjct: 175  -DKIPCLCGSARCRG-YLN 191


>gi|195356446|ref|XP_002044683.1| GM18767 [Drosophila sechellia]
 gi|194133849|gb|EDW55365.1| GM18767 [Drosophila sechellia]
          Length = 1637

 Score = 69.7 bits (169), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 44/79 (55%), Gaps = 4/79 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N A  I HSC PNC AKV  ++   +I IY+ + I   EEIT+DY    E 
Sbjct: 1561 TIIDATKCGNLARFINHSCNPNCYAKVITIESEKKIVIYSKQPIGINEEITYDYKFPLEE 1620

Query: 2021 KEEYEASVCLCGSQVCRGS 2039
                E   CLCG+Q CRG+
Sbjct: 1621 ----EKIPCLCGAQGCRGT 1635


>gi|350587283|ref|XP_003128857.3| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Sus
            scrofa]
          Length = 1338

 Score = 69.7 bits (169), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 78/148 (52%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+V  ++   GE  FV E++GE+       ++++ +  +++  E     FY + +++
Sbjct: 1046 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIRRAQEHDITRFYMLTIDK 1097

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NY+  + HSC+PNCE     V+G  ++G++ V  I  G E+
Sbjct: 1098 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1148

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN      E+   +VC CG+  C G
Sbjct: 1149 TFNYNLDCLGNEK---TVCRCGASNCSG 1173


>gi|195453659|ref|XP_002073883.1| GK12911 [Drosophila willistoni]
 gi|194169968|gb|EDW84869.1| GK12911 [Drosophila willistoni]
          Length = 1765

 Score = 69.7 bits (169), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 44/79 (55%), Gaps = 4/79 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N A  I HSC PNC AKV  ++   +I IY+ + I   EEIT+DY    E 
Sbjct: 1689 TIIDATKCGNLARFINHSCNPNCYAKVITIESEKKIVIYSKQPIGVNEEITYDYKFPLED 1748

Query: 2021 KEEYEASVCLCGSQVCRGS 2039
                E   CLCG+Q CRG+
Sbjct: 1749 ----EKIPCLCGAQGCRGT 1763


>gi|15213542|gb|AAK92049.1|AF322907_1 NSD1 [Homo sapiens]
          Length = 2596

 Score = 69.7 bits (169), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 38/132 (28%), Positives = 71/132 (53%), Gaps = 18/132 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            +FV E++GE+       ++++ +  ++  +E+    FY + +++ +         ++DA 
Sbjct: 1863 EFVNEYVGEL------IDEEECMARIKHAHENDITHFYMLTIDKDR---------IIDAG 1907

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K NY+  + HSC+PNCE     V+G  ++G++ V  I  G E+TF+YN      E+   
Sbjct: 1908 PKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTELTFNYNLDCLGNEK--- 1964

Query: 2027 SVCLCGSQVCRG 2038
            +VC CG+  C G
Sbjct: 1965 TVCRCGASNCSG 1976


>gi|242073096|ref|XP_002446484.1| hypothetical protein SORBIDRAFT_06g016720 [Sorghum bicolor]
 gi|241937667|gb|EES10812.1| hypothetical protein SORBIDRAFT_06g016720 [Sorghum bicolor]
          Length = 521

 Score = 69.7 bits (169), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 49/151 (32%), Positives = 73/151 (48%), Gaps = 26/151 (17%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            +G G+V ++    G+  FV+E+ GEV     W E +   R  Q        + Y IYL  
Sbjct: 95   RGWGLVADENIMAGQ--FVIEYCGEVI---SWKESK---RRAQAYETQGLKDAYIIYLNA 146

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +          +DA  K N+A  I HSC+PNCE +   V G  ++GI+  + I +G E+
Sbjct: 147  DES---------IDATRKGNFARFINHSCQPNCETRKWNVLGEVRVGIFAKQDIPFGTEL 197

Query: 2011 TFDYNSVTESKEEYEASV---CLCGSQVCRG 2038
            ++DYN       E+   V   CLCG+  C G
Sbjct: 198  SYDYNF------EWYGGVMVRCLCGAASCSG 222


>gi|149756942|ref|XP_001488967.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 1
            [Equus caballus]
          Length = 1365

 Score = 69.7 bits (169), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+V  ++   GE  FV E++GE+       ++++ +  ++  +E+    FY + +++
Sbjct: 1073 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKYAHENDITHFYMLTIDK 1124

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NY+  + HSC+PNCE     V+G  ++G++ V  I  G E+
Sbjct: 1125 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1175

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN      E+   +VC CG+  C G
Sbjct: 1176 TFNYNLDCLGNEK---TVCRCGASNCSG 1200


>gi|121709862|ref|XP_001272547.1| SET domain protein [Aspergillus clavatus NRRL 1]
 gi|119400697|gb|EAW11121.1| SET domain protein [Aspergillus clavatus NRRL 1]
          Length = 1232

 Score = 69.7 bits (169), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA  +   A  I HSC PNC AK+  VDG  +I IY +R I   EE+T+DY    E 
Sbjct: 1153 TVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREW 1212

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
              + +   CLCGS  C+G +LN
Sbjct: 1213 DSD-DRIPCLCGSTGCKG-FLN 1232


>gi|323348281|gb|EGA82530.1| Set1p [Saccharomyces cerevisiae Lalvin QA23]
          Length = 980

 Score = 69.7 bits (169), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA  K   A  I H C PNC AK+  V G  +I IY +R I   EE+T+DY    E 
Sbjct: 901  TVIDATKKGGIARFINHCCNPNCTAKIIKVGGRRRIVIYALRDIAASEELTYDYKFERE- 959

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
            K++ E   CLCG+  C+G +LN
Sbjct: 960  KDDEERLPCLCGAPNCKG-FLN 980


>gi|195453973|ref|XP_002074027.1| GK14418 [Drosophila willistoni]
 gi|194170112|gb|EDW85013.1| GK14418 [Drosophila willistoni]
          Length = 1420

 Score = 69.7 bits (169), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 50/169 (29%), Positives = 84/169 (49%), Gaps = 20/169 (11%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            +G G+VC +     E DFV+E++GEV    + F+K    R LQK   D    +Y + +E+
Sbjct: 1206 RGFGLVCRE--AIAEGDFVIEYVGEVINHAE-FQK----RMLQKQ-RDRDENYYFLGVEK 1257

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
                       ++DA  K N A  + HSC PNCE +  +V+  +++G++ ++ I    E+
Sbjct: 1258 D---------FIIDAGPKGNLARFMNHSCEPNCETQKWSVNCIHRVGLFAIKDIPANTEL 1308

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRGSY-LNLTGEGAFEKVLKELHG 2058
            TF+Y  + +         C CG++ C G     L  +G  E    +L+G
Sbjct: 1309 TFNY--LWDDLMNNGKKACYCGAERCSGQIGGKLKDQGLKETTSAQLNG 1355


>gi|170095481|ref|XP_001878961.1| histone methyltransferase [Laccaria bicolor S238N-H82]
 gi|164646265|gb|EDR10511.1| histone methyltransferase [Laccaria bicolor S238N-H82]
          Length = 144

 Score = 69.7 bits (169), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 50/139 (35%), Positives = 67/139 (48%), Gaps = 26/139 (18%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI---YLERPKGDADGYDLVVV 1963
            + V+E++GEV            IR+      +   E   I   YL R   D      +VV
Sbjct: 29   EMVIEYVGEV------------IRAQVAEKREKTYERQGIGSSYLFRIDED------LVV 70

Query: 1964 DAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEE 2023
            DA  K N    I HSC PNC AK+  + G  +I IY  + I  G+EIT+DY+   E    
Sbjct: 71   DATKKGNLGRLINHSCDPNCTAKIITISGEKKIVIYAKQDIELGDEITYDYHFPFEQ--- 127

Query: 2024 YEASVCLCGSQVCRGSYLN 2042
             +  +CLCGS  CRG +LN
Sbjct: 128  -DKILCLCGSVKCRG-FLN 144


>gi|148705490|gb|EDL37437.1| mCG16344 [Mus musculus]
          Length = 1298

 Score = 69.7 bits (169), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+V  ++   GE  FV E++GE+       ++++ +  ++  +E+    FY + +++
Sbjct: 1006 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKYAHENDITHFYMLTIDK 1057

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NY+  + HSC+PNCE     V+G  ++G++ V  I  G E+
Sbjct: 1058 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1108

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN      E+   +VC CG+  C G
Sbjct: 1109 TFNYNLDCLGNEK---TVCRCGASNCSG 1133


>gi|431897323|gb|ELK06585.1| Putative histone-lysine N-methyltransferase NSD2 [Pteropus alecto]
          Length = 502

 Score = 69.7 bits (169), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+V  ++   GE  FV E++GE+       ++ + +  +++ +E+    FY + +++
Sbjct: 143  KGWGLVAKRDIRKGE--FVNEYVGEL------IDEDECMARIKRAHENDITHFYMLTIDK 194

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NY+  + HSC+PNCE     V+G  ++G++ V  I  G E+
Sbjct: 195  DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 245

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN      E+   +VC CG+  C G
Sbjct: 246  TFNYNLDCLGNEK---TVCRCGASNCSG 270


>gi|326674803|ref|XP_003200208.1| PREDICTED: histone-lysine N-methyltransferase SETD2-like [Danio
            rerio]
          Length = 1428

 Score = 69.7 bits (169), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 31/77 (40%), Positives = 43/77 (55%), Gaps = 3/77 (3%)

Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
            ++DA  K N +  + HSC PNCE +   V+G  +IG +T + +  G E+TFDY      K
Sbjct: 645  IIDATLKGNCSRFMNHSCEPNCETQKWTVNGQLRIGFFTTKAVTAGTELTFDYQFQRYGK 704

Query: 2022 EEYEASVCLCGSQVCRG 2038
               EA  C CG+  CRG
Sbjct: 705  ---EAQKCFCGAPSCRG 718


>gi|195496958|ref|XP_002095897.1| GE25383 [Drosophila yakuba]
 gi|194181998|gb|EDW95609.1| GE25383 [Drosophila yakuba]
          Length = 1628

 Score = 69.7 bits (169), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 44/79 (55%), Gaps = 4/79 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N A  I HSC PNC AKV  ++   +I IY+ + I   EEIT+DY    E 
Sbjct: 1552 TIIDATKCGNLARFINHSCNPNCYAKVITIESEKKIVIYSKQPIGINEEITYDYKFPLEE 1611

Query: 2021 KEEYEASVCLCGSQVCRGS 2039
                E   CLCG+Q CRG+
Sbjct: 1612 ----EKIPCLCGAQGCRGT 1626


>gi|50312247|ref|XP_456155.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
 gi|74636430|sp|Q6CIT4.1|SET1_KLULA RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
            specific; AltName: Full=COMPASS component SET1; AltName:
            Full=SET domain-containing protein 1
 gi|49645291|emb|CAG98863.1| KLLA0F24134p [Kluyveromyces lactis]
          Length = 1000

 Score = 69.7 bits (169), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 35/82 (42%), Positives = 48/82 (58%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA  +   A  I H C P+C AK+  VDG  +I IY +R I   EE+T+DY    E+
Sbjct: 921  TVIDATKRGGIARFINHCCEPSCTAKIIKVDGRKRIVIYALRDIGTNEELTYDYKFERET 980

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
             +E E   CLCG+  C+G +LN
Sbjct: 981  -DEGERLPCLCGAPSCKG-FLN 1000


>gi|259146872|emb|CAY80128.1| Set1p [Saccharomyces cerevisiae EC1118]
          Length = 1080

 Score = 69.7 bits (169), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 46/82 (56%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA  K   A  I H C PNC AK+  V G  +I IY +R I   EE+T+DY    E 
Sbjct: 1001 TVIDATKKGGIARFINHCCNPNCTAKIIKVGGRRRIVIYALRDIAASEELTYDYKFEREK 1060

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
             +E E   CLCG+  C+G +LN
Sbjct: 1061 DDE-ERLPCLCGAPNCKG-FLN 1080


>gi|122937787|gb|ABM68621.1| AAEL000054-PA [Aedes aegypti]
          Length = 3489

 Score = 69.7 bits (169), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 54/157 (34%), Positives = 75/157 (47%), Gaps = 23/157 (14%)

Query: 1886 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN 1945
            Y ++  G G+ CN++   GE   V+E+ GE+            IRS   +  +   +   
Sbjct: 3356 YRSHIHGRGLFCNRDIEAGE--MVIEYAGEL------------IRSTLTDKRERYYDSRG 3401

Query: 1946 IYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIH 2005
            I     K D    +  VVDA  + N A  I HSC PNC +KV  + GH  I I+ +R I 
Sbjct: 3402 IGCYMFKID----EHFVVDATMRGNAARFINHSCEPNCYSKVVDILGHKHIIIFALRRIV 3457

Query: 2006 YGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
             GEE+T+DY    E  +      C CGS+ CR  YLN
Sbjct: 3458 QGEELTYDYKFPFEDVK----IPCSCGSKKCR-KYLN 3489


>gi|328875054|gb|EGG23419.1| SET domain-containing protein [Dictyostelium fasciculatum]
          Length = 1359

 Score = 69.7 bits (169), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 35/81 (43%), Positives = 44/81 (54%), Gaps = 4/81 (4%)

Query: 1959 DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVT 2018
            D  ++DA  K N A  I H C PNC AKV  + G  +I IY  R I+ GEE+T+DY    
Sbjct: 1281 DDTIIDATFKGNQARFINHCCDPNCMAKVITMGGQKKIIIYAKRDINVGEELTYDYKFPI 1340

Query: 2019 ESKEEYEASVCLCGSQVCRGS 2039
            E  +      CLC S  CRG+
Sbjct: 1341 EDVK----IPCLCKSAKCRGT 1357


>gi|118404602|ref|NP_001072649.1| histone-lysine N-methyltransferase SETD1B [Xenopus (Silurana)
            tropicalis]
 gi|123884540|sp|Q08D57.1|SET1B_XENTR RecName: Full=Histone-lysine N-methyltransferase SETD1B; AltName:
            Full=SET domain-containing protein 1B
 gi|115312893|gb|AAI23933.1| hypothetical protein MGC145850 [Xenopus (Silurana) tropicalis]
          Length = 1956

 Score = 69.7 bits (169), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 46/79 (58%), Gaps = 4/79 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N+A  I HSC PNC AKV  V+   +I IY+ + I+  EEIT+DY    E 
Sbjct: 1880 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQYINVNEEITYDYKFPIED 1939

Query: 2021 KEEYEASVCLCGSQVCRGS 2039
             +      CLCG++ CRG+
Sbjct: 1940 VK----IPCLCGAENCRGT 1954


>gi|302666919|ref|XP_003025054.1| hypothetical protein TRV_00712 [Trichophyton verrucosum HKI 0517]
 gi|291189136|gb|EFE44443.1| hypothetical protein TRV_00712 [Trichophyton verrucosum HKI 0517]
          Length = 1376

 Score = 69.7 bits (169), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 46/82 (56%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA      A  I HSC PNC AK+  VDG  +I IY +R I   EE+T+DY    E 
Sbjct: 1297 TVIDATKHGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREW 1356

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
              + +   CLCGS  C+G +LN
Sbjct: 1357 DSD-DRIPCLCGSTGCKG-FLN 1376


>gi|194898301|ref|XP_001978769.1| GG11901 [Drosophila erecta]
 gi|190650472|gb|EDV47727.1| GG11901 [Drosophila erecta]
          Length = 1626

 Score = 69.3 bits (168), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 44/79 (55%), Gaps = 4/79 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N A  I HSC PNC AKV  ++   +I IY+ + I   EEIT+DY    E 
Sbjct: 1550 TIIDATKCGNLARFINHSCNPNCYAKVITIESEKKIVIYSKQPIGINEEITYDYKFPLEE 1609

Query: 2021 KEEYEASVCLCGSQVCRGS 2039
                E   CLCG+Q CRG+
Sbjct: 1610 ----EKIPCLCGAQGCRGT 1624


>gi|348527268|ref|XP_003451141.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like [Oreochromis
            niloticus]
          Length = 1605

 Score = 69.3 bits (168), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 45/148 (30%), Positives = 73/148 (49%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            +G G+  N+     + DFV E++GEV       + ++  + +++ +E+    FY + L +
Sbjct: 1315 RGWGLRTNQ--ALKKGDFVTEYVGEV------IDSEECQQRIKRAHENHVTNFYMLTLTK 1366

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         V+DA  K N +  I HSC PNCE +   V+G  +IGI+ +  I  G E+
Sbjct: 1367 DR---------VIDAGPKGNSSRFINHSCSPNCETQKWTVNGDVRIGIFALCDIEAGTEL 1417

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN           + C CGS  C G
Sbjct: 1418 TFNYNLHCVGNRR---TSCHCGSDNCSG 1442


>gi|195388606|ref|XP_002052970.1| GJ23622 [Drosophila virilis]
 gi|194151056|gb|EDW66490.1| GJ23622 [Drosophila virilis]
          Length = 1687

 Score = 69.3 bits (168), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 44/79 (55%), Gaps = 4/79 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N A  I HSC PNC AKV  ++   +I IY+ + I   EEIT+DY    E 
Sbjct: 1611 TIIDATKCGNLARFINHSCNPNCYAKVITIESEKKIVIYSKQPIGINEEITYDYKFPLEE 1670

Query: 2021 KEEYEASVCLCGSQVCRGS 2039
                E   CLCG+Q CRG+
Sbjct: 1671 ----EKIPCLCGAQGCRGT 1685


>gi|302839691|ref|XP_002951402.1| histone H3 Lys 36 methyltransferase/ASH1 [Volvox carteri f.
            nagariensis]
 gi|300263377|gb|EFJ47578.1| histone H3 Lys 36 methyltransferase/ASH1 [Volvox carteri f.
            nagariensis]
          Length = 2345

 Score = 69.3 bits (168), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 44/137 (32%), Positives = 69/137 (50%), Gaps = 19/137 (13%)

Query: 1908 FVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMH 1967
            F++E+ GEV       + ++  R ++    +  P FY + L      A G   + +DA  
Sbjct: 1595 FIIEYAGEV------IDDRELGRRMEHARMNGEPHFYIMEL------AAG---LYIDARR 1639

Query: 1968 KANYASRICHSCRPNCEAKV--TAVDGHYQIGIYTVRGIHYGEEITFDY--NSVTESKEE 2023
            K N A  I  SC PNCE +    A  G  ++GI+  R I  GEE+ +DY  ++    K+ 
Sbjct: 1640 KGNIARLINSSCDPNCETQKWHDASTGEIRVGIFASRDIPPGEELVYDYFFSTYGAIKQS 1699

Query: 2024 YEASVCLCGSQVCRGSY 2040
              + VC+CGS+ CRG+ 
Sbjct: 1700 AASFVCMCGSKNCRGTM 1716


>gi|367010698|ref|XP_003679850.1| hypothetical protein TDEL_0B05100 [Torulaspora delbrueckii]
 gi|359747508|emb|CCE90639.1| hypothetical protein TDEL_0B05100 [Torulaspora delbrueckii]
          Length = 1019

 Score = 69.3 bits (168), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 35/82 (42%), Positives = 47/82 (57%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA  K   A  I H C P+C AK+  V G  +I IY +R I   EE+T+DY    E+
Sbjct: 940  TVIDATKKGGIARFINHCCDPSCTAKIIKVGGKKRIVIYALRDIAANEELTYDYKFERET 999

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
             +E E   CLCG+  C+G +LN
Sbjct: 1000 DDE-ERLPCLCGAPTCKG-FLN 1019


>gi|320580861|gb|EFW95083.1| histone-lysine n-methyltransferase, h3 lysine-4 specific, putative
            [Ogataea parapolymorpha DL-1]
          Length = 658

 Score = 69.3 bits (168), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 48/82 (58%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA  K   A  I H C P+C AK+  V+G  +I IY +R I   EE+T+DY    E+
Sbjct: 579  TVIDASKKGGIARFINHCCVPSCTAKIIKVEGKKRIVIYALRDIAANEELTYDYKFERET 638

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
             +E E   CLCG+  C+G YLN
Sbjct: 639  NDE-ERIPCLCGAPGCKG-YLN 658


>gi|380019005|ref|XP_003693408.1| PREDICTED: uncharacterized protein LOC100869667 [Apis florea]
          Length = 1392

 Score = 69.3 bits (168), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 49/149 (32%), Positives = 74/149 (49%), Gaps = 20/149 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+    +   GE  F++E++GEV       + +D  R  ++ ++D    +Y + L 
Sbjct: 447  KKGFGLRAMVDLLAGE--FIMEYVGEV------VDPKDFRRRAKEYSKDKNKHYYFMAL- 497

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              K D       ++DA  K N +  I HSC PN E +   V+G  +IG +  + I  GEE
Sbjct: 498  --KSDQ------IIDATMKGNVSRFINHSCDPNSETQKWTVNGELRIGFFNKKFIAAGEE 549

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            ITFDY+     K   EA  C C +  CRG
Sbjct: 550  ITFDYHFQRYGK---EAQKCFCEAPNCRG 575


>gi|328790605|ref|XP_003251435.1| PREDICTED: hypothetical protein LOC100578450 [Apis mellifera]
          Length = 1394

 Score = 69.3 bits (168), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 49/149 (32%), Positives = 74/149 (49%), Gaps = 20/149 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+    +   GE  F++E++GEV       + +D  R  ++ ++D    +Y + L 
Sbjct: 447  KKGFGLRAMVDLLAGE--FIMEYVGEV------VDPKDFRRRAKEYSKDKNKHYYFMAL- 497

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              K D       ++DA  K N +  I HSC PN E +   V+G  +IG +  + I  GEE
Sbjct: 498  --KSDQ------IIDATMKGNVSRFINHSCDPNSETQKWTVNGELRIGFFNKKFIAAGEE 549

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            ITFDY+     K   EA  C C +  CRG
Sbjct: 550  ITFDYHFQRYGK---EAQKCFCEAPNCRG 575


>gi|327304525|ref|XP_003236954.1| histone-lysine N-methyltransferase [Trichophyton rubrum CBS 118892]
 gi|326459952|gb|EGD85405.1| histone-lysine N-methyltransferase [Trichophyton rubrum CBS 118892]
          Length = 1337

 Score = 69.3 bits (168), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 46/82 (56%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA      A  I HSC PNC AK+  VDG  +I IY +R I   EE+T+DY    E 
Sbjct: 1258 TVIDATKHGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREW 1317

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
              + +   CLCGS  C+G +LN
Sbjct: 1318 DSD-DRIPCLCGSTGCKG-FLN 1337


>gi|405967140|gb|EKC32340.1| Histone-lysine N-methyltransferase SETD1B-A [Crassostrea gigas]
          Length = 1401

 Score = 69.3 bits (168), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 42/79 (53%), Gaps = 4/79 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N A  I H C PNC AK+  V+   +I IY+ R I   EEIT+DY    E 
Sbjct: 1325 TIIDATKCGNLARFINHCCNPNCYAKIITVESQKKIVIYSKRDIDVNEEITYDYKFPIED 1384

Query: 2021 KEEYEASVCLCGSQVCRGS 2039
                E   CLCG+  CRG+
Sbjct: 1385 ----EKIPCLCGAPNCRGT 1399


>gi|328858772|gb|EGG07883.1| hypothetical protein MELLADRAFT_74594 [Melampsora larici-populina
            98AG31]
          Length = 191

 Score = 69.3 bits (168), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 47/136 (34%), Positives = 66/136 (48%), Gaps = 25/136 (18%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI---YLERPKGDADGYDLVVV 1963
            + V+E++GEV            IR    +  + A E   I   YL R   D      +VV
Sbjct: 76   EMVIEYVGEV------------IRQAVADRREKAYERMGIGSSYLFRVDDD------LVV 117

Query: 1964 DAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEE 2023
            DA  K N    I H C PNC AK+  ++G  +I IY    I  G+E+T+DY+     KE+
Sbjct: 118  DATKKGNLGRLINHCCAPNCTAKIITINGEKKIVIYAKATIELGDEVTYDYHF---PKED 174

Query: 2024 YEASVCLCGSQVCRGS 2039
             +   CLCGS  C+G+
Sbjct: 175  VKIP-CLCGSSKCKGT 189


>gi|145548702|ref|XP_001460031.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124427859|emb|CAK92634.1| unnamed protein product [Paramecium tetraurelia]
          Length = 672

 Score = 69.3 bits (168), Expect = 3e-08,   Method: Composition-based stats.
 Identities = 47/156 (30%), Positives = 71/156 (45%), Gaps = 22/156 (14%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGI-RSLQKNNEDPAPEFYNIYLE 1949
            KG G+VC +  GF  ++F+  + GEVY   +WFEKQ    + +Q  N     +  + Y+E
Sbjct: 119  KGKGMVCCQGEGFATNEFICFYFGEVYSPQRWFEKQTIFNKRMQDGNRKTCSQ--SPYVE 176

Query: 1950 RPKGDADGYDLVV--------VDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTV 2001
                D    DL+V        +D     N A  I +SC PNC      V+    + + T 
Sbjct: 177  FFIND----DLLVMFKKYFQFIDPTRYGNMAQHISYSCDPNCRLVTVIVNQQNLLAVMTA 232

Query: 2002 RGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCR 2037
            + I+Y EE+T  +      +       CLCGS  C+
Sbjct: 233  KKINYLEELTLPFPLTCMDQ-------CLCGSLHCK 261



 Score = 59.7 bits (143), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 62/283 (21%), Positives = 120/283 (42%), Gaps = 49/283 (17%)

Query: 2154 YNQRLQNLAVTLDKVRYVMRCVFGDPKKAPPPVERLSPEETVSFLWKGEGSLVEELIQCM 2213
            + Q   N+   +DKV++ ++ +    K   PP+  ++        WK  GS  +++    
Sbjct: 314  HQQNYINIFSCVDKVKFALQHL----KTVQPPIFLVT--NIFDQFWKNYGSNTQKI---- 363

Query: 2214 APHVEEDVLND----LKSKIQAHDPSGSEDIQRELRKSL---------------LWLRDE 2254
               +E  ++N+    LK   Q H  S   +I +++++ +               L L + 
Sbjct: 364  --QLESSIINEIVIFLKRHSQQHQCSIGLEIIKQMKQIIDQNSIYALELTRMLFLLLSEI 421

Query: 2255 VRNL-PCTYKCRHDAAADLIHIYAYTKCFFRVQEYKAFTSPPVYISPLDLGPKYADKLGA 2313
            + N+  C++   + A A +++  ++T  +F   +Y+ F S P   +  +  P+  +K   
Sbjct: 422  ILNIESCSFN--NKAFATILYFMSFTHTYFSSTQYQGFDSKPFEENEFEYIPQPKNKSKL 479

Query: 2314 DLQVYRKTYGENYCLGQLIFWHIQTNADPDCTLARASRGCLSLPDIGSFYAKVQKPSRHR 2373
             L    K Y   +  GQLI W+ QT  +P  ++A+  RG L  P   S         +  
Sbjct: 480  ALS---KQYTPQFIWGQLINWNKQTLQNPQSSMAQERRGVLCYP---SLLLSFDNKHKTF 533

Query: 2374 VYGPKT----VRFMLSRMEKQPQRPWPKDRIWAFKSSPRIFGS 2412
             Y  KT    + +  S+ E QP         W++K+   I+G+
Sbjct: 534  PYQCKTREIYLEYFQSKKEIQPDL-----STWSYKNQHNIYGT 571


>gi|189238620|ref|XP_969339.2| PREDICTED: similar to CG40351 CG40351-PC [Tribolium castaneum]
 gi|270009170|gb|EFA05618.1| hypothetical protein TcasGA2_TC015826 [Tribolium castaneum]
          Length = 1268

 Score = 69.3 bits (168), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 33/78 (42%), Positives = 43/78 (55%), Gaps = 4/78 (5%)

Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
            ++DA    N A  I HSC PNC AKV  ++   +I IY+ + I   EEIT+DY    E  
Sbjct: 1193 IIDATKCGNLARFINHSCNPNCYAKVITIESQKKIVIYSKQSIGVNEEITYDYKFPIED- 1251

Query: 2022 EEYEASVCLCGSQVCRGS 2039
               E   CLCG+  CRG+
Sbjct: 1252 ---EKIPCLCGAATCRGT 1266


>gi|150866258|ref|XP_001385792.2| histone methyltransferase involved in gene regulation
            [Scheffersomyces stipitis CBS 6054]
 gi|149387514|gb|ABN67763.2| histone methyltransferase involved in gene regulation
            [Scheffersomyces stipitis CBS 6054]
          Length = 1055

 Score = 69.3 bits (168), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 35/82 (42%), Positives = 46/82 (56%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA  K   A  I H C P+C AK+  VD   +I IY +R I   EE+T+DY    E+
Sbjct: 976  TVIDATKKGGIARFINHCCSPSCTAKIIKVDNQKRIVIYALRDIDANEELTYDYKFERET 1035

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
             +  E   CLCG+  C+G YLN
Sbjct: 1036 NDA-ERIRCLCGAPGCKG-YLN 1055


>gi|350413847|ref|XP_003490133.1| PREDICTED: hypothetical protein LOC100748492 [Bombus impatiens]
          Length = 3522

 Score = 69.3 bits (168), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 65/198 (32%), Positives = 88/198 (44%), Gaps = 28/198 (14%)

Query: 1847 PLQPVIEEIEKEAVDDCDVRTMKMCRGILKAMDSRPDDKYVAYRKGLGVVCNKEGGFGED 1906
            P    I E E   V   ++  M M   ILK         Y ++  G G+ C ++   GE 
Sbjct: 3351 PKMIAISEAESRRVASTNL-PMAMRFRILKETSKESVGVYHSHIHGRGLFCLRDIEAGE- 3408

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
              V+E+ GEV            IR+   +  +   +  NI     K D    D +VVDA 
Sbjct: 3409 -MVIEYAGEV------------IRASLTDKREKYYDSKNIGCYMFKID----DHLVVDAT 3451

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYE- 2025
             K N A  I HSC PNC ++V  + G   I I+ +R I  GEE+T+DY      K  +E 
Sbjct: 3452 MKGNAARFINHSCEPNCYSRVVDILGKKHILIFALRRIIQGEELTYDY------KFPFED 3505

Query: 2026 -ASVCLCGSQVCRGSYLN 2042
                C CGS+ CR  YLN
Sbjct: 3506 IKIPCTCGSRRCR-KYLN 3522


>gi|358333784|dbj|GAA31138.2| histone-lysine N-methyltransferase SETD1B [Clonorchis sinensis]
          Length = 1685

 Score = 69.3 bits (168), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 35/82 (42%), Positives = 46/82 (56%), Gaps = 4/82 (4%)

Query: 1959 DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVT 2018
            D  V+DA    N    I HSC+PNC AK+  V+G  +I IY+ R I+  EEIT+DY    
Sbjct: 1607 DDFVIDATMCGNNGRFINHSCQPNCYAKIITVEGKKKIVIYSKRDINVMEEITYDY---- 1662

Query: 2019 ESKEEYEASVCLCGSQVCRGSY 2040
            +   E E   C CG+  CRG+ 
Sbjct: 1663 KFPYEEEKIPCQCGASTCRGTL 1684


>gi|444722051|gb|ELW62755.1| putative histone-lysine N-methyltransferase NSD2 [Tupaia chinensis]
          Length = 1421

 Score = 69.3 bits (168), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 43/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+V  ++   GE  FV E++GE+       ++++ +  ++  +E+    FY + +++
Sbjct: 916  KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKYAHENDITHFYMLTIDK 967

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NY+  + HSC+PNCE     V+G  ++G++ +  I  G E+
Sbjct: 968  DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFALCDIPAGTEL 1018

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN      E+   +VC CG+  C G
Sbjct: 1019 TFNYNLDCLGNEK---TVCRCGASNCSG 1043


>gi|195062427|ref|XP_001996188.1| GH22347 [Drosophila grimshawi]
 gi|193899683|gb|EDV98549.1| GH22347 [Drosophila grimshawi]
          Length = 1714

 Score = 69.3 bits (168), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 44/79 (55%), Gaps = 4/79 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N A  I HSC PNC AKV  ++   +I IY+ + I   EEIT+DY    E 
Sbjct: 1638 TIIDATKCGNLARFINHSCNPNCYAKVITIESEKKIVIYSKQPIGINEEITYDYKFPLEE 1697

Query: 2021 KEEYEASVCLCGSQVCRGS 2039
                E   CLCG+Q CRG+
Sbjct: 1698 ----EKIPCLCGAQGCRGT 1712


>gi|62862148|ref|NP_001015221.1| Set1, isoform A [Drosophila melanogaster]
 gi|62862150|ref|NP_001015222.1| Set1, isoform B [Drosophila melanogaster]
 gi|161076059|ref|NP_001104406.1| Set1, isoform C [Drosophila melanogaster]
 gi|281366745|ref|NP_001163846.1| Set1, isoform D [Drosophila melanogaster]
 gi|281366747|ref|NP_001163847.1| Set1, isoform E [Drosophila melanogaster]
 gi|281366749|ref|NP_001163848.1| Set1, isoform F [Drosophila melanogaster]
 gi|281366751|ref|NP_001163849.1| Set1, isoform G [Drosophila melanogaster]
 gi|281366753|ref|NP_001163850.1| Set1, isoform H [Drosophila melanogaster]
 gi|281366755|ref|NP_001163851.1| Set1, isoform I [Drosophila melanogaster]
 gi|51951109|gb|EAL24598.1| Set1, isoform A [Drosophila melanogaster]
 gi|51951110|gb|EAL24599.1| Set1, isoform B [Drosophila melanogaster]
 gi|158529717|gb|EDP28071.1| Set1, isoform C [Drosophila melanogaster]
 gi|281309231|gb|EFA98694.1| Set1, isoform D [Drosophila melanogaster]
 gi|281309232|gb|EFA98695.1| Set1, isoform E [Drosophila melanogaster]
 gi|281309233|gb|EFA98696.1| Set1, isoform F [Drosophila melanogaster]
 gi|281309234|gb|EFA98697.1| Set1, isoform G [Drosophila melanogaster]
 gi|281309235|gb|EFA98698.1| Set1, isoform H [Drosophila melanogaster]
 gi|281309236|gb|EFA98699.1| Set1, isoform I [Drosophila melanogaster]
          Length = 1641

 Score = 69.3 bits (168), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 44/79 (55%), Gaps = 4/79 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N A  I HSC PNC AKV  ++   +I IY+ + I   EEIT+DY    E 
Sbjct: 1565 TIIDATKCGNLARFINHSCNPNCYAKVITIESEKKIVIYSKQPIGINEEITYDYKFPLED 1624

Query: 2021 KEEYEASVCLCGSQVCRGS 2039
                E   CLCG+Q CRG+
Sbjct: 1625 ----EKIPCLCGAQGCRGT 1639


>gi|414587222|tpg|DAA37793.1| TPA: hypothetical protein ZEAMMB73_251567 [Zea mays]
          Length = 489

 Score = 69.3 bits (168), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 49/151 (32%), Positives = 72/151 (47%), Gaps = 26/151 (17%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            +G G+V ++    G+  FV+E+ GEV   WK     +  R  Q        + Y IYL  
Sbjct: 71   RGWGLVADENIMAGQ--FVIEYCGEVIS-WK-----EAKRRAQAYETQCLKDAYIIYLNA 122

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +          +DA  K N A  I HSC+PNCE +   V G  ++GI+  + I +G E+
Sbjct: 123  DES---------IDATRKGNLARFINHSCQPNCETRKWNVLGEVRVGIFAKQNIPFGTEL 173

Query: 2011 TFDYNSVTESKEEYEASV---CLCGSQVCRG 2038
            ++DYN       E+   V   CLCG+  C G
Sbjct: 174  SYDYNF------EWYGGVMVRCLCGAASCSG 198


>gi|326472906|gb|EGD96915.1| histone-lysine N-methyltransferase [Trichophyton tonsurans CBS
            112818]
          Length = 1330

 Score = 69.3 bits (168), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 46/82 (56%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA      A  I HSC PNC AK+  VDG  +I IY +R I   EE+T+DY    E 
Sbjct: 1251 TVIDATKHGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREW 1310

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
              + +   CLCGS  C+G +LN
Sbjct: 1311 DSD-DRIPCLCGSTGCKG-FLN 1330


>gi|119467882|ref|XP_001257747.1| SET domain protein [Neosartorya fischeri NRRL 181]
 gi|119405899|gb|EAW15850.1| SET domain protein [Neosartorya fischeri NRRL 181]
          Length = 1241

 Score = 69.3 bits (168), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA  +   A  I HSC PNC AK+  VDG  +I IY +R I   EE+T+DY    E 
Sbjct: 1162 TVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIGRDEELTYDYKFEREW 1221

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
              + +   CLCGS  C+G +LN
Sbjct: 1222 DSD-DRIPCLCGSTGCKG-FLN 1241


>gi|359492362|ref|XP_002284621.2| PREDICTED: uncharacterized protein LOC100245350 [Vitis vinifera]
          Length = 2184

 Score = 69.3 bits (168), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 35/77 (45%), Positives = 43/77 (55%), Gaps = 2/77 (2%)

Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
            V+DA  K N    I HSC PNC  +   V+G   IG++ +R I  GEE+TFDYN V    
Sbjct: 1313 VIDACAKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALRDIKKGEEVTFDYNYVRVFG 1372

Query: 2022 EEYEASVCLCGSQVCRG 2038
                A  C+CGS  CRG
Sbjct: 1373 --AAAKKCVCGSPQCRG 1387


>gi|348527922|ref|XP_003451468.1| PREDICTED: hypothetical protein LOC100692734 [Oreochromis niloticus]
          Length = 2421

 Score = 69.3 bits (168), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 49/158 (31%), Positives = 78/158 (49%), Gaps = 21/158 (13%)

Query: 1886 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN 1945
            +    +G G+ C  +   G+  FV E++GEV       E +  IR  Q NN      FY 
Sbjct: 2005 FRTLSRGWGLRCVHDIKKGQ--FVSEYVGEVI---DEEECRSRIRHAQDNN---ICNFYM 2056

Query: 1946 IYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIH 2005
            + L++ +         ++DA  K N A  + HSC+PNCE +   V+G  ++G++ +  I 
Sbjct: 2057 LTLDKDR---------IIDAGPKGNEARFMNHSCQPNCETQKWTVNGDTRVGLFALIDIA 2107

Query: 2006 YGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNL 2043
             G E+TF+YN       +   +VC CG+  C G +L L
Sbjct: 2108 AGTELTFNYNLECLGNRK---TVCKCGASNCSG-FLGL 2141


>gi|332026544|gb|EGI66662.1| Histone-lysine N-methyltransferase SETD2 [Acromyrmex echinatior]
          Length = 1841

 Score = 69.3 bits (168), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 49/149 (32%), Positives = 74/149 (49%), Gaps = 20/149 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+    +   GE  F++E++GEV       + +D  R  ++ ++D    +Y + L 
Sbjct: 881  KKGFGLRAVVDIMAGE--FIMEYVGEV------VDPKDFRRRAKEYSKDKNRHYYFMAL- 931

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              K D       ++DA  K N +  I HSC PN E +   V+G  +IG +  + I  GEE
Sbjct: 932  --KSDQ------IIDATMKGNISRFINHSCDPNAETQKWTVNGELRIGFFNKKFIAAGEE 983

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            ITFDY+     K   EA  C C +  CRG
Sbjct: 984  ITFDYHFQRYGK---EAQKCYCEALNCRG 1009


>gi|157103255|ref|XP_001647894.1| mixed-lineage leukemia protein, mll [Aedes aegypti]
 gi|108884726|gb|EAT48951.1| AAEL000054-PA, partial [Aedes aegypti]
          Length = 3069

 Score = 69.3 bits (168), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 54/157 (34%), Positives = 75/157 (47%), Gaps = 23/157 (14%)

Query: 1886 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN 1945
            Y ++  G G+ CN++   GE   V+E+ GE+            IRS   +  +   +   
Sbjct: 2936 YRSHIHGRGLFCNRDIEAGE--MVIEYAGEL------------IRSTLTDKRERYYDSRG 2981

Query: 1946 IYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIH 2005
            I     K D    +  VVDA  + N A  I HSC PNC +KV  + GH  I I+ +R I 
Sbjct: 2982 IGCYMFKID----EHFVVDATMRGNAARFINHSCEPNCYSKVVDILGHKHIIIFALRRIV 3037

Query: 2006 YGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
             GEE+T+DY    E  +      C CGS+ CR  YLN
Sbjct: 3038 QGEELTYDYKFPFEDVK----IPCSCGSKKCR-KYLN 3069


>gi|401842102|gb|EJT44375.1| SET1-like protein [Saccharomyces kudriavzevii IFO 1802]
          Length = 1087

 Score = 69.3 bits (168), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 46/82 (56%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA  K   A  I H C PNC AK+  V G  +I IY +R I   EE+T+DY    E 
Sbjct: 1008 TVIDATKKGGIARFINHCCDPNCTAKIIKVGGRRRIVIYALRDIGANEELTYDYKFEREQ 1067

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
             +E E   CLCG+  C+G +LN
Sbjct: 1068 DDE-ERLPCLCGASNCKG-FLN 1087


>gi|326477398|gb|EGE01408.1| histone-lysine N-methyltransferase [Trichophyton equinum CBS 127.97]
          Length = 1331

 Score = 69.3 bits (168), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 46/82 (56%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA      A  I HSC PNC AK+  VDG  +I IY +R I   EE+T+DY    E 
Sbjct: 1252 TVIDATKHGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREW 1311

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
              + +   CLCGS  C+G +LN
Sbjct: 1312 DSD-DRIPCLCGSTGCKG-FLN 1331


>gi|194900731|ref|XP_001979909.1| GG21380 [Drosophila erecta]
 gi|190651612|gb|EDV48867.1| GG21380 [Drosophila erecta]
          Length = 3741

 Score = 69.3 bits (168), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 54/151 (35%), Positives = 72/151 (47%), Gaps = 23/151 (15%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
            G G+ C K+   GE   V+E+ GE+            IRS   +  +   +   I     
Sbjct: 3614 GRGLYCTKDIEAGE--MVIEYAGEL------------IRSTLTDKRERYYDSRGIGCYMF 3659

Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
            K D    D +VVDA  + N A  I H C PNC +KV  + GH  I I+ +R I  GEE+T
Sbjct: 3660 KID----DNLVVDATMRGNAARFINHCCEPNCYSKVVDILGHKHIIIFALRRIVQGEELT 3715

Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            +DY    +   E E   C CGS+ CR  YLN
Sbjct: 3716 YDY----KFPFEEEKIPCSCGSKRCR-KYLN 3741


>gi|225685245|gb|EEH23529.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb03]
          Length = 756

 Score = 68.9 bits (167), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 46/149 (30%), Positives = 80/149 (53%), Gaps = 22/149 (14%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            +G GV  N+   F  +  +VE+ GE+    K  E++  +R++ KNNE     +Y +Y ++
Sbjct: 375  RGYGVRSNRT--FAPNQIIVEYTGEII-TQKECERR--MRTVYKNNEC----YYLMYFDQ 425

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVR-GIHYGEE 2009
                      +++DA  + + A  + HSC PNCE +   V G  ++ ++  + GI  GEE
Sbjct: 426  N---------MIIDAT-RGSIARFVNHSCEPNCEMEKWTVAGKPRMALFAGKNGITTGEE 475

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            +T+DYN    S++  +   C CG++ CRG
Sbjct: 476  LTYDYNFDPYSQKNVQE--CRCGAETCRG 502


>gi|410898830|ref|XP_003962900.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
            [Takifugu rubripes]
          Length = 1329

 Score = 68.9 bits (167), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 47/158 (29%), Positives = 81/158 (51%), Gaps = 21/158 (13%)

Query: 1882 PDDKYVAYR-KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
            PD K +    KG G++  ++   GE  FV E++GE+       E +  I+  Q+NN    
Sbjct: 1023 PDTKIIKTPGKGWGLITLRDIKKGE--FVNEYIGELI---DEEECRARIKYAQENN---V 1074

Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
              FY + +++ +         ++DA  K NY+  + HSC+PNCE +   V+G  ++G++ 
Sbjct: 1075 TNFYMLTIDKDR---------IIDAGPKGNYSRFMNHSCQPNCETQKWTVNGDTRVGLFA 1125

Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            +  +  G E+TF+YN      E+   + C CG+  C G
Sbjct: 1126 ICDVPAGTELTFNYNLDCLGNEK---TACCCGAPNCSG 1160


>gi|296805347|ref|XP_002843498.1| histone-lysine N-methyltransferase [Arthroderma otae CBS 113480]
 gi|238844800|gb|EEQ34462.1| histone-lysine N-methyltransferase [Arthroderma otae CBS 113480]
          Length = 1344

 Score = 68.9 bits (167), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 46/82 (56%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA      A  I HSC PNC AK+  VDG  +I IY +R I   EE+T+DY    E 
Sbjct: 1265 TVIDATKHGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREW 1324

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
              + +   CLCGS  C+G +LN
Sbjct: 1325 DSD-DRIPCLCGSTGCKG-FLN 1344


>gi|414587221|tpg|DAA37792.1| TPA: hypothetical protein ZEAMMB73_251567 [Zea mays]
          Length = 503

 Score = 68.9 bits (167), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 49/151 (32%), Positives = 72/151 (47%), Gaps = 26/151 (17%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            +G G+V ++    G+  FV+E+ GEV   WK     +  R  Q        + Y IYL  
Sbjct: 85   RGWGLVADENIMAGQ--FVIEYCGEVIS-WK-----EAKRRAQAYETQCLKDAYIIYLNA 136

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +          +DA  K N A  I HSC+PNCE +   V G  ++GI+  + I +G E+
Sbjct: 137  DES---------IDATRKGNLARFINHSCQPNCETRKWNVLGEVRVGIFAKQNIPFGTEL 187

Query: 2011 TFDYNSVTESKEEYEASV---CLCGSQVCRG 2038
            ++DYN       E+   V   CLCG+  C G
Sbjct: 188  SYDYNF------EWYGGVMVRCLCGAASCSG 212


>gi|340710026|ref|XP_003393599.1| PREDICTED: hypothetical protein LOC100646252 [Bombus terrestris]
          Length = 3530

 Score = 68.9 bits (167), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 60/177 (33%), Positives = 81/177 (45%), Gaps = 27/177 (15%)

Query: 1868 MKMCRGILKAMDSRPDDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQD 1927
            M M   ILK         Y ++  G G+ C ++   GE   V+E+ GEV           
Sbjct: 3379 MAMRFRILKETSKESVGVYHSHIHGRGLFCLRDIEAGE--MVIEYAGEV----------- 3425

Query: 1928 GIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKV 1987
             IR+   +  +   +  NI     K D    D +VVDA  K N A  I HSC PNC ++V
Sbjct: 3426 -IRASLTDKREKYYDSKNIGCYMFKID----DHLVVDATMKGNAARFINHSCEPNCYSRV 3480

Query: 1988 TAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYE--ASVCLCGSQVCRGSYLN 2042
              + G   I I+ +R I  GEE+T+DY      K  +E     C CGS+ CR  YLN
Sbjct: 3481 VDILGKKHILIFALRRIIQGEELTYDY------KFPFEDIKIPCTCGSRRCR-KYLN 3530


>gi|347968475|ref|XP_563394.4| AGAP002741-PA [Anopheles gambiae str. PEST]
 gi|333467986|gb|EAL40845.4| AGAP002741-PA [Anopheles gambiae str. PEST]
          Length = 4925

 Score = 68.9 bits (167), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 54/157 (34%), Positives = 75/157 (47%), Gaps = 23/157 (14%)

Query: 1886 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN 1945
            Y ++  G G+ CN++   GE   V+E+ GE+            IRS   +  +   +   
Sbjct: 4792 YRSHIHGRGLFCNRDIEAGE--MVIEYAGEL------------IRSTLTDKRERYYDSRG 4837

Query: 1946 IYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIH 2005
            I     K D +     VVDA  + N A  I HSC PNC +KV  + GH  I I+ +R I 
Sbjct: 4838 IGCYMFKIDEN----FVVDATMRGNAARFINHSCEPNCYSKVVDILGHKHIIIFALRRIV 4893

Query: 2006 YGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
             GEE+T+DY    E  +      C CGS+ CR  YLN
Sbjct: 4894 QGEELTYDYKFPFEDVK----IPCSCGSKKCR-KYLN 4925


>gi|17136558|ref|NP_476770.1| trithorax, isoform B [Drosophila melanogaster]
 gi|19550181|ref|NP_599108.1| trithorax, isoform C [Drosophila melanogaster]
 gi|62472551|ref|NP_001014621.1| trithorax, isoform E [Drosophila melanogaster]
 gi|23171245|gb|AAN13600.1| trithorax, isoform B [Drosophila melanogaster]
 gi|23171246|gb|AAN13601.1| trithorax, isoform C [Drosophila melanogaster]
 gi|61679333|gb|AAX52951.1| trithorax, isoform E [Drosophila melanogaster]
          Length = 3358

 Score = 68.9 bits (167), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 54/151 (35%), Positives = 71/151 (47%), Gaps = 23/151 (15%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
            G G+ C K+   GE   V+E+ GE+            IRS   +  +   +   I     
Sbjct: 3231 GRGLYCTKDIEAGE--MVIEYAGEL------------IRSTLTDKRERYYDSRGIGCYMF 3276

Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
            K D    D +VVDA  + N A  I H C PNC +KV  + GH  I I+ +R I  GEE+T
Sbjct: 3277 KID----DNLVVDATMRGNAARFINHCCEPNCYSKVVDILGHKHIIIFALRRIVQGEELT 3332

Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            +DY    E     E   C CGS+ CR  YLN
Sbjct: 3333 YDYKFPFED----EKIPCSCGSKRCR-KYLN 3358


>gi|402221447|gb|EJU01516.1| SET domain-containing protein [Dacryopinax sp. DJM-731 SS1]
          Length = 164

 Score = 68.9 bits (167), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 50/139 (35%), Positives = 66/139 (47%), Gaps = 26/139 (18%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI---YLERPKGDADGYDLVVV 1963
            D V+E++GEV            +R    +  +   E   I   YL R   D      +VV
Sbjct: 49   DMVIEYVGEV------------VRQQVADKREKVYERQGIGSSYLFRIDDD------LVV 90

Query: 1964 DAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEE 2023
            DA  K N    I HSC PNC A++  ++   +I IY    I  GEEIT+DY+   E    
Sbjct: 91   DATMKGNIGRLINHSCSPNCTARIITINSSKKIVIYAKTPIEPGEEITYDYHFPIEQ--- 147

Query: 2024 YEASVCLCGSQVCRGSYLN 2042
             E   CLCGS+ CRG +LN
Sbjct: 148  -EKIPCLCGSEKCRG-FLN 164


>gi|350421470|ref|XP_003492853.1| PREDICTED: hypothetical protein LOC100746901 [Bombus impatiens]
          Length = 1777

 Score = 68.9 bits (167), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 49/149 (32%), Positives = 74/149 (49%), Gaps = 20/149 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+    +   GE  F++E++GEV       + +D  R  ++ ++D    +Y + L 
Sbjct: 830  KKGFGLRAMVDLLAGE--FIMEYVGEV------VDPKDFRRRAKEYSKDKNKHYYFMAL- 880

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              K D       ++DA  K N +  I HSC PN E +   V+G  +IG +  + I  GEE
Sbjct: 881  --KSDQ------IIDATLKGNVSRFINHSCDPNSETQKWTVNGELRIGFFNKKFIAAGEE 932

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            ITFDY+     K   EA  C C +  CRG
Sbjct: 933  ITFDYHFQRYGK---EAQKCFCEAPNCRG 958


>gi|340726897|ref|XP_003401788.1| PREDICTED: hypothetical protein LOC100652142 [Bombus terrestris]
          Length = 1777

 Score = 68.9 bits (167), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 49/149 (32%), Positives = 74/149 (49%), Gaps = 20/149 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+    +   GE  F++E++GEV       + +D  R  ++ ++D    +Y + L 
Sbjct: 830  KKGFGLRAMVDLLAGE--FIMEYVGEV------VDPKDFRRRAKEYSKDKNKHYYFMAL- 880

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              K D       ++DA  K N +  I HSC PN E +   V+G  +IG +  + I  GEE
Sbjct: 881  --KSDQ------IIDATLKGNVSRFINHSCDPNSETQKWTVNGELRIGFFNKKFIAAGEE 932

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            ITFDY+     K   EA  C C +  CRG
Sbjct: 933  ITFDYHFQRYGK---EAQKCFCEAPNCRG 958


>gi|70991351|ref|XP_750524.1| SET domain protein [Aspergillus fumigatus Af293]
 gi|74671075|sp|Q4WNH8.1|SET1_ASPFU RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
            specific; AltName: Full=COMPASS component set1; AltName:
            Full=SET domain-containing protein 1
 gi|66848157|gb|EAL88486.1| SET domain protein [Aspergillus fumigatus Af293]
          Length = 1241

 Score = 68.9 bits (167), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA  +   A  I HSC PNC AK+  VDG  +I IY +R I   EE+T+DY    E 
Sbjct: 1162 TVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIGRDEELTYDYKFEREW 1221

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
              + +   CLCGS  C+G +LN
Sbjct: 1222 DSD-DRIPCLCGSTGCKG-FLN 1241


>gi|159124080|gb|EDP49198.1| SET domain protein [Aspergillus fumigatus A1163]
          Length = 1241

 Score = 68.9 bits (167), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA  +   A  I HSC PNC AK+  VDG  +I IY +R I   EE+T+DY    E 
Sbjct: 1162 TVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIGRDEELTYDYKFEREW 1221

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
              + +   CLCGS  C+G +LN
Sbjct: 1222 DSD-DRIPCLCGSTGCKG-FLN 1241


>gi|256271664|gb|EEU06704.1| Set1p [Saccharomyces cerevisiae JAY291]
          Length = 1080

 Score = 68.9 bits (167), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 46/82 (56%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA  K   A  I H C PNC AK+  V G  +I IY +R I   EE+T+DY    E 
Sbjct: 1001 TVIDATKKGGIARFINHCCDPNCTAKIIKVGGRRRIVIYALRDIAASEELTYDYKFEREK 1060

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
             +E E   CLCG+  C+G +LN
Sbjct: 1061 DDE-ERLPCLCGAPNCKG-FLN 1080


>gi|162463380|ref|NP_001105665.1| SET domain-containing protein SET102 [Zea mays]
 gi|22121720|gb|AAM89289.1| SET domain-containing protein SET102 [Zea mays]
 gi|414587223|tpg|DAA37794.1| TPA: SET domain-containing protein SET102 [Zea mays]
          Length = 513

 Score = 68.9 bits (167), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 49/151 (32%), Positives = 72/151 (47%), Gaps = 26/151 (17%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            +G G+V ++    G+  FV+E+ GEV   WK     +  R  Q        + Y IYL  
Sbjct: 95   RGWGLVADENIMAGQ--FVIEYCGEVIS-WK-----EAKRRAQAYETQCLKDAYIIYLNA 146

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +          +DA  K N A  I HSC+PNCE +   V G  ++GI+  + I +G E+
Sbjct: 147  DES---------IDATRKGNLARFINHSCQPNCETRKWNVLGEVRVGIFAKQNIPFGTEL 197

Query: 2011 TFDYNSVTESKEEYEASV---CLCGSQVCRG 2038
            ++DYN       E+   V   CLCG+  C G
Sbjct: 198  SYDYNF------EWYGGVMVRCLCGAASCSG 222


>gi|17136556|ref|NP_476769.1| trithorax, isoform D [Drosophila melanogaster]
 gi|19550184|ref|NP_599109.1| trithorax, isoform A [Drosophila melanogaster]
 gi|290457684|sp|P20659.4|TRX_DROME RecName: Full=Histone-lysine N-methyltransferase trithorax; AltName:
            Full=Lysine N-methyltransferase 2A
 gi|10726522|gb|AAF55041.2| trithorax, isoform A [Drosophila melanogaster]
 gi|23171244|gb|AAN13599.1| trithorax, isoform D [Drosophila melanogaster]
          Length = 3726

 Score = 68.9 bits (167), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 54/151 (35%), Positives = 71/151 (47%), Gaps = 23/151 (15%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
            G G+ C K+   GE   V+E+ GE+            IRS   +  +   +   I     
Sbjct: 3599 GRGLYCTKDIEAGE--MVIEYAGEL------------IRSTLTDKRERYYDSRGIGCYMF 3644

Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
            K D    D +VVDA  + N A  I H C PNC +KV  + GH  I I+ +R I  GEE+T
Sbjct: 3645 KID----DNLVVDATMRGNAARFINHCCEPNCYSKVVDILGHKHIIIFALRRIVQGEELT 3700

Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            +DY    E     E   C CGS+ CR  YLN
Sbjct: 3701 YDYKFPFED----EKIPCSCGSKRCR-KYLN 3726


>gi|410923178|ref|XP_003975059.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like [Takifugu
            rubripes]
          Length = 1499

 Score = 68.9 bits (167), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 45/148 (30%), Positives = 72/148 (48%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            +G G+  N+    GE  FV E++GEV       + ++  + +++ +E+    FY + L +
Sbjct: 1207 RGWGLKANQPLKKGE--FVTEYVGEV------IDAEECQQRIKRAHENHMTNFYMLTLTK 1258

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         V+DA  K N +  I HSC PNCE +   V+G   IG++ +  I  G E+
Sbjct: 1259 DR---------VIDAAQKGNLSRFINHSCSPNCETQKWTVNGDVHIGLFALCDIDAGTEL 1309

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN           + C CGS  C G
Sbjct: 1310 TFNYNLHCVGNRR---TTCNCGSDNCSG 1334


>gi|315045626|ref|XP_003172188.1| histone-lysine N-methyltransferase [Arthroderma gypseum CBS 118893]
 gi|311342574|gb|EFR01777.1| histone-lysine N-methyltransferase [Arthroderma gypseum CBS 118893]
          Length = 1334

 Score = 68.9 bits (167), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 46/82 (56%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA      A  I HSC PNC AK+  VDG  +I IY +R I   EE+T+DY    E 
Sbjct: 1255 TVIDATKHGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREW 1314

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
              + +   CLCGS  C+G +LN
Sbjct: 1315 DSD-DRIPCLCGSTGCKG-FLN 1334


>gi|151944065|gb|EDN62358.1| SET domain-containing protein [Saccharomyces cerevisiae YJM789]
          Length = 1080

 Score = 68.9 bits (167), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 46/82 (56%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA  K   A  I H C PNC AK+  V G  +I IY +R I   EE+T+DY    E 
Sbjct: 1001 TVIDATKKGGIARFINHCCDPNCTAKIIKVGGRRRIVIYALRDIAASEELTYDYKFEREK 1060

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
             +E E   CLCG+  C+G +LN
Sbjct: 1061 DDE-ERLPCLCGAPNCKG-FLN 1080


>gi|241612901|ref|XP_002407306.1| mixed-lineage leukemia protein, mll, putative [Ixodes scapularis]
 gi|215502770|gb|EEC12264.1| mixed-lineage leukemia protein, mll, putative [Ixodes scapularis]
          Length = 208

 Score = 68.9 bits (167), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 37/81 (45%), Positives = 47/81 (58%), Gaps = 5/81 (6%)

Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
            ++DA    N A  I HSC PNC AKV  V+G  +I IY+ + I+  EEIT+DY    E  
Sbjct: 133  IIDATKCGNLARFINHSCNPNCYAKVITVEGQKKIVIYSKQPINVNEEITYDYKFPLEE- 191

Query: 2022 EEYEASVCLCGSQVCRGSYLN 2042
               E   CLCG+  CRG +LN
Sbjct: 192  ---EKISCLCGAPQCRG-FLN 208


>gi|6321911|ref|NP_011987.1| Set1p [Saccharomyces cerevisiae S288c]
 gi|731707|sp|P38827.1|SET1_YEAST RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
            specific; AltName: Full=COMPASS component SET1; AltName:
            Full=Lysine N-methyltransferase 2; AltName: Full=SET
            domain-containing protein 1
 gi|529135|gb|AAB68867.1| Set1p [Saccharomyces cerevisiae]
 gi|190405898|gb|EDV09165.1| histone-lysine N-methyltransferase [Saccharomyces cerevisiae RM11-1a]
 gi|285810026|tpg|DAA06813.1| TPA: Set1p [Saccharomyces cerevisiae S288c]
 gi|392298926|gb|EIW10021.1| Set1p [Saccharomyces cerevisiae CEN.PK113-7D]
          Length = 1080

 Score = 68.9 bits (167), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 46/82 (56%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA  K   A  I H C PNC AK+  V G  +I IY +R I   EE+T+DY    E 
Sbjct: 1001 TVIDATKKGGIARFINHCCDPNCTAKIIKVGGRRRIVIYALRDIAASEELTYDYKFEREK 1060

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
             +E E   CLCG+  C+G +LN
Sbjct: 1061 DDE-ERLPCLCGAPNCKG-FLN 1080


>gi|374370210|ref|ZP_09628219.1| methyltransferase [Cupriavidus basilensis OR16]
 gi|373098212|gb|EHP39324.1| methyltransferase [Cupriavidus basilensis OR16]
          Length = 188

 Score = 68.9 bits (167), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 51/156 (32%), Positives = 76/156 (48%), Gaps = 31/156 (19%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
            G GV  N     GE + ++E+ GE +  WK         +L+++  DPA   +  Y    
Sbjct: 50   GKGVYAN--APIGEGERIIEYKGE-HISWK--------EALKRHPHDPADPNHTFYFSLE 98

Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
             GD       V+DA    N A  I H+C PNCEA+    +   ++ I+ +R I  GEE+ 
Sbjct: 99   DGD-------VIDAKFGGNRARWINHACEPNCEAR----EKKGRVFIHALRDIASGEELF 147

Query: 2012 FDYNSVTES------KEEYEASVCLCGSQVCRGSYL 2041
            +DY  V ++      K+E+E   C CGS  CRG+ L
Sbjct: 148  YDYGLVIDARYTKKLKKEFE---CRCGSPKCRGTML 180


>gi|207344594|gb|EDZ71692.1| YHR119Wp-like protein [Saccharomyces cerevisiae AWRI1631]
          Length = 1080

 Score = 68.9 bits (167), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 46/82 (56%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA  K   A  I H C PNC AK+  V G  +I IY +R I   EE+T+DY    E 
Sbjct: 1001 TVIDATKKGGIARFINHCCDPNCTAKIIKVGGRRRIVIYALRDIAASEELTYDYKFEREK 1060

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
             +E E   CLCG+  C+G +LN
Sbjct: 1061 DDE-ERLPCLCGAPNCKG-FLN 1080


>gi|449295340|gb|EMC91362.1| hypothetical protein BAUCODRAFT_80239 [Baudoinia compniacensis UAMH
            10762]
          Length = 1279

 Score = 68.9 bits (167), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 47/140 (33%), Positives = 72/140 (51%), Gaps = 23/140 (16%)

Query: 1906 DDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDA 1965
            +D ++E++GE     K  +K   +R L+   +       + YL R   D       +VDA
Sbjct: 1160 NDLIIEYVGE-----KVRQKVADLRELRYEKQG----VGSSYLFRMMDDE------IVDA 1204

Query: 1966 MHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYE 2025
              K   A  I HSC PNC AK+  V+G  +I IY ++ I   EE+T+DY    + + EY 
Sbjct: 1205 TKKGGIARFINHSCSPNCTAKIIKVEGTPRIVIYALKDIGKNEELTYDY----KFEREYG 1260

Query: 2026 AS---VCLCGSQVCRGSYLN 2042
            ++    CLCG+  C+G +LN
Sbjct: 1261 STDRIPCLCGTANCKG-FLN 1279


>gi|357627347|gb|EHJ77076.1| hypothetical protein KGM_14526 [Danaus plexippus]
          Length = 1912

 Score = 68.9 bits (167), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 39/131 (29%), Positives = 68/131 (51%), Gaps = 18/131 (13%)

Query: 1908 FVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMH 1967
            FV+E++GE+       ++++  R + + +E     FY + L++ +         ++DA  
Sbjct: 1688 FVIEYVGEL------IDEEEFRRRMNRKHEVRDENFYFLTLDKER---------MIDAGP 1732

Query: 1968 KANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEAS 2027
            K N A  + HSC PNCE +   V G  ++G++ +R I    E+TF+YN  T      E  
Sbjct: 1733 KGNLARFMNHSCEPNCETQKWTVLGDVRVGLFALRDIPANSELTFNYNLETSG---IEKK 1789

Query: 2028 VCLCGSQVCRG 2038
             C+CG++ C G
Sbjct: 1790 RCMCGAKRCSG 1800


>gi|349578671|dbj|GAA23836.1| K7_Set1p [Saccharomyces cerevisiae Kyokai no. 7]
          Length = 1080

 Score = 68.9 bits (167), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 46/82 (56%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA  K   A  I H C PNC AK+  V G  +I IY +R I   EE+T+DY    E 
Sbjct: 1001 TVIDATKKGGIARFINHCCDPNCTAKIIKVGGRRRIVIYALRDIAASEELTYDYKFEREK 1060

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
             +E E   CLCG+  C+G +LN
Sbjct: 1061 DDE-ERLPCLCGAPNCKG-FLN 1080


>gi|47216786|emb|CAG03790.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 1443

 Score = 68.9 bits (167), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 42/134 (31%), Positives = 71/134 (52%), Gaps = 18/134 (13%)

Query: 1905 EDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVD 1964
            + +FV E++GE+       E +  I+  Q+NN      FY + +++ +         ++D
Sbjct: 1117 QGEFVNEYIGELI---DEEECRARIKYAQENNIT---NFYMLTIDKDR---------IID 1161

Query: 1965 AMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEY 2024
            A  K NY+  + HSC+PNCE +   V+G  ++G++ V  I  G E+TF+YN      E+ 
Sbjct: 1162 AGPKGNYSRFMNHSCQPNCETQKWTVNGDTRVGLFAVCDIPAGTELTFNYNLDCLGNEK- 1220

Query: 2025 EASVCLCGSQVCRG 2038
              +VC CG+  C G
Sbjct: 1221 --TVCCCGAPNCSG 1232


>gi|322792358|gb|EFZ16342.1| hypothetical protein SINV_07789 [Solenopsis invicta]
          Length = 3272

 Score = 68.9 bits (167), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 60/175 (34%), Positives = 79/175 (45%), Gaps = 23/175 (13%)

Query: 1868 MKMCRGILKAMDSRPDDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQD 1927
            M M   ILK         Y +   G G+ C ++   GE   V+E+ GEV           
Sbjct: 3121 MAMRFRILKETSKASVGVYYSRIHGRGLFCLRDIEPGE--MVIEYAGEV----------- 3167

Query: 1928 GIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKV 1987
             IRS   +  +   +  NI     K D    D +VVDA  K N A  I HSC PNC ++V
Sbjct: 3168 -IRSSLTDKREKYYDSKNIGCYMFKID----DHLVVDATMKGNAARFINHSCEPNCYSRV 3222

Query: 1988 TAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
              + G   I I+ +R I  GEE+T+DY    E  +      C CGS+ CR  YLN
Sbjct: 3223 VDILGKKHILIFALRRIIQGEELTYDYKFPFEDIK----IPCTCGSRKCR-KYLN 3272


>gi|406694364|gb|EKC97692.1| hypothetical protein A1Q2_08004 [Trichosporon asahii var. asahii CBS
            8904]
          Length = 1218

 Score = 68.9 bits (167), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 37/96 (38%), Positives = 50/96 (52%), Gaps = 8/96 (8%)

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
            YL R  GD      +V DA  K + +  I HSC P   AK+  ++GH +I IY  R ++ 
Sbjct: 1131 YLFRIDGD------IVCDATFKGSVSRLINHSCNPTANAKIININGHNKIVIYAKRTLYP 1184

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            G+E+T+ YN   E  E      CLCG   C G +LN
Sbjct: 1185 GDEVTYSYNFPLEQDESLRVR-CLCGEPTCLG-FLN 1218


>gi|402852477|ref|XP_003890948.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
            [Papio anubis]
          Length = 1013

 Score = 68.9 bits (167), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+V  ++   GE  FV E++GE+       ++++ +  ++  +E+    FY + +++
Sbjct: 721  KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKHAHENDITHFYMLTIDK 772

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NY+  + HSC+PNCE     V+G  ++G++ V  I  G E+
Sbjct: 773  DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 823

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN      E+   +VC CG+  C G
Sbjct: 824  TFNYNLDCLGNEK---TVCRCGASNCSG 848


>gi|213406581|ref|XP_002174062.1| histone-lysine N-methyltransferase [Schizosaccharomyces japonicus
            yFS275]
 gi|212002109|gb|EEB07769.1| histone-lysine N-methyltransferase [Schizosaccharomyces japonicus
            yFS275]
          Length = 779

 Score = 68.9 bits (167), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 47/149 (31%), Positives = 71/149 (47%), Gaps = 20/149 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+  N      +  FV E++GEV P  ++ ++      +++ +E     FY + L+
Sbjct: 167  KKGFGLRAN--SYLTKGTFVYEYIGEVIPEVRFRKR------MREYDERGIRHFYFMMLQ 218

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              KG+        +DA  K + A    HSCRPNC      V    ++GI+  R I  GEE
Sbjct: 219  --KGE-------YIDATVKGSLARFCNHSCRPNCYVDKWVVGNKLRMGIFCKRDIQKGEE 269

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            +TFDYN     +   +A  C CG   C G
Sbjct: 270  LTFDYNV---DRYGAQAQPCYCGEDCCLG 295


>gi|357631650|gb|EHJ79119.1| hypothetical protein KGM_15585 [Danaus plexippus]
          Length = 1491

 Score = 68.9 bits (167), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 35/82 (42%), Positives = 45/82 (54%), Gaps = 5/82 (6%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N A  I HSC PNC AK+  ++   +I IY+ + I   EEIT+DY    E 
Sbjct: 1415 TIIDATKCGNLARFINHSCNPNCYAKIITIESQKKIVIYSKQPIGVDEEITYDYKFPLED 1474

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
                E   CLCG+  CRG YLN
Sbjct: 1475 ----EKIPCLCGAPQCRG-YLN 1491


>gi|255558564|ref|XP_002520307.1| huntingtin interacting protein, putative [Ricinus communis]
 gi|223540526|gb|EEF42093.1| huntingtin interacting protein, putative [Ricinus communis]
          Length = 1746

 Score = 68.9 bits (167), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 35/77 (45%), Positives = 42/77 (54%), Gaps = 2/77 (2%)

Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
            V+DA  K N    I HSC PNC  +   V+G   IG++ +R I  GEE+TFDYN V    
Sbjct: 897  VIDACAKGNLGRFINHSCDPNCRTEKWVVNGEICIGLFALRDIKKGEELTFDYNYVRVCG 956

Query: 2022 EEYEASVCLCGSQVCRG 2038
                A  C CGS  CRG
Sbjct: 957  --AAAKRCYCGSPQCRG 971


>gi|355718741|gb|AES06369.1| SET domain containing 1B [Mustela putorius furo]
          Length = 359

 Score = 68.9 bits (167), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N+A  I HSC PNC AKV  V+   +I IY+ + I+  EEIT+DY    E 
Sbjct: 284  TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 343

Query: 2021 KEEYEASVCLCGSQVCRGS 2039
             +      CLCGS+ CRG+
Sbjct: 344  VK----IPCLCGSENCRGT 358


>gi|328767162|gb|EGF77213.1| hypothetical protein BATDEDRAFT_91931 [Batrachochytrium dendrobatidis
            JAM81]
          Length = 779

 Score = 68.6 bits (166), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 46/154 (29%), Positives = 68/154 (44%), Gaps = 24/154 (15%)

Query: 1886 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN 1945
            + A  +G G+  +     G    ++E+ GE+    K  E+ D I S QKN+         
Sbjct: 567  FYAPNRGFGLYTDVPIKAGV--LIIEYRGEIISTAKCIERNDTIYSGQKNH--------- 615

Query: 1946 IYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIH 2005
             +LE   G       +V+D   K   A    HSC PNC  +   V   +++GI+    I 
Sbjct: 616  YFLEYGNG-------LVLDGCRKGTIARFANHSCDPNCHVEKWYVGTEFRVGIFATNNIS 668

Query: 2006 YGEEITFDYNSVTESKEEY-EASVCLCGSQVCRG 2038
             G E+T+DY       + Y +   C CGSQ CRG
Sbjct: 669  VGSELTYDYRF-----DSYGQMQPCYCGSQNCRG 697


>gi|393244480|gb|EJD51992.1| histone methyltransferase [Auricularia delicata TFB-10046 SS5]
          Length = 153

 Score = 68.6 bits (166), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 38/83 (45%), Positives = 51/83 (61%), Gaps = 7/83 (8%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
            +VVDA  K N    I HSC PNC AK+ +V+G  +I IY  + I  G+E+T+DY+   E 
Sbjct: 77   LVVDATKKGNLGRLINHSCDPNCTAKIISVNGVKKIVIYAKQDIELGDELTYDYHFPRE- 135

Query: 2021 KEEYEASV-CLCGSQVCRGSYLN 2042
                EA + CLCG+  CRG +LN
Sbjct: 136  ----EAKIPCLCGAAKCRG-FLN 153


>gi|28204960|gb|AAH46473.1| Whsc1 protein, partial [Mus musculus]
          Length = 851

 Score = 68.6 bits (166), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+V  ++   GE  FV E++GE+       ++++ +  ++  +E+    FY + +++
Sbjct: 559  KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKYAHENDITHFYMLTIDK 610

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NY+  + HSC+PNCE     V+G  ++G++ V  I  G E+
Sbjct: 611  DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 661

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN      E+   +VC CG+  C G
Sbjct: 662  TFNYNLDCLGNEK---TVCRCGASNCSG 686


>gi|391331299|ref|XP_003740087.1| PREDICTED: uncharacterized protein LOC100899404 [Metaseiulus
            occidentalis]
          Length = 2686

 Score = 68.6 bits (166), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 52/158 (32%), Positives = 72/158 (45%), Gaps = 35/158 (22%)

Query: 1888 AYRKGL---GVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFY 1944
             YR G+   G+ C K+   GE   ++E+ GEV               ++ +  D   ++Y
Sbjct: 2552 VYRSGIHGRGLYCKKDIAKGE--MIIEYAGEV---------------IRASLCDRREKYY 2594

Query: 1945 -----NIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIY 1999
                   Y+ R   D       VVDA  K N A  I HSC PNC +K+  VD    I IY
Sbjct: 2595 EGRGLGCYMFRMDNDE------VVDATVKGNAARFINHSCDPNCYSKMITVDNKKHIVIY 2648

Query: 2000 TVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCR 2037
             +R I  GEE+T+DY    E  + +    C CGS+ CR
Sbjct: 2649 ALREIRTGEELTYDYKFPIEDDKLH----CTCGSRRCR 2682


>gi|37360238|dbj|BAC98097.1| mKIAA1090 protein [Mus musculus]
          Length = 857

 Score = 68.6 bits (166), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+V  ++   GE  FV E++GE+       ++++ +  ++  +E+    FY + +++
Sbjct: 565  KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKYAHENDITHFYMLTIDK 616

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NY+  + HSC+PNCE     V+G  ++G++ V  I  G E+
Sbjct: 617  DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 667

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN      E+   +VC CG+  C G
Sbjct: 668  TFNYNLDCLGNEK---TVCRCGASNCSG 692


>gi|348675982|gb|EGZ15800.1| hypothetical protein PHYSODRAFT_263017 [Phytophthora sojae]
          Length = 823

 Score = 68.6 bits (166), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 48/147 (32%), Positives = 73/147 (49%), Gaps = 21/147 (14%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
            G G+V N++   GE  F++E++GEV       +  +  R + +  ++    FY + LE+ 
Sbjct: 246  GFGLVANEKINAGE--FIIEYVGEV------IDDIECERRMIQYRDNGEVNFYMMELEKN 297

Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
                     +V+DA +++N +  I H C PN   +   VDG  +IGI+  R I   EEIT
Sbjct: 298  ---------IVIDAKYRSNDSRFINHCCDPNSVTQKWNVDGMQRIGIFARRNIAPDEEIT 348

Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRG 2038
             DYN         EA+ C CGS  C G
Sbjct: 349  IDYN----FSHFGEAADCKCGSTACTG 371


>gi|195143973|ref|XP_002012971.1| GL23881 [Drosophila persimilis]
 gi|194101914|gb|EDW23957.1| GL23881 [Drosophila persimilis]
          Length = 1466

 Score = 68.6 bits (166), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 41/150 (27%), Positives = 77/150 (51%), Gaps = 23/150 (15%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            +G G+VC +     E DF++E++GEV        +++  R + +  +D    FY + +E+
Sbjct: 1279 RGFGLVCREP--IAEGDFIIEYVGEV------INQEEFQRRMLRKQKDRDENFYFLGVEK 1330

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
                       ++DA  K N A  + HSC PNC ++   V+  +++G++ ++ I    E+
Sbjct: 1331 E---------FIIDAGPKGNLARFMNHSCEPNCTSQKWTVNCTHRVGLFAIQDIPAETEL 1381

Query: 2011 TFDY--NSVTESKEEYEASVCLCGSQVCRG 2038
            TF+Y  + +   K++     C CGS+ C G
Sbjct: 1382 TFNYLWDDLLNDKKK----ACHCGSERCSG 1407


>gi|328768890|gb|EGF78935.1| hypothetical protein BATDEDRAFT_90118 [Batrachochytrium dendrobatidis
            JAM81]
          Length = 1361

 Score = 68.6 bits (166), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 46/150 (30%), Positives = 71/150 (47%), Gaps = 24/150 (16%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+   +    G   F++E+ GEV P    F K+     + +++ + A  FY + L++
Sbjct: 251  KGFGIYARENIAGGA--FIIEYCGEVIPA-SLFGKR-----ITEHSNNSAQHFYFMSLKK 302

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
                 D Y    +DA  K N +  + HSC PNC  +   V    +IG++ +R I    E+
Sbjct: 303  -----DEY----IDASKKGNLSRYLNHSCDPNCSLQKWLVGDTIRIGLFALRAIPKNAEL 353

Query: 2011 TFDYNSVTESKEEY--EASVCLCGSQVCRG 2038
            TFDY       E Y  +A  C CG+  C G
Sbjct: 354  TFDYKF-----ERYGSKAQECYCGAAACTG 378


>gi|321472797|gb|EFX83766.1| hypothetical protein DAPPUDRAFT_301653 [Daphnia pulex]
          Length = 303

 Score = 68.6 bits (166), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 33/78 (42%), Positives = 45/78 (57%), Gaps = 4/78 (5%)

Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
            ++DA    N A  I HSC PNC A+V  ++   +I IY+ + I  GEEIT+DY    E  
Sbjct: 228  IIDATKCGNLARFINHSCNPNCYARVITIESQKKIVIYSKQPIGVGEEITYDYKFPIEE- 286

Query: 2022 EEYEASVCLCGSQVCRGS 2039
               +  +CLCGS  CRG+
Sbjct: 287  ---DKIICLCGSSQCRGT 301


>gi|213624868|gb|AAI71696.1| Wolf-Hirschhorn syndrome candidate 1 [Danio rerio]
          Length = 1461

 Score = 68.6 bits (166), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 47/148 (31%), Positives = 78/148 (52%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G++  ++   GE  FV E++GE+       E +  IR  Q+N+      FY + +++
Sbjct: 1164 KGWGLISLRDIKKGE--FVNEYVGELI---DEEECRSRIRHAQEND---ITHFYMLTIDK 1215

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NY+  + HSC+PNCE +   V+G  ++G++ V  I  G E+
Sbjct: 1216 DR---------IIDAGPKGNYSRFMNHSCQPNCETQKWTVNGDTRVGLFAVCDIPAGTEL 1266

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN      E+   +VC CG+  C G
Sbjct: 1267 TFNYNLDCLGNEK---TVCRCGAPNCSG 1291


>gi|128485462|ref|NP_001076020.1| probable histone-lysine N-methyltransferase NSD2 [Danio rerio]
          Length = 1461

 Score = 68.6 bits (166), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 43/148 (29%), Positives = 78/148 (52%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G++  ++   GE  FV E++GE+       ++++    ++   E+    FY + +++
Sbjct: 1164 KGWGLISLRDIKKGE--FVNEYVGEL------IDEEECRSRIRHAQENDITHFYMLTIDK 1215

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NY+  + HSC+PNCE +   V+G  ++G++ V  I  G E+
Sbjct: 1216 DR---------IIDAGPKGNYSRFMNHSCQPNCETQKWTVNGDTRVGLFAVCDIPAGTEL 1266

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN      E+   +VC CG+  C G
Sbjct: 1267 TFNYNLDCLGNEK---TVCRCGAPNCSG 1291


>gi|443714650|gb|ELU06966.1| hypothetical protein CAPTEDRAFT_176480 [Capitella teleta]
          Length = 936

 Score = 68.6 bits (166), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 44/155 (28%), Positives = 74/155 (47%), Gaps = 20/155 (12%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +K+V   +G GV    +       ++ E+LGEV         ++  R    ++   AP  
Sbjct: 224  EKFVTADRGHGV--RSKHPLVNGQYICEYLGEVV-------SEEEFRRRMADDYSAAPHH 274

Query: 1944 YNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRG 2003
            Y + L+            V+D     + +  I HSC PNCE +   ++G Y+I +++++ 
Sbjct: 275  YCLNLD---------SGTVIDGYRMGSISRFINHSCEPNCEMQKWNINGVYRIALFSLKD 325

Query: 2004 IHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            I  GEE+T+DYN   +S   +   +C CGS  CRG
Sbjct: 326  IPPGEELTYDYN--FQSYNVHSQQICKCGSANCRG 358


>gi|321468162|gb|EFX79148.1| hypothetical protein DAPPUDRAFT_319776 [Daphnia pulex]
          Length = 1408

 Score = 68.6 bits (166), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 46/149 (30%), Positives = 75/149 (50%), Gaps = 20/149 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG+G+   ++   G  DF++E++GEV    + F ++    + +KN       +Y + L 
Sbjct: 476  KKGVGLRALQDMDPG--DFIIEYVGEVIDP-REFHRRAKDYAREKNKH-----YYFMAL- 526

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              K DA      ++DA  + N +  I HSC PN E +   V+G  ++G +  + +  G+E
Sbjct: 527  --KSDA------IIDATQQGNVSRFINHSCDPNAETQKWTVNGDLRVGFFARKSLKSGDE 578

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            +TFDY      K   EA  C C S  CRG
Sbjct: 579  VTFDYQFQRYGK---EAQRCYCESSNCRG 604


>gi|195145308|ref|XP_002013638.1| GL23289 [Drosophila persimilis]
 gi|194102581|gb|EDW24624.1| GL23289 [Drosophila persimilis]
          Length = 293

 Score = 68.6 bits (166), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 55/151 (36%), Positives = 72/151 (47%), Gaps = 23/151 (15%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
            G G+ C K+   GE   V+E+ GE+            IRS   +  +   +   I     
Sbjct: 166  GRGLYCTKDIEAGE--MVIEYAGEL------------IRSTLTDKRERYYDSRGIGCYMF 211

Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
            K D    D +VVDA  + N A  I HSC PNC +KV  + GH  I I+ +R I  GEE+T
Sbjct: 212  KID----DNLVVDATMRGNAARFINHSCEPNCYSKVVDILGHKHIIIFALRRIVQGEELT 267

Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            +DY    E     E   C CGS+ CR  YLN
Sbjct: 268  YDYKFPFED----EKIPCSCGSKRCR-KYLN 293


>gi|297842509|ref|XP_002889136.1| hypothetical protein ARALYDRAFT_476894 [Arabidopsis lyrata subsp.
            lyrata]
 gi|297334977|gb|EFH65395.1| hypothetical protein ARALYDRAFT_476894 [Arabidopsis lyrata subsp.
            lyrata]
          Length = 1766

 Score = 68.2 bits (165), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 43/134 (32%), Positives = 66/134 (49%), Gaps = 17/134 (12%)

Query: 1905 EDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVD 1964
            E  F++E++GEV  +  +  +Q       + +      FY + L       +G +  V+D
Sbjct: 1048 EGQFLIEYVGEVLDMQSYDTRQKEYACKGQKH------FYFMTL-------NGNE--VID 1092

Query: 1965 AMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEY 2024
            A  K N    I HSC PNC  +   V+G   +GI++++ +  G+E+TFDYN V       
Sbjct: 1093 AGAKGNLGRFINHSCEPNCRTEKWMVNGEICVGIFSMKDLKKGQELTFDYNYVRVFG--A 1150

Query: 2025 EASVCLCGSQVCRG 2038
             A  C CGS  CRG
Sbjct: 1151 AAKKCYCGSSHCRG 1164


>gi|15292119|gb|AAK93328.1| LD39445p [Drosophila melanogaster]
          Length = 751

 Score = 68.2 bits (165), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 54/151 (35%), Positives = 71/151 (47%), Gaps = 23/151 (15%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
            G G+ C K+   GE   V+E+ GE+            IRS   +  +   +   I     
Sbjct: 624  GRGLYCTKDIEAGE--MVIEYAGEL------------IRSTLTDKRERYYDSRGIGCYMF 669

Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
            K D    D +VVDA  + N A  I H C PNC +KV  + GH  I I+ +R I  GEE+T
Sbjct: 670  KID----DNLVVDATMRGNAARFINHCCEPNCYSKVVDILGHKHIIIFALRRIVQGEELT 725

Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            +DY    E     E   C CGS+ CR  YLN
Sbjct: 726  YDYKFPFED----EKIPCSCGSKRCR-KYLN 751


>gi|194746360|ref|XP_001955648.1| GF16138 [Drosophila ananassae]
 gi|190628685|gb|EDV44209.1| GF16138 [Drosophila ananassae]
          Length = 1460

 Score = 68.2 bits (165), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 44/151 (29%), Positives = 76/151 (50%), Gaps = 25/151 (16%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            +G G+VC +     E  FV+E++GEV       E Q+ +   Q+N ++    +Y + +E+
Sbjct: 1253 RGFGLVCREP--IAEGTFVIEYVGEVI---NHAEFQERLIQKQRNRDE---NYYFLGVEK 1304

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
                       ++DA  K N A  + HSC PNCE +   V+  +++GI+ ++ I    E+
Sbjct: 1305 D---------FIIDAGPKGNLARFMNHSCEPNCETQKWTVNCVHRVGIFAIKDIPANTEL 1355

Query: 2011 TFDY---NSVTESKEEYEASVCLCGSQVCRG 2038
            TF+Y   + +  SK+      C CG+  C G
Sbjct: 1356 TFNYLWDDLMNNSKK-----ACFCGATRCSG 1381


>gi|86278478|gb|ABC88477.1| Wolf-Hirschhorn syndrome candidate 1 protein [Danio rerio]
          Length = 1366

 Score = 68.2 bits (165), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 43/148 (29%), Positives = 78/148 (52%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G++  ++   GE  FV E++GE+       ++++    ++   E+    FY + +++
Sbjct: 1069 KGWGLISLRDIKKGE--FVNEYVGEL------IDEEECRSRIRHAQENDITHFYMLTIDK 1120

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NY+  + HSC+PNCE +   V+G  ++G++ V  I  G E+
Sbjct: 1121 DR---------IIDAGPKGNYSRFMNHSCQPNCETQKWTVNGDTRVGLFAVCDIPAGTEL 1171

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN      E+   +VC CG+  C G
Sbjct: 1172 TFNYNLDCLGNEK---TVCRCGAPNCSG 1196


>gi|15232214|ref|NP_191555.1| putative histone-lysine N-methyltransferase ASHH4 [Arabidopsis
            thaliana]
 gi|75264575|sp|Q9M1X9.1|ASHH4_ARATH RecName: Full=Putative histone-lysine N-methyltransferase ASHH4;
            AltName: Full=ASH1 homolog 4; AltName: Full=Protein SET
            DOMAIN GROUP 24
 gi|7019690|emb|CAB75815.1| putative protein [Arabidopsis thaliana]
 gi|332646470|gb|AEE79991.1| putative histone-lysine N-methyltransferase ASHH4 [Arabidopsis
            thaliana]
          Length = 352

 Score = 68.2 bits (165), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 46/146 (31%), Positives = 71/146 (48%), Gaps = 21/146 (14%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
            G G+V +++   GE  F++E++GEV       + +     L K N      FY   +   
Sbjct: 122  GYGIVADEDINSGE--FIIEYVGEV------IDDKICEERLWKLNHKVETNFYLCQINWN 173

Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
                     +V+DA HK N +  I HSC PN E +   +DG  +IGI+  R I+ GE++T
Sbjct: 174  ---------MVIDATHKGNKSRYINHSCSPNTEMQKWIIDGETRIGIFATRFINKGEQLT 224

Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCR 2037
            +DY  V    ++     C CG+  CR
Sbjct: 225  YDYQFVQFGADQD----CYCGAVCCR 246


>gi|168037139|ref|XP_001771062.1| histone H3 methyltransferase complex, subunit SET1 [Physcomitrella
            patens subsp. patens]
 gi|162677595|gb|EDQ64063.1| histone H3 methyltransferase complex, subunit SET1 [Physcomitrella
            patens subsp. patens]
          Length = 2607

 Score = 68.2 bits (165), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 45/130 (34%), Positives = 64/130 (49%), Gaps = 23/130 (17%)

Query: 1906 DDFVVEFLGEVY--PVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVV 1963
            +DFV+E++GE+    V  + E+Q  I  +  +           YL R        D +VV
Sbjct: 2496 EDFVIEYVGEIIRRQVSNFRERQYEIMGIGSS-----------YLFRVD------DELVV 2538

Query: 1964 DAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEE 2023
            DA  K   A  I HSC PNC  K+  V+G  ++ IY+ R I  GEE+T+DY    E K+ 
Sbjct: 2539 DATQKGGLARFINHSCNPNCYTKIITVEGRKKVVIYSKRAIGAGEELTYDYKFSLEDKK- 2597

Query: 2024 YEASVCLCGS 2033
                 C CG+
Sbjct: 2598 ---IPCYCGA 2604


>gi|240254387|ref|NP_177854.6| histone-lysine N-methyltransferase SETD2 [Arabidopsis thaliana]
 gi|157734196|gb|ABV68921.1| SDG8 [Arabidopsis thaliana]
 gi|332197839|gb|AEE35960.1| histone-lysine N-methyltransferase SETD2 [Arabidopsis thaliana]
          Length = 1805

 Score = 68.2 bits (165), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 46/155 (29%), Positives = 77/155 (49%), Gaps = 19/155 (12%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +++ + +KG G+   ++    E  F++E++GEV  +  +  +Q       + +      F
Sbjct: 1029 ERFQSGKKGYGLRLLED--VREGQFLIEYVGEVLDMQSYETRQKEYAFKGQKH------F 1080

Query: 1944 YNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRG 2003
            Y + L       +G +  V+DA  K N    I HSC PNC  +   V+G   +GI++++ 
Sbjct: 1081 YFMTL-------NGNE--VIDAGAKGNLGRFINHSCEPNCRTEKWMVNGEICVGIFSMQD 1131

Query: 2004 IHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            +  G+E+TFDYN V        A  C CGS  CRG
Sbjct: 1132 LKKGQELTFDYNYVRVFG--AAAKKCYCGSSHCRG 1164


>gi|406606267|emb|CCH42258.1| Histone-lysine N-methyltransferase [Wickerhamomyces ciferrii]
          Length = 1074

 Score = 68.2 bits (165), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 49/149 (32%), Positives = 72/149 (48%), Gaps = 20/149 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G++  +   F     VVE+ GEV       E +  + ++ K ++     +Y + LE
Sbjct: 255  KKGCGLLSIR--SFNAGSLVVEYTGEVI---HLDEVEHRLNTIYKESDS----YYFLGLE 305

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
                     +  V+DA  K + A    HSC PN E +   V+G  +IG++  R I  GEE
Sbjct: 306  ---------EEYVIDAGQKGSVARFANHSCDPNAEMQKWYVNGEPRIGLFAKRSIEAGEE 356

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            IT+DYN   E  E  E   C CGS+ C G
Sbjct: 357  ITYDYN--FEWFENGEPQKCYCGSKNCHG 383


>gi|317455359|pdb|3OPE|A Chain A, Structural Basis Of Auto-Inhibitory Mechanism Of Histone
            Methyltransferase
 gi|317455360|pdb|3OPE|B Chain B, Structural Basis Of Auto-Inhibitory Mechanism Of Histone
            Methyltransferase
          Length = 222

 Score = 68.2 bits (165), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 49/160 (30%), Positives = 80/160 (50%), Gaps = 30/160 (18%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +++ A  KG G+   +    G+  F++E+LGEV                         EF
Sbjct: 77   ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVVS---------------------EQEF 113

Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
             N  +E+    +D Y L     +V+D+    N A  I HSC PNCE +  +V+G Y+IG+
Sbjct: 114  RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 173

Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            Y ++ +  G E+T+DYN  + + E+ +  +C CG + CRG
Sbjct: 174  YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 211


>gi|3540208|gb|AAC34358.1| Hypothetical protein [Arabidopsis thaliana]
          Length = 1767

 Score = 68.2 bits (165), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 43/134 (32%), Positives = 66/134 (49%), Gaps = 17/134 (12%)

Query: 1905 EDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVD 1964
            E  F++E++GEV  +  +  +Q       + +      FY + L       +G +  V+D
Sbjct: 1048 EGQFLIEYVGEVLDMQSYETRQKEYAFKGQKH------FYFMTL-------NGNE--VID 1092

Query: 1965 AMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEY 2024
            A  K N    I HSC PNC  +   V+G   +GI++++ +  G+E+TFDYN V       
Sbjct: 1093 AGAKGNLGRFINHSCEPNCRTEKWMVNGEICVGIFSMQDLKKGQELTFDYNYVRVFG--A 1150

Query: 2025 EASVCLCGSQVCRG 2038
             A  C CGS  CRG
Sbjct: 1151 AAKKCYCGSSHCRG 1164


>gi|198432159|ref|XP_002123225.1| PREDICTED: similar to Wolf-Hirschhorn syndrome candidate 1 protein,
            partial [Ciona intestinalis]
          Length = 752

 Score = 68.2 bits (165), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 37/132 (28%), Positives = 72/132 (54%), Gaps = 18/132 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            +FV E++GE+       + ++ +R ++  +++    FY + +++ +         ++DA 
Sbjct: 305  EFVSEYVGEL------VDSEECMRRIEDAHKNNVTNFYMLTIDKDR---------IIDAG 349

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K NY+  + HSC PNCE +   V+G  ++G++ +R I  GEE+ F+YN      ++   
Sbjct: 350  PKGNYSRFMNHSCDPNCETQKWMVNGDTRVGLFALREIQDGEELMFNYNLDCLGNDK--- 406

Query: 2027 SVCLCGSQVCRG 2038
            + C+CGS  C G
Sbjct: 407  TPCMCGSANCSG 418


>gi|94707110|sp|Q2LAE1.1|ASHH2_ARATH RecName: Full=Histone-lysine N-methyltransferase ASHH2; AltName:
            Full=ASH1 homolog 2; AltName: Full=H3-K4-HMTase; AltName:
            Full=Histone H3-K36 methyltransferase 8;
            Short=H3-K36-HMTase 8; AltName: Full=Protein EARLY
            FLOWERING IN SHORT DAYS; AltName: Full=Protein SET DOMAIN
            GROUP 8
 gi|85036158|gb|ABC69038.1| SDG8 [Arabidopsis thaliana]
          Length = 1759

 Score = 68.2 bits (165), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 43/134 (32%), Positives = 66/134 (49%), Gaps = 17/134 (12%)

Query: 1905 EDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVD 1964
            E  F++E++GEV  +  +  +Q       + +      FY + L       +G +  V+D
Sbjct: 1048 EGQFLIEYVGEVLDMQSYETRQKEYAFKGQKH------FYFMTL-------NGNE--VID 1092

Query: 1965 AMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEY 2024
            A  K N    I HSC PNC  +   V+G   +GI++++ +  G+E+TFDYN V       
Sbjct: 1093 AGAKGNLGRFINHSCEPNCRTEKWMVNGEICVGIFSMQDLKKGQELTFDYNYVRVFG--A 1150

Query: 2025 EASVCLCGSQVCRG 2038
             A  C CGS  CRG
Sbjct: 1151 AAKKCYCGSSHCRG 1164


>gi|50293843|ref|XP_449333.1| hypothetical protein [Candida glabrata CBS 138]
 gi|74637287|sp|Q6FKB1.1|SET1_CANGA RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
            specific; AltName: Full=COMPASS component SET1; AltName:
            Full=SET domain-containing protein 1
 gi|49528646|emb|CAG62307.1| unnamed protein product [Candida glabrata]
          Length = 1111

 Score = 68.2 bits (165), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 35/82 (42%), Positives = 46/82 (56%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA  K   A  I H C P+C AK+  V G  +I IY +R I   EE+T+DY    E+
Sbjct: 1032 TVIDATKKGGIARFINHCCEPSCTAKIIKVGGKRRIVIYALRDIAANEELTYDYKFERET 1091

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
              E E   CLCG+  C+G +LN
Sbjct: 1092 DAE-ERLPCLCGAPSCKG-FLN 1111


>gi|254569422|ref|XP_002491821.1| hypothetical protein [Komagataella pastoris GS115]
 gi|238031618|emb|CAY69541.1| hypothetical protein PAS_chr2-2_0494 [Komagataella pastoris GS115]
 gi|328351679|emb|CCA38078.1| histone-lysine N-methyltransferase SETD1 [Komagataella pastoris CBS
            7435]
          Length = 1020

 Score = 68.2 bits (165), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 35/82 (42%), Positives = 46/82 (56%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA  K   A  I H C+P+C AK+  V+G  +I IY ++ I   EE+T+DY    E 
Sbjct: 941  TVIDATKKGGIARFINHCCQPSCTAKIIKVEGKKRIVIYALKDIAANEELTYDYKFERED 1000

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
              E E   CLCG   C+G YLN
Sbjct: 1001 NNE-ERIPCLCGVPGCKG-YLN 1020


>gi|145327721|ref|NP_001077836.1| histone-lysine N-methyltransferase SETD2 [Arabidopsis thaliana]
 gi|332197840|gb|AEE35961.1| histone-lysine N-methyltransferase SETD2 [Arabidopsis thaliana]
          Length = 1501

 Score = 68.2 bits (165), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 42/134 (31%), Positives = 64/134 (47%), Gaps = 17/134 (12%)

Query: 1905 EDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVD 1964
            E  F++E++GEV  +  +  +Q      ++        FY + L   +         V+D
Sbjct: 1048 EGQFLIEYVGEVLDMQSYETRQ------KEYAFKGQKHFYFMTLNGNE---------VID 1092

Query: 1965 AMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEY 2024
            A  K N    I HSC PNC  +   V+G   +GI++++ +  G+E+TFDYN V       
Sbjct: 1093 AGAKGNLGRFINHSCEPNCRTEKWMVNGEICVGIFSMQDLKKGQELTFDYNYVRVFG--A 1150

Query: 2025 EASVCLCGSQVCRG 2038
             A  C CGS  CRG
Sbjct: 1151 AAKKCYCGSSHCRG 1164


>gi|295663144|ref|XP_002792125.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
 gi|226279300|gb|EEH34866.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
          Length = 816

 Score = 68.2 bits (165), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 46/149 (30%), Positives = 80/149 (53%), Gaps = 22/149 (14%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            +G GV  N+   F  +  +VE+ GE+    K  E++  +R++ KNNE     +Y +Y ++
Sbjct: 436  RGYGVRSNRT--FEPNQIIVEYTGEIV-TQKECERR--MRTVYKNNEC----YYLMYFDQ 486

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVR-GIHYGEE 2009
                      +++DA  + + A  + HSC PNCE +   V G  ++ ++  + GI  GEE
Sbjct: 487  N---------MIIDAT-RGSIARFVNHSCEPNCEMEKWTVAGKPRMALFAGKNGITTGEE 536

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            +T+DYN    S++  +   C CG++ CRG
Sbjct: 537  LTYDYNFDPYSQKNVQE--CRCGAETCRG 563


>gi|401625463|gb|EJS43472.1| set1p [Saccharomyces arboricola H-6]
          Length = 1089

 Score = 67.8 bits (164), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 46/82 (56%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA  K   A  I H C PNC AK+  V G  +I IY +R I   EE+T+DY    E 
Sbjct: 1010 TVIDATKKGGIARFINHCCDPNCTAKIIKVGGRRRIVIYALRDIGANEELTYDYKFEREQ 1069

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
             +E E   CLCG+  C+G +LN
Sbjct: 1070 DDE-ERLPCLCGAPNCKG-FLN 1089


>gi|302757968|ref|XP_002962407.1| hypothetical protein SELMODRAFT_438147 [Selaginella moellendorffii]
 gi|300169268|gb|EFJ35870.1| hypothetical protein SELMODRAFT_438147 [Selaginella moellendorffii]
          Length = 1326

 Score = 67.8 bits (164), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 54/181 (29%), Positives = 82/181 (45%), Gaps = 34/181 (18%)

Query: 1873 GILKAMDSRPDDKYVAYRKGLGVVCNKEGGFG--------EDDFVVEFLGEVYPVWKWFE 1924
            G+ K   SR  +K  A +K L    +K   +G         +DF+VE++GEV        
Sbjct: 1169 GVRKLGGSRAMEKMRARKKLLKFQRSKIHAWGVVAMEVIEPEDFIVEYVGEVL-----RP 1223

Query: 1925 KQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGY---DLVVVDAMHKANYASRICHSCRP 1981
            K   +R ++             YL +  G +  +   D  V+DA  +      I HSC P
Sbjct: 1224 KVADVREVR-------------YLRQGLGSSYFFRVGDGFVIDATQRGGLGRFINHSCEP 1270

Query: 1982 NCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
            NC  K+  V+G  ++ IY    I  G E+T+DY    E ++      CLCG++ CRG +L
Sbjct: 1271 NCYPKIITVEGQKRVFIYARTHIAPGTELTYDYKFPHEDQK----IPCLCGAERCRG-FL 1325

Query: 2042 N 2042
            N
Sbjct: 1326 N 1326


>gi|326432726|gb|EGD78296.1| hypothetical protein PTSG_09362 [Salpingoeca sp. ATCC 50818]
          Length = 1279

 Score = 67.8 bits (164), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 49/163 (30%), Positives = 73/163 (44%), Gaps = 19/163 (11%)

Query: 1886 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN 1945
            ++   KG G+   ++    E  FV+E++GE+          D     ++     A   ++
Sbjct: 1068 FLTQSKGWGLKAGED--IAEGQFVIEYVGEII---------DATECRRRLAASQAANDHS 1116

Query: 1946 IYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIH 2005
             Y+    G +       VDA +KAN A  I HSC PNCE +   V G  ++GI+    I 
Sbjct: 1117 FYILSLSGSS------FVDARNKANLARFINHSCGPNCETQKWNVLGETRVGIFAKEDIP 1170

Query: 2006 YGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGA 2048
             G E+TFDY    +S      + C CG+  CRG    L  E A
Sbjct: 1171 KGTELTFDYQ--LDSLGSRGRTTCHCGASSCRGVIEKLGREAA 1211


>gi|270015132|gb|EFA11580.1| trithorax [Tribolium castaneum]
          Length = 2343

 Score = 67.8 bits (164), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 53/159 (33%), Positives = 77/159 (48%), Gaps = 35/159 (22%)

Query: 1889 YRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN--- 1945
            +R+GL  + + E G    + V+E+ GEV            IRS+  +  +   ++YN   
Sbjct: 2215 HRRGLFCLRDFEAG----EMVIEYSGEV------------IRSVLTDKRE---KYYNSKG 2255

Query: 1946 --IYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRG 2003
               Y+ R        D +VVDA    N A  I HSC PNC +KV  + GH  I I+ +R 
Sbjct: 2256 IGCYMFRID------DNLVVDATMTGNAARFINHSCDPNCYSKVVEILGHKHIIIFALRR 2309

Query: 2004 IHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            I  GEE+T+DY    E     +   C CG++ CR  +LN
Sbjct: 2310 IICGEELTYDYKFPIEE----DKIPCTCGTRRCR-KFLN 2343


>gi|340718068|ref|XP_003397494.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
            H3 lysine-36 and H4 lysine-20 specific-like [Bombus
            terrestris]
          Length = 1238

 Score = 67.8 bits (164), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 40/131 (30%), Positives = 66/131 (50%), Gaps = 18/131 (13%)

Query: 1908 FVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMH 1967
            FV+E++GEV       ++ +  R L +  E     FY + ++  +         ++DA  
Sbjct: 868  FVIEYVGEV------IDEAEYKRRLHRKKELKNENFYFLTIDNNR---------MIDAEP 912

Query: 1968 KANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEAS 2027
            K N +  + HSC PNCE +   V+G  +IG++ +  I  GEE+TF+YN   + +      
Sbjct: 913  KGNLSRFMNHSCSPNCETQKWTVNGDTRIGLFALCDIECGEELTFNYNLACDGETR---K 969

Query: 2028 VCLCGSQVCRG 2038
             CLCG+  C G
Sbjct: 970  PCLCGASNCSG 980


>gi|31418293|gb|AAH53454.1| Whsc1 protein, partial [Mus musculus]
          Length = 558

 Score = 67.8 bits (164), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+V  ++   GE  FV E++GE+       ++++ +  ++  +E+    FY + +++
Sbjct: 266  KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKYAHENDITHFYMLTIDK 317

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NY+  + HSC+PNCE     V+G  ++G++ V  I  G E+
Sbjct: 318  DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 368

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN      E+   +VC CG+  C G
Sbjct: 369  TFNYNLDCLGNEK---TVCRCGASNCSG 393


>gi|452837203|gb|EME39145.1| hypothetical protein DOTSEDRAFT_75034 [Dothistroma septosporum NZE10]
          Length = 1275

 Score = 67.8 bits (164), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 46/143 (32%), Positives = 71/143 (49%), Gaps = 17/143 (11%)

Query: 1900 EGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYD 1959
            E   G ++ ++E++GE     K  +K   +R ++   +       + YL R   D     
Sbjct: 1150 EENIGINELIIEYVGE-----KVRQKVADMREIKYEKQG----VGSSYLFRMMDDE---- 1196

Query: 1960 LVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTE 2019
              +VDA  K   A  I HSC PNC AK+  V+G  +I IY ++ I+  +E+T+DY    E
Sbjct: 1197 --IVDATKKGGIARFINHSCDPNCTAKIIKVEGTPRIVIYALKDIYKNDELTYDYKFERE 1254

Query: 2020 SKEEYEASVCLCGSQVCRGSYLN 2042
                 +   CLCGS  C+G +LN
Sbjct: 1255 IG-STDRIPCLCGSANCKG-FLN 1275


>gi|441664377|ref|XP_003279042.2| PREDICTED: histone-lysine N-methyltransferase NSD2-like [Nomascus
            leucogenys]
          Length = 780

 Score = 67.8 bits (164), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+V  ++   GE  FV E++GE+       ++++ +  ++  +E+    FY + +++
Sbjct: 568  KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKHAHENDITHFYMLTIDK 619

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NY+  + HSC+PNCE     V+G  ++G++ V  I  G E+
Sbjct: 620  DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 670

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN      E+   +VC CG+  C G
Sbjct: 671  TFNYNLDCLGNEK---TVCRCGASNCSG 695


>gi|452846178|gb|EME48111.1| hypothetical protein DOTSEDRAFT_167709 [Dothistroma septosporum
            NZE10]
          Length = 963

 Score = 67.8 bits (164), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 52/174 (29%), Positives = 80/174 (45%), Gaps = 27/174 (15%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVY--PVWKWFEKQDGIRSLQKNNEDPAPEFYNIY 1947
            +KG G+  +KE   G  DFV E++GEV    V++        R +Q+ +E+    FY  +
Sbjct: 218  KKGYGLRADKELRPG--DFVYEYIGEVIGENVFR--------RRMQQYDEEGIKHFY--F 265

Query: 1948 LERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYG 2007
            +   KG+        VDA  K N      HSC PNC      V+   ++GI+  R I  G
Sbjct: 266  MSLTKGE-------FVDATKKGNLGRFCNHSCNPNCYVDKWVVNDKLRMGIFVERNIQAG 318

Query: 2008 EEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLD 2061
            EE+ F+YN     +   +   C CG   C G    + G+   E+  K  H +++
Sbjct: 319  EELVFNYNV---DRYGADPQPCYCGEPNCTGY---IGGKTQTERGTKLSHTIIE 366


>gi|120974668|gb|ABM46716.1| MLL [Gorilla gorilla]
          Length = 338

 Score = 67.8 bits (164), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 50/151 (33%), Positives = 70/151 (46%), Gaps = 21/151 (13%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
            G G+ C +    GE   V+E+ G V            IRS+Q +  +   +   I     
Sbjct: 209  GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKREKYYDSKGIGCYMF 254

Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
            + D    D  VVDA    N A  I HSC PNC ++V  +DG   I I+ +R I+ GEE+T
Sbjct: 255  RID----DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELT 310

Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            +DY    E         C CG++ CR  +LN
Sbjct: 311  YDYKFPIEDAS--NKLPCNCGAKKCR-KFLN 338


>gi|198451130|ref|XP_001358254.2| GA18567 [Drosophila pseudoobscura pseudoobscura]
 gi|198131348|gb|EAL27392.2| GA18567 [Drosophila pseudoobscura pseudoobscura]
          Length = 1541

 Score = 67.8 bits (164), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 45/160 (28%), Positives = 80/160 (50%), Gaps = 24/160 (15%)

Query: 1881 RPDDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
            R D  Y+  R G G+VC +     E DF++E++GEV        +++  R + +  +D  
Sbjct: 1345 RMDVVYMNAR-GFGLVCREP--IAEGDFIIEYVGEV------INQEEFQRRMLRKQKDRD 1395

Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
              FY + +E+           ++DA  K N A  + HSC PNC ++   V+   ++G++ 
Sbjct: 1396 ENFYFLGVEKE---------FIIDAGPKGNLARFMNHSCEPNCTSQKWTVNCTNRVGLFA 1446

Query: 2001 VRGIHYGEEITFDY--NSVTESKEEYEASVCLCGSQVCRG 2038
            ++ I    E+TF+Y  + +   K++     C CGS+ C G
Sbjct: 1447 IQDIPAETELTFNYLWDDLLNDKKK----ACYCGSERCSG 1482


>gi|91076142|ref|XP_970289.1| PREDICTED: similar to mixed-lineage leukemia protein, mll [Tribolium
            castaneum]
          Length = 1824

 Score = 67.8 bits (164), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 53/159 (33%), Positives = 77/159 (48%), Gaps = 35/159 (22%)

Query: 1889 YRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN--- 1945
            +R+GL  + + E G    + V+E+ GEV            IRS+  +  +   ++YN   
Sbjct: 1696 HRRGLFCLRDFEAG----EMVIEYSGEV------------IRSVLTDKRE---KYYNSKG 1736

Query: 1946 --IYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRG 2003
               Y+ R        D +VVDA    N A  I HSC PNC +KV  + GH  I I+ +R 
Sbjct: 1737 IGCYMFRID------DNLVVDATMTGNAARFINHSCDPNCYSKVVEILGHKHIIIFALRR 1790

Query: 2004 IHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            I  GEE+T+DY    E     +   C CG++ CR  +LN
Sbjct: 1791 IICGEELTYDYKFPIEE----DKIPCTCGTRRCR-KFLN 1824


>gi|62531333|gb|AAH93421.1| Whsc1 protein [Danio rerio]
          Length = 320

 Score = 67.8 bits (164), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 47/148 (31%), Positives = 79/148 (53%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G++  ++   GE  FV E++GE+       E +  IR+ Q+N+      FY + +++
Sbjct: 23   KGWGLISLRDIKKGE--FVNEYVGELI---DEEECRSRIRNAQEND---ITHFYMLTIDK 74

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NY+  + HSC+PNCE +   V+G  ++G++ V  I  G E+
Sbjct: 75   DR---------IIDAGPKGNYSRFMNHSCQPNCETQKWTVNGDTRVGLFAVCDIPAGTEL 125

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN      E+   +VC CG+  C G
Sbjct: 126  TFNYNLDCLGNEK---TVCRCGAPNCSG 150


>gi|194380712|dbj|BAG58509.1| unnamed protein product [Homo sapiens]
          Length = 323

 Score = 67.8 bits (164), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 50/151 (33%), Positives = 71/151 (47%), Gaps = 21/151 (13%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
            G G+ C +    GE   V+E+ G V            IRS+Q +  +   +   I     
Sbjct: 194  GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKREKYYDSKGIGCYMF 239

Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
            + D    D  VVDA    N A  I HSC PNC ++V  +DG   I I+ +R I+ GEE+T
Sbjct: 240  RID----DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELT 295

Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            +DY    E  +      C CG++ CR  +LN
Sbjct: 296  YDYKFPIE--DASNKLPCNCGAKKCR-KFLN 323


>gi|124111218|gb|ABM91999.1| MLL [Pan troglodytes]
          Length = 338

 Score = 67.8 bits (164), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 50/151 (33%), Positives = 71/151 (47%), Gaps = 21/151 (13%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
            G G+ C +    GE   V+E+ G V            IRS+Q +  +   +   I     
Sbjct: 209  GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKREKYYDSKGIGCYMF 254

Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
            + D    D  VVDA    N A  I HSC PNC ++V  +DG   I I+ +R I+ GEE+T
Sbjct: 255  RID----DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELT 310

Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            +DY    E  +      C CG++ CR  +LN
Sbjct: 311  YDYKFPIE--DASNKLPCNCGAKKCR-KFLN 338


>gi|241554585|ref|XP_002399516.1| mixed-lineage leukemia protein, mll, putative [Ixodes scapularis]
 gi|215501703|gb|EEC11197.1| mixed-lineage leukemia protein, mll, putative [Ixodes scapularis]
          Length = 544

 Score = 67.8 bits (164), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 39/81 (48%), Positives = 48/81 (59%), Gaps = 5/81 (6%)

Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
            VVDA    N A  I HSC PNC +KV AV G   I IY +R I+ GEE+T+DY      K
Sbjct: 469  VVDATTHGNAARFINHSCDPNCYSKVIAVFGQKHIIIYALRKIYKGEELTYDYKF---PK 525

Query: 2022 EEYEASVCLCGSQVCRGSYLN 2042
            EE +   C CG++ CR  +LN
Sbjct: 526  EEVKIP-CSCGARRCR-KFLN 544


>gi|363756170|ref|XP_003648301.1| hypothetical protein Ecym_8199 [Eremothecium cymbalariae DBVPG#7215]
 gi|356891501|gb|AET41484.1| Hypothetical protein Ecym_8199 [Eremothecium cymbalariae DBVPG#7215]
          Length = 995

 Score = 67.8 bits (164), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 35/84 (41%), Positives = 47/84 (55%), Gaps = 2/84 (2%)

Query: 1959 DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVT 2018
            +  V+DA  K   A  I H C P+C AK+  V G  +I IY +R I   EE+T+DY    
Sbjct: 914  EYTVIDATKKGGIARFINHCCDPSCTAKIIKVGGRKRIVIYALRDIAANEELTYDYKFER 973

Query: 2019 ESKEEYEASVCLCGSQVCRGSYLN 2042
            E  +E E   CLCG+  C+G +LN
Sbjct: 974  EVDDE-ERLPCLCGAATCKG-FLN 995


>gi|26347387|dbj|BAC37342.1| unnamed protein product [Mus musculus]
          Length = 601

 Score = 67.8 bits (164), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+V  ++   GE  FV E++GE+       ++++ +  ++  +E+    FY + +++
Sbjct: 309  KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKYAHENDITHFYMLTIDK 360

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NY+  + HSC+PNCE     V+G  ++G++ V  I  G E+
Sbjct: 361  DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 411

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN      E+   +VC CG+  C G
Sbjct: 412  TFNYNLDCLGNEK---TVCRCGASNCSG 436


>gi|168057166|ref|XP_001780587.1| trithorax-like protein, histone-lysine N-methyltransferase
            [Physcomitrella patens subsp. patens]
 gi|162667953|gb|EDQ54570.1| trithorax-like protein, histone-lysine N-methyltransferase
            [Physcomitrella patens subsp. patens]
          Length = 902

 Score = 67.8 bits (164), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 37/99 (37%), Positives = 50/99 (50%), Gaps = 4/99 (4%)

Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
            VVDA H    A  I HSC PNC ++     G  +I I+  R I  GEE+T+DY  +++  
Sbjct: 797  VVDATHAGTIAHLINHSCEPNCYSRTVTASGEDRIIIFAKRNIEVGEELTYDYRFMSKD- 855

Query: 2022 EEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLL 2060
               E   C CG   CRGS   + G+G   K+   L  L+
Sbjct: 856  ---EVLTCYCGCAGCRGSVNVVDGDGDSTKLSVPLSELI 891


>gi|380797995|gb|AFE70873.1| putative histone-lysine N-methyltransferase NSD2 isoform 1, partial
            [Macaca mulatta]
          Length = 421

 Score = 67.8 bits (164), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+V  ++   GE  FV E++GE+       ++++ +  ++  +E+    FY + +++
Sbjct: 129  KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKHAHENDITHFYMLTIDK 180

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NY+  + HSC+PNCE     V+G  ++G++ V  I  G E+
Sbjct: 181  DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 231

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN      E+   +VC CG+  C G
Sbjct: 232  TFNYNLDCLGNEK---TVCRCGASNCSG 256


>gi|302821061|ref|XP_002992195.1| hypothetical protein SELMODRAFT_430432 [Selaginella moellendorffii]
 gi|300139962|gb|EFJ06692.1| hypothetical protein SELMODRAFT_430432 [Selaginella moellendorffii]
          Length = 1052

 Score = 67.8 bits (164), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 34/78 (43%), Positives = 44/78 (56%), Gaps = 4/78 (5%)

Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
            VVDA H  + A  I HSC PNC +++  VD    I I+  R IH  EE+T+DY   ++  
Sbjct: 953  VVDATHVGSMAHLINHSCEPNCYSRIITVDAKDSIIIFAKRDIHPWEELTYDYRFASKGA 1012

Query: 2022 EEYEASVCLCGSQVCRGS 2039
            E     VC CG+  CRGS
Sbjct: 1013 E----LVCNCGALKCRGS 1026


>gi|302800676|ref|XP_002982095.1| hypothetical protein SELMODRAFT_445108 [Selaginella moellendorffii]
 gi|300150111|gb|EFJ16763.1| hypothetical protein SELMODRAFT_445108 [Selaginella moellendorffii]
          Length = 1045

 Score = 67.8 bits (164), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 34/78 (43%), Positives = 44/78 (56%), Gaps = 4/78 (5%)

Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
            VVDA H  + A  I HSC PNC +++  VD    I I+  R IH  EE+T+DY   ++  
Sbjct: 946  VVDATHVGSMAHLINHSCEPNCYSRIITVDAKDSIIIFAKRDIHPWEELTYDYRFASKGA 1005

Query: 2022 EEYEASVCLCGSQVCRGS 2039
            E     VC CG+  CRGS
Sbjct: 1006 E----LVCNCGALKCRGS 1019


>gi|270001477|gb|EEZ97924.1| hypothetical protein TcasGA2_TC000311 [Tribolium castaneum]
          Length = 1647

 Score = 67.8 bits (164), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 47/156 (30%), Positives = 79/156 (50%), Gaps = 22/156 (14%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +K++   KG GV        GE  F++E++GEV    ++ E+   I     ++       
Sbjct: 916  EKFMTENKGWGVRTKLPIKSGE--FILEYVGEVVSDQEFKERMATIYVNDTHH------- 966

Query: 1944 YNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRG 2003
            Y ++L       DG   +V+D          + HSC+PNCE +  +V+G +++ ++ +R 
Sbjct: 967  YCLHL-------DGG--LVIDGHRMGGDGRFVNHSCQPNCEMQKWSVNGQFRMALFALRD 1017

Query: 2004 IHYGEEITFDYN-SVTESKEEYEASVCLCGSQVCRG 2038
            I   EE+T+DYN S+    E  E   C CGS++CRG
Sbjct: 1018 IESSEELTYDYNFSLFNPAEGQE---CKCGSEMCRG 1050


>gi|384500869|gb|EIE91360.1| hypothetical protein RO3G_16071 [Rhizopus delemar RA 99-880]
          Length = 883

 Score = 67.8 bits (164), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 44/138 (31%), Positives = 62/138 (44%), Gaps = 22/138 (15%)

Query: 1903 FGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVV 1962
               + F++E++GEV    ++         L +  E  A  F + Y    K D       +
Sbjct: 269  LSSNSFIMEYIGEVITQNEF---------LHRTREYDAQGFKHYYFMTLKNDE------I 313

Query: 1963 VDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKE 2022
            +DA  K   A  + HSCRPNC  +   +    +IGI+T R I  GEE+TFDY       E
Sbjct: 314  IDATRKGCLARFMNHSCRPNCVTQKWVIGKKMRIGIFTSRNIKAGEELTFDYKF-----E 368

Query: 2023 EYEASV--CLCGSQVCRG 2038
             Y A    C CG   C+G
Sbjct: 369  RYGAVAQKCFCGEVNCKG 386


>gi|195570949|ref|XP_002103466.1| GD20433 [Drosophila simulans]
 gi|194199393|gb|EDX12969.1| GD20433 [Drosophila simulans]
          Length = 152

 Score = 67.8 bits (164), Expect = 9e-08,   Method: Composition-based stats.
 Identities = 54/151 (35%), Positives = 71/151 (47%), Gaps = 23/151 (15%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
            G G+ C K+   GE   V+E+ GE+            IRS   +  +   +   I     
Sbjct: 25   GRGLYCTKDIEAGE--MVIEYAGEL------------IRSTLTDKRERYYDSRGIGCYMF 70

Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
            K D    D +VVDA  + N A  I H C PNC +KV  + GH  I I+ +R I  GEE+T
Sbjct: 71   KID----DNLVVDATMRGNAARFINHCCEPNCYSKVVDILGHKHIIIFALRRIVQGEELT 126

Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            +DY    E     E   C CGS+ CR  YLN
Sbjct: 127  YDYKFPFED----EKIPCSCGSKRCR-KYLN 152


>gi|255711468|ref|XP_002552017.1| KLTH0B05280p [Lachancea thermotolerans]
 gi|238933395|emb|CAR21579.1| KLTH0B05280p [Lachancea thermotolerans CBS 6340]
          Length = 986

 Score = 67.8 bits (164), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 34/82 (41%), Positives = 46/82 (56%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA  K   A  I H C P+C AK+  V G  +I IY +R I   EE+T+DY    E+
Sbjct: 907  TVIDATKKGGIARFINHCCDPSCTAKIIRVGGRKRIVIYALRDIAANEELTYDYKFERET 966

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
             +E E   C CG+  C+G +LN
Sbjct: 967  DDE-ERLPCFCGAPTCKG-FLN 986


>gi|350420881|ref|XP_003492659.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
            isoform 2 [Bombus impatiens]
          Length = 1239

 Score = 67.4 bits (163), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 40/131 (30%), Positives = 66/131 (50%), Gaps = 18/131 (13%)

Query: 1908 FVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMH 1967
            FV+E++GEV       ++ +  R L +  E     FY + ++  +         ++DA  
Sbjct: 869  FVIEYVGEV------IDEAEYKRRLHRKKELKNENFYFLTIDNNR---------MIDAEP 913

Query: 1968 KANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEAS 2027
            K N +  + HSC PNCE +   V+G  +IG++ +  I  GEE+TF+YN   + +      
Sbjct: 914  KGNLSRFMNHSCSPNCETQKWTVNGDTRIGLFALCDIERGEELTFNYNLACDGETR---K 970

Query: 2028 VCLCGSQVCRG 2038
             CLCG+  C G
Sbjct: 971  PCLCGAPNCSG 981


>gi|452820773|gb|EME27811.1| histone-lysine N-methyltransferase isoform 1 [Galdieria sulphuraria]
          Length = 769

 Score = 67.4 bits (163), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 44/142 (30%), Positives = 70/142 (49%), Gaps = 28/142 (19%)

Query: 1906 DDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDL----- 1960
            ++F++E++GE+            IR  QK +++    ++       +G  D Y       
Sbjct: 651  NEFIIEYVGEI------------IR--QKISDEREKRYFR------QGIGDSYMFRLDED 690

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA  K + A  + HSC  N  AK+  +D   +I  Y+ R I  GEEIT+DY   TE 
Sbjct: 691  QIIDATRKGSVARFVNHSCESNAVAKIITIDNSKKIVFYSKRLIRAGEEITYDYKFNTE- 749

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
             +E    +CLCG+  CR  +LN
Sbjct: 750  -DENNKILCLCGAPTCR-KFLN 769


>gi|157126650|ref|XP_001654691.1| mixed-lineage leukemia protein, mll [Aedes aegypti]
 gi|108873214|gb|EAT37439.1| AAEL010578-PA [Aedes aegypti]
          Length = 172

 Score = 67.4 bits (163), Expect = 9e-08,   Method: Composition-based stats.
 Identities = 58/175 (33%), Positives = 79/175 (45%), Gaps = 23/175 (13%)

Query: 1868 MKMCRGILKAMDSRPDDKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQD 1927
            M M    LK         Y ++  G G+ CN++   GE   V+E+ GE+           
Sbjct: 21   MAMRYRTLKETSKESVGVYRSHIHGRGLFCNRDIEAGE--MVIEYAGEL----------- 67

Query: 1928 GIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKV 1987
             IRS   +  +   +   I     K D    +  VVDA  + N A  I HSC PNC +KV
Sbjct: 68   -IRSTLTDKRERYYDSRGIGCYMFKID----EHFVVDATMRGNAARFINHSCEPNCYSKV 122

Query: 1988 TAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
              + GH  I I+ +R I  GEE+T+DY    E  +      C CGS+ CR  YLN
Sbjct: 123  VDILGHKHIIIFALRRIVQGEELTYDYKFPFEDVK----IPCSCGSKKCR-KYLN 172


>gi|432879768|ref|XP_004073538.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific-like [Oryzias latipes]
          Length = 2321

 Score = 67.4 bits (163), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 40/131 (30%), Positives = 68/131 (51%), Gaps = 18/131 (13%)

Query: 1908 FVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMH 1967
            FV E++GEV       ++++    ++   E     FY + L++ +         V+DA  
Sbjct: 1890 FVSEYVGEV------IDEEECRARIRHAQEHDICNFYMLTLDKDR---------VIDAGP 1934

Query: 1968 KANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEAS 2027
            K N A  + HSC+PNCE +   V+G  ++G++ ++ I  GEE+TF+YN       +   +
Sbjct: 1935 KGNQARFMNHSCQPNCETQKWTVNGDTRVGLFALQDIAKGEELTFNYNLECRGNGK---T 1991

Query: 2028 VCLCGSQVCRG 2038
            VC CG+  C G
Sbjct: 1992 VCKCGAPNCSG 2002


>gi|350420879|ref|XP_003492658.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
            isoform 1 [Bombus impatiens]
          Length = 1230

 Score = 67.4 bits (163), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 40/131 (30%), Positives = 66/131 (50%), Gaps = 18/131 (13%)

Query: 1908 FVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMH 1967
            FV+E++GEV       ++ +  R L +  E     FY + ++  +         ++DA  
Sbjct: 860  FVIEYVGEV------IDEAEYKRRLHRKKELKNENFYFLTIDNNR---------MIDAEP 904

Query: 1968 KANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEAS 2027
            K N +  + HSC PNCE +   V+G  +IG++ +  I  GEE+TF+YN   + +      
Sbjct: 905  KGNLSRFMNHSCSPNCETQKWTVNGDTRIGLFALCDIERGEELTFNYNLACDGETR---K 961

Query: 2028 VCLCGSQVCRG 2038
             CLCG+  C G
Sbjct: 962  PCLCGAPNCSG 972


>gi|452820772|gb|EME27810.1| histone-lysine N-methyltransferase isoform 2 [Galdieria sulphuraria]
          Length = 797

 Score = 67.4 bits (163), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 45/140 (32%), Positives = 71/140 (50%), Gaps = 24/140 (17%)

Query: 1906 DDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDL---VV 1962
            ++F++E++GE+            IR  QK +++        Y  +  GD+  + L    +
Sbjct: 679  NEFIIEYVGEI------------IR--QKISDEREKR----YFRQGIGDSYMFRLDEDQI 720

Query: 1963 VDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKE 2022
            +DA  K + A  + HSC  N  AK+  +D   +I  Y+ R I  GEEIT+DY   TE  +
Sbjct: 721  IDATRKGSVARFVNHSCESNAVAKIITIDNSKKIVFYSKRLIRAGEEITYDYKFNTE--D 778

Query: 2023 EYEASVCLCGSQVCRGSYLN 2042
            E    +CLCG+  CR  +LN
Sbjct: 779  ENNKILCLCGAPTCR-KFLN 797


>gi|12642795|gb|AAK00344.1|AF330040_1 IL-5 promoter REII-region-binding protein [Homo sapiens]
 gi|119602961|gb|EAW82555.1| Wolf-Hirschhorn syndrome candidate 1, isoform CRA_g [Homo sapiens]
 gi|133777178|gb|AAH94825.2| Wolf-Hirschhorn syndrome candidate 1 [Homo sapiens]
          Length = 584

 Score = 67.4 bits (163), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+V  ++   GE  FV E++GE+       ++++ +  ++  +E+    FY + +++
Sbjct: 292  KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKHAHENDITHFYMLTIDK 343

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NY+  + HSC+PNCE     V+G  ++G++ V  I  G E+
Sbjct: 344  DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 394

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN      E+   +VC CG+  C G
Sbjct: 395  TFNYNLDCLGNEK---TVCRCGASNCSG 419


>gi|355729163|gb|AES09785.1| Wolf-Hirschhorn syndrome candidate 1 [Mustela putorius furo]
          Length = 409

 Score = 67.4 bits (163), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+V  ++   GE  FV E++GE+       ++++ +  ++  +E+    FY + +++
Sbjct: 118  KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKYAHENDITHFYMLTIDK 169

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NY+  + HSC+PNCE     V+G  ++G++ V  I  G E+
Sbjct: 170  DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 220

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN      E+   +VC CG+  C G
Sbjct: 221  TFNYNLDCLGNEK---TVCRCGASNCSG 245


>gi|432094921|gb|ELK26329.1| Histone-lysine N-methyltransferase SETD1B [Myotis davidii]
          Length = 1462

 Score = 67.4 bits (163), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 45/79 (56%), Gaps = 4/79 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N+A  I HSC PNC AKV  V+   +I IY+ + I+  EEIT+DY    E 
Sbjct: 1386 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 1445

Query: 2021 KEEYEASVCLCGSQVCRGS 2039
             +      CLC S+ CRG+
Sbjct: 1446 VK----IPCLCNSENCRGT 1460


>gi|297824409|ref|XP_002880087.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
 gi|297325926|gb|EFH56346.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
          Length = 363

 Score = 67.4 bits (163), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 47/146 (32%), Positives = 69/146 (47%), Gaps = 21/146 (14%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
            G G+V  +E   GE  F++E++GEV       + +     L K        FY   + R 
Sbjct: 127  GSGIVAEEEIKPGE--FIIEYVGEV------IDDKTCEERLWKMKHRGETNFYLCEITRD 178

Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
                     +V+DA HK N +  I HSC PN + +   +DG  +IGI+  RGI  GE +T
Sbjct: 179  ---------MVIDATHKGNKSRYINHSCNPNTQMQKWIIDGETRIGIFATRGIKKGEHLT 229

Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCR 2037
            +DY  V    ++     C CG+  CR
Sbjct: 230  YDYQFVQFGADQD----CHCGAVGCR 251


>gi|195503632|ref|XP_002098733.1| GE10528 [Drosophila yakuba]
 gi|194184834|gb|EDW98445.1| GE10528 [Drosophila yakuba]
          Length = 1441

 Score = 67.4 bits (163), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 45/164 (27%), Positives = 80/164 (48%), Gaps = 25/164 (15%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            +G G+V N+E    E DFV+E++GEV          +  R +++   D    +Y + +E+
Sbjct: 1248 RGFGLV-NREP-IAEGDFVIEYVGEV------INHAEFQRRMEQKQRDRDENYYFLGVEK 1299

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
                       ++DA  K N A  + HSC PNCE +   V+  +++G++ ++ I    E+
Sbjct: 1300 D---------FIIDAGPKGNLARFMNHSCEPNCETQKWTVNCIHRVGLFAIKDIPVNTEL 1350

Query: 2011 TFDY---NSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEK 2051
            TF+Y   + +  SK+      C CG+  C G       +GA ++
Sbjct: 1351 TFNYLWDDLMNNSKK-----ACFCGATRCSGEIGGKLKDGAVKE 1389


>gi|452980621|gb|EME80382.1| hypothetical protein MYCFIDRAFT_204567 [Pseudocercospora fijiensis
            CIRAD86]
          Length = 1200

 Score = 67.4 bits (163), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 47/146 (32%), Positives = 74/146 (50%), Gaps = 23/146 (15%)

Query: 1900 EGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYD 1959
            E     +D ++E++GE     K  +K   +R ++ + +       + YL R   D     
Sbjct: 1075 EENIAVNDLIIEYVGE-----KVRQKVADMREIKYDKQG----VGSSYLFRMIDDE---- 1121

Query: 1960 LVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTE 2019
              +VDA  K   A  I HSC PNC AK+  V+G  +I IY ++ I   +E+T+DY    +
Sbjct: 1122 --IVDATKKGGIARFINHSCDPNCTAKIIKVEGTPRIVIYALKDIGKNDELTYDY----K 1175

Query: 2020 SKEEYEAS---VCLCGSQVCRGSYLN 2042
             + EY ++    CLCGS  C+G +LN
Sbjct: 1176 FEREYGSTDRIPCLCGSANCKG-FLN 1200


>gi|123454343|ref|XP_001314927.1| SET domain containing protein [Trichomonas vaginalis G3]
 gi|121897588|gb|EAY02704.1| SET domain containing protein [Trichomonas vaginalis G3]
          Length = 486

 Score = 67.4 bits (163), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 40/122 (32%), Positives = 62/122 (50%), Gaps = 7/122 (5%)

Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
            K D+D Y    +DA  +   A  I HSC PNCE+++  ++G + + +  ++ I+  EE+T
Sbjct: 349  KADSDHY----LDATFRGGIARWINHSCDPNCESRIIKLNGRFAVVLVAIKDINPCEELT 404

Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDRHQLMLEACE 2071
            +DY    E   E +A  CLCGS  CRG +LN       +K   E+        ++L   E
Sbjct: 405  YDYKLPYEP--EDKAIKCLCGSPNCRG-WLNRDKNTLDDKTFSEVKFKNISEDVLLRLVE 461

Query: 2072 LN 2073
             N
Sbjct: 462  NN 463


>gi|397568484|gb|EJK46160.1| hypothetical protein THAOC_35187 [Thalassiosira oceanica]
          Length = 473

 Score = 67.4 bits (163), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 50/150 (33%), Positives = 67/150 (44%), Gaps = 19/150 (12%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA-PEFYNIYLE 1949
            KG G++     G    D V+E+ GEV       E     R      + P  P FY + L 
Sbjct: 299  KGWGLI--SVDGVKSGDLVIEYAGEVID-----ESTKESRLAAWTRDHPTDPNFYVMAL- 350

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
               G A  Y    +DA H AN A  + HSC PNC      V GH ++ I  VR +  GE 
Sbjct: 351  ---GQAGWY----IDARHVANQARFVNHSCDPNCRLVPLNVAGHMRVAIVAVRDVRPGEF 403

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGS 2039
            +++DY   T   + +    C CGS  CRG+
Sbjct: 404  LSYDYQFDTRQGDRF---TCRCGSSNCRGT 430


>gi|398394325|ref|XP_003850621.1| histone methyltransferase, partial [Zymoseptoria tritici IPO323]
 gi|339470500|gb|EGP85597.1| histone methyltransferase [Zymoseptoria tritici IPO323]
          Length = 1163

 Score = 67.4 bits (163), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 46/143 (32%), Positives = 70/143 (48%), Gaps = 17/143 (11%)

Query: 1900 EGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYD 1959
            E     +D ++E++GE     K  +K   +R ++   +       + YL R   D     
Sbjct: 1038 EENIAVNDLIIEYVGE-----KVRQKIADLREIRYEKQG----VGSSYLFRMIDDE---- 1084

Query: 1960 LVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTE 2019
              +VDA  K   A  I HSC PNC AK+  V+G  +I IY ++ I   +E+T+DY    E
Sbjct: 1085 --IVDATKKGGIARFINHSCSPNCTAKIIKVEGTPRIVIYALKDIGKNDELTYDYKFERE 1142

Query: 2020 SKEEYEASVCLCGSQVCRGSYLN 2042
              +  +   CLCGS  C+G +LN
Sbjct: 1143 M-DSTDRIPCLCGSANCKG-FLN 1163


>gi|91077840|ref|XP_971447.1| PREDICTED: similar to set domain protein [Tribolium castaneum]
          Length = 1549

 Score = 67.4 bits (163), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 47/156 (30%), Positives = 79/156 (50%), Gaps = 22/156 (14%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +K++   KG GV        GE  F++E++GEV    ++ E+   I     ++       
Sbjct: 818  EKFMTENKGWGVRTKLPIKSGE--FILEYVGEVVSDQEFKERMATIYVNDTHH------- 868

Query: 1944 YNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRG 2003
            Y ++L       DG   +V+D          + HSC+PNCE +  +V+G +++ ++ +R 
Sbjct: 869  YCLHL-------DGG--LVIDGHRMGGDGRFVNHSCQPNCEMQKWSVNGQFRMALFALRD 919

Query: 2004 IHYGEEITFDYN-SVTESKEEYEASVCLCGSQVCRG 2038
            I   EE+T+DYN S+    E  E   C CGS++CRG
Sbjct: 920  IESSEELTYDYNFSLFNPAEGQE---CKCGSEMCRG 952


>gi|449458127|ref|XP_004146799.1| PREDICTED: uncharacterized protein LOC101220062 [Cucumis sativus]
          Length = 1289

 Score = 67.4 bits (163), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 51/134 (38%), Positives = 64/134 (47%), Gaps = 19/134 (14%)

Query: 1906 DDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDA 1965
            +DFV+E++GE+        +   IR  Q             YL R     DGY   VVDA
Sbjct: 1173 EDFVIEYVGELIR-----PRISDIRERQYEKMGIGSS----YLFRLD---DGY---VVDA 1217

Query: 1966 MHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYE 2025
              +   A  I HSC PNC  KV  V+G  +I IY  R I  GEEIT++Y    E K+   
Sbjct: 1218 TKRGGVARFINHSCEPNCYTKVITVEGQKKIFIYAKRHISAGEEITYNYKFPLEEKK--- 1274

Query: 2026 ASVCLCGSQVCRGS 2039
               C C S+ CRGS
Sbjct: 1275 -IPCNCRSRRCRGS 1287


>gi|426331996|ref|XP_004026979.1| PREDICTED: histone-lysine N-methyltransferase ASH1L [Gorilla gorilla
            gorilla]
          Length = 2776

 Score = 67.0 bits (162), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 44/137 (32%), Positives = 69/137 (50%), Gaps = 28/137 (20%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +++ A  KG G+   +    G+  F++E+LGEV              S Q        EF
Sbjct: 2143 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2179

Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
             N  +E+    +D Y L     +V+D+    N A  I HSC PNCE +  +V+G Y+IG+
Sbjct: 2180 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2239

Query: 1999 YTVRGIHYGEEITFDYN 2015
            Y ++ +  G E+T+DYN
Sbjct: 2240 YALKDMPAGTELTYDYN 2256


>gi|156230137|gb|AAI52413.1| WHSC1 protein [Homo sapiens]
          Length = 713

 Score = 67.0 bits (162), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+V  ++   GE  FV E++GE+       ++++ +  ++  +E+    FY + +++
Sbjct: 421  KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKHAHENDITHFYMLTIDK 472

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NY+  + HSC+PNCE     V+G  ++G++ V  I  G E+
Sbjct: 473  DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 523

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN      E+   +VC CG+  C G
Sbjct: 524  TFNYNLDCLGNEK---TVCRCGASNCSG 548


>gi|383864320|ref|XP_003707627.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific-like [Megachile rotundata]
          Length = 1302

 Score = 67.0 bits (162), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 40/131 (30%), Positives = 66/131 (50%), Gaps = 18/131 (13%)

Query: 1908 FVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMH 1967
            FV+E++GEV       ++ +  R L +  E     FY + ++  +         ++DA  
Sbjct: 933  FVIEYVGEV------IDEAEYKRRLHRKKELKNENFYFLTIDNNR---------MIDAEP 977

Query: 1968 KANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEAS 2027
            K N +  + HSC PNCE +   V+G  +IG++ +  I  GEE+TF+YN   + +      
Sbjct: 978  KGNLSRFMNHSCSPNCETQKWTVNGDTRIGLFALCDIEPGEELTFNYNLACDGETR---K 1034

Query: 2028 VCLCGSQVCRG 2038
             CLCG+  C G
Sbjct: 1035 PCLCGAPNCSG 1045


>gi|40789042|dbj|BAA83042.2| KIAA1090 protein [Homo sapiens]
          Length = 715

 Score = 67.0 bits (162), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+V  ++   GE  FV E++GE+       ++++ +  ++  +E+    FY + +++
Sbjct: 423  KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKHAHENDITHFYMLTIDK 474

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NY+  + HSC+PNCE     V+G  ++G++ V  I  G E+
Sbjct: 475  DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 525

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN      E+   +VC CG+  C G
Sbjct: 526  TFNYNLDCLGNEK---TVCRCGASNCSG 550


>gi|374106286|gb|AEY95196.1| FABR136Wp [Ashbya gossypii FDAG1]
          Length = 975

 Score = 67.0 bits (162), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 35/82 (42%), Positives = 47/82 (57%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA  K   A  I H C P+C AK+  V G  +I IY +R I   EE+T+DY    E+
Sbjct: 896  TVIDATKKGGIARFINHCCDPSCTAKIIKVGGMKRIVIYALRDIAANEELTYDYKFERET 955

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
             +E E   CLCG+  C+G +LN
Sbjct: 956  DDE-ERLPCLCGAPNCKG-FLN 975


>gi|302306708|ref|NP_983083.2| ABR136Wp [Ashbya gossypii ATCC 10895]
 gi|442570023|sp|Q75D88.2|SET1_ASHGO RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
            specific; AltName: Full=COMPASS component SET1; AltName:
            Full=SET domain-containing protein 1
 gi|299788647|gb|AAS50907.2| ABR136Wp [Ashbya gossypii ATCC 10895]
          Length = 975

 Score = 67.0 bits (162), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 35/82 (42%), Positives = 47/82 (57%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA  K   A  I H C P+C AK+  V G  +I IY +R I   EE+T+DY    E+
Sbjct: 896  TVIDATKKGGIARFINHCCDPSCTAKIIKVGGMKRIVIYALRDIAANEELTYDYKFERET 955

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
             +E E   CLCG+  C+G +LN
Sbjct: 956  DDE-ERLPCLCGAPNCKG-FLN 975


>gi|196001997|ref|XP_002110866.1| hypothetical protein TRIADDRAFT_54228 [Trichoplax adhaerens]
 gi|190586817|gb|EDV26870.1| hypothetical protein TRIADDRAFT_54228 [Trichoplax adhaerens]
          Length = 1004

 Score = 67.0 bits (162), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 47/152 (30%), Positives = 76/152 (50%), Gaps = 21/152 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+   ++    ++ FV+E+ GEV  + + FE++    + +K        +Y + L 
Sbjct: 132  KKGFGLRTLED--LEDNQFVLEYCGEVIDL-REFERRKRDYAKKK-----IKHYYFMTLS 183

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              +         ++DA  K  ++  I HSC PNC  +   V+G  +IG +T+R I    E
Sbjct: 184  PNE---------IIDASRKGTFSRFINHSCDPNCVTQKWTVNGMLRIGFFTLRKIPANTE 234

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
            +TFDY      +E  E   C CGS+ CRG YL
Sbjct: 235  LTFDYQFERYGREVQE---CYCGSEKCRG-YL 262


>gi|334311241|ref|XP_003339591.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
            H3 lysine-36 and H4 lysine-20 specific-like [Monodelphis
            domestica]
          Length = 2705

 Score = 67.0 bits (162), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 51/193 (26%), Positives = 92/193 (47%), Gaps = 21/193 (10%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            +FV E++GE+       ++++    ++   E     FY + L++ +         ++DA 
Sbjct: 1968 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 2012

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K NYA  + H C+PNCE +  +V+G  ++G++ +  I  G E+TF+YN       +   
Sbjct: 2013 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 2069

Query: 2027 SVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDRHQLMLEA-CELNSVSEEDYLELGR 2085
            +VC CG+  C G +L +  +       ++   L  R Q+   +  E+    E++    G 
Sbjct: 2070 TVCKCGAPNCSG-FLGVRPKNHPNPTEEKSKKLKRRQQVKRRSQGEITKEREDECFSCGD 2128

Query: 2086 AG-LGSCLLGGLP 2097
            AG L SC   G P
Sbjct: 2129 AGQLVSCKKPGCP 2141


>gi|328781326|ref|XP_003249962.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
            [Apis mellifera]
          Length = 1218

 Score = 67.0 bits (162), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 40/131 (30%), Positives = 65/131 (49%), Gaps = 18/131 (13%)

Query: 1908 FVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMH 1967
            FV+E++GEV       ++ +  R L +  E     FY + ++  +          +DA  
Sbjct: 852  FVIEYVGEV------IDEAEYKRRLHRKKELKNENFYFLTIDNNR---------TIDAEP 896

Query: 1968 KANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEAS 2027
            K N +  + HSC PNCE +   V+G  +IG++ +  I  GEE+TF+YN   + +      
Sbjct: 897  KGNLSRFMNHSCSPNCETQKWTVNGDTRIGLFALCDIEPGEELTFNYNLACDGETR---K 953

Query: 2028 VCLCGSQVCRG 2038
             CLCG+  C G
Sbjct: 954  PCLCGASNCSG 964


>gi|119602957|gb|EAW82551.1| Wolf-Hirschhorn syndrome candidate 1, isoform CRA_d [Homo sapiens]
          Length = 742

 Score = 67.0 bits (162), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+V  ++   GE  FV E++GE+       ++++ +  ++  +E+    FY + +++
Sbjct: 450  KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKHAHENDITHFYMLTIDK 501

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NY+  + HSC+PNCE     V+G  ++G++ V  I  G E+
Sbjct: 502  DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 552

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN      E+   +VC CG+  C G
Sbjct: 553  TFNYNLDCLGNEK---TVCRCGASNCSG 577


>gi|297282129|ref|XP_002802212.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
            [Macaca mulatta]
          Length = 713

 Score = 67.0 bits (162), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+V  ++   GE  FV E++GE+       ++++ +  ++  +E+    FY + +++
Sbjct: 421  KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKHAHENDITHFYMLTIDK 472

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NY+  + HSC+PNCE     V+G  ++G++ V  I  G E+
Sbjct: 473  DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 523

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN      E+   +VC CG+  C G
Sbjct: 524  TFNYNLDCLGNEK---TVCRCGASNCSG 548


>gi|405966542|gb|EKC31816.1| Putative histone-lysine N-methyltransferase ASH1L [Crassostrea gigas]
          Length = 2162

 Score = 67.0 bits (162), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 43/155 (27%), Positives = 75/155 (48%), Gaps = 26/155 (16%)

Query: 1892 GLGVVCNKEGGFG--------EDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            GL V+  K+ G+G           F++E+LGEV    ++          ++  E+ + E 
Sbjct: 1401 GLEVIVTKDRGYGIRTSDSISNGQFILEYLGEVVSEAEF---------RRRMTEEYSQER 1451

Query: 1944 YNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRG 2003
            ++  L    G        V+D     N    + HSC PNCE +   V+G Y++G++ ++ 
Sbjct: 1452 HHYCLNLDSG-------AVIDGYRMGNIGRYVNHSCEPNCEMQKWNVNGVYRMGLFALKD 1504

Query: 2004 IHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            I    E+T+DYN  + + +  +  +C CGS+ CRG
Sbjct: 1505 ISPNMELTYDYNFHSFNVDAQQ--LCRCGSENCRG 1537


>gi|93003038|tpd|FAA00102.1| TPA: zinc finger protein [Ciona intestinalis]
          Length = 883

 Score = 67.0 bits (162), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 49/158 (31%), Positives = 77/158 (48%), Gaps = 20/158 (12%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            +G GV  N +    E  F++E++GEV       E++   R+++  N +   + Y + LE 
Sbjct: 130  RGWGVRTNSD--IPEGQFLLEYVGEVVS-----EREFRRRTIE--NYNAHNDHYCVQLEA 180

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
                       V+D    AN    + HSC+PNCE +   V+G Y++G++  R I   EE+
Sbjct: 181  G---------TVIDGYRLANEGRFVNHSCQPNCEMQKWVVNGEYRVGLFAKRPIVSSEEL 231

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGA 2048
            T+DYN    + +  +   C CGS  CRG     T  GA
Sbjct: 232  TYDYNFHAYNLDRQQP--CRCGSSECRGVIGGKTQRGA 267


>gi|260836403|ref|XP_002613195.1| hypothetical protein BRAFLDRAFT_278042 [Branchiostoma floridae]
 gi|229298580|gb|EEN69204.1| hypothetical protein BRAFLDRAFT_278042 [Branchiostoma floridae]
          Length = 313

 Score = 67.0 bits (162), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 55/156 (35%), Positives = 73/156 (46%), Gaps = 31/156 (19%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
            G G+ C +    GE   V+E+ G V            IRS+     D    +YN      
Sbjct: 184  GRGLFCKRNIDSGE--MVIEYAGMV------------IRSVLT---DKRENYYNSKGIGC 226

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
            Y+ R     D Y+  VVDA    N A  I HSC PNC ++V  V+G   I I+ +R I+ 
Sbjct: 227  YMFR----IDDYE--VVDATMHGNAARFINHSCDPNCYSRVIQVEGKKHIVIFAMRKIYK 280

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            GEE+T+DY    E +       C CGS+ CR  YLN
Sbjct: 281  GEELTYDYKFPIEDQN--SKIDCTCGSKRCR-KYLN 313


>gi|157278865|gb|AAI15212.1| Whsc1 protein [Danio rerio]
          Length = 486

 Score = 67.0 bits (162), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 43/148 (29%), Positives = 78/148 (52%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G++  ++   GE  FV E++GE+       ++++    ++   E+    FY + +++
Sbjct: 189  KGWGLISLRDIKKGE--FVNEYVGEL------IDEEECRSRIRHAQENDITHFYMLTIDK 240

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NY+  + HSC+PNCE +   V+G  ++G++ V  I  G E+
Sbjct: 241  DR---------IIDAGPKGNYSRFMNHSCQPNCETQKWTVNGDTRVGLFAVCDIPAGTEL 291

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN      E+   +VC CG+  C G
Sbjct: 292  TFNYNLDCLGNEK---TVCRCGAPNCSG 316


>gi|453082196|gb|EMF10244.1| hypothetical protein SEPMUDRAFT_151237 [Mycosphaerella populorum
            SO2202]
          Length = 1254

 Score = 67.0 bits (162), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 45/142 (31%), Positives = 68/142 (47%), Gaps = 27/142 (19%)

Query: 1906 DDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDA 1965
            +D ++E++GE     K  +K   +R ++ + +     +    L          D  +VDA
Sbjct: 1135 NDLIIEYVGE-----KVRQKVADMREIKYDKQGVGSSYLFRML----------DDEIVDA 1179

Query: 1966 MHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYE 2025
              K   A  I HSC PNC AK+  V+G  +I IY ++ I   +E+T+DY      K E E
Sbjct: 1180 TKKGGIARFINHSCSPNCTAKIIKVEGTPRIVIYALKDISKNDELTYDY------KFERE 1233

Query: 2026 ASV-----CLCGSQVCRGSYLN 2042
                    CLCGS  C+G +LN
Sbjct: 1234 IGATDRIPCLCGSANCKG-FLN 1254


>gi|349604316|gb|AEP99904.1| Histone-lysine N-methyltransferase HRX-like protein, partial [Equus
            caballus]
          Length = 297

 Score = 67.0 bits (162), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 50/151 (33%), Positives = 70/151 (46%), Gaps = 21/151 (13%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
            G G+ C +    GE   V+E+ G V            IRS+Q +  +   +   I     
Sbjct: 168  GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKREKYYDSKGIGCYMF 213

Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
            + D    D  VVDA    N A  I HSC PNC ++V  +DG   I I+ +R I+ GEE+T
Sbjct: 214  RID----DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELT 269

Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            +DY    E         C CG++ CR  +LN
Sbjct: 270  YDYKFPIEDAS--NKLPCNCGAKKCR-KFLN 297


>gi|313221636|emb|CBY36121.1| unnamed protein product [Oikopleura dioica]
          Length = 207

 Score = 67.0 bits (162), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 43/131 (32%), Positives = 70/131 (53%), Gaps = 17/131 (12%)

Query: 1908 FVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMH 1967
            F++E++GE+         +  IR L+++ +     +Y + L+         +L ++DA  
Sbjct: 44   FIIEYIGEIIS-----HDESRIR-LEESAKIGVTNYYILELD---------NLRMIDAGP 88

Query: 1968 KANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEAS 2027
            + N A  I HSC PNC      V G  +IGI++ R I  GEE+TF+Y  + +S +E +  
Sbjct: 89   RGNIARFINHSCDPNCGIDPWIVQGDTRIGIFSKRDIQEGEELTFNYQ-LQQSSDEGKTK 147

Query: 2028 VCLCGSQVCRG 2038
             CLCGS+ C G
Sbjct: 148  -CLCGSKNCAG 157


>gi|414589296|tpg|DAA39867.1| TPA: putative histone-lysine N-methyltransferase family protein [Zea
            mays]
          Length = 343

 Score = 67.0 bits (162), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 50/151 (33%), Positives = 71/151 (47%), Gaps = 27/151 (17%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
            G G+V   E   GE  FV+E++GEV                    +D   E   ++  + 
Sbjct: 127  GHGLVAEDEIKKGE--FVIEYVGEVI-------------------DDRTCE-NRLWTMKR 164

Query: 1952 KGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
              D D Y       +V+DA +K N +  I HSC PN   +   VDG  ++GI+ +R I  
Sbjct: 165  LLDTDFYLCEVSSNMVIDATNKGNRSRFINHSCEPNTAMQKWTVDGETRVGIFALRDIKI 224

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCR 2037
            GEE+T+DYN +    +   A VC CGS  CR
Sbjct: 225  GEELTYDYNIMYRFVQFGAAQVCHCGSSNCR 255


>gi|242048842|ref|XP_002462165.1| hypothetical protein SORBIDRAFT_02g020844 [Sorghum bicolor]
 gi|241925542|gb|EER98686.1| hypothetical protein SORBIDRAFT_02g020844 [Sorghum bicolor]
          Length = 341

 Score = 67.0 bits (162), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 51/151 (33%), Positives = 71/151 (47%), Gaps = 31/151 (20%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
            G G+V   E   GE  FV+E++GEV                    +D A E   ++  + 
Sbjct: 128  GHGLVAEDEIKKGE--FVIEYVGEVI-------------------DDRACE-NRLWTMKR 165

Query: 1952 KGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
              D D Y       +V+DA +K N +  I HSC PN + +   VDG  ++GI+ +R I  
Sbjct: 166  LNDTDFYLCEVSSNMVIDATNKGNLSRFINHSCEPNTKMQKWTVDGETRVGIFALRDIKI 225

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCR 2037
            GEE+T+DY  V        A VC CGS  CR
Sbjct: 226  GEELTYDYKFVQFGA----AQVCHCGSSKCR 252


>gi|449016155|dbj|BAM79557.1| unknown RNA binding protein [Cyanidioschyzon merolae strain 10D]
          Length = 1151

 Score = 66.6 bits (161), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 46/139 (33%), Positives = 68/139 (48%), Gaps = 23/139 (16%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDL---VVV 1963
            D+++E+ GE+            +RS   +  + A      Y ++  GD+  + +    VV
Sbjct: 1033 DYIIEYRGEL------------VRSAVADLRERA------YRQQGMGDSFMFRIDADTVV 1074

Query: 1964 DAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEE 2023
            DA H  + A  + HSC PN  A++  + G   I  Y+ R I  GEEIT+DYN   E  + 
Sbjct: 1075 DATHIGSVARFVNHSCDPNAIARIVQLGGASHILFYSKRSICVGEEITYDYNFDIED-DA 1133

Query: 2024 YEASVCLCGSQVCRGSYLN 2042
             E   CLCG+  CR  YLN
Sbjct: 1134 SEKVPCLCGAPNCR-QYLN 1151


>gi|418528271|ref|ZP_13094221.1| nuclear protein SET [Comamonas testosteroni ATCC 11996]
 gi|371454647|gb|EHN67649.1| nuclear protein SET [Comamonas testosteroni ATCC 11996]
          Length = 168

 Score = 66.6 bits (161), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 50/155 (32%), Positives = 74/155 (47%), Gaps = 29/155 (18%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
            G GV   ++    E + ++E++GEV     W E QD      ++  DP+   +  Y +  
Sbjct: 23   GKGVFAAQD--IAEGETIIEYVGEVI---DWQEAQD------RHPHDPSQPNHTFYFQVD 71

Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
                   D  V+DA HK N +  I HSC PNC      +DG  ++ I  +R I  GEE+ 
Sbjct: 72   -------DERVIDATHKGNSSRWINHSCAPNC--YTDEIDG--RVYIVALRNIAAGEELN 120

Query: 2012 FDYNSVTESKEEYEASV-----CLCGSQVCRGSYL 2041
            +DY  + E  E Y A +     C CG+  CRG+ L
Sbjct: 121  YDYGLMVE--ERYTAKLKAEYACYCGAANCRGTML 153


>gi|195501654|ref|XP_002097885.1| GE26460 [Drosophila yakuba]
 gi|194183986|gb|EDW97597.1| GE26460 [Drosophila yakuba]
          Length = 343

 Score = 66.6 bits (161), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 54/151 (35%), Positives = 71/151 (47%), Gaps = 23/151 (15%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
            G G+ C K+   GE   V+E+ GE+            IRS   +  +   +   I     
Sbjct: 216  GRGLYCTKDIEAGE--MVIEYAGEL------------IRSTLTDKRERYYDSRGIGCYMF 261

Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
            K D    D +VVDA  + N A  I H C PNC +KV  + GH  I I+ +R I  GEE+T
Sbjct: 262  KID----DNLVVDATMRGNAARFINHCCEPNCYSKVVDILGHKHIIIFALRRIVQGEELT 317

Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            +DY    E     E   C CGS+ CR  YLN
Sbjct: 318  YDYKFPFEE----EKIPCSCGSKRCR-KYLN 343


>gi|356507632|ref|XP_003522568.1| PREDICTED: histone-lysine N-methyltransferase ASHH2-like [Glycine
            max]
          Length = 2081

 Score = 66.6 bits (161), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 47/153 (30%), Positives = 74/153 (48%), Gaps = 19/153 (12%)

Query: 1886 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN 1945
            +   +KG G+   ++   G+  F++E++GEV  + + +E +    +L+ +       FY 
Sbjct: 1230 FKCGKKGYGLKAIEDVAQGQ--FLIEYVGEVLDM-QTYEARQREYALKGHRH-----FYF 1281

Query: 1946 IYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIH 2005
            + L   +         V+DA  K N    I HSC PNC  +   V+G   IG++ +R + 
Sbjct: 1282 MTLNGSE---------VIDASAKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALRNVK 1332

Query: 2006 YGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
              EE+TFDYN V        A  C CGS  CRG
Sbjct: 1333 KDEELTFDYNYVRVFG--AAAKKCYCGSSNCRG 1363


>gi|449463442|ref|XP_004149443.1| PREDICTED: histone-lysine N-methyltransferase ASHH2-like [Cucumis
            sativus]
          Length = 1814

 Score = 66.6 bits (161), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 35/77 (45%), Positives = 42/77 (54%), Gaps = 2/77 (2%)

Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
            V+DA  K N    I HSC PNC  +   V+G   IG++ +R I  GEE+TFDYN V    
Sbjct: 1174 VIDACGKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALRDIKKGEEVTFDYNYVRVFG 1233

Query: 2022 EEYEASVCLCGSQVCRG 2038
                A  C CGS  CRG
Sbjct: 1234 --AAAKKCYCGSFHCRG 1248


>gi|326496078|dbj|BAJ90660.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 362

 Score = 66.6 bits (161), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 49/153 (32%), Positives = 72/153 (47%), Gaps = 35/153 (22%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYP-------VWKWFEKQDGIRSLQKNNEDPAPEFY 1944
            G G++   E   GE  FV+E++GEV         +WK  ++Q                + 
Sbjct: 139  GFGLIAEDEIKKGE--FVIEYVGEVIDDRTCEERLWK-MKRQ---------------RYT 180

Query: 1945 NIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGI 2004
            N YL     +      +V+DA +K N +  I HSC PN E +   VDG  ++GI+ +R I
Sbjct: 181  NFYLCEVSSN------MVIDATNKGNKSRFINHSCEPNTEMQKWTVDGETRVGIFALRDI 234

Query: 2005 HYGEEITFDYNSVTESKEEYEASVCLCGSQVCR 2037
              GEE+T+DY  V    ++     C CGS  CR
Sbjct: 235  ERGEELTYDYKFVQFGADQ----DCHCGSSNCR 263


>gi|264680920|ref|YP_003280830.1| nuclear protein SET [Comamonas testosteroni CNB-2]
 gi|299530912|ref|ZP_07044326.1| nuclear protein SET [Comamonas testosteroni S44]
 gi|262211436|gb|ACY35534.1| nuclear protein SET [Comamonas testosteroni CNB-2]
 gi|298721133|gb|EFI62076.1| nuclear protein SET [Comamonas testosteroni S44]
          Length = 168

 Score = 66.6 bits (161), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 50/155 (32%), Positives = 74/155 (47%), Gaps = 29/155 (18%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
            G GV   ++    E + ++E++GEV     W E QD      ++  DP+   +  Y +  
Sbjct: 23   GKGVFAAQD--IAEGETIIEYVGEVI---DWQEAQD------RHPHDPSQPNHTFYFQVD 71

Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
                   D  V+DA HK N +  I HSC PNC      +DG  ++ I  +R I  GEE+ 
Sbjct: 72   -------DERVIDATHKGNSSRWINHSCAPNC--YTDEIDG--RVYIVALRNIAAGEELN 120

Query: 2012 FDYNSVTESKEEYEASV-----CLCGSQVCRGSYL 2041
            +DY  + E  E Y A +     C CG+  CRG+ L
Sbjct: 121  YDYGLMVE--ERYTAKLKAEYACYCGAANCRGTML 153


>gi|403223606|dbj|BAM41736.1| uncharacterized protein TOT_040000118 [Theileria orientalis strain
            Shintoku]
          Length = 944

 Score = 66.6 bits (161), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 49/158 (31%), Positives = 75/158 (47%), Gaps = 15/158 (9%)

Query: 1882 PDDKYVAYR-KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
            P  K V +  KG+G V  +E    E++ V E++GEV      F K     S  + ++D  
Sbjct: 639  PKLKLVYFEGKGIGAVATEE--IRENELVCEYVGEVITQTD-FHKSLASSSFAEIDDDNQ 695

Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
              +Y + + +          V +D+ H  N A  I HSC PNC +    V G Y++G++ 
Sbjct: 696  CHWYVMKVHKE---------VYIDSTHLGNVARFINHSCDPNCSSIPINVRGSYRMGVFA 746

Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
             R I  GEE+T++Y     SK       C C ++ CRG
Sbjct: 747  SRKILKGEEVTYNYGFT--SKGVGGGFRCKCNAKNCRG 782


>gi|190349638|gb|ACE75882.1| multiple-myeloma-related WHSC1/MMSET isoform RE-IIBP [Homo sapiens]
          Length = 704

 Score = 66.6 bits (161), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+V  ++   GE  FV E++GE+       ++++ +  ++  +E+    FY + +++
Sbjct: 412  KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKYAHENDITHFYMLTIDK 463

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NY+  + HSC+PNCE     V+G  ++G++ V  I  G E+
Sbjct: 464  DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 514

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN      E+   +VC CG+  C G
Sbjct: 515  TFNYNLDCLGNEK---TVCRCGASNCSG 539


>gi|395505173|ref|XP_003756919.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific [Sarcophilus harrisii]
          Length = 2717

 Score = 66.2 bits (160), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 59/220 (26%), Positives = 101/220 (45%), Gaps = 35/220 (15%)

Query: 1886 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN 1945
            +   ++G G+    +   GE  FV E++GE+       ++++    ++   E     FY 
Sbjct: 1950 FRTLQRGWGLRTKTDIKKGE--FVNEYVGEL------IDEEECRARIRYAQEHDITNFYM 2001

Query: 1946 IYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIH 2005
            + L++ +         ++DA  K NYA  + H C+PNCE +  +V+G  ++G++ +  I 
Sbjct: 2002 LTLDKDR---------IIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIK 2052

Query: 2006 YGEEITFDYNSVTESKEEYEASVCLCGSQVCRG-------SYLNLTGEGAFEKVLKELHG 2058
             G E+TF+YN       +   +VC CG+  C G       ++ N T E +  K LK    
Sbjct: 2053 AGTELTFNYNLECLGNGK---TVCKCGAPNCSGFLGVRPKNHPNPTEEKS--KKLKRKQQ 2107

Query: 2059 LLDRHQLMLEACELNSVSEEDYLELGRAG-LGSCLLGGLP 2097
            +  R Q      E+    E++    G AG L SC   G P
Sbjct: 2108 VKRRSQG-----EITKEREDECFSCGDAGQLVSCKKPGCP 2142


>gi|270014006|gb|EFA10454.1| hypothetical protein TcasGA2_TC012700 [Tribolium castaneum]
          Length = 1740

 Score = 66.2 bits (160), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 40/131 (30%), Positives = 69/131 (52%), Gaps = 18/131 (13%)

Query: 1908 FVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMH 1967
            FV+E++GE+       ++Q+  R +QK +E     +Y + +++ +         ++DA  
Sbjct: 1385 FVIEYVGEM------IDEQEYQRRVQKMHEQKEENYYFLTIDKDR---------MLDAGP 1429

Query: 1968 KANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEAS 2027
            K N A  + HSC PNCE +   V+G  ++G++    I  G E+TF+YN     KE+    
Sbjct: 1430 KGNVARFMNHSCDPNCETQKWTVNGDTRVGLFANCDIPAGTELTFNYNLECIGKEK---K 1486

Query: 2028 VCLCGSQVCRG 2038
            +C CG+  C G
Sbjct: 1487 ICHCGAPNCSG 1497


>gi|260833262|ref|XP_002611576.1| hypothetical protein BRAFLDRAFT_117164 [Branchiostoma floridae]
 gi|229296947|gb|EEN67586.1| hypothetical protein BRAFLDRAFT_117164 [Branchiostoma floridae]
          Length = 734

 Score = 66.2 bits (160), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 44/155 (28%), Positives = 76/155 (49%), Gaps = 26/155 (16%)

Query: 1892 GLGVVCNKEGGFG--------EDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            GL  +  K+ G+G        + +F++E++GEV    ++  ++  +     +N       
Sbjct: 86   GLERIVTKDRGYGVRSKTPIPQGNFILEYVGEVVSEQEF--RRRTVEIYHDHNH-----H 138

Query: 1944 YNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRG 2003
            Y + L         +   V+D          + HSC PNCE +  +V+G Y+IG++ +R 
Sbjct: 139  YCLNL---------HSGAVIDGYKYGCEGRFVNHSCEPNCEMQKWSVNGVYRIGLFALRD 189

Query: 2004 IHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            I  GEE+T+DYN    + E+ +  +C CGS  CRG
Sbjct: 190  IPAGEELTYDYNFHAFNMEKQQ--ICKCGSAKCRG 222


>gi|347972366|ref|XP_316738.5| AGAP004656-PA [Anopheles gambiae str. PEST]
 gi|333469400|gb|EAA11974.5| AGAP004656-PA [Anopheles gambiae str. PEST]
          Length = 1259

 Score = 66.2 bits (160), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 47/150 (31%), Positives = 79/150 (52%), Gaps = 24/150 (16%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+V  ++   G+  FV+E++GEV    ++  +   +++ ++ N      +Y + +E 
Sbjct: 1032 KGFGLVALEDLKSGQ--FVIEYVGEVINSEEFDRRVMMMQAAKETN------YYFLTVEP 1083

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
                    DL + DA  K N +  I HSC PNCE +   +     IG++ ++ I+ GEE+
Sbjct: 1084 --------DLTI-DAGPKGNVSRFINHSCEPNCETQKWTIGETRVIGLFAIKDINAGEEL 1134

Query: 2011 TFDYN--SVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN  S+  +K      VCLCG+  C G
Sbjct: 1135 TFNYNLESLGNNKR-----VCLCGAGKCSG 1159


>gi|442621474|ref|NP_001263029.1| Mes-4, isoform B [Drosophila melanogaster]
 gi|440217972|gb|AGB96409.1| Mes-4, isoform B [Drosophila melanogaster]
          Length = 1423

 Score = 66.2 bits (160), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 42/151 (27%), Positives = 74/151 (49%), Gaps = 25/151 (16%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            +G G+V  +    G  DFV+E++GEV          +  R +++   D    +Y + +E+
Sbjct: 1240 RGFGLVNREPIAVG--DFVIEYVGEV------INHAEFQRRMEQKQRDRDENYYFLGVEK 1291

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
                       ++DA  K N A  + HSC PNCE +   V+  +++GI+ ++ I    E+
Sbjct: 1292 D---------FIIDAGPKGNLARFMNHSCEPNCETQKWTVNCIHRVGIFAIKDIPVNSEL 1342

Query: 2011 TFDY---NSVTESKEEYEASVCLCGSQVCRG 2038
            TF+Y   + +  SK+      C CG++ C G
Sbjct: 1343 TFNYLWDDLMNNSKK-----ACFCGAKRCSG 1368


>gi|410080444|ref|XP_003957802.1| hypothetical protein KAFR_0F00700 [Kazachstania africana CBS 2517]
 gi|372464389|emb|CCF58667.1| hypothetical protein KAFR_0F00700 [Kazachstania africana CBS 2517]
          Length = 1133

 Score = 66.2 bits (160), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 35/81 (43%), Positives = 46/81 (56%), Gaps = 2/81 (2%)

Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
            V+DA  K   A  I H C P+C AK+  V G  +I IY +R I   EE+T+DY    E  
Sbjct: 1055 VIDATKKGGIARFINHCCDPSCTAKIIKVGGKRRIVIYALRDIAKNEELTYDYKFEREQD 1114

Query: 2022 EEYEASVCLCGSQVCRGSYLN 2042
            +E E   CLCG+  C+G +LN
Sbjct: 1115 DE-ERLPCLCGAPNCKG-FLN 1133


>gi|412991390|emb|CCO16235.1| unnamed protein product [Bathycoccus prasinos]
          Length = 825

 Score = 66.2 bits (160), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 40/137 (29%), Positives = 63/137 (45%), Gaps = 17/137 (12%)

Query: 1905 EDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVD 1964
            E DF+VE++GE+       ++++  R L        P FY + +   +         ++D
Sbjct: 518  EGDFIVEYMGEI------VDEEECTRRLLACKGKNEPNFYLMEITPSQ---------IID 562

Query: 1965 AMHKANYASRICHSCRPNCEAK--VTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKE 2022
            A    N A  I  SC PNCE +  V A     ++GI+    I  G E+T+DYN      E
Sbjct: 563  ARFCGNNARFINSSCHPNCETQRWVDASTNETRVGIFATEDIKSGTELTYDYNFAHFGGE 622

Query: 2023 EYEASVCLCGSQVCRGS 2039
               +  C CG  +C+G+
Sbjct: 623  GTTSFTCFCGHPMCKGT 639


>gi|156393989|ref|XP_001636609.1| predicted protein [Nematostella vectensis]
 gi|156223714|gb|EDO44546.1| predicted protein [Nematostella vectensis]
          Length = 213

 Score = 66.2 bits (160), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 48/137 (35%), Positives = 65/137 (47%), Gaps = 25/137 (18%)

Query: 1906 DDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDL---VV 1962
            D+ V+E++GEV            IR    +  +        Y ER  G +  + L    +
Sbjct: 97   DEMVIEYVGEV------------IRQAIADYRE------RCYEERGIGSSYMFRLDETTI 138

Query: 1963 VDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKE 2022
            +DA    N+A  I H C PNC AKV AV+   +I IY+ R I   EEIT+DY    E   
Sbjct: 139  IDATTMGNFARFINHCCDPNCYAKVIAVENMKKIVIYSKRDIQVDEEITYDYKFPIED-- 196

Query: 2023 EYEASVCLCGSQVCRGS 2039
              E   CLCG+  CRG+
Sbjct: 197  --EKIPCLCGAPQCRGT 211


>gi|24650756|ref|NP_733239.1| Mes-4, isoform A [Drosophila melanogaster]
 gi|29427833|sp|Q8MT36.2|MES4_DROME RecName: Full=Probable histone-lysine N-methyltransferase Mes-4;
            AltName: Full=Maternal-effect sterile 4 homolog
 gi|23172478|gb|AAF56762.2| Mes-4, isoform A [Drosophila melanogaster]
 gi|94400569|gb|ABF17912.1| FI01019p [Drosophila melanogaster]
          Length = 1427

 Score = 66.2 bits (160), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 42/151 (27%), Positives = 74/151 (49%), Gaps = 25/151 (16%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            +G G+V  +    G  DFV+E++GEV          +  R +++   D    +Y + +E+
Sbjct: 1244 RGFGLVNREPIAVG--DFVIEYVGEV------INHAEFQRRMEQKQRDRDENYYFLGVEK 1295

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
                       ++DA  K N A  + HSC PNCE +   V+  +++GI+ ++ I    E+
Sbjct: 1296 D---------FIIDAGPKGNLARFMNHSCEPNCETQKWTVNCIHRVGIFAIKDIPVNSEL 1346

Query: 2011 TFDY---NSVTESKEEYEASVCLCGSQVCRG 2038
            TF+Y   + +  SK+      C CG++ C G
Sbjct: 1347 TFNYLWDDLMNNSKK-----ACFCGAKRCSG 1372


>gi|348516272|ref|XP_003445663.1| PREDICTED: histone-lysine N-methyltransferase SETD1B-A-like
            [Oreochromis niloticus]
          Length = 595

 Score = 66.2 bits (160), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 45/140 (32%), Positives = 65/140 (46%), Gaps = 25/140 (17%)

Query: 1903 FGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI---YLERPKGDADGYD 1959
               D+ V+E++G++            IR +  +  +   E   I   YL R   D     
Sbjct: 476  IAADEMVIEYVGQI------------IRQVIADMREQRYEEEGIGSSYLFRVDQDT---- 519

Query: 1960 LVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTE 2019
              ++DA    N A  I HSC PNC AK+  V+   +I IY+ + I+  EEIT+DY    E
Sbjct: 520  --IIDATKCGNLARFINHSCNPNCYAKIITVESQKKIVIYSRQPININEEITYDYKFPIE 577

Query: 2020 SKEEYEASVCLCGSQVCRGS 2039
              +      CLCG+  CRGS
Sbjct: 578  ETK----IPCLCGADGCRGS 593


>gi|302798461|ref|XP_002980990.1| hypothetical protein SELMODRAFT_3415 [Selaginella moellendorffii]
 gi|300151044|gb|EFJ17691.1| hypothetical protein SELMODRAFT_3415 [Selaginella moellendorffii]
          Length = 242

 Score = 66.2 bits (160), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 45/132 (34%), Positives = 64/132 (48%), Gaps = 19/132 (14%)

Query: 1908 FVVEFLGEVYPVWKW-FEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            FV+E++GEV     +   +++  R  QK+       FY + L   +         V+DA 
Sbjct: 97   FVIEYVGEVLDSRSFELRQKEYARQRQKH-------FYFMTLNSSE---------VIDAC 140

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K N    I HSC PNC+ +   V+G   IG++ +R +   EEITF+YN   E      A
Sbjct: 141  RKGNLGRFINHSCEPNCQTEKWCVNGEICIGLFAIRDVAKNEEITFNYN--FERLYGAAA 198

Query: 2027 SVCLCGSQVCRG 2038
              C CGS  CRG
Sbjct: 199  KKCHCGSAHCRG 210


>gi|195352984|ref|XP_002042990.1| GM16309 [Drosophila sechellia]
 gi|194127055|gb|EDW49098.1| GM16309 [Drosophila sechellia]
          Length = 1418

 Score = 66.2 bits (160), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 43/151 (28%), Positives = 75/151 (49%), Gaps = 25/151 (16%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            +G G+V N+E      DFV+E++GEV          +  R +++   D    +Y + +E+
Sbjct: 1235 RGFGLV-NREP-IAAGDFVIEYVGEV------INHAEFQRRMEQKQRDRDENYYFLGVEK 1286

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
                       ++DA  K N A  + HSC PNCE +   V+  +++GI+ ++ I    E+
Sbjct: 1287 D---------FIIDAGPKGNLARFMNHSCEPNCETQKWTVNCIHRVGIFAIKDIPVNTEL 1337

Query: 2011 TFDY---NSVTESKEEYEASVCLCGSQVCRG 2038
            TF+Y   + +  SK+      C CG++ C G
Sbjct: 1338 TFNYLWDDLMNNSKK-----ACFCGAKRCSG 1363


>gi|91090902|ref|XP_973711.1| PREDICTED: similar to NSD1 [Tribolium castaneum]
          Length = 1795

 Score = 66.2 bits (160), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 40/131 (30%), Positives = 69/131 (52%), Gaps = 18/131 (13%)

Query: 1908 FVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMH 1967
            FV+E++GE+       ++Q+  R +QK +E     +Y + +++ +         ++DA  
Sbjct: 1440 FVIEYVGEM------IDEQEYQRRVQKMHEQKEENYYFLTIDKDR---------MLDAGP 1484

Query: 1968 KANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEAS 2027
            K N A  + HSC PNCE +   V+G  ++G++    I  G E+TF+YN     KE+    
Sbjct: 1485 KGNVARFMNHSCDPNCETQKWTVNGDTRVGLFANCDIPAGTELTFNYNLECIGKEK---K 1541

Query: 2028 VCLCGSQVCRG 2038
            +C CG+  C G
Sbjct: 1542 ICHCGAPNCSG 1552


>gi|157734198|gb|ABV68922.1| SDG25 [Arabidopsis thaliana]
          Length = 1388

 Score = 66.2 bits (160), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 49/137 (35%), Positives = 66/137 (48%), Gaps = 25/137 (18%)

Query: 1906 DDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI---YLERPKGDADGYDLVV 1962
            +DFV+E++GE+            IRS      +   E   I   YL R     DGY   V
Sbjct: 1272 EDFVIEYVGEL------------IRSSISEIRERQYEKMGIGSSYLFRLD---DGY---V 1313

Query: 1963 VDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKE 2022
            +DA  +   A  I HSC PNC  K+ +V+G  +I IY  R I  GEEI+++Y    E   
Sbjct: 1314 LDATKRGGIARFINHSCEPNCYTKIISVEGKKKIFIYAKRHIDAGEEISYNYKFPLED-- 1371

Query: 2023 EYEASVCLCGSQVCRGS 2039
              +   C CG+  CRGS
Sbjct: 1372 --DKIPCNCGAPKCRGS 1386


>gi|449477606|ref|XP_002188016.2| PREDICTED: histone-lysine N-methyltransferase SETD1B-like
            [Taeniopygia guttata]
          Length = 228

 Score = 66.2 bits (160), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N+A  I HSC PNC AKV  V+   +I IY+ + I+  EEIT+DY    E 
Sbjct: 152  TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 211

Query: 2021 KEEYEASVCLCGSQVCRGS 2039
             +      CLCGS+ CRG+
Sbjct: 212  VK----IPCLCGSENCRGT 226


>gi|260830013|ref|XP_002609956.1| hypothetical protein BRAFLDRAFT_124382 [Branchiostoma floridae]
 gi|229295318|gb|EEN65966.1| hypothetical protein BRAFLDRAFT_124382 [Branchiostoma floridae]
          Length = 902

 Score = 66.2 bits (160), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 44/155 (28%), Positives = 77/155 (49%), Gaps = 26/155 (16%)

Query: 1892 GLGVVCNKEGGFG--------EDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            GL  +  K+ G+G        + +F++E++GEV       E++   R+++  ++      
Sbjct: 114  GLERIVTKDRGYGVRSKTPIPQGNFILEYVGEVVS-----EQEFRRRTVEIYHDHNHHYC 168

Query: 1944 YNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRG 2003
             N++              V+D          + HSC PNCE +  +V+G Y+IG++ +R 
Sbjct: 169  LNLH-----------SGAVIDGYKYGCEGRFVNHSCEPNCEMQKWSVNGVYRIGLFALRD 217

Query: 2004 IHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            I  GEE+T+DYN    + E+ +  +C CGS  CRG
Sbjct: 218  IPAGEELTYDYNFHAFNMEKQQ--ICKCGSAKCRG 250


>gi|326436327|gb|EGD81897.1| hypothetical protein PTSG_11893 [Salpingoeca sp. ATCC 50818]
          Length = 296

 Score = 66.2 bits (160), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 49/141 (34%), Positives = 69/141 (48%), Gaps = 29/141 (20%)

Query: 1905 EDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI-----YLERPKGDADGYD 1959
            +D+ V+E++GE+            +R  Q   ED    +  I     YL R   D     
Sbjct: 177  KDELVIEYVGEI------------VR--QTVAEDRERRYARIGIGSSYLFRIDED----- 217

Query: 1960 LVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTE 2019
              V+DA    + A  I HSC  NC A+V +VDG  +IGIY+ R I   EEIT+DY     
Sbjct: 218  -YVIDATRMGSIARFINHSCDANCYAQVVSVDGKKRIGIYSKRPIAANEEITYDYKF--- 273

Query: 2020 SKEEYEASV-CLCGSQVCRGS 2039
             +EE    + C CG++ CRG+
Sbjct: 274  PREEGPNKIPCFCGARTCRGT 294


>gi|171462836|ref|YP_001796949.1| nuclear protein SET [Polynucleobacter necessarius subsp. necessarius
            STIR1]
 gi|171192374|gb|ACB43335.1| nuclear protein SET [Polynucleobacter necessarius subsp. necessarius
            STIR1]
          Length = 163

 Score = 66.2 bits (160), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 53/158 (33%), Positives = 77/158 (48%), Gaps = 27/158 (17%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
            G GV   K    GE   ++E+ GE    WK  EK+        + +DP   FY   LE  
Sbjct: 25   GKGVFVAKPIKKGEA--IIEYKGERIS-WKLAEKRH-----PHDPKDPNHTFY-FSLE-- 73

Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
                   D  V+DA +  N A  I HSC+P+CE +  + +G  ++ IY  R +  GEE+ 
Sbjct: 74   -------DGRVIDAKYGGNAARWINHSCKPSCETREDSFNGEPRVFIYAKRALKVGEELF 126

Query: 2012 FDYNSVTES------KEEYEASVCLCGSQVCRGSYLNL 2043
            +DY+   E       K++YE   C CG++ CRG+ L L
Sbjct: 127  YDYSLDIEGKITKQMKKDYE---CRCGAKKCRGTMLAL 161


>gi|428171302|gb|EKX40220.1| hypothetical protein GUITHDRAFT_75734 [Guillardia theta CCMP2712]
          Length = 156

 Score = 66.2 bits (160), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 34/91 (37%), Positives = 49/91 (53%), Gaps = 9/91 (9%)

Query: 1949 ERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGE 2008
            E P+G A      +VDA  + N    I H C PNCEAK+  ++G  +I I  +  + +GE
Sbjct: 73   EPPEGRA-----AIVDATIRHNIGHYINHCCDPNCEAKILKINGQRRIIISAIHDVQFGE 127

Query: 2009 EITFDYNSVTESKEEYEASVCLCGSQVCRGS 2039
            E+T+DY    E K+      C CG+  CRG+
Sbjct: 128  ELTYDYKLPFEDKK----IPCHCGAPTCRGT 154


>gi|221069761|ref|ZP_03545866.1| Histone-lysine N-methyltransferase [Comamonas testosteroni KF-1]
 gi|220714784|gb|EED70152.1| Histone-lysine N-methyltransferase [Comamonas testosteroni KF-1]
          Length = 168

 Score = 66.2 bits (160), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 52/155 (33%), Positives = 74/155 (47%), Gaps = 29/155 (18%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
            G GV   ++   GE   ++E++GEV     W E QD      ++  DP+   +  Y +  
Sbjct: 23   GKGVFAAQDIAQGET--LIEYVGEVI---DWQEAQD------RHPHDPSQPNHTFYFQVD 71

Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
                   D  V+DA HK N +  I HSC PNC      +DG  +I I  +R I  GEE+ 
Sbjct: 72   -------DERVIDATHKGNSSRWINHSCDPNC--YTDEIDG--RIYIIALRNIAAGEELN 120

Query: 2012 FDYNSVTESKEEYEASV-----CLCGSQVCRGSYL 2041
            +DY  + E  E Y A +     C CG+  CRG+ L
Sbjct: 121  YDYGLMVE--ERYTAKLKAEYACYCGAANCRGTML 153


>gi|47226564|emb|CAG08580.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 1404

 Score = 66.2 bits (160), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 72/148 (48%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            +G G+  N+    GE  FV+E++GEV       + ++  + +++ +E+    FY + L +
Sbjct: 1145 RGWGLKANQPIKKGE--FVIEYVGEV------IDAEECQQRIKRAHENHMTNFYMLTLTK 1196

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         V+DA  K N +  I HSC PNCE +   V+G   IG++ +  I    E+
Sbjct: 1197 DR---------VIDAGQKGNLSRFINHSCSPNCETQKWTVNGDVHIGLFALCDIETDTEL 1247

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN           + C CGS  C G
Sbjct: 1248 TFNYNLHCVGNRR---ATCNCGSDNCSG 1272


>gi|28277052|gb|AAH44818.1| Mll1 protein, partial [Mus musculus]
          Length = 142

 Score = 66.2 bits (160), Expect = 3e-07,   Method: Composition-based stats.
 Identities = 51/153 (33%), Positives = 72/153 (47%), Gaps = 25/153 (16%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI--YLE 1949
            G G+ C +    GE   V+E+ G V            IRS+Q +  +   +   I  Y+ 
Sbjct: 13   GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKREKYYDSKGIGCYMF 58

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
            R        D  VVDA    N A  I HSC PNC ++V  +DG   I I+ +R I+ GEE
Sbjct: 59   RID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEE 112

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            +T+DY    E  +      C CG++ CR  +LN
Sbjct: 113  LTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 142


>gi|431912177|gb|ELK14315.1| Histone-lysine N-methyltransferase SETD1B [Pteropus alecto]
          Length = 245

 Score = 66.2 bits (160), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 45/79 (56%), Gaps = 4/79 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N+A  I HSC PNC AKV  V+   +I IY+ + I   EEIT+DY    E 
Sbjct: 169  TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHISVNEEITYDYKFPIED 228

Query: 2021 KEEYEASVCLCGSQVCRGS 2039
             +      CLCGS+ CRG+
Sbjct: 229  VK----IPCLCGSENCRGT 243


>gi|357157974|ref|XP_003577976.1| PREDICTED: histone-lysine N-methyltransferase ASHH3-like
            [Brachypodium distachyon]
          Length = 338

 Score = 66.2 bits (160), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 47/153 (30%), Positives = 74/153 (48%), Gaps = 35/153 (22%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYP-------VWKWFEKQDGIRSLQKNNEDPAPEFY 1944
            G G+V   + G  + +F++E++GEV         +WK  ++Q                + 
Sbjct: 115  GFGLV--ADDGIQKGEFIIEYVGEVIDDRTCEERLWK-MKRQ---------------RYT 156

Query: 1945 NIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGI 2004
            N YL     +      +V+DA +K N +  I HSC+PN E +   VDG  ++GI+ +R I
Sbjct: 157  NFYLCEVSSN------MVIDATNKGNKSRFINHSCQPNTEMQKWTVDGETRVGIFALRDI 210

Query: 2005 HYGEEITFDYNSVTESKEEYEASVCLCGSQVCR 2037
              GEE+T+DY  V    ++     C CGS  CR
Sbjct: 211  KKGEELTYDYKFVQFGADQ----DCHCGSSKCR 239


>gi|432952957|ref|XP_004085262.1| PREDICTED: histone-lysine N-methyltransferase NSD2-like, partial
            [Oryzias latipes]
          Length = 1167

 Score = 66.2 bits (160), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 42/148 (28%), Positives = 79/148 (53%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+V  ++   G+  FV E++GE+       ++++    ++  +E+    FY + +++
Sbjct: 967  KGWGLVALRDIKKGK--FVNEYIGEL------IDEEECRARIKYAHENNITNFYMLTIDK 1018

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NY+  + HSC+PNCE +   V+G  ++G++ V  I  G E+
Sbjct: 1019 DR---------IIDAGPKGNYSRFMNHSCQPNCETQKWTVNGDTRVGLFAVCHIPAGTEL 1069

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN      E+   ++C CG+  C G
Sbjct: 1070 TFNYNLDCLGNEK---TICRCGAPNCSG 1094


>gi|327265653|ref|XP_003217622.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific-like [Anolis carolinensis]
          Length = 2106

 Score = 65.9 bits (159), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 40/148 (27%), Positives = 75/148 (50%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            +G G+   ++   GE  FV E++GE+       ++++    ++   E     FY + L++
Sbjct: 1383 RGWGLQAKRDIKKGE--FVNEYVGEL------IDEEECRARIRHAQEHDITNFYMLTLDK 1434

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NYA  + H C+PNCE +  +V+G  ++G++ +  +  G E+
Sbjct: 1435 DR---------IIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFAITNVKAGTEL 1485

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN       +   +VC CG+  C G
Sbjct: 1486 TFNYNLECLGNGK---TVCKCGAPNCSG 1510


>gi|367004711|ref|XP_003687088.1| hypothetical protein TPHA_0I01480 [Tetrapisispora phaffii CBS 4417]
 gi|357525391|emb|CCE64654.1| hypothetical protein TPHA_0I01480 [Tetrapisispora phaffii CBS 4417]
          Length = 1030

 Score = 65.9 bits (159), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 34/82 (41%), Positives = 46/82 (56%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA  K   A  I H C P+C AK+  V G  +I IY +R I   EE+T+DY    E 
Sbjct: 951  TVIDATKKGGIARFINHCCDPSCTAKIIKVGGKKRIVIYALRDIDVNEELTYDYKFEREE 1010

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
             ++ E   CLCG+  C+G +LN
Sbjct: 1011 DDQ-ERLPCLCGAPNCKG-FLN 1030


>gi|449679772|ref|XP_002161520.2| PREDICTED: histone-lysine N-methyltransferase MLL-like [Hydra
            magnipapillata]
          Length = 281

 Score = 65.9 bits (159), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 35/84 (41%), Positives = 48/84 (57%), Gaps = 5/84 (5%)

Query: 1959 DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVT 2018
            D  VVDA  K N A  I HSC PNC +++ ++DG  +I IY  + +  GEE+T+DY    
Sbjct: 203  DTDVVDATTKGNAARFINHSCEPNCFSRIISIDGCKKIIIYAQKRVTVGEELTYDYKFAI 262

Query: 2019 ESKEEYEASVCLCGSQVCRGSYLN 2042
            E     +   C CG++ CR  YLN
Sbjct: 263  ED----DKLPCFCGAKKCR-KYLN 281


>gi|33305503|gb|AAQ02781.1|AF373874_1 Mll protein [Xenopus laevis]
          Length = 84

 Score = 65.9 bits (159), Expect = 3e-07,   Method: Composition-based stats.
 Identities = 36/84 (42%), Positives = 47/84 (55%), Gaps = 3/84 (3%)

Query: 1959 DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVT 2018
            D  VVDA    N A  I HSC PNC ++V  +DG   I I+ +R I+ GEE+T+DY    
Sbjct: 4    DSEVVDATMHGNAARFINHSCEPNCYSRVIPIDGQKHIVIFAMRKIYRGEELTYDYKFPI 63

Query: 2019 ESKEEYEASVCLCGSQVCRGSYLN 2042
            E      A  C CG++ CR  +LN
Sbjct: 64   EDANNKLA--CNCGTKKCR-KFLN 84


>gi|354472091|ref|XP_003498274.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1
            [Cricetulus griseus]
          Length = 1436

 Score = 65.9 bits (159), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 45/158 (28%), Positives = 79/158 (50%), Gaps = 21/158 (13%)

Query: 1882 PDDKYV-AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
            PD + +   RKG G+   +    GE  FV E++GE+       ++++    +++ +E+  
Sbjct: 1145 PDAEIIKTERKGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSV 1196

Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
              FY + + + +         ++DA  K NY+  + HSC PNCE +   V+G  ++G++ 
Sbjct: 1197 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1247

Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            +  I  G E+TF+YN           +VC CGS  C G
Sbjct: 1248 ICDIPAGMELTFNYNLDCLGNGR---TVCHCGSDNCSG 1282


>gi|396478086|ref|XP_003840449.1| hypothetical protein LEMA_P101010.1 [Leptosphaeria maculans JN3]
 gi|312217021|emb|CBX96970.1| hypothetical protein LEMA_P101010.1 [Leptosphaeria maculans JN3]
          Length = 962

 Score = 65.9 bits (159), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 49/149 (32%), Positives = 67/149 (44%), Gaps = 20/149 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+  NK+   G  DFV E++GEV       EK    R LQ ++E     FY  ++ 
Sbjct: 243  KKGFGLRANKDMAPG--DFVFEYIGEVI-----DEKTFRRRMLQYDHEG-IKHFY--FMS 292

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              KG+        VDA  K N      HSC PNC      V    ++GI+  R +  GEE
Sbjct: 293  LTKGE-------FVDATKKGNLGRFCNHSCNPNCFVDKWVVGDKLRMGIFVERRVQAGEE 345

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            + F+YN     +   +   C CG   C G
Sbjct: 346  LVFNYNV---DRYGADPQPCYCGEPNCSG 371


>gi|356518575|ref|XP_003527954.1| PREDICTED: histone-lysine N-methyltransferase ASHH2-like [Glycine
            max]
          Length = 2037

 Score = 65.9 bits (159), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 48/149 (32%), Positives = 72/149 (48%), Gaps = 19/149 (12%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+   +    G+  F++E++GEV  + + +E +    +L+ +       FY + L 
Sbjct: 1190 KKGYGLKAIENVAQGQ--FLIEYVGEVLDM-QAYEARQREYALKGHRH-----FYFMTLN 1241

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              +         V+DA  K N    I HSC PNC  +   V+G   IG++ +R I   EE
Sbjct: 1242 GSE---------VIDASAKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALRDIKKDEE 1292

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            +TFDYN V        A  C CGS  CRG
Sbjct: 1293 LTFDYNYVRVFG--AAAKKCYCGSPNCRG 1319


>gi|354472093|ref|XP_003498275.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2
            [Cricetulus griseus]
          Length = 1387

 Score = 65.9 bits (159), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 45/158 (28%), Positives = 79/158 (50%), Gaps = 21/158 (13%)

Query: 1882 PDDKYV-AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
            PD + +   RKG G+   +    GE  FV E++GE+       ++++    +++ +E+  
Sbjct: 1096 PDAEIIKTERKGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSV 1147

Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
              FY + + + +         ++DA  K NY+  + HSC PNCE +   V+G  ++G++ 
Sbjct: 1148 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1198

Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            +  I  G E+TF+YN           +VC CGS  C G
Sbjct: 1199 ICDIPAGMELTFNYNLDCLGNGR---TVCHCGSDNCSG 1233


>gi|198437220|ref|XP_002124518.1| PREDICTED: similar to Histone-lysine N-methyltransferase HRX (Zinc
            finger protein HRX) (ALL-1) (Trithorax-like protein)
            (Lysine N-methyltransferase 2A) (CXXC-type zinc finger
            protein 7) [Ciona intestinalis]
          Length = 3406

 Score = 65.9 bits (159), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 43/152 (28%), Positives = 72/152 (47%), Gaps = 20/152 (13%)

Query: 1886 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN 1945
            Y +   G G+ C ++   GE   ++E+ G++        +Q+     +K  E  +   Y 
Sbjct: 3271 YRSTIHGRGLYCKRDFDSGE--MIMEYTGQII-------RQELTDKREKYYESKSIGCYM 3321

Query: 1946 IYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIH 2005
              ++         D  VVDA    + A  I HSC PNC +++   +G   I I+ +R I+
Sbjct: 3322 FRMD---------DFYVVDATVLGSGARFINHSCDPNCYSRIVQFEGKKHIVIFALREIY 3372

Query: 2006 YGEEITFDYNSVTESKEEYEASVCLCGSQVCR 2037
             GEE+T+DY    E  +E     C CG+++CR
Sbjct: 3373 KGEELTYDYKFPIE--DENHKIACTCGARLCR 3402


>gi|326928449|ref|XP_003210391.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific-like, partial [Meleagris gallopavo]
          Length = 2336

 Score = 65.9 bits (159), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 37/132 (28%), Positives = 67/132 (50%), Gaps = 18/132 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            +FV E++GE+       ++++    ++   E     FY + L++ +         ++DA 
Sbjct: 1669 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 1713

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K NYA  + H C+PNCE +   V+G  ++G++ +  I  G E+TF+YN       +   
Sbjct: 1714 PKGNYARFMNHCCQPNCETQKWCVNGDTRVGLFAIVNIKAGTELTFNYNLECLGNGK--- 1770

Query: 2027 SVCLCGSQVCRG 2038
            +VC CG+  C G
Sbjct: 1771 TVCKCGAPNCSG 1782


>gi|345493934|ref|XP_001600694.2| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1 [Nasonia
            vitripennis]
          Length = 1382

 Score = 65.9 bits (159), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 76/148 (51%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            +G G+V  +    G+  F++E++GEV       E +  +R LQ+  E     +Y + ++ 
Sbjct: 1016 RGWGLVSLEPIKHGQ--FIIEYVGEVID-----EAEYKLR-LQQKKERKNENYYFLTIDN 1067

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K N +  + HSC+PNCE +   V+G  +IG++ +R I  GEE+
Sbjct: 1068 SR---------MIDAEPKGNLSRFMNHSCQPNCETQKWKVNGDTRIGLFALRDIEPGEEL 1118

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN   + +       CLC +  C G
Sbjct: 1119 TFNYNLACDGETR---KPCLCKAPNCSG 1143


>gi|403072167|pdb|4FMU|A Chain A, Crystal Structure Of Methyltransferase Domain Of Human Set
            Domain- Containing Protein 2 Compound: Pr-Snf
 gi|407944022|pdb|4H12|A Chain A, The Crystal Structure Of Methyltransferase Domain Of Human
            Set Domain- Containing Protein 2 In Complex With
            S-Adenosyl-L-Homocysteine
          Length = 278

 Score = 65.9 bits (159), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 31/77 (40%), Positives = 43/77 (55%), Gaps = 3/77 (3%)

Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
            ++DA  K N +  + HSC PNCE +   V+G  ++G +T + +  G E+TFDY      K
Sbjct: 181  IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSELTFDYQFQRYGK 240

Query: 2022 EEYEASVCLCGSQVCRG 2038
               EA  C CGS  CRG
Sbjct: 241  ---EAQKCFCGSANCRG 254


>gi|358253063|dbj|GAA51760.1| histone-lysine N-methyltransferase NSD1/2 [Clonorchis sinensis]
          Length = 1596

 Score = 65.9 bits (159), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 39/131 (29%), Positives = 67/131 (51%), Gaps = 18/131 (13%)

Query: 1908 FVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMH 1967
            FV E++G++       ++++  R L+  +E+    +Y + L+  +         ++DA  
Sbjct: 1074 FVNEYIGDL------IDEEEANRRLRFAHENNVTNYYMMKLDAQR---------IIDAGP 1118

Query: 1968 KANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEAS 2027
            K N +  + H C PN   +   V+G  +IG++ VR I  GEE+TFDYN V   +E     
Sbjct: 1119 KGNLSRFMNHCCDPNLNTQKWTVNGDNRIGLFAVRDIAAGEELTFDYNFVALGQERLN-- 1176

Query: 2028 VCLCGSQVCRG 2038
             C CG++ C G
Sbjct: 1177 -CRCGAENCTG 1186


>gi|223365716|pdb|2W5Y|A Chain A, Binary Complex Of The Mixed Lineage Leukaemia (Mll1) Set
            Domain With The Cofactor Product S-Adenosylhomocysteine.
 gi|223365717|pdb|2W5Z|A Chain A, Ternary Complex Of The Mixed Lineage Leukaemia (Mll1) Set
            Domain With The Cofactor Product S-Adenosylhomocysteine
            And Histone Peptide
          Length = 192

 Score = 65.5 bits (158), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 50/151 (33%), Positives = 70/151 (46%), Gaps = 21/151 (13%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
            G G+ C +    GE   V+E+ G V            IRS+Q +  +   +   I     
Sbjct: 63   GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKREKYYDSKGIGCYMF 108

Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
            + D    D  VVDA    N A  I HSC PNC ++V  +DG   I I+ +R I+ GEE+T
Sbjct: 109  RID----DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELT 164

Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            +DY    E         C CG++ CR  +LN
Sbjct: 165  YDYKFPIEDAS--NKLPCNCGAKKCR-KFLN 192


>gi|363739108|ref|XP_414538.3| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific [Gallus gallus]
          Length = 2412

 Score = 65.5 bits (158), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 37/132 (28%), Positives = 67/132 (50%), Gaps = 18/132 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            +FV E++GE+       ++++    ++   E     FY + L++ +         ++DA 
Sbjct: 1681 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 1725

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K NYA  + H C+PNCE +   V+G  ++G++ +  I  G E+TF+YN       +   
Sbjct: 1726 PKGNYARFMNHCCQPNCETQKWCVNGDTRVGLFAIVNIKAGTELTFNYNLECLGNGK--- 1782

Query: 2027 SVCLCGSQVCRG 2038
            +VC CG+  C G
Sbjct: 1783 TVCKCGAPNCSG 1794


>gi|121483959|gb|ABM54292.1| MLL [Pan paniscus]
          Length = 162

 Score = 65.5 bits (158), Expect = 3e-07,   Method: Composition-based stats.
 Identities = 51/153 (33%), Positives = 72/153 (47%), Gaps = 25/153 (16%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI--YLE 1949
            G G+ C +    GE   V+E+ G V            IRS+Q +  +   +   I  Y+ 
Sbjct: 33   GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKREKYYDSKGIGCYMF 78

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
            R        D  VVDA    N A  I HSC PNC ++V  +DG   I I+ +R I+ GEE
Sbjct: 79   RID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEE 132

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            +T+DY    E  +      C CG++ CR  +LN
Sbjct: 133  LTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 162


>gi|324523879|gb|ADY48320.1| Histone-lysine N-methyltransferase ASH1L, partial [Ascaris suum]
          Length = 287

 Score = 65.5 bits (158), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 43/148 (29%), Positives = 73/148 (49%), Gaps = 17/148 (11%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG GV   K    G++  + E++G V P  ++FE+ + I +   NN + +  ++ + +  
Sbjct: 9    KGFGVFAKKYIPAGQE--LTEYVGRVMPRDEYFEQLNFIGTF--NNLEMS--YFGMQIT- 61

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
                    +   VDA +  N +  + HSC PNC+     VDG Y++ +  ++ I  G+E+
Sbjct: 62   --------NEFYVDARNCGNMSRSVNHSCEPNCKVNAVTVDGVYRLKVSALKDIAAGDEL 113

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            T+DY   TE         C CG+  CRG
Sbjct: 114  TYDYG--TELWSGMVGMRCRCGTAGCRG 139


>gi|451994892|gb|EMD87361.1| hypothetical protein COCHEDRAFT_1144880 [Cochliobolus heterostrophus
            C5]
          Length = 923

 Score = 65.5 bits (158), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 45/149 (30%), Positives = 68/149 (45%), Gaps = 20/149 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+  NK+   GE  FV E++GEV       +++   R + + +E+    FY  ++ 
Sbjct: 217  KKGFGLRANKDMAPGE--FVFEYIGEV------IDERTFRRRMGQYDEEGIKHFY--FMS 266

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              KG+        VDA  K N      HSC PNC      V    ++GI+  R +  GEE
Sbjct: 267  LTKGE-------FVDATKKGNLGRFCNHSCNPNCFVDKWVVGDKLRMGIFVERQVKAGEE 319

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            + F+YN     +   +   C CG   C G
Sbjct: 320  LVFNYNV---DRYGADPQPCYCGEPNCSG 345


>gi|348535504|ref|XP_003455240.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific [Oreochromis niloticus]
          Length = 2122

 Score = 65.5 bits (158), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 40/131 (30%), Positives = 68/131 (51%), Gaps = 18/131 (13%)

Query: 1908 FVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMH 1967
            F+ E++GEV       E +  IR  Q+N+      FY + L++ +         ++DA  
Sbjct: 1671 FISEYVGEVI---DEEECRARIRHAQEND---ICNFYMLTLDKDR---------IIDAGP 1715

Query: 1968 KANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEAS 2027
            K N A  + HSC+PNCE +   V+G  ++G++ ++ +  GEE+TF+YN       +   +
Sbjct: 1716 KGNQARFMNHSCQPNCETQKWTVNGDTRVGLFALQDVPKGEELTFNYNLECRGNGK---T 1772

Query: 2028 VCLCGSQVCRG 2038
             C CG+  C G
Sbjct: 1773 ACKCGAPNCSG 1783


>gi|149726051|ref|XP_001502479.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific [Equus caballus]
          Length = 2700

 Score = 65.5 bits (158), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            +FV E++GE+       ++++    ++   E     FY + L++ +         ++DA 
Sbjct: 1969 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 2013

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K NYA  + H C+PNCE +  +V+G  ++G++ +  I  G E+TF+YN       +   
Sbjct: 2014 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 2070

Query: 2027 SVCLCGSQVCRG 2038
            +VC CG+  C G
Sbjct: 2071 TVCKCGAPNCSG 2082


>gi|451846131|gb|EMD59442.1| hypothetical protein COCSADRAFT_258710 [Cochliobolus sativus ND90Pr]
          Length = 923

 Score = 65.5 bits (158), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 45/149 (30%), Positives = 68/149 (45%), Gaps = 20/149 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+  NK+   GE  FV E++GEV       +++   R + + +E+    FY  ++ 
Sbjct: 217  KKGFGLRANKDMAPGE--FVFEYIGEV------IDERTFRRRMGQYDEEGIKHFY--FMS 266

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              KG+        VDA  K N      HSC PNC      V    ++GI+  R +  GEE
Sbjct: 267  LTKGE-------FVDATKKGNLGRFCNHSCNPNCFVDKWVVGDKLRMGIFVERQVKAGEE 319

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            + F+YN     +   +   C CG   C G
Sbjct: 320  LVFNYNV---DRYGADPQPCYCGEPNCSG 345


>gi|27477095|ref|NP_758859.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
            specific isoform a [Homo sapiens]
 gi|16755530|gb|AAL27991.1|AF380302_1 androgen receptor-associated coregulator 267-a [Homo sapiens]
 gi|119605437|gb|EAW85031.1| nuclear receptor binding SET domain protein 1, isoform CRA_a [Homo
            sapiens]
          Length = 2427

 Score = 65.5 bits (158), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            +FV E++GE+       ++++    ++   E     FY + L++ +         ++DA 
Sbjct: 1697 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 1741

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K NYA  + H C+PNCE +  +V+G  ++G++ +  I  G E+TF+YN       +   
Sbjct: 1742 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 1798

Query: 2027 SVCLCGSQVCRG 2038
            +VC CG+  C G
Sbjct: 1799 TVCKCGAPNCSG 1810


>gi|170058057|ref|XP_001864756.1| Mll1 protein [Culex quinquefasciatus]
 gi|167877297|gb|EDS40680.1| Mll1 protein [Culex quinquefasciatus]
          Length = 114

 Score = 65.5 bits (158), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 37/81 (45%), Positives = 46/81 (56%), Gaps = 5/81 (6%)

Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
            VVDA  + N A  I HSC PNC +KV  + GH  I I+ +R I  GEE+T+DY    E  
Sbjct: 39   VVDATMRGNAARFINHSCEPNCYSKVVDILGHKHIIIFALRRIVQGEELTYDYKFPFEDV 98

Query: 2022 EEYEASVCLCGSQVCRGSYLN 2042
            +      C CGS+ CR  YLN
Sbjct: 99   K----IPCSCGSKKCR-KYLN 114


>gi|380815578|gb|AFE79663.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
            specific isoform a [Macaca mulatta]
 gi|383420747|gb|AFH33587.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
            specific isoform a [Macaca mulatta]
          Length = 2426

 Score = 65.5 bits (158), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            +FV E++GE+       ++++    ++   E     FY + L++ +         ++DA 
Sbjct: 1696 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 1740

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K NYA  + H C+PNCE +  +V+G  ++G++ +  I  G E+TF+YN       +   
Sbjct: 1741 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 1797

Query: 2027 SVCLCGSQVCRG 2038
            +VC CG+  C G
Sbjct: 1798 TVCKCGAPNCSG 1809


>gi|187956219|gb|AAI50629.1| Nuclear receptor binding SET domain protein 1 [Homo sapiens]
          Length = 2427

 Score = 65.5 bits (158), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            +FV E++GE+       ++++    ++   E     FY + L++ +         ++DA 
Sbjct: 1697 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 1741

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K NYA  + H C+PNCE +  +V+G  ++G++ +  I  G E+TF+YN       +   
Sbjct: 1742 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 1798

Query: 2027 SVCLCGSQVCRG 2038
            +VC CG+  C G
Sbjct: 1799 TVCKCGAPNCSG 1810


>gi|395861196|ref|XP_003802879.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific-like [Otolemur garnettii]
          Length = 2410

 Score = 65.5 bits (158), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            +FV E++GE+       ++++    ++   E     FY + L++ +         ++DA 
Sbjct: 1682 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 1726

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K NYA  + H C+PNCE +  +V+G  ++G++ +  I  G E+TF+YN       +   
Sbjct: 1727 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 1783

Query: 2027 SVCLCGSQVCRG 2038
            +VC CG+  C G
Sbjct: 1784 TVCKCGAPNCSG 1795


>gi|413916020|gb|AFW55952.1| putative SET-domain containing protein family [Zea mays]
          Length = 710

 Score = 65.5 bits (158), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 46/129 (35%), Positives = 60/129 (46%), Gaps = 19/129 (14%)

Query: 1906 DDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDA 1965
            +DFV+E++G++        +   IR  Q             YL R   D       VVDA
Sbjct: 600  EDFVIEYVGQLI-----HRRVSDIRESQYEKSGIGSS----YLFRLDDD------FVVDA 644

Query: 1966 MHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYE 2025
              +   A  I HSC PNC  KV  VDG  +I IY  R I+ GEEIT++Y    E K+   
Sbjct: 645  TKRGGLARFINHSCEPNCYTKVITVDGQKKIFIYAKRRIYAGEEITYNYKFPLEEKK--- 701

Query: 2026 ASVCLCGSQ 2034
               C CGS+
Sbjct: 702  -IPCHCGSR 709


>gi|355750457|gb|EHH54795.1| hypothetical protein EGM_15701 [Macaca fascicularis]
          Length = 2695

 Score = 65.5 bits (158), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            +FV E++GE+       ++++    ++   E     FY + L++ +         ++DA 
Sbjct: 1965 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 2009

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K NYA  + H C+PNCE +  +V+G  ++G++ +  I  G E+TF+YN       +   
Sbjct: 2010 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 2066

Query: 2027 SVCLCGSQVCRG 2038
            +VC CG+  C G
Sbjct: 2067 TVCKCGAPNCSG 2078


>gi|380815580|gb|AFE79664.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
            specific isoform b [Macaca mulatta]
 gi|383420749|gb|AFH33588.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
            specific isoform b [Macaca mulatta]
          Length = 2695

 Score = 65.5 bits (158), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            +FV E++GE+       ++++    ++   E     FY + L++ +         ++DA 
Sbjct: 1965 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 2009

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K NYA  + H C+PNCE +  +V+G  ++G++ +  I  G E+TF+YN       +   
Sbjct: 2010 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 2066

Query: 2027 SVCLCGSQVCRG 2038
            +VC CG+  C G
Sbjct: 2067 TVCKCGAPNCSG 2078


>gi|405952170|gb|EKC20012.1| Histone-lysine N-methyltransferase SETD2 [Crassostrea gigas]
          Length = 1451

 Score = 65.5 bits (158), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 45/155 (29%), Positives = 74/155 (47%), Gaps = 20/155 (12%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            + +V   KG+G+           DFV+E++GEV   +K F+ +  ++   K  ++     
Sbjct: 487  EAFVTDWKGMGLRAT--AALQPGDFVMEYVGEVLD-YKQFKSR--VKQQAKMGQE---HH 538

Query: 1944 YNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRG 2003
            Y + L   +         V+DA +K N +  + HSC PNCE +   V+G  ++G +  + 
Sbjct: 539  YFMALNSDE---------VIDASYKGNVSRYMNHSCDPNCETQKWTVNGVLRVGFFVKKA 589

Query: 2004 IHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            +    E+ FDY      K   EA  C CGS+ CRG
Sbjct: 590  VEPLTELNFDYQFERYGK---EAQKCFCGSENCRG 621


>gi|441595720|ref|XP_004087266.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
            H3 lysine-36 and H4 lysine-20 specific [Nomascus
            leucogenys]
          Length = 2697

 Score = 65.5 bits (158), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            +FV E++GE+       ++++    ++   E     FY + L++ +         ++DA 
Sbjct: 1967 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 2011

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K NYA  + H C+PNCE +  +V+G  ++G++ +  I  G E+TF+YN       +   
Sbjct: 2012 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 2068

Query: 2027 SVCLCGSQVCRG 2038
            +VC CG+  C G
Sbjct: 2069 TVCKCGAPNCSG 2080


>gi|345493936|ref|XP_003427184.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2 [Nasonia
            vitripennis]
          Length = 1317

 Score = 65.5 bits (158), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 76/148 (51%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            +G G+V  +    G+  F++E++GEV       E +  +R LQ+  E     +Y + ++ 
Sbjct: 951  RGWGLVSLEPIKHGQ--FIIEYVGEVID-----EAEYKLR-LQQKKERKNENYYFLTIDN 1002

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K N +  + HSC+PNCE +   V+G  +IG++ +R I  GEE+
Sbjct: 1003 SR---------MIDAEPKGNLSRFMNHSCQPNCETQKWKVNGDTRIGLFALRDIEPGEEL 1053

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN   + +       CLC +  C G
Sbjct: 1054 TFNYNLACDGETR---KPCLCKAPNCSG 1078


>gi|118918400|ref|NP_032765.3| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
            specific [Mus musculus]
          Length = 2691

 Score = 65.5 bits (158), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            +FV E++GE+       ++++    ++   E     FY + L++ +         ++DA 
Sbjct: 1967 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 2011

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K NYA  + H C+PNCE +  +V+G  ++G++ +  I  G E+TF+YN       +   
Sbjct: 2012 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 2068

Query: 2027 SVCLCGSQVCRG 2038
            +VC CG+  C G
Sbjct: 2069 TVCKCGAPNCSG 2080


>gi|145588193|ref|YP_001154790.1| nuclear protein SET [Polynucleobacter necessarius subsp. asymbioticus
            QLW-P1DMWA-1]
 gi|145046599|gb|ABP33226.1| nuclear protein SET [Polynucleobacter necessarius subsp. asymbioticus
            QLW-P1DMWA-1]
          Length = 164

 Score = 65.5 bits (158), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 47/141 (33%), Positives = 71/141 (50%), Gaps = 25/141 (17%)

Query: 1909 VVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHK 1968
            ++E+ GE    WK  EK+        + +DP   FY   LE         D  V+DA + 
Sbjct: 40   IIEYKGERIS-WKLAEKRH-----PHDPKDPNHTFY-FSLE---------DGRVIDAKYG 83

Query: 1969 ANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES------KE 2022
             N A  I HSC+P+CE +  + +G  ++ IY  R +  GEE+ +DY+   E       K+
Sbjct: 84   GNAARWINHSCKPSCETREDSFEGKPRVFIYAKRSLKVGEELFYDYSLDVEGRISKQMKK 143

Query: 2023 EYEASVCLCGSQVCRGSYLNL 2043
            +YE   C CG++ CRG+ L L
Sbjct: 144  DYE---CRCGAKKCRGTMLAL 161


>gi|403290056|ref|XP_003936149.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific [Saimiri boliviensis boliviensis]
          Length = 2697

 Score = 65.5 bits (158), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            +FV E++GE+       ++++    ++   E     FY + L++ +         ++DA 
Sbjct: 1967 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 2011

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K NYA  + H C+PNCE +  +V+G  ++G++ +  I  G E+TF+YN       +   
Sbjct: 2012 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 2068

Query: 2027 SVCLCGSQVCRG 2038
            +VC CG+  C G
Sbjct: 2069 TVCKCGAPNCSG 2080


>gi|417407050|gb|JAA50158.1| Putative histone-lysine n-methyltransferase h3 lysine-36 and h4
            lysine-20 specific [Desmodus rotundus]
          Length = 2699

 Score = 65.5 bits (158), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            +FV E++GE+       ++++    ++   E     FY + L++ +         ++DA 
Sbjct: 1969 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 2013

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K NYA  + H C+PNCE +  +V+G  ++G++ +  I  G E+TF+YN       +   
Sbjct: 2014 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 2070

Query: 2027 SVCLCGSQVCRG 2038
            +VC CG+  C G
Sbjct: 2071 TVCKCGAPNCSG 2082


>gi|410949106|ref|XP_003981265.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific [Felis catus]
          Length = 2432

 Score = 65.5 bits (158), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            +FV E++GE+       ++++    ++   E     FY + L++ +         ++DA 
Sbjct: 1701 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 1745

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K NYA  + H C+PNCE +  +V+G  ++G++ +  I  G E+TF+YN       +   
Sbjct: 1746 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 1802

Query: 2027 SVCLCGSQVCRG 2038
            +VC CG+  C G
Sbjct: 1803 TVCKCGAPNCSG 1814


>gi|440898362|gb|ELR49876.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
            specific [Bos grunniens mutus]
          Length = 2698

 Score = 65.5 bits (158), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            +FV E++GE+       ++++    ++   E     FY + L++ +         ++DA 
Sbjct: 1969 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 2013

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K NYA  + H C+PNCE +  +V+G  ++G++ +  I  G E+TF+YN       +   
Sbjct: 2014 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 2070

Query: 2027 SVCLCGSQVCRG 2038
            +VC CG+  C G
Sbjct: 2071 TVCKCGAPNCSG 2082


>gi|426229361|ref|XP_004008759.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific [Ovis aries]
          Length = 2698

 Score = 65.5 bits (158), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            +FV E++GE+       ++++    ++   E     FY + L++ +         ++DA 
Sbjct: 1969 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 2013

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K NYA  + H C+PNCE +  +V+G  ++G++ +  I  G E+TF+YN       +   
Sbjct: 2014 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 2070

Query: 2027 SVCLCGSQVCRG 2038
            +VC CG+  C G
Sbjct: 2071 TVCKCGAPNCSG 2082


>gi|410216828|gb|JAA05633.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
 gi|410260118|gb|JAA18025.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
          Length = 2428

 Score = 65.5 bits (158), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            +FV E++GE+       ++++    ++   E     FY + L++ +         ++DA 
Sbjct: 1698 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 1742

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K NYA  + H C+PNCE +  +V+G  ++G++ +  I  G E+TF+YN       +   
Sbjct: 1743 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 1799

Query: 2027 SVCLCGSQVCRG 2038
            +VC CG+  C G
Sbjct: 1800 TVCKCGAPNCSG 1811


>gi|296193510|ref|XP_002806650.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
            H3 lysine-36 and H4 lysine-20 specific [Callithrix
            jacchus]
          Length = 2692

 Score = 65.5 bits (158), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            +FV E++GE+       ++++    ++   E     FY + L++ +         ++DA 
Sbjct: 1966 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 2010

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K NYA  + H C+PNCE +  +V+G  ++G++ +  I  G E+TF+YN       +   
Sbjct: 2011 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 2067

Query: 2027 SVCLCGSQVCRG 2038
            +VC CG+  C G
Sbjct: 2068 TVCKCGAPNCSG 2079


>gi|19923586|ref|NP_071900.2| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
            specific isoform b [Homo sapiens]
 gi|32469769|sp|Q96L73.1|NSD1_HUMAN RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific; AltName: Full=Androgen receptor
            coactivator 267 kDa protein; AltName: Full=Androgen
            receptor-associated protein of 267 kDa; AltName:
            Full=H3-K36-HMTase; AltName: Full=H4-K20-HMTase; AltName:
            Full=Lysine N-methyltransferase 3B; AltName: Full=Nuclear
            receptor-binding SET domain-containing protein 1;
            Short=NR-binding SET domain-containing protein
 gi|17530097|gb|AAL40694.1|AF395588_1 putative nuclear protein NSD1 [Homo sapiens]
 gi|16751269|gb|AAL06645.1| androgen receptor associated coregulator 267-b [Homo sapiens]
 gi|119605438|gb|EAW85032.1| nuclear receptor binding SET domain protein 1, isoform CRA_b [Homo
            sapiens]
          Length = 2696

 Score = 65.5 bits (158), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            +FV E++GE+       ++++    ++   E     FY + L++ +         ++DA 
Sbjct: 1966 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 2010

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K NYA  + H C+PNCE +  +V+G  ++G++ +  I  G E+TF+YN       +   
Sbjct: 2011 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 2067

Query: 2027 SVCLCGSQVCRG 2038
            +VC CG+  C G
Sbjct: 2068 TVCKCGAPNCSG 2079


>gi|410303854|gb|JAA30527.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
 gi|410341931|gb|JAA39912.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
          Length = 2428

 Score = 65.5 bits (158), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            +FV E++GE+       ++++    ++   E     FY + L++ +         ++DA 
Sbjct: 1698 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 1742

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K NYA  + H C+PNCE +  +V+G  ++G++ +  I  G E+TF+YN       +   
Sbjct: 1743 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 1799

Query: 2027 SVCLCGSQVCRG 2038
            +VC CG+  C G
Sbjct: 1800 TVCKCGAPNCSG 1811


>gi|410216830|gb|JAA05634.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
 gi|410260120|gb|JAA18026.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
          Length = 2697

 Score = 65.1 bits (157), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            +FV E++GE+       ++++    ++   E     FY + L++ +         ++DA 
Sbjct: 1967 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 2011

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K NYA  + H C+PNCE +  +V+G  ++G++ +  I  G E+TF+YN       +   
Sbjct: 2012 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 2068

Query: 2027 SVCLCGSQVCRG 2038
            +VC CG+  C G
Sbjct: 2069 TVCKCGAPNCSG 2080


>gi|402873563|ref|XP_003900641.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific, partial [Papio anubis]
          Length = 2343

 Score = 65.1 bits (157), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            +FV E++GE+       ++++    ++   E     FY + L++ +         ++DA 
Sbjct: 1610 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 1654

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K NYA  + H C+PNCE +  +V+G  ++G++ +  I  G E+TF+YN       +   
Sbjct: 1655 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 1711

Query: 2027 SVCLCGSQVCRG 2038
            +VC CG+  C G
Sbjct: 1712 TVCKCGAPNCSG 1723


>gi|355691890|gb|EHH27075.1| hypothetical protein EGK_17188 [Macaca mulatta]
          Length = 2695

 Score = 65.1 bits (157), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            +FV E++GE+       ++++    ++   E     FY + L++ +         ++DA 
Sbjct: 1965 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 2009

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K NYA  + H C+PNCE +  +V+G  ++G++ +  I  G E+TF+YN       +   
Sbjct: 2010 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 2066

Query: 2027 SVCLCGSQVCRG 2038
            +VC CG+  C G
Sbjct: 2067 TVCKCGAPNCSG 2078


>gi|351708443|gb|EHB11362.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
            specific [Heterocephalus glaber]
          Length = 2698

 Score = 65.1 bits (157), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            +FV E++GE+       ++++    ++   E     FY + L++ +         ++DA 
Sbjct: 1966 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 2010

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K NYA  + H C+PNCE +  +V+G  ++G++ +  I  G E+TF+YN       +   
Sbjct: 2011 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 2067

Query: 2027 SVCLCGSQVCRG 2038
            +VC CG+  C G
Sbjct: 2068 TVCKCGAPNCSG 2079


>gi|119605439|gb|EAW85033.1| nuclear receptor binding SET domain protein 1, isoform CRA_c [Homo
            sapiens]
          Length = 2593

 Score = 65.1 bits (157), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            +FV E++GE+       ++++    ++   E     FY + L++ +         ++DA 
Sbjct: 1863 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 1907

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K NYA  + H C+PNCE +  +V+G  ++G++ +  I  G E+TF+YN       +   
Sbjct: 1908 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 1964

Query: 2027 SVCLCGSQVCRG 2038
            +VC CG+  C G
Sbjct: 1965 TVCKCGAPNCSG 1976


>gi|114603589|ref|XP_527132.2| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific isoform 8 [Pan troglodytes]
 gi|397470588|ref|XP_003806901.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific [Pan paniscus]
 gi|410303856|gb|JAA30528.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
 gi|410341933|gb|JAA39913.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
          Length = 2697

 Score = 65.1 bits (157), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            +FV E++GE+       ++++    ++   E     FY + L++ +         ++DA 
Sbjct: 1967 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 2011

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K NYA  + H C+PNCE +  +V+G  ++G++ +  I  G E+TF+YN       +   
Sbjct: 2012 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 2068

Query: 2027 SVCLCGSQVCRG 2038
            +VC CG+  C G
Sbjct: 2069 TVCKCGAPNCSG 2080


>gi|68565655|sp|O88491.1|NSD1_MOUSE RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific; AltName: Full=H3-K36-HMTase; AltName:
            Full=H4-K20-HMTase; AltName: Full=Nuclear
            receptor-binding SET domain-containing protein 1;
            Short=NR-binding SET domain-containing protein
 gi|3329465|gb|AAC40182.1| NSD1 protein [Mus musculus]
          Length = 2588

 Score = 65.1 bits (157), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            +FV E++GE+       ++++    ++   E     FY + L++ +         ++DA 
Sbjct: 1864 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 1908

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K NYA  + H C+PNCE +  +V+G  ++G++ +  I  G E+TF+YN       +   
Sbjct: 1909 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 1965

Query: 2027 SVCLCGSQVCRG 2038
            +VC CG+  C G
Sbjct: 1966 TVCKCGAPNCSG 1977


>gi|301615056|ref|XP_002936997.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific [Xenopus (Silurana) tropicalis]
          Length = 2440

 Score = 65.1 bits (157), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 57/225 (25%), Positives = 101/225 (44%), Gaps = 24/225 (10%)

Query: 1886 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN 1945
            +    +G G+ C  +   GE  FV E++GE+       ++++    ++   E     FY 
Sbjct: 1766 FRTLSRGWGLRCRTDIKKGE--FVNEYVGEM------IDEEECRARIRYAQEQDITNFYM 1817

Query: 1946 IYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIH 2005
            + L++ +         V+DA  K N+A  + H C+PNCE +   V+G  ++G++ +  I 
Sbjct: 1818 LTLDKDR---------VIDAGPKGNFARFMNHCCQPNCETQKWTVNGDTRVGLFALCDIK 1868

Query: 2006 YGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDRHQL 2065
               E+TF+YN       +   +VC CG+  C G +L +  +   + V  E  G   +  +
Sbjct: 1869 AXVELTFNYNLECLGNGK---TVCKCGAPNCSG-FLGVRPKN--QPVSSEDKGKKRKQYV 1922

Query: 2066 MLEACELNSVSEEDYLELGRAG-LGSCLLGGLPNWVVAYSARLVR 2109
              +  E+    E++    G  G L SC   G P    A   +L R
Sbjct: 1923 KRKKSEVVKEHEDECFSCGDGGQLVSCKKPGCPKVYHAECLKLTR 1967


>gi|70571511|dbj|BAE06763.1| zinc finger protein [Ciona intestinalis]
          Length = 709

 Score = 65.1 bits (157), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 45/144 (31%), Positives = 71/144 (49%), Gaps = 18/144 (12%)

Query: 1905 EDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVD 1964
            E  F++E++GEV       E++   R+++  N +   + Y + LE            V+D
Sbjct: 1    EGQFLLEYVGEVVS-----EREFRRRTIE--NYNAHNDHYCVQLEAG---------TVID 44

Query: 1965 AMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEY 2024
                AN    + HSC+PNCE +   V+G Y++G++  R I   EE+T+DYN    + +  
Sbjct: 45   GYRLANEGRFVNHSCQPNCEMQKWVVNGEYRVGLFAKRPIVGSEELTYDYNFHAYNLDRQ 104

Query: 2025 EASVCLCGSQVCRGSYLNLTGEGA 2048
            +   C CGS  CRG     T  GA
Sbjct: 105  QP--CRCGSSECRGVIGGKTQRGA 126


>gi|410904194|ref|XP_003965577.1| PREDICTED: histone-lysine N-methyltransferase SETD1B-A-like, partial
            [Takifugu rubripes]
          Length = 109

 Score = 65.1 bits (157), Expect = 5e-07,   Method: Composition-based stats.
 Identities = 33/78 (42%), Positives = 46/78 (58%), Gaps = 4/78 (5%)

Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
            ++DA    N+A  I HSC PNC AKV  V+   +I IY+ + I+  EEIT+DY    E  
Sbjct: 34   IIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSRQPINVNEEITYDYKFPIEDV 93

Query: 2022 EEYEASVCLCGSQVCRGS 2039
            +      CLCG++ CRG+
Sbjct: 94   K----IPCLCGAENCRGT 107


>gi|301785552|ref|XP_002928188.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific-like [Ailuropoda melanoleuca]
 gi|281342107|gb|EFB17691.1| hypothetical protein PANDA_018107 [Ailuropoda melanoleuca]
          Length = 2699

 Score = 65.1 bits (157), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            +FV E++GE+       ++++    ++   E     FY + L++ +         ++DA 
Sbjct: 1970 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 2014

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K NYA  + H C+PNCE +  +V+G  ++G++ +  I  G E+TF+YN       +   
Sbjct: 2015 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 2071

Query: 2027 SVCLCGSQVCRG 2038
            +VC CG+  C G
Sbjct: 2072 TVCKCGAPNCSG 2083


>gi|148709230|gb|EDL41176.1| nuclear receptor-binding SET-domain protein 1, isoform CRA_b [Mus
            musculus]
          Length = 2382

 Score = 65.1 bits (157), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            +FV E++GE+       ++++    ++   E     FY + L++ +         ++DA 
Sbjct: 1658 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 1702

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K NYA  + H C+PNCE +  +V+G  ++G++ +  I  G E+TF+YN       +   
Sbjct: 1703 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 1759

Query: 2027 SVCLCGSQVCRG 2038
            +VC CG+  C G
Sbjct: 1760 TVCKCGAPNCSG 1771


>gi|73953273|ref|XP_865778.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific isoform 5 [Canis lupus familiaris]
          Length = 2698

 Score = 65.1 bits (157), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            +FV E++GE+       ++++    ++   E     FY + L++ +         ++DA 
Sbjct: 1967 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 2011

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K NYA  + H C+PNCE +  +V+G  ++G++ +  I  G E+TF+YN       +   
Sbjct: 2012 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 2068

Query: 2027 SVCLCGSQVCRG 2038
            +VC CG+  C G
Sbjct: 2069 TVCKCGAPNCSG 2080


>gi|350580826|ref|XP_003123715.3| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific, partial [Sus scrofa]
          Length = 2392

 Score = 65.1 bits (157), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            +FV E++GE+       ++++    ++   E     FY + L++ +         ++DA 
Sbjct: 1661 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 1705

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K NYA  + H C+PNCE +  +V+G  ++G++ +  I  G E+TF+YN       +   
Sbjct: 1706 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 1762

Query: 2027 SVCLCGSQVCRG 2038
            +VC CG+  C G
Sbjct: 1763 TVCKCGAPNCSG 1774


>gi|196013861|ref|XP_002116791.1| hypothetical protein TRIADDRAFT_31338 [Trichoplax adhaerens]
 gi|190580769|gb|EDV20850.1| hypothetical protein TRIADDRAFT_31338 [Trichoplax adhaerens]
          Length = 725

 Score = 65.1 bits (157), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 37/100 (37%), Positives = 55/100 (55%), Gaps = 8/100 (8%)

Query: 1943 FYNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIG 1997
            +  I + + KG+ + Y L     V++DA  K N A  + HSC+PNCE     V+G   IG
Sbjct: 483  YQRIKMAQSKGEKNFYMLNIDKDVIIDAGQKGNLARFMNHSCQPNCETHKWTVNGLTCIG 542

Query: 1998 IYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCR 2037
            ++ +  I  GEE+TFDY       ++ E   C CGS++CR
Sbjct: 543  LFAIDDIKQGEELTFDYRLHAVGNDQAE---CHCGSKLCR 579


>gi|444706655|gb|ELW47981.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
            specific [Tupaia chinensis]
          Length = 2687

 Score = 65.1 bits (157), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            +FV E++GE+       ++++    ++   E     FY + L++ +         ++DA 
Sbjct: 1958 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 2002

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K NYA  + H C+PNCE +  +V+G  ++G++ +  I  G E+TF+YN       +   
Sbjct: 2003 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 2059

Query: 2027 SVCLCGSQVCRG 2038
            +VC CG+  C G
Sbjct: 2060 TVCKCGAPNCSG 2071


>gi|148709229|gb|EDL41175.1| nuclear receptor-binding SET-domain protein 1, isoform CRA_a [Mus
            musculus]
          Length = 2588

 Score = 65.1 bits (157), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            +FV E++GE+       ++++    ++   E     FY + L++ +         ++DA 
Sbjct: 1864 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 1908

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K NYA  + H C+PNCE +  +V+G  ++G++ +  I  G E+TF+YN       +   
Sbjct: 1909 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 1965

Query: 2027 SVCLCGSQVCRG 2038
            +VC CG+  C G
Sbjct: 1966 TVCKCGAPNCSG 1977


>gi|354471955|ref|XP_003498206.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific [Cricetulus griseus]
          Length = 2690

 Score = 65.1 bits (157), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            +FV E++GE+       ++++    ++   E     FY + L++ +         ++DA 
Sbjct: 1969 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 2013

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K NYA  + H C+PNCE +  +V+G  ++G++ +  I  G E+TF+YN       +   
Sbjct: 2014 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 2070

Query: 2027 SVCLCGSQVCRG 2038
            +VC CG+  C G
Sbjct: 2071 TVCKCGAPNCSG 2082


>gi|326671180|ref|XP_694414.5| PREDICTED: histone-lysine N-methyltransferase NSD3 [Danio rerio]
          Length = 1562

 Score = 65.1 bits (157), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 40/132 (30%), Positives = 67/132 (50%), Gaps = 18/132 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            DFV+E++GE+       + ++  + ++  NE+    FY + L + +         V+DA 
Sbjct: 1282 DFVMEYVGEL------IDSEECKQRIRTANENHVTNFYMLTLTKDR---------VIDAG 1326

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K N +  + HSC PNCE +   V+G  +IG++T+  I    E+TF+YN           
Sbjct: 1327 PKGNLSRFMNHSCSPNCETQKWTVNGDVRIGLFTLCDISADTELTFNYNLDCLGNGR--- 1383

Query: 2027 SVCLCGSQVCRG 2038
            + C CGS+ C G
Sbjct: 1384 TSCHCGSENCSG 1395


>gi|297676794|ref|XP_002816309.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific isoform 1 [Pongo abelii]
          Length = 2697

 Score = 65.1 bits (157), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            +FV E++GE+       ++++    ++   E     FY + L++ +         ++DA 
Sbjct: 1967 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 2011

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K NYA  + H C+PNCE +  +V+G  ++G++ +  I  G E+TF+YN       +   
Sbjct: 2012 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 2068

Query: 2027 SVCLCGSQVCRG 2038
            +VC CG+  C G
Sbjct: 2069 TVCKCGAPNCSG 2080


>gi|119895257|ref|XP_592234.3| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific, partial [Bos taurus]
          Length = 2389

 Score = 65.1 bits (157), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            +FV E++GE+       ++++    ++   E     FY + L++ +         ++DA 
Sbjct: 1660 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 1704

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K NYA  + H C+PNCE +  +V+G  ++G++ +  I  G E+TF+YN       +   
Sbjct: 1705 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 1761

Query: 2027 SVCLCGSQVCRG 2038
            +VC CG+  C G
Sbjct: 1762 TVCKCGAPNCSG 1773


>gi|395736540|ref|XP_003776772.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific isoform 2 [Pongo abelii]
          Length = 2594

 Score = 65.1 bits (157), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            +FV E++GE+       ++++    ++   E     FY + L++ +         ++DA 
Sbjct: 1864 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 1908

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K NYA  + H C+PNCE +  +V+G  ++G++ +  I  G E+TF+YN       +   
Sbjct: 1909 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 1965

Query: 2027 SVCLCGSQVCRG 2038
            +VC CG+  C G
Sbjct: 1966 TVCKCGAPNCSG 1977


>gi|291387890|ref|XP_002710469.1| PREDICTED: nuclear receptor binding SET domain protein 1 isoform 2
            [Oryctolagus cuniculus]
          Length = 2431

 Score = 65.1 bits (157), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            +FV E++GE+       ++++    ++   E     FY + L++ +         ++DA 
Sbjct: 1700 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 1744

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K NYA  + H C+PNCE +  +V+G  ++G++ +  I  G E+TF+YN       +   
Sbjct: 1745 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 1801

Query: 2027 SVCLCGSQVCRG 2038
            +VC CG+  C G
Sbjct: 1802 TVCKCGAPNCSG 1813


>gi|52545752|emb|CAH56331.1| hypothetical protein [Homo sapiens]
          Length = 881

 Score = 65.1 bits (157), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            +FV E++GE+       ++++    ++   E     FY + L++ +         ++DA 
Sbjct: 151  EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 195

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K NYA  + H C+PNCE +  +V+G  ++G++ +  I  G E+TF+YN       +   
Sbjct: 196  PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 252

Query: 2027 SVCLCGSQVCRG 2038
            +VC CG+  C G
Sbjct: 253  TVCKCGAPNCSG 264


>gi|291001085|ref|XP_002683109.1| predicted protein [Naegleria gruberi]
 gi|284096738|gb|EFC50365.1| predicted protein [Naegleria gruberi]
          Length = 147

 Score = 65.1 bits (157), Expect = 5e-07,   Method: Composition-based stats.
 Identities = 34/78 (43%), Positives = 46/78 (58%), Gaps = 6/78 (7%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYN-SVTE 2019
            +++DA  + N A  I HSC PNC A++  VD   ++ IY +R I  GEEIT+DY   + E
Sbjct: 71   LIIDATKRGNLARFINHSCDPNCCARIIEVDKQKKVCIYALRKILVGEEITYDYKFPIEE 130

Query: 2020 SKEEYEASVCLCGSQVCR 2037
            SK       C CGSQ C+
Sbjct: 131  SKIP-----CKCGSQKCK 143


>gi|149039889|gb|EDL94005.1| nuclear receptor binding SET domain protein 1 (predicted), isoform
            CRA_b [Rattus norvegicus]
          Length = 2586

 Score = 65.1 bits (157), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            +FV E++GE+       ++++    ++   E     FY + L++ +         ++DA 
Sbjct: 1861 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 1905

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K NYA  + H C+PNCE +  +V+G  ++G++ +  I  G E+TF+YN       +   
Sbjct: 1906 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 1962

Query: 2027 SVCLCGSQVCRG 2038
            +VC CG+  C G
Sbjct: 1963 TVCKCGAPNCSG 1974


>gi|290985403|ref|XP_002675415.1| predicted protein [Naegleria gruberi]
 gi|284089011|gb|EFC42671.1| predicted protein [Naegleria gruberi]
          Length = 438

 Score = 65.1 bits (157), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 43/148 (29%), Positives = 73/148 (49%), Gaps = 18/148 (12%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG+GV CN++    +  F+ E++GEV  V       D   +  K +   +   Y + +  
Sbjct: 266  KGIGVKCNQDV-IKKGTFITEYVGEVISV-------DKFETRTKRSYKKSLHHYCMNMNE 317

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA    N A  I HSC PN   +   V+G  ++GI+ ++ I  GEEI
Sbjct: 318  NE---------IIDATWMGNIARFINHSCAPNARTQTWDVNGQNRVGIFAIKDIVKGEEI 368

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            T++YN +  + +E +   C CG+  C+G
Sbjct: 369  TYNYNFLIYN-DETKQQECKCGAPNCQG 395


>gi|344265319|ref|XP_003404732.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific [Loxodonta africana]
          Length = 2702

 Score = 65.1 bits (157), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            +FV E++GE+       ++++    ++   E     FY + L++ +         ++DA 
Sbjct: 1970 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 2014

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K NYA  + H C+PNCE +  +V+G  ++G++ +  I  G E+TF+YN       +   
Sbjct: 2015 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 2071

Query: 2027 SVCLCGSQVCRG 2038
            +VC CG+  C G
Sbjct: 2072 TVCKCGAPNCSG 2083


>gi|194907101|ref|XP_001981487.1| GG12082 [Drosophila erecta]
 gi|190656125|gb|EDV53357.1| GG12082 [Drosophila erecta]
          Length = 1441

 Score = 65.1 bits (157), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 52/186 (27%), Positives = 87/186 (46%), Gaps = 33/186 (17%)

Query: 1880 SRPDDKYVAYRKG--LGVVCNKEGGFG--------EDDFVVEFLGEVYPVWKWFEKQDGI 1929
            SR +++    RK   L VV   E GFG        E DFV+E++GEV       E Q  +
Sbjct: 1225 SRCENQMFEQRKSPRLEVVYMNERGFGLVNREPIAEGDFVIEYVGEVI---NHAEFQRRM 1281

Query: 1930 RSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTA 1989
               Q+  ++    +Y + +E+           ++DA  K N A  + HSC PNCE +   
Sbjct: 1282 EQKQRGRDE---NYYFLGVEKD---------FIIDAGPKGNLARFMNHSCEPNCETQKWT 1329

Query: 1990 VDGHYQIGIYTVRGIHYGEEITFDY---NSVTESKEEYEASVCLCGSQVCRGSYLNLTGE 2046
            V+  +++G++ ++ I    E+TF+Y   + +  SK+      C CG+  C G       +
Sbjct: 1330 VNCIHRVGLFAIKDIPANTELTFNYLWDDLMNNSKK-----ACFCGATRCSGEIGGKLKD 1384

Query: 2047 GAFEKV 2052
            GA ++ 
Sbjct: 1385 GAVKET 1390


>gi|168025972|ref|XP_001765507.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162683357|gb|EDQ69768.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 993

 Score = 65.1 bits (157), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 45/138 (32%), Positives = 67/138 (48%), Gaps = 26/138 (18%)

Query: 1905 EDDFVVEFLGEVY---PVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLV 1961
            +D+FV+E+ GEV       K   +  G RS+            N Y+     D       
Sbjct: 837  KDEFVIEYTGEVIDDAMCEKRLWEMKGRRSI-----------CNFYMCEIAKD------F 879

Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
            ++DA  K N +  + HSC+PNC  +   VDG  ++G++  R I  GEE+T+DY  V    
Sbjct: 880  IIDATRKGNASRYLNHSCQPNCRLEKWRVDGETRVGVFAGRNIIAGEELTYDYKYV---- 935

Query: 2022 EEYEASV-CLCGSQVCRG 2038
             E+  +V C CG+  CRG
Sbjct: 936  -EFGPNVKCRCGAPNCRG 952


>gi|157822347|ref|NP_001100807.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
            specific [Rattus norvegicus]
 gi|149039888|gb|EDL94004.1| nuclear receptor binding SET domain protein 1 (predicted), isoform
            CRA_a [Rattus norvegicus]
          Length = 2381

 Score = 65.1 bits (157), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            +FV E++GE+       ++++    ++   E     FY + L++ +         ++DA 
Sbjct: 1656 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 1700

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K NYA  + H C+PNCE +  +V+G  ++G++ +  I  G E+TF+YN       +   
Sbjct: 1701 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 1757

Query: 2027 SVCLCGSQVCRG 2038
            +VC CG+  C G
Sbjct: 1758 TVCKCGAPNCSG 1769


>gi|357163489|ref|XP_003579748.1| PREDICTED: histone-lysine N-methyltransferase ASHH1-like
            [Brachypodium distachyon]
          Length = 517

 Score = 65.1 bits (157), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 47/150 (31%), Positives = 72/150 (48%), Gaps = 24/150 (16%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            +G G++  +    G+  FV+E+ GEV   WK     +  R  Q   +    E Y IYL  
Sbjct: 93   RGWGLLAEENIMAGQ--FVIEYCGEVIS-WK-----EAKRRSQAYEDQGLMEAYIIYLNT 144

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +          +DA  K + A  I HSC+PNCE +   V G  ++GI+  + I  G E+
Sbjct: 145  AES---------IDATKKGSLARFINHSCQPNCETRKWNVLGEVRVGIFAKQDIPIGMEL 195

Query: 2011 TFDYNSVTESKEEYEASV--CLCGSQVCRG 2038
            ++DYN      E +  ++  CLCG+  C G
Sbjct: 196  SYDYNF-----EWFGGAIVRCLCGAASCSG 220


>gi|348574862|ref|XP_003473209.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
            H3 lysine-36 and H4 lysine-20 specific-like [Cavia
            porcellus]
          Length = 2509

 Score = 65.1 bits (157), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            +FV E++GE+       ++++    ++   E     FY + L++ +         ++DA 
Sbjct: 1779 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 1823

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K NYA  + H C+PNCE +  +V+G  ++G++ +  I  G E+TF+YN       +   
Sbjct: 1824 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 1880

Query: 2027 SVCLCGSQVCRG 2038
            +VC CG+  C G
Sbjct: 1881 TVCKCGAPNCSG 1892


>gi|456062341|ref|YP_007501311.1| Nuclear protein SET [beta proteobacterium CB]
 gi|455439638|gb|AGG32576.1| Nuclear protein SET [beta proteobacterium CB]
          Length = 164

 Score = 65.1 bits (157), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 47/141 (33%), Positives = 71/141 (50%), Gaps = 25/141 (17%)

Query: 1909 VVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHK 1968
            ++E+ GE    WK  EK+        + +DP   FY   LE         D   +DA + 
Sbjct: 40   IIEYKGERIS-WKLAEKRH-----PHDPKDPNHTFY-FSLE---------DGRCIDAKYG 83

Query: 1969 ANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES------KE 2022
             N A  I HSC+P+CE +  + DG  ++ IY  R +  GEE+ +DY+   E       K+
Sbjct: 84   GNAARWINHSCKPSCETREDSFDGEPRVFIYAKRNLKLGEELFYDYSLDVEGRITKQMKK 143

Query: 2023 EYEASVCLCGSQVCRGSYLNL 2043
            +YE   C CG++ CRG+ L+L
Sbjct: 144  DYE---CRCGAKKCRGTMLSL 161


>gi|291387888|ref|XP_002710468.1| PREDICTED: nuclear receptor binding SET domain protein 1 isoform 1
            [Oryctolagus cuniculus]
          Length = 2700

 Score = 65.1 bits (157), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            +FV E++GE+       ++++    ++   E     FY + L++ +         ++DA 
Sbjct: 1969 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 2013

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K NYA  + H C+PNCE +  +V+G  ++G++ +  I  G E+TF+YN       +   
Sbjct: 2014 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 2070

Query: 2027 SVCLCGSQVCRG 2038
            +VC CG+  C G
Sbjct: 2071 TVCKCGAPNCSG 2082


>gi|344240382|gb|EGV96485.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
            specific [Cricetulus griseus]
          Length = 2318

 Score = 65.1 bits (157), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            +FV E++GE+       ++++    ++   E     FY + L++ +         ++DA 
Sbjct: 1597 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 1641

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K NYA  + H C+PNCE +  +V+G  ++G++ +  I  G E+TF+YN       +   
Sbjct: 1642 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 1698

Query: 2027 SVCLCGSQVCRG 2038
            +VC CG+  C G
Sbjct: 1699 TVCKCGAPNCSG 1710


>gi|453087448|gb|EMF15489.1| hypothetical protein SEPMUDRAFT_161660 [Mycosphaerella populorum
            SO2202]
          Length = 966

 Score = 65.1 bits (157), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 50/172 (29%), Positives = 76/172 (44%), Gaps = 23/172 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+  N E     +DF+ E++GEV    K F  +     L + +E+    FY  ++ 
Sbjct: 215  KKGYGLRANTE--LQANDFIFEYIGEVIGE-KTFRNR-----LHQYDEEGIKHFY--FMS 264

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              KG+        VDA  K N      HSC PNC      V    ++GI+  R IH GEE
Sbjct: 265  LSKGE-------FVDATKKGNLGRFCNHSCNPNCYVDKWVVGDKLRMGIFAERKIHAGEE 317

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLD 2061
            + F+YN     +   +   C C    C G    + G+   E+  K  H +++
Sbjct: 318  LVFNYNV---DRYGADPQPCYCDEPNCTGF---IGGKTQTERATKLSHTIIE 363


>gi|94732456|emb|CAK03662.1| novel protein similar to vertebrate Wolf-Hirschhorn syndrome
            candidate 1 (WHSC1) [Danio rerio]
          Length = 728

 Score = 65.1 bits (157), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 43/148 (29%), Positives = 78/148 (52%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G++  ++   GE  FV E++GE+       ++++    ++   E+    FY + +++
Sbjct: 431  KGWGLISLRDIKKGE--FVNEYVGEL------IDEEECRSRIRHAQENDITHFYMLTIDK 482

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NY+  + HSC+PNCE +   V+G  ++G++ V  I  G E+
Sbjct: 483  DR---------IIDAGPKGNYSRFMNHSCQPNCETQKWTVNGDTRVGLFAVCDIPAGTEL 533

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN      E+   +VC CG+  C G
Sbjct: 534  TFNYNLDCLGNEK---TVCRCGAPNCSG 558


>gi|118101388|ref|XP_424390.2| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2 [Gallus
            gallus]
          Length = 1386

 Score = 65.1 bits (157), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 46/169 (27%), Positives = 85/169 (50%), Gaps = 22/169 (13%)

Query: 1882 PDDKYVAY-RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
            PD + +   R+G G+   +    GE  FV E++GE+       ++++    +++ +E+  
Sbjct: 1094 PDAEIIKTDRRGWGLRTKRNIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSV 1145

Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
              FY + + + +         ++DA  K NY+  + HSC PNCE +   V+G  ++G++ 
Sbjct: 1146 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDIRVGLFA 1196

Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAF 2049
            +  I  G E+TF+YN         E   C CG++ C G +L +  + AF
Sbjct: 1197 LCDIPAGMELTFNYNLDCLGNGRTE---CHCGAENCSG-FLGVRPKTAF 1241


>gi|118101386|ref|XP_001232891.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1 [Gallus
            gallus]
          Length = 1436

 Score = 65.1 bits (157), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 46/169 (27%), Positives = 85/169 (50%), Gaps = 22/169 (13%)

Query: 1882 PDDKYVAY-RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
            PD + +   R+G G+   +    GE  FV E++GE+       ++++    +++ +E+  
Sbjct: 1144 PDAEIIKTDRRGWGLRTKRNIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSV 1195

Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
              FY + + + +         ++DA  K NY+  + HSC PNCE +   V+G  ++G++ 
Sbjct: 1196 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDIRVGLFA 1246

Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAF 2049
            +  I  G E+TF+YN         E   C CG++ C G +L +  + AF
Sbjct: 1247 LCDIPAGMELTFNYNLDCLGNGRTE---CHCGAENCSG-FLGVRPKTAF 1291


>gi|113470951|gb|ABI34877.1| Wolf-Hirschhorn syndrome candidate 1-like 1 [Danio rerio]
          Length = 129

 Score = 64.7 bits (156), Expect = 6e-07,   Method: Composition-based stats.
 Identities = 40/132 (30%), Positives = 67/132 (50%), Gaps = 18/132 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            DFV+E++GE+       + ++  + ++  NE+    FY + L + +         V+DA 
Sbjct: 11   DFVMEYVGEL------IDSEECKQRIRTANENHVTNFYMLTLTKDR---------VIDAG 55

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K N +  + HSC PNCE +   V+G  +IG++T+  I    E+TF+YN           
Sbjct: 56   PKGNLSRFMNHSCSPNCETQKWTVNGDVRIGLFTLCDISADTELTFNYNLDCLGNGR--- 112

Query: 2027 SVCLCGSQVCRG 2038
            + C CGS+ C G
Sbjct: 113  TSCHCGSENCSG 124


>gi|21392158|gb|AAM48433.1| RE61305p [Drosophila melanogaster]
          Length = 1016

 Score = 64.7 bits (156), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 42/151 (27%), Positives = 74/151 (49%), Gaps = 25/151 (16%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            +G G+V  +    G  DFV+E++GEV          +  R +++   D    +Y + +E+
Sbjct: 833  RGFGLVNREPIAVG--DFVIEYVGEV------INHAEFQRRMEQKQRDRDENYYFLGVEK 884

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
                       ++DA  K N A  + HSC PNCE +   V+  +++GI+ ++ I    E+
Sbjct: 885  D---------FIIDAGPKGNLARFMNHSCEPNCETQKWTVNCIHRVGIFAIKDIPVNSEL 935

Query: 2011 TFDY---NSVTESKEEYEASVCLCGSQVCRG 2038
            TF+Y   + +  SK+      C CG++ C G
Sbjct: 936  TFNYLWDDLMNNSKK-----ACFCGAKRCSG 961


>gi|195037347|ref|XP_001990122.1| GH19166 [Drosophila grimshawi]
 gi|193894318|gb|EDV93184.1| GH19166 [Drosophila grimshawi]
          Length = 1434

 Score = 64.7 bits (156), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 39/148 (26%), Positives = 69/148 (46%), Gaps = 19/148 (12%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            +G G+VC +     E DF++E++GEV          +  R + +   D    FY + +E+
Sbjct: 1244 RGFGLVCRE--AIAEGDFIIEYVGEV------INHAEFQRRVAQKTNDRDENFYFLGVEK 1295

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
                       ++DA  K N A  + HSC PNC  +   V+   ++G++ ++ I    E+
Sbjct: 1296 D---------FIIDAGPKGNLARFMNHSCEPNCATQKWTVNCINRVGLFAIKDIPENTEL 1346

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+Y  + +         C CG++ C G
Sbjct: 1347 TFNY--LWDDLMNNGKKACFCGAKRCSG 1372


>gi|224080887|ref|XP_002197925.1| PREDICTED: histone-lysine N-methyltransferase NSD3 [Taeniopygia
            guttata]
          Length = 1435

 Score = 64.7 bits (156), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 46/169 (27%), Positives = 85/169 (50%), Gaps = 22/169 (13%)

Query: 1882 PDDKYV-AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
            PD + +   R+G G+   +    GE  FV E++GE+       ++++    +++ +E+  
Sbjct: 1143 PDAEIIKTERRGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSV 1194

Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
              FY + + + +         ++DA  K NY+  + HSC PNCE +   V+G  ++G++ 
Sbjct: 1195 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDIRVGLFA 1245

Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAF 2049
            +  I  G E+TF+YN         E   C CG++ C G +L +  + AF
Sbjct: 1246 LCDIPAGMELTFNYNLDCLGNGRTE---CHCGAENCSG-FLGVRPKTAF 1290


>gi|326932813|ref|XP_003212507.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like isoform 1
            [Meleagris gallopavo]
          Length = 1436

 Score = 64.7 bits (156), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 46/169 (27%), Positives = 85/169 (50%), Gaps = 22/169 (13%)

Query: 1882 PDDKYVAY-RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
            PD + +   R+G G+   +    GE  FV E++GE+       ++++    +++ +E+  
Sbjct: 1144 PDAEIIKTDRRGWGLRTKRNIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSV 1195

Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
              FY + + + +         ++DA  K NY+  + HSC PNCE +   V+G  ++G++ 
Sbjct: 1196 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDIRVGLFA 1246

Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAF 2049
            +  I  G E+TF+YN         E   C CG++ C G +L +  + AF
Sbjct: 1247 LCDIPAGMELTFNYNLDCLGNGRTE---CHCGAENCSG-FLGVRPKTAF 1291


>gi|224095256|ref|XP_002310367.1| SET domain protein [Populus trichocarpa]
 gi|222853270|gb|EEE90817.1| SET domain protein [Populus trichocarpa]
          Length = 281

 Score = 64.7 bits (156), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 47/146 (32%), Positives = 69/146 (47%), Gaps = 21/146 (14%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
            G G+V +++   GE  FV+E++GEV       + +     L K        FY   + R 
Sbjct: 38   GSGIVADEDIKQGE--FVIEYVGEV------IDDKTCEERLWKMKHCGETNFYLCEINRD 89

Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
                     +V+DA +K N +  I HSC PN E +   +DG  +IGI+  R I  GE +T
Sbjct: 90   ---------MVIDATYKGNKSRYINHSCSPNTEMQKWIIDGETRIGIFATRDIRKGEHLT 140

Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCR 2037
            +DY  V    ++     C CGS  CR
Sbjct: 141  YDYQFVQFGADQ----DCHCGSSGCR 162


>gi|326932815|ref|XP_003212508.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like isoform 2
            [Meleagris gallopavo]
          Length = 1386

 Score = 64.7 bits (156), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 46/169 (27%), Positives = 85/169 (50%), Gaps = 22/169 (13%)

Query: 1882 PDDKYVAY-RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
            PD + +   R+G G+   +    GE  FV E++GE+       ++++    +++ +E+  
Sbjct: 1094 PDAEIIKTDRRGWGLRTKRNIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSV 1145

Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
              FY + + + +         ++DA  K NY+  + HSC PNCE +   V+G  ++G++ 
Sbjct: 1146 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDIRVGLFA 1196

Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAF 2049
            +  I  G E+TF+YN         E   C CG++ C G +L +  + AF
Sbjct: 1197 LCDIPAGMELTFNYNLDCLGNGRTE---CHCGAENCSG-FLGVRPKTAF 1241


>gi|256084142|ref|XP_002578291.1| SET domain protein [Schistosoma mansoni]
          Length = 1746

 Score = 64.7 bits (156), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 39/132 (29%), Positives = 67/132 (50%), Gaps = 18/132 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            +FV E++G++       ++ +  R L+  +E+    +Y + L+  +         ++DA 
Sbjct: 1051 EFVNEYIGDL------IDEDEANRRLRFAHENNITNYYMMKLDSQR---------IIDAG 1095

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K N +  + HSC PN   +   V+G  +IG++ VR I  GEE+TF+YN V   +E    
Sbjct: 1096 PKGNLSRFMNHSCDPNLNTQKWTVNGDNRIGLFAVRDISVGEELTFNYNFVALGQERLN- 1154

Query: 2027 SVCLCGSQVCRG 2038
              C CG+  C G
Sbjct: 1155 --CRCGASNCVG 1164


>gi|449270866|gb|EMC81514.1| Histone-lysine N-methyltransferase NSD3 [Columba livia]
          Length = 1440

 Score = 64.7 bits (156), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 46/169 (27%), Positives = 85/169 (50%), Gaps = 22/169 (13%)

Query: 1882 PDDKYV-AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
            PD + +   R+G G+   +    GE  FV E++GE+       ++++    +++ +E+  
Sbjct: 1148 PDAEIIKTERRGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSV 1199

Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
              FY + + + +         ++DA  K NY+  + HSC PNCE +   V+G  ++G++ 
Sbjct: 1200 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDIRVGLFA 1250

Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAF 2049
            +  I  G E+TF+YN         E   C CG++ C G +L +  + AF
Sbjct: 1251 LCDIPAGMELTFNYNLDCLGNGRTE---CHCGAENCSG-FLGVRPKSAF 1295


>gi|124513208|ref|XP_001349960.1| SET domain protein, putative [Plasmodium falciparum 3D7]
 gi|23615377|emb|CAD52368.1| SET domain protein, putative [Plasmodium falciparum 3D7]
          Length = 2548

 Score = 64.3 bits (155), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 52/159 (32%), Positives = 79/159 (49%), Gaps = 17/159 (10%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            + G GV C ++   GE   + E++GEV    + FEK+  +   Q+  E    + YN Y+ 
Sbjct: 2128 KTGYGVFCKRDIKNGE--LICEYVGEVLG-KREFEKR--LEVYQE--ESKKTDMYNWYII 2180

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
            +   D      V +D+  K + +  I HSC PN  ++   V G Y+IGI+ +R I  GEE
Sbjct: 2181 QINKD------VYIDSGKKGSISRFINHSCSPNSVSQKWIVRGFYRIGIFALRDIPSGEE 2234

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGA 2048
            IT++Y S       +E   CLC S  C   +L   GE +
Sbjct: 2235 ITYNY-SYNFLFNNFE---CLCKSPNCMNYHLLKKGESS 2269


>gi|94312468|ref|YP_585678.1| putative histone-lysine N-methyltransferase [Cupriavidus
            metallidurans CH34]
 gi|430804722|ref|ZP_19431837.1| putative histone-lysine N-methyltransferase [Cupriavidus sp. HMR-1]
 gi|93356320|gb|ABF10409.1| putative histone-lysine N-methyltransferase [Cupriavidus
            metallidurans CH34]
 gi|429503042|gb|ELA01344.1| putative histone-lysine N-methyltransferase [Cupriavidus sp. HMR-1]
          Length = 170

 Score = 64.3 bits (155), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 48/147 (32%), Positives = 71/147 (48%), Gaps = 29/147 (19%)

Query: 1901 GGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDL 1960
            G   E + V+E+ GE +  WK         +L+++  DP    +  Y     GD      
Sbjct: 41   GQIAEGERVIEYKGE-HISWK--------EALKRHPHDPNDPNHTFYFSLDDGD------ 85

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA    N A  I H+C PNCEA+    +   ++ I+ +R I  GEE+ +DY  V ++
Sbjct: 86   -VIDAKFGGNRARWINHACDPNCEAR----EKKGRVFIHALRDIEPGEELFYDYGLVIDA 140

Query: 2021 ------KEEYEASVCLCGSQVCRGSYL 2041
                  K+EYE   C CGS  CRG+ L
Sbjct: 141  RYTKKLKQEYE---CRCGSPKCRGTML 164


>gi|51849607|dbj|BAD42330.1| hypothetical protein [Nannochloris bacillaris]
          Length = 334

 Score = 64.3 bits (155), Expect = 8e-07,   Method: Composition-based stats.
 Identities = 47/150 (31%), Positives = 70/150 (46%), Gaps = 24/150 (16%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+   ++   G+  F+VE++GEV       E+++  R           EFY    +R
Sbjct: 145  KGFGLFAAEDVKAGQ--FIVEYVGEV------LEEEEYARR---------KEFYIATGQR 187

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
                 +  +  V+DA  +      I HSC PNCE +   V G   IG++ +  +  G  +
Sbjct: 188  HYYFMNVGNGEVIDAARRGGLGRFINHSCEPNCETQKWVVRGELAIGLFALEDVPAGSVL 247

Query: 2011 TFDYNSVTESKEEY--EASVCLCGSQVCRG 2038
            TFDYN      E Y  +   CLCGS+ CRG
Sbjct: 248  TFDYNF-----ERYGDKPMKCLCGSKACRG 272


>gi|302825340|ref|XP_002994293.1| hypothetical protein SELMODRAFT_432224 [Selaginella moellendorffii]
 gi|300137824|gb|EFJ04637.1| hypothetical protein SELMODRAFT_432224 [Selaginella moellendorffii]
          Length = 820

 Score = 64.3 bits (155), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 43/132 (32%), Positives = 66/132 (50%), Gaps = 19/132 (14%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            DF++E++GEV       E+   ++   +NN      FY   +        G+D  V+DA 
Sbjct: 200  DFLIEYIGEVIDDKTCEERLWDLKERGENN------FYLCEV--------GHD-KVIDAT 244

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K N +  I HSC PN + +    DG  +IG++ V  I  G+EIT+DY  +    E+   
Sbjct: 245  FKGNMSRFINHSCNPNAQLRKWQCDGELRIGVFAVSRILKGQEITYDYKYIQFGTEQQ-- 302

Query: 2027 SVCLCGSQVCRG 2038
              C CGS+ C+G
Sbjct: 303  --CHCGSKNCKG 312


>gi|10438794|dbj|BAB15346.1| unnamed protein product [Homo sapiens]
          Length = 1069

 Score = 64.3 bits (155), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            +FV E++GE+       ++++    ++   E     FY + L++ +         ++DA 
Sbjct: 339  EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 383

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K NYA  + H C+PNCE +  +V+G  ++G++ +  I  G E+TF+YN       +   
Sbjct: 384  PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 440

Query: 2027 SVCLCGSQVCRG 2038
            +VC CG+  C G
Sbjct: 441  TVCKCGAPNCSG 452


>gi|195395005|ref|XP_002056127.1| GJ10771 [Drosophila virilis]
 gi|194142836|gb|EDW59239.1| GJ10771 [Drosophila virilis]
          Length = 1430

 Score = 64.3 bits (155), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 39/148 (26%), Positives = 69/148 (46%), Gaps = 19/148 (12%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            +G G+VC +     E DF++E++GEV          +  R + +   D    FY + +E+
Sbjct: 1236 RGFGLVCREP--IAEGDFIIEYVGEV------INHAEFQRRMAQKTRDRDENFYFLGVEK 1287

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
                       ++DA  K N A  + HSC PNC  +   V+   ++G++ ++ I    E+
Sbjct: 1288 D---------FIIDAGPKGNLARFMNHSCEPNCATQKWTVNCINRVGLFAIKDIPENTEL 1338

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+Y  + +         C CG++ C G
Sbjct: 1339 TFNY--LWDDLMNNGKKACFCGAKRCSG 1364


>gi|195108992|ref|XP_001999076.1| GI23270 [Drosophila mojavensis]
 gi|193915670|gb|EDW14537.1| GI23270 [Drosophila mojavensis]
          Length = 1433

 Score = 64.3 bits (155), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 41/148 (27%), Positives = 72/148 (48%), Gaps = 19/148 (12%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            +G G+VC +     E DF++E++GEV       E Q  +    KN ++    FY + +E+
Sbjct: 1221 RGFGLVCREP--IKEGDFIIEYVGEVI---NHAEFQRRMAQKTKNRDE---NFYFLGVEK 1272

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
                       ++DA  K N A  + HSC PNC  +   V+ + ++G++ ++ I    E+
Sbjct: 1273 D---------FIIDAGPKGNLARFMNHSCEPNCATQKWTVNCNNRVGLFAIKDIPENTEL 1323

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+Y  + +         C CG++ C G
Sbjct: 1324 TFNY--LWDDLMNNGKKACFCGAKRCSG 1349


>gi|353232109|emb|CCD79464.1| putative set domain protein [Schistosoma mansoni]
          Length = 1503

 Score = 64.3 bits (155), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 39/132 (29%), Positives = 67/132 (50%), Gaps = 18/132 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            +FV E++G++       ++ +  R L+  +E+    +Y + L+  +         ++DA 
Sbjct: 1051 EFVNEYIGDL------IDEDEANRRLRFAHENNITNYYMMKLDSQR---------IIDAG 1095

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K N +  + HSC PN   +   V+G  +IG++ VR I  GEE+TF+YN V   +E    
Sbjct: 1096 PKGNLSRFMNHSCDPNLNTQKWTVNGDNRIGLFAVRDISVGEELTFNYNFVALGQERLN- 1154

Query: 2027 SVCLCGSQVCRG 2038
              C CG+  C G
Sbjct: 1155 --CRCGASNCVG 1164


>gi|357116306|ref|XP_003559923.1| PREDICTED: histone-lysine N-methyltransferase ASHH3-like
            [Brachypodium distachyon]
          Length = 349

 Score = 64.3 bits (155), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 46/153 (30%), Positives = 74/153 (48%), Gaps = 35/153 (22%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYP-------VWKWFEKQDGIRSLQKNNEDPAPEFY 1944
            G G+V ++  G  + +F++E++GEV         +WK  ++Q                + 
Sbjct: 126  GFGLVADE--GIQQGEFIIEYVGEVIDDRTCEERLWK-MKRQ---------------RYT 167

Query: 1945 NIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGI 2004
            N YL     +      +V+DA +K N +  I HSC+PN E +   VDG  ++GI+ +  I
Sbjct: 168  NFYLCEVSSN------MVIDATNKGNKSRFINHSCQPNTEMQKWTVDGETRVGIFALHDI 221

Query: 2005 HYGEEITFDYNSVTESKEEYEASVCLCGSQVCR 2037
              GEE+T+DY  V    ++     C CGS  CR
Sbjct: 222  KKGEELTYDYKFVQFGADQ----DCHCGSSNCR 250


>gi|339327714|ref|YP_004687407.1| methyltransferase [Cupriavidus necator N-1]
 gi|338167871|gb|AEI78926.1| methyltransferase [Cupriavidus necator N-1]
          Length = 290

 Score = 64.3 bits (155), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 46/147 (31%), Positives = 72/147 (48%), Gaps = 29/147 (19%)

Query: 1901 GGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDL 1960
            G   E + V+E+ GE +  WK         +L+++  DP+   +  Y     G       
Sbjct: 157  GPIAEGERVIEYKGE-HISWK--------TALERHPHDPSDPNHTFYFSLDDGS------ 201

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA +  N A  I H+C PNCEA+    +   ++ I+ +R I  GEE+ +DY  V ++
Sbjct: 202  -VIDAKYGGNRARWINHACEPNCEAR----EKKGRVFIHALRDIAQGEELFYDYGLVIDA 256

Query: 2021 ------KEEYEASVCLCGSQVCRGSYL 2041
                  K+E+E   C CGS  CRG+ L
Sbjct: 257  RYTAKLKKEFE---CRCGSPQCRGTML 280


>gi|113869618|ref|YP_728107.1| methyltransferase [Ralstonia eutropha H16]
 gi|113528394|emb|CAJ94739.1| putative methyltransferase [Ralstonia eutropha H16]
          Length = 171

 Score = 64.3 bits (155), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 46/147 (31%), Positives = 73/147 (49%), Gaps = 29/147 (19%)

Query: 1901 GGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDL 1960
            G   E + V+E+ GE +  WK        ++L+++  DP+   +  Y     G       
Sbjct: 38   GQIAEGERVIEYKGE-HISWK--------KALERHPHDPSDPNHTFYFSLDDGS------ 82

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA +  N A  I H+C PNCEA+    +   ++ I+ +R I  GEE+ +DY  V ++
Sbjct: 83   -VIDAKYGGNRARWINHACEPNCEAR----EKKGRVFIHALRDIAEGEELFYDYGLVIDA 137

Query: 2021 ------KEEYEASVCLCGSQVCRGSYL 2041
                  K+E+E   C CGS  CRG+ L
Sbjct: 138  RYTAKLKKEFE---CRCGSPQCRGTML 161


>gi|256074584|ref|XP_002573604.1| huntingtin interacting protein-related [Schistosoma mansoni]
          Length = 1575

 Score = 63.9 bits (154), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 54/203 (26%), Positives = 86/203 (42%), Gaps = 33/203 (16%)

Query: 1886 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN 1945
            Y    KG G++       G   FV+E++GEV    ++  +      L             
Sbjct: 458  YAGKDKGWGLMATDNVKKGS--FVIEYVGEVIDFSEFRRRIRRYERL------------- 502

Query: 1946 IYLERPKGDADGYDLVV-----VDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
                   G A  Y + V     +DA  K N+A  + HSC PNC  +  +V+G  +IG + 
Sbjct: 503  -------GHAHHYFMAVESDRFIDAGSKGNWARFVNHSCEPNCVTQKWSVNGEIRIGFFA 555

Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLL 2060
               I  G+E+T DY  V     E +   C CG+  C G  +  T +   EKV  +   ++
Sbjct: 556  KEDIPSGQEVTIDYQFVQYGVSEQK---CYCGASTCSG-IMGATSKYLQEKVRMKDTTMV 611

Query: 2061 DRHQLMLEACELNSVSEEDYLEL 2083
            +R   +L+  +L+S    D + L
Sbjct: 612  ERR--ILQLLQLDSFRNADDITL 632


>gi|449510894|ref|XP_004186257.1| PREDICTED: LOW QUALITY PROTEIN: ash1 (absent, small, or
            homeotic)-like (Drosophila), partial [Taeniopygia
            guttata]
          Length = 519

 Score = 63.9 bits (154), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 36/99 (36%), Positives = 57/99 (57%), Gaps = 7/99 (7%)

Query: 1945 NIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIY 1999
            N  +E+    +D Y L     +V+D+    N A  I HSC PNCE +  +V+G Y+IG+Y
Sbjct: 1    NRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCNPNCEMQKWSVNGVYRIGLY 60

Query: 2000 TVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
             ++ +  G E+T+DYN  + + E+ +  +C CG   CRG
Sbjct: 61   ALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFDKCRG 97


>gi|449474840|ref|XP_002193971.2| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific [Taeniopygia guttata]
          Length = 1651

 Score = 63.9 bits (154), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 37/132 (28%), Positives = 67/132 (50%), Gaps = 18/132 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            +FV E++GE+       ++++    ++   E     FY + L++ +         ++DA 
Sbjct: 695  EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 739

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K NYA  + H C+PNCE +   V+G  ++G++ +  I  G E+TF+YN       +   
Sbjct: 740  PKGNYARFMNHCCQPNCETQKWCVNGDTRVGLFALVNIKAGTELTFNYNLECLGNGK--- 796

Query: 2027 SVCLCGSQVCRG 2038
            +VC CG+  C G
Sbjct: 797  TVCKCGAPNCSG 808


>gi|444511191|gb|ELV09829.1| Histone-lysine N-methyltransferase NSD3 [Tupaia chinensis]
          Length = 1235

 Score = 63.9 bits (154), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 47/175 (26%), Positives = 86/175 (49%), Gaps = 22/175 (12%)

Query: 1882 PDDKYV-AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
            PD + +   R+G G+   +    GE  FV E++GE+       ++++    +Q+ +E+  
Sbjct: 943  PDAEVIKTERRGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECRLRIQRAHENSV 994

Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
              FY + + + +         ++DA  K NY+  + HSC PNCE +   V+G  ++G++ 
Sbjct: 995  TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1045

Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKE 2055
            +  I  G E+TF+YN         E   C CG++ C G +L +  + A     +E
Sbjct: 1046 LCDIPAGMELTFNYNLDCLGNGRTE---CHCGAENCSG-FLGVRPKSACASTAEE 1096


>gi|149634094|ref|XP_001506476.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1
            [Ornithorhynchus anatinus]
          Length = 1437

 Score = 63.9 bits (154), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 46/169 (27%), Positives = 84/169 (49%), Gaps = 22/169 (13%)

Query: 1882 PDDKYV-AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
            PD + +   R+G G+   +    GE  FV E++GE+       ++++    +++ +E+  
Sbjct: 1145 PDAEIIKTERRGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSV 1196

Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
              FY + + + +         ++DA  K NY+  + HSC PNCE +   V+G  ++G++ 
Sbjct: 1197 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1247

Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAF 2049
            +  I  G E+TF+YN         E   C CG+  C G +L +  + AF
Sbjct: 1248 LCDIPAGMELTFNYNLDCLGNGRTE---CHCGADNCSG-FLGVRPKTAF 1292


>gi|359067302|ref|XP_002689078.2| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific [Bos taurus]
          Length = 1470

 Score = 63.9 bits (154), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            +FV E++GE+       ++++    ++   E     FY + L++ +         ++DA 
Sbjct: 741  EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 785

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K NYA  + H C+PNCE +  +V+G  ++G++ +  I  G E+TF+YN       +   
Sbjct: 786  PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 842

Query: 2027 SVCLCGSQVCRG 2038
            +VC CG+  C G
Sbjct: 843  TVCKCGAPNCSG 854


>gi|156391978|ref|XP_001635826.1| predicted protein [Nematostella vectensis]
 gi|156222924|gb|EDO43763.1| predicted protein [Nematostella vectensis]
          Length = 348

 Score = 63.9 bits (154), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 44/134 (32%), Positives = 63/134 (47%), Gaps = 18/134 (13%)

Query: 1905 EDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVD 1964
            ++ FV+E+ GEV         +D     Q+ +      +Y + L      AD     ++D
Sbjct: 99   QNQFVIEYCGEV------MNYRDFQSRAQRYDRQKRRHYYFMTLR-----ADE----IID 143

Query: 1965 AMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEY 2024
            A  K + +  I HSC PNC  +   V+G  +IG +T+R I  GEE+TFDY      K   
Sbjct: 144  ATLKGSISRFINHSCEPNCVTQKWTVNGLLRIGFFTLRTIKAGEELTFDYQLQRYGK--- 200

Query: 2025 EASVCLCGSQVCRG 2038
             A  C C S  CRG
Sbjct: 201  IAQTCYCESPSCRG 214


>gi|313227685|emb|CBY22833.1| unnamed protein product [Oikopleura dioica]
          Length = 1179

 Score = 63.9 bits (154), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 43/131 (32%), Positives = 67/131 (51%), Gaps = 17/131 (12%)

Query: 1908 FVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMH 1967
            F++E++GE+         +  IR L+++ +     +Y + L+         +L ++DA  
Sbjct: 1016 FIIEYIGEIIS-----HDESRIR-LEESAKIGVTNYYILELD---------NLRMIDAGP 1060

Query: 1968 KANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEAS 2027
            + N A  I HSC PNC      V G  +IGI++ R I  GEE+TF+Y     S E    +
Sbjct: 1061 RGNIARFINHSCDPNCGIDPWIVQGDTRIGIFSKRDIQEGEELTFNYQLQQSSDE--GKT 1118

Query: 2028 VCLCGSQVCRG 2038
             CLCGS+ C G
Sbjct: 1119 KCLCGSKNCAG 1129


>gi|392580378|gb|EIW73505.1| hypothetical protein TREMEDRAFT_24920 [Tremella mesenterica DSM 1558]
          Length = 180

 Score = 63.9 bits (154), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 34/82 (41%), Positives = 47/82 (57%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
            +V DA  K + +  I HSC P+  AK+ +++G  +I IY  R +H GEEI +DY    ES
Sbjct: 101  LVCDATFKGSVSRLINHSCDPSASAKIISINGQSKIVIYAKRTLHPGEEILYDYKFPLES 160

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
                    CLCG+  CRG +LN
Sbjct: 161  DPALRVP-CLCGAATCRG-WLN 180


>gi|296485540|tpg|DAA27655.1| TPA: nuclear receptor binding SET domain protein 1 [Bos taurus]
          Length = 1275

 Score = 63.9 bits (154), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            +FV E++GE+       ++++    ++   E     FY + L++ +         ++DA 
Sbjct: 544  EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 588

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K NYA  + H C+PNCE +  +V+G  ++G++ +  I  G E+TF+YN       +   
Sbjct: 589  PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 645

Query: 2027 SVCLCGSQVCRG 2038
            +VC CG+  C G
Sbjct: 646  TVCKCGAPNCSG 657


>gi|225380776|gb|ACN88689.1| myeloid/lymphoid or mixed-lineage leukemia [Danio rerio]
          Length = 148

 Score = 63.5 bits (153), Expect = 1e-06,   Method: Composition-based stats.
 Identities = 37/84 (44%), Positives = 46/84 (54%), Gaps = 3/84 (3%)

Query: 1959 DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVT 2018
            D  VVDA    N A  I HSC PNC ++V  VDG   I I+  R I+ GEE+T+DY    
Sbjct: 68   DYEVVDATIHGNSARFINHSCEPNCYSRVINVDGRKHIVIFATRKIYKGEELTYDYKFPI 127

Query: 2019 ESKEEYEASVCLCGSQVCRGSYLN 2042
            E  E      C CG++ CR  +LN
Sbjct: 128  E--EPGNKLPCNCGAKKCR-KFLN 148


>gi|344238567|gb|EGV94670.1| Histone-lysine N-methyltransferase NSD3 [Cricetulus griseus]
          Length = 620

 Score = 63.5 bits (153), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 43/149 (28%), Positives = 75/149 (50%), Gaps = 20/149 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            RKG G+   +    GE  FV E++GE+       ++++    +++ +E+    FY + + 
Sbjct: 338  RKGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSVTNFYMLTVT 389

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
            + +         ++DA  K NY+  + HSC PNCE +   V+G  ++G++ +  I  G E
Sbjct: 390  KDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFAICDIPAGME 440

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            +TF+YN           +VC CGS  C G
Sbjct: 441  LTFNYNLDCLGNGR---TVCHCGSDNCSG 466


>gi|309780384|ref|ZP_07675135.1| SET domain protein [Ralstonia sp. 5_7_47FAA]
 gi|404394987|ref|ZP_10986790.1| hypothetical protein HMPREF0989_02076 [Ralstonia sp. 5_2_56FAA]
 gi|308921087|gb|EFP66733.1| SET domain protein [Ralstonia sp. 5_7_47FAA]
 gi|348615101|gb|EGY64632.1| hypothetical protein HMPREF0989_02076 [Ralstonia sp. 5_2_56FAA]
          Length = 179

 Score = 63.5 bits (153), Expect = 1e-06,   Method: Composition-based stats.
 Identities = 43/136 (31%), Positives = 69/136 (50%), Gaps = 23/136 (16%)

Query: 1909 VVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHK 1968
            ++E+ GE +  WK         +L+++  DP+   +  Y     G        V+DA + 
Sbjct: 55   IIEYKGE-HITWK--------EALRRHPHDPSDPNHTFYFSLEDGS-------VIDAKYG 98

Query: 1969 ANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEE---YE 2025
             N A  I H+C+PNCEA+    DG  ++ I+ +R I  GEE+ +DY  V E ++     E
Sbjct: 99   GNRARWINHACKPNCEAR--EADG--RVFIHALRDIEAGEELFYDYGLVIEGRQTKALKE 154

Query: 2026 ASVCLCGSQVCRGSYL 2041
               C CG++ CRG+ L
Sbjct: 155  QFACRCGAKKCRGTML 170


>gi|355729169|gb|AES09787.1| Wolf-Hirschhorn syndrome candidate 1-like 1 [Mustela putorius furo]
          Length = 596

 Score = 63.5 bits (153), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 42/149 (28%), Positives = 76/149 (51%), Gaps = 20/149 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            R+G G+   +    GE  FV E++GE+       ++++    +++ +E+    FY + + 
Sbjct: 314  RRGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSVTNFYMLTVT 365

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
            + +         ++DA  K NY+  + HSC PNCE +   V+G  ++G++ +R I  G E
Sbjct: 366  KDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFALRDIPAGME 416

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            +TF+YN         E   C CG++ C G
Sbjct: 417  LTFNYNLDCLGNGRTE---CHCGAENCSG 442


>gi|302846429|ref|XP_002954751.1| histone H3 methyltransferase [Volvox carteri f. nagariensis]
 gi|300259934|gb|EFJ44157.1| histone H3 methyltransferase [Volvox carteri f. nagariensis]
          Length = 261

 Score = 63.5 bits (153), Expect = 1e-06,   Method: Composition-based stats.
 Identities = 48/149 (32%), Positives = 73/149 (48%), Gaps = 24/149 (16%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+   ++   G+  F++E++GEV    ++  +++   S+ + +      F NI    
Sbjct: 90   KGFGLFALEDIKAGQ--FIIEYIGEVLEEDEYQRRKEYYMSVGQRHY----YFMNI---- 139

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
              G+ +     V+DA  K N +  I HSC PNCE +   V G   IG++ VR I    E+
Sbjct: 140  --GNGE-----VIDACRKGNISRFINHSCEPNCETQKWLVRGELAIGLFAVRDIPKDTEL 192

Query: 2011 TFDYNSVTESKEEY--EASVCLCGSQVCR 2037
            TFDYN      E Y  +   C CGS  CR
Sbjct: 193  TFDYNF-----ERYGDKPMRCYCGSTNCR 216


>gi|145353759|ref|XP_001421172.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|145357147|ref|XP_001422783.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144581408|gb|ABO99465.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144583027|gb|ABP01142.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 503

 Score = 63.5 bits (153), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 43/139 (30%), Positives = 62/139 (44%), Gaps = 27/139 (19%)

Query: 1908 FVVEFLGEVYPVWK-----WFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVV 1962
            F+VE+ GE+    +     W++KQ G                N YL     +       V
Sbjct: 306  FIVEYAGEILDEHECAERLWYDKQSGEE--------------NFYLMEISAN------YV 345

Query: 1963 VDAMHKANYASRICHSCRPNCEAK--VTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
            +DA  K + A  I  SC PNCE +  V A     ++GI+    I  G E+T+DYN     
Sbjct: 346  IDAKFKGSIARFINSSCHPNCETQRWVDASTNETRVGIFATEDIASGTELTYDYNFAHFG 405

Query: 2021 KEEYEASVCLCGSQVCRGS 2039
             E+  + VC+CG   CRG+
Sbjct: 406  DEKGTSFVCMCGHPKCRGT 424


>gi|390569511|ref|ZP_10249796.1| nuclear protein SET [Burkholderia terrae BS001]
 gi|389938371|gb|EIN00215.1| nuclear protein SET [Burkholderia terrae BS001]
          Length = 209

 Score = 63.5 bits (153), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 42/117 (35%), Positives = 61/117 (52%), Gaps = 20/117 (17%)

Query: 1931 SLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAV 1990
            +L+++  +P    +  Y     GD       V+D   K N A  I HSC PNCEA+   +
Sbjct: 92   ALRRHPHNPDEPNHTFYFALDSGD-------VIDGKVKGNSARWINHSCAPNCEAE--EI 142

Query: 1991 DGHYQIGIYTVRGIHYGEEITFDYNSVTES------KEEYEASVCLCGSQVCRGSYL 2041
            DGH  + I  +R I  GEE+ +DY  V ++      K+EYE   C CG++ CRG+ L
Sbjct: 143  DGH--VFIDALRDIGAGEELFYDYGLVIDARQTKKLKKEYE---CRCGARKCRGTML 194


>gi|126303359|ref|XP_001372863.1| PREDICTED: histone-lysine N-methyltransferase NSD3 [Monodelphis
            domestica]
          Length = 1435

 Score = 63.5 bits (153), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 46/169 (27%), Positives = 84/169 (49%), Gaps = 22/169 (13%)

Query: 1882 PDDKYV-AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
            PD + +   R+G G+   +    GE  FV E++GE+       ++++    +++ +E+  
Sbjct: 1143 PDAEVIKTERRGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSI 1194

Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
              FY + + + +         ++DA  K NY+  + HSC PNCE +   V+G  ++G++ 
Sbjct: 1195 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDIRVGLFA 1245

Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAF 2049
            +  I  G E+TF+YN         E   C CG+  C G +L +  + AF
Sbjct: 1246 LCDIPAGVELTFNYNLDCLGNGRTE---CHCGADNCSG-FLGVRPKTAF 1290


>gi|324505555|gb|ADY42386.1| Histone-lysine N-methyltransferase Mes-4 [Ascaris suum]
          Length = 743

 Score = 63.5 bits (153), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 48/160 (30%), Positives = 75/160 (46%), Gaps = 21/160 (13%)

Query: 1883 DDKYVAYR----KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNED 1938
            DD+++  R    KG GV   K    G++  + E++G V P  ++FE+ + I +   NN  
Sbjct: 453  DDEWMEERRTTNKGFGVFAKKYIPAGQE--LTEYVGRVMPRDEYFEQLNFIGTF--NN-- 506

Query: 1939 PAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
                     LE         +   VDA +  N +  + HSC PNC+     VDG Y++ +
Sbjct: 507  ---------LEMSYFGMQITNEFYVDARNCGNMSRSVNHSCEPNCKVNAVTVDGVYRLKV 557

Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
              ++ I  G+E+T+DY   TE         C CG+  CRG
Sbjct: 558  SALKDIAAGDELTYDYG--TELWSGMVGMRCRCGTAGCRG 595


>gi|16549858|dbj|BAB70868.1| unnamed protein product [Homo sapiens]
          Length = 1059

 Score = 63.5 bits (153), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            +FV E++GE+       ++++    ++   E     FY + L++ +         ++DA 
Sbjct: 929  EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 973

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K NYA  + H C+PNCE +  +V+G  ++G++ +  I  G E+TF+YN       +   
Sbjct: 974  PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 1030

Query: 2027 SVCLCGSQVCRG 2038
            +VC CG+  C G
Sbjct: 1031 TVCKCGAPNCSG 1042


>gi|115478464|ref|NP_001062827.1| Os09g0307800 [Oryza sativa Japonica Group]
 gi|51091678|dbj|BAD36461.1| putative SET domain protein 110 [Oryza sativa Japonica Group]
 gi|51091893|dbj|BAD36704.1| putative SET domain protein 110 [Oryza sativa Japonica Group]
 gi|113631060|dbj|BAF24741.1| Os09g0307800 [Oryza sativa Japonica Group]
          Length = 340

 Score = 63.5 bits (153), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 49/153 (32%), Positives = 71/153 (46%), Gaps = 35/153 (22%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYP-------VWKWFEKQDGIRSLQKNNEDPAPEFY 1944
            G GVV  ++   GE  FV+E++GEV         +WK   + D                 
Sbjct: 119  GNGVVAEEDIKKGE--FVIEYVGEVIDDRTCEQRLWKMKRQGDT---------------- 160

Query: 1945 NIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGI 2004
            N YL     +      +V+DA +K N +  I HSC PN E +   V+G  ++GI+ +R I
Sbjct: 161  NFYLCEVSSN------MVIDATNKGNMSRFINHSCEPNTEMQKWTVEGETRVGIFALRDI 214

Query: 2005 HYGEEITFDYNSVTESKEEYEASVCLCGSQVCR 2037
              GEE+T+DY  V    ++     C CGS  CR
Sbjct: 215  KTGEELTYDYKFVQFGADQD----CHCGSSNCR 243


>gi|432099958|gb|ELK28852.1| Histone-lysine N-methyltransferase NSD3 [Myotis davidii]
          Length = 1641

 Score = 63.2 bits (152), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 43/158 (27%), Positives = 79/158 (50%), Gaps = 21/158 (13%)

Query: 1882 PDDKYV-AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
            PD + +   R+G G+   +    GE  FV E++GE+       ++++    +++ +E+  
Sbjct: 1349 PDAEIIKTERRGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSV 1400

Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
              FY + + + +         ++DA  K NY+  + HSC PNCE +   V+G  ++G++ 
Sbjct: 1401 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1451

Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            +  I  G E+TF+YN         E   C CG++ C G
Sbjct: 1452 LCDIPAGMELTFNYNLDCLGNGRTE---CHCGAENCSG 1486


>gi|327284319|ref|XP_003226886.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like [Anolis
            carolinensis]
          Length = 1438

 Score = 63.2 bits (152), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 42/149 (28%), Positives = 74/149 (49%), Gaps = 20/149 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            R+G G+   +    GE  FV E++GE+       ++++    +++ +E+    FY + + 
Sbjct: 1155 RRGWGLRTKRNIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSVTNFYMLTVT 1206

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
            + +         ++DA  K NY+  + HSC PNCE +   V+G  ++G++ V  I  G E
Sbjct: 1207 KDR---------IIDAGPKGNYSRFMNHSCHPNCETQKWTVNGDVRVGLFAVCDIPAGME 1257

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            +TF+YN         E   C CG+  C G
Sbjct: 1258 LTFNYNLDCLGNGRTE---CHCGADNCSG 1283


>gi|395507428|ref|XP_003758026.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1
            [Sarcophilus harrisii]
          Length = 1437

 Score = 63.2 bits (152), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 46/169 (27%), Positives = 84/169 (49%), Gaps = 22/169 (13%)

Query: 1882 PDDKYV-AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
            PD + +   R+G G+   +    GE  FV E++GE+       ++++    +++ +E+  
Sbjct: 1145 PDAEVIKTERRGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSI 1196

Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
              FY + + + +         ++DA  K NY+  + HSC PNCE +   V+G  ++G++ 
Sbjct: 1197 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1247

Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAF 2049
            +  I  G E+TF+YN         E   C CG+  C G +L +  + AF
Sbjct: 1248 LCDIPAGVELTFNYNLDCLGNGRTE---CHCGADNCSG-FLGVRPKTAF 1292


>gi|194291212|ref|YP_002007119.1| hypothetical protein RALTA_A3139 [Cupriavidus taiwanensis LMG 19424]
 gi|193225047|emb|CAQ71058.1| conserved hypothetical protein [Cupriavidus taiwanensis LMG 19424]
          Length = 171

 Score = 63.2 bits (152), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 45/145 (31%), Positives = 72/145 (49%), Gaps = 29/145 (20%)

Query: 1903 FGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVV 1962
              E + V+E+ GE +  WK        ++L+++  DP+   +  Y     G        V
Sbjct: 40   IAEGERVIEYKGE-HISWK--------KALERHPHDPSDPNHTFYFSLDDGS-------V 83

Query: 1963 VDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES-- 2020
            +DA +  N A  I H+C PNCEA+    +   ++ I+ +R I  GEE+ +DY  V ++  
Sbjct: 84   IDAKYGGNRARWINHACEPNCEAR----EKKGRVFIHALRDIAQGEELFYDYGLVIDARY 139

Query: 2021 ----KEEYEASVCLCGSQVCRGSYL 2041
                K+E+E   C CGS  CRG+ L
Sbjct: 140  TAKLKKEFE---CRCGSPQCRGTML 161


>gi|162460550|ref|NP_001105653.1| LOC542662 [Zea mays]
 gi|24021802|gb|AAN41254.1| SET domain protein 110 [Zea mays]
 gi|195652527|gb|ACG45731.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20specific [Zea mays]
          Length = 342

 Score = 63.2 bits (152), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 50/151 (33%), Positives = 69/151 (45%), Gaps = 31/151 (20%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
            G G+V   E   GE  FV+E++GEV                    +D   E   ++  + 
Sbjct: 130  GHGLVAEDEIKKGE--FVIEYVGEVI-------------------DDRTCE-NRLWTMKR 167

Query: 1952 KGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
              D D Y       +V+DA +K N +  I HSC PN   +   VDG  ++GI+ +R I  
Sbjct: 168  LDDTDFYLCEVSSNMVIDATNKGNLSRFINHSCEPNTAMQKWTVDGETRVGIFALRDIKI 227

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCR 2037
            GEE+T+DY  V        A VC CGS  CR
Sbjct: 228  GEELTYDYKFVQFGA----AQVCHCGSSKCR 254


>gi|186477803|ref|YP_001859273.1| nuclear protein SET [Burkholderia phymatum STM815]
 gi|184194262|gb|ACC72227.1| nuclear protein SET [Burkholderia phymatum STM815]
          Length = 160

 Score = 63.2 bits (152), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 48/141 (34%), Positives = 70/141 (49%), Gaps = 29/141 (20%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            D ++E+ GE    WK         +L+++  +P    +  Y     GD       V+D  
Sbjct: 28   DRLIEYKGERIS-WK--------EALRRHPHNPDEPNHTFYFALDSGD-------VIDGK 71

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES------ 2020
             K N A  I HSC PNCEA+   +DGH  + I  +R I  GEE+ +DY  V ++      
Sbjct: 72   VKGNSARWINHSCAPNCEAE--EIDGH--VYIDALRDIEAGEELFYDYGLVIDARQTKKL 127

Query: 2021 KEEYEASVCLCGSQVCRGSYL 2041
            K+EYE   C CG++ CRG+ L
Sbjct: 128  KKEYE---CRCGARKCRGTML 145


>gi|395507430|ref|XP_003758027.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2
            [Sarcophilus harrisii]
          Length = 1389

 Score = 63.2 bits (152), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 46/169 (27%), Positives = 84/169 (49%), Gaps = 22/169 (13%)

Query: 1882 PDDKYV-AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
            PD + +   R+G G+   +    GE  FV E++GE+       ++++    +++ +E+  
Sbjct: 1097 PDAEVIKTERRGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSI 1148

Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
              FY + + + +         ++DA  K NY+  + HSC PNCE +   V+G  ++G++ 
Sbjct: 1149 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1199

Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAF 2049
            +  I  G E+TF+YN         E   C CG+  C G +L +  + AF
Sbjct: 1200 LCDIPAGVELTFNYNLDCLGNGRTE---CHCGADNCSG-FLGVRPKTAF 1244


>gi|291225527|ref|XP_002732754.1| PREDICTED: Ash1l protein-like [Saccoglossus kowalevskii]
          Length = 2643

 Score = 63.2 bits (152), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 36/103 (34%), Positives = 52/103 (50%), Gaps = 7/103 (6%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
            +V+D          + HSC PNCE +  +V+G Y+IG++ ++ I  G E+T+DYN    +
Sbjct: 1940 MVIDGYRMGCEGRFVNHSCEPNCEMQKWSVNGVYRIGLFALKDIQPGSELTYDYNFHAFN 1999

Query: 2021 KEEYEASVCLCGSQVCRG-----SYLNLTGEGAFEKVLKELHG 2058
             E  +   C CGS  CRG     S       GAF+K  K   G
Sbjct: 2000 LETQQE--CCCGSDKCRGFIGGKSQAQQRVNGAFKKDKKTASG 2040


>gi|359489946|ref|XP_002268035.2| PREDICTED: histone-lysine N-methyltransferase ASHH3-like [Vitis
            vinifera]
          Length = 377

 Score = 63.2 bits (152), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 53/172 (30%), Positives = 79/172 (45%), Gaps = 38/172 (22%)

Query: 1876 KAMDSRPDDKYVAY---RKGLGVVCNKEGGFGEDDFVVEFLGEVYP-------VWKWFEK 1925
            K   SRP  K       + G G+V +++   GE  FV+E++GEV         +WK    
Sbjct: 114  KPFQSRPVKKMKMVETEKCGSGIVADEDIKQGE--FVIEYVGEVIDDKTCEDRLWK---- 167

Query: 1926 QDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEA 1985
               ++ L + N      FY   + R          +V+DA +K N +  I HSC PN E 
Sbjct: 168  ---MKHLGETN------FYLCEINRD---------MVIDATYKGNKSRYINHSCDPNTEM 209

Query: 1986 KVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCR 2037
            +   +DG  +IGI+  R I  GE +T+DY  V    ++     C CG+  CR
Sbjct: 210  QKWRIDGETRIGIFATRDIKRGEHLTYDYQFVQFGADQD----CHCGAVGCR 257


>gi|118572948|sp|Q6P2L6.2|NSD3_MOUSE RecName: Full=Histone-lysine N-methyltransferase NSD3; AltName:
            Full=Nuclear SET domain-containing protein 3; AltName:
            Full=Wolf-Hirschhorn syndrome candidate 1-like protein 1
            homolog; Short=WHSC1-like protein 1
          Length = 1439

 Score = 63.2 bits (152), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 46/175 (26%), Positives = 86/175 (49%), Gaps = 22/175 (12%)

Query: 1882 PDDKYV-AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
            PD + +   R+G G+   +    GE  FV E++GE+       ++++    +++ +E+  
Sbjct: 1148 PDAEVIKTERRGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSV 1199

Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
              FY + + + +         ++DA  K NY+  + HSC PNCE +   V+G  ++G++ 
Sbjct: 1200 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1250

Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKE 2055
            +  I  G E+TF+YN           +VC CG+  C G +L +  + A    + E
Sbjct: 1251 LCDIPAGMELTFNYNLDCLGNGR---TVCHCGADNCSG-FLGVRPKSACTSAVDE 1301


>gi|170694289|ref|ZP_02885443.1| nuclear protein SET [Burkholderia graminis C4D1M]
 gi|170140712|gb|EDT08886.1| nuclear protein SET [Burkholderia graminis C4D1M]
          Length = 185

 Score = 63.2 bits (152), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 50/142 (35%), Positives = 71/142 (50%), Gaps = 29/142 (20%)

Query: 1909 VVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHK 1968
            ++E+ GE    WK     + +R    N ++P   FY   L+  K         V+D    
Sbjct: 30   LIEYKGERIS-WK-----EALRRHPHNPDEPNHTFY-FALDSGK---------VIDGKVS 73

Query: 1969 ANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES------KE 2022
             N A  I HSC PNCEA+   +DGH  + ++ +R I  GEEI +DY  V ++      K+
Sbjct: 74   GNSARWINHSCAPNCEAE--EIDGH--VYVHALRDIAEGEEIFYDYGLVIDARQTKKLKK 129

Query: 2023 EYEASVCLCGSQVCRGSYLNLT 2044
            EYE   C CGS+ CRG+ L  T
Sbjct: 130  EYE---CRCGSRKCRGTMLAPT 148


>gi|170574239|ref|XP_001892724.1| SET domain containing protein [Brugia malayi]
 gi|158601534|gb|EDP38427.1| SET domain containing protein [Brugia malayi]
          Length = 222

 Score = 63.2 bits (152), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 33/75 (44%), Positives = 42/75 (56%), Gaps = 2/75 (2%)

Query: 1963 VDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKE 2022
            VDA +  N A    HSC PN +     VDG Y++ I T++ I  GEE+TFDY+  TE  E
Sbjct: 66   VDARNYGNIARSFNHSCEPNTKVDAVVVDGIYRLKISTIKDIKKGEELTFDYD--TEIIE 123

Query: 2023 EYEASVCLCGSQVCR 2037
                  C CGS+ CR
Sbjct: 124  GLVGMECFCGSRNCR 138


>gi|124486903|ref|NP_001074738.1| histone-lysine N-methyltransferase NSD3 isoform 2 [Mus musculus]
 gi|189442807|gb|AAI67226.1| Wolf-Hirschhorn syndrome candidate 1-like 1 (human) [synthetic
            construct]
          Length = 1446

 Score = 63.2 bits (152), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 46/175 (26%), Positives = 86/175 (49%), Gaps = 22/175 (12%)

Query: 1882 PDDKYV-AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
            PD + +   R+G G+   +    GE  FV E++GE+       ++++    +++ +E+  
Sbjct: 1155 PDAEVIKTERRGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSV 1206

Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
              FY + + + +         ++DA  K NY+  + HSC PNCE +   V+G  ++G++ 
Sbjct: 1207 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1257

Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKE 2055
            +  I  G E+TF+YN           +VC CG+  C G +L +  + A    + E
Sbjct: 1258 LCDIPAGMELTFNYNLDCLGNGR---TVCHCGADNCSG-FLGVRPKSACTSAVDE 1308


>gi|398807623|ref|ZP_10566499.1| SET domain-containing protein [Variovorax sp. CF313]
 gi|398089158|gb|EJL79686.1| SET domain-containing protein [Variovorax sp. CF313]
          Length = 206

 Score = 63.2 bits (152), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 48/144 (33%), Positives = 68/144 (47%), Gaps = 27/144 (18%)

Query: 1903 FGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVV 1962
              E + ++E+ GEV   WK         +L+++  DPA   +  Y     G        V
Sbjct: 31   LAEGETLIEYKGEVIS-WK--------EALRRHPHDPAQPNHTFYFHIDDGR-------V 74

Query: 1963 VDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKE 2022
            +D   K N A  I HSC PNCEA    VDG  ++ I  +R I  GEE+ +DY  + +  E
Sbjct: 75   IDGNVKGNDARWINHSCEPNCEAD--EVDG--RVYIKALRNISAGEELNYDYGLIID--E 128

Query: 2023 EYEASV-----CLCGSQVCRGSYL 2041
             Y   +     C CGS+ CRG+ L
Sbjct: 129  PYTPKLLSEFPCWCGSEQCRGTLL 152


>gi|148700883|gb|EDL32830.1| mCG14519 [Mus musculus]
          Length = 1381

 Score = 63.2 bits (152), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 46/175 (26%), Positives = 86/175 (49%), Gaps = 22/175 (12%)

Query: 1882 PDDKYV-AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
            PD + +   R+G G+   +    GE  FV E++GE+       ++++    +++ +E+  
Sbjct: 1090 PDAEVIKTERRGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSV 1141

Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
              FY + + + +         ++DA  K NY+  + HSC PNCE +   V+G  ++G++ 
Sbjct: 1142 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1192

Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKE 2055
            +  I  G E+TF+YN           +VC CG+  C G +L +  + A    + E
Sbjct: 1193 LCDIPAGMELTFNYNLDCLGNGR---TVCHCGADNCSG-FLGVRPKSACTSAVDE 1243


>gi|350593412|ref|XP_003483678.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            NSD3-like [Sus scrofa]
          Length = 1438

 Score = 63.2 bits (152), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 46/175 (26%), Positives = 86/175 (49%), Gaps = 22/175 (12%)

Query: 1882 PDDKYV-AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
            PD + +   R+G G+   +    GE  FV E++GE+       ++++    +++ +E+  
Sbjct: 1146 PDAEIIKTERRGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSV 1197

Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
              FY + + + +         ++DA  K NY+  + HSC PNCE +   V+G  ++G++ 
Sbjct: 1198 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1248

Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKE 2055
            +  I  G E+TF+YN         E   C CG++ C G +L +  + A     +E
Sbjct: 1249 LCDIPAGMELTFNYNLDCLGNGRTE---CHCGAENCSG-FLGVRPKSACASTTEE 1299


>gi|297737225|emb|CBI26426.3| unnamed protein product [Vitis vinifera]
          Length = 438

 Score = 63.2 bits (152), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 53/172 (30%), Positives = 79/172 (45%), Gaps = 38/172 (22%)

Query: 1876 KAMDSRPDDKYVAY---RKGLGVVCNKEGGFGEDDFVVEFLGEVYP-------VWKWFEK 1925
            K   SRP  K       + G G+V +++   GE  FV+E++GEV         +WK    
Sbjct: 191  KPFQSRPVKKMKMVETEKCGSGIVADEDIKQGE--FVIEYVGEVIDDKTCEDRLWK---- 244

Query: 1926 QDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEA 1985
               ++ L + N      FY   + R          +V+DA +K N +  I HSC PN E 
Sbjct: 245  ---MKHLGETN------FYLCEINRD---------MVIDATYKGNKSRYINHSCDPNTEM 286

Query: 1986 KVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCR 2037
            +   +DG  +IGI+  R I  GE +T+DY  V    ++     C CG+  CR
Sbjct: 287  QKWRIDGETRIGIFATRDIKRGEHLTYDYQFVQFGADQD----CHCGAVGCR 334


>gi|296082099|emb|CBI21104.3| unnamed protein product [Vitis vinifera]
          Length = 1111

 Score = 62.8 bits (151), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 44/135 (32%), Positives = 67/135 (49%), Gaps = 14/135 (10%)

Query: 1908 FVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMH 1967
             VVE++GE+  +    +++   +S +K     A  F+ I  E            ++DA  
Sbjct: 991  MVVEYVGEIVGLRVADKRESDYQSGRKLQYKTACYFFRIDKEH-----------IIDATR 1039

Query: 1968 KANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEAS 2027
            K   A  + HSC PNC AKV +V    ++  +  R I+ GEEIT+DY+   E  +E +  
Sbjct: 1040 KGGIARFVNHSCLPNCVAKVISVRNEKKVVFFAERDINPGEEITYDYHFNHE--DEGKKI 1097

Query: 2028 VCLCGSQVCRGSYLN 2042
             C C S+ CR  YLN
Sbjct: 1098 PCFCNSRNCR-RYLN 1111


>gi|313226807|emb|CBY21952.1| unnamed protein product [Oikopleura dioica]
          Length = 216

 Score = 62.8 bits (151), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 45/133 (33%), Positives = 66/133 (49%), Gaps = 23/133 (17%)

Query: 1908 FVVEFLGEVYPVWKWFEKQ--DGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDA 1965
            F++E+LGEV    K F+K+  +  RS ++++       Y + L R            +DA
Sbjct: 75   FIIEYLGEVVSA-KEFKKRSHEYARSGKQHH-------YFMELSRQ---------ATIDA 117

Query: 1966 MHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYE 2025
             HK   +  I HSC PN E +   V+G  +IG + +R I   EEITFDY  +       +
Sbjct: 118  YHKGAISRFINHSCEPNSETQKWTVNGLLRIGFFAIRDIQPEEEITFDYQFIHFG----Q 173

Query: 2026 ASVCLCGSQVCRG 2038
               CLCG+  CRG
Sbjct: 174  GQKCLCGAPSCRG 186


>gi|218201888|gb|EEC84315.1| hypothetical protein OsI_30811 [Oryza sativa Indica Group]
          Length = 360

 Score = 62.8 bits (151), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 49/153 (32%), Positives = 71/153 (46%), Gaps = 35/153 (22%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYP-------VWKWFEKQDGIRSLQKNNEDPAPEFY 1944
            G GVV  ++   GE  FV+E++GEV         +WK   + D                 
Sbjct: 119  GNGVVAEEDIKKGE--FVIEYVGEVIDDRTCEQRLWKMKRQGDT---------------- 160

Query: 1945 NIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGI 2004
            N YL     +      +V+DA +K N +  I HSC PN E +   V+G  ++GI+ +R I
Sbjct: 161  NFYLCEVSSN------MVIDATNKGNMSRFINHSCEPNTEMQKWTVEGETRVGIFALRDI 214

Query: 2005 HYGEEITFDYNSVTESKEEYEASVCLCGSQVCR 2037
              GEE+T+DY  V    ++     C CGS  CR
Sbjct: 215  KTGEELTYDYKFVQFGADQD----CHCGSSNCR 243


>gi|47225089|emb|CAF97504.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 352

 Score = 62.8 bits (151), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 37/84 (44%), Positives = 45/84 (53%), Gaps = 3/84 (3%)

Query: 1959 DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVT 2018
            D  VVDA    N A  I HSC PNC ++V  VDG   I I+  R I+ GEE+T+DY    
Sbjct: 272  DYEVVDATVHGNAARFINHSCEPNCYSRVITVDGKKHIVIFASRRIYQGEELTYDYKFPI 331

Query: 2019 ESKEEYEASVCLCGSQVCRGSYLN 2042
            E  E      C C S+ CR  +LN
Sbjct: 332  E--EASSKLPCNCNSKKCR-KFLN 352


>gi|291409090|ref|XP_002720827.1| PREDICTED: WHSC1L1 protein [Oryctolagus cuniculus]
          Length = 1435

 Score = 62.8 bits (151), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 46/175 (26%), Positives = 86/175 (49%), Gaps = 22/175 (12%)

Query: 1882 PDDKYV-AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
            PD + +   R+G G+   +    GE  FV E++GE+       ++++    +++ +E+  
Sbjct: 1143 PDAEIIKTERRGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSV 1194

Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
              FY + + + +         ++DA  K NY+  + HSC PNCE +   V+G  ++G++ 
Sbjct: 1195 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1245

Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKE 2055
            +  I  G E+TF+YN         E   C CG++ C G +L +  + A     +E
Sbjct: 1246 LCDIPAGMELTFNYNLDCLGNGRTE---CHCGAENCSG-FLGVRPKSACASTTEE 1296


>gi|298706866|emb|CBJ25830.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 810

 Score = 62.8 bits (151), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 38/132 (28%), Positives = 65/132 (49%), Gaps = 16/132 (12%)

Query: 1908 FVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMH 1967
             + E++GEV        +    R L+ N+     EFY + L +          + +DA  
Sbjct: 563  LIGEYVGEVIDEAMVEHRMAEQRRLRPNDG----EFYIMELGQS---------LFIDAKE 609

Query: 1968 KANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEAS 2027
            K N    I HSC PNC+ +   + G+ ++GIY  + +  GE +++DY   T  K  ++  
Sbjct: 610  KGNLMRLINHSCNPNCDVQAWNIAGYTRLGIYAKKDLAKGESLSYDYKFSTNEKARFK-- 667

Query: 2028 VCLCGSQVCRGS 2039
             C+CG++ CRG+
Sbjct: 668  -CMCGAENCRGT 678


>gi|414884958|tpg|DAA60972.1| TPA: putative histone-lysine N-methyltransferase family protein
            isoform 1 [Zea mays]
 gi|414884959|tpg|DAA60973.1| TPA: putative histone-lysine N-methyltransferase family protein
            isoform 2 [Zea mays]
          Length = 337

 Score = 62.8 bits (151), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 33/77 (42%), Positives = 44/77 (57%), Gaps = 4/77 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
            +V+DA +K N +  I HSC PN   +   VDG  ++GI+ +R I  GEE+T+DY  V   
Sbjct: 182  MVIDATNKGNLSRFINHSCEPNTAMQKWTVDGETRVGIFALRDIKIGEELTYDYKFVQFG 241

Query: 2021 KEEYEASVCLCGSQVCR 2037
                 A VC CGS  CR
Sbjct: 242  A----AQVCHCGSSKCR 254


>gi|222641285|gb|EEE69417.1| hypothetical protein OsJ_28789 [Oryza sativa Japonica Group]
          Length = 360

 Score = 62.8 bits (151), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 49/153 (32%), Positives = 71/153 (46%), Gaps = 35/153 (22%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYP-------VWKWFEKQDGIRSLQKNNEDPAPEFY 1944
            G GVV  ++   GE  FV+E++GEV         +WK   + D                 
Sbjct: 119  GNGVVAEEDIKKGE--FVIEYVGEVIDDRTCEQRLWKMKRQGDT---------------- 160

Query: 1945 NIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGI 2004
            N YL     +      +V+DA +K N +  I HSC PN E +   V+G  ++GI+ +R I
Sbjct: 161  NFYLCEVSSN------MVIDATNKGNMSRFINHSCEPNTEMQKWTVEGETRVGIFALRDI 214

Query: 2005 HYGEEITFDYNSVTESKEEYEASVCLCGSQVCR 2037
              GEE+T+DY  V    ++     C CGS  CR
Sbjct: 215  KTGEELTYDYKFVQFGADQD----CHCGSSNCR 243


>gi|91785767|ref|YP_560973.1| hypothetical protein Bxe_A0006 [Burkholderia xenovorans LB400]
 gi|91689721|gb|ABE32921.1| conserved hypothetical protein [Burkholderia xenovorans LB400]
          Length = 174

 Score = 62.8 bits (151), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 49/155 (31%), Positives = 76/155 (49%), Gaps = 23/155 (14%)

Query: 1895 VVCNKEGGFGEDDFVVEFL--GEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPK 1952
            +   + G  G+  F VE +  GE    +K  E+     +L+++  +PA   +  Y     
Sbjct: 6    IAVRRSGVHGKGVFAVEPIAAGERLIEYKG-ERISWKEALRRHPHNPAEPNHTFYFALDS 64

Query: 1953 GDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITF 2012
            G        V+D     N A  I HSC PNCEA+   +DGH  + ++ +R I  GEE+ +
Sbjct: 65   GK-------VIDGKVNGNSARWINHSCAPNCEAE--EIDGH--VYVHALRDIAEGEEVFY 113

Query: 2013 DYNSVTES------KEEYEASVCLCGSQVCRGSYL 2041
            DY  V ++      K+EYE   C CG++ CRG+ L
Sbjct: 114  DYGLVIDARQTNKLKKEYE---CRCGARKCRGTML 145


>gi|357498513|ref|XP_003619545.1| Histone-lysine N-methyltransferase ASHH3 [Medicago truncatula]
 gi|355494560|gb|AES75763.1| Histone-lysine N-methyltransferase ASHH3 [Medicago truncatula]
          Length = 348

 Score = 62.8 bits (151), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 50/165 (30%), Positives = 76/165 (46%), Gaps = 24/165 (14%)

Query: 1876 KAMDSRPDDKYVAYRK---GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSL 1932
            KA   RP  K    +    G G+V +++   GE  FV+E++GEV       + +   + L
Sbjct: 103  KAFQHRPVKKMKLVKTEKCGSGIVADEDIKLGE--FVIEYVGEV------IDDKTCEQRL 154

Query: 1933 QKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDG 1992
                +     FY   + R          +V+DA +K N +  I HSC PN E +   +DG
Sbjct: 155  WNMKDRGETNFYLCEINRD---------MVIDATNKGNKSRYINHSCCPNTEMQKWIIDG 205

Query: 1993 HYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCR 2037
              +IGI+  R I  GE +T+DY  V    ++     C CG+  CR
Sbjct: 206  ETRIGIFASRDIKKGEHLTYDYQFVQFGADQD----CHCGAVQCR 246


>gi|71029610|ref|XP_764448.1| hypothetical protein [Theileria parva strain Muguga]
 gi|68351402|gb|EAN32165.1| hypothetical protein TP04_0811 [Theileria parva]
          Length = 995

 Score = 62.8 bits (151), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 46/148 (31%), Positives = 72/148 (48%), Gaps = 14/148 (9%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG+G V  +E   GE + V E++GEV      F++     S  + ++     +Y + + R
Sbjct: 716  KGVGAVATEE--IGEGELVCEYVGEVISQAD-FQRCLASASFAEIDDGNQSHWYVMKIHR 772

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
                 D Y    +D+ H  N A  I HSC PNC +    V G Y++G++ +R I   EE+
Sbjct: 773  -----DTY----IDSTHLGNVARFINHSCDPNCASVPINVKGTYRMGVFALRKIKQDEEV 823

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            T++Y     SK       C C ++ CRG
Sbjct: 824  TYNYGFT--SKGVGGGFRCRCRAKNCRG 849


>gi|157821603|ref|NP_001099560.1| histone-lysine N-methyltransferase NSD3 [Rattus norvegicus]
 gi|149057818|gb|EDM09061.1| Wolf-Hirschhorn syndrome candidate 1-like 1 (predicted) [Rattus
            norvegicus]
          Length = 1396

 Score = 62.8 bits (151), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 43/158 (27%), Positives = 79/158 (50%), Gaps = 21/158 (13%)

Query: 1882 PDDKYV-AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
            PD + +   R+G G+   +    GE  FV E++GE+       ++++    +++ +E+  
Sbjct: 1105 PDAEIIKTERRGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSV 1156

Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
              FY + + + +         ++DA  K NY+  + HSC PNCE +   V+G  ++G++ 
Sbjct: 1157 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1207

Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            +  I  G E+TF+YN           +VC CG+  C G
Sbjct: 1208 LCDIPAGMELTFNYNLDCLGNGR---TVCHCGADNCSG 1242


>gi|195574451|ref|XP_002105202.1| GD18047 [Drosophila simulans]
 gi|194201129|gb|EDX14705.1| GD18047 [Drosophila simulans]
          Length = 567

 Score = 62.8 bits (151), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 43/151 (28%), Positives = 75/151 (49%), Gaps = 25/151 (16%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            +G G+V N+E      DFV+E++GEV          +  R +++   D    +Y + +E+
Sbjct: 384  RGFGLV-NREP-IAAGDFVIEYVGEV------INHAEFQRRMEQKQRDRDENYYFLGVEK 435

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
                       ++DA  K N A  + HSC PNCE +   V+  +++GI+ ++ I    E+
Sbjct: 436  D---------FIIDAGPKGNLARFMNHSCEPNCETQKWTVNCIHRVGIFAIKDIPVNTEL 486

Query: 2011 TFDY---NSVTESKEEYEASVCLCGSQVCRG 2038
            TF+Y   + +  SK+      C CG++ C G
Sbjct: 487  TFNYLWDDLMNNSKK-----ACFCGAKRCSG 512


>gi|417406466|gb|JAA49891.1| Putative histone-lysine n-methyltransferase nsd3-like isoform 3
            [Desmodus rotundus]
          Length = 1438

 Score = 62.8 bits (151), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 43/158 (27%), Positives = 79/158 (50%), Gaps = 21/158 (13%)

Query: 1882 PDDKYV-AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
            PD + +   R+G G+   +    GE  FV E++GE+       ++++    +++ +E+  
Sbjct: 1146 PDAEIIKTERRGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSV 1197

Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
              FY + + + +         ++DA  K NY+  + HSC PNCE +   V+G  ++G++ 
Sbjct: 1198 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1248

Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            +  I  G E+TF+YN         E   C CG++ C G
Sbjct: 1249 LCDIPAGMELTFNYNLDCLGNGRTE---CHCGAENCSG 1283


>gi|226493201|ref|NP_001149253.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20specific [Zea mays]
 gi|194704072|gb|ACF86120.1| unknown [Zea mays]
 gi|195625808|gb|ACG34734.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20specific [Zea mays]
 gi|238014446|gb|ACR38258.1| unknown [Zea mays]
 gi|414589294|tpg|DAA39865.1| TPA: putative histone-lysine N-methyltransferase family protein
            isoform 1 [Zea mays]
 gi|414589295|tpg|DAA39866.1| TPA: putative histone-lysine N-methyltransferase family protein
            isoform 2 [Zea mays]
          Length = 339

 Score = 62.8 bits (151), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 50/151 (33%), Positives = 69/151 (45%), Gaps = 31/151 (20%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
            G G+V   E   GE  FV+E++GEV                    +D   E   ++  + 
Sbjct: 127  GHGLVAEDEIKKGE--FVIEYVGEVI-------------------DDRTCE-NRLWTMKR 164

Query: 1952 KGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
              D D Y       +V+DA +K N +  I HSC PN   +   VDG  ++GI+ +R I  
Sbjct: 165  LLDTDFYLCEVSSNMVIDATNKGNRSRFINHSCEPNTAMQKWTVDGETRVGIFALRDIKI 224

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCR 2037
            GEE+T+DY  V        A VC CGS  CR
Sbjct: 225  GEELTYDYKFVQFGA----AQVCHCGSSNCR 251


>gi|356559949|ref|XP_003548258.1| PREDICTED: histone-lysine N-methyltransferase ASHH3-like [Glycine
            max]
          Length = 349

 Score = 62.4 bits (150), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 46/146 (31%), Positives = 71/146 (48%), Gaps = 21/146 (14%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
            G G+V +++   GE  FV+E++GEV       E+   ++   + N      FY   + R 
Sbjct: 126  GSGIVADEDIKLGE--FVIEYVGEVIDDKTCEERLWNMKHSGETN------FYLCEINRD 177

Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
                     +V+DA +K N +  I HSC PN E +   +DG  +IGI+  R I  GE +T
Sbjct: 178  ---------MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIQKGEHLT 228

Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCR 2037
            +DY  V    ++     C CG+  CR
Sbjct: 229  YDYQFVQFGADQD----CHCGAAECR 250


>gi|319796552|ref|YP_004158192.1| nuclear protein set [Variovorax paradoxus EPS]
 gi|315599015|gb|ADU40081.1| nuclear protein SET [Variovorax paradoxus EPS]
          Length = 205

 Score = 62.4 bits (150), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 51/151 (33%), Positives = 70/151 (46%), Gaps = 32/151 (21%)

Query: 1901 GGFGEDDF-----VVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDA 1955
            G F  DD      ++E+ GEV   WK         +L+++  DPA   +  Y     G  
Sbjct: 24   GVFAVDDLAEGETLIEYKGEVIN-WK--------EALRRHPHDPAQPNHTFYFHIDDGR- 73

Query: 1956 DGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYN 2015
                  V+D   K N A  I HSC PNCEA    VDG  ++ I  +R I  GEE+ +DY 
Sbjct: 74   ------VIDGNVKGNDARWINHSCEPNCEAD--EVDG--RVYIKALRNIAAGEELNYDYG 123

Query: 2016 SVTESKEEYEASV-----CLCGSQVCRGSYL 2041
             + +  E Y   +     C CGS+ CRG+ L
Sbjct: 124  LIID--EPYTPKLLSEFPCWCGSEQCRGTLL 152


>gi|385207702|ref|ZP_10034570.1| SET domain-containing protein [Burkholderia sp. Ch1-1]
 gi|385180040|gb|EIF29316.1| SET domain-containing protein [Burkholderia sp. Ch1-1]
          Length = 174

 Score = 62.4 bits (150), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 40/117 (34%), Positives = 61/117 (52%), Gaps = 20/117 (17%)

Query: 1931 SLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAV 1990
            +L+++  +PA   +  Y     G        V+D     N A  I HSC PNCEA+   +
Sbjct: 43   ALRRHPHNPAEPNHTFYFALDSGK-------VIDGKVNGNSARWINHSCAPNCEAE--EI 93

Query: 1991 DGHYQIGIYTVRGIHYGEEITFDYNSVTES------KEEYEASVCLCGSQVCRGSYL 2041
            DGH  + ++ +R I  GEE+ +DY  V ++      K+EYE   C CG++ CRG+ L
Sbjct: 94   DGH--VYVHALRDIAEGEEVFYDYGLVIDARQTKKLKKEYE---CRCGARKCRGTML 145


>gi|397521373|ref|XP_003830771.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1 [Pan
            paniscus]
          Length = 1437

 Score = 62.4 bits (150), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 43/158 (27%), Positives = 78/158 (49%), Gaps = 21/158 (13%)

Query: 1882 PDDKYV-AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
            PD + +   R+G G+   +    GE  FV E++GE+       ++++    +++ +E+  
Sbjct: 1145 PDAEIIKTERRGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSV 1196

Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
              FY + + + +         ++DA  K NY+  + HSC PNCE +   V+G  ++G++ 
Sbjct: 1197 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1247

Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            +  I  G E+TF+YN         E   C CG+  C G
Sbjct: 1248 LCDIPAGMELTFNYNLDCLGNGRTE---CHCGADNCSG 1282


>gi|410307858|gb|JAA32529.1| Wolf-Hirschhorn syndrome candidate 1-like 1 [Pan troglodytes]
          Length = 1437

 Score = 62.4 bits (150), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 43/158 (27%), Positives = 78/158 (49%), Gaps = 21/158 (13%)

Query: 1882 PDDKYV-AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
            PD + +   R+G G+   +    GE  FV E++GE+       ++++    +++ +E+  
Sbjct: 1145 PDAEIIKTERRGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSV 1196

Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
              FY + + + +         ++DA  K NY+  + HSC PNCE +   V+G  ++G++ 
Sbjct: 1197 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1247

Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            +  I  G E+TF+YN         E   C CG+  C G
Sbjct: 1248 LCDIPAGMELTFNYNLDCLGNGRTE---CHCGADNCSG 1282


>gi|402878017|ref|XP_003902703.1| PREDICTED: histone-lysine N-methyltransferase NSD3 [Papio anubis]
          Length = 1438

 Score = 62.4 bits (150), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 46/175 (26%), Positives = 85/175 (48%), Gaps = 22/175 (12%)

Query: 1882 PDDKYV-AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
            PD + +   R+G G+   +    GE  FV E++GE+       ++++    +++ +E+  
Sbjct: 1146 PDAEIIKTERRGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSV 1197

Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
              FY + + + +         ++DA  K NY+  + HSC PNCE +   V+G  ++G++ 
Sbjct: 1198 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1248

Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKE 2055
            +  I  G E+TF+YN         E   C CG+  C G +L +  + A     +E
Sbjct: 1249 LCDIPAGMELTFNYNLDCLGNGRTE---CHCGADNCSG-FLGVRPKSACASTTEE 1299


>gi|420253143|ref|ZP_14756206.1| SET domain-containing protein [Burkholderia sp. BT03]
 gi|398052652|gb|EJL44901.1| SET domain-containing protein [Burkholderia sp. BT03]
          Length = 160

 Score = 62.4 bits (150), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 42/117 (35%), Positives = 61/117 (52%), Gaps = 20/117 (17%)

Query: 1931 SLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAV 1990
            +L+++  +P    +  Y     GD       V+D   K N A  I HSC PNCEA+   +
Sbjct: 43   ALRRHPHNPDEPNHTFYFALDSGD-------VIDGKVKGNSARWINHSCAPNCEAE--EI 93

Query: 1991 DGHYQIGIYTVRGIHYGEEITFDYNSVTES------KEEYEASVCLCGSQVCRGSYL 2041
            DGH  + I  +R I  GEE+ +DY  V ++      K+EYE   C CG++ CRG+ L
Sbjct: 94   DGH--VFIDALRDIGAGEELFYDYGLVIDARQTKKLKKEYE---CRCGARKCRGTML 145


>gi|350855153|emb|CCD58126.1| huntingtin interacting protein-related [Schistosoma mansoni]
          Length = 887

 Score = 62.4 bits (150), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 54/203 (26%), Positives = 86/203 (42%), Gaps = 33/203 (16%)

Query: 1886 YVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN 1945
            Y    KG G++       G   FV+E++GEV    ++  +      L             
Sbjct: 286  YAGKDKGWGLMATDNVKKGS--FVIEYVGEVIDFSEFRRRIRRYERL------------- 330

Query: 1946 IYLERPKGDADGYDLVV-----VDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
                   G A  Y + V     +DA  K N+A  + HSC PNC  +  +V+G  +IG + 
Sbjct: 331  -------GHAHHYFMAVESDRFIDAGSKGNWARFVNHSCEPNCVTQKWSVNGEIRIGFFA 383

Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLL 2060
               I  G+E+T DY  V     E +   C CG+  C G  +  T +   EKV  +   ++
Sbjct: 384  KEDIPSGQEVTIDYQFVQYGVSEQK---CYCGASTCSG-IMGATSKYLQEKVRMKDTTMV 439

Query: 2061 DRHQLMLEACELNSVSEEDYLEL 2083
            +R   +L+  +L+S    D + L
Sbjct: 440  ERR--ILQLLQLDSFRNADDITL 460


>gi|414590165|tpg|DAA40736.1| TPA: putative trithorax-like family protein [Zea mays]
          Length = 1566

 Score = 62.0 bits (149), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 42/135 (31%), Positives = 67/135 (49%), Gaps = 14/135 (10%)

Query: 1908 FVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMH 1967
             VVE++GE+        ++   +S ++     A  F+ I  E            ++DA  
Sbjct: 1446 MVVEYVGEIVGQRVADRREIEYQSGKRQQYKSACYFFKIDREH-----------IIDATR 1494

Query: 1968 KANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEAS 2027
            K   A  + HSC+PNC AK+ +V    ++  +  R I+ GEEIT+DY+   E  +E +  
Sbjct: 1495 KGGIARFVNHSCQPNCVAKIISVRNEKKVMFFAERHINPGEEITYDYHFNRE--DEGQRI 1552

Query: 2028 VCLCGSQVCRGSYLN 2042
            +C C S+ CR  YLN
Sbjct: 1553 LCFCRSRYCR-RYLN 1566


>gi|397521377|ref|XP_003830773.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 3 [Pan
            paniscus]
          Length = 1388

 Score = 62.0 bits (149), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 43/158 (27%), Positives = 78/158 (49%), Gaps = 21/158 (13%)

Query: 1882 PDDKYV-AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
            PD + +   R+G G+   +    GE  FV E++GE+       ++++    +++ +E+  
Sbjct: 1096 PDAEIIKTERRGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSV 1147

Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
              FY + + + +         ++DA  K NY+  + HSC PNCE +   V+G  ++G++ 
Sbjct: 1148 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1198

Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            +  I  G E+TF+YN         E   C CG+  C G
Sbjct: 1199 LCDIPAGMELTFNYNLDCLGNGRTE---CHCGADNCSG 1233


>gi|395847337|ref|XP_003796335.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2
            [Otolemur garnettii]
          Length = 1389

 Score = 62.0 bits (149), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 43/158 (27%), Positives = 78/158 (49%), Gaps = 21/158 (13%)

Query: 1882 PDDKYV-AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
            PD + +   R+G G+   +    GE  FV E++GE+       ++++    +++ +E+  
Sbjct: 1097 PDAEIIKTERRGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSV 1148

Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
              FY + + + +         ++DA  K NY+  + HSC PNCE +   V+G  ++G++ 
Sbjct: 1149 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1199

Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            +  I  G E+TF+YN         E   C CG+  C G
Sbjct: 1200 LCDIPAGMELTFNYNLDCLGNGRTE---CHCGADNCSG 1234


>gi|426256406|ref|XP_004021831.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2 [Ovis
            aries]
          Length = 1439

 Score = 62.0 bits (149), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 46/175 (26%), Positives = 85/175 (48%), Gaps = 22/175 (12%)

Query: 1882 PDDKYV-AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
            PD + +   R+G G+   +    GE  FV E++GE+       ++++    +++ +E+  
Sbjct: 1147 PDAEVIRTERRGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSV 1198

Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
              FY + + + +         ++DA  K NY+  + HSC PNCE +   V+G  ++G++ 
Sbjct: 1199 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1249

Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKE 2055
            +  I  G E+TF+YN         E   C CG+  C G +L +  + A     +E
Sbjct: 1250 LCDIPAGMELTFNYNLDCLGNGRTE---CHCGADNCSG-FLGVRPKSACASTAEE 1300


>gi|426359420|ref|XP_004046973.1| PREDICTED: histone-lysine N-methyltransferase NSD3 [Gorilla gorilla
            gorilla]
          Length = 1397

 Score = 62.0 bits (149), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 43/158 (27%), Positives = 78/158 (49%), Gaps = 21/158 (13%)

Query: 1882 PDDKYV-AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
            PD + +   R+G G+   +    GE  FV E++GE+       ++++    +++ +E+  
Sbjct: 1105 PDAEIIKTERRGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSV 1156

Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
              FY + + + +         ++DA  K NY+  + HSC PNCE +   V+G  ++G++ 
Sbjct: 1157 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1207

Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            +  I  G E+TF+YN         E   C CG+  C G
Sbjct: 1208 LCDIPAGMELTFNYNLDCLGNGRTE---CHCGADNCSG 1242


>gi|440907576|gb|ELR57709.1| Histone-lysine N-methyltransferase NSD3 [Bos grunniens mutus]
          Length = 1446

 Score = 62.0 bits (149), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 46/175 (26%), Positives = 85/175 (48%), Gaps = 22/175 (12%)

Query: 1882 PDDKYV-AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
            PD + +   R+G G+   +    GE  FV E++GE+       ++++    +++ +E+  
Sbjct: 1156 PDAEVIRTERRGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSV 1207

Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
              FY + + + +         ++DA  K NY+  + HSC PNCE +   V+G  ++G++ 
Sbjct: 1208 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1258

Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKE 2055
            +  I  G E+TF+YN         E   C CG+  C G +L +  + A     +E
Sbjct: 1259 LCDIPAGMELTFNYNLDCLGNGRTE---CHCGADNCSG-FLGVRPKSACASTAEE 1309


>gi|395847335|ref|XP_003796334.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1
            [Otolemur garnettii]
          Length = 1438

 Score = 62.0 bits (149), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 43/158 (27%), Positives = 78/158 (49%), Gaps = 21/158 (13%)

Query: 1882 PDDKYV-AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
            PD + +   R+G G+   +    GE  FV E++GE+       ++++    +++ +E+  
Sbjct: 1146 PDAEIIKTERRGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSV 1197

Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
              FY + + + +         ++DA  K NY+  + HSC PNCE +   V+G  ++G++ 
Sbjct: 1198 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1248

Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            +  I  G E+TF+YN         E   C CG+  C G
Sbjct: 1249 LCDIPAGMELTFNYNLDCLGNGRTE---CHCGADNCSG 1283


>gi|255078218|ref|XP_002502689.1| set domain protein [Micromonas sp. RCC299]
 gi|226517954|gb|ACO63947.1| set domain protein [Micromonas sp. RCC299]
          Length = 1065

 Score = 62.0 bits (149), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 44/150 (29%), Positives = 67/150 (44%), Gaps = 22/150 (14%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAP-EFYNIYL 1948
            RKG G+   +     +  F++E++GEV         +D  RS +   +D     +Y + L
Sbjct: 181  RKGHGLFTKQA--LKKGQFIIEYIGEVL-------HEDEYRSRKARYDDEGRRHYYFMTL 231

Query: 1949 ERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGE 2008
               +          +DA  + N    + HSC PNCE +   V+G   IGIY +  I  G+
Sbjct: 232  SSSE---------TIDAAERGNAGRFLNHSCDPNCETQKWMVNGELCIGIYALTDIDAGD 282

Query: 2009 EITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            E+TFDYN         +   C CG+  C G
Sbjct: 283  ELTFDYNFERYGDNPIK---CFCGTSRCGG 309


>gi|414590164|tpg|DAA40735.1| TPA: putative trithorax-like family protein [Zea mays]
          Length = 1591

 Score = 62.0 bits (149), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 42/135 (31%), Positives = 67/135 (49%), Gaps = 14/135 (10%)

Query: 1908 FVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMH 1967
             VVE++GE+        ++   +S ++     A  F+ I  E            ++DA  
Sbjct: 1471 MVVEYVGEIVGQRVADRREIEYQSGKRQQYKSACYFFKIDREH-----------IIDATR 1519

Query: 1968 KANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEAS 2027
            K   A  + HSC+PNC AK+ +V    ++  +  R I+ GEEIT+DY+   E  +E +  
Sbjct: 1520 KGGIARFVNHSCQPNCVAKIISVRNEKKVMFFAERHINPGEEITYDYHFNRE--DEGQRI 1577

Query: 2028 VCLCGSQVCRGSYLN 2042
            +C C S+ CR  YLN
Sbjct: 1578 LCFCRSRYCR-RYLN 1591


>gi|315364634|pdb|3OOI|A Chain A, Crystal Structure Of Human Histone-Lysine
            N-Methyltransferase Nsd1 Set Domain In Complex With
            S-Adenosyl-L-Methionine
          Length = 232

 Score = 62.0 bits (149), Expect = 4e-06,   Method: Composition-based stats.
 Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            +FV E++GE+       ++++    ++   E     FY + L++ +         ++DA 
Sbjct: 116  EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 160

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K NYA  + H C+PNCE +  +V+G  ++G++ +  I  G E+TF+YN       +   
Sbjct: 161  PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 217

Query: 2027 SVCLCGSQVCRG 2038
            +VC CG+  C G
Sbjct: 218  TVCKCGAPNCSG 229


>gi|386331753|ref|YP_006027922.1| set domain protein [Ralstonia solanacearum Po82]
 gi|334194201|gb|AEG67386.1| set domain protein [Ralstonia solanacearum Po82]
          Length = 188

 Score = 62.0 bits (149), Expect = 4e-06,   Method: Composition-based stats.
 Identities = 42/136 (30%), Positives = 68/136 (50%), Gaps = 23/136 (16%)

Query: 1909 VVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHK 1968
            ++E+ GE +  WK         +L+++  DP+   +  Y     G        V+DA + 
Sbjct: 64   IIEYKGE-HISWK--------EALRRHPHDPSDPNHTFYFSLEDGS-------VIDAKYG 107

Query: 1969 ANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEAS- 2027
             N A  I H+C+PNCEA+    DG  ++ I+ +R I  GEE+ +DY  V E ++      
Sbjct: 108  GNRARWINHACKPNCEAR--EKDG--RVFIHALRDIEAGEELFYDYGLVIEGRQTKALKA 163

Query: 2028 --VCLCGSQVCRGSYL 2041
               C CG++ CRG+ L
Sbjct: 164  QFACHCGAKTCRGTML 179


>gi|355708046|gb|AES03147.1| nuclear receptor binding SET domain protein 1 [Mustela putorius furo]
          Length = 261

 Score = 62.0 bits (149), Expect = 5e-06,   Method: Composition-based stats.
 Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            +FV E++GE+       ++++    ++   E     FY + L++ +         ++DA 
Sbjct: 142  EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 186

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K NYA  + H C+PNCE +  +V+G  ++G++ +  I  G E+TF+YN       +   
Sbjct: 187  PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 243

Query: 2027 SVCLCGSQVCRG 2038
            +VC CG+  C G
Sbjct: 244  TVCKCGAPNCSG 255


>gi|431902251|gb|ELK08752.1| Histone-lysine N-methyltransferase NSD3 [Pteropus alecto]
          Length = 1322

 Score = 62.0 bits (149), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 43/158 (27%), Positives = 78/158 (49%), Gaps = 21/158 (13%)

Query: 1882 PDDKYV-AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
            PD + +   R+G G+   +    GE  FV E++GE+       ++++    +++ +E+  
Sbjct: 1105 PDAEIIKTERRGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSV 1156

Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
              FY + + + +         ++DA  K NY+  + HSC PNCE +   V+G  ++G++ 
Sbjct: 1157 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1207

Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            +  I  G E+TF+YN         E   C CG+  C G
Sbjct: 1208 LCDIPAGMELTFNYNLDCLGNGRTE---CHCGADNCSG 1242


>gi|239818159|ref|YP_002947069.1| histone-lysine N-methyltransferase [Variovorax paradoxus S110]
 gi|239804736|gb|ACS21803.1| Histone-lysine N-methyltransferase [Variovorax paradoxus S110]
          Length = 210

 Score = 61.6 bits (148), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 47/144 (32%), Positives = 68/144 (47%), Gaps = 27/144 (18%)

Query: 1903 FGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVV 1962
              E + ++E+ GEV   WK         +L+++  DPA   +  Y     G        V
Sbjct: 31   LAEGETLIEYKGEVIS-WK--------EALRRHPHDPAQPNHTFYFHIDDGR-------V 74

Query: 1963 VDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKE 2022
            +D   K N A  I HSC PNCEA    +DG  ++ I  +R I  GEE+ +DY  + +  E
Sbjct: 75   IDGNVKGNDARWINHSCEPNCEAD--EIDG--RVYIKALRNIAAGEELNYDYGLIID--E 128

Query: 2023 EYEASV-----CLCGSQVCRGSYL 2041
             Y   +     C CGS+ CRG+ L
Sbjct: 129  PYTPKLLSEFPCWCGSENCRGTLL 152


>gi|302795285|ref|XP_002979406.1| hypothetical protein SELMODRAFT_110353 [Selaginella moellendorffii]
 gi|300153174|gb|EFJ19814.1| hypothetical protein SELMODRAFT_110353 [Selaginella moellendorffii]
          Length = 274

 Score = 61.6 bits (148), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 42/132 (31%), Positives = 64/132 (48%), Gaps = 19/132 (14%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            DF++E++GEV       E+   ++   +NN      FY   +   K         V+DA 
Sbjct: 109  DFLIEYIGEVIDDKTCEERLWDLKERGENN------FYLCEVGHDK---------VIDAT 153

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K N +  I HSC PN + +    DG  +IG++ V  I  G+EIT+DY  +    E+   
Sbjct: 154  FKGNMSRFINHSCDPNAQLRKWQCDGELRIGVFAVSRILKGQEITYDYKYIQFGTEQQ-- 211

Query: 2027 SVCLCGSQVCRG 2038
              C CGS+ C+G
Sbjct: 212  --CHCGSKNCKG 221


>gi|449665927|ref|XP_002164851.2| PREDICTED: histone-lysine N-methyltransferase NSD2-like [Hydra
            magnipapillata]
          Length = 1214

 Score = 61.6 bits (148), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 40/150 (26%), Positives = 78/150 (52%), Gaps = 24/150 (16%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            +G G++ + +   GE  FV+E++GE+       +++   R +++ +E    ++Y + +++
Sbjct: 860  RGWGLMADTDIKQGE--FVIEYVGEL------IDEETCHRRVREYHEKDIFDYYFLTIDK 911

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
                       ++DA  K N +  + HSC PNCE +   V+G  ++ ++  R I  GEE+
Sbjct: 912  DN---------IIDAYPKGNMSRFMNHSCNPNCETQKWTVNGEIRVALFATRDIKMGEEL 962

Query: 2011 TFDYN--SVTESKEEYEASVCLCGSQVCRG 2038
             F+YN  S+   K++     C CG+  C G
Sbjct: 963  CFNYNLDSLGNDKKQ-----CKCGAVNCSG 987


>gi|402588522|gb|EJW82455.1| SET domain-containing protein [Wuchereria bancrofti]
          Length = 626

 Score = 61.6 bits (148), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 33/75 (44%), Positives = 42/75 (56%), Gaps = 2/75 (2%)

Query: 1963 VDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKE 2022
            VDA +  N A    HSC PN +     VDG Y++ I T++ I  GEE+TFDY+  TE  E
Sbjct: 495  VDARNYGNIARSFNHSCEPNTKVDAVVVDGIYRLKISTIKDIKKGEELTFDYD--TEIIE 552

Query: 2023 EYEASVCLCGSQVCR 2037
                  C CGS+ CR
Sbjct: 553  GLVGMECFCGSKNCR 567


>gi|405966105|gb|EKC31425.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
            specific [Crassostrea gigas]
          Length = 1079

 Score = 61.6 bits (148), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 57/251 (22%), Positives = 111/251 (44%), Gaps = 33/251 (13%)

Query: 1908 FVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMH 1967
            FV E++GE+       ++++  R + +++E+    +Y + L++ +         V+DA  
Sbjct: 736  FVHEYVGEL------IDEEEVKRRIDESHENNISNYYMLTLDKNR---------VIDAGP 780

Query: 1968 KANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEAS 2027
            K N +  + HSC PNCE +    +G  ++G++ +  I  G E+TF+YN      ++   +
Sbjct: 781  KGNLSRFMNHSCAPNCETQKWTANGDVRVGLFAIYDIPAGTELTFNYNLECLGNDK---T 837

Query: 2028 VCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDRHQLMLEACELNSVSEED--YLELGR 2085
             C CG+++C G +L +  + A    + +     ++ +      +++   E D      G 
Sbjct: 838  KCNCGAELCSG-FLGVRPKSAVAASVAKGKKKDEKKKRKRNKKKIDGKKEHDDECFRCGE 896

Query: 2086 AG-LGSCLLGGLPNWVVAYSARLVRFINLERTKLPE---EILRHNLEEKRKYFSDICLEV 2141
             G L  C  GG P        ++     L+ +K P    +   H+ +E  K    +C E 
Sbjct: 897  GGELVMCDRGGCP--------KVYHLHCLKLSKPPHGKWDCPWHHCDECGKPAITMCTEC 948

Query: 2142 EKSDAEVQAEG 2152
              S      EG
Sbjct: 949  PNSFCATHTEG 959


>gi|295678165|ref|YP_003606689.1| nuclear protein SET [Burkholderia sp. CCGE1002]
 gi|295438008|gb|ADG17178.1| nuclear protein SET [Burkholderia sp. CCGE1002]
          Length = 177

 Score = 61.6 bits (148), Expect = 6e-06,   Method: Composition-based stats.
 Identities = 45/139 (32%), Positives = 69/139 (49%), Gaps = 29/139 (20%)

Query: 1909 VVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHK 1968
            ++E+ GE    WK         +L+++  +PA   +  Y     G        V+D    
Sbjct: 30   LIEYKGERI-TWK--------EALRRHPHNPAEPNHTFYFALDNGK-------VIDGKVN 73

Query: 1969 ANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES------KE 2022
             N A  I HSC PNCEA+   +DGH  + ++ +R I  GEE+ +DY  V ++      K+
Sbjct: 74   GNSARWINHSCAPNCEAE--EIDGH--VYVHALRDIAEGEEVFYDYGLVIDARQTKKLKK 129

Query: 2023 EYEASVCLCGSQVCRGSYL 2041
            EYE   C CG++ CRG+ L
Sbjct: 130  EYE---CRCGARKCRGTML 145


>gi|209515808|ref|ZP_03264671.1| nuclear protein SET [Burkholderia sp. H160]
 gi|209503835|gb|EEA03828.1| nuclear protein SET [Burkholderia sp. H160]
          Length = 170

 Score = 61.2 bits (147), Expect = 6e-06,   Method: Composition-based stats.
 Identities = 40/117 (34%), Positives = 61/117 (52%), Gaps = 20/117 (17%)

Query: 1931 SLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAV 1990
            +L+++  +PA   +  Y     G        V+D     N A  I HSC PNCEA+   +
Sbjct: 43   ALRRHPHNPAEPNHTFYFALDNGK-------VIDGKVNGNSARWINHSCAPNCEAE--EI 93

Query: 1991 DGHYQIGIYTVRGIHYGEEITFDYNSVTES------KEEYEASVCLCGSQVCRGSYL 2041
            DGH  + ++ +R I  GEE+ +DY  V ++      K+EYE   C CG++ CRG+ L
Sbjct: 94   DGH--VYVHALRDIAEGEEVFYDYGLVIDARQTKKLKKEYE---CRCGARKCRGTML 145


>gi|187930618|ref|YP_001901105.1| nuclear protein SET [Ralstonia pickettii 12J]
 gi|187727508|gb|ACD28673.1| nuclear protein SET [Ralstonia pickettii 12J]
          Length = 179

 Score = 60.8 bits (146), Expect = 8e-06,   Method: Composition-based stats.
 Identities = 43/136 (31%), Positives = 68/136 (50%), Gaps = 23/136 (16%)

Query: 1909 VVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHK 1968
            ++E+ GE +  WK         +L+++  DP+   +  Y     G        V+DA   
Sbjct: 55   IIEYKGE-HITWK--------EALRRHPHDPSDPNHTFYFSLEDGS-------VIDAKFG 98

Query: 1969 ANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEE---YE 2025
             N A  I H+C+PNCEA+    DG  ++ I+ +R I  GEE+ +DY  V E ++     E
Sbjct: 99   GNRARWINHACKPNCEAR--EEDG--RVFIHALRDIEPGEELFYDYGLVIEGRQTKALKE 154

Query: 2026 ASVCLCGSQVCRGSYL 2041
               C CG++ CRG+ L
Sbjct: 155  QFACRCGAKRCRGTML 170


>gi|241664808|ref|YP_002983168.1| nuclear protein SET [Ralstonia pickettii 12D]
 gi|240866835|gb|ACS64496.1| nuclear protein SET [Ralstonia pickettii 12D]
          Length = 179

 Score = 60.8 bits (146), Expect = 9e-06,   Method: Composition-based stats.
 Identities = 43/136 (31%), Positives = 68/136 (50%), Gaps = 23/136 (16%)

Query: 1909 VVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHK 1968
            ++E+ GE +  WK         +L+++  DP+   +  Y     G        V+DA   
Sbjct: 55   IIEYKGE-HITWK--------EALRRHPHDPSDPNHTFYFSLEDGS-------VIDAKFG 98

Query: 1969 ANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEE---YE 2025
             N A  I H+C+PNCEA+    DG  ++ I+ +R I  GEE+ +DY  V E ++     E
Sbjct: 99   GNRARWINHACKPNCEAR--EEDG--RVFIHALRDIEPGEELFYDYGLVIEGRQTKALKE 154

Query: 2026 ASVCLCGSQVCRGSYL 2041
               C CG++ CRG+ L
Sbjct: 155  QFACRCGAKKCRGTML 170


>gi|281346901|gb|EFB22485.1| hypothetical protein PANDA_005493 [Ailuropoda melanoleuca]
          Length = 926

 Score = 60.8 bits (146), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 43/158 (27%), Positives = 79/158 (50%), Gaps = 21/158 (13%)

Query: 1882 PDDKYV-AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
            PD + +   R+G G+   +    GE  FV E++GE+       ++++    +++ +E+  
Sbjct: 634  PDAEVIKTERRGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECKLRIKRAHENSV 685

Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
              FY + + + +         ++DA  K NY+  + HSC PNCE +   V+G  ++G++ 
Sbjct: 686  TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 736

Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            +  I  G E+TF+YN         E   C CG++ C G
Sbjct: 737  LCDIPAGMELTFNYNLDCLGNGRTE---CHCGAENCSG 771


>gi|340506525|gb|EGR32648.1| SET domain protein [Ichthyophthirius multifiliis]
          Length = 978

 Score = 60.8 bits (146), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 40/146 (27%), Positives = 70/146 (47%), Gaps = 22/146 (15%)

Query: 1908 FVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMH 1967
            F+++++GEV+ +      ++GI+ ++  +          YL +   +       V+D   
Sbjct: 73   FIIQYIGEVFDI----NSEEGIKRVKDYSRSTCT-----YLMKIDKNE------VIDPTF 117

Query: 1968 KANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEAS 2027
            K N A  I HSC PNC  +   V G   IGI+ ++ I   +E+TFDY       + Y+  
Sbjct: 118  KGNLARFINHSCDPNCITQKWHVLGEICIGIFAIKNIKEDDELTFDYQF-----DSYKTP 172

Query: 2028 V--CLCGSQVCRGSYLNLTGEGAFEK 2051
            +  CLCG+  C+G    +  +  FE+
Sbjct: 173  LTKCLCGNVKCKGYLGYIPTDYTFEE 198


>gi|76161881|gb|AAX30110.2| KIAA1076 protein [Schistosoma japonicum]
          Length = 123

 Score = 60.5 bits (145), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 36/81 (44%), Positives = 45/81 (55%), Gaps = 4/81 (4%)

Query: 1959 DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVT 2018
            D  V+DA    N A  I HSC+PNC AK+  V+   +I IY+ R I+  EEIT+DY    
Sbjct: 45   DDFVIDATMCGNNARFINHSCQPNCYAKIIMVESKKKIVIYSKRDINVMEEITYDYKFPY 104

Query: 2019 ESKEEYEASVCLCGSQVCRGS 2039
            E     E   C CGS  CRG+
Sbjct: 105  EE----EKIPCQCGSSSCRGT 121


>gi|296004740|ref|XP_966279.2| SET domain protein, putative [Plasmodium falciparum 3D7]
 gi|263429753|sp|C6KTD2.1|HKNMT_PLAF7 RecName: Full=Putative histone-lysine N-methyltransferase PFF1440w
 gi|225631776|emb|CAG25109.2| SET domain protein, putative [Plasmodium falciparum 3D7]
          Length = 6753

 Score = 60.5 bits (145), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 45/143 (31%), Positives = 70/143 (48%), Gaps = 21/143 (14%)

Query: 1902 GFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----IYLERPKGDAD 1956
            G+G   +  EF+ E  PV ++    + IR++     D   ++Y+      Y+ R   +  
Sbjct: 6623 GYGL--YTCEFINEGEPVIEYI--GEYIRNII---SDKREKYYDKIESSCYMFRLNEN-- 6673

Query: 1957 GYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQ-IGIYTVRGIHYGEEITFDYN 2015
                +++DA    N +  I HSC PNC  K+ + D + + I I+  R I   EEIT+DY 
Sbjct: 6674 ----IIIDATKWGNVSRFINHSCEPNCFCKIVSCDQNLKHIVIFAKRDIAAHEEITYDYQ 6729

Query: 2016 SVTESKEEYEASVCLCGSQVCRG 2038
               ES  E +  +CLCGS  C G
Sbjct: 6730 FGVES--EGKKLICLCGSSTCLG 6750


>gi|225430418|ref|XP_002283013.1| PREDICTED: histone-lysine N-methyltransferase ATX1-like [Vitis
            vinifera]
          Length = 496

 Score = 60.5 bits (145), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 44/135 (32%), Positives = 67/135 (49%), Gaps = 14/135 (10%)

Query: 1908 FVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMH 1967
             VVE++GE+  +    +++   +S +K     A  F+ I  E            ++DA  
Sbjct: 376  MVVEYVGEIVGLRVADKRESDYQSGRKLQYKTACYFFRIDKEH-----------IIDATR 424

Query: 1968 KANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEAS 2027
            K   A  + HSC PNC AKV +V    ++  +  R I+ GEEIT+DY+   E  +E +  
Sbjct: 425  KGGIARFVNHSCLPNCVAKVISVRNEKKVVFFAERDINPGEEITYDYHFNHE--DEGKKI 482

Query: 2028 VCLCGSQVCRGSYLN 2042
             C C S+ CR  YLN
Sbjct: 483  PCFCNSRNCR-RYLN 496


>gi|213404666|ref|XP_002173105.1| carboxypeptidase Y [Schizosaccharomyces japonicus yFS275]
 gi|212001152|gb|EEB06812.1| carboxypeptidase Y [Schizosaccharomyces japonicus yFS275]
          Length = 1055

 Score = 60.5 bits (145), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 49/194 (25%), Positives = 70/194 (36%), Gaps = 24/194 (12%)

Query: 409 RSPSHSDRSPHDRGRYYDHRDRSPSRHDRSPYTRDRSPYTFDRSPYSRERSPYNRDRSPY 468
           R P H  + P D     +HRDR P   +  P   DR P  FD  P   +R P + +  P 
Sbjct: 253 RKPEHHGKPPMDFEHEPEHRDRPPMDFEHGPEHHDRPPMDFDHEPEHHDRPPMDFEHGPE 312

Query: 469 AREKSPYDRSRHYDHRNRSPFSAERSPQDRARFHDRSDRTPNY----------------- 511
              + P D  R  +H  R P   E  P+         +R P +                 
Sbjct: 313 RHGEPPRDFERKPEHHGRPPKDFEHGPEHHGEPPRDFERKPEHHGKPPKHFEPEREHHGE 372

Query: 512 ----LERSPLHRSRPNNHREASSKTGASEKRNARYDSKGHEDKLGPKDSNARCSRSSAKE 567
                ER P H  +P  H E   +      ++  +D   HE    PK+S         KE
Sbjct: 373 PPRDFERKPEHHGKPPKHFEPEREHRDRPPKDFEHDRAHHEKP--PKESEPEQHEKQPKE 430

Query: 568 SQDKSNVQDLNVSD 581
           S+ +  + DL + D
Sbjct: 431 SKPEQEI-DLQIVD 443



 Score = 57.0 bits (136), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 42/142 (29%), Positives = 53/142 (37%), Gaps = 20/142 (14%)

Query: 394 HEPSLSSRVIYD------RHGRSPSHSDRSPHDRGRYYDHRDRSPSRHDRSPYTRDR--- 444
           HEP    R   D      RHG  P   +R P   G+   H +  P  HDR P   +    
Sbjct: 169 HEPEHHDRPPMDFEHGPKRHGEPPEDFERKPEHHGKPPKHFEPGPDHHDRPPKDFEHGPE 228

Query: 445 ----SPYTFDRSPYSRERSPYNRDRSPYAREKSPYDRSRHYDHRNRSPFSAERSPQDRAR 500
                P  F+R P  R   P + +R P    K P D     +HR+R P   E  P+    
Sbjct: 229 HHGEPPRDFERKPEHRGEPPRDFERKPEHHGKPPMDFEHEPEHRDRPPMDFEHGPE---- 284

Query: 501 FHDRSDRTPNYLERSPLHRSRP 522
                DR P   +  P H  RP
Sbjct: 285 ---HHDRPPMDFDHEPEHHDRP 303



 Score = 51.2 bits (121), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 35/138 (25%), Positives = 50/138 (36%), Gaps = 7/138 (5%)

Query: 409 RSPSHSDRSPHDRGRYYDHRDRSPSRHDRSPYTRDRSPYTFDRSPYSRERSPYNRDRSPY 468
           R P H    P D  R  +H  + P   +  P  RDR P  F+  P   +R P + D  P 
Sbjct: 239 RKPEHRGEPPRDFERKPEHHGKPPMDFEHEPEHRDRPPMDFEHGPEHHDRPPMDFDHEPE 298

Query: 469 AREKSPYDRSRHYDHRNRSPFSAERSPQDRARFHDRSDRTPNY-------LERSPLHRSR 521
             ++ P D     +     P   ER P+   R     +  P +        ER P H  +
Sbjct: 299 HHDRPPMDFEHGPERHGEPPRDFERKPEHHGRPPKDFEHGPEHHGEPPRDFERKPEHHGK 358

Query: 522 PNNHREASSKTGASEKRN 539
           P  H E   +      R+
Sbjct: 359 PPKHFEPEREHHGEPPRD 376



 Score = 49.7 bits (117), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 34/121 (28%), Positives = 46/121 (38%), Gaps = 7/121 (5%)

Query: 409 RSPSHSDRSPHDRGRYYDHRDRSPSRHDRSPYTRDRSPYTFDRSPYSRERSPYNRDRSPY 468
           R P H DR P D     +H DR P   +  P      P  F+R P    + P + +  P 
Sbjct: 155 REPEHHDRPPMDFEHEPEHHDRPPMDFEHGPKRHGEPPEDFERKPEHHGKPPKHFEPGPD 214

Query: 469 AREKSPYDRSRHYDHRNRSPFSAERSPQDRARFHDRSDRTPNY-------LERSPLHRSR 521
             ++ P D     +H    P   ER P+ R       +R P +        E  P HR R
Sbjct: 215 HHDRPPKDFEHGPEHHGEPPRDFERKPEHRGEPPRDFERKPEHHGKPPMDFEHEPEHRDR 274

Query: 522 P 522
           P
Sbjct: 275 P 275


>gi|242051571|ref|XP_002454931.1| hypothetical protein SORBIDRAFT_03g001640 [Sorghum bicolor]
 gi|241926906|gb|EES00051.1| hypothetical protein SORBIDRAFT_03g001640 [Sorghum bicolor]
          Length = 993

 Score = 60.5 bits (145), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 55/185 (29%), Positives = 82/185 (44%), Gaps = 28/185 (15%)

Query: 1873 GILKAMDSRPDDKYVAYRKGLG---------VVCNKEGGFGEDDFVVEFLGEVYPVWKWF 1923
            G+   MD + D  +  +++ LG         V C + G  G   F    + E   V ++ 
Sbjct: 822  GLNACMDRKDDQSFSTFKERLGYLQKTENLRVSCGRSGIHGWGLFAARNIQEGQMVIEYR 881

Query: 1924 EKQ-----DGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHS 1978
             +Q       +R  Q + E       + YL +   D      VV+DA  K N A  I HS
Sbjct: 882  GEQVRRCVADLREAQYHREKK-----DCYLFKISED------VVIDATDKGNIARLINHS 930

Query: 1979 CRPNCEAKVTAVDG-HYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCR 2037
            C PNC A++  V G   QI +   R +  GEE+T+DY    +  E+ +   CLC +  CR
Sbjct: 931  CMPNCYARIMTVSGDRNQIILIAKRDVSAGEELTYDYLFDPDESEDCKVP-CLCKAPNCR 989

Query: 2038 GSYLN 2042
            G Y+N
Sbjct: 990  G-YMN 993


>gi|345781638|ref|XP_003432154.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like [Canis lupus
            familiaris]
          Length = 742

 Score = 60.5 bits (145), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 39/141 (27%), Positives = 72/141 (51%), Gaps = 23/141 (16%)

Query: 1898 NKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADG 1957
            NK+G     +FV E++GE+       ++++    +++ +E+    FY + + + +     
Sbjct: 535  NKQG-----EFVNEYVGEL------IDEEECRLRIKRAHENSVTNFYMLTVTKDR----- 578

Query: 1958 YDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSV 2017
                ++DA  K NY+  + HSC PNCE +   V+G  ++G++ +  I  G E+TF+YN  
Sbjct: 579  ----IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDIRVGLFALCDIPAGMELTFNYNLD 634

Query: 2018 TESKEEYEASVCLCGSQVCRG 2038
                   E   C CG++ C G
Sbjct: 635  CLGNGRTE---CHCGAENCSG 652


>gi|356530969|ref|XP_003534051.1| PREDICTED: histone-lysine N-methyltransferase ASHH3-like [Glycine
            max]
          Length = 349

 Score = 60.5 bits (145), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 45/146 (30%), Positives = 70/146 (47%), Gaps = 21/146 (14%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
            G G+V +++   GE  FV+E++GEV       E+   ++   + N      FY   + R 
Sbjct: 126  GSGIVADEDIKLGE--FVIEYVGEVIDDKTCEERLWNMKHRGETN------FYLCEINRD 177

Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
                     +V+DA +K N +  I HSC PN E +   +DG  +IGI+    I  GE +T
Sbjct: 178  ---------MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATSDIQKGEHLT 228

Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCR 2037
            +DY  V    ++     C CG+  CR
Sbjct: 229  YDYQFVQFGADQ----DCHCGAAECR 250


>gi|47222897|emb|CAF99053.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 768

 Score = 60.1 bits (144), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 38/145 (26%), Positives = 72/145 (49%), Gaps = 18/145 (12%)

Query: 1894 GVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKG 1953
            GV+        +  FV+E++GEV       ++++    ++   E+    FY + L++ + 
Sbjct: 480  GVLMTSSDATSQGAFVIEYVGEV------IDEEECRARIKHAQENDIFNFYMLTLDKDR- 532

Query: 1954 DADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFD 2013
                    ++DA  K N A  + H C+PNCE +   V+G  ++G++ ++ I  G+E+ F+
Sbjct: 533  --------IIDAGPKGNQARFMNHCCQPNCETQKWTVNGDTRVGLFALQDIPKGKELNFN 584

Query: 2014 YNSVTESKEEYEASVCLCGSQVCRG 2038
            YN       +   +VC CG+  C G
Sbjct: 585  YNLECLGNGK---TVCKCGAPNCSG 606


>gi|2980780|emb|CAA18207.1| putative protein [Arabidopsis thaliana]
 gi|7269987|emb|CAB79804.1| putative protein [Arabidopsis thaliana]
          Length = 477

 Score = 60.1 bits (144), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 40/139 (28%), Positives = 65/139 (46%), Gaps = 29/139 (20%)

Query: 1905 EDDFVVEFLGEVYPVWK-----WFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYD 1959
            ++DF+VE++GEV    +     W  K  G++           +FY   +++         
Sbjct: 328  KEDFIVEYIGEVISDAQCEQRLWDMKHKGMK-----------DFYMCEIQKD-------- 368

Query: 1960 LVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTE 2019
               +DA  K N +  + HSC PNC  +   V+G  ++G++  R I  GE +T+DY  V  
Sbjct: 369  -FTIDATFKGNASRFLNHSCNPNCVLEKWQVEGETRVGVFAARQIEAGEPLTYDYRFVQF 427

Query: 2020 SKEEYEASVCLCGSQVCRG 2038
              E      C CGS+ C+G
Sbjct: 428  GPE----VKCNCGSENCQG 442


>gi|407715215|ref|YP_006835780.1| nuclear protein SET [Burkholderia phenoliruptrix BR3459a]
 gi|407237399|gb|AFT87598.1| nuclear protein SET [Burkholderia phenoliruptrix BR3459a]
          Length = 173

 Score = 60.1 bits (144), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 48/139 (34%), Positives = 70/139 (50%), Gaps = 29/139 (20%)

Query: 1909 VVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAMHK 1968
            ++E+ GE    WK     + +R    N ++P   FY   L+  K         V+D    
Sbjct: 18   LIEYKGERIS-WK-----EALRRHPHNPDEPNHTFY-FALDSGK---------VIDGKVN 61

Query: 1969 ANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES------KE 2022
             N A  I HSC PNCEA+   +DGH  + ++ +R I  GEE+ +DY  V ++      K+
Sbjct: 62   GNSARWINHSCAPNCEAE--EIDGH--VYVHALRDIAEGEELFYDYGLVIDARQTKKLKK 117

Query: 2023 EYEASVCLCGSQVCRGSYL 2041
            EYE   C CGS+ CRG+ L
Sbjct: 118  EYE---CRCGSRKCRGTML 133


>gi|224117806|ref|XP_002331636.1| SET domain protein [Populus trichocarpa]
 gi|222874032|gb|EEF11163.1| SET domain protein [Populus trichocarpa]
          Length = 351

 Score = 59.7 bits (143), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 45/146 (30%), Positives = 67/146 (45%), Gaps = 21/146 (14%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
            G G+V +++   GE  FV+E++GEV       +       L K        FY   + R 
Sbjct: 124  GSGIVADEDIKQGE--FVIEYVGEV------IDDNTCEERLWKMKHRGETNFYLCEINRN 175

Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
                     +V+DA +K N +  I HSC PN E +   +DG  +IGI+    I  GE +T
Sbjct: 176  ---------MVIDATYKGNKSRYINHSCSPNTEMQKWIIDGETRIGIFATHDIRKGEHLT 226

Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCR 2037
            +DY  V    ++     C CG+  CR
Sbjct: 227  YDYQFVQFGADQ----DCHCGASGCR 248


>gi|449505027|ref|XP_004162355.1| PREDICTED: histone-lysine N-methyltransferase ASHH3-like [Cucumis
            sativus]
          Length = 373

 Score = 59.7 bits (143), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 46/146 (31%), Positives = 71/146 (48%), Gaps = 21/146 (14%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
            G G+V +++   GE  FV+E++GEV       E+   ++   + N      FY   + R 
Sbjct: 126  GSGIVADEDIKQGE--FVIEYVGEVIDDKTCEERLWNMKHRGETN------FYLCEINRD 177

Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
                     +V+DA +K N +  I HSC PN E +   +DG  +IGI+  R I  GE +T
Sbjct: 178  ---------MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIPKGEHLT 228

Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCR 2037
            +DY  V    ++     C CG+  CR
Sbjct: 229  YDYQFVQFGADQD----CHCGAVDCR 250


>gi|297802948|ref|XP_002869358.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
 gi|297315194|gb|EFH45617.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
          Length = 497

 Score = 59.7 bits (143), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 42/144 (29%), Positives = 67/144 (46%), Gaps = 30/144 (20%)

Query: 1903 FGEDDFVVEFLGEVYPVWK-----WFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADG 1957
              ++DF+VE++GEV    +     W  K  G++           +FY   +++       
Sbjct: 346  INKEDFIVEYIGEVISDAQCEQRLWDMKHKGMK-----------DFYMCEIQKD------ 388

Query: 1958 YDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSV 2017
                 +DA  K N +  + HSC PNC  +   V+G  ++G++  R I  GE +T+DY  V
Sbjct: 389  ---FTIDATFKGNASRFLNHSCSPNCVLEKWQVEGETRVGVFAARQIEAGEPLTYDYRFV 445

Query: 2018 TESKEEYEASVCLCGSQVCRGSYL 2041
                E      C CGS+ C+G YL
Sbjct: 446  QFGPEVK----CNCGSESCQG-YL 464


>gi|449442399|ref|XP_004138969.1| PREDICTED: histone-lysine N-methyltransferase ASHH3-like [Cucumis
            sativus]
          Length = 373

 Score = 59.3 bits (142), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 46/146 (31%), Positives = 71/146 (48%), Gaps = 21/146 (14%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
            G G+V +++   GE  FV+E++GEV       E+   ++   + N      FY   + R 
Sbjct: 126  GSGIVADEDIKQGE--FVIEYVGEVIDDKTCEERLWNMKHRGETN------FYLCEINRD 177

Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
                     +V+DA +K N +  I HSC PN E +   +DG  +IGI+  R I  GE +T
Sbjct: 178  ---------MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIPKGEHLT 228

Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCR 2037
            +DY  V    ++     C CG+  CR
Sbjct: 229  YDYQFVQFGADQD----CHCGAVDCR 250


>gi|18417683|ref|NP_567859.1| histone-lysine N-methyltransferase ASHR3 [Arabidopsis thaliana]
 gi|75164864|sp|Q949T8.1|ASHR3_ARATH RecName: Full=Histone-lysine N-methyltransferase ASHR3; AltName:
            Full=ASH1-related protein 3; AltName: Full=Protein SET
            DOMAIN GROUP 4; AltName: Full=Protein stamen loss
 gi|15292921|gb|AAK92831.1| unknown protein [Arabidopsis thaliana]
 gi|20465681|gb|AAM20309.1| unknown protein [Arabidopsis thaliana]
 gi|56201422|dbj|BAD72877.1| stamen loss [Arabidopsis thaliana]
 gi|332660421|gb|AEE85821.1| histone-lysine N-methyltransferase ASHR3 [Arabidopsis thaliana]
          Length = 497

 Score = 59.3 bits (142), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 42/144 (29%), Positives = 67/144 (46%), Gaps = 30/144 (20%)

Query: 1903 FGEDDFVVEFLGEVYPVWK-----WFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADG 1957
              ++DF+VE++GEV    +     W  K  G++           +FY   +++       
Sbjct: 346  INKEDFIVEYIGEVISDAQCEQRLWDMKHKGMK-----------DFYMCEIQKD------ 388

Query: 1958 YDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSV 2017
                 +DA  K N +  + HSC PNC  +   V+G  ++G++  R I  GE +T+DY  V
Sbjct: 389  ---FTIDATFKGNASRFLNHSCNPNCVLEKWQVEGETRVGVFAARQIEAGEPLTYDYRFV 445

Query: 2018 TESKEEYEASVCLCGSQVCRGSYL 2041
                E      C CGS+ C+G YL
Sbjct: 446  QFGPE----VKCNCGSENCQG-YL 464


>gi|254358922|ref|ZP_04975195.1| putative membrane protein [Burkholderia mallei 2002721280]
 gi|148028049|gb|EDK86070.1| putative membrane protein [Burkholderia mallei 2002721280]
          Length = 956

 Score = 56.6 bits (135), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 48/202 (23%), Positives = 82/202 (40%), Gaps = 9/202 (4%)

Query: 378 SSSSRISSLDKYSSRHHEP---------SLSSRVIYDRHGRSPSHSDRSPHDRGRYYDHR 428
           S++ RI+S      RHH P         +       +   R+P+   R+P+   R  +  
Sbjct: 3   SANGRIASFGSLRERHHAPQSRGPLAALNRRRARRSESERRTPNAERRTPNAERRTPNAE 62

Query: 429 DRSPSRHDRSPYTRDRSPYTFDRSPYSRERSPYNRDRSPYAREKSPYDRSRHYDHRNRSP 488
            R+P+   R+P    R+P    R+P +  R+P    R+P A  ++P    R  +   R+P
Sbjct: 63  RRTPNAERRTPNAERRTPNAERRTPNAERRTPNAERRTPNAERRTPNAERRTPNAERRTP 122

Query: 489 FSAERSPQDRARFHDRSDRTPNYLERSPLHRSRPNNHREASSKTGASEKRNARYDSKGHE 548
            +  R+P    R  +   RTPN   R+P    R  N    +   G   + + R ++    
Sbjct: 123 NAERRTPNAERRTPNAERRTPNAERRTPNAERRTPNAERRTPNAGPETRLSRRGNTIDSS 182

Query: 549 DKLGPKDSNARCSRSSAKESQD 570
               P     R +R S + S +
Sbjct: 183 PIRTPSAPRRRLTRPSMRYSVE 204


>gi|254200343|ref|ZP_04906709.1| putative membrane protein [Burkholderia mallei FMH]
 gi|147749939|gb|EDK57013.1| putative membrane protein [Burkholderia mallei FMH]
          Length = 1012

 Score = 56.2 bits (134), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 50/182 (27%), Positives = 78/182 (42%), Gaps = 20/182 (10%)

Query: 378 SSSSRISSLDKYSSRHHEP----------------SLSSRVIYDRHGRSPSHSDRSPHDR 421
           S++ RI+S      RHH P                S S R   +   R+P+   R+P+  
Sbjct: 3   SANGRIASFGSLRERHHAPQSRGPLAALNRRRARRSESERRTPNAERRTPNAERRTPNAE 62

Query: 422 GRYYDHRDRSPSRHDRSPYTRDRSPYTFDRSPYSRERSPYNRDRSPYAREKSPYDRSRHY 481
            R  +   R+P+   R+P    R+P    R+P +  R+P    R+P A  ++P    R  
Sbjct: 63  RRTPNAERRTPNAERRTPNAERRTPNAERRTPNAERRTPNAERRTPNAERRTPNAERRTP 122

Query: 482 DHRNRSPFSAERSPQDRARFHDRSDRTPNYLERSP-LHRSRPNNHRE---ASSKTGASEK 537
           +   R+P +  R+P    R  +   RTPN   R+P   R  PN  R    A  +T  +E+
Sbjct: 123 NAERRTPNAERRTPNAERRTPNAERRTPNAERRTPNAERRTPNAERRTPNAERRTPNAER 182

Query: 538 RN 539
           R 
Sbjct: 183 RT 184



 Score = 54.3 bits (129), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 45/160 (28%), Positives = 71/160 (44%), Gaps = 6/160 (3%)

Query: 409 RSPSHSDRSPHDRGRYYDHRDRSPSRHDRSPYTRDRSPYTFDRSPYSRERSPYNRDRSPY 468
           R+P+   R+P+   R  +   R+P+   R+P    R+P    R+P +  R+P    R+P 
Sbjct: 71  RTPNAERRTPNAERRTPNAERRTPNAERRTPNAERRTPNAERRTPNAERRTPNAERRTPN 130

Query: 469 AREKSPYDRSRHYDHRNRSPFSAERSPQDRARFHDRSDRTPNYLERSP-LHRSRPNNHRE 527
           A  ++P    R  +   R+P +  R+P    R  +   RTPN   R+P   R  PN  R 
Sbjct: 131 AERRTPNAERRTPNAERRTPNAERRTPNAERRTPNAERRTPNAERRTPNAERRTPNAERR 190

Query: 528 ---ASSKTGASEKR--NARYDSKGHEDKLGPKDSNARCSR 562
              A  +T  +E+R  NA   +   E +        R SR
Sbjct: 191 TPNAERRTPNAERRTPNAERRTPNAERRTPNAGPETRLSR 230



 Score = 53.9 bits (128), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 40/162 (24%), Positives = 69/162 (42%)

Query: 409 RSPSHSDRSPHDRGRYYDHRDRSPSRHDRSPYTRDRSPYTFDRSPYSRERSPYNRDRSPY 468
           R+P+   R+P+   R  +   R+P+   R+P    R+P    R+P +  R+P    R+P 
Sbjct: 99  RTPNAERRTPNAERRTPNAERRTPNAERRTPNAERRTPNAERRTPNAERRTPNAERRTPN 158

Query: 469 AREKSPYDRSRHYDHRNRSPFSAERSPQDRARFHDRSDRTPNYLERSPLHRSRPNNHREA 528
           A  ++P    R  +   R+P +  R+P    R  +   RTPN   R+P    R  N    
Sbjct: 159 AERRTPNAERRTPNAERRTPNAERRTPNAERRTPNAERRTPNAERRTPNAERRTPNAERR 218

Query: 529 SSKTGASEKRNARYDSKGHEDKLGPKDSNARCSRSSAKESQD 570
           +   G   + + R ++        P     R +R S + S +
Sbjct: 219 TPNAGPETRLSRRGNTIDSSPIRTPSAPRRRLTRPSMRYSVE 260


>gi|407262482|ref|XP_003946424.1| PREDICTED: periphilin-1-like [Mus musculus]
          Length = 588

 Score = 54.7 bits (130), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 60/137 (43%), Positives = 70/137 (51%), Gaps = 33/137 (24%)

Query: 391 SRHHEPSLSSRVIYDRHGRSPSHS-DRSPH-----------DRGRYYDHRDRSP--SRHD 436
           S HH    SS    DR   SP ++ DRSPH           DR   Y  RDRSP  +R  
Sbjct: 232 SSHHPRDRSSHYARDR---SPHYARDRSPHYARDRPPQYARDRSSQYA-RDRSPQYARDR 287

Query: 437 RSPYTRDRSP-YTFDRSP-YSRERSP---------YNRDRSP-YAREKS---PYDRSRHY 481
            S Y RDRSP Y  DRSP Y+R+RSP         Y RDRSP YAR++S     DRS HY
Sbjct: 288 SSHYARDRSPHYARDRSPHYARDRSPQYARDRSSHYARDRSPQYARDRSSQYARDRSSHY 347

Query: 482 DHRNRSPFSAERSPQDR 498
                S ++ +RSP  R
Sbjct: 348 ARDRSSHYARDRSPHKR 364


>gi|21220509|ref|NP_626288.1| hypothetical protein SCO2028 [Streptomyces coelicolor A3(2)]
 gi|5738516|emb|CAB52863.1| putative membrane protein [Streptomyces coelicolor A3(2)]
          Length = 509

 Score = 53.1 bits (126), Expect = 0.002,   Method: Composition-based stats.
 Identities = 38/105 (36%), Positives = 51/105 (48%)

Query: 410 SPSHSDRSPHDRGRYYDHRDRSPSRHDRSPYTRDRSPYTFDRSPYSRERSPYNRDRSPYA 469
           +P  S+R      R+  + DRSP   DRSP   DRS    DRSP   + +P + DRS  A
Sbjct: 335 APPASERVRRATKRFPRNPDRSPPNPDRSPPDPDRSSPNPDRSPSDPDGTPSDPDRSSPA 394

Query: 470 REKSPYDRSRHYDHRNRSPFSAERSPQDRARFHDRSDRTPNYLER 514
            ++SP   + H    +RSP   +  P D  R    SDR P   +R
Sbjct: 395 SDRSPSAPAAHAPAPDRSPPDPDGRPSDPDRSSPASDRLPPASDR 439



 Score = 52.0 bits (123), Expect = 0.005,   Method: Composition-based stats.
 Identities = 44/130 (33%), Positives = 58/130 (44%), Gaps = 3/130 (2%)

Query: 409 RSPSHSDRSPHDRGRYYDHRDRSPSRHDRSPYTRDRSPYTFDRSPYSRERSPYNRDRSPY 468
           RSP + DRSP D  R   + DRSPS  D +P   DRS    DRSP +        DRSP 
Sbjct: 355 RSPPNPDRSPPDPDRSSPNPDRSPSDPDGTPSDPDRSSPASDRSPSAPAAHAPAPDRSPP 414

Query: 469 AREKSPYDRSRHYDHRNRSPFSAERSPQDRARFHDRSDRTPNYLERSPLHRSRPNNHREA 528
             +  P D  R     +R P +++RSP   A      D +P   +  P   S P+    A
Sbjct: 415 DPDGRPSDPDRSSPASDRLPPASDRSPSAPAAHAPAPDWSPPDPDGRP---SDPDRSSPA 471

Query: 529 SSKTGASEKR 538
           S +   +  R
Sbjct: 472 SDRLPPAPDR 481


>gi|148708748|gb|EDL40695.1| mCG51743 [Mus musculus]
          Length = 527

 Score = 53.1 bits (126), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 60/137 (43%), Positives = 70/137 (51%), Gaps = 33/137 (24%)

Query: 391 SRHHEPSLSSRVIYDRHGRSPSHS-DRSPH-----------DRGRYYDHRDRSP--SRHD 436
           S HH    SS    DR   SP ++ DRSPH           DR   Y  RDRSP  +R  
Sbjct: 171 SSHHPRDRSSHYARDR---SPHYARDRSPHYARDRPPQYARDRSSQYA-RDRSPQYARDR 226

Query: 437 RSPYTRDRSP-YTFDRSP-YSRERSP---------YNRDRSP-YAREKS---PYDRSRHY 481
            S Y RDRSP Y  DRSP Y+R+RSP         Y RDRSP YAR++S     DRS HY
Sbjct: 227 SSHYARDRSPHYARDRSPHYARDRSPQYARDRSSHYARDRSPQYARDRSSQYARDRSSHY 286

Query: 482 DHRNRSPFSAERSPQDR 498
                S ++ +RSP  R
Sbjct: 287 ARDRSSHYARDRSPHKR 303


>gi|18605943|gb|AAH22960.1| BC022960 protein, partial [Mus musculus]
          Length = 368

 Score = 51.6 bits (122), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 54/115 (46%), Positives = 68/115 (59%), Gaps = 20/115 (17%)

Query: 428 RDRSP--SRHDRSPYTRDRSP-YTFDRSP-YSRERSP-YNRDRSP-YAREKSPY---DRS 478
           RDRS   +R   S Y RDRS  Y  DRSP Y+R+RS  Y RDRSP YAR++SP+   DRS
Sbjct: 33  RDRSSQYARDRSSQYARDRSSQYARDRSPQYARDRSSHYARDRSPHYARDRSPHYARDRS 92

Query: 479 RHYDHRNRSPFSAERSPQ---DRARFH--DRS-----DRTPNYL-ERSPLHRSRP 522
            HY     S ++ +RSPQ   DR+  +  DRS     DR+ +Y  +RSP  R  P
Sbjct: 93  PHYARDRSSHYARDRSPQYARDRSSQYARDRSSHYARDRSSHYARDRSPHKRDAP 147


>gi|340382809|ref|XP_003389910.1| PREDICTED: hypothetical protein LOC100636721 [Amphimedon
           queenslandica]
          Length = 841

 Score = 48.5 bits (114), Expect = 0.052,   Method: Compositional matrix adjust.
 Identities = 25/51 (49%), Positives = 30/51 (58%)

Query: 427 HRDRSPSRHDRSPYTRDRSPYTFDRSPYSRERSPYNRDRSPYAREKSPYDR 477
           +RD  P   D  PY RD  PY  D  PY+R+  PYNRD  PY R+  PY+R
Sbjct: 439 YRDDRPYNRDDRPYNRDNRPYNRDDRPYNRDDRPYNRDDRPYNRDDRPYNR 489



 Score = 44.3 bits (103), Expect = 0.90,   Method: Compositional matrix adjust.
 Identities = 27/65 (41%), Positives = 34/65 (52%)

Query: 404 YDRHGRSPSHSDRSPHDRGRYYDHRDRSPSRHDRSPYTRDRSPYTFDRSPYSRERSPYNR 463
           Y+R G    + D  P++R     +RD  P   D  PY RD  PY  D  PY+R+  PYNR
Sbjct: 430 YNRDGEDRPYRDDRPYNRDDRPYNRDNRPYNRDDRPYNRDDRPYNRDDRPYNRDDRPYNR 489

Query: 464 DRSPY 468
           D  PY
Sbjct: 490 DDRPY 494


>gi|407264369|ref|XP_003945664.1| PREDICTED: periphilin-1-like [Mus musculus]
          Length = 668

 Score = 48.1 bits (113), Expect = 0.060,   Method: Compositional matrix adjust.
 Identities = 88/267 (32%), Positives = 117/267 (43%), Gaps = 45/267 (16%)

Query: 300 EGLYKGEHNNGKNHGREYFHGNRFKRHGTDSDSGDRKYYGDYG-----DFAGLKSRRLSD 354
           +G Y  +     + G+ +F   R  R    S S    Y    G      F+    R  S 
Sbjct: 182 DGYYSHDAFRVCDEGQSFFRDQRRSRRNYHSASWQPNYRNRRGGLRRKTFSSHHPRDRSS 241

Query: 355 DY-NSRSVH-----SEHYSRHSVEKFHRNSSSSRISSLDKYSSRHHEPSLS-SRVIYDRH 407
            Y   RS H     S HY+R    ++ R+ SS          +R   P  +  R  +   
Sbjct: 242 HYARDRSPHYARDRSPHYARDRPPQYARDRSSQYARDRSPQYARDRSPQYARDRSPHYAR 301

Query: 408 GRSPSHS-DRSPH---DRGRYYDH-------RDRSP--SRHDRSPYTRDRSP-YTFDRSP 453
            RSP ++ DRS     DR   Y         RDRS   +R   S Y RDRS  Y  DRSP
Sbjct: 302 DRSPQYARDRSSQYARDRSSQYARDRSSQYARDRSSQYARDRSSQYARDRSSQYARDRSP 361

Query: 454 -YSRERSP-YNRDRSP-YAREKSPY---DRSRHYDHRNRSPFSAERSPQ----------- 496
            Y+R+RS  Y RDRSP YAR++SP+   DRS HY     S ++ +RSPQ           
Sbjct: 362 QYARDRSSHYARDRSPHYARDRSPHYARDRSPHYARDRSSHYARDRSPQYARDRSSQYAR 421

Query: 497 DRARFHDRSDRTPNYL-ERSPLHRSRP 522
           DR+  + R DR+ +Y  +RSP  R  P
Sbjct: 422 DRSSHYAR-DRSSHYARDRSPHKRDAP 447


>gi|357622727|gb|EHJ74138.1| hypothetical protein KGM_12956 [Danaus plexippus]
          Length = 1922

 Score = 48.1 bits (113), Expect = 0.065,   Method: Compositional matrix adjust.
 Identities = 64/231 (27%), Positives = 97/231 (41%), Gaps = 32/231 (13%)

Query: 413 HSDRSPHDRGRYYDHRDRSPSRHDRSPYTRDRSPYTFDRS-PYSRERSPYNRDRSPYARE 471
           H D S +DR R   +  RSP    R+P    + PY  D+   Y +  SPY++  S Y R 
Sbjct: 418 HRDAS-YDRSRGGSYEPRSP----RAPSYERKPPY--DKGGAYEKRLSPYDKRSSSYERR 470

Query: 472 KSPYDRSRHYDHRNRSPFSAERSPQDRARFHDRSD-----RTPNYLERSPLHRSRPNNHR 526
            + YD+   YD R  SP+S  R     +R   R D     RTP      P+   RP + R
Sbjct: 471 AASYDKQTPYDRRRHSPYSRMRGSSYGSRSPSRDDPRKRPRTP------PVETRRPLSPR 524

Query: 527 EASSKTGASEKRN---ARYDSKGHEDKLGPK------DSNARCSRSSAKESQDKSNVQDL 577
           E  + +  +  R+   A YD      K  P+          + S  S  +  D + V+  
Sbjct: 525 EGETTSPMNSVRSEEGAEYDRGDRSGKQIPRIDFYHQSYRHKSSIRSPSQEVDNNYVELQ 584

Query: 578 NVSDEKTANCESHKEEQP-QSSSVDCKEPPQVDGPPLEELVSMEEDMDICD 627
           + S       ++    +P +S + +  E   +D  P E ++S   D DICD
Sbjct: 585 HSSLVTVPIVDTTVAPKPIESPNRNPDEEKSMDAEPFEPILS---DEDICD 632


>gi|294678028|ref|YP_003578643.1| ribosomal large subunit pseudouridine synthase B [Rhodobacter
           capsulatus SB 1003]
 gi|294476848|gb|ADE86236.1| ribosomal large subunit pseudouridine synthase B [Rhodobacter
           capsulatus SB 1003]
          Length = 594

 Score = 46.2 bits (108), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 61/221 (27%), Positives = 84/221 (38%), Gaps = 29/221 (13%)

Query: 331 DSGDRKYYGDYGDFAGLKSRRLSDD---YNSRSVHSEHYSRHSVEKFHRNSSSSRISSLD 387
           + GDRK Y          +RR   D   Y  R    + YSR   E   R   + R     
Sbjct: 337 EDGDRKPYAPRDGEKKPYARREDGDRKPYAPRDGEKKPYSRR--EDGDRKPYAPRDGERK 394

Query: 388 KYSSRHHEPSLSSRVIYDRHGRSPSHSDRSPHDRGRYYDHRDRSPSRHDRSPYTR----D 443
            Y+ R            DR   +P   ++ P+ R    D +  +P   +R PY R    D
Sbjct: 395 PYARREDG---------DRKPYAPRDGEKKPYARREDGDRKPYAPRDGERKPYARREDGD 445

Query: 444 RSPYTFDRSPYSRERSPYNR----DRSPYAR---EKSPYDRSRHYDHRNRSPFSAERSPQ 496
           R PY    +P   E+ PY R    DR PYA    EK PY R    D +  +P   E+ P 
Sbjct: 446 RKPY----APRDGEKKPYARREDGDRKPYAPRDGEKKPYARREDGDRKPYAPRDGEKKPY 501

Query: 497 DRARFHDRSDRTPNYLERSPLHRSRPNNHREASSKTGASEK 537
            R    DR    P   E+ P  R    + +  + + G + K
Sbjct: 502 ARREDGDRKPYAPRDGEKKPYARREDGDRKPYAPRDGEAGK 542



 Score = 44.3 bits (103), Expect = 0.92,   Method: Compositional matrix adjust.
 Identities = 57/213 (26%), Positives = 80/213 (37%), Gaps = 26/213 (12%)

Query: 356 YNSRSVHSEHYSRHSVEKFHRNSSSSRISSLDKYSSRHHEPSLSSRVIYDRHGRSPSHSD 415
           Y  R    + Y+R   E   R   + R      YS R            DR   +P   +
Sbjct: 344 YAPRDGEKKPYARR--EDGDRKPYAPRDGEKKPYSRREDG---------DRKPYAPRDGE 392

Query: 416 RSPHDRGRYYDHRDRSPSRHDRSPYTR----DRSPYT---FDRSPYSRERSPYNRDRSPY 468
           R P+ R    D +  +P   ++ PY R    DR PY     +R PY+R     + DR PY
Sbjct: 393 RKPYARREDGDRKPYAPRDGEKKPYARREDGDRKPYAPRDGERKPYARRE---DGDRKPY 449

Query: 469 AR---EKSPYDRSRHYDHRNRSPFSAERSPQDRARFHDRSDRTPNYLERSPLHRSRPNNH 525
           A    EK PY R    D +  +P   E+ P  R    DR    P   E+ P  R    + 
Sbjct: 450 APRDGEKKPYARREDGDRKPYAPRDGEKKPYARREDGDRKPYAPRDGEKKPYARREDGDR 509

Query: 526 REASSKTGASEKRNARYDSKGHEDKLGPKDSNA 558
           +  + + G  +    R D  G      P+D  A
Sbjct: 510 KPYAPRDGEKKPYARRED--GDRKPYAPRDGEA 540



 Score = 43.9 bits (102), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 56/206 (27%), Positives = 80/206 (38%), Gaps = 30/206 (14%)

Query: 338 YGDYGDFAGLKSRRLSDDYNSRSVHSEHYSRHSVEKFHRNSSSSRISSLDKYSSRHHEPS 397
           +G   DFAG +  R    +  +S H E   R     F R     R      Y+ R     
Sbjct: 290 FGRQRDFAGAEGDRKPKSFGMKS-HREEGERKP---FARREDGERKP----YARREDG-- 339

Query: 398 LSSRVIYDRHGRSPSHSDRSPHDRGRYYDHRDRSPSRHDRSPYTR----DRSPYT---FD 450
                  DR   +P   ++ P+ R    D +  +P   ++ PY+R    DR PY     +
Sbjct: 340 -------DRKPYAPRDGEKKPYARREDGDRKPYAPRDGEKKPYSRREDGDRKPYAPRDGE 392

Query: 451 RSPYSRERSPYNRDRSPYAR---EKSPYDRSRHYDHRNRSPFSAERSPQDRARFHDRSDR 507
           R PY+R     + DR PYA    EK PY R    D +  +P   ER P  R    DR   
Sbjct: 393 RKPYARRE---DGDRKPYAPRDGEKKPYARREDGDRKPYAPRDGERKPYARREDGDRKPY 449

Query: 508 TPNYLERSPLHRSRPNNHREASSKTG 533
            P   E+ P  R    + +  + + G
Sbjct: 450 APRDGEKKPYARREDGDRKPYAPRDG 475


>gi|395732000|ref|XP_003775997.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein C2orf16
            homolog [Pongo abelii]
          Length = 1984

 Score = 45.8 bits (107), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 70/184 (38%), Positives = 86/184 (46%), Gaps = 28/184 (15%)

Query: 363  SEHYSRHSVEKFHRNSSSSRISSLDKYSSRHHEPS-LSSRVIYDRHGRSPSHSDR-SPHD 420
            SE   R   E+ HR  S    S       RH  PS  S R   +R  RSPS   R SP +
Sbjct: 1687 SERSHRSPSERRHRRPSER--SHRSPSERRHRRPSERSHRSPSERRHRSPSQRSRPSPSE 1744

Query: 421  RGRYYDHRDRSPS-RHDRSPYTRDRSPYTFDRSPYSRERSPYNRDRSPYAREKSPYDRSR 479
            R      R RSPS R  RSP  R R P   +R    R RSP  R +    R +SP +RS 
Sbjct: 1745 R------RHRSPSERRHRSPSQRSR-PSPSER----RHRSPSQRSQ---RRHRSPSERSH 1790

Query: 480  HYDHRNRSPFSAE---RSPQDRAR--FHDRSDRTPNYLERSPLHRSRPNNHREASSKT-G 533
            H     R   S+E   RSP +R+R    +RS R+P+   RS  HRS   +HR  S ++  
Sbjct: 1791 HSPSERRHLSSSERRHRSPLERSRHSLSERSHRSPSE-RRS--HRSFERSHRRISERSHS 1847

Query: 534  ASEK 537
             SEK
Sbjct: 1848 PSEK 1851


>gi|356515246|ref|XP_003526312.1| PREDICTED: probable histone-lysine N-methyltransferase ATXR3-like
          [Glycine max]
          Length = 2325

 Score = 41.6 bits (96), Expect = 5.8,   Method: Compositional matrix adjust.
 Identities = 19/36 (52%), Positives = 23/36 (63%), Gaps = 3/36 (8%)

Query: 1  MGDGGVACMPLQQQQQHNSIMERFPISDKTTICVGN 36
          MGDGGVAC+PLQQQQ    ++ER P +       GN
Sbjct: 1  MGDGGVACIPLQQQQH---VIERLPNAAAEKALSGN 33


>gi|390344371|ref|XP_798120.3| PREDICTED: uncharacterized protein LOC593558 [Strongylocentrotus
           purpuratus]
          Length = 1785

 Score = 41.2 bits (95), Expect = 7.1,   Method: Compositional matrix adjust.
 Identities = 30/99 (30%), Positives = 41/99 (41%)

Query: 421 RGRYYDHRDRSPSRHDRSPYTRDRSPYTFDRSPYSRERSPYNRDRSPYAREKSPYDRSRH 480
           RGR     DR P +H R P    R P  ++R P+   RSP    RSP    +SP + SR 
Sbjct: 560 RGREPPMHDRVPPQHGRMPPQHGRVPSDYERVPHGHVRSPSEYSRSPSEYSRSPSEYSRS 619

Query: 481 YDHRNRSPFSAERSPQDRARFHDRSDRTPNYLERSPLHR 519
                  P    +  Q   R   +  R P+   ++P  R
Sbjct: 620 PSEHRGPPGRGPQPQQQYGRVPSQHGRAPHEAGKAPHGR 658


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.314    0.131    0.389 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 40,404,072,461
Number of Sequences: 23463169
Number of extensions: 1839635905
Number of successful extensions: 7974976
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 10864
Number of HSP's successfully gapped in prelim test: 20165
Number of HSP's that attempted gapping in prelim test: 6426794
Number of HSP's gapped (non-prelim): 743596
length of query: 2445
length of database: 8,064,228,071
effective HSP length: 160
effective length of query: 2285
effective length of database: 8,605,088,327
effective search space: 19662626827195
effective search space used: 19662626827195
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 86 (37.7 bits)