BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 026129
         (243 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|449505027|ref|XP_004162355.1| PREDICTED: histone-lysine N-methyltransferase ASHH3-like [Cucumis
           sativus]
          Length = 373

 Score =  419 bits (1076), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 198/233 (84%), Positives = 217/233 (93%), Gaps = 1/233 (0%)

Query: 1   MPAAKKNSDNSRIGHAFNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRR 60
           MPA KKNS+ SRIG+ F+KL++QIGNPV+FELPDWF K K +PY FI+RNIYLTK+ KRR
Sbjct: 1   MPAIKKNSECSRIGNVFHKLMRQIGNPVDFELPDWFSKWKPMPYTFIRRNIYLTKKFKRR 60

Query: 61  LEDDGIFCSCTASPGSSGVC-DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLV 119
           +EDDGIFCSC+ SPGS GVC D+DCHCGMLLSSCSSGCKCG  CLNKPFQ+RPVKKMKLV
Sbjct: 61  VEDDGIFCSCSPSPGSPGVCCDKDCHCGMLLSSCSSGCKCGVLCLNKPFQHRPVKKMKLV 120

Query: 120 QTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVI 179
           +TEKCG+GIVADEDIK+GEFVIEYVGEVIDD+TCEERLW MKH GETNFYLCEINRDMVI
Sbjct: 121 KTEKCGSGIVADEDIKQGEFVIEYVGEVIDDKTCEERLWNMKHRGETNFYLCEINRDMVI 180

Query: 180 DATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
           DATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDI KGE+LTYDYQ+
Sbjct: 181 DATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIPKGEHLTYDYQF 233


>gi|449442399|ref|XP_004138969.1| PREDICTED: histone-lysine N-methyltransferase ASHH3-like [Cucumis
           sativus]
          Length = 373

 Score =  418 bits (1075), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 198/233 (84%), Positives = 216/233 (92%), Gaps = 1/233 (0%)

Query: 1   MPAAKKNSDNSRIGHAFNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRR 60
           MPA KKNS+ SRIG+ F+KL++QIGNPV FELPDWF K K +PY FI+RNIYLTK+ KRR
Sbjct: 1   MPAIKKNSECSRIGNVFHKLMRQIGNPVNFELPDWFSKWKPMPYTFIRRNIYLTKKFKRR 60

Query: 61  LEDDGIFCSCTASPGSSGVC-DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLV 119
           +EDDGIFCSC+ SPGS GVC D+DCHCGMLLSSCSSGCKCG  CLNKPFQ+RPVKKMKLV
Sbjct: 61  VEDDGIFCSCSPSPGSPGVCCDKDCHCGMLLSSCSSGCKCGVLCLNKPFQHRPVKKMKLV 120

Query: 120 QTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVI 179
           +TEKCG+GIVADEDIK+GEFVIEYVGEVIDD+TCEERLW MKH GETNFYLCEINRDMVI
Sbjct: 121 KTEKCGSGIVADEDIKQGEFVIEYVGEVIDDKTCEERLWNMKHRGETNFYLCEINRDMVI 180

Query: 180 DATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
           DATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDI KGE+LTYDYQ+
Sbjct: 181 DATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIPKGEHLTYDYQF 233


>gi|224117806|ref|XP_002331636.1| SET domain protein [Populus trichocarpa]
 gi|222874032|gb|EEF11163.1| SET domain protein [Populus trichocarpa]
          Length = 351

 Score =  411 bits (1057), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 199/232 (85%), Positives = 212/232 (91%), Gaps = 1/232 (0%)

Query: 1   MPAAKKNSDNSRIGHAFNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRR 60
           MPAAKKN D  RI  AFNKLLK+IGN VEFELPD F K K+  Y  IKRNIYLTK+IKRR
Sbjct: 1   MPAAKKNLDQ-RITLAFNKLLKEIGNQVEFELPDAFNKSKSTAYTLIKRNIYLTKKIKRR 59

Query: 61  LEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQ 120
           LEDDGIFCSC+AS GSS VCDRDCHCGMLLSSCSSGCKCG+SC NKPFQ+RP+KKMKLVQ
Sbjct: 60  LEDDGIFCSCSASLGSSRVCDRDCHCGMLLSSCSSGCKCGSSCSNKPFQHRPLKKMKLVQ 119

Query: 121 TEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVID 180
           TEKCG+GIVADEDIK+GEFVIEYVGEVIDD TCEERLWKMKH GETNFYLCEINR+MVID
Sbjct: 120 TEKCGSGIVADEDIKQGEFVIEYVGEVIDDNTCEERLWKMKHRGETNFYLCEINRNMVID 179

Query: 181 ATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
           ATYKGNKSRYINHSC PNTEMQKWIIDGETRIGIFAT DI+KGE+LTYDYQ+
Sbjct: 180 ATYKGNKSRYINHSCSPNTEMQKWIIDGETRIGIFATHDIRKGEHLTYDYQF 231


>gi|356530969|ref|XP_003534051.1| PREDICTED: histone-lysine N-methyltransferase ASHH3-like [Glycine
           max]
          Length = 349

 Score =  407 bits (1045), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 192/233 (82%), Positives = 210/233 (90%), Gaps = 1/233 (0%)

Query: 1   MPAAKKNSDNSRIGHAFNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKR- 59
           MPA KKN + +  G  FNKL K++G PV+FELPDWF K K + Y +IKRNIYLTK++KR 
Sbjct: 1   MPAMKKNPEQTHFGSVFNKLGKELGEPVDFELPDWFNKSKPMQYTYIKRNIYLTKKVKRS 60

Query: 60  RLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLV 119
           R +DDGIFCSCT SPGS+ VC RDCHCGMLLSSCSSGCKCG+SCLNKPFQNRPVKKMKLV
Sbjct: 61  RFDDDGIFCSCTPSPGSTSVCGRDCHCGMLLSSCSSGCKCGSSCLNKPFQNRPVKKMKLV 120

Query: 120 QTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVI 179
           +TEKCG+GIVADEDIK GEFVIEYVGEVIDD+TCEERLW MKH GETNFYLCEINRDMVI
Sbjct: 121 KTEKCGSGIVADEDIKLGEFVIEYVGEVIDDKTCEERLWNMKHRGETNFYLCEINRDMVI 180

Query: 180 DATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
           DATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFAT DI+KGE+LTYDYQ+
Sbjct: 181 DATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATSDIQKGEHLTYDYQF 233


>gi|356559949|ref|XP_003548258.1| PREDICTED: histone-lysine N-methyltransferase ASHH3-like [Glycine
           max]
          Length = 349

 Score =  402 bits (1032), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 192/233 (82%), Positives = 210/233 (90%), Gaps = 1/233 (0%)

Query: 1   MPAAKKNSDNSRIGHAFNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKR- 59
           MPA KKN + + IG  FN L K++G PV+FELPD F K K + Y +IKRNIYLTK++KR 
Sbjct: 1   MPAMKKNPELTCIGSVFNMLGKELGEPVDFELPDSFNKSKPMQYTYIKRNIYLTKKVKRS 60

Query: 60  RLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLV 119
           R +DDGIFCSCT SPGS+ VC RDCHCGMLLSSCSSGCKCG+SCLNKPFQNRPVKKMKLV
Sbjct: 61  RFDDDGIFCSCTPSPGSTSVCGRDCHCGMLLSSCSSGCKCGSSCLNKPFQNRPVKKMKLV 120

Query: 120 QTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVI 179
           +TEKCG+GIVADEDIK GEFVIEYVGEVIDD+TCEERLW MKH GETNFYLCEINRDMVI
Sbjct: 121 KTEKCGSGIVADEDIKLGEFVIEYVGEVIDDKTCEERLWNMKHSGETNFYLCEINRDMVI 180

Query: 180 DATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
           DATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDI+KGE+LTYDYQ+
Sbjct: 181 DATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIQKGEHLTYDYQF 233


>gi|297737225|emb|CBI26426.3| unnamed protein product [Vitis vinifera]
          Length = 438

 Score =  372 bits (954), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 190/235 (80%), Positives = 211/235 (89%), Gaps = 3/235 (1%)

Query: 1   MPAAKK---NSDNSRIGHAFNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRI 57
           MP+AKK   N +   +G AF+KLLK++GNPV+FELP  F K K   Y FI+RNIYLTK+I
Sbjct: 83  MPSAKKLKQNIEQKGVGGAFSKLLKELGNPVDFELPSSFNKWKPTSYTFIRRNIYLTKKI 142

Query: 58  KRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMK 117
           KRRLEDDGIFCSC++  GSSGVC RDC CGML SSCSSGCKCG SCLNKPFQ+RPVKKMK
Sbjct: 143 KRRLEDDGIFCSCSSGSGSSGVCGRDCLCGMLQSSCSSGCKCGTSCLNKPFQSRPVKKMK 202

Query: 118 LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDM 177
           +V+TEKCG+GIVADEDIK+GEFVIEYVGEVIDD+TCE+RLWKMKHLGETNFYLCEINRDM
Sbjct: 203 MVETEKCGSGIVADEDIKQGEFVIEYVGEVIDDKTCEDRLWKMKHLGETNFYLCEINRDM 262

Query: 178 VIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
           VIDATYKGNKSRYINHSC PNTEMQKW IDGETRIGIFATRDIK+GE+LTYDYQ+
Sbjct: 263 VIDATYKGNKSRYINHSCDPNTEMQKWRIDGETRIGIFATRDIKRGEHLTYDYQF 317


>gi|359489946|ref|XP_002268035.2| PREDICTED: histone-lysine N-methyltransferase ASHH3-like [Vitis
           vinifera]
          Length = 377

 Score =  368 bits (945), Expect = 1e-99,   Method: Compositional matrix adjust.
 Identities = 186/228 (81%), Positives = 207/228 (90%)

Query: 5   KKNSDNSRIGHAFNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDD 64
           K+N +   +G AF+KLLK++GNPV+FELP  F K K   Y FI+RNIYLTK+IKRRLEDD
Sbjct: 13  KQNIEQKGVGGAFSKLLKELGNPVDFELPSSFNKWKPTSYTFIRRNIYLTKKIKRRLEDD 72

Query: 65  GIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKC 124
           GIFCSC++  GSSGVC RDC CGML SSCSSGCKCG SCLNKPFQ+RPVKKMK+V+TEKC
Sbjct: 73  GIFCSCSSGSGSSGVCGRDCLCGMLQSSCSSGCKCGTSCLNKPFQSRPVKKMKMVETEKC 132

Query: 125 GAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYK 184
           G+GIVADEDIK+GEFVIEYVGEVIDD+TCE+RLWKMKHLGETNFYLCEINRDMVIDATYK
Sbjct: 133 GSGIVADEDIKQGEFVIEYVGEVIDDKTCEDRLWKMKHLGETNFYLCEINRDMVIDATYK 192

Query: 185 GNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
           GNKSRYINHSC PNTEMQKW IDGETRIGIFATRDIK+GE+LTYDYQ+
Sbjct: 193 GNKSRYINHSCDPNTEMQKWRIDGETRIGIFATRDIKRGEHLTYDYQF 240


>gi|357498513|ref|XP_003619545.1| Histone-lysine N-methyltransferase ASHH3 [Medicago truncatula]
 gi|355494560|gb|AES75763.1| Histone-lysine N-methyltransferase ASHH3 [Medicago truncatula]
          Length = 348

 Score =  355 bits (910), Expect = 1e-95,   Method: Compositional matrix adjust.
 Identities = 175/232 (75%), Positives = 195/232 (84%), Gaps = 3/232 (1%)

Query: 1   MPAAKKNSDNSRIGHAFNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRR 60
           MP  KKNS+ +R    F KL+  +G  V+FELPD  IK +   Y  IK ++Y+TK  KR+
Sbjct: 1   MPIMKKNSEQNRFEAVFTKLMNNLGETVDFELPD-SIKNRTTQYTHIK-HVYVTKNRKRK 58

Query: 61  LEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQ 120
             DDGIFCSC +S  S+ VC RDCHCGMLLSSCSSGCKCG SCLNK FQ+RPVKKMKLV+
Sbjct: 59  -SDDGIFCSCESSSDSTAVCGRDCHCGMLLSSCSSGCKCGISCLNKAFQHRPVKKMKLVK 117

Query: 121 TEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVID 180
           TEKCG+GIVADEDIK GEFVIEYVGEVIDD+TCE+RLW MK  GETNFYLCEINRDMVID
Sbjct: 118 TEKCGSGIVADEDIKLGEFVIEYVGEVIDDKTCEQRLWNMKDRGETNFYLCEINRDMVID 177

Query: 181 ATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
           AT KGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA+RDIKKGE+LTYDYQ+
Sbjct: 178 ATNKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFASRDIKKGEHLTYDYQF 229


>gi|297824409|ref|XP_002880087.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
 gi|297325926|gb|EFH56346.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
          Length = 363

 Score =  337 bits (863), Expect = 3e-90,   Method: Compositional matrix adjust.
 Identities = 179/234 (76%), Positives = 200/234 (85%), Gaps = 2/234 (0%)

Query: 1   MPAAKKNSDNSRIGHAFNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRR 60
           MPA KK SD + +G  FNKLL QIG   EFELPDW  K K  PY+FI+RNIYLTK++KRR
Sbjct: 1   MPATKKISDRNHLGQVFNKLLNQIGESEEFELPDWLNKGKPTPYIFIRRNIYLTKKVKRR 60

Query: 61  LEDDGIFCSCTASPGSS--GVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKL 118
           +EDDGIFCSC++S   S   VC  +CHCGML SSCSS CKCG+ C NKPFQ R VKKMKL
Sbjct: 61  VEDDGIFCSCSSSSPGSSSTVCGSNCHCGMLFSSCSSSCKCGSECNNKPFQQRHVKKMKL 120

Query: 119 VQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMV 178
           +QTEKCG+GIVA+E+IK GEF+IEYVGEVIDD+TCEERLWKMKH GETNFYLCEI RDMV
Sbjct: 121 IQTEKCGSGIVAEEEIKPGEFIIEYVGEVIDDKTCEERLWKMKHRGETNFYLCEITRDMV 180

Query: 179 IDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
           IDAT+KGNKSRYINHSC PNT+MQKWIIDGETRIGIFATR IKKGE+LTYDYQ+
Sbjct: 181 IDATHKGNKSRYINHSCNPNTQMQKWIIDGETRIGIFATRGIKKGEHLTYDYQF 234


>gi|18406465|ref|NP_566010.1| histone-lysine N-methyltransferase ASHH3 [Arabidopsis thaliana]
 gi|94707125|sp|Q945S8.2|ASHH3_ARATH RecName: Full=Histone-lysine N-methyltransferase ASHH3; AltName:
           Full=ASH1 homolog 3; AltName: Full=Protein SET DOMAIN
           GROUP 7
 gi|15028059|gb|AAK76560.1| unknown protein [Arabidopsis thaliana]
 gi|20197070|gb|AAC23419.2| expressed protein [Arabidopsis thaliana]
 gi|20259301|gb|AAM14386.1| unknown protein [Arabidopsis thaliana]
 gi|330255289|gb|AEC10383.1| histone-lysine N-methyltransferase ASHH3 [Arabidopsis thaliana]
          Length = 363

 Score =  333 bits (853), Expect = 5e-89,   Method: Compositional matrix adjust.
 Identities = 176/234 (75%), Positives = 201/234 (85%), Gaps = 2/234 (0%)

Query: 1   MPAAKKNSDNSRIGHAFNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRR 60
           MPA+KK SD + +G  F+KLL QIG   EFELP+W  K K  PY+FI+RNIYLTK++KRR
Sbjct: 1   MPASKKISDRNHLGQVFDKLLNQIGESEEFELPEWLNKGKPTPYIFIRRNIYLTKKVKRR 60

Query: 61  LEDDGIFCSCTASPGSS--GVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKL 118
           +EDDGIFCSC++S   S   VC  +CHCGML SSCSS CKCG+ C NKPFQ R VKKMKL
Sbjct: 61  VEDDGIFCSCSSSSPGSSSTVCGSNCHCGMLFSSCSSSCKCGSECNNKPFQQRHVKKMKL 120

Query: 119 VQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMV 178
           +QTEKCG+GIVA+E+I+ GEF+IEYVGEVIDD+TCEERLWKMKH GETNFYLCEI RDMV
Sbjct: 121 IQTEKCGSGIVAEEEIEAGEFIIEYVGEVIDDKTCEERLWKMKHRGETNFYLCEITRDMV 180

Query: 179 IDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
           IDAT+KGNKSRYINHSC PNT+MQKWIIDGETRIGIFATR IKKGE+LTYDYQ+
Sbjct: 181 IDATHKGNKSRYINHSCNPNTQMQKWIIDGETRIGIFATRGIKKGEHLTYDYQF 234


>gi|218201888|gb|EEC84315.1| hypothetical protein OsI_30811 [Oryza sativa Indica Group]
          Length = 360

 Score =  304 bits (779), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 145/218 (66%), Positives = 178/218 (81%), Gaps = 4/218 (1%)

Query: 15  HAFNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASP 74
           + F++L+K I  PV+F+LP    + K+  YV IKRN Y+T++   R+EDDGIFCSCT S 
Sbjct: 13  NVFDQLIKAIKAPVDFDLPPVLKEWKSNYYVPIKRNAYMTRK---RVEDDGIFCSCTPS- 68

Query: 75  GSSGVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDI 134
           GSS  CD+DC CGML S CSS CKC N C NKPFQ+R ++K KL++TEKCG G+VA+EDI
Sbjct: 69  GSSATCDKDCQCGMLFSCCSSTCKCENKCANKPFQHRTLRKTKLIKTEKCGNGVVAEEDI 128

Query: 135 KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
           K+GEFVIEYVGEVIDD+TCE+RLWKMK  G+TNFYLCE++ +MVIDAT KGN SR+INHS
Sbjct: 129 KKGEFVIEYVGEVIDDRTCEQRLWKMKRQGDTNFYLCEVSSNMVIDATNKGNMSRFINHS 188

Query: 195 CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
           C PNTEMQKW ++GETR+GIFA RDIK GE LTYDY++
Sbjct: 189 CEPNTEMQKWTVEGETRVGIFALRDIKTGEELTYDYKF 226


>gi|222641285|gb|EEE69417.1| hypothetical protein OsJ_28789 [Oryza sativa Japonica Group]
          Length = 360

 Score =  304 bits (778), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 145/218 (66%), Positives = 178/218 (81%), Gaps = 4/218 (1%)

Query: 15  HAFNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASP 74
           + F++L+K I  PV+F+LP    + K+  YV IKRN Y+T++   R+EDDGIFCSCT S 
Sbjct: 13  NVFDQLIKAIKAPVDFDLPPVLKEWKSNYYVPIKRNAYITRK---RVEDDGIFCSCTPS- 68

Query: 75  GSSGVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDI 134
           GSS  CD+DC CGML S CSS CKC N C NKPFQ+R ++K KL++TEKCG G+VA+EDI
Sbjct: 69  GSSATCDKDCQCGMLFSCCSSTCKCENKCANKPFQHRTLRKTKLIKTEKCGNGVVAEEDI 128

Query: 135 KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
           K+GEFVIEYVGEVIDD+TCE+RLWKMK  G+TNFYLCE++ +MVIDAT KGN SR+INHS
Sbjct: 129 KKGEFVIEYVGEVIDDRTCEQRLWKMKRQGDTNFYLCEVSSNMVIDATNKGNMSRFINHS 188

Query: 195 CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
           C PNTEMQKW ++GETR+GIFA RDIK GE LTYDY++
Sbjct: 189 CEPNTEMQKWTVEGETRVGIFALRDIKTGEELTYDYKF 226


>gi|115478464|ref|NP_001062827.1| Os09g0307800 [Oryza sativa Japonica Group]
 gi|51091678|dbj|BAD36461.1| putative SET domain protein 110 [Oryza sativa Japonica Group]
 gi|51091893|dbj|BAD36704.1| putative SET domain protein 110 [Oryza sativa Japonica Group]
 gi|113631060|dbj|BAF24741.1| Os09g0307800 [Oryza sativa Japonica Group]
          Length = 340

 Score =  304 bits (778), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 145/218 (66%), Positives = 178/218 (81%), Gaps = 4/218 (1%)

Query: 15  HAFNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASP 74
           + F++L+K I  PV+F+LP    + K+  YV IKRN Y+T++   R+EDDGIFCSCT S 
Sbjct: 13  NVFDQLIKAIKAPVDFDLPPVLKEWKSNYYVPIKRNAYITRK---RVEDDGIFCSCTPS- 68

Query: 75  GSSGVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDI 134
           GSS  CD+DC CGML S CSS CKC N C NKPFQ+R ++K KL++TEKCG G+VA+EDI
Sbjct: 69  GSSATCDKDCQCGMLFSCCSSTCKCENKCANKPFQHRTLRKTKLIKTEKCGNGVVAEEDI 128

Query: 135 KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
           K+GEFVIEYVGEVIDD+TCE+RLWKMK  G+TNFYLCE++ +MVIDAT KGN SR+INHS
Sbjct: 129 KKGEFVIEYVGEVIDDRTCEQRLWKMKRQGDTNFYLCEVSSNMVIDATNKGNMSRFINHS 188

Query: 195 CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
           C PNTEMQKW ++GETR+GIFA RDIK GE LTYDY++
Sbjct: 189 CEPNTEMQKWTVEGETRVGIFALRDIKTGEELTYDYKF 226


>gi|326496078|dbj|BAJ90660.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 362

 Score =  294 bits (752), Expect = 3e-77,   Method: Compositional matrix adjust.
 Identities = 141/221 (63%), Positives = 173/221 (78%), Gaps = 3/221 (1%)

Query: 12  RIGHAFNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCT 71
           R+     K LK   + V+F+LP    + K   Y+ IKRN+YLTK+   R+EDDGIFCSC+
Sbjct: 29  RVFDELTKALKDPADFVDFDLPSALKEWKLGYYIPIKRNVYLTKK---RVEDDGIFCSCS 85

Query: 72  ASPGSSGVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVAD 131
            S GS   C +DC CGML S CSS CKCGN+C NK FQ RP+ K KL++TEKCG G++A+
Sbjct: 86  LSSGSPVTCGKDCQCGMLFSCCSSNCKCGNNCANKSFQLRPLFKTKLIKTEKCGFGLIAE 145

Query: 132 EDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYI 191
           ++IK+GEFVIEYVGEVIDD+TCEERLWKMK    TNFYLCE++ +MVIDAT KGNKSR+I
Sbjct: 146 DEIKKGEFVIEYVGEVIDDRTCEERLWKMKRQRYTNFYLCEVSSNMVIDATNKGNKSRFI 205

Query: 192 NHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
           NHSC PNTEMQKW +DGETR+GIFA RDI++GE LTYDY++
Sbjct: 206 NHSCEPNTEMQKWTVDGETRVGIFALRDIERGEELTYDYKF 246


>gi|357157974|ref|XP_003577976.1| PREDICTED: histone-lysine N-methyltransferase ASHH3-like
           [Brachypodium distachyon]
          Length = 338

 Score =  293 bits (751), Expect = 3e-77,   Method: Compositional matrix adjust.
 Identities = 144/220 (65%), Positives = 173/220 (78%), Gaps = 6/220 (2%)

Query: 16  AFNKLLKQIGNP---VEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTA 72
            FNKL +++ +P   V+F+LP    + K   Y+ IKRN+YLTK+   R+EDDGIFCSC  
Sbjct: 6   VFNKLTRKLQDPADFVDFDLPPALKEWKLGYYIPIKRNVYLTKK---RVEDDGIFCSCPF 62

Query: 73  SPGSSGVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADE 132
             GSS  C +DC CGML S CSS CKC N C NK FQ RP+KK KL++TEKCG G+VAD+
Sbjct: 63  ISGSSVACGKDCQCGMLFSCCSSNCKCENRCANKSFQLRPLKKTKLIKTEKCGFGLVADD 122

Query: 133 DIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYIN 192
            I++GEF+IEYVGEVIDD+TCEERLWKMK    TNFYLCE++ +MVIDAT KGNKSR+IN
Sbjct: 123 GIQKGEFIIEYVGEVIDDRTCEERLWKMKRQRYTNFYLCEVSSNMVIDATNKGNKSRFIN 182

Query: 193 HSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
           HSC PNTEMQKW +DGETR+GIFA RDIKKGE LTYDY++
Sbjct: 183 HSCQPNTEMQKWTVDGETRVGIFALRDIKKGEELTYDYKF 222


>gi|357116306|ref|XP_003559923.1| PREDICTED: histone-lysine N-methyltransferase ASHH3-like
           [Brachypodium distachyon]
          Length = 349

 Score =  293 bits (750), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 143/220 (65%), Positives = 172/220 (78%), Gaps = 6/220 (2%)

Query: 16  AFNKLLKQIGNP---VEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTA 72
            FNKL K++ +P   V+F+LP    + K   Y+ IKRN+YL   +K+R+EDDGIFC+C  
Sbjct: 17  VFNKLTKKLQDPADFVDFDLPSALKEWKLGYYIPIKRNVYL---MKKRVEDDGIFCACPF 73

Query: 73  SPGSSGVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADE 132
             GSS  C +DC CGML S CSS CKC N C NK FQ RP+KK KL++TEKCG G+VADE
Sbjct: 74  ISGSSVACGKDCQCGMLFSCCSSNCKCENRCANKSFQLRPMKKTKLIKTEKCGFGLVADE 133

Query: 133 DIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYIN 192
            I++GEF+IEYVGEVIDD+TCEERLWKMK    TNFYLCE++ +MVIDAT KGNKSR+IN
Sbjct: 134 GIQQGEFIIEYVGEVIDDRTCEERLWKMKRQRYTNFYLCEVSSNMVIDATNKGNKSRFIN 193

Query: 193 HSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
           HSC PNTEMQKW +DGETR+GIFA  DIKKGE LTYDY++
Sbjct: 194 HSCQPNTEMQKWTVDGETRVGIFALHDIKKGEELTYDYKF 233


>gi|15232214|ref|NP_191555.1| putative histone-lysine N-methyltransferase ASHH4 [Arabidopsis
           thaliana]
 gi|75264575|sp|Q9M1X9.1|ASHH4_ARATH RecName: Full=Putative histone-lysine N-methyltransferase ASHH4;
           AltName: Full=ASH1 homolog 4; AltName: Full=Protein SET
           DOMAIN GROUP 24
 gi|7019690|emb|CAB75815.1| putative protein [Arabidopsis thaliana]
 gi|332646470|gb|AEE79991.1| putative histone-lysine N-methyltransferase ASHH4 [Arabidopsis
           thaliana]
          Length = 352

 Score =  290 bits (743), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 162/245 (66%), Positives = 189/245 (77%), Gaps = 7/245 (2%)

Query: 1   MPAAKKNSDNSRIGHAFNKLLKQIGNPVEFELPDWF--IKPKAIPYVFIKRNIYLTKRIK 58
           M ++KK SD ++I  +  KL KQIG   + E PD    +KP     +FIKRNIYL K++K
Sbjct: 1   MSSSKKGSDRNQIRKSLRKLKKQIGELEKLESPDRLNNVKP-----IFIKRNIYLKKKLK 55

Query: 59  RRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKL 118
           ++++D GIFCSC+  PGSS +C  DC+CG+LLSSCSS CKC + C NKPFQ R +KKMKL
Sbjct: 56  KKVKDHGIFCSCSLDPGSSTLCGSDCNCGILLSSCSSSCKCSSECTNKPFQQRHIKKMKL 115

Query: 119 VQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMV 178
           VQTEKCG GIVADEDI  GEF+IEYVGEVIDD+ CEERLWK+ H  ETNFYLC+IN +MV
Sbjct: 116 VQTEKCGYGIVADEDINSGEFIIEYVGEVIDDKICEERLWKLNHKVETNFYLCQINWNMV 175

Query: 179 IDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
           IDAT+KGNKSRYINHSC PNTEMQKWIIDGETRIGIFATR I KGE LTYDYQ+      
Sbjct: 176 IDATHKGNKSRYINHSCSPNTEMQKWIIDGETRIGIFATRFINKGEQLTYDYQFVQFGAD 235

Query: 239 LIAYC 243
              YC
Sbjct: 236 QDCYC 240


>gi|224095256|ref|XP_002310367.1| SET domain protein [Populus trichocarpa]
 gi|222853270|gb|EEE90817.1| SET domain protein [Populus trichocarpa]
          Length = 281

 Score =  283 bits (723), Expect = 6e-74,   Method: Compositional matrix adjust.
 Identities = 133/145 (91%), Positives = 141/145 (97%)

Query: 88  MLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEV 147
           MLLSSCSSGCKCG+SC NKPFQ+R VKKMKLVQTEKCG+GIVADEDIK+GEFVIEYVGEV
Sbjct: 1   MLLSSCSSGCKCGSSCSNKPFQHRQVKKMKLVQTEKCGSGIVADEDIKQGEFVIEYVGEV 60

Query: 148 IDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIID 207
           IDD+TCEERLWKMKH GETNFYLCEINRDMVIDATYKGNKSRYINHSC PNTEMQKWIID
Sbjct: 61  IDDKTCEERLWKMKHCGETNFYLCEINRDMVIDATYKGNKSRYINHSCSPNTEMQKWIID 120

Query: 208 GETRIGIFATRDIKKGENLTYDYQY 232
           GETRIGIFATRDI+KGE+LTYDYQ+
Sbjct: 121 GETRIGIFATRDIRKGEHLTYDYQF 145


>gi|242048842|ref|XP_002462165.1| hypothetical protein SORBIDRAFT_02g020844 [Sorghum bicolor]
 gi|241925542|gb|EER98686.1| hypothetical protein SORBIDRAFT_02g020844 [Sorghum bicolor]
          Length = 341

 Score =  280 bits (716), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 139/231 (60%), Positives = 175/231 (75%), Gaps = 7/231 (3%)

Query: 15  HAFNKLLKQIGNPVEFELPDWFIKPKAI--PYVFIKRNIYLTKRIKRRLEDDGIFCSCTA 72
           + F +L+ +I  P +F+ P  F+K   +   Y  IKRN+Y TKR   R+ED GI C C  
Sbjct: 21  NVFEQLISKI-EPADFD-PRPFLKQLNVLGRYEPIKRNVYCTKR---RVEDYGISCRCKP 75

Query: 73  SPGSSGVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADE 132
           SPGSS VC RDCHCGML S CSS C+C N+C NK FQ+RP+KK KL++TEKCG G+VA++
Sbjct: 76  SPGSSVVCGRDCHCGMLFSCCSSQCECDNACTNKSFQHRPLKKTKLIKTEKCGHGLVAED 135

Query: 133 DIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYIN 192
           +IK+GEFVIEYVGEVIDD+ CE RLW MK L +T+FYLCE++ +MVIDAT KGN SR+IN
Sbjct: 136 EIKKGEFVIEYVGEVIDDRACENRLWTMKRLNDTDFYLCEVSSNMVIDATNKGNLSRFIN 195

Query: 193 HSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
           HSC PNT+MQKW +DGETR+GIFA RDIK GE LTYDY++     + + +C
Sbjct: 196 HSCEPNTKMQKWTVDGETRVGIFALRDIKIGEELTYDYKFVQFGAAQVCHC 246


>gi|297817294|ref|XP_002876530.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
 gi|297322368|gb|EFH52789.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
          Length = 354

 Score =  279 bits (713), Expect = 8e-73,   Method: Compositional matrix adjust.
 Identities = 160/244 (65%), Positives = 186/244 (76%), Gaps = 3/244 (1%)

Query: 1   MPAAKKNSDNSRIGHAFNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRR 60
           M ++KK SD ++I  +  KL KQIG   + E P   +  K  P +FIKRNIYL K+ K+R
Sbjct: 1   MSSSKKGSDRNQIRKSLRKLKKQIGELEKLESPPEPLN-KGKP-IFIKRNIYLKKKFKKR 58

Query: 61  LEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQ 120
           ++D GIFCSC+  PGSS +C  DC+CG+LLSSCSS CKC + C NKPFQ R +KKMKLVQ
Sbjct: 59  VKDHGIFCSCSLDPGSSTICGSDCNCGILLSSCSSSCKCSSECTNKPFQQRHIKKMKLVQ 118

Query: 121 TEKCGAGIVADEDIKRGEFVIEYVGE-VIDDQTCEERLWKMKHLGETNFYLCEINRDMVI 179
           TEKCG GIVADEDI  GEF+IEYVGE VID++ CEERLWK+ H  E NFYLC+IN +MVI
Sbjct: 119 TEKCGYGIVADEDINSGEFIIEYVGEVVIDEKICEERLWKLNHKVEKNFYLCQINWNMVI 178

Query: 180 DATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSL 239
           DAT+KGNKSRYINHSC PNTEMQKWIIDGETRIGIFATR I KGE LTYDYQ+       
Sbjct: 179 DATHKGNKSRYINHSCNPNTEMQKWIIDGETRIGIFATRFINKGEQLTYDYQFVQFGADQ 238

Query: 240 IAYC 243
             YC
Sbjct: 239 DCYC 242


>gi|226493201|ref|NP_001149253.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4
           lysine-20specific [Zea mays]
 gi|194704072|gb|ACF86120.1| unknown [Zea mays]
 gi|195625808|gb|ACG34734.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4
           lysine-20specific [Zea mays]
 gi|238014446|gb|ACR38258.1| unknown [Zea mays]
 gi|414589294|tpg|DAA39865.1| TPA: putative histone-lysine N-methyltransferase family protein
           isoform 1 [Zea mays]
 gi|414589295|tpg|DAA39866.1| TPA: putative histone-lysine N-methyltransferase family protein
           isoform 2 [Zea mays]
          Length = 339

 Score =  277 bits (709), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 136/233 (58%), Positives = 177/233 (75%), Gaps = 5/233 (2%)

Query: 12  RIGHAFNKLLKQIGNPVEFELPDWFIKPKAIP-YVFIKRNIYLTKRIKRRLEDDGIFCSC 70
           +I + F++L+ +I +  +F+   +  +   +  Y  IKRN+Y TKR   R+ED GI C C
Sbjct: 17  QIDNVFDQLINKIEH-ADFDTRPFLKQLNVLGRYEPIKRNVYCTKR---RIEDYGISCHC 72

Query: 71  TASPGSSGVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVA 130
             SPGSS VC RDCHCGML S CSS C+C ++C NK FQ+RP+KK KL++TEKCG G+VA
Sbjct: 73  KPSPGSSVVCGRDCHCGMLFSCCSSQCECDDTCTNKSFQHRPLKKTKLIKTEKCGHGLVA 132

Query: 131 DEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRY 190
           +++IK+GEFVIEYVGEVIDD+TCE RLW MK L +T+FYLCE++ +MVIDAT KGN+SR+
Sbjct: 133 EDEIKKGEFVIEYVGEVIDDRTCENRLWTMKRLLDTDFYLCEVSSNMVIDATNKGNRSRF 192

Query: 191 INHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
           INHSC PNT MQKW +DGETR+GIFA RDIK GE LTYDY++     + + +C
Sbjct: 193 INHSCEPNTAMQKWTVDGETRVGIFALRDIKIGEELTYDYKFVQFGAAQVCHC 245


>gi|414589296|tpg|DAA39867.1| TPA: putative histone-lysine N-methyltransferase family protein
           [Zea mays]
          Length = 343

 Score =  275 bits (703), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 138/235 (58%), Positives = 175/235 (74%), Gaps = 7/235 (2%)

Query: 12  RIGHAFNKLLKQIGNPVEFELPDWFIKPKAIP-YVFIKRNIYLTKRIKRRLEDDGIFCSC 70
           +I + F++L+ +I +  +F+   +  +   +  Y  IKRN+Y TKR   R+ED GI C C
Sbjct: 17  QIDNVFDQLINKIEH-ADFDTRPFLKQLNVLGRYEPIKRNVYCTKR---RIEDYGISCHC 72

Query: 71  TASPGSSGVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVA 130
             SPGSS VC RDCHCGML S CSS C+C ++C NK FQ+RP+KK KL++TEKCG G+VA
Sbjct: 73  KPSPGSSVVCGRDCHCGMLFSCCSSQCECDDTCTNKSFQHRPLKKTKLIKTEKCGHGLVA 132

Query: 131 DEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRY 190
           +++IK+GEFVIEYVGEVIDD+TCE RLW MK L +T+FYLCE++ +MVIDAT KGN+SR+
Sbjct: 133 EDEIKKGEFVIEYVGEVIDDRTCENRLWTMKRLLDTDFYLCEVSSNMVIDATNKGNRSRF 192

Query: 191 INHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQ--YEFLHDSLIAYC 243
           INHSC PNT MQKW +DGETR+GIFA RDIK GE LTYDY   Y F+       C
Sbjct: 193 INHSCEPNTAMQKWTVDGETRVGIFALRDIKIGEELTYDYNIMYRFVQFGAAQVC 247


>gi|162460550|ref|NP_001105653.1| LOC542662 [Zea mays]
 gi|24021802|gb|AAN41254.1| SET domain protein 110 [Zea mays]
 gi|195652527|gb|ACG45731.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4
           lysine-20specific [Zea mays]
          Length = 342

 Score =  267 bits (683), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 134/234 (57%), Positives = 170/234 (72%), Gaps = 5/234 (2%)

Query: 12  RIGHAFNKLLKQIGNPVEFELPDWFIKPKAI--PYVFIKRNIYLTKRIKRRLEDDGIFCS 69
           +I + F++L+ +I        P  F+K   +   Y  IKRN+Y TKR    +ED GI C 
Sbjct: 18  QIENVFDQLISKIEQADPDFDPRPFLKQLNVLGRYEPIKRNVYFTKRY---IEDYGISCH 74

Query: 70  CTASPGSSGVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIV 129
           C  SPGSS VC RDC+C ML S CSS C+C  +C NK FQ+RP+ K KL++TEKCG G+V
Sbjct: 75  CKPSPGSSVVCGRDCYCSMLFSCCSSQCECDIACTNKSFQHRPLTKTKLIKTEKCGHGLV 134

Query: 130 ADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSR 189
           A+++IK+GEFVIEYVGEVIDD+TCE RLW MK L +T+FYLCE++ +MVIDAT KGN SR
Sbjct: 135 AEDEIKKGEFVIEYVGEVIDDRTCENRLWTMKRLDDTDFYLCEVSSNMVIDATNKGNLSR 194

Query: 190 YINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
           +INHSC PNT MQKW +DGETR+GIFA RDIK GE LTYDY++     + + +C
Sbjct: 195 FINHSCEPNTAMQKWTVDGETRVGIFALRDIKIGEELTYDYKFVQFGAAQVCHC 248


>gi|414884958|tpg|DAA60972.1| TPA: putative histone-lysine N-methyltransferase family protein
           isoform 1 [Zea mays]
 gi|414884959|tpg|DAA60973.1| TPA: putative histone-lysine N-methyltransferase family protein
           isoform 2 [Zea mays]
          Length = 337

 Score =  267 bits (683), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 134/234 (57%), Positives = 170/234 (72%), Gaps = 5/234 (2%)

Query: 12  RIGHAFNKLLKQIGNPVEFELPDWFIKPKAI--PYVFIKRNIYLTKRIKRRLEDDGIFCS 69
           +I + F++L+ +I        P  F+K   +   Y  IKRN+Y TKR    +ED GI C 
Sbjct: 18  QIENVFDQLISKIEQADPDFDPRPFLKQLNVLGRYEPIKRNVYFTKRY---IEDYGISCH 74

Query: 70  CTASPGSSGVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIV 129
           C  SPGSS VC RDC+C ML S CSS C+C  +C NK FQ+RP+ K KL++TEKCG G+V
Sbjct: 75  CKPSPGSSVVCGRDCYCSMLFSCCSSQCECDIACTNKSFQHRPLTKTKLIKTEKCGHGLV 134

Query: 130 ADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSR 189
           A+++IK+GEFVIEYVGEVIDD+TCE RLW MK L +T+FYLCE++ +MVIDAT KGN SR
Sbjct: 135 AEDEIKKGEFVIEYVGEVIDDRTCENRLWTMKRLDDTDFYLCEVSSNMVIDATNKGNLSR 194

Query: 190 YINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
           +INHSC PNT MQKW +DGETR+GIFA RDIK GE LTYDY++     + + +C
Sbjct: 195 FINHSCEPNTAMQKWTVDGETRVGIFALRDIKIGEELTYDYKFVQFGAAQVCHC 248


>gi|302825340|ref|XP_002994293.1| hypothetical protein SELMODRAFT_432224 [Selaginella moellendorffii]
 gi|300137824|gb|EFJ04637.1| hypothetical protein SELMODRAFT_432224 [Selaginella moellendorffii]
          Length = 820

 Score =  245 bits (625), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 113/209 (54%), Positives = 150/209 (71%), Gaps = 7/209 (3%)

Query: 27  PVEFELPDWFIK---PKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRD 83
           P +F LP  +IK    ++ PY+ I+RN+YLTK+  +  +DDG+ CSC  S     +C  D
Sbjct: 90  PCDFRLPSSWIKRLDQESPPYIHIRRNVYLTKKTPKVSKDDGMQCSCKGSK----ICSSD 145

Query: 84  CHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEY 143
           C C +L +SCSS C C   C N PFQ R  ++++L +TE CG G+ ADE+I+RG+F+IEY
Sbjct: 146 CICRLLYTSCSSSCACEGKCENLPFQKRDGRRLRLKETENCGWGLFADENIERGDFLIEY 205

Query: 144 VGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQK 203
           +GEVIDD+TCEERLW +K  GE NFYLCE+  D VIDAT+KGN SR+INHSC PN +++K
Sbjct: 206 IGEVIDDKTCEERLWDLKERGENNFYLCEVGHDKVIDATFKGNMSRFINHSCNPNAQLRK 265

Query: 204 WIIDGETRIGIFATRDIKKGENLTYDYQY 232
           W  DGE RIG+FA   I KG+ +TYDY+Y
Sbjct: 266 WQCDGELRIGVFAVSRILKGQEITYDYKY 294


>gi|302795285|ref|XP_002979406.1| hypothetical protein SELMODRAFT_110353 [Selaginella moellendorffii]
 gi|300153174|gb|EFJ19814.1| hypothetical protein SELMODRAFT_110353 [Selaginella moellendorffii]
          Length = 274

 Score =  238 bits (606), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 112/207 (54%), Positives = 149/207 (71%), Gaps = 7/207 (3%)

Query: 29  EFELPDWFIK---PKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCH 85
           +F LP  +IK    ++ PY+ I+RN+YLTK+  +  +DDG+ CSC  S     +C  DC 
Sbjct: 1   DFRLPSSWIKRLDQESPPYIHIRRNVYLTKKTPKVSKDDGMQCSCKGSK----ICSSDCI 56

Query: 86  CGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVG 145
           C +L +SCSS C C   C N PFQ R  ++++L +TE CG G+ ADE+I+RG+F+IEY+G
Sbjct: 57  CRLLYTSCSSSCACEGKCENLPFQKRDGRRLRLKETENCGWGLFADENIERGDFLIEYIG 116

Query: 146 EVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWI 205
           EVIDD+TCEERLW +K  GE NFYLCE+  D VIDAT+KGN SR+INHSC PN +++KW 
Sbjct: 117 EVIDDKTCEERLWDLKERGENNFYLCEVGHDKVIDATFKGNMSRFINHSCDPNAQLRKWQ 176

Query: 206 IDGETRIGIFATRDIKKGENLTYDYQY 232
            DGE RIG+FA   I KG+ +TYDY+Y
Sbjct: 177 CDGELRIGVFAVSRILKGQEITYDYKY 203


>gi|168031756|ref|XP_001768386.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162680311|gb|EDQ66748.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 615

 Score =  223 bits (568), Expect = 6e-56,   Method: Compositional matrix adjust.
 Identities = 103/148 (69%), Positives = 122/148 (82%)

Query: 85  HCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYV 144
           H  ML  +CSS C CG+SC N PFQ  P  KMK V+TE+CG G+VADEDIK G F++EYV
Sbjct: 295 HVRMLFMTCSSNCGCGDSCTNLPFQKLPGSKMKAVKTERCGWGLVADEDIKAGSFLVEYV 354

Query: 145 GEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKW 204
           GEVIDDQTCEERLW MK  GE NFY+CEI+R+MVIDAT+KGN SR+INHSC PN+E+QKW
Sbjct: 355 GEVIDDQTCEERLWAMKKQGEMNFYMCEISREMVIDATFKGNLSRFINHSCQPNSELQKW 414

Query: 205 IIDGETRIGIFATRDIKKGENLTYDYQY 232
            IDGETRIG+FA  DIK+G+ +TYDYQ+
Sbjct: 415 DIDGETRIGVFAITDIKRGDFVTYDYQF 442


>gi|225445763|ref|XP_002272781.1| PREDICTED: histone-lysine N-methyltransferase ASHR3 [Vitis
           vinifera]
 gi|297743704|emb|CBI36587.3| unnamed protein product [Vitis vinifera]
          Length = 494

 Score =  211 bits (538), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 105/206 (50%), Positives = 138/206 (66%), Gaps = 8/206 (3%)

Query: 38  KPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGC 97
           K +  PYV I+RN+YL K+ KR    DGI C+       S VC  +C C +   SCS  C
Sbjct: 260 KMEPTPYVHIRRNVYLVKK-KRDDAADGIGCT-----NCSSVCSENCVCRVQCISCSKSC 313

Query: 98  KCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERL 157
            C  +C N+PF  R  KK+K+V+TE CG G+ A E I +G+FVIEY+GEVIDD  CE RL
Sbjct: 314 HCSENCTNRPF--RKEKKIKIVKTELCGWGVDAAESINKGDFVIEYIGEVIDDALCERRL 371

Query: 158 WKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFAT 217
           W MK  G+ NFY+CEI +D  IDAT+KGN SR++NHSC PN +++KW ++GETR+G+FA 
Sbjct: 372 WDMKDRGDQNFYMCEIRKDFTIDATFKGNASRFLNHSCDPNCKLEKWQVEGETRVGVFAA 431

Query: 218 RDIKKGENLTYDYQYEFLHDSLIAYC 243
           R IK GE LTYDY++      +  +C
Sbjct: 432 RSIKAGEPLTYDYRFVRFGPEVKCHC 457


>gi|255576669|ref|XP_002529224.1| set domain protein, putative [Ricinus communis]
 gi|223531342|gb|EEF33180.1| set domain protein, putative [Ricinus communis]
          Length = 156

 Score =  211 bits (536), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 104/123 (84%), Positives = 109/123 (88%), Gaps = 2/123 (1%)

Query: 1   MPAAKKN--SDNSRIGHAFNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIK 58
           MP A KN   D++RI  AFNKLLKQIGNPVEFELP+WF K K  PY FIKRNIYLTKRIK
Sbjct: 6   MPVATKNFDCDHNRINIAFNKLLKQIGNPVEFELPEWFNKCKPTPYTFIKRNIYLTKRIK 65

Query: 59  RRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKL 118
           RRLEDDGIFCSC  SPGSSGVCDRDCHCGMLLSSCSSGCKCG+SCLNKPFQ+RPVKKMKL
Sbjct: 66  RRLEDDGIFCSCGPSPGSSGVCDRDCHCGMLLSSCSSGCKCGDSCLNKPFQHRPVKKMKL 125

Query: 119 VQT 121
           VQT
Sbjct: 126 VQT 128


>gi|242065740|ref|XP_002454159.1| hypothetical protein SORBIDRAFT_04g025725 [Sorghum bicolor]
 gi|241933990|gb|EES07135.1| hypothetical protein SORBIDRAFT_04g025725 [Sorghum bicolor]
          Length = 328

 Score =  210 bits (534), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 100/204 (49%), Positives = 142/204 (69%), Gaps = 6/204 (2%)

Query: 40  KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKC 99
           K  PY  I+RN+YL K+ +  +  D    +C A       C  DC    + +SCS  C+C
Sbjct: 30  KPPPYTSIRRNVYLIKKKRTSVRVDIGCTNCRAD----STCKEDCEFRGISTSCSKNCRC 85

Query: 100 GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
            + C N+PF  R  KK+K+V+T++CG G VA E ++RG+FVIEYVGEVIDD TCE+RLW 
Sbjct: 86  SDLCTNRPF--RKDKKIKIVKTKRCGWGAVALEPLERGDFVIEYVGEVIDDATCEQRLWD 143

Query: 160 MKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD 219
           +++ G+ NFY+CEI++D  IDAT+KGN SR++NHSC PN +++KW +DGETR+G+FA+R 
Sbjct: 144 IRYRGDKNFYMCEISKDFTIDATFKGNVSRFLNHSCEPNCKLEKWQVDGETRVGVFASRS 203

Query: 220 IKKGENLTYDYQYEFLHDSLIAYC 243
           IK GE LTYDY++    + +  +C
Sbjct: 204 IKVGEPLTYDYRFVHFGEKVKCHC 227


>gi|357142828|ref|XP_003572707.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
           ASHR3-like [Brachypodium distachyon]
          Length = 466

 Score =  207 bits (528), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 101/204 (49%), Positives = 140/204 (68%), Gaps = 6/204 (2%)

Query: 40  KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKC 99
           K   Y+ I+RN+YL   +K++  D      CT     S VC  DC C  L  SCS  C C
Sbjct: 162 KPPSYIHIRRNVYL---VKKKRADSSAETGCTNCRADS-VCKDDCECRGLSMSCSKSCHC 217

Query: 100 GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
            + C NKPF  R  KK+K+V++E CG G +A E +++ +F+IEYVGEVIDD TCE+RLW+
Sbjct: 218 SDLCTNKPF--RKDKKIKIVKSEGCGWGAIALEPLEKCDFIIEYVGEVIDDATCEQRLWE 275

Query: 160 MKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD 219
           MK  G+ NFY+CEI++D  IDAT+KGN SR++NHSC PN +++KW +DGETR+G+FA+R 
Sbjct: 276 MKRRGDKNFYMCEISKDYTIDATFKGNTSRFLNHSCDPNCKLEKWQVDGETRVGVFASRF 335

Query: 220 IKKGENLTYDYQYEFLHDSLIAYC 243
           I+ GE LTYDY++    + +  +C
Sbjct: 336 IEVGEPLTYDYRFVHFGEKVKCHC 359


>gi|18417683|ref|NP_567859.1| histone-lysine N-methyltransferase ASHR3 [Arabidopsis thaliana]
 gi|75164864|sp|Q949T8.1|ASHR3_ARATH RecName: Full=Histone-lysine N-methyltransferase ASHR3; AltName:
           Full=ASH1-related protein 3; AltName: Full=Protein SET
           DOMAIN GROUP 4; AltName: Full=Protein stamen loss
 gi|15292921|gb|AAK92831.1| unknown protein [Arabidopsis thaliana]
 gi|20465681|gb|AAM20309.1| unknown protein [Arabidopsis thaliana]
 gi|56201422|dbj|BAD72877.1| stamen loss [Arabidopsis thaliana]
 gi|332660421|gb|AEE85821.1| histone-lysine N-methyltransferase ASHR3 [Arabidopsis thaliana]
          Length = 497

 Score =  207 bits (527), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 102/205 (49%), Positives = 137/205 (66%), Gaps = 8/205 (3%)

Query: 28  VEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCG 87
           ++    D  +K     YV I+RNIYL K+ KR   +DG+ C+    P     CDR C C 
Sbjct: 248 IDLAWKDSVVKEDPPSYVHIRRNIYLVKK-KRDNANDGVGCT-NCGPN----CDRSCVCR 301

Query: 88  MLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEV 147
           +   SCS GC C  SC N+PF  R  KK+K+V+TE CG G+ A E I + +F++EY+GEV
Sbjct: 302 VQCISCSKGCSCPESCGNRPF--RKEKKIKIVKTEHCGWGVEAAESINKEDFIVEYIGEV 359

Query: 148 IDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIID 207
           I D  CE+RLW MKH G  +FY+CEI +D  IDAT+KGN SR++NHSC PN  ++KW ++
Sbjct: 360 ISDAQCEQRLWDMKHKGMKDFYMCEIQKDFTIDATFKGNASRFLNHSCNPNCVLEKWQVE 419

Query: 208 GETRIGIFATRDIKKGENLTYDYQY 232
           GETR+G+FA R I+ GE LTYDY++
Sbjct: 420 GETRVGVFAARQIEAGEPLTYDYRF 444


>gi|326522849|dbj|BAJ88470.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 394

 Score =  205 bits (522), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 101/204 (49%), Positives = 139/204 (68%), Gaps = 6/204 (2%)

Query: 40  KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKC 99
           K   Y  I+RN+YL   +K++  D      CT    S  VC  DC C  L  SCS  C C
Sbjct: 94  KPPSYTHIRRNVYL---VKKKRADSSAETGCTNC-KSDSVCKDDCECRGLSMSCSKSCHC 149

Query: 100 GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
            + C NKPF  R  KK+K+V++E CG G VA E +++G+F+IEYVGEVI+D TCE+RLW 
Sbjct: 150 SDLCSNKPF--RKDKKIKIVKSEGCGWGAVALEPLEKGDFIIEYVGEVINDATCEQRLWD 207

Query: 160 MKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD 219
           MK  G+ NFY+CEI++D  IDAT+KGN SR++NHSC PN +++KW ++GETR+G+FA+R 
Sbjct: 208 MKRRGDKNFYMCEISKDFTIDATFKGNTSRFLNHSCDPNCKLEKWQVEGETRVGVFASRI 267

Query: 220 IKKGENLTYDYQYEFLHDSLIAYC 243
           I+ GE LTYDY++    + +  +C
Sbjct: 268 IEVGEPLTYDYRFVHFGEKVKCHC 291


>gi|297802948|ref|XP_002869358.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
 gi|297315194|gb|EFH45617.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
          Length = 497

 Score =  205 bits (522), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 98/205 (47%), Positives = 136/205 (66%), Gaps = 8/205 (3%)

Query: 28  VEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCG 87
           ++    D  +K    PYV I+RNIY  K+ KR   +DG+ C+          C R C C 
Sbjct: 248 IDLTWKDSVVKDDLPPYVHIRRNIYFVKK-KRDNANDGVGCT-----NCGPTCCRSCVCR 301

Query: 88  MLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEV 147
           +   SCS GC+C  +C N+PF  R  KK+++V+TE CG G+ A E I + +F++EY+GEV
Sbjct: 302 VQCISCSKGCRCPETCGNRPF--RKEKKIRIVKTEHCGWGVEAAESINKEDFIVEYIGEV 359

Query: 148 IDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIID 207
           I D  CE+RLW MKH G  +FY+CEI +D  IDAT+KGN SR++NHSC PN  ++KW ++
Sbjct: 360 ISDAQCEQRLWDMKHKGMKDFYMCEIQKDFTIDATFKGNASRFLNHSCSPNCVLEKWQVE 419

Query: 208 GETRIGIFATRDIKKGENLTYDYQY 232
           GETR+G+FA R I+ GE LTYDY++
Sbjct: 420 GETRVGVFAARQIEAGEPLTYDYRF 444


>gi|449457959|ref|XP_004146715.1| PREDICTED: histone-lysine N-methyltransferase ASHR3-like [Cucumis
           sativus]
          Length = 502

 Score =  205 bits (521), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 99/190 (52%), Positives = 130/190 (68%), Gaps = 7/190 (3%)

Query: 43  PYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCGNS 102
           PY  I+RNIYL K+ +     DG+ C+          C  DC C +   SCS  C C ++
Sbjct: 270 PYTHIRRNIYLVKKKRDTGVADGLGCT-----NCKTECSDDCVCRVQCISCSRACHCRDT 324

Query: 103 CLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
           C N+PF  R  KK+K+V+TE CG G+ A E I +GEFVIEY+GEVIDD  CE+RLW MK+
Sbjct: 325 CTNRPF--RKEKKVKIVKTELCGWGVEAAESIGKGEFVIEYIGEVIDDALCEQRLWDMKY 382

Query: 163 LGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKK 222
            G  NFY+CEI +D  IDAT+KGN SR++NHSC PN  ++KW ++GETR+G+FA R I+ 
Sbjct: 383 KGMKNFYMCEIRKDFTIDATFKGNASRFLNHSCDPNCSLEKWQVEGETRVGVFAARSIEV 442

Query: 223 GENLTYDYQY 232
           GE LTYDY++
Sbjct: 443 GEPLTYDYRF 452


>gi|449530608|ref|XP_004172286.1| PREDICTED: histone-lysine N-methyltransferase ASHR3-like, partial
           [Cucumis sativus]
          Length = 285

 Score =  204 bits (520), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 99/190 (52%), Positives = 131/190 (68%), Gaps = 7/190 (3%)

Query: 43  PYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCGNS 102
           PY  I+RNIYL K+ +     DG+ C+   +      C  DC C +   SCS  C C ++
Sbjct: 53  PYTHIRRNIYLVKKKRDTGVADGLGCTNCKTE-----CSDDCVCRVQCISCSRACHCRDT 107

Query: 103 CLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
           C N+PF  R  KK+K+V+TE CG G+ A E I +GEFVIEY+GEVIDD  CE+RLW MK+
Sbjct: 108 CTNRPF--RKEKKVKIVKTELCGWGVEAAESIGKGEFVIEYIGEVIDDALCEQRLWDMKY 165

Query: 163 LGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKK 222
            G  NFY+CEI +D  IDAT+KGN SR++NHSC PN  ++KW ++GETR+G+FA R I+ 
Sbjct: 166 KGMKNFYMCEIRKDFTIDATFKGNASRFLNHSCDPNCSLEKWQVEGETRVGVFAARSIEV 225

Query: 223 GENLTYDYQY 232
           GE LTYDY++
Sbjct: 226 GEPLTYDYRF 235


>gi|326487338|dbj|BAJ89653.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 340

 Score =  204 bits (519), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 101/204 (49%), Positives = 138/204 (67%), Gaps = 6/204 (2%)

Query: 40  KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKC 99
           K   Y  I+RN+YL   +K++  D      CT     S VC  DC C  L  SCS  C C
Sbjct: 40  KPPSYTHIRRNVYL---VKKKRADSSAETGCTNCKSDS-VCKDDCECRGLSMSCSKSCHC 95

Query: 100 GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
            + C NKPF  R  KK+K+V++E CG G VA E +++G+F+IEYVGEVI+D TCE+RLW 
Sbjct: 96  SDLCSNKPF--RKDKKIKIVKSEGCGWGAVALEPLEKGDFIIEYVGEVINDATCEQRLWD 153

Query: 160 MKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD 219
           MK  G+ NFY+CEI++D  IDAT+KGN SR++NHSC PN +++KW ++GETR G+FA+R 
Sbjct: 154 MKRRGDKNFYMCEISKDFTIDATFKGNTSRFLNHSCDPNCKLEKWQVEGETRAGVFASRI 213

Query: 220 IKKGENLTYDYQYEFLHDSLIAYC 243
           I+ GE LTYDY++    + +  +C
Sbjct: 214 IEVGEPLTYDYRFVHFGEKVKCHC 237


>gi|284434727|gb|ADB85424.1| putative retrotransposon protein [Phyllostachys edulis]
          Length = 1946

 Score =  202 bits (513), Expect = 1e-49,   Method: Composition-based stats.
 Identities = 88/125 (70%), Positives = 104/125 (83%)

Query: 107 PFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGET 166
           P  + P     L  TEKCG G+VA+++IK+GEFVIEYVGEVIDD+TCEERLWKMK   +T
Sbjct: 95  PLTSNPFYHRSLTLTEKCGFGLVAEDEIKKGEFVIEYVGEVIDDRTCEERLWKMKRQRDT 154

Query: 167 NFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENL 226
           NFYLCE++ +MVIDAT KGN SR+INHSC PNTEMQKW +DGETR+GIFA RDIK+GE L
Sbjct: 155 NFYLCEVSSNMVIDATNKGNMSRFINHSCEPNTEMQKWTVDGETRVGIFALRDIKEGEEL 214

Query: 227 TYDYQ 231
           TYDY+
Sbjct: 215 TYDYK 219


>gi|222623224|gb|EEE57356.1| hypothetical protein OsJ_07493 [Oryza sativa Japonica Group]
          Length = 384

 Score =  201 bits (512), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 101/226 (44%), Positives = 140/226 (61%), Gaps = 15/226 (6%)

Query: 25  GNPVEFELPDWFIKPKAI-------PYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
           G P E  + +W  K  A+          F+       KR   R E     CS      + 
Sbjct: 55  GRPFEEYVKEWKAKKAALGVPAGRCELPFLTGTPKAKKRPDSRAEAGCTNCS------AD 108

Query: 78  GVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRG 137
             C  DC C  L  SCS  C C + C NKPF  R  KK+K V+T++CG G ++ E +++G
Sbjct: 109 STCKDDCECRGLYMSCSKNCHCSDMCTNKPF--RKDKKIKAVKTKRCGWGAISLEPLEKG 166

Query: 138 EFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCP 197
           +F+IEYVGEVI+D TCE+RLW MK  G+ NFY+CEI++D  IDAT+KGN SR++NHSC P
Sbjct: 167 DFIIEYVGEVINDATCEQRLWDMKRRGDKNFYMCEISKDFTIDATFKGNTSRFLNHSCDP 226

Query: 198 NTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
           N +++KW +DGETR+G+FA+R I+ GE+LTYDY++    + +  YC
Sbjct: 227 NCKLEKWQVDGETRVGVFASRSIQVGEHLTYDYRFVHFGEKVKCYC 272


>gi|224143888|ref|XP_002325110.1| SET domain protein [Populus trichocarpa]
 gi|222866544|gb|EEF03675.1| SET domain protein [Populus trichocarpa]
          Length = 516

 Score =  200 bits (509), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 98/201 (48%), Positives = 135/201 (67%), Gaps = 8/201 (3%)

Query: 43  PYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCGNS 102
           PYV I+RN+YL K+ KR   D  + C+       S  C  +C C +   SCS  C+C  +
Sbjct: 285 PYVHIRRNVYLVKK-KRDDSDGDVGCT-----NCSSTCCENCVCRVQCISCSKACRCPET 338

Query: 103 CLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
           C N+PF  R  KK+K+V+TE CG G+ A E + +G+F+IEY+GEVIDD+ CE+RLW MK+
Sbjct: 339 CTNRPF--RKEKKIKIVKTEFCGWGVEAAEPLNKGDFIIEYIGEVIDDKLCEQRLWDMKY 396

Query: 163 LGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKK 222
            G  NFY+CEI +D  IDAT+KGN SR++NHSC PN  ++KW ++GETR+G+FA   I+ 
Sbjct: 397 KGVQNFYMCEIRKDFTIDATFKGNSSRFLNHSCKPNCILEKWDVEGETRVGVFAAGSIRV 456

Query: 223 GENLTYDYQYEFLHDSLIAYC 243
           GE LTYDY++      +  YC
Sbjct: 457 GEPLTYDYRFVRFGPEVKCYC 477


>gi|356514491|ref|XP_003525939.1| PREDICTED: histone-lysine N-methyltransferase ASHR3-like [Glycine
           max]
          Length = 459

 Score =  196 bits (498), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 102/190 (53%), Positives = 132/190 (69%), Gaps = 6/190 (3%)

Query: 43  PYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCGNS 102
           PYV I+RNIYL K+ KR   DDG   +   S  S+  C  DC C +   SCS  C+C  +
Sbjct: 224 PYVHIRRNIYLVKK-KRSDADDG---AGCTSCSSTSTCSDDCVCRVQCISCSKACRCSEN 279

Query: 103 CLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
           C N+PF  R  KK+K+V+TE CG G+ A E I +G F+IEY+GEVIDD  CE+RLW MK+
Sbjct: 280 CNNRPF--RKEKKIKIVKTELCGWGVEAAETIDKGGFIIEYIGEVIDDALCEKRLWDMKY 337

Query: 163 LGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKK 222
            G  NFY+CEI +D  IDAT+KGN SR++NHSC PN  ++KW +DGETR+G+FA   I+ 
Sbjct: 338 RGVQNFYMCEIRKDFTIDATFKGNTSRFLNHSCDPNCVLEKWQVDGETRVGVFAACSIEA 397

Query: 223 GENLTYDYQY 232
           GE LTYDY++
Sbjct: 398 GEPLTYDYRF 407


>gi|238013758|gb|ACR37914.1| unknown [Zea mays]
          Length = 252

 Score =  189 bits (480), Expect = 9e-46,   Method: Compositional matrix adjust.
 Identities = 84/152 (55%), Positives = 118/152 (77%), Gaps = 2/152 (1%)

Query: 92  SCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQ 151
           SCS  C+C + C N+PF  R  KK+K+V+T++CG G VA E ++RG+FVIEYVGEVIDD 
Sbjct: 2   SCSKNCRCSDLCTNRPF--RKDKKIKIVKTKRCGWGAVALEPLERGDFVIEYVGEVIDDA 59

Query: 152 TCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETR 211
           TCE+RLW ++  G+ NFY+CEI++D  IDAT+KGN SR++NHSC PN +++KW +DGETR
Sbjct: 60  TCEQRLWDIRRRGDKNFYMCEISKDFTIDATFKGNVSRFLNHSCEPNCKLEKWQVDGETR 119

Query: 212 IGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
           +G+FA+R I+ GE LTYDY++    + +  +C
Sbjct: 120 VGVFASRSIEVGEPLTYDYRFVHFGEKVKCHC 151


>gi|297599580|ref|NP_001047405.2| Os02g0611300 [Oryza sativa Japonica Group]
 gi|255671080|dbj|BAF09319.2| Os02g0611300 [Oryza sativa Japonica Group]
          Length = 263

 Score =  189 bits (479), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 83/152 (54%), Positives = 117/152 (76%), Gaps = 2/152 (1%)

Query: 92  SCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQ 151
           SCS  C C + C NKPF  R  KK+K V+T++CG G ++ E +++G+F+IEYVGEVI+D 
Sbjct: 2   SCSKNCHCSDMCTNKPF--RKDKKIKAVKTKRCGWGAISLEPLEKGDFIIEYVGEVINDA 59

Query: 152 TCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETR 211
           TCE+RLW MK  G+ NFY+CEI++D  IDAT+KGN SR++NHSC PN +++KW +DGETR
Sbjct: 60  TCEQRLWDMKRRGDKNFYMCEISKDFTIDATFKGNTSRFLNHSCDPNCKLEKWQVDGETR 119

Query: 212 IGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
           +G+FA+R I+ GE+LTYDY++    + +  YC
Sbjct: 120 VGVFASRSIQVGEHLTYDYRFVHFGEKVKCYC 151


>gi|218191149|gb|EEC73576.1| hypothetical protein OsI_08033 [Oryza sativa Indica Group]
          Length = 163

 Score =  187 bits (474), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 84/153 (54%), Positives = 118/153 (77%), Gaps = 2/153 (1%)

Query: 89  LLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVI 148
           L  SCS  C C + C NKPF  R  KK+K V+T++CG G ++ E +++G+F+IEYVGEVI
Sbjct: 4   LYMSCSKNCHCSDMCTNKPF--RKDKKIKAVKTKRCGWGAISLEPLEKGDFIIEYVGEVI 61

Query: 149 DDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDG 208
           +D TCE+RLW MK  G+ NFY+CEI++D  IDAT+KGN SR++NHSC PN +++KW +DG
Sbjct: 62  NDATCEQRLWDMKRRGDKNFYMCEISKDFTIDATFKGNTSRFLNHSCDPNCKLEKWQVDG 121

Query: 209 ETRIGIFATRDIKKGENLTYDYQYEFLHDSLIA 241
           ETR+G+FA+R I+ GE+LTYDY+Y    + L++
Sbjct: 122 ETRVGVFASRSIQVGEHLTYDYRYSSCIEKLLS 154


>gi|255574954|ref|XP_002528383.1| set domain protein, putative [Ricinus communis]
 gi|223532171|gb|EEF33976.1| set domain protein, putative [Ricinus communis]
          Length = 500

 Score =  186 bits (473), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 97/190 (51%), Positives = 131/190 (68%), Gaps = 8/190 (4%)

Query: 43  PYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCGNS 102
           PYV I+RN+YL K+ +  +++D    SC++S     VC   C       SCS  C C  +
Sbjct: 267 PYVHIRRNVYLVKKKRDDVDNDVGCTSCSSSCCEDCVCRVQC------ISCSRACHCSEN 320

Query: 103 CLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
           C N+PF  R  KK+K+V+TE CG G+   E I +G+F+IEY+GEVIDD  CE+RLW MK+
Sbjct: 321 CTNRPF--RKEKKIKIVKTEFCGWGVETVEPINKGDFIIEYIGEVIDDAVCEQRLWDMKY 378

Query: 163 LGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKK 222
            G  NFY+CEI +D  IDAT+KGN SR++NHSC PN  ++KW ++GETR+G+FA R IK 
Sbjct: 379 KGVQNFYMCEIRKDFTIDATFKGNSSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKV 438

Query: 223 GENLTYDYQY 232
           GE LTYDY++
Sbjct: 439 GEPLTYDYRF 448


>gi|357141465|ref|XP_003572234.1| PREDICTED: histone-lysine N-methyltransferase ASHR3-like
           [Brachypodium distachyon]
          Length = 347

 Score =  184 bits (466), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 91/208 (43%), Positives = 139/208 (66%), Gaps = 8/208 (3%)

Query: 26  NPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCH 85
           +P++ E  +   KP    YV +KR+I     +  + + D I   CT        C   C 
Sbjct: 128 DPIKTEELESLTKPPT--YVLLKRSILC---MIHKCDGDAIEGGCTDC-DPPLACKTMCS 181

Query: 86  CGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVG 145
           C  +  SCS  CKC N C N+PF+    K++++V+T+ CG G+VA E I++G+FVIE+VG
Sbjct: 182 CRSVWISCSRACKCSNECTNRPFRRE--KRIEVVKTQHCGWGVVALESIQKGDFVIEFVG 239

Query: 146 EVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWI 205
           EVIDD TCEERL  MK  G+ NFY+C++N++ VIDAT++GN  R++NHSC PN +++KW 
Sbjct: 240 EVIDDVTCEERLEDMKRRGDQNFYMCKVNKNFVIDATFRGNACRFLNHSCEPNCQLEKWQ 299

Query: 206 IDGETRIGIFATRDIKKGENLTYDYQYE 233
           ++G+TR+G+FA++ I+ G+ LTY Y+++
Sbjct: 300 VNGKTRLGVFASQAIEVGKPLTYSYRFK 327


>gi|147765700|emb|CAN77905.1| hypothetical protein VITISV_024391 [Vitis vinifera]
          Length = 290

 Score =  182 bits (463), Expect = 8e-44,   Method: Compositional matrix adjust.
 Identities = 90/134 (67%), Positives = 100/134 (74%), Gaps = 17/134 (12%)

Query: 99  CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
           C            P+   K++ TEKCG+GIVADEDIK+GEFVIEYVGEVIDD+TCE+RLW
Sbjct: 56  CAPPSYTAKLDGMPMHLSKVMLTEKCGSGIVADEDIKQGEFVIEYVGEVIDDKTCEDRLW 115

Query: 159 KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATR 218
           KMKHLGETNFYLCEINRDMVIDATYKGNK                  IDGETRIGIFATR
Sbjct: 116 KMKHLGETNFYLCEINRDMVIDATYKGNKR-----------------IDGETRIGIFATR 158

Query: 219 DIKKGENLTYDYQY 232
           DIK+GE+LTYDYQ+
Sbjct: 159 DIKRGEHLTYDYQF 172


>gi|2980780|emb|CAA18207.1| putative protein [Arabidopsis thaliana]
 gi|7269987|emb|CAB79804.1| putative protein [Arabidopsis thaliana]
          Length = 477

 Score =  175 bits (444), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 77/137 (56%), Positives = 103/137 (75%), Gaps = 2/137 (1%)

Query: 96  GCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEE 155
           GC C  SC N+PF  R  KK+K+V+TE CG G+ A E I + +F++EY+GEVI D  CE+
Sbjct: 290 GCSCPESCGNRPF--RKEKKIKIVKTEHCGWGVEAAESINKEDFIVEYIGEVISDAQCEQ 347

Query: 156 RLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIF 215
           RLW MKH G  +FY+CEI +D  IDAT+KGN SR++NHSC PN  ++KW ++GETR+G+F
Sbjct: 348 RLWDMKHKGMKDFYMCEIQKDFTIDATFKGNASRFLNHSCNPNCVLEKWQVEGETRVGVF 407

Query: 216 ATRDIKKGENLTYDYQY 232
           A R I+ GE LTYDY++
Sbjct: 408 AARQIEAGEPLTYDYRF 424


>gi|168025972|ref|XP_001765507.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162683357|gb|EDQ69768.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 993

 Score =  171 bits (433), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 83/145 (57%), Positives = 102/145 (70%), Gaps = 3/145 (2%)

Query: 89  LLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVI 148
           L  SCS  CKC + C NKPF  R  K++K+ +T  CG G      IK+ EFVIEY GEVI
Sbjct: 792 LSMSCSKDCKCSDKCCNKPF--RKDKRLKVSKTAHCGWGAFTSVAIKKDEFVIEYTGEVI 849

Query: 149 DDQTCEERLWKMKHLGET-NFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIID 207
           DD  CE+RLW+MK      NFY+CEI +D +IDAT KGN SRY+NHSC PN  ++KW +D
Sbjct: 850 DDAMCEKRLWEMKGRRSICNFYMCEIAKDFIIDATRKGNASRYLNHSCQPNCRLEKWRVD 909

Query: 208 GETRIGIFATRDIKKGENLTYDYQY 232
           GETR+G+FA R+I  GE LTYDY+Y
Sbjct: 910 GETRVGVFAGRNIIAGEELTYDYKY 934


>gi|212721872|ref|NP_001132822.1| uncharacterized protein LOC100194312 [Zea mays]
 gi|194695492|gb|ACF81830.1| unknown [Zea mays]
 gi|413923000|gb|AFW62932.1| putative SET-domain containing protein family [Zea mays]
          Length = 418

 Score =  165 bits (418), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 74/142 (52%), Positives = 106/142 (74%), Gaps = 1/142 (0%)

Query: 103 CLNKPFQNRPVKKMKLVQ-TEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMK 161
           C+ +   N  V+  +  Q T++CG G VA E ++RG+FVIEYVGEVIDD TCE+RLW ++
Sbjct: 176 CVAEDTSNFTVESFRCPQHTKRCGWGAVALEPLERGDFVIEYVGEVIDDATCEQRLWDIR 235

Query: 162 HLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIK 221
             G+ NFY+CEI++D  IDAT+KGN SR++NHSC PN +++KW +DGETR+G+FA+R I+
Sbjct: 236 RRGDKNFYMCEISKDFTIDATFKGNVSRFLNHSCEPNCKLEKWQVDGETRVGVFASRSIE 295

Query: 222 KGENLTYDYQYEFLHDSLIAYC 243
            GE LTYDY++    + +  +C
Sbjct: 296 VGEPLTYDYRFVHFGEKVKCHC 317


>gi|42407424|dbj|BAD10031.1| SET domain protein-like [Oryza sativa Japonica Group]
          Length = 437

 Score =  163 bits (412), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 72/133 (54%), Positives = 105/133 (78%), Gaps = 2/133 (1%)

Query: 101 NSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKM 160
           + C NKPF+ +  KK+++V+T+ CG G  A E I++ +FVIE+VGEVIDD+TCEERL  M
Sbjct: 278 HECTNKPFRRQ--KKIEIVKTQYCGWGSRALEAIEKDDFVIEFVGEVIDDETCEERLEDM 335

Query: 161 KHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDI 220
           +  G+ NFY+C++ +D VIDAT+KGN  R+ NHSC PN ++QKW ++G+TR+G+FA++ I
Sbjct: 336 RRRGDKNFYMCKVKKDFVIDATFKGNDCRFFNHSCEPNCQLQKWQVNGKTRLGVFASKAI 395

Query: 221 KKGENLTYDYQYE 233
           + GE LTYDY++E
Sbjct: 396 EVGEPLTYDYRFE 408


>gi|145353759|ref|XP_001421172.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|145357147|ref|XP_001422783.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144581408|gb|ABO99465.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144583027|gb|ABP01142.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 503

 Score =  162 bits (409), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 84/208 (40%), Positives = 120/208 (57%), Gaps = 10/208 (4%)

Query: 40  KAIPYVFIKRNIYLTKRIKRRL-EDDGIFCSCTASP--GSSGV----CDRDCHCGMLLSS 92
           K  P+  I R++++++    +L + +   C C   P  G S      C ++C    L  S
Sbjct: 199 KPPPFERIHRSVFVSRPPPVKLHKSETAVCDCHPPPSRGDSETIRDGCGQECLNRKLRFS 258

Query: 93  CSS-GCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQ 151
           C S  C CG++C N+P    P  K K+++TE  G G+   E ++ G F++EY GE++D+ 
Sbjct: 259 CDSRTCPCGDACSNRPLSQLPAPKTKIIRTENRGWGLTLQEPVRAGTFIVEYAGEILDEH 318

Query: 152 TCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWI--IDGE 209
            C ERLW  K  GE NFYL EI+ + VIDA +KG+ +R+IN SC PN E Q+W+     E
Sbjct: 319 ECAERLWYDKQSGEENFYLMEISANYVIDAKFKGSIARFINSSCHPNCETQRWVDASTNE 378

Query: 210 TRIGIFATRDIKKGENLTYDYQYEFLHD 237
           TR+GIFAT DI  G  LTYDY +    D
Sbjct: 379 TRVGIFATEDIASGTELTYDYNFAHFGD 406


>gi|440801495|gb|ELR22513.1| SET domain containing protein [Acanthamoeba castellanii str. Neff]
          Length = 981

 Score =  159 bits (401), Expect = 1e-36,   Method: Composition-based stats.
 Identities = 78/197 (39%), Positives = 114/197 (57%), Gaps = 2/197 (1%)

Query: 38  KPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCS-SG 96
           KP  IP+ +IK+N+YL + ++    D+   C+C    G    C  +C   +L   C    
Sbjct: 332 KPPPIPFSYIKKNMYLHRSMRHWSSDEVPVCNCRLVSGKKA-CGENCINRVLNIECKLKH 390

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
           C CG +C N+ FQ R   K+    T K G G+ A E I +G F+IEYVGEVI    C++R
Sbjct: 391 CPCGTNCSNRQFQLRKYAKIDRFLTGKKGWGLRAREKIPKGTFIIEYVGEVISTDMCQDR 450

Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
           +   + +G  ++Y   ++    IDA+ KGN +R+INHSC PN +  KW +D E R+GIFA
Sbjct: 451 MKYYEEMGLEHYYFLTLDGSECIDASQKGNLARFINHSCNPNAKTHKWTVDKEIRVGIFA 510

Query: 217 TRDIKKGENLTYDYQYE 233
             DI  G+ +T+DYQ+E
Sbjct: 511 EEDIPVGQEITFDYQFE 527


>gi|168044865|ref|XP_001774900.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162673794|gb|EDQ60312.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1980

 Score =  157 bits (397), Expect = 4e-36,   Method: Composition-based stats.
 Identities = 80/191 (41%), Positives = 112/191 (58%), Gaps = 3/191 (1%)

Query: 44   YVFIKRNIYLTKRIKRRLEDDGIFCSCTA-SPGSSGVCDRDCHCGMLLSSC-SSGCKCGN 101
            +  IKRNI+  +  K +  D+ + C C    PG  G C  DC   ML   C    C CG 
Sbjct: 832  WTLIKRNIFKHRSQKHQDADEAMVCQCAPPKPGEVG-CGEDCLNRMLNVECLPQQCPCGP 890

Query: 102  SCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMK 161
             C N+ FQ R    ++L +  K G G+ A E+I RG F+IEYVGEV+D  + E R  +  
Sbjct: 891  FCTNQQFQKRLYANVELFRCGKKGHGLRALENIPRGTFIIEYVGEVLDMPSFEARQKEYS 950

Query: 162  HLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIK 221
               + +FY   ++ + +IDA  KGN  R+INHSC PN + +KW++DGE  IG+FA RD+K
Sbjct: 951  MNSQKHFYFMTLSANEIIDACSKGNLGRFINHSCEPNCQTEKWMVDGEVCIGLFAIRDVK 1010

Query: 222  KGENLTYDYQY 232
            KGE +T+DY +
Sbjct: 1011 KGEEVTFDYNF 1021


>gi|145355325|ref|XP_001421914.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144582153|gb|ABP00208.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 860

 Score =  155 bits (392), Expect = 1e-35,   Method: Composition-based stats.
 Identities = 75/186 (40%), Positives = 113/186 (60%), Gaps = 3/186 (1%)

Query: 49  RNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKP 107
           RNI+  +  +   EDD + C+C  +P S   C  DC   ++LS C    C CG++C N+ 
Sbjct: 114 RNIFAHRAPRTPSEDDAMICAC--APESGAGCGSDCLNRLVLSECDPAHCPCGSACGNQR 171

Query: 108 FQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETN 167
                 +   + +T K G G+ A E +  GEFV+EY GEV+ ++  +ER  + +  G ++
Sbjct: 172 MSRGESRATTVRRTGKKGHGLFAAERVGAGEFVLEYCGEVLHEEAYKERKRRYQDEGRSH 231

Query: 168 FYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLT 227
           +Y   ++    IDAT +GN+ R++NHSC PN E QKW++ GE  IGIFATRDI++GE LT
Sbjct: 232 YYFMTLSSSETIDATIRGNEGRFLNHSCAPNCETQKWMVRGELCIGIFATRDIEEGEELT 291

Query: 228 YDYQYE 233
            DY++E
Sbjct: 292 IDYKFE 297


>gi|390342260|ref|XP_003725626.1| PREDICTED: uncharacterized protein LOC578079 isoform 1
            [Strongylocentrotus purpuratus]
          Length = 3023

 Score =  154 bits (389), Expect = 3e-35,   Method: Composition-based stats.
 Identities = 80/212 (37%), Positives = 117/212 (55%), Gaps = 8/212 (3%)

Query: 30   FELPDWFI--KPKAIPYVFIKRNIYLTKRIKRRL--EDDGIFCSCTASPGSS----GVCD 81
             +  +W I  + K   +V +  N YLT+R + +   E   + C C  S          C 
Sbjct: 1571 LQYAEWGIVERSKMPRFVELNENYYLTERRRSKAGKEARRMVCCCATSREERRRGIQACG 1630

Query: 82   RDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVI 141
             DC   +L+  C S C CG+ C N+ FQ R   ++ +  TE+ G G+ A E++K  EFV+
Sbjct: 1631 EDCLNRVLMLECGSRCPCGDYCTNRRFQKRENARVGVFYTEEKGHGLKAKEELKDNEFVM 1690

Query: 142  EYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEM 201
            EYVGEV++    + R  +       +FY   +  D +IDAT KGN SR++NHSC PN E 
Sbjct: 1691 EYVGEVLNFHEFKHRAKQYSKDKNLHFYFMALKSDEIIDATEKGNVSRFMNHSCDPNCET 1750

Query: 202  QKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
            QKW ++G+ R+G F  R +K GE LT+DYQ+E
Sbjct: 1751 QKWTVNGQLRVGFFTKRQVKPGEELTFDYQFE 1782


>gi|390342258|ref|XP_783359.3| PREDICTED: uncharacterized protein LOC578079 isoform 2
            [Strongylocentrotus purpuratus]
          Length = 3024

 Score =  154 bits (389), Expect = 3e-35,   Method: Composition-based stats.
 Identities = 80/212 (37%), Positives = 117/212 (55%), Gaps = 8/212 (3%)

Query: 30   FELPDWFI--KPKAIPYVFIKRNIYLTKRIKRRL--EDDGIFCSCTASPGSS----GVCD 81
             +  +W I  + K   +V +  N YLT+R + +   E   + C C  S          C 
Sbjct: 1571 LQYAEWGIVERSKMPRFVELNENYYLTERRRSKAGKEARRMVCCCATSREERRRGIQACG 1630

Query: 82   RDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVI 141
             DC   +L+  C S C CG+ C N+ FQ R   ++ +  TE+ G G+ A E++K  EFV+
Sbjct: 1631 EDCLNRVLMLECGSRCPCGDYCTNRRFQKRENARVGVFYTEEKGHGLKAKEELKDNEFVM 1690

Query: 142  EYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEM 201
            EYVGEV++    + R  +       +FY   +  D +IDAT KGN SR++NHSC PN E 
Sbjct: 1691 EYVGEVLNFHEFKHRAKQYSKDKNLHFYFMALKSDEIIDATEKGNVSRFMNHSCDPNCET 1750

Query: 202  QKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
            QKW ++G+ R+G F  R +K GE LT+DYQ+E
Sbjct: 1751 QKWTVNGQLRVGFFTKRQVKPGEELTFDYQFE 1782


>gi|320169023|gb|EFW45922.1| ASHH3 [Capsaspora owczarzaki ATCC 30864]
          Length = 885

 Score =  154 bits (388), Expect = 4e-35,   Method: Composition-based stats.
 Identities = 80/214 (37%), Positives = 123/214 (57%), Gaps = 12/214 (5%)

Query: 38  KPKAIPYVFIKRNIYLTKRIKRRLEDDGIF---CSCTASPGS---SGVCDRDCHCGMLLS 91
           +P+  P+  I  + Y+    K+    D  F   C C+  P +   +  C ++C   ML++
Sbjct: 120 EPRDKPFQVITHSEYINLSAKK----DPKFKFECECSFDPATDDPATACGKNCLNRMLMA 175

Query: 92  SCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDD 150
            CS   C CG  C N+ FQNR    M++ +TEK G G++  ED+  G+F+IEYVG+V+ +
Sbjct: 176 ECSPKRCPCGTYCTNQRFQNRQYPAMEVFRTEKKGNGLMVLEDLAPGQFLIEYVGDVVHN 235

Query: 151 QTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGET 210
           +  ++R          +FY   ++ D VIDAT +G+ SR+INHSC PN E QKW++D   
Sbjct: 236 REFKKRTKSYHERQYDHFYFMTLSSDEVIDATVRGSISRFINHSCEPNCETQKWVVDRRI 295

Query: 211 RIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
           R+GIFA + IK G  +T+DY++E F  +    YC
Sbjct: 296 RVGIFAKKAIKAGTEITFDYKFERFSDEGQACYC 329


>gi|222640644|gb|EEE68776.1| hypothetical protein OsJ_27487 [Oryza sativa Japonica Group]
          Length = 405

 Score =  152 bits (384), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 66/118 (55%), Positives = 92/118 (77%)

Query: 118 LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDM 177
           L+QT+ CG G  A E I++ +FVIE+VGEVIDD+TCEERL  M+  G+ NFY+C++ +D 
Sbjct: 281 LLQTQYCGWGSRALEAIEKDDFVIEFVGEVIDDETCEERLEDMRRRGDKNFYMCKVKKDF 340

Query: 178 VIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
           VIDAT+KGN  R+ NHSC PN ++QKW ++G+TR+G+FA++ I+ GE LTYDY +  L
Sbjct: 341 VIDATFKGNDCRFFNHSCEPNCQLQKWQVNGKTRLGVFASKAIEVGEPLTYDYSFRLL 398


>gi|414870328|tpg|DAA48885.1| TPA: putative SET-domain containing protein family [Zea mays]
          Length = 266

 Score =  152 bits (384), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 69/137 (50%), Positives = 100/137 (72%), Gaps = 2/137 (1%)

Query: 96  GCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEE 155
           GC C   C NKPF  R  KK+K+V+T +CG G +A E I + +FVIE+VGEVIDD  CE+
Sbjct: 111 GCPCSVKCSNKPF--RREKKIKIVKTRQCGWGAIALETIGKDDFVIEFVGEVIDDAMCED 168

Query: 156 RLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIF 215
           RL  M+   + NFY+C++++D VID T+KGN  R+ NHSC PN  ++KW ++G+TR+G+F
Sbjct: 169 RLQDMRQRRDQNFYMCKVDKDFVIDPTFKGNACRFFNHSCQPNCRLEKWQVNGKTRLGVF 228

Query: 216 ATRDIKKGENLTYDYQY 232
           A++ I+ G  LTY+Y++
Sbjct: 229 ASQTIEVGMPLTYNYRF 245


>gi|218201221|gb|EEC83648.1| hypothetical protein OsI_29393 [Oryza sativa Indica Group]
          Length = 451

 Score =  152 bits (384), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 65/118 (55%), Positives = 92/118 (77%)

Query: 118 LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDM 177
           L+QT+ CG G  A E I++ +FVIE+VGEVIDD+TCEERL  M+  G+ NFY+C++ +D 
Sbjct: 327 LLQTQYCGWGSRALEAIEKDDFVIEFVGEVIDDETCEERLEDMRRRGDKNFYMCKVKKDF 386

Query: 178 VIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
           VIDAT+KGN  R+ NHSC PN ++QKW ++G+TR+G+FA++ I+ GE LT+DY +  L
Sbjct: 387 VIDATFKGNDCRFFNHSCEPNCQLQKWQVNGKTRLGVFASKAIEVGEPLTFDYSFRLL 444


>gi|405952170|gb|EKC20012.1| Histone-lysine N-methyltransferase SETD2 [Crassostrea gigas]
          Length = 1451

 Score =  152 bits (383), Expect = 2e-34,   Method: Composition-based stats.
 Identities = 77/201 (38%), Positives = 113/201 (56%), Gaps = 6/201 (2%)

Query: 39  PKAIPYVFIKRNIYLTKRIKRRLEDD--GIFCSCTASPGSSGV----CDRDCHCGMLLSS 92
           PK  P+  I+ NIYL +R K +   D   + C C+ S     +    C  DC   ML   
Sbjct: 403 PKKPPFEPIEDNIYLCERKKNKKMKDVRRMVCDCSTSKEDRDMGYEACGEDCLNRMLYIE 462

Query: 93  CSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQT 152
           C + C CG  C NK FQ +    ++   T+  G G+ A   ++ G+FV+EYVGEV+D + 
Sbjct: 463 CGNRCPCGEYCTNKRFQKKQYADVEAFVTDWKGMGLRATAALQPGDFVMEYVGEVLDYKQ 522

Query: 153 CEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRI 212
            + R+ +   +G+ + Y   +N D VIDA+YKGN SRY+NHSC PN E QKW ++G  R+
Sbjct: 523 FKSRVKQQAKMGQEHHYFMALNSDEVIDASYKGNVSRYMNHSCDPNCETQKWTVNGVLRV 582

Query: 213 GIFATRDIKKGENLTYDYQYE 233
           G F  + ++    L +DYQ+E
Sbjct: 583 GFFVKKAVEPLTELNFDYQFE 603


>gi|189237403|ref|XP_973596.2| PREDICTED: similar to AGAP011688-PA [Tribolium castaneum]
 gi|270007628|gb|EFA04076.1| hypothetical protein TcasGA2_TC014310 [Tribolium castaneum]
          Length = 1569

 Score =  152 bits (383), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 79/195 (40%), Positives = 114/195 (58%), Gaps = 7/195 (3%)

Query: 44  YVFIKRNIYLTKRIKRRLEDDGIFCSCTASP-----GSSGVCDRDCHCGMLLSSCSSGCK 98
           +V +K N+YLT R+  + E   + C C  +P     G  G C  DC   +L+  C   C 
Sbjct: 500 FVHLKENLYLTDRMSCK-EAKKMTCDCFLTPEEIERGELG-CGEDCLNRLLMIECGGLCP 557

Query: 99  CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
            G+ C NK FQ      +++ +TEK G G+ A  +I  GEF++EYVGEV+D +  + R  
Sbjct: 558 VGDRCTNKKFQKSQFAPVEVFKTEKKGLGLRAAANIPYGEFILEYVGEVLDPEEFDNRAD 617

Query: 159 KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATR 218
              +    ++Y   +  D +IDAT KGN SR+INHSC PN E QKW ++GE RIG F+TR
Sbjct: 618 DYSNDKNKHYYFMSLRADAIIDATMKGNISRFINHSCDPNAETQKWTVNGELRIGFFSTR 677

Query: 219 DIKKGENLTYDYQYE 233
            I  GE +T+DY+++
Sbjct: 678 TILAGEEITFDYRFQ 692


>gi|168009924|ref|XP_001757655.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162691349|gb|EDQ77712.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1715

 Score =  151 bits (382), Expect = 2e-34,   Method: Composition-based stats.
 Identities = 78/191 (40%), Positives = 111/191 (58%), Gaps = 3/191 (1%)

Query: 44  YVFIKRNIYLTKRIKRRLEDDGIFCSCTA-SPGSSGVCDRDCHCGMLLSSC-SSGCKCGN 101
           +  IKRNI+  +  K +  D+ + C C    PG  G C  DC   +L   C    C CG 
Sbjct: 670 WTLIKRNIFKHRNQKHQDADEAMVCQCAPPKPGEVG-CGEDCLNRILNVECLPQQCPCGP 728

Query: 102 SCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMK 161
            C N+ FQ R    ++L +  K G G+ A E+I RG F+IEYVGEV+D  + E R  +  
Sbjct: 729 LCTNQQFQKRTYANVELFRCGKKGHGLRALENIPRGTFIIEYVGEVLDMPSFEARQKEYS 788

Query: 162 HLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIK 221
              + +FY   ++ + +IDA  KGN  R+INHSC PN + +KW++DGE  IG+FA RDIK
Sbjct: 789 MNSQKHFYFMTLSANEIIDACNKGNLGRFINHSCEPNCQTEKWMVDGEVCIGLFAIRDIK 848

Query: 222 KGENLTYDYQY 232
           + E +T+DY +
Sbjct: 849 EREEVTFDYNF 859


>gi|158301050|ref|XP_001238385.2| AGAP011688-PA [Anopheles gambiae str. PEST]
 gi|157013454|gb|EAU75883.2| AGAP011688-PA [Anopheles gambiae str. PEST]
          Length = 2404

 Score =  151 bits (381), Expect = 3e-34,   Method: Composition-based stats.
 Identities = 83/217 (38%), Positives = 119/217 (54%), Gaps = 10/217 (4%)

Query: 25   GNPVEFELPDWFIK---PKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTAS-----PGS 76
            G+    EL + +++    K   +  I+ NIY + RI  R E   + C C  +      G 
Sbjct: 1185 GDSANVELSEEYLRDMEEKLSRFETIRENIYHSDRIVSR-EAKKMTCDCFLTHEEIERGE 1243

Query: 77   SGVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKR 136
             G C  DC   +L+  C S C  G+ C N+ FQ +     ++ +TEK G GI A   I  
Sbjct: 1244 HG-CGEDCLNRLLMIECGSRCTVGDRCTNRRFQRQEYAHCQVFRTEKKGFGIQASSAIAP 1302

Query: 137  GEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCC 196
            GEF++EYVGEV++    +ER          ++Y   +  D +IDAT KGN SR+INHSC 
Sbjct: 1303 GEFIMEYVGEVLNSAQFDERAEAYSREKNKHYYFMALRSDGIIDATTKGNISRFINHSCD 1362

Query: 197  PNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
            PN E QKW ++GE RIG F+T+ I  GE +T+DYQ++
Sbjct: 1363 PNAETQKWTVNGELRIGFFSTKYILPGEEITFDYQFQ 1399


>gi|427779581|gb|JAA55242.1| Putative histone-lysine n-methyltransferase setd2 [Rhipicephalus
           pulchellus]
          Length = 2038

 Score =  150 bits (380), Expect = 3e-34,   Method: Composition-based stats.
 Identities = 80/203 (39%), Positives = 115/203 (56%), Gaps = 12/203 (5%)

Query: 40  KAIPYVFIKRNIYLTKRIK-------RRLEDDGIFCSCTASPGSSGV--CDRDCHCGMLL 90
           K++ Y  I+ N+YL +R K       RR+  D   CS T      G+  C+ DC   +L+
Sbjct: 789 KSMDYEDIEENLYLFERRKTKSKKEVRRMICD---CSLTKDEKDRGIMGCEEDCLNRLLM 845

Query: 91  SSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDD 150
             C S C  G +C N+ FQ +   K++   TEK G G+   E +  G FV+EYVGEV+  
Sbjct: 846 IECGSRCPNGENCSNRRFQKKSYIKVEKFMTEKKGWGLRTLETVSSGTFVMEYVGEVLTP 905

Query: 151 QTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGET 210
           +   +R+ +       ++Y   +  D +IDAT KGN SR+INHSC PN E QKW ++GE 
Sbjct: 906 EDFRKRVKQYARDNNQHYYFMALRADEIIDATQKGNVSRFINHSCDPNCETQKWTVNGEL 965

Query: 211 RIGIFATRDIKKGENLTYDYQYE 233
           RIG F  R ++ GE LT+DYQ++
Sbjct: 966 RIGFFTRRPLRAGEELTFDYQFQ 988


>gi|301614673|ref|XP_002936809.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Xenopus
            (Silurana) tropicalis]
          Length = 1298

 Score =  150 bits (378), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 83/202 (41%), Positives = 120/202 (59%), Gaps = 10/202 (4%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
            K  PY  IK N    K ++    D      C   P S   C  D +C   ML+  C    
Sbjct: 931  KPPPYKHIKVNKPYGK-VQVYTADISEIPKCNCKPSSEKPCGFDSECLNRMLMYECHPQV 989

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G+ C N+ F  R   + K+++TE  G G++A  DIK+GEFV EY+GE+ID++ C   
Sbjct: 990  CPAGDRCQNQCFNKRQYPETKIIKTEGKGWGLIATRDIKKGEFVNEYIGELIDEEEC--- 1046

Query: 157  LWKMKHLGE---TNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIG 213
            +++++H  E   T+FY+  I++D +IDA  KGN SR++NHSC PN E QKW ++G+TR+G
Sbjct: 1047 MYRIRHAQENDITHFYMLTIDKDRIIDAGPKGNFSRFMNHSCQPNCETQKWSVNGDTRVG 1106

Query: 214  IFATRDIKKGENLTYDYQYEFL 235
            +FA RDI  GE LT++Y  + L
Sbjct: 1107 LFAVRDIPAGEELTFNYNLDCL 1128


>gi|412985970|emb|CCO17170.1| predicted protein [Bathycoccus prasinos]
          Length = 1058

 Score =  150 bits (378), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 70/188 (37%), Positives = 115/188 (61%), Gaps = 1/188 (0%)

Query: 47  IKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLN 105
           + RNI+  +  ++  EDD + C+C   P +   C+ +C   ++L+ C    C C  +C N
Sbjct: 167 VSRNIFKHREPRKLTEDDIMVCNCAIDPETGSGCEEECVNRLVLTECDPKFCPCSTACKN 226

Query: 106 KPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGE 165
           + F  +  K++++ ++ K G G+ ++E+IK GEFV+EYVGEV+ ++  EER  +      
Sbjct: 227 QRFSRKKFKELEVKRSSKKGHGLFSEENIKPGEFVLEYVGEVLHEEAYEERKRQYAREKR 286

Query: 166 TNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGEN 225
            +FY   ++    IDAT +G   R++NHSC PN E QKW++ GE  IG+FA R+I+ GE 
Sbjct: 287 RHFYFMTLSSSETIDATERGGVGRFLNHSCDPNCETQKWMVKGELCIGVFALREIQPGEE 346

Query: 226 LTYDYQYE 233
           +T DY++E
Sbjct: 347 ITIDYKFE 354


>gi|297602734|ref|NP_001052807.2| Os04g0429100 [Oryza sativa Japonica Group]
 gi|255675465|dbj|BAF14721.2| Os04g0429100, partial [Oryza sativa Japonica Group]
          Length = 612

 Score =  149 bits (377), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 81/204 (39%), Positives = 120/204 (58%), Gaps = 12/204 (5%)

Query: 44  YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCG------MLLSSCSSG- 96
           Y+ I+ N +L +R KR+ E+D   C C  +     + D D  CG      +  + C+ G 
Sbjct: 108 YIHIETNDFLHRRHKRQKEEDIAVCECQYN-----LLDPDSACGDRCLNVLTSTECTPGY 162

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
           C CG  C N+ FQ       +LV+TE  G G++ADE+I  G+FV+EY GEVI  +  + R
Sbjct: 163 CLCGVYCKNQRFQKSQYAATRLVKTEGRGWGLLADENIMAGQFVMEYCGEVISWKEAKRR 222

Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
               ++ G T+ Y+  +N D  IDAT KG+ +R+INHSC PN E +KW + GE R+GIFA
Sbjct: 223 SQAYENQGLTDAYIIYLNADESIDATKKGSLARFINHSCQPNCETRKWNVLGEVRVGIFA 282

Query: 217 TRDIKKGENLTYDYQYEFLHDSLI 240
            +DI  G  L+YDY +E+   +++
Sbjct: 283 KQDIPIGTELSYDYNFEWFGGAMV 306


>gi|432879768|ref|XP_004073538.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific-like [Oryzias latipes]
          Length = 2321

 Score =  149 bits (376), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 81/196 (41%), Positives = 115/196 (58%), Gaps = 2/196 (1%)

Query: 40   KAIPYVFIKRNIYLTK-RIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-C 97
            K  PY  IK N  + K +I      +   C+C AS  +    D +C   ML+  C    C
Sbjct: 1789 KPPPYRHIKVNRPIGKVQIITADLSEVPRCNCKASDENPCGADSECINRMLMYECHPQVC 1848

Query: 98   KCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERL 157
              G  C N+ F  R    +++ +T  CG G+ A  DIK+G FV EYVGEVID++ C  R+
Sbjct: 1849 AAGERCQNQAFTKRQYTTVEIYRTLSCGWGLRAVSDIKKGAFVSEYVGEVIDEEECRARI 1908

Query: 158  WKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFAT 217
               +     NFY+  +++D VIDA  KGN++R++NHSC PN E QKW ++G+TR+G+FA 
Sbjct: 1909 RHAQEHDICNFYMLTLDKDRVIDAGPKGNQARFMNHSCQPNCETQKWTVNGDTRVGLFAL 1968

Query: 218  RDIKKGENLTYDYQYE 233
            +DI KGE LT++Y  E
Sbjct: 1969 QDIAKGEELTFNYNLE 1984


>gi|255078218|ref|XP_002502689.1| set domain protein [Micromonas sp. RCC299]
 gi|226517954|gb|ACO63947.1| set domain protein [Micromonas sp. RCC299]
          Length = 1065

 Score =  149 bits (376), Expect = 1e-33,   Method: Composition-based stats.
 Identities = 71/195 (36%), Positives = 109/195 (55%), Gaps = 1/195 (0%)

Query: 47  IKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLN 105
           I RNI+  +  K   EDD + C+C   P +   C  DC    +L  C    C CG+ C N
Sbjct: 104 IHRNIFTHREPKSVTEDDVLICACVPDPETGKGCGPDCINRDVLVECDPAFCPCGDGCQN 163

Query: 106 KPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGE 165
           + FQ +   K+ + +T + G G+   + +K+G+F+IEY+GEV+ +     R  +    G 
Sbjct: 164 QKFQKKDYAKLDIQRTGRKGHGLFTKQALKKGQFIIEYIGEVLHEDEYRSRKARYDDEGR 223

Query: 166 TNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGEN 225
            ++Y   ++    IDA  +GN  R++NHSC PN E QKW+++GE  IGI+A  DI  G+ 
Sbjct: 224 RHYYFMTLSSSETIDAAERGNAGRFLNHSCDPNCETQKWMVNGELCIGIYALTDIDAGDE 283

Query: 226 LTYDYQYEFLHDSLI 240
           LT+DY +E   D+ I
Sbjct: 284 LTFDYNFERYGDNPI 298


>gi|241753587|ref|XP_002401135.1| huntingtin interacting protein, putative [Ixodes scapularis]
 gi|215508354|gb|EEC17808.1| huntingtin interacting protein, putative [Ixodes scapularis]
          Length = 1594

 Score =  149 bits (376), Expect = 1e-33,   Method: Composition-based stats.
 Identities = 79/212 (37%), Positives = 119/212 (56%), Gaps = 12/212 (5%)

Query: 40  KAIPYVFIKRNIY-----LTKRIKRRLEDDGIFCSCTASPGSSGV--CDRDCHCGMLLSS 92
           +++ Y  I+ N+Y       K ++R + D    CS T      G+  C+ DC   +L+  
Sbjct: 541 RSLAYEDIEENLYRKKSKCKKEVRRMICD----CSLTKDERDRGIMGCEEDCLNRLLMIE 596

Query: 93  CSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQT 152
           C S C  G+SC N+ FQ +   K++   TEK G G+   E +  G FV+EYVGEV+  + 
Sbjct: 597 CGSRCPNGDSCSNRRFQKKSYIKVEKFLTEKKGWGLRTVETLASGAFVMEYVGEVLTPED 656

Query: 153 CEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRI 212
             +R+ +       ++Y   +  D +IDAT KGN SR+INHSC PN E QKW ++GE RI
Sbjct: 657 FRKRVKQYARDNHQHYYFMALRSDEIIDATQKGNVSRFINHSCDPNCETQKWTVNGELRI 716

Query: 213 GIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
           G F  R ++ GE LT+DYQ++ +  ++   YC
Sbjct: 717 GFFTRRPLRAGEELTFDYQFQRYGKEAQKCYC 748


>gi|198418893|ref|XP_002124393.1| PREDICTED: similar to SET domain containing 2 [Ciona intestinalis]
          Length = 2228

 Score =  149 bits (376), Expect = 1e-33,   Method: Composition-based stats.
 Identities = 78/200 (39%), Positives = 113/200 (56%), Gaps = 4/200 (2%)

Query: 47   IKRNIYLTKRIKRRLEDD--GIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCGNSCL 104
            I  NIYL ++ K R+  +   + C C  +   +  C  DC   +L+  CS+ C  G  C 
Sbjct: 995  ISDNIYLCEKKKSRVRKEIRRMVCECDNNEDGTP-CGSDCLNRLLMIECSARCPLGEQCQ 1053

Query: 105  NKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLG 164
            NK FQ +     ++ QT+  G GI A E++  G  V+EY GEV+D Q    R        
Sbjct: 1054 NKRFQRKQYVPTEVFQTKWKGWGIRATENLSPGMLVMEYCGEVLDLQEFGRRSLLYSRGN 1113

Query: 165  ETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGE 224
            + +FY   +++D +IDAT KGN SR+INHSC PN E QKW ++G  R+G F  RDI KGE
Sbjct: 1114 QQHFYFMALSQDEIIDATTKGNTSRFINHSCDPNCETQKWTVNGRLRVGFFTMRDINKGE 1173

Query: 225  NLTYDYQYE-FLHDSLIAYC 243
             +T+DYQ++ +  ++   YC
Sbjct: 1174 EITFDYQFQRYGKEAQACYC 1193


>gi|302141761|emb|CBI18964.3| unnamed protein product [Vitis vinifera]
          Length = 1958

 Score =  149 bits (376), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 79/190 (41%), Positives = 111/190 (58%), Gaps = 1/190 (0%)

Query: 44   YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNS 102
            ++ I+ N++L +  + +  D+ + C C         C  +C   ML   C  G C CG+ 
Sbjct: 1194 WMLIRSNLFLHRSRRTQTIDEVMVCHCKRPVEGRFGCGDECLNRMLNIECVQGTCPCGDL 1253

Query: 103  CLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
            C N+ FQ R   K+K  +  K G G+   +DI +G+F+IEYVGEV+D QT E R  +   
Sbjct: 1254 CSNQQFQKRGYAKLKWFKCGKKGYGLQLQQDISQGQFLIEYVGEVLDLQTYEARQKEYAS 1313

Query: 163  LGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKK 222
             G  +FY   +N   VIDA  KGN  R+INHSC PN   +KW+++GE  IG+FA RDIKK
Sbjct: 1314 RGHKHFYFMTLNGSEVIDACAKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALRDIKK 1373

Query: 223  GENLTYDYQY 232
            GE +T+DY Y
Sbjct: 1374 GEEVTFDYNY 1383


>gi|312378119|gb|EFR24776.1| hypothetical protein AND_10404 [Anopheles darlingi]
          Length = 2632

 Score =  149 bits (375), Expect = 1e-33,   Method: Composition-based stats.
 Identities = 78/192 (40%), Positives = 108/192 (56%), Gaps = 7/192 (3%)

Query: 47   IKRNIYLTKRIKRRLEDDGIFCSCTAS-----PGSSGVCDRDCHCGMLLSSCSSGCKCGN 101
            I+ NIY + RI  R E   + C C  +      G  G C  DC   +L+  C S C  G 
Sbjct: 1391 IRENIYYSDRIVSR-EAKKMTCDCFLTHEDIERGEMG-CGEDCLNRLLMIECGSRCTVGE 1448

Query: 102  SCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMK 161
             C N+ FQ +     ++ +TEK G GI A   I  GEF++EYVGEV++    ++R     
Sbjct: 1449 RCTNRRFQRQEYAHCQVFRTEKKGFGIQASAPIAPGEFIMEYVGEVLNGSQFDQRAEAYS 1508

Query: 162  HLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIK 221
                 ++Y   +  D +IDAT KGN SR+INHSC PN E QKW ++GE RIG F+T+ I 
Sbjct: 1509 RDKNKHYYFMALRSDGIIDATTKGNISRFINHSCDPNAETQKWTVNGELRIGFFSTKYIL 1568

Query: 222  KGENLTYDYQYE 233
             GE +T+DYQ++
Sbjct: 1569 PGEEITFDYQFQ 1580


>gi|332028801|gb|EGI68830.1| Putative histone-lysine N-methyltransferase NSD2 [Acromyrmex
            echinatior]
          Length = 1304

 Score =  148 bits (374), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 82/192 (42%), Positives = 110/192 (57%), Gaps = 2/192 (1%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CK 98
            K  PYV +K N  +   +K    +  + C C     +    + DC   +L+  CS G C 
Sbjct: 874  KPPPYVKLKVNKPVGN-VKPTEVESIVACECDPEWDNPCAPNTDCLNRILMVECSPGICP 932

Query: 99   CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
             G  C+N+ F  R    M+   T   G G+   EDIK G+FVIEYVGE+IDD   + RL 
Sbjct: 933  AGAKCMNQSFVLRQYPAMEPFHTMGRGWGLRTLEDIKTGQFVIEYVGEIIDDAEYKRRLH 992

Query: 159  KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATR 218
            + K L   NFY   I+ +  IDA  KGN SR++NHSC PN E QKW ++G+TRIG+FA R
Sbjct: 993  RKKELKNENFYFLTIDNNRTIDAEPKGNLSRFMNHSCAPNCETQKWTVNGDTRIGLFALR 1052

Query: 219  DIKKGENLTYDY 230
            DI+ GE LT++Y
Sbjct: 1053 DIESGEELTFNY 1064


>gi|116309320|emb|CAH66406.1| OSIGBa0093L02.2 [Oryza sativa Indica Group]
          Length = 519

 Score =  148 bits (373), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 81/204 (39%), Positives = 120/204 (58%), Gaps = 12/204 (5%)

Query: 44  YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCG------MLLSSCSSG- 96
           Y+ I+ N +L +R KR+ E+D   C C  +     + D D  CG      +  + C+ G 
Sbjct: 15  YIHIETNDFLHRRHKRQKEEDIAVCECQYN-----LLDPDSACGDRCLNVLTSTECTPGY 69

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
           C CG  C N+ FQ       +LV+TE  G G++ADE+I  G+FV+EY GEVI  +  + R
Sbjct: 70  CLCGVYCKNQRFQKSQYAATRLVKTEGRGWGLLADENIMAGQFVMEYCGEVISWKEAKRR 129

Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
               ++ G T+ Y+  +N D  IDAT KG+ +R+INHSC PN E +KW + GE R+GIFA
Sbjct: 130 SQAYENQGLTDAYIIYLNADESIDATKKGSLARFINHSCQPNCETRKWNVLGEVRVGIFA 189

Query: 217 TRDIKKGENLTYDYQYEFLHDSLI 240
            +DI  G  L+YDY +E+   +++
Sbjct: 190 KQDIPIGTELSYDYNFEWFGGAMV 213


>gi|240254387|ref|NP_177854.6| histone-lysine N-methyltransferase SETD2 [Arabidopsis thaliana]
 gi|157734196|gb|ABV68921.1| SDG8 [Arabidopsis thaliana]
 gi|332197839|gb|AEE35960.1| histone-lysine N-methyltransferase SETD2 [Arabidopsis thaliana]
          Length = 1805

 Score =  148 bits (373), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 78/190 (41%), Positives = 112/190 (58%), Gaps = 1/190 (0%)

Query: 44   YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNS 102
            +  IK N +L +  K +  D+ + C C  SP     C  +C   ML   C  G C  G+ 
Sbjct: 955  FKAIKTNQFLHRNRKSQTIDEIMVCHCKPSPDGRLGCGEECLNRMLNIECLQGTCPAGDL 1014

Query: 103  CLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
            C N+ FQ R   K +  Q+ K G G+   ED++ G+F+IEYVGEV+D Q+ E R  +   
Sbjct: 1015 CSNQQFQKRKYVKFERFQSGKKGYGLRLLEDVREGQFLIEYVGEVLDMQSYETRQKEYAF 1074

Query: 163  LGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKK 222
             G+ +FY   +N + VIDA  KGN  R+INHSC PN   +KW+++GE  +GIF+ +D+KK
Sbjct: 1075 KGQKHFYFMTLNGNEVIDAGAKGNLGRFINHSCEPNCRTEKWMVNGEICVGIFSMQDLKK 1134

Query: 223  GENLTYDYQY 232
            G+ LT+DY Y
Sbjct: 1135 GQELTFDYNY 1144


>gi|3540208|gb|AAC34358.1| Hypothetical protein [Arabidopsis thaliana]
          Length = 1767

 Score =  148 bits (373), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 78/190 (41%), Positives = 112/190 (58%), Gaps = 1/190 (0%)

Query: 44   YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNS 102
            +  IK N +L +  K +  D+ + C C  SP     C  +C   ML   C  G C  G+ 
Sbjct: 955  FKAIKTNQFLHRNRKSQTIDEIMVCHCKPSPDGRLGCGEECLNRMLNIECLQGTCPAGDL 1014

Query: 103  CLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
            C N+ FQ R   K +  Q+ K G G+   ED++ G+F+IEYVGEV+D Q+ E R  +   
Sbjct: 1015 CSNQQFQKRKYVKFERFQSGKKGYGLRLLEDVREGQFLIEYVGEVLDMQSYETRQKEYAF 1074

Query: 163  LGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKK 222
             G+ +FY   +N + VIDA  KGN  R+INHSC PN   +KW+++GE  +GIF+ +D+KK
Sbjct: 1075 KGQKHFYFMTLNGNEVIDAGAKGNLGRFINHSCEPNCRTEKWMVNGEICVGIFSMQDLKK 1134

Query: 223  GENLTYDYQY 232
            G+ LT+DY Y
Sbjct: 1135 GQELTFDYNY 1144


>gi|328725324|ref|XP_003248429.1| PREDICTED: hypothetical protein LOC100165877 isoform 2
           [Acyrthosiphon pisum]
          Length = 1384

 Score =  148 bits (373), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 93/250 (37%), Positives = 135/250 (54%), Gaps = 32/250 (12%)

Query: 18  NKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKR----------------IKRRL 61
           N+LL+   N +EF+      K KA  +  +  N+Y   R                +K+R+
Sbjct: 393 NQLLEARINDIEFDE-----KLKATNFTLVDENVYRPNRQTLKSYPPSEIPGLRKLKQRI 447

Query: 62  EDDGI--FCSCTASP-----GSSGVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVK 114
               I  +C CT S      G++G CD  C   +L   C  GC     C NK FQN+  K
Sbjct: 448 ISGDINDYCECTLSKEDIIEGNTG-CDDRCLNRLLKVECGLGCSLKRYCTNKQFQNKQFK 506

Query: 115 KMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEIN 174
           K  +++T+  G GI A EDI +G  + EYVGEVID      RL K ++      Y+ ++N
Sbjct: 507 KTNIIKTDNKGYGICAVEDIPKGALISEYVGEVIDYNEMCNRLTKKEYKNLN--YMVQLN 564

Query: 175 RDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEF 234
            D +ID+T KGN +R+INHSC PN+  +KW + G++RIG F+TR I+KGE +T+DY ++ 
Sbjct: 565 PDEIIDSTSKGNVTRFINHSCDPNSVGEKWHVLGQSRIGFFSTRHIEKGEEITFDYSFQI 624

Query: 235 LHD-SLIAYC 243
             D + I YC
Sbjct: 625 FGDGAQICYC 634


>gi|222628880|gb|EEE61012.1| hypothetical protein OsJ_14832 [Oryza sativa Japonica Group]
          Length = 518

 Score =  148 bits (373), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 81/204 (39%), Positives = 120/204 (58%), Gaps = 12/204 (5%)

Query: 44  YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCG------MLLSSCSSG- 96
           Y+ I+ N +L +R KR+ E+D   C C  +     + D D  CG      +  + C+ G 
Sbjct: 14  YIHIETNDFLHRRHKRQKEEDIAVCECQYN-----LLDPDSACGDRCLNVLTSTECTPGY 68

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
           C CG  C N+ FQ       +LV+TE  G G++ADE+I  G+FV+EY GEVI  +  + R
Sbjct: 69  CLCGVYCKNQRFQKSQYAATRLVKTEGRGWGLLADENIMAGQFVMEYCGEVISWKEAKRR 128

Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
               ++ G T+ Y+  +N D  IDAT KG+ +R+INHSC PN E +KW + GE R+GIFA
Sbjct: 129 SQAYENQGLTDAYIIYLNADESIDATKKGSLARFINHSCQPNCETRKWNVLGEVRVGIFA 188

Query: 217 TRDIKKGENLTYDYQYEFLHDSLI 240
            +DI  G  L+YDY +E+   +++
Sbjct: 189 KQDIPIGTELSYDYNFEWFGGAMV 212


>gi|255078696|ref|XP_002502928.1| set domain protein [Micromonas sp. RCC299]
 gi|226518194|gb|ACO64186.1| set domain protein [Micromonas sp. RCC299]
          Length = 377

 Score =  148 bits (373), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 75/156 (48%), Positives = 96/156 (61%), Gaps = 3/156 (1%)

Query: 80  CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGE 138
           C  +C     L++C    C CG SC N+PF      K   + TE  G G+   E +K G 
Sbjct: 50  CGAECFNRTCLTTCDPRVCPCGPSCSNRPFHQLKSPKTDTLLTENRGWGLFLAEPVKAGT 109

Query: 139 FVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPN 198
           F++EYVGE++D+ T E+RLW+ K  GE NFYL E+  +  IDA YKGN SR+IN SC PN
Sbjct: 110 FIVEYVGEILDEHTTEKRLWEDKKRGEDNFYLMEVMPNQCIDARYKGNLSRFINSSCHPN 169

Query: 199 TEMQKW--IIDGETRIGIFATRDIKKGENLTYDYQY 232
            E QKW     GETR+GIFA +DI +G  LTYDY +
Sbjct: 170 CETQKWQDSATGETRVGIFAIQDIPEGTELTYDYNF 205


>gi|328725322|ref|XP_001947915.2| PREDICTED: hypothetical protein LOC100165877 isoform 3
           [Acyrthosiphon pisum]
          Length = 1568

 Score =  148 bits (373), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 93/250 (37%), Positives = 135/250 (54%), Gaps = 32/250 (12%)

Query: 18  NKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKR----------------IKRRL 61
           N+LL+   N +EF+      K KA  +  +  N+Y   R                +K+R+
Sbjct: 393 NQLLEARINDIEFDE-----KLKATNFTLVDENVYRPNRQTLKSYPPSEIPGLRKLKQRI 447

Query: 62  EDDGI--FCSCTASP-----GSSGVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVK 114
               I  +C CT S      G++G CD  C   +L   C  GC     C NK FQN+  K
Sbjct: 448 ISGDINDYCECTLSKEDIIEGNTG-CDDRCLNRLLKVECGLGCSLKRYCTNKQFQNKQFK 506

Query: 115 KMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEIN 174
           K  +++T+  G GI A EDI +G  + EYVGEVID      RL K ++      Y+ ++N
Sbjct: 507 KTNIIKTDNKGYGICAVEDIPKGALISEYVGEVIDYNEMCNRLTKKEYKNLN--YMVQLN 564

Query: 175 RDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEF 234
            D +ID+T KGN +R+INHSC PN+  +KW + G++RIG F+TR I+KGE +T+DY ++ 
Sbjct: 565 PDEIIDSTSKGNVTRFINHSCDPNSVGEKWHVLGQSRIGFFSTRHIEKGEEITFDYSFQI 624

Query: 235 LHD-SLIAYC 243
             D + I YC
Sbjct: 625 FGDGAQICYC 634


>gi|145327721|ref|NP_001077836.1| histone-lysine N-methyltransferase SETD2 [Arabidopsis thaliana]
 gi|332197840|gb|AEE35961.1| histone-lysine N-methyltransferase SETD2 [Arabidopsis thaliana]
          Length = 1501

 Score =  148 bits (373), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 78/190 (41%), Positives = 112/190 (58%), Gaps = 1/190 (0%)

Query: 44   YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNS 102
            +  IK N +L +  K +  D+ + C C  SP     C  +C   ML   C  G C  G+ 
Sbjct: 955  FKAIKTNQFLHRNRKSQTIDEIMVCHCKPSPDGRLGCGEECLNRMLNIECLQGTCPAGDL 1014

Query: 103  CLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
            C N+ FQ R   K +  Q+ K G G+   ED++ G+F+IEYVGEV+D Q+ E R  +   
Sbjct: 1015 CSNQQFQKRKYVKFERFQSGKKGYGLRLLEDVREGQFLIEYVGEVLDMQSYETRQKEYAF 1074

Query: 163  LGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKK 222
             G+ +FY   +N + VIDA  KGN  R+INHSC PN   +KW+++GE  +GIF+ +D+KK
Sbjct: 1075 KGQKHFYFMTLNGNEVIDAGAKGNLGRFINHSCEPNCRTEKWMVNGEICVGIFSMQDLKK 1134

Query: 223  GENLTYDYQY 232
            G+ LT+DY Y
Sbjct: 1135 GQELTFDYNY 1144


>gi|94707110|sp|Q2LAE1.1|ASHH2_ARATH RecName: Full=Histone-lysine N-methyltransferase ASHH2; AltName:
            Full=ASH1 homolog 2; AltName: Full=H3-K4-HMTase; AltName:
            Full=Histone H3-K36 methyltransferase 8;
            Short=H3-K36-HMTase 8; AltName: Full=Protein EARLY
            FLOWERING IN SHORT DAYS; AltName: Full=Protein SET DOMAIN
            GROUP 8
 gi|85036158|gb|ABC69038.1| SDG8 [Arabidopsis thaliana]
          Length = 1759

 Score =  148 bits (373), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 78/190 (41%), Positives = 112/190 (58%), Gaps = 1/190 (0%)

Query: 44   YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNS 102
            +  IK N +L +  K +  D+ + C C  SP     C  +C   ML   C  G C  G+ 
Sbjct: 955  FKAIKTNQFLHRNRKSQTIDEIMVCHCKPSPDGRLGCGEECLNRMLNIECLQGTCPAGDL 1014

Query: 103  CLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
            C N+ FQ R   K +  Q+ K G G+   ED++ G+F+IEYVGEV+D Q+ E R  +   
Sbjct: 1015 CSNQQFQKRKYVKFERFQSGKKGYGLRLLEDVREGQFLIEYVGEVLDMQSYETRQKEYAF 1074

Query: 163  LGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKK 222
             G+ +FY   +N + VIDA  KGN  R+INHSC PN   +KW+++GE  +GIF+ +D+KK
Sbjct: 1075 KGQKHFYFMTLNGNEVIDAGAKGNLGRFINHSCEPNCRTEKWMVNGEICVGIFSMQDLKK 1134

Query: 223  GENLTYDYQY 232
            G+ LT+DY Y
Sbjct: 1135 GQELTFDYNY 1144


>gi|218194855|gb|EEC77282.1| hypothetical protein OsI_15918 [Oryza sativa Indica Group]
          Length = 472

 Score =  147 bits (372), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 81/204 (39%), Positives = 120/204 (58%), Gaps = 12/204 (5%)

Query: 44  YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCG------MLLSSCSSG- 96
           Y+ I+ N +L +R KR+ E+D   C C  +     + D D  CG      +  + C+ G 
Sbjct: 9   YIHIETNDFLHRRHKRQKEEDIAVCECQYN-----LLDPDSACGDRCLNVLTSTECTPGY 63

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
           C CG  C N+ FQ       +LV+TE  G G++ADE+I  G+FV+EY GEVI  +  + R
Sbjct: 64  CLCGVYCKNQRFQKSQYAATRLVKTEGRGWGLLADENIMAGQFVMEYCGEVISWKEAKRR 123

Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
               ++ G T+ Y+  +N D  IDAT KG+ +R+INHSC PN E +KW + GE R+GIFA
Sbjct: 124 SQAYENQGLTDAYIIYLNADESIDATKKGSLARFINHSCQPNCETRKWNVLGEVRVGIFA 183

Query: 217 TRDIKKGENLTYDYQYEFLHDSLI 240
            +DI  G  L+YDY +E+   +++
Sbjct: 184 KQDIPIGTELSYDYNFEWFGGAMV 207


>gi|348527268|ref|XP_003451141.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like [Oreochromis
            niloticus]
          Length = 1605

 Score =  147 bits (371), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 79/194 (40%), Positives = 112/194 (57%), Gaps = 4/194 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
            K  PY FIK N  + K ++  + D      C   P     C  +  C   ML   C    
Sbjct: 1229 KPPPYKFIKSNKPVGK-VQVHVADLSEIQRCNCKPADEHPCSLESQCLNRMLQYECHPQV 1287

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G+SC N+ F  R   + ++++TE  G G+  ++ +K+G+FV EYVGEVID + C++R
Sbjct: 1288 CPAGDSCENQCFSKRLYAETEVIKTEGRGWGLRTNQALKKGDFVTEYVGEVIDSEECQQR 1347

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +      TNFY+  + +D VIDA  KGN SR+INHSC PN E QKW ++G+ RIGIFA
Sbjct: 1348 IKRAHENHVTNFYMLTLTKDRVIDAGPKGNSSRFINHSCSPNCETQKWTVNGDVRIGIFA 1407

Query: 217  TRDIKKGENLTYDY 230
              DI+ G  LT++Y
Sbjct: 1408 LCDIEAGTELTFNY 1421


>gi|449665927|ref|XP_002164851.2| PREDICTED: histone-lysine N-methyltransferase NSD2-like [Hydra
           magnipapillata]
          Length = 1214

 Score =  147 bits (371), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 89/221 (40%), Positives = 124/221 (56%), Gaps = 5/221 (2%)

Query: 16  AFNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPG 75
           AF +L KQ     E        KP+  P+  IK N  ++    R + D   +  C  S  
Sbjct: 755 AFAELEKQRAEVSEILGRKITNKPQ--PFKLIKSNKPVS--CIRNILDQSEWPVCECSKE 810

Query: 76  SSGVCDRDCHCGMLLSSCSS-GCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDI 134
           +    D +C   MLL  C++  C  GN C N+  Q    KK    + E  G G++AD DI
Sbjct: 811 TFCSSDSECLNRMLLFECNAKTCPAGNLCQNQQIQKNESKKCHPFKCEGRGWGLMADTDI 870

Query: 135 KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
           K+GEFVIEYVGE+ID++TC  R+ +       ++Y   I++D +IDA  KGN SR++NHS
Sbjct: 871 KQGEFVIEYVGELIDEETCHRRVREYHEKDIFDYYFLTIDKDNIIDAYPKGNMSRFMNHS 930

Query: 195 CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
           C PN E QKW ++GE R+ +FATRDIK GE L ++Y  + L
Sbjct: 931 CNPNCETQKWTVNGEIRVALFATRDIKMGEELCFNYNLDSL 971


>gi|159473232|ref|XP_001694743.1| histone methyltransferase [Chlamydomonas reinhardtii]
 gi|158276555|gb|EDP02327.1| histone methyltransferase [Chlamydomonas reinhardtii]
          Length = 1105

 Score =  147 bits (370), Expect = 6e-33,   Method: Composition-based stats.
 Identities = 81/211 (38%), Positives = 115/211 (54%), Gaps = 4/211 (1%)

Query: 37  IKPKAIPYVF--IKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCS 94
           +K + IP V+  +K N+   ++ K + EDD + C C         C  DC   ML   C 
Sbjct: 119 LKRRRIPAVWQLLKDNVLTHRKRKVQDEDDIMICHCKPVWRGGDGCGPDCINRMLCIECV 178

Query: 95  SG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTC 153
            G C   + C N+ F  R    +++ +    G G+ A EDIK G+F+IEY+GEV+++   
Sbjct: 179 PGFCPSEDKCTNQMFSKRMYANLEIRRAGAKGFGLFALEDIKAGQFIIEYIGEVLEEDEY 238

Query: 154 EERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIG 213
           + R      +G+ ++Y   I    VIDA  KGN SR+INHSC PN E QKW++ GE  IG
Sbjct: 239 QRRKEYYMSVGQRHYYFMNIGNGEVIDACRKGNISRFINHSCEPNCETQKWLVHGELAIG 298

Query: 214 IFATRDIKKGENLTYDYQYEFLHDS-LIAYC 243
           +FA RDI K   LT+DY +E   D  +  YC
Sbjct: 299 LFAVRDIPKDTELTFDYNFERYGDKPMRCYC 329


>gi|321468162|gb|EFX79148.1| hypothetical protein DAPPUDRAFT_319776 [Daphnia pulex]
          Length = 1408

 Score =  146 bits (369), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 78/205 (38%), Positives = 114/205 (55%), Gaps = 11/205 (5%)

Query: 36  FIKPKAIPYVFIKRNIYLTKR-----IKRRLEDDGIFCSCTASPGSSGV--CDRDCHCGM 88
            I+ +   +  ++ N YLT R     +KR L D    CS T    + G   C  DC   +
Sbjct: 386 IIQERLKSFEIVEENQYLTSRKTSKEVKRMLCD----CSLTKEEIARGELGCGEDCINRL 441

Query: 89  LLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVI 148
           L+  C   C+    C NK FQ R   K+++  TEK G G+ A +D+  G+F+IEYVGEVI
Sbjct: 442 LMIECGPRCQLAARCTNKRFQKRQYGKIEVFNTEKKGVGLRALQDMDPGDFIIEYVGEVI 501

Query: 149 DDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDG 208
           D +    R          ++Y   +  D +IDAT +GN SR+INHSC PN E QKW ++G
Sbjct: 502 DPREFHRRAKDYAREKNKHYYFMALKSDAIIDATQQGNVSRFINHSCDPNAETQKWTVNG 561

Query: 209 ETRIGIFATRDIKKGENLTYDYQYE 233
           + R+G FA + +K G+ +T+DYQ++
Sbjct: 562 DLRVGFFARKSLKSGDEVTFDYQFQ 586


>gi|195478285|ref|XP_002100470.1| GE17076 [Drosophila yakuba]
 gi|194187994|gb|EDX01578.1| GE17076 [Drosophila yakuba]
          Length = 2397

 Score =  146 bits (369), Expect = 7e-33,   Method: Composition-based stats.
 Identities = 81/205 (39%), Positives = 110/205 (53%), Gaps = 6/205 (2%)

Query: 44   YVFIKRNIYLTKRIKRRLEDDGIFCSC--TASPGSSG--VCDRDCHCGMLLSSCSSGCKC 99
            +  +K N Y   R +   E+  + C C  T    + G   C   C   ML+  C   C  
Sbjct: 1355 FQLLKENYYRCAR-QVSQENAEMQCDCFLTGDEEAQGHLSCGAGCINRMLMIECGPLCTN 1413

Query: 100  GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
            G  C NK FQ       ++ +TEK G GI A+  I  GEF++EYVGEVID +  E R   
Sbjct: 1414 GARCTNKRFQQHQCWPCRVFRTEKKGCGITAELQIPPGEFIMEYVGEVIDSEEFERRQHL 1473

Query: 160  MKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD 219
                   ++Y   +  + +IDAT KGN SRYINHSC PN E QKW ++GE RIG F+ + 
Sbjct: 1474 YSKDRNRHYYFMALRGEAIIDATSKGNISRYINHSCDPNAETQKWTVNGELRIGFFSVKP 1533

Query: 220  IKKGENLTYDYQYE-FLHDSLIAYC 243
            I+ GE +T+DYQY+ +  D+   YC
Sbjct: 1534 IQPGEEITFDYQYQRYGRDAQRCYC 1558


>gi|194766778|ref|XP_001965501.1| GF22528 [Drosophila ananassae]
 gi|190619492|gb|EDV35016.1| GF22528 [Drosophila ananassae]
          Length = 2414

 Score =  146 bits (368), Expect = 8e-33,   Method: Composition-based stats.
 Identities = 80/202 (39%), Positives = 111/202 (54%), Gaps = 6/202 (2%)

Query: 47   IKRNIYLTKRIKRRLEDDGIFCSC--TASPGSSG--VCDRDCHCGMLLSSCSSGCKCGNS 102
            ++ N Y   R +   E+  + C C  T    + G   C   C   ML+  C   C  G+ 
Sbjct: 1354 LRENYYRCAR-QVSQENAEMQCDCFLTGDEEAQGHLCCGSGCINRMLMIECGPLCSNGDR 1412

Query: 103  CLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
            C NK FQ       ++ +TEK G GI A+  I  GEF++EYVGEVID +  E R      
Sbjct: 1413 CTNKRFQQHQCWPCRVFRTEKKGCGITAELQIPPGEFIMEYVGEVIDSEEFERRQHLYSR 1472

Query: 163  LGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKK 222
              + ++Y   +  + +IDAT KGN SRYINHSC PN E QKW ++GE RIG F+ + I+ 
Sbjct: 1473 DRKRHYYFMALRGEAIIDATSKGNISRYINHSCDPNAETQKWTVNGELRIGFFSVKTIQP 1532

Query: 223  GENLTYDYQYE-FLHDSLIAYC 243
            GE +T+DYQY+ +  D+   YC
Sbjct: 1533 GEEITFDYQYQRYGRDAQRCYC 1554


>gi|224084984|ref|XP_002307459.1| SET domain protein [Populus trichocarpa]
 gi|222856908|gb|EEE94455.1| SET domain protein [Populus trichocarpa]
          Length = 594

 Score =  146 bits (368), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 81/194 (41%), Positives = 106/194 (54%), Gaps = 1/194 (0%)

Query: 40  KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CK 98
           K   +  I  N +L +  K +  D+ + C C A       C  +C   ML   C  G C 
Sbjct: 86  KEYEFTRITTNQFLHRSRKTQTIDEIMVCYCKAPVAGRLGCGDECLNRMLNIECVQGTCP 145

Query: 99  CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
           CG+ C N+ FQ R   KM   +  K G G+  DEDI RG+F+IEYVGEV+D    E R  
Sbjct: 146 CGDHCSNQQFQKRNYAKMTWERCGKKGFGLRLDEDISRGQFLIEYVGEVLDVHAYEARQK 205

Query: 159 KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATR 218
                G  +FY   ++   VIDA  KGN  R+INHSC PN   +KW+++GE  IG+FA R
Sbjct: 206 DYASKGHKHFYFMTLDGSEVIDACAKGNLGRFINHSCDPNCRTEKWVVNGEICIGLFALR 265

Query: 219 DIKKGENLTYDYQY 232
           DIK GE +T+DY Y
Sbjct: 266 DIKMGEEVTFDYNY 279


>gi|332026544|gb|EGI66662.1| Histone-lysine N-methyltransferase SETD2 [Acromyrmex echinatior]
          Length = 1841

 Score =  146 bits (368), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 82/216 (37%), Positives = 118/216 (54%), Gaps = 7/216 (3%)

Query: 23  QIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASP-----GSS 77
           +I  P+  EL D  ++ +   +  +  N+YLT+R   + E   + C C  +      G  
Sbjct: 778 KISLPMVLELEDREMEERLSQFEHLYENLYLTERYTNK-ETKRMVCDCFLTEEEIERGEL 836

Query: 78  GVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRG 137
           G C  DC   +L+  C   C  G+ C NK FQN    K ++ +TEK G G+ A  DI  G
Sbjct: 837 G-CGEDCLNRLLMIECGPRCVVGDRCTNKRFQNCQYAKCEVFRTEKKGFGLRAVVDIMAG 895

Query: 138 EFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCP 197
           EF++EYVGEV+D +    R  +       ++Y   +  D +IDAT KGN SR+INHSC P
Sbjct: 896 EFIMEYVGEVVDPKDFRRRAKEYSKDKNRHYYFMALKSDQIIDATMKGNISRFINHSCDP 955

Query: 198 NTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
           N E QKW ++GE RIG F  + I  GE +T+DY ++
Sbjct: 956 NAETQKWTVNGELRIGFFNKKFIAAGEEITFDYHFQ 991


>gi|350421470|ref|XP_003492853.1| PREDICTED: hypothetical protein LOC100746901 [Bombus impatiens]
          Length = 1777

 Score =  145 bits (367), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 79/212 (37%), Positives = 119/212 (56%), Gaps = 7/212 (3%)

Query: 27  PVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASP-----GSSGVCD 81
           P+  E+ D  ++ +   + +++ N+YLT+R   + E   + C C  +      G  G C 
Sbjct: 731 PMVPEIKDREMEERLSQFEYLRENLYLTERYTNK-ETKRMVCDCFLTEEEIERGELG-CG 788

Query: 82  RDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVI 141
            DC   +L+  C   C  G+ C NK FQN    K ++ +TEK G G+ A  D+  GEF++
Sbjct: 789 EDCLNRLLMIECGPRCVVGDRCTNKRFQNCEYAKCEVFRTEKKGFGLRAMVDLLAGEFIM 848

Query: 142 EYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEM 201
           EYVGEV+D +    R  +       ++Y   +  D +IDAT KGN SR+INHSC PN+E 
Sbjct: 849 EYVGEVVDPKDFRRRAKEYSKDKNKHYYFMALKSDQIIDATLKGNVSRFINHSCDPNSET 908

Query: 202 QKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
           QKW ++GE RIG F  + I  GE +T+DY ++
Sbjct: 909 QKWTVNGELRIGFFNKKFIAAGEEITFDYHFQ 940


>gi|47225482|emb|CAG11965.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 1625

 Score =  145 bits (366), Expect = 1e-32,   Method: Composition-based stats.
 Identities = 77/205 (37%), Positives = 115/205 (56%), Gaps = 9/205 (4%)

Query: 38  KPKAIPYVF--IKRNIYLTKRIKRRLEDD--GIFCSCTASPG---SSGV--CDRDCHCGM 88
           K K +P  F  I+ N+YLT+R K +   D   + C C   P    S G   C  DC   +
Sbjct: 207 KSKKMPPYFDLIEENLYLTERKKSKSHRDIKRMQCECPVLPREERSKGAMACGEDCLNRL 266

Query: 89  LLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVI 148
           L+  CSS C+ G  C N+ FQ R   +  ++ TE  G G+ A +D+    FV+EY GEV+
Sbjct: 267 LMIECSSRCQNGAYCSNRRFQMRQHAEFDVILTENKGWGLRAAKDLPSNTFVLEYCGEVL 326

Query: 149 DDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDG 208
           D +  + R+ +       ++Y   +  + +IDAT KGN SR++NHSC PN E QKW ++G
Sbjct: 327 DHKEFKTRVKEYARNKNIHYYFMSLKNNEIIDATLKGNLSRFMNHSCEPNCETQKWTVNG 386

Query: 209 ETRIGIFATRDIKKGENLTYDYQYE 233
           + R+G F T+ +  G  LT+DYQ++
Sbjct: 387 QLRVGFFTTKAVTAGTELTFDYQFQ 411


>gi|167526044|ref|XP_001747356.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163774191|gb|EDQ87823.1| predicted protein [Monosiga brevicollis MX1]
          Length = 734

 Score =  145 bits (366), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 86/224 (38%), Positives = 118/224 (52%), Gaps = 21/224 (9%)

Query: 17  FNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGS 76
             +L +Q    V +   DW      I Y  I+  IY  K  K  LE+D + C C      
Sbjct: 28  LQQLREQHKKAVHYTANDW------IEYEQIEAPIYKAKMHK--LEED-MECEC-----Q 73

Query: 77  SGVCDRDCHCG------MLLSSCS-SGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIV 129
               D D +CG      +L+  C+ + C CGN C N+  Q +   ++++ +TEK G G+ 
Sbjct: 74  PNAADEDSYCGSNCLNRLLMVECNVARCPCGNKCRNRRLQKQQHARVEIFKTEKKGWGLR 133

Query: 130 ADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSR 189
           A E I++G+F+ EY GEV D     ER  +    G  ++Y   ++ D VIDAT KG  SR
Sbjct: 134 ALEPIRKGDFIYEYCGEVFDQAVFRERQLEYAQEGRFHYYFMSLSADTVIDATRKGAVSR 193

Query: 190 YINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
           +INHSC PN E QKW + G  RIG F  RDI   E +T+DYQYE
Sbjct: 194 FINHSCDPNAETQKWTVGGVLRIGFFCIRDIAVNEEITFDYQYE 237


>gi|194895514|ref|XP_001978270.1| GG17783 [Drosophila erecta]
 gi|190649919|gb|EDV47197.1| GG17783 [Drosophila erecta]
          Length = 2384

 Score =  145 bits (366), Expect = 1e-32,   Method: Composition-based stats.
 Identities = 81/205 (39%), Positives = 110/205 (53%), Gaps = 6/205 (2%)

Query: 44   YVFIKRNIYLTKRIKRRLEDDGIFCSC--TASPGSSG--VCDRDCHCGMLLSSCSSGCKC 99
            +  +K N Y   R +   E+  + C C  T    + G   C   C   ML+  C   C  
Sbjct: 1342 FQLLKENYYRCAR-QVSQENAEMQCDCFLTGDEEAQGHLSCGAGCINRMLMIECGPLCTN 1400

Query: 100  GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
            G  C NK FQ       ++ +TEK G GI A+  I  GEF++EYVGEVID +  E R   
Sbjct: 1401 GARCTNKRFQQHQCWPCRVFRTEKKGCGITAELQIPPGEFIMEYVGEVIDSEEFERRQHL 1460

Query: 160  MKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD 219
                   ++Y   +  + +IDAT KGN SRYINHSC PN E QKW ++GE RIG F+ + 
Sbjct: 1461 YSKDRNRHYYFMALRGEAIIDATSKGNISRYINHSCDPNAETQKWTVNGELRIGFFSVKP 1520

Query: 220  IKKGENLTYDYQYE-FLHDSLIAYC 243
            I+ GE +T+DYQY+ +  D+   YC
Sbjct: 1521 IQPGEEITFDYQYQRYGRDAQRCYC 1545


>gi|340726897|ref|XP_003401788.1| PREDICTED: hypothetical protein LOC100652142 [Bombus terrestris]
          Length = 1777

 Score =  145 bits (366), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 79/212 (37%), Positives = 119/212 (56%), Gaps = 7/212 (3%)

Query: 27  PVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASP-----GSSGVCD 81
           P+  E+ D  ++ +   + +++ N+YLT+R   + E   + C C  +      G  G C 
Sbjct: 731 PMVPEIKDREMEERLSQFEYLRENLYLTERYTNK-ETKRMVCDCFLTEEEIERGELG-CG 788

Query: 82  RDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVI 141
            DC   +L+  C   C  G+ C NK FQN    K ++ +TEK G G+ A  D+  GEF++
Sbjct: 789 EDCLNRLLMIECGPRCVVGDRCTNKRFQNCEYAKCEVFRTEKKGFGLRAMVDLLAGEFIM 848

Query: 142 EYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEM 201
           EYVGEV+D +    R  +       ++Y   +  D +IDAT KGN SR+INHSC PN+E 
Sbjct: 849 EYVGEVVDPKDFRRRAKEYSKDKNKHYYFMALKSDQIIDATLKGNVSRFINHSCDPNSET 908

Query: 202 QKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
           QKW ++GE RIG F  + I  GE +T+DY ++
Sbjct: 909 QKWTVNGELRIGFFNKKFIAAGEEITFDYHFQ 940


>gi|302801428|ref|XP_002982470.1| hypothetical protein SELMODRAFT_421873 [Selaginella moellendorffii]
 gi|300149569|gb|EFJ16223.1| hypothetical protein SELMODRAFT_421873 [Selaginella moellendorffii]
          Length = 1285

 Score =  145 bits (366), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 78/204 (38%), Positives = 112/204 (54%), Gaps = 1/204 (0%)

Query: 41  AIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKC 99
            + +  I+ NI+  +  K   +DD + C C      S  C  DC   M+   CS   C C
Sbjct: 501 GVSWTQIRHNIFQHRHQKTLDDDDTLICLCKPPKDGSPGCGEDCLNRMVNVECSPDTCPC 560

Query: 100 GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
           G  C N+ F  R    + LV+  K G G+ A E+I +G FVIEYVGEV+D ++ E R  +
Sbjct: 561 GERCSNQQFGKREYSNVALVRCGKKGFGLKALENIAKGSFVIEYVGEVLDSRSFELRQKE 620

Query: 160 MKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD 219
                + +FY   +N   VIDA  KGN  R+INHSC PN + +KW ++GE  IG+FA RD
Sbjct: 621 YARQRQKHFYFMTLNSSEVIDACRKGNLGRFINHSCEPNCQTEKWCVNGEICIGLFAIRD 680

Query: 220 IKKGENLTYDYQYEFLHDSLIAYC 243
           + K E +T++Y +E L+ +    C
Sbjct: 681 VAKNEEITFNYNFERLYGAAAKKC 704


>gi|224063022|ref|XP_002300966.1| SET domain protein [Populus trichocarpa]
 gi|222842692|gb|EEE80239.1| SET domain protein [Populus trichocarpa]
          Length = 605

 Score =  145 bits (365), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 82/195 (42%), Positives = 109/195 (55%), Gaps = 2/195 (1%)

Query: 40  KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS-GVCDRDCHCGMLLSSCSSG-C 97
           K   +  I  N +L +  K +  D+ + C C A  G   G C  +C   ML   C  G C
Sbjct: 51  KEYEFTRITTNQFLHRTRKTQTIDEIMVCYCKAPVGGRLGGCGDECLNRMLNIECVQGTC 110

Query: 98  KCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERL 157
            CG+ C N+ FQ     KM   +  K G G+  +EDI RG+F+IEYVGEV+D    E R 
Sbjct: 111 PCGDLCSNQQFQKHNYAKMTWDRCGKKGFGLRLEEDITRGQFLIEYVGEVLDVHAYEARQ 170

Query: 158 WKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFAT 217
            +    G  +FY   ++   VIDA  KGN  R+INHSC PN   +KW+++GE  IG+FA 
Sbjct: 171 KEYASKGHKHFYFMTLDGSEVIDACVKGNLGRFINHSCDPNCRTEKWVVNGEICIGLFAL 230

Query: 218 RDIKKGENLTYDYQY 232
           RDIKKGE +T+DY Y
Sbjct: 231 RDIKKGEEVTFDYNY 245


>gi|383860108|ref|XP_003705533.1| PREDICTED: uncharacterized protein LOC100883855 [Megachile
           rotundata]
          Length = 1766

 Score =  145 bits (365), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 79/212 (37%), Positives = 118/212 (55%), Gaps = 7/212 (3%)

Query: 27  PVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASP-----GSSGVCD 81
           P+  E+ D  ++ +   + ++  N+YLT+R   + E   + C C  +      G  G C 
Sbjct: 723 PMVPEIKDREMEERLSQFEYLSENLYLTERYTNK-ETKRMVCDCFLTEEEIERGELG-CG 780

Query: 82  RDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVI 141
            DC   +L+  C   C  G+ C NK FQN    K ++ +TEK G G+ A  D+  GEF++
Sbjct: 781 EDCLNRLLMIECGPRCVVGDRCTNKRFQNCEYAKCEVFRTEKKGFGLRAMADMLAGEFIM 840

Query: 142 EYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEM 201
           EYVGEV+D +    R  +       ++Y   +  D +IDAT KGN SR+INHSC PN+E 
Sbjct: 841 EYVGEVVDPKDFRRRAKEYSKDKNRHYYFMALKSDQIIDATMKGNVSRFINHSCDPNSET 900

Query: 202 QKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
           QKW ++GE RIG F  + I  GE +T+DY ++
Sbjct: 901 QKWTVNGELRIGFFNKKFIAAGEEITFDYHFQ 932


>gi|242073096|ref|XP_002446484.1| hypothetical protein SORBIDRAFT_06g016720 [Sorghum bicolor]
 gi|241937667|gb|EES10812.1| hypothetical protein SORBIDRAFT_06g016720 [Sorghum bicolor]
          Length = 521

 Score =  145 bits (365), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 81/198 (40%), Positives = 115/198 (58%), Gaps = 12/198 (6%)

Query: 44  YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCG------MLLSSCSSG- 96
           Y+ I RN +  ++ KR+ E+D   C C        + D +  CG      +  + C+ G 
Sbjct: 13  YIHIDRNDFSYRKHKRQTEEDIAVCECQYD-----ILDPESPCGDRCLNLLTNTECTPGY 67

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
           C+CG  C N+ FQ     + +LV+TE  G G+VADE+I  G+FVIEY GEVI  +  + R
Sbjct: 68  CRCGVYCKNQRFQKCQYARTRLVRTEGRGWGLVADENIMAGQFVIEYCGEVISWKESKRR 127

Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
               +  G  + Y+  +N D  IDAT KGN +R+INHSC PN E +KW + GE R+GIFA
Sbjct: 128 AQAYETQGLKDAYIIYLNADESIDATRKGNFARFINHSCQPNCETRKWNVLGEVRVGIFA 187

Query: 217 TRDIKKGENLTYDYQYEF 234
            +DI  G  L+YDY +E+
Sbjct: 188 KQDIPFGTELSYDYNFEW 205


>gi|359078405|ref|XP_002697155.2| PREDICTED: histone-lysine N-methyltransferase SETD2 [Bos taurus]
          Length = 1448

 Score =  145 bits (365), Expect = 2e-32,   Method: Composition-based stats.
 Identities = 75/207 (36%), Positives = 115/207 (55%), Gaps = 9/207 (4%)

Query: 36  FIKPKAIPYVF--IKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHC 86
           + K   +P  F  I+ N+YLT+R K +   D     C  +P S          C  DC  
Sbjct: 386 YAKQGKMPCYFDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLN 445

Query: 87  GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
            +L+  CSS C  G+ C N+ FQ +    ++++ TEK G G+ A +D+    FV+EY GE
Sbjct: 446 RLLMIECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGE 505

Query: 147 VIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWII 206
           V+D +  + R+ +       ++Y   +  D +IDAT KGN SR++NHSC PN E QKW +
Sbjct: 506 VLDHKEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTV 565

Query: 207 DGETRIGIFATRDIKKGENLTYDYQYE 233
           +G+ R+G F T+ +  G  LT+DYQ++
Sbjct: 566 NGQLRVGFFTTKLVPSGSELTFDYQFQ 592


>gi|380019005|ref|XP_003693408.1| PREDICTED: uncharacterized protein LOC100869667 [Apis florea]
          Length = 1392

 Score =  144 bits (364), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 80/216 (37%), Positives = 120/216 (55%), Gaps = 7/216 (3%)

Query: 23  QIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASP-----GSS 77
           +I  P+  E+ D  ++ +   +  ++ N+YLT+R   + E   + C C  +      G  
Sbjct: 344 RISLPMVSEIEDREMEERLSQFENLRENLYLTERYTNK-ETKRMVCDCFLTEEEIERGEL 402

Query: 78  GVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRG 137
           G C  DC   +L+  C   C  G+ C NK FQN    K ++ +TEK G G+ A  D+  G
Sbjct: 403 G-CGEDCLNRLLMIECGPRCVVGDRCTNKRFQNCEYAKCEVFRTEKKGFGLRAMVDLLAG 461

Query: 138 EFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCP 197
           EF++EYVGEV+D +    R  +       ++Y   +  D +IDAT KGN SR+INHSC P
Sbjct: 462 EFIMEYVGEVVDPKDFRRRAKEYSKDKNKHYYFMALKSDQIIDATMKGNVSRFINHSCDP 521

Query: 198 NTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
           N+E QKW ++GE RIG F  + I  GE +T+DY ++
Sbjct: 522 NSETQKWTVNGELRIGFFNKKFIAAGEEITFDYHFQ 557


>gi|449492020|ref|XP_004174653.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            SETD2 [Taeniopygia guttata]
          Length = 2489

 Score =  144 bits (364), Expect = 3e-32,   Method: Composition-based stats.
 Identities = 76/207 (36%), Positives = 117/207 (56%), Gaps = 9/207 (4%)

Query: 36   FIKPKAIPYVF--IKRNIYLTKRIKRRLEDD--GIFCSC---TASPGSSG--VCDRDCHC 86
            F K   +P  F  I+ N+YLT+R K +   D   + C C   +    + G   C  DC  
Sbjct: 1447 FAKQGKMPCYFDLIEENVYLTERKKNKSHRDIKRMLCECPPLSKEERAQGEVACGEDCLN 1506

Query: 87   GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
             +L+  CSS C  G+ C N+ FQ +    ++++ TEK G G+ A +D+    FV+EY GE
Sbjct: 1507 RLLMIECSSRCPNGDYCSNRRFQKKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGE 1566

Query: 147  VIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWII 206
            V+D +  + R+ +       ++Y   +  D +IDAT KGN SR++NHSC PN E QKW +
Sbjct: 1567 VLDHKEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTV 1626

Query: 207  DGETRIGIFATRDIKKGENLTYDYQYE 233
            +G+ R+G F T+ +  G  LT+DYQ++
Sbjct: 1627 NGQLRVGFFTTKLVPSGSELTFDYQFQ 1653


>gi|195012609|ref|XP_001983710.1| GH16034 [Drosophila grimshawi]
 gi|193897192|gb|EDV96058.1| GH16034 [Drosophila grimshawi]
          Length = 2059

 Score =  144 bits (364), Expect = 3e-32,   Method: Composition-based stats.
 Identities = 81/206 (39%), Positives = 108/206 (52%), Gaps = 7/206 (3%)

Query: 44   YVFIKRNIYLTKRIKRRLEDDGIFCSC-----TASPGSSGVCDRDCHCGMLLSSCSSGCK 98
            +  ++ N Y   R   R E+  + C C       S G    C   C   ML+  C   C 
Sbjct: 1084 FQMLRDNYYRCARQVSR-ENAEMQCDCFVTGDEESLGQMRCCGDGCINRMLMIECGPLCT 1142

Query: 99   CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
             G  C NK FQ       ++ +TEK G GI A+  +  GEF++EYVGEVID +  E R  
Sbjct: 1143 NGERCTNKRFQQHQGWPCRVFRTEKKGCGITAELQMPSGEFIMEYVGEVIDSEEFERRQH 1202

Query: 159  KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATR 218
                    ++Y   +  D +IDAT KGN SRYINHSC PN E QKW ++GE RIG F+ +
Sbjct: 1203 LYSEDRNRHYYFMALRSDSIIDATSKGNISRYINHSCDPNAETQKWTVNGELRIGFFSLK 1262

Query: 219  DIKKGENLTYDYQYE-FLHDSLIAYC 243
             I  GE +T+DYQY+ +  D+   YC
Sbjct: 1263 TIMPGEEITFDYQYQRYGRDAQRCYC 1288


>gi|322788177|gb|EFZ13959.1| hypothetical protein SINV_06678 [Solenopsis invicta]
          Length = 1093

 Score =  144 bits (364), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 73/149 (48%), Positives = 95/149 (63%), Gaps = 1/149 (0%)

Query: 83   DCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVI 141
            DC   +L+  CS G C   + C+N+ F  R    M+   TE  G G+ A EDI  G+FVI
Sbjct: 858  DCLNRILMVECSPGICPAEDKCMNQSFVLRQYPAMEPFHTEGRGWGLRALEDIAAGKFVI 917

Query: 142  EYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEM 201
            EYVGE+IDD   + RL + K L   NFY   I+ +  IDA  KGN SR++NHSC PN E 
Sbjct: 918  EYVGEIIDDAEYKRRLHRKKELKNENFYFLTIDNNRTIDAEPKGNLSRFMNHSCAPNCET 977

Query: 202  QKWIIDGETRIGIFATRDIKKGENLTYDY 230
            QKW ++G+TRIG+FA RDI+ GE LT++Y
Sbjct: 978  QKWTVNGDTRIGLFALRDIESGEELTFNY 1006


>gi|345480373|ref|XP_001606723.2| PREDICTED: hypothetical protein LOC100123115 [Nasonia vitripennis]
          Length = 1746

 Score =  144 bits (364), Expect = 3e-32,   Method: Composition-based stats.
 Identities = 74/192 (38%), Positives = 110/192 (57%), Gaps = 7/192 (3%)

Query: 47  IKRNIYLTKRIKRRLEDDGIFCSCTASP-----GSSGVCDRDCHCGMLLSSCSSGCKCGN 101
           +K N+YLT+R   + E   + C C  +      G  G C  DC   +L+  C S C  G+
Sbjct: 775 LKENLYLTERFTSK-ETKRMVCECFLTEEEFQRGELG-CGEDCLNRLLMIECGSRCVVGD 832

Query: 102 SCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMK 161
            C NK FQN      ++ +TEK G G+ A  +++ G+F++EYVGEV+D +   +R  +  
Sbjct: 833 RCTNKRFQNCEYANCEVFRTEKKGFGLRATTNLEAGDFIMEYVGEVLDPKDFRKRAKEYS 892

Query: 162 HLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIK 221
                ++Y   +  D +IDAT KGN SR+INHSC PN E QKW ++GE RIG F  + + 
Sbjct: 893 KDKNRHYYFMALKSDQIIDATMKGNISRFINHSCDPNAETQKWTVNGELRIGFFNKKFVA 952

Query: 222 KGENLTYDYQYE 233
            GE +T+DY ++
Sbjct: 953 AGEEITFDYHFQ 964


>gi|328790605|ref|XP_003251435.1| PREDICTED: hypothetical protein LOC100578450 [Apis mellifera]
          Length = 1394

 Score =  144 bits (363), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 80/216 (37%), Positives = 120/216 (55%), Gaps = 7/216 (3%)

Query: 23  QIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASP-----GSS 77
           +I  P+  E+ D  ++ +   +  ++ N+YLT+R   + E   + C C  +      G  
Sbjct: 344 RISLPMVSEIEDREMEERLSQFENLRENLYLTERYTNK-ETKRMVCDCFLTEEEIERGEL 402

Query: 78  GVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRG 137
           G C  DC   +L+  C   C  G+ C NK FQN    K ++ +TEK G G+ A  D+  G
Sbjct: 403 G-CGEDCLNRLLMIECGPRCVVGDRCTNKRFQNCEYAKCEVFRTEKKGFGLRAMVDLLAG 461

Query: 138 EFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCP 197
           EF++EYVGEV+D +    R  +       ++Y   +  D +IDAT KGN SR+INHSC P
Sbjct: 462 EFIMEYVGEVVDPKDFRRRAKEYSKDKNKHYYFMALKSDQIIDATMKGNVSRFINHSCDP 521

Query: 198 NTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
           N+E QKW ++GE RIG F  + I  GE +T+DY ++
Sbjct: 522 NSETQKWTVNGELRIGFFNKKFIAAGEEITFDYHFQ 557


>gi|242061944|ref|XP_002452261.1| hypothetical protein SORBIDRAFT_04g022620 [Sorghum bicolor]
 gi|241932092|gb|EES05237.1| hypothetical protein SORBIDRAFT_04g022620 [Sorghum bicolor]
          Length = 1840

 Score =  144 bits (363), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 75/194 (38%), Positives = 107/194 (55%), Gaps = 1/194 (0%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSS-GCK 98
            K   +  ++ N++L +  + +  D+ + C+C         C   C   ML   C+   C 
Sbjct: 1056 KPPSWTLVRSNLFLHRNRRTQSIDESMVCNCKPPQDGRMGCRDGCLNRMLNIECAKRTCP 1115

Query: 99   CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
            CG  C N+ FQ R   K++   + K G G+   ED+  G F+IEYVGEV+D  + E R  
Sbjct: 1116 CGEQCSNQKFQRRSYAKLRWFYSGKKGYGLQLQEDVTEGRFLIEYVGEVLDITSYESRQR 1175

Query: 159  KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATR 218
                 G+ +FY   +N   VIDA  KGN  R+INHSC PN   +KW+++GE  IGIF+ R
Sbjct: 1176 YYASKGQKHFYFMALNGGEVIDACTKGNLGRFINHSCSPNCRTEKWMVNGEVCIGIFSLR 1235

Query: 219  DIKKGENLTYDYQY 232
            +IKKGE LT+DY Y
Sbjct: 1236 NIKKGEELTFDYNY 1249


>gi|195126250|ref|XP_002007587.1| GI12297 [Drosophila mojavensis]
 gi|193919196|gb|EDW18063.1| GI12297 [Drosophila mojavensis]
          Length = 1972

 Score =  144 bits (363), Expect = 3e-32,   Method: Composition-based stats.
 Identities = 80/212 (37%), Positives = 113/212 (53%), Gaps = 4/212 (1%)

Query: 36   FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGV---CDRDCHCGMLLSS 92
            F++ + +    I R+ Y     +   E+  + C C  +    G    C   C   ML+  
Sbjct: 968  FLRQEGLNTFQILRDNYYRCARQVSKENAEMQCDCFVTGDEEGQGMRCGDGCINRMLMIE 1027

Query: 93   CSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQT 152
            C   C  G+ C NK FQ       ++ +TEK G GI A+  I+ GEF++EYVGEVID + 
Sbjct: 1028 CGPLCSYGDRCTNKRFQQHQGWPCRVFRTEKKGCGITAELQIQPGEFIMEYVGEVIDSEE 1087

Query: 153  CEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRI 212
             E R          ++Y   +  + +IDAT KGN SRYINHSC PN E QKW ++GE RI
Sbjct: 1088 FERRQHLYSEDRNRHYYFMALRGEAIIDATTKGNISRYINHSCDPNAETQKWTVNGELRI 1147

Query: 213  GIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
            G F+ + I  GE +T+DYQY+ +  D+   YC
Sbjct: 1148 GFFSVKTIMPGEEITFDYQYQRYGRDAQRCYC 1179


>gi|198467361|ref|XP_001354372.2| GA14357 [Drosophila pseudoobscura pseudoobscura]
 gi|198149208|gb|EAL31425.2| GA14357 [Drosophila pseudoobscura pseudoobscura]
          Length = 2918

 Score =  144 bits (363), Expect = 3e-32,   Method: Composition-based stats.
 Identities = 79/202 (39%), Positives = 111/202 (54%), Gaps = 6/202 (2%)

Query: 47   IKRNIYLTKRIKRRLEDDGIFCSC--TASPGSSG--VCDRDCHCGMLLSSCSSGCKCGNS 102
            ++ N Y   R +   E+  + C C  T    + G   C   C   ML+  C   C  G+ 
Sbjct: 1841 LRENYYRCAR-QVSQENAEMQCDCFLTGDEEAQGHLCCGAGCINRMLMIECGPLCTNGDR 1899

Query: 103  CLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
            C NK FQ       ++ +TEK G GI A+  I  GEF++EYVGEVID +  E R  +   
Sbjct: 1900 CTNKRFQLHQCWPCRVFRTEKKGCGITAELQIPAGEFIMEYVGEVIDSEEFERRQHRYSK 1959

Query: 163  LGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKK 222
                ++Y   +  + +IDAT +GN SRYINHSC PN E QKW ++GE RIG F+ ++I  
Sbjct: 1960 DRNRHYYFMALRGEAIIDATMRGNISRYINHSCDPNAETQKWTVNGELRIGFFSLKNILP 2019

Query: 223  GENLTYDYQYE-FLHDSLIAYC 243
            GE +T+DYQY+ +  D+   YC
Sbjct: 2020 GEEITFDYQYQRYGRDAQRCYC 2041


>gi|354484245|ref|XP_003504300.1| PREDICTED: histone-lysine N-methyltransferase SETD2-like [Cricetulus
            griseus]
 gi|344236054|gb|EGV92157.1| Histone-lysine N-methyltransferase SETD2 [Cricetulus griseus]
          Length = 2412

 Score =  144 bits (363), Expect = 3e-32,   Method: Composition-based stats.
 Identities = 76/207 (36%), Positives = 115/207 (55%), Gaps = 9/207 (4%)

Query: 36   FIKPKAIPYVF--IKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHC 86
            F K   +P  F  I+ N+YLT+R K +   D     C  +P S          C  DC  
Sbjct: 1312 FAKLGKMPCYFDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEVACGEDCLN 1371

Query: 87   GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
             +L+  CSS C  G+ C N+ FQ +    ++++ TEK G G+ A +D+    FV+EY GE
Sbjct: 1372 RLLMIECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGE 1431

Query: 147  VIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWII 206
            V+D +  + R+ +       ++Y   +  D +IDAT KGN SR++NHSC PN E QKW +
Sbjct: 1432 VLDHKEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTV 1491

Query: 207  DGETRIGIFATRDIKKGENLTYDYQYE 233
            +G+ R+G F T+ +  G  LT+DYQ++
Sbjct: 1492 NGQLRVGFFTTKLVPSGSELTFDYQFQ 1518


>gi|197927225|ref|NP_001074809.2| SET domain containing 2 [Mus musculus]
          Length = 2537

 Score =  144 bits (363), Expect = 3e-32,   Method: Composition-based stats.
 Identities = 76/207 (36%), Positives = 115/207 (55%), Gaps = 9/207 (4%)

Query: 36   FIKPKAIPYVF--IKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHC 86
            F K   +P  F  I+ N+YLT+R K +   D     C  +P S          C  DC  
Sbjct: 1437 FAKLGKMPCYFDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEVACGEDCLN 1496

Query: 87   GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
             +L+  CSS C  G+ C N+ FQ +    ++++ TEK G G+ A +D+    FV+EY GE
Sbjct: 1497 RLLMIECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGE 1556

Query: 147  VIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWII 206
            V+D +  + R+ +       ++Y   +  D +IDAT KGN SR++NHSC PN E QKW +
Sbjct: 1557 VLDHKEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTV 1616

Query: 207  DGETRIGIFATRDIKKGENLTYDYQYE 233
            +G+ R+G F T+ +  G  LT+DYQ++
Sbjct: 1617 NGQLRVGFFTTKLVPSGSELTFDYQFQ 1643


>gi|432092361|gb|ELK24976.1| Histone-lysine N-methyltransferase SETD2 [Myotis davidii]
          Length = 2865

 Score =  144 bits (363), Expect = 4e-32,   Method: Composition-based stats.
 Identities = 72/195 (36%), Positives = 110/195 (56%), Gaps = 7/195 (3%)

Query: 46   FIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSGCK 98
             I+ N+YLT+R K +   D     C  +P S          C  DC   +L+  CSS C 
Sbjct: 1778 LIEENVYLTERKKNKSHRDIKRMQCECAPLSKEERAQGEIACGEDCLNRLLMIECSSRCP 1837

Query: 99   CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
             G+ C N+ FQ +    ++++ TEK G G+ A +D+    FV+EY GEV+D +  + R+ 
Sbjct: 1838 NGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKARVK 1897

Query: 159  KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATR 218
            +       ++Y   +  D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F T+
Sbjct: 1898 EYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTK 1957

Query: 219  DIKKGENLTYDYQYE 233
             +  G  LT+DYQ++
Sbjct: 1958 LVPSGSELTFDYQFQ 1972


>gi|281343603|gb|EFB19187.1| hypothetical protein PANDA_000629 [Ailuropoda melanoleuca]
          Length = 2535

 Score =  144 bits (363), Expect = 4e-32,   Method: Composition-based stats.
 Identities = 75/207 (36%), Positives = 115/207 (55%), Gaps = 9/207 (4%)

Query: 36   FIKPKAIPYVF--IKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHC 86
            + K   +P  F  I+ N+YLT+R K +   D     C  +P S          C  DC  
Sbjct: 1434 YAKQGKMPCYFDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLN 1493

Query: 87   GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
             +L+  CSS C  G+ C N+ FQ +    ++++ TEK G G+ A +D+    FV+EY GE
Sbjct: 1494 RLLMIECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGE 1553

Query: 147  VIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWII 206
            V+D +  + R+ +       ++Y   +  D +IDAT KGN SR++NHSC PN E QKW +
Sbjct: 1554 VLDHKEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTV 1613

Query: 207  DGETRIGIFATRDIKKGENLTYDYQYE 233
            +G+ R+G F T+ +  G  LT+DYQ++
Sbjct: 1614 NGQLRVGFFTTKLVPSGSELTFDYQFQ 1640


>gi|301754075|ref|XP_002912890.1| PREDICTED: histone-lysine N-methyltransferase SETD2-like [Ailuropoda
            melanoleuca]
          Length = 2549

 Score =  144 bits (363), Expect = 4e-32,   Method: Composition-based stats.
 Identities = 75/207 (36%), Positives = 115/207 (55%), Gaps = 9/207 (4%)

Query: 36   FIKPKAIPYVF--IKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHC 86
            + K   +P  F  I+ N+YLT+R K +   D     C  +P S          C  DC  
Sbjct: 1448 YAKQGKMPCYFDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLN 1507

Query: 87   GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
             +L+  CSS C  G+ C N+ FQ +    ++++ TEK G G+ A +D+    FV+EY GE
Sbjct: 1508 RLLMIECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGE 1567

Query: 147  VIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWII 206
            V+D +  + R+ +       ++Y   +  D +IDAT KGN SR++NHSC PN E QKW +
Sbjct: 1568 VLDHKEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTV 1627

Query: 207  DGETRIGIFATRDIKKGENLTYDYQYE 233
            +G+ R+G F T+ +  G  LT+DYQ++
Sbjct: 1628 NGQLRVGFFTTKLVPSGSELTFDYQFQ 1654


>gi|395516140|ref|XP_003762252.1| PREDICTED: histone-lysine N-methyltransferase SETD2 [Sarcophilus
            harrisii]
          Length = 2570

 Score =  144 bits (362), Expect = 4e-32,   Method: Composition-based stats.
 Identities = 72/195 (36%), Positives = 110/195 (56%), Gaps = 7/195 (3%)

Query: 46   FIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSGCK 98
             I+ N+YLT+R K +   D     C  +P S          C  DC   +L+  CSS C 
Sbjct: 1477 LIEENVYLTERKKNKSHRDIKRMQCECTPVSKDDRAQGEIACGEDCLNRLLMIECSSRCP 1536

Query: 99   CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
             G+ C N+ FQ +    ++++ TEK G G+ A +D+    FV+EY GEV+D +  + R+ 
Sbjct: 1537 NGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKARVK 1596

Query: 159  KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATR 218
            +       ++Y   +  D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F T+
Sbjct: 1597 EYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTK 1656

Query: 219  DIKKGENLTYDYQYE 233
             +  G  LT+DYQ++
Sbjct: 1657 LVPSGSELTFDYQFQ 1671


>gi|195352880|ref|XP_002042939.1| GM11634 [Drosophila sechellia]
 gi|194126986|gb|EDW49029.1| GM11634 [Drosophila sechellia]
          Length = 1965

 Score =  144 bits (362), Expect = 4e-32,   Method: Composition-based stats.
 Identities = 82/205 (40%), Positives = 109/205 (53%), Gaps = 6/205 (2%)

Query: 44   YVFIKRNIYLTKRIKRRLEDDGIFCSC--TASPGSSG--VCDRDCHCGMLLSSCSSGCKC 99
            +  +K N Y   R +   E+  + C C  T    + G   C   C   ML+  C   C  
Sbjct: 1194 FQLLKENFYRCAR-QVSQENAEMQCDCFLTGDEEAQGHLSCGAGCINRMLMIECGPLCSN 1252

Query: 100  GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
            G  C NK FQ       ++ +TEK G GI A+  I  GEF++EYVGEVID +  E R   
Sbjct: 1253 GARCTNKRFQQHQCWPCRVFRTEKKGCGITAELLIPPGEFIMEYVGEVIDSEEFERRQHL 1312

Query: 160  MKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD 219
                   ++Y   +  + VIDAT KGN SRYINHSC PN E QKW ++GE RIG F+ + 
Sbjct: 1313 YSKDRNRHYYFMALRGEAVIDATSKGNISRYINHSCDPNAETQKWTVNGELRIGFFSVKP 1372

Query: 220  IKKGENLTYDYQY-EFLHDSLIAYC 243
            I+ GE +T+DYQY  +  D+   YC
Sbjct: 1373 IQPGEEITFDYQYLRYGRDAQRCYC 1397


>gi|281360813|ref|NP_001162740.1| Set2, isoform B [Drosophila melanogaster]
 gi|118582047|sp|Q9VYD1.2|C1716_DROME RecName: Full=Probable histone-lysine N-methyltransferase CG1716
 gi|92109778|gb|ABE73213.1| LD27386p [Drosophila melanogaster]
 gi|272506087|gb|ACZ95275.1| Set2, isoform B [Drosophila melanogaster]
          Length = 2313

 Score =  144 bits (362), Expect = 4e-32,   Method: Composition-based stats.
 Identities = 82/205 (40%), Positives = 109/205 (53%), Gaps = 6/205 (2%)

Query: 44   YVFIKRNIYLTKRIKRRLEDDGIFCSC--TASPGSSG--VCDRDCHCGMLLSSCSSGCKC 99
            +  +K N Y   R +   E+  + C C  T    + G   C   C   ML+  C   C  
Sbjct: 1289 FQLLKENFYRCAR-QVSQENAEMQCDCFLTGDEEAQGHLSCGAGCINRMLMIECGPLCSN 1347

Query: 100  GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
            G  C NK FQ       ++ +TEK G GI A+  I  GEF++EYVGEVID +  E R   
Sbjct: 1348 GARCTNKRFQQHQCWPCRVFRTEKKGCGITAELLIPPGEFIMEYVGEVIDSEEFERRQHL 1407

Query: 160  MKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD 219
                   ++Y   +  + VIDAT KGN SRYINHSC PN E QKW ++GE RIG F+ + 
Sbjct: 1408 YSKDRNRHYYFMALRGEAVIDATSKGNISRYINHSCDPNAETQKWTVNGELRIGFFSVKP 1467

Query: 220  IKKGENLTYDYQY-EFLHDSLIAYC 243
            I+ GE +T+DYQY  +  D+   YC
Sbjct: 1468 IQPGEEITFDYQYLRYGRDAQRCYC 1492


>gi|24641786|ref|NP_572888.2| Set2, isoform A [Drosophila melanogaster]
 gi|22832197|gb|AAF48273.2| Set2, isoform A [Drosophila melanogaster]
          Length = 2362

 Score =  144 bits (362), Expect = 4e-32,   Method: Composition-based stats.
 Identities = 82/205 (40%), Positives = 109/205 (53%), Gaps = 6/205 (2%)

Query: 44   YVFIKRNIYLTKRIKRRLEDDGIFCSC--TASPGSSG--VCDRDCHCGMLLSSCSSGCKC 99
            +  +K N Y   R +   E+  + C C  T    + G   C   C   ML+  C   C  
Sbjct: 1338 FQLLKENFYRCAR-QVSQENAEMQCDCFLTGDEEAQGHLSCGAGCINRMLMIECGPLCSN 1396

Query: 100  GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
            G  C NK FQ       ++ +TEK G GI A+  I  GEF++EYVGEVID +  E R   
Sbjct: 1397 GARCTNKRFQQHQCWPCRVFRTEKKGCGITAELLIPPGEFIMEYVGEVIDSEEFERRQHL 1456

Query: 160  MKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD 219
                   ++Y   +  + VIDAT KGN SRYINHSC PN E QKW ++GE RIG F+ + 
Sbjct: 1457 YSKDRNRHYYFMALRGEAVIDATSKGNISRYINHSCDPNAETQKWTVNGELRIGFFSVKP 1516

Query: 220  IKKGENLTYDYQY-EFLHDSLIAYC 243
            I+ GE +T+DYQY  +  D+   YC
Sbjct: 1517 IQPGEEITFDYQYLRYGRDAQRCYC 1541


>gi|334333796|ref|XP_001375978.2| PREDICTED: histone-lysine N-methyltransferase SETD2 [Monodelphis
            domestica]
          Length = 2592

 Score =  144 bits (362), Expect = 4e-32,   Method: Composition-based stats.
 Identities = 72/195 (36%), Positives = 110/195 (56%), Gaps = 7/195 (3%)

Query: 46   FIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSGCK 98
             I+ N+YLT+R K +   D     C  +P S          C  DC   +L+  CSS C 
Sbjct: 1498 LIEENVYLTERKKNKSHRDIKRMQCECTPVSKDDRAQGEIACGEDCLNRLLMIECSSRCP 1557

Query: 99   CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
             G+ C N+ FQ +    ++++ TEK G G+ A +D+    FV+EY GEV+D +  + R+ 
Sbjct: 1558 NGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKARVK 1617

Query: 159  KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATR 218
            +       ++Y   +  D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F T+
Sbjct: 1618 EYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTK 1677

Query: 219  DIKKGENLTYDYQYE 233
             +  G  LT+DYQ++
Sbjct: 1678 LVPSGSELTFDYQFQ 1692


>gi|196013861|ref|XP_002116791.1| hypothetical protein TRIADDRAFT_31338 [Trichoplax adhaerens]
 gi|190580769|gb|EDV20850.1| hypothetical protein TRIADDRAFT_31338 [Trichoplax adhaerens]
          Length = 725

 Score =  144 bits (362), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 77/195 (39%), Positives = 116/195 (59%), Gaps = 6/195 (3%)

Query: 40  KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSC-SSG 96
           K   Y  IK N  +   I+  L +   +  C+  P     C    +C   ML+  C SS 
Sbjct: 369 KPPSYKHIKMNKLVDALIRPNLTE---YTPCSCKPTDEAPCGSSSNCINRMLMCECNSSM 425

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
           C  G+ C N+ FQ     K ++ ++ +CG G+ + EDI  G  V+EYVGE+++++TC +R
Sbjct: 426 CPAGDKCQNQRFQKLEYAKSEIFKSNQCGWGLKSAEDIYAGTLVVEYVGELLNEKTCYQR 485

Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
           +   +  GE NFY+  I++D++IDA  KGN +R++NHSC PN E  KW ++G T IG+FA
Sbjct: 486 IKMAQSKGEKNFYMLNIDKDVIIDAGQKGNLARFMNHSCQPNCETHKWTVNGLTCIGLFA 545

Query: 217 TRDIKKGENLTYDYQ 231
             DIK+GE LT+DY+
Sbjct: 546 IDDIKQGEELTFDYR 560


>gi|308810955|ref|XP_003082786.1| Histone H3 (Lys4) methyltransferase complex, subunit SET1 and
           related methyltransferases (ISS) [Ostreococcus tauri]
 gi|116061255|emb|CAL56643.1| Histone H3 (Lys4) methyltransferase complex, subunit SET1 and
           related methyltransferases (ISS) [Ostreococcus tauri]
          Length = 501

 Score =  144 bits (362), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 80/218 (36%), Positives = 114/218 (52%), Gaps = 9/218 (4%)

Query: 24  IGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRD 83
           +G P  +E  +        P V +  +   T   K+ L D G       +P     C  +
Sbjct: 191 MGKPAAYERINRSTFVSVPPPVKMHISESATCECKQPLRDGG------GTPTLRNGCGPE 244

Query: 84  CHCGMLLSSCSS-GCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIE 142
           C    L  SC S  C CG++C N+P    P  K K+++TE  G G++  E +  G F++E
Sbjct: 245 CINRKLRFSCDSRTCPCGDACSNRPLSQLPTPKTKIIRTENRGWGLMLQEPVTAGTFIVE 304

Query: 143 YVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQ 202
           Y GE++++     RLW  K  GE NFYL EI+ + VIDA +KG+ +R+IN  C PN E Q
Sbjct: 305 YTGEILNEADVANRLWLDKQEGEENFYLMEISTNYVIDAKFKGSLARFINSGCHPNCETQ 364

Query: 203 KWI--IDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
           +W+     ETR+GIFA  DI  G  LTYDY +    D+
Sbjct: 365 RWVDASTNETRVGIFAIEDIPAGTELTYDYHFAHFGDA 402


>gi|440891718|gb|ELR45266.1| Histone-lysine N-methyltransferase SETD2, partial [Bos grunniens
            mutus]
          Length = 2533

 Score =  144 bits (362), Expect = 4e-32,   Method: Composition-based stats.
 Identities = 75/207 (36%), Positives = 115/207 (55%), Gaps = 9/207 (4%)

Query: 36   FIKPKAIPYVF--IKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHC 86
            + K   +P  F  I+ N+YLT+R K +   D     C  +P S          C  DC  
Sbjct: 1432 YAKQGKMPCYFDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLN 1491

Query: 87   GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
             +L+  CSS C  G+ C N+ FQ +    ++++ TEK G G+ A +D+    FV+EY GE
Sbjct: 1492 RLLMIECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGE 1551

Query: 147  VIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWII 206
            V+D +  + R+ +       ++Y   +  D +IDAT KGN SR++NHSC PN E QKW +
Sbjct: 1552 VLDHKEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTV 1611

Query: 207  DGETRIGIFATRDIKKGENLTYDYQYE 233
            +G+ R+G F T+ +  G  LT+DYQ++
Sbjct: 1612 NGQLRVGFFTTKLVPSGSELTFDYQFQ 1638


>gi|119914792|ref|XP_589886.3| PREDICTED: histone-lysine N-methyltransferase SETD2 [Bos taurus]
          Length = 2547

 Score =  144 bits (362), Expect = 4e-32,   Method: Composition-based stats.
 Identities = 75/207 (36%), Positives = 115/207 (55%), Gaps = 9/207 (4%)

Query: 36   FIKPKAIPYVF--IKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHC 86
            + K   +P  F  I+ N+YLT+R K +   D     C  +P S          C  DC  
Sbjct: 1446 YAKQGKMPCYFDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLN 1505

Query: 87   GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
             +L+  CSS C  G+ C N+ FQ +    ++++ TEK G G+ A +D+    FV+EY GE
Sbjct: 1506 RLLMIECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGE 1565

Query: 147  VIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWII 206
            V+D +  + R+ +       ++Y   +  D +IDAT KGN SR++NHSC PN E QKW +
Sbjct: 1566 VLDHKEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTV 1625

Query: 207  DGETRIGIFATRDIKKGENLTYDYQYE 233
            +G+ R+G F T+ +  G  LT+DYQ++
Sbjct: 1626 NGQLRVGFFTTKLVPSGSELTFDYQFQ 1652


>gi|296474690|tpg|DAA16805.1| TPA: Wolf-Hirschhorn syndrome candidate 1 protein-like [Bos taurus]
          Length = 2547

 Score =  144 bits (362), Expect = 4e-32,   Method: Composition-based stats.
 Identities = 75/207 (36%), Positives = 115/207 (55%), Gaps = 9/207 (4%)

Query: 36   FIKPKAIPYVF--IKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHC 86
            + K   +P  F  I+ N+YLT+R K +   D     C  +P S          C  DC  
Sbjct: 1446 YAKQGKMPCYFDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLN 1505

Query: 87   GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
             +L+  CSS C  G+ C N+ FQ +    ++++ TEK G G+ A +D+    FV+EY GE
Sbjct: 1506 RLLMIECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGE 1565

Query: 147  VIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWII 206
            V+D +  + R+ +       ++Y   +  D +IDAT KGN SR++NHSC PN E QKW +
Sbjct: 1566 VLDHKEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTV 1625

Query: 207  DGETRIGIFATRDIKKGENLTYDYQYE 233
            +G+ R+G F T+ +  G  LT+DYQ++
Sbjct: 1626 NGQLRVGFFTTKLVPSGSELTFDYQFQ 1652


>gi|195566590|ref|XP_002106863.1| GD17127 [Drosophila simulans]
 gi|194204255|gb|EDX17831.1| GD17127 [Drosophila simulans]
          Length = 2246

 Score =  144 bits (362), Expect = 4e-32,   Method: Composition-based stats.
 Identities = 82/205 (40%), Positives = 109/205 (53%), Gaps = 6/205 (2%)

Query: 44   YVFIKRNIYLTKRIKRRLEDDGIFCSC--TASPGSSG--VCDRDCHCGMLLSSCSSGCKC 99
            +  +K N Y   R +   E+  + C C  T    + G   C   C   ML+  C   C  
Sbjct: 1289 FQLLKENFYRCAR-QVSQENAEMQCDCFLTGDEEAQGHLSCGAGCINRMLMIECGPLCSN 1347

Query: 100  GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
            G  C NK FQ       ++ +TEK G GI A+  I  GEF++EYVGEVID +  E R   
Sbjct: 1348 GARCTNKRFQQHQCWPCRVFRTEKKGCGITAELLIPPGEFIMEYVGEVIDSEEFERRQHL 1407

Query: 160  MKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD 219
                   ++Y   +  + VIDAT KGN SRYINHSC PN E QKW ++GE RIG F+ + 
Sbjct: 1408 YSKDRNRHYYFMALRGEAVIDATSKGNISRYINHSCDPNAETQKWTVNGELRIGFFSVKP 1467

Query: 220  IKKGENLTYDYQY-EFLHDSLIAYC 243
            I+ GE +T+DYQY  +  D+   YC
Sbjct: 1468 IQPGEEITFDYQYLRYGRDAQRCYC 1492


>gi|73985747|ref|XP_864158.1| PREDICTED: histone-lysine N-methyltransferase SETD2 isoform 11 [Canis
            lupus familiaris]
          Length = 2562

 Score =  144 bits (362), Expect = 4e-32,   Method: Composition-based stats.
 Identities = 72/195 (36%), Positives = 110/195 (56%), Gaps = 7/195 (3%)

Query: 46   FIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSGCK 98
             I+ N+YLT+R K +   D     C  +P S          C  DC   +L+  CSS C 
Sbjct: 1473 LIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLNRLLMIECSSRCP 1532

Query: 99   CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
             G+ C N+ FQ +    ++++ TEK G G+ A +D+    FV+EY GEV+D +  + R+ 
Sbjct: 1533 NGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKARVK 1592

Query: 159  KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATR 218
            +       ++Y   +  D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F T+
Sbjct: 1593 EYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTK 1652

Query: 219  DIKKGENLTYDYQYE 233
             +  G  LT+DYQ++
Sbjct: 1653 LVPSGSELTFDYQFQ 1667


>gi|195448204|ref|XP_002071555.1| GK25076 [Drosophila willistoni]
 gi|194167640|gb|EDW82541.1| GK25076 [Drosophila willistoni]
          Length = 2217

 Score =  144 bits (362), Expect = 5e-32,   Method: Composition-based stats.
 Identities = 81/202 (40%), Positives = 109/202 (53%), Gaps = 6/202 (2%)

Query: 47   IKRNIYLTKRIKRRLEDDGIFCSC--TASPGSSG--VCDRDCHCGMLLSSCSSGCKCGNS 102
            +K N Y   R +   E+  + C C  T    + G   C   C   ML+  C   C  G+ 
Sbjct: 1088 LKENYYRCAR-QVSQENAEMQCDCFLTGDEEAQGHLCCGARCINRMLMIECGPLCTNGDR 1146

Query: 103  CLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
            C NK FQ       ++ +TEK G GI A+  I  GEF++EYVGEVID +  E R      
Sbjct: 1147 CTNKRFQQHQCWPCRVFRTEKKGCGITAELQIPPGEFIMEYVGEVIDAEEFERRQHLYSK 1206

Query: 163  LGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKK 222
                ++Y   +  + +IDAT KGN SRYINHSC PN E QKW ++GE RIG F+ + I  
Sbjct: 1207 DRNRHYYFMALRGEAIIDATSKGNISRYINHSCDPNAETQKWTVNGELRIGFFSVKTILP 1266

Query: 223  GENLTYDYQYE-FLHDSLIAYC 243
            GE +T+DYQY+ +  D+   YC
Sbjct: 1267 GEEITFDYQYQRYGRDAQRCYC 1288


>gi|410914004|ref|XP_003970478.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
           lysine-20 specific-like [Takifugu rubripes]
          Length = 1169

 Score =  143 bits (361), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 77/199 (38%), Positives = 115/199 (57%), Gaps = 2/199 (1%)

Query: 38  KPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG- 96
           KP    ++ + R I   + I   L +    C+C AS  +    D +C   ML+  C    
Sbjct: 701 KPPPYRHIKVNRAIGKVQFITADLSEIP-RCNCKASDENPCGVDSECINRMLMYECHPQV 759

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
           C  G  C N+ F  R    +++ +T  CG G+ A  DIK+G FVIEYVGEVID++ C  R
Sbjct: 760 CAAGERCQNQAFTKREYTPVEIYRTLSCGWGLRAVSDIKKGAFVIEYVGEVIDEEECRAR 819

Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
           +   +     NFY+  +++D +IDA  KGN++R++NH C PN E QKW ++G+TR+G+FA
Sbjct: 820 IKHAQENDIFNFYMLTLDKDRIIDAGPKGNQARFMNHCCQPNCETQKWTVNGDTRVGLFA 879

Query: 217 TRDIKKGENLTYDYQYEFL 235
            +DI KG+ L ++Y  E L
Sbjct: 880 LQDIPKGKELNFNYNLECL 898


>gi|348582642|ref|XP_003477085.1| PREDICTED: histone-lysine N-methyltransferase SETD2-like [Cavia
            porcellus]
          Length = 2565

 Score =  143 bits (361), Expect = 5e-32,   Method: Composition-based stats.
 Identities = 75/207 (36%), Positives = 115/207 (55%), Gaps = 9/207 (4%)

Query: 36   FIKPKAIPYVF--IKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHC 86
            + K   +P  F  I+ N+YLT+R K +   D     C  +P S          C  DC  
Sbjct: 1464 YAKQGKMPCYFDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEVACGEDCLN 1523

Query: 87   GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
             +L+  CSS C  G+ C N+ FQ +    ++++ TEK G G+ A +D+    FV+EY GE
Sbjct: 1524 RLLMIECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGE 1583

Query: 147  VIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWII 206
            V+D +  + R+ +       ++Y   +  D +IDAT KGN SR++NHSC PN E QKW +
Sbjct: 1584 VLDHKEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTV 1643

Query: 207  DGETRIGIFATRDIKKGENLTYDYQYE 233
            +G+ R+G F T+ +  G  LT+DYQ++
Sbjct: 1644 NGQLRVGFFTTKLVPSGSELTFDYQFQ 1670


>gi|297842509|ref|XP_002889136.1| hypothetical protein ARALYDRAFT_476894 [Arabidopsis lyrata subsp.
            lyrata]
 gi|297334977|gb|EFH65395.1| hypothetical protein ARALYDRAFT_476894 [Arabidopsis lyrata subsp.
            lyrata]
          Length = 1766

 Score =  143 bits (361), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 76/190 (40%), Positives = 111/190 (58%), Gaps = 1/190 (0%)

Query: 44   YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNS 102
            +  IK N +L +  K +  D+ + C C   P     C  +C   ML   C  G C  G+ 
Sbjct: 955  FKAIKTNQFLHRNRKSQTIDEIMVCHCKPPPDGRLGCGEECLNRMLNIECLQGTCPAGDL 1014

Query: 103  CLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
            C N+ FQ R   K +  Q+ K G G+   ED++ G+F+IEYVGEV+D Q+ + R  +   
Sbjct: 1015 CSNQQFQKRKYVKFERFQSGKKGYGLRLLEDVREGQFLIEYVGEVLDMQSYDTRQKEYAC 1074

Query: 163  LGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKK 222
             G+ +FY   +N + VIDA  KGN  R+INHSC PN   +KW+++GE  +GIF+ +D+KK
Sbjct: 1075 KGQKHFYFMTLNGNEVIDAGAKGNLGRFINHSCEPNCRTEKWMVNGEICVGIFSMKDLKK 1134

Query: 223  GENLTYDYQY 232
            G+ LT+DY Y
Sbjct: 1135 GQELTFDYNY 1144


>gi|195376627|ref|XP_002047094.1| GJ13235 [Drosophila virilis]
 gi|194154252|gb|EDW69436.1| GJ13235 [Drosophila virilis]
          Length = 2005

 Score =  143 bits (361), Expect = 5e-32,   Method: Composition-based stats.
 Identities = 76/187 (40%), Positives = 102/187 (54%), Gaps = 5/187 (2%)

Query: 62   EDDGIFCSCTASPGSSGV----CDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMK 117
            E+  + C C  +    G+    C   C   ML+  C   C  G  C NK FQ       +
Sbjct: 1017 ENAEMQCDCFVTGDEEGLGQLRCGDGCINRMLMIECGPLCTYGERCTNKRFQQHQGWPCR 1076

Query: 118  LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDM 177
            + +TEK G GI A+  I  GEF++EYVGEVID +  E R          ++Y   +  + 
Sbjct: 1077 VFRTEKKGCGITAELQIPPGEFIMEYVGEVIDSEEFERRQHLYSEDRNRHYYFMALRGEA 1136

Query: 178  VIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLH 236
            +IDAT KGN SRYINHSC PN E QKW ++GE RIG F+ + I  GE +T+DYQY+ +  
Sbjct: 1137 IIDATTKGNISRYINHSCDPNAETQKWTVNGELRIGFFSVKTILPGEEITFDYQYQRYGR 1196

Query: 237  DSLIAYC 243
            D+   YC
Sbjct: 1197 DAQRCYC 1203


>gi|301122693|ref|XP_002909073.1| histone-lysine N-methyltransferase, putative [Phytophthora
           infestans T30-4]
 gi|262099835|gb|EEY57887.1| histone-lysine N-methyltransferase, putative [Phytophthora
           infestans T30-4]
          Length = 751

 Score =  143 bits (361), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 74/168 (44%), Positives = 101/168 (60%), Gaps = 4/168 (2%)

Query: 80  CDRDCHCGMLLSSCS-SGCKCGNS--CLNKPFQNRPVKKMKLVQTEK-CGAGIVADEDIK 135
           C   C+  ML  SCS   C   +   C N+  + R +K M++       G G++ +EDI 
Sbjct: 164 CGEGCYNRMLFISCSDETCSAPDPSVCSNRAIKRRQLKSMRVEYIPGGPGFGLITNEDIN 223

Query: 136 RGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSC 195
            GEFVIEYVGEVIDD+ CE R+   +  GE NFY+ E+ +++VIDA Y+ N SR+INHSC
Sbjct: 224 AGEFVIEYVGEVIDDKECERRMITYRDNGEVNFYMMELEKNIVIDAKYRSNDSRFINHSC 283

Query: 196 CPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
            PN+  QKW +DG  RIGIFA R+I   E +T DY +    ++    C
Sbjct: 284 DPNSVTQKWNVDGMQRIGIFARRNIAPNEEITIDYNFSHFGEAADCRC 331


>gi|380812066|gb|AFE77908.1| histone-lysine N-methyltransferase SETD2 [Macaca mulatta]
          Length = 2565

 Score =  143 bits (361), Expect = 6e-32,   Method: Composition-based stats.
 Identities = 72/195 (36%), Positives = 110/195 (56%), Gaps = 7/195 (3%)

Query: 46   FIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSGCK 98
             I+ N+YLT+R K +   D     C  +P S          C  DC   +L+  CSS C 
Sbjct: 1476 LIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLNRLLMIECSSRCP 1535

Query: 99   CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
             G+ C N+ FQ +    ++++ TEK G G+ A +D+    FV+EY GEV+D +  + R+ 
Sbjct: 1536 NGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKARVK 1595

Query: 159  KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATR 218
            +       ++Y   +  D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F T+
Sbjct: 1596 EYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTK 1655

Query: 219  DIKKGENLTYDYQYE 233
             +  G  LT+DYQ++
Sbjct: 1656 LVPSGSELTFDYQFQ 1670


>gi|356507632|ref|XP_003522568.1| PREDICTED: histone-lysine N-methyltransferase ASHH2-like [Glycine
            max]
          Length = 2081

 Score =  143 bits (361), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 78/190 (41%), Positives = 109/190 (57%), Gaps = 1/190 (0%)

Query: 44   YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNS 102
            +  I  N +L +  K +  D+ + C C  S G    C  +C   +L   C  G C CG+ 
Sbjct: 1154 FTNILTNEFLHRSHKTQTIDEIMVCHCKPSQGGKLGCGDECLNRILNIECVQGTCPCGDR 1213

Query: 103  CLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
            C N+ FQ      +K  +  K G G+ A ED+ +G+F+IEYVGEV+D QT E R  +   
Sbjct: 1214 CSNQQFQKHKYASLKWFKCGKKGYGLKAIEDVAQGQFLIEYVGEVLDMQTYEARQREYAL 1273

Query: 163  LGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKK 222
             G  +FY   +N   VIDA+ KGN  R+INHSC PN   +KW+++GE  IG+FA R++KK
Sbjct: 1274 KGHRHFYFMTLNGSEVIDASAKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALRNVKK 1333

Query: 223  GENLTYDYQY 232
             E LT+DY Y
Sbjct: 1334 DEELTFDYNY 1343


>gi|109040979|ref|XP_001113652.1| PREDICTED: histone-lysine N-methyltransferase SETD2-like isoform 2
            [Macaca mulatta]
          Length = 2550

 Score =  143 bits (361), Expect = 6e-32,   Method: Composition-based stats.
 Identities = 72/195 (36%), Positives = 110/195 (56%), Gaps = 7/195 (3%)

Query: 46   FIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSGCK 98
             I+ N+YLT+R K +   D     C  +P S          C  DC   +L+  CSS C 
Sbjct: 1461 LIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLNRLLMIECSSRCP 1520

Query: 99   CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
             G+ C N+ FQ +    ++++ TEK G G+ A +D+    FV+EY GEV+D +  + R+ 
Sbjct: 1521 NGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKARVK 1580

Query: 159  KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATR 218
            +       ++Y   +  D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F T+
Sbjct: 1581 EYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTK 1640

Query: 219  DIKKGENLTYDYQYE 233
             +  G  LT+DYQ++
Sbjct: 1641 LVPSGSELTFDYQFQ 1655


>gi|402860278|ref|XP_003894560.1| PREDICTED: histone-lysine N-methyltransferase SETD2 [Papio anubis]
          Length = 2521

 Score =  143 bits (361), Expect = 6e-32,   Method: Composition-based stats.
 Identities = 72/195 (36%), Positives = 110/195 (56%), Gaps = 7/195 (3%)

Query: 46   FIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSGCK 98
             I+ N+YLT+R K +   D     C  +P S          C  DC   +L+  CSS C 
Sbjct: 1432 LIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLNRLLMIECSSRCP 1491

Query: 99   CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
             G+ C N+ FQ +    ++++ TEK G G+ A +D+    FV+EY GEV+D +  + R+ 
Sbjct: 1492 NGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKARVK 1551

Query: 159  KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATR 218
            +       ++Y   +  D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F T+
Sbjct: 1552 EYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTK 1611

Query: 219  DIKKGENLTYDYQYE 233
             +  G  LT+DYQ++
Sbjct: 1612 LVPSGSELTFDYQFQ 1626


>gi|157824020|ref|NP_001101659.1| histone-lysine N-methyltransferase SETD2 [Rattus norvegicus]
 gi|149018436|gb|EDL77077.1| kinesin family member 9 (predicted) [Rattus norvegicus]
          Length = 2294

 Score =  143 bits (361), Expect = 6e-32,   Method: Composition-based stats.
 Identities = 76/207 (36%), Positives = 115/207 (55%), Gaps = 9/207 (4%)

Query: 36   FIKPKAIPYVF--IKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHC 86
            F K   +P  F  I+ N+YLT+R K +   D     C  +P S          C  DC  
Sbjct: 1194 FAKLGKMPCYFDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEVACGEDCLN 1253

Query: 87   GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
             +L+  CSS C  G+ C N+ FQ +    ++++ TEK G G+ A +D+    FV+EY GE
Sbjct: 1254 RLLMIECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGE 1313

Query: 147  VIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWII 206
            V+D +  + R+ +       ++Y   +  D +IDAT KGN SR++NHSC PN E QKW +
Sbjct: 1314 VLDHKEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTV 1373

Query: 207  DGETRIGIFATRDIKKGENLTYDYQYE 233
            +G+ R+G F T+ +  G  LT+DYQ++
Sbjct: 1374 NGQLRVGFFTTKLVPSGSELTFDYQFQ 1400


>gi|15150415|gb|AAK84931.1| SD01656p [Drosophila melanogaster]
          Length = 1443

 Score =  143 bits (361), Expect = 6e-32,   Method: Composition-based stats.
 Identities = 82/205 (40%), Positives = 109/205 (53%), Gaps = 6/205 (2%)

Query: 44  YVFIKRNIYLTKRIKRRLEDDGIFCSC--TASPGSSG--VCDRDCHCGMLLSSCSSGCKC 99
           +  +K N Y   R +   E+  + C C  T    + G   C   C   ML+  C   C  
Sbjct: 419 FQLLKENFYRCAR-QVSQENAEMQCDCFLTGDEEAQGHLSCGAGCINRMLMIECGPLCSN 477

Query: 100 GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
           G  C NK FQ       ++ +TEK G GI A+  I  GEF++EYVGEVID +  E R   
Sbjct: 478 GARCTNKRFQQHQCWPCRVFRTEKKGCGITAELLIPPGEFIMEYVGEVIDSEEFERRQHL 537

Query: 160 MKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD 219
                  ++Y   +  + VIDAT KGN SRYINHSC PN E QKW ++GE RIG F+ + 
Sbjct: 538 YSKDRNRHYYFMALRGEAVIDATSKGNISRYINHSCDPNAETQKWTVNGELRIGFFSVKP 597

Query: 220 IKKGENLTYDYQY-EFLHDSLIAYC 243
           I+ GE +T+DYQY  +  D+   YC
Sbjct: 598 IQPGEEITFDYQYLRYGRDAQRCYC 622


>gi|363729887|ref|XP_418510.3| PREDICTED: histone-lysine N-methyltransferase SETD2 [Gallus gallus]
          Length = 2554

 Score =  143 bits (361), Expect = 6e-32,   Method: Composition-based stats.
 Identities = 76/207 (36%), Positives = 117/207 (56%), Gaps = 9/207 (4%)

Query: 36   FIKPKAIPYVF--IKRNIYLTKRIKRRLEDD--GIFCSC---TASPGSSG--VCDRDCHC 86
            F K   +P  F  I+ N+YLT+R K +   D   + C C   +    + G   C  DC  
Sbjct: 1461 FAKQGKMPCYFDLIEENVYLTERKKSKSHRDIKRMLCECPPLSKEERAQGEVACGEDCLN 1520

Query: 87   GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
             +L+  CSS C  G+ C N+ FQ +    ++++ TEK G G+ A +D+    FV+EY GE
Sbjct: 1521 RLLMIECSSRCPNGDYCSNRRFQKKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGE 1580

Query: 147  VIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWII 206
            V+D +  + R+ +       ++Y   +  D +IDAT KGN SR++NHSC PN E QKW +
Sbjct: 1581 VLDHKEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTV 1640

Query: 207  DGETRIGIFATRDIKKGENLTYDYQYE 233
            +G+ R+G F T+ +  G  LT+DYQ++
Sbjct: 1641 NGQLRVGFFTTKLVPSGSELTFDYQFQ 1667


>gi|426340342|ref|XP_004034089.1| PREDICTED: histone-lysine N-methyltransferase SETD2 [Gorilla gorilla
            gorilla]
          Length = 2564

 Score =  143 bits (361), Expect = 6e-32,   Method: Composition-based stats.
 Identities = 72/195 (36%), Positives = 110/195 (56%), Gaps = 7/195 (3%)

Query: 46   FIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSGCK 98
             I+ N+YLT+R K +   D     C  +P S          C  DC   +L+  CSS C 
Sbjct: 1475 LIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLNRLLMIECSSRCP 1534

Query: 99   CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
             G+ C N+ FQ +    ++++ TEK G G+ A +D+    FV+EY GEV+D +  + R+ 
Sbjct: 1535 NGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKARVK 1594

Query: 159  KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATR 218
            +       ++Y   +  D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F T+
Sbjct: 1595 EYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTK 1654

Query: 219  DIKKGENLTYDYQYE 233
             +  G  LT+DYQ++
Sbjct: 1655 LVPSGSELTFDYQFQ 1669


>gi|115446669|ref|NP_001047114.1| Os02g0554000 [Oryza sativa Japonica Group]
 gi|50725771|dbj|BAD33302.1| SET domain-containing protein-like [Oryza sativa Japonica Group]
 gi|113536645|dbj|BAF09028.1| Os02g0554000 [Oryza sativa Japonica Group]
          Length = 637

 Score =  143 bits (360), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 75/190 (39%), Positives = 104/190 (54%), Gaps = 1/190 (0%)

Query: 44  YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSS-GCKCGNS 102
           +  ++ N++L +  + +  D+ + C+C         C   C   +L   C+   C CG  
Sbjct: 124 FALLRSNLFLHRNRRTQSIDESMVCNCKPPHDDRMGCRDGCLNRILNIECTKRTCPCGEH 183

Query: 103 CLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
           C N+ FQ R   K+    T K G G+   ED+  G F+IEYVGEV+D    E R      
Sbjct: 184 CSNQQFQRRTYAKLGKFHTGKKGYGLQLKEDVSEGRFLIEYVGEVLDITAYESRQRYYAS 243

Query: 163 LGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKK 222
            G+ +FY   +N   VIDA  KGN  R+INHSC PN   +KW+++GE  IGIFA R+IKK
Sbjct: 244 KGQKHFYFMALNGGEVIDACTKGNLGRFINHSCSPNCRTEKWMVNGEVCIGIFAMRNIKK 303

Query: 223 GENLTYDYQY 232
           GE LT+DY Y
Sbjct: 304 GEELTFDYNY 313


>gi|357163489|ref|XP_003579748.1| PREDICTED: histone-lysine N-methyltransferase ASHH1-like
           [Brachypodium distachyon]
          Length = 517

 Score =  143 bits (360), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 80/202 (39%), Positives = 117/202 (57%), Gaps = 6/202 (2%)

Query: 43  PYVFIKRNIYLTKRIKRRLEDDGIFCSCT---ASPGSSGVCDRDCHCGMLLSSCSSG-CK 98
           PY+ I+ N +L +R KR+ E+D   C C      P S+  C   C      + C+ G C+
Sbjct: 10  PYIHIETNDFLHRRHKRQKEEDIAVCECQYNLMDPDSA--CGERCWNVSTNTECTPGYCR 67

Query: 99  CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
           CG  C N+ FQ     + +LV+TE  G G++A+E+I  G+FVIEY GEVI  +  + R  
Sbjct: 68  CGVYCKNQRFQKCQYARTRLVKTEGRGWGLLAEENIMAGQFVIEYCGEVISWKEAKRRSQ 127

Query: 159 KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATR 218
             +  G    Y+  +N    IDAT KG+ +R+INHSC PN E +KW + GE R+GIFA +
Sbjct: 128 AYEDQGLMEAYIIYLNTAESIDATKKGSLARFINHSCQPNCETRKWNVLGEVRVGIFAKQ 187

Query: 219 DIKKGENLTYDYQYEFLHDSLI 240
           DI  G  L+YDY +E+   +++
Sbjct: 188 DIPIGMELSYDYNFEWFGGAIV 209


>gi|302798461|ref|XP_002980990.1| hypothetical protein SELMODRAFT_3415 [Selaginella moellendorffii]
 gi|300151044|gb|EFJ17691.1| hypothetical protein SELMODRAFT_3415 [Selaginella moellendorffii]
          Length = 242

 Score =  143 bits (360), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 78/201 (38%), Positives = 111/201 (55%), Gaps = 1/201 (0%)

Query: 44  YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCS-SGCKCGNS 102
           +  I+ NI+  +  K   +DD + C C      S  C  DC   M+   CS   C CG  
Sbjct: 1   WTQIRHNIFQHRHQKTLDDDDTLICLCKPPKDGSPGCGEDCLNRMVNVECSPDTCPCGER 60

Query: 103 CLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
           C N+ F  R    + LV+  K G G+ A E+I +G FVIEYVGEV+D ++ E R  +   
Sbjct: 61  CSNQQFGKREYSNVALVRCGKKGFGLKALENIAKGSFVIEYVGEVLDSRSFELRQKEYAR 120

Query: 163 LGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKK 222
             + +FY   +N   VIDA  KGN  R+INHSC PN + +KW ++GE  IG+FA RD+ K
Sbjct: 121 QRQKHFYFMTLNSSEVIDACRKGNLGRFINHSCEPNCQTEKWCVNGEICIGLFAIRDVAK 180

Query: 223 GENLTYDYQYEFLHDSLIAYC 243
            E +T++Y +E L+ +    C
Sbjct: 181 NEEITFNYNFERLYGAAAKKC 201


>gi|327289513|ref|XP_003229469.1| PREDICTED: histone-lysine N-methyltransferase SETD2-like [Anolis
            carolinensis]
          Length = 2579

 Score =  143 bits (360), Expect = 7e-32,   Method: Composition-based stats.
 Identities = 77/207 (37%), Positives = 115/207 (55%), Gaps = 9/207 (4%)

Query: 36   FIKPKAIPYVF--IKRNIYLTKRIKRRLEDD--GIFCSC-TASPGSSG----VCDRDCHC 86
            F K   +P  F  I+ N+YLT+R K +   D   + C C T S          C  DC  
Sbjct: 1481 FAKHGRMPCYFDLIEENVYLTERKKNKSHRDIKRMLCECPTLSKDERAQGEVACGEDCLN 1540

Query: 87   GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
             +L+  CSS C  G  C N+ FQ +    ++++ TEK G G+ A +D+    FV+EY GE
Sbjct: 1541 RLLMIECSSRCPNGEHCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGE 1600

Query: 147  VIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWII 206
            V+D +  + R+ +       ++Y   +  D +IDAT KGN SR++NHSC PN E QKW +
Sbjct: 1601 VLDHKEFKTRVKEYARSKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTV 1660

Query: 207  DGETRIGIFATRDIKKGENLTYDYQYE 233
            +G+ R+G F T+ +  G  LT+DYQ++
Sbjct: 1661 NGQLRVGFFTTKMVPSGSELTFDYQFQ 1687


>gi|345493934|ref|XP_001600694.2| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1 [Nasonia
            vitripennis]
          Length = 1382

 Score =  143 bits (360), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 81/192 (42%), Positives = 109/192 (56%), Gaps = 2/192 (1%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CK 98
            K  PYV +K N  +   +K    D  + C C  +       D DC   +L+  CS   C 
Sbjct: 932  KPPPYVKLKVNKPVG-NVKVPEVDSMVACDCNPNQPYPCSPDSDCLNRILMIECSPDTCP 990

Query: 99   CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
                C N+ F  R    MK   TE+ G G+V+ E IK G+F+IEYVGEVID+   + RL 
Sbjct: 991  ASTKCQNQLFVQRKYPAMKPAHTEERGWGLVSLEPIKHGQFIIEYVGEVIDEAEYKLRLQ 1050

Query: 159  KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATR 218
            + K     N+Y   I+   +IDA  KGN SR++NHSC PN E QKW ++G+TRIG+FA R
Sbjct: 1051 QKKERKNENYYFLTIDNSRMIDAEPKGNLSRFMNHSCQPNCETQKWKVNGDTRIGLFALR 1110

Query: 219  DIKKGENLTYDY 230
            DI+ GE LT++Y
Sbjct: 1111 DIEPGEELTFNY 1122


>gi|412991390|emb|CCO16235.1| unnamed protein product [Bathycoccus prasinos]
          Length = 825

 Score =  143 bits (360), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 70/156 (44%), Positives = 96/156 (61%), Gaps = 3/156 (1%)

Query: 80  CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGE 138
           C  DC       +C +  C CG+ C N+PFQ+ P  K+K+  TE  G G+   +D+  G+
Sbjct: 461 CGHDCVNRETRYTCDTRVCPCGDDCSNRPFQHLPQPKVKVQLTENRGYGLFLQQDVFEGD 520

Query: 139 FVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPN 198
           F++EY+GE++D++ C  RL   K   E NFYL EI    +IDA + GN +R+IN SC PN
Sbjct: 521 FIVEYMGEIVDEEECTRRLLACKGKNEPNFYLMEITPSQIIDARFCGNNARFINSSCHPN 580

Query: 199 TEMQKWI--IDGETRIGIFATRDIKKGENLTYDYQY 232
            E Q+W+     ETR+GIFAT DIK G  LTYDY +
Sbjct: 581 CETQRWVDASTNETRVGIFATEDIKSGTELTYDYNF 616


>gi|410905477|ref|XP_003966218.1| PREDICTED: histone-lysine N-methyltransferase SETD2-like [Takifugu
            rubripes]
          Length = 1950

 Score =  142 bits (359), Expect = 9e-32,   Method: Composition-based stats.
 Identities = 77/205 (37%), Positives = 114/205 (55%), Gaps = 9/205 (4%)

Query: 38   KPKAIPYVF--IKRNIYLTKRIKRRLEDD--GIFCSCTASPG---SSGV--CDRDCHCGM 88
            K K +P  F  I+ N+YLT+R K +   D   + C C   P    S G   C  DC   +
Sbjct: 837  KSKKMPPYFDLIEENLYLTERKKSKSHRDIKRMQCECPVLPREERSKGALACGEDCLNRL 896

Query: 89   LLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVI 148
            L+  CSS C+ G  C N+ FQ R      ++ TE  G G+ A +D+    FV+EY GEV+
Sbjct: 897  LMIECSSRCQNGAYCSNRRFQMRQHADFDVILTEDKGWGLRAAKDLPSNTFVLEYCGEVL 956

Query: 149  DDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDG 208
            D +  + R+ +       ++Y   +  + +IDAT KGN SR++NHSC PN E QKW ++G
Sbjct: 957  DHKEFKTRVKEYARNKNIHYYFMALKNNEIIDATLKGNLSRFMNHSCEPNCETQKWTVNG 1016

Query: 209  ETRIGIFATRDIKKGENLTYDYQYE 233
            + R+G F T+ +  G  LT+DYQ++
Sbjct: 1017 QLRVGFFTTKAVTAGTELTFDYQFQ 1041


>gi|195171947|ref|XP_002026763.1| GL27000 [Drosophila persimilis]
 gi|194111702|gb|EDW33745.1| GL27000 [Drosophila persimilis]
          Length = 944

 Score =  142 bits (359), Expect = 9e-32,   Method: Composition-based stats.
 Identities = 71/165 (43%), Positives = 97/165 (58%), Gaps = 1/165 (0%)

Query: 80  CDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEF 139
           C   C   ML+  C   C  G+ C NK FQ       ++ +TEK G GI A+  I  GEF
Sbjct: 19  CGAGCINRMLMIECGPLCTNGDRCTNKRFQLHQCWPCRVFRTEKKGCGITAELQIPAGEF 78

Query: 140 VIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNT 199
           ++EYVGEVID +  E R  +       ++Y   +  + +IDAT +GN SRYINHSC PN 
Sbjct: 79  IMEYVGEVIDSEEFERRQHRYSKDRNRHYYFMALRGEAIIDATMRGNISRYINHSCDPNA 138

Query: 200 EMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
           E QKW ++GE RIG F+ ++I  GE +T+DYQY+ +  D+   YC
Sbjct: 139 ETQKWTVNGELRIGFFSLKNILPGEEITFDYQYQRYGRDAQRCYC 183


>gi|327265653|ref|XP_003217622.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific-like [Anolis carolinensis]
          Length = 2106

 Score =  142 bits (359), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 78/199 (39%), Positives = 111/199 (55%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
            K  PY  IK N  + K ++    D      C   P     C  D  C   MLL  C    
Sbjct: 1297 KPPPYKHIKVNRPIGK-VQIFTADLSEIPRCNCKPADENPCGLDSECINRMLLYECHPAV 1355

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  GN C N+ F  R   ++++ +T   G G+ A  DIK+GEFV EYVGE+ID++ C  R
Sbjct: 1356 CPAGNRCQNQCFTKRQYPEVEIFRTLARGWGLQAKRDIKKGEFVNEYVGELIDEEECRAR 1415

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            +   +    TNFY+  +++D +IDA  KGN +R++NH C PN E QKW ++G+TR+G+FA
Sbjct: 1416 IRHAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFA 1475

Query: 217  TRDIKKGENLTYDYQYEFL 235
              ++K G  LT++Y  E L
Sbjct: 1476 ITNVKAGTELTFNYNLECL 1494


>gi|345493936|ref|XP_003427184.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2 [Nasonia
            vitripennis]
          Length = 1317

 Score =  142 bits (359), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 81/192 (42%), Positives = 109/192 (56%), Gaps = 2/192 (1%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CK 98
            K  PYV +K N  +   +K    D  + C C  +       D DC   +L+  CS   C 
Sbjct: 867  KPPPYVKLKVNKPVG-NVKVPEVDSMVACDCNPNQPYPCSPDSDCLNRILMIECSPDTCP 925

Query: 99   CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
                C N+ F  R    MK   TE+ G G+V+ E IK G+F+IEYVGEVID+   + RL 
Sbjct: 926  ASTKCQNQLFVQRKYPAMKPAHTEERGWGLVSLEPIKHGQFIIEYVGEVIDEAEYKLRLQ 985

Query: 159  KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATR 218
            + K     N+Y   I+   +IDA  KGN SR++NHSC PN E QKW ++G+TRIG+FA R
Sbjct: 986  QKKERKNENYYFLTIDNSRMIDAEPKGNLSRFMNHSCQPNCETQKWKVNGDTRIGLFALR 1045

Query: 219  DIKKGENLTYDY 230
            DI+ GE LT++Y
Sbjct: 1046 DIEPGEELTFNY 1057


>gi|52545752|emb|CAH56331.1| hypothetical protein [Homo sapiens]
          Length = 881

 Score =  142 bits (359), Expect = 1e-31,   Method: Composition-based stats.
 Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)

Query: 68  CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
           C+C A+  +    D +C   MLL  C  + C  G  C N+ F  R   ++++ +T + G 
Sbjct: 80  CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGW 139

Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
           G+    DIK+GEFV EYVGE+ID++ C  R+   +    TNFY+  +++D +IDA  KGN
Sbjct: 140 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 199

Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
            +R++NH C PN E QKW ++G+TR+G+FA  DIK G  LT++Y  E L
Sbjct: 200 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 248


>gi|291223879|ref|XP_002731935.1| PREDICTED: Wolf-Hirschhorn syndrome candidate 1 protein-like
            [Saccoglossus kowalevskii]
          Length = 1787

 Score =  142 bits (359), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 81/200 (40%), Positives = 117/200 (58%), Gaps = 4/200 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLE-DDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-C 97
            K  P+ FIK N  + K + ++ +      C C A   +    D DC   MLL  C S  C
Sbjct: 1247 KPAPFKFIKTNKPVGKVVMQQCDISQCTPCECKADMKNPCGPDSDCLNRMLLIECHSQVC 1306

Query: 98   KCGNSCLNKPFQNRPVKKMKLVQTEKCGAG--IVADEDIKRGEFVIEYVGEVIDDQTCEE 155
              G++C N+ FQ     +    +T++ G G  +   +DIK+G+FV EYVGE++D++TC E
Sbjct: 1307 PAGDNCQNQRFQKMQYPETIPFRTDEKGRGWGLKTTQDIKKGDFVHEYVGELVDEETCRE 1366

Query: 156  RLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIF 215
            R+ K + L   NFY+  I++D VIDA  KGN +R++NHSC PN E  KW I  +TR+G+F
Sbjct: 1367 RIKKCQQLDIDNFYMLTIDKDHVIDAGPKGNLARFMNHSCDPNCETMKWTILPDTRVGLF 1426

Query: 216  ATRDIKKGENLTYDYQYEFL 235
            A RDI  G  LT++Y  + L
Sbjct: 1427 AKRDITAGSELTFNYNLDCL 1446


>gi|410923178|ref|XP_003975059.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like [Takifugu
            rubripes]
          Length = 1499

 Score =  142 bits (358), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 78/194 (40%), Positives = 109/194 (56%), Gaps = 4/194 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
            K  PY FIK N  + K ++  + D      C   P     C     C   ML   C    
Sbjct: 1121 KPPPYKFIKSNKPVGK-VQMHVADLSEIQRCNCRPTDEHPCGLHSQCLNRMLQYECHPQV 1179

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G++C N+ F  R   + ++V+T   G G+ A++ +K+GEFV EYVGEVID + C++R
Sbjct: 1180 CPAGDNCENQGFTKRLYAETEVVKTADRGWGLKANQPLKKGEFVTEYVGEVIDAEECQQR 1239

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +      TNFY+  + +D VIDA  KGN SR+INHSC PN E QKW ++G+  IG+FA
Sbjct: 1240 IKRAHENHMTNFYMLTLTKDRVIDAAQKGNLSRFINHSCSPNCETQKWTVNGDVHIGLFA 1299

Query: 217  TRDIKKGENLTYDY 230
              DI  G  LT++Y
Sbjct: 1300 LCDIDAGTELTFNY 1313


>gi|292621054|ref|XP_683890.4| PREDICTED: hypothetical protein LOC556086 [Danio rerio]
          Length = 2055

 Score =  142 bits (358), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 80/198 (40%), Positives = 115/198 (58%), Gaps = 2/198 (1%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGI-FCSCTASPGSSGVCDRDCHCGMLLSSCSSG-C 97
            K  PY  IK N  + K +    +   I  C+C A+  +    D +C   MLL  C S  C
Sbjct: 1549 KPPPYKHIKVNKQIGKVLIITADLSEIPRCNCKATDENPCGIDSECINRMLLYECHSQVC 1608

Query: 98   KCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERL 157
              G  C N+ F  R   ++++ +T   G G+ +  DIK+G FV EYVGEVID++ C  R+
Sbjct: 1609 PAGERCQNQSFTKRQYTEVEIFRTLSRGWGLRSISDIKKGAFVNEYVGEVIDEEECRSRI 1668

Query: 158  WKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFAT 217
               +     NFY+  +++D +IDA  KGN+SR++NHSC PN E QKW ++G+TR+G+FA 
Sbjct: 1669 KNAQDNDICNFYMLTLDKDRIIDAGPKGNESRFMNHSCQPNCETQKWTVNGDTRVGLFAL 1728

Query: 218  RDIKKGENLTYDYQYEFL 235
             DI KG  LT++Y  E L
Sbjct: 1729 EDIPKGVELTFNYNLECL 1746


>gi|170039780|ref|XP_001847701.1| huntingtin interacting protein [Culex quinquefasciatus]
 gi|167863380|gb|EDS26763.1| huntingtin interacting protein [Culex quinquefasciatus]
          Length = 2379

 Score =  142 bits (358), Expect = 1e-31,   Method: Composition-based stats.
 Identities = 80/216 (37%), Positives = 116/216 (53%), Gaps = 10/216 (4%)

Query: 26   NPVEFELPDWFIK---PKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASP-----GSS 77
            N    EL D +++    K   +  I+ N+Y   ++  + E   + C C  +      G  
Sbjct: 1236 NVTNIELSDEYLRDMEEKLSRFETIRENMYHCDKVISK-EAKKMMCDCFLTEEEKDRGEL 1294

Query: 78   GVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRG 137
            G C  DC   +L+  C S C  G  C NK FQ       ++ +TEK G GI A  +I  G
Sbjct: 1295 G-CGEDCLNKLLMIECGSRCVVGERCTNKRFQKLEYANTQVFRTEKKGFGIQASTEIAPG 1353

Query: 138  EFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCP 197
            EF++EYVGEV++ +  +ER          ++Y   +  D +IDAT KGN SR+INHSC  
Sbjct: 1354 EFIMEYVGEVLNSEQFDERAELYSKEKNKHYYFMALRSDAIIDATTKGNISRFINHSCDA 1413

Query: 198  NTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
            N E QKW ++GE RIG F+T+ I  GE +T+DYQ++
Sbjct: 1414 NAETQKWTVNGELRIGFFSTKYILPGEEVTFDYQFQ 1449


>gi|444706655|gb|ELW47981.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
            specific [Tupaia chinensis]
          Length = 2687

 Score =  142 bits (358), Expect = 1e-31,   Method: Composition-based stats.
 Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)

Query: 68   CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
            C+C A+  +    D +C   MLL  C  + C  G  C N+ F  R   ++++ +T + G 
Sbjct: 1887 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGW 1946

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+    DIK+GEFV EYVGE+ID++ C  R+   +    TNFY+  +++D +IDA  KGN
Sbjct: 1947 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 2006

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
             +R++NH C PN E QKW ++G+TR+G+FA  DIK G  LT++Y  E L
Sbjct: 2007 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 2055


>gi|76666643|ref|XP_613048.2| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 1
            [Bos taurus]
 gi|297476142|ref|XP_002688498.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Bos
            taurus]
 gi|296486298|tpg|DAA28411.1| TPA: Wolf-Hirschhorn syndrome candidate 1 [Bos taurus]
          Length = 1365

 Score =  142 bits (357), Expect = 1e-31,   Method: Composition-based stats.
 Identities = 79/196 (40%), Positives = 110/196 (56%), Gaps = 4/196 (2%)

Query: 43   PYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG-CKC 99
            PY  IK N    K ++    D      C   P     C  D  C   ML+  C    C  
Sbjct: 990  PYKHIKVNKPYGK-VQVHTADISEIPKCNCKPTDENPCGSDSQCLNRMLMFECHPQVCPA 1048

Query: 100  GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
            G SC N+ F  R   + K+V+T+  G G+VA  DI++GEFV EYVGE+ID++ C  R+ +
Sbjct: 1049 GESCQNQCFTKRQYPETKIVRTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMARIKR 1108

Query: 160  MKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD 219
                  T+FY+  I++D +IDA  KGN SR++NHSC PN E  KW ++G+TR+G+FA  D
Sbjct: 1109 AHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCD 1168

Query: 220  IKKGENLTYDYQYEFL 235
            I  G  LT++Y  + L
Sbjct: 1169 IPAGTELTFNYNLDCL 1184


>gi|432873648|ref|XP_004072321.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like [Oryzias
            latipes]
          Length = 1597

 Score =  142 bits (357), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 77/194 (39%), Positives = 111/194 (57%), Gaps = 4/194 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
            K  PY  IK N  + K ++  + D      C   P     C  D  C   ML   C    
Sbjct: 1222 KPPPYKVIKSNKPVGK-VQMHVADLSEIPRCNCKPVDEHPCSIDSQCLNRMLQYECHPQV 1280

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G++C N+ F  R   + ++V+TE  G G+  ++ +++G+FV EYVGEVID + C++R
Sbjct: 1281 CPAGDNCENQNFTKRLYAETEVVKTEGRGWGLQTNQALRKGDFVAEYVGEVIDSEECQQR 1340

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +      TNFY+  + +D VIDA  KGN +R+INHSC PN E QKW ++G+ RIGIFA
Sbjct: 1341 IKRAHENHVTNFYMLTLTKDRVIDAGPKGNSARFINHSCNPNCETQKWTVNGDVRIGIFA 1400

Query: 217  TRDIKKGENLTYDY 230
              DI+ G  LT++Y
Sbjct: 1401 LCDIEAGTELTFNY 1414


>gi|326921432|ref|XP_003206963.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            SETD2-like [Meleagris gallopavo]
          Length = 2147

 Score =  142 bits (357), Expect = 2e-31,   Method: Composition-based stats.
 Identities = 76/207 (36%), Positives = 117/207 (56%), Gaps = 9/207 (4%)

Query: 36   FIKPKAIPYVF--IKRNIYLTKRIKRRLEDD--GIFCSC---TASPGSSG--VCDRDCHC 86
            F K   +P  F  I+ N+YLT+R K +   D   + C C   +    + G   C  DC  
Sbjct: 1268 FAKQGKMPCYFDLIEENVYLTERKKSKSHRDIKRMLCECPPLSKEERAQGEVACGEDCLN 1327

Query: 87   GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
             +L+  CSS C  G+ C N+ FQ +    ++++ TEK G G+ A +D+    FV+EY GE
Sbjct: 1328 RLLMIECSSRCPNGDFCSNRRFQKKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGE 1387

Query: 147  VIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWII 206
            V+D +  + R+ +       ++Y   +  D +IDAT KGN SR++NHSC PN E QKW +
Sbjct: 1388 VLDHKEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTV 1447

Query: 207  DGETRIGIFATRDIKKGENLTYDYQYE 233
            +G+ R+G F T+ +  G  LT+DYQ++
Sbjct: 1448 NGQLRVGFFTTKLVPSGSELTFDYQFQ 1474


>gi|384253874|gb|EIE27348.1| hypothetical protein COCSUDRAFT_55364 [Coccomyxa subellipsoidea
           C-169]
          Length = 1591

 Score =  142 bits (357), Expect = 2e-31,   Method: Composition-based stats.
 Identities = 71/163 (43%), Positives = 94/163 (57%), Gaps = 4/163 (2%)

Query: 74  PGSSGVCDRDCHCGMLLSSCS-SGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADE 132
           P  +G C  +C   +    C    C CG+ C N+PF   P  K +L  TE  G G+ A E
Sbjct: 477 PERTG-CGENCLNRLSYIHCDPKQCPCGDYCSNRPFHLLPQPKTELFLTEDRGWGVKATE 535

Query: 133 DIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYIN 192
            I RG F++EY GEVI++  C  R+ + K  G  +FY+ E+   ++IDA  KGN +R+IN
Sbjct: 536 HIPRGTFIVEYAGEVIEEHECRRRMAQAKVTGLQHFYMMELAPGLIIDARVKGNMARFIN 595

Query: 193 HSCCPNTEMQKW--IIDGETRIGIFATRDIKKGENLTYDYQYE 233
            SC PN E QKW     GE RIGIFA  DI+ G  L YDYQ++
Sbjct: 596 SSCAPNCESQKWHDAATGEIRIGIFAADDIEPGTELAYDYQFQ 638


>gi|413937237|gb|AFW71788.1| hypothetical protein ZEAMMB73_686749 [Zea mays]
          Length = 1815

 Score =  141 bits (356), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 76/194 (39%), Positives = 105/194 (54%), Gaps = 1/194 (0%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSC-SSGCK 98
            K   +  ++ N++L +  + +  D+ + C+C         C   C   ML   C    C 
Sbjct: 993  KPPSWTLVRSNLFLHRNRRTQSIDESMVCNCKPPQDGRMGCRDGCLNRMLNIECVKRTCP 1052

Query: 99   CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
            CG  C N+ FQ R   K++   + K G G+   ED+  G F+IEYVGEV+D  + E R  
Sbjct: 1053 CGEQCSNQKFQRRSYAKLRWFHSGKKGYGLQLQEDVTEGRFLIEYVGEVLDITSYESRQR 1112

Query: 159  KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATR 218
                 G+ +FY   +N   VIDA  KGN  R+INHSC PN   +KW+++GE  IGIFA R
Sbjct: 1113 YYACKGQKHFYFMALNGGEVIDACTKGNLGRFINHSCSPNCCTEKWMVNGEVCIGIFALR 1172

Query: 219  DIKKGENLTYDYQY 232
             IKKGE LT+DY Y
Sbjct: 1173 SIKKGEELTFDYNY 1186


>gi|241998002|ref|XP_002433644.1| set domain protein, putative [Ixodes scapularis]
 gi|215495403|gb|EEC05044.1| set domain protein, putative [Ixodes scapularis]
          Length = 729

 Score =  141 bits (356), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 75/196 (38%), Positives = 112/196 (57%), Gaps = 6/196 (3%)

Query: 40  KAIPYVFIKRNIYLTKRIKRRLEDDGI-FCSCTASPGSSGVCDRDCHCGMLLSSCSSG-C 97
           K  P+  I+ N  + +     L+   +  C CTA       C  DC   +LL  C    C
Sbjct: 500 KPAPFRMIRVNRPVGQVAVPALDLRTVSVCVCTAQ----DPCQADCLNRLLLYECRPDLC 555

Query: 98  KCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERL 157
             G  C N+ F  R   ++ +++ E  G G+  D+ +  G+FV+EYVGE+I++Q CE RL
Sbjct: 556 PAGEHCKNQHFLRREYAQVTVIRAEGRGWGLRTDQALTAGDFVMEYVGEIINEQECERRL 615

Query: 158 WKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFAT 217
            ++     +NFY   ++RD +IDA  +GN SR++NHSC PN E QKW ++G+TR+GIFA 
Sbjct: 616 SRLHLEHSSNFYFLTLDRDRIIDAGPRGNLSRFMNHSCDPNCETQKWTVNGDTRVGIFAI 675

Query: 218 RDIKKGENLTYDYQYE 233
           RDI  G  LT++Y  +
Sbjct: 676 RDIAPGTELTFNYNLD 691


>gi|348535504|ref|XP_003455240.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific [Oreochromis niloticus]
          Length = 2122

 Score =  141 bits (356), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 73/197 (37%), Positives = 114/197 (57%), Gaps = 2/197 (1%)

Query: 38   KPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG- 96
            KP    ++ + R I   + I   L  +   C+C A   +    D +C   ML+  C    
Sbjct: 1570 KPPPYRHIRVNRPIGKVQIITADL-SEVPRCNCKALDENPCGIDSECINRMLMYECHPQV 1628

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G+ C N+ F  R    +++ +T  CG G+    DI++G F+ EYVGEVID++ C  R
Sbjct: 1629 CAAGDRCQNQSFTKRQYTPVEIFRTLSCGWGLRGLSDIRKGAFISEYVGEVIDEEECRAR 1688

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            +   +     NFY+  +++D +IDA  KGN++R++NHSC PN E QKW ++G+TR+G+FA
Sbjct: 1689 IRHAQENDICNFYMLTLDKDRIIDAGPKGNQARFMNHSCQPNCETQKWTVNGDTRVGLFA 1748

Query: 217  TRDIKKGENLTYDYQYE 233
             +D+ KGE LT++Y  E
Sbjct: 1749 LQDVPKGEELTFNYNLE 1765


>gi|432901504|ref|XP_004076868.1| PREDICTED: uncharacterized protein LOC101161079 [Oryzias latipes]
          Length = 2214

 Score =  141 bits (356), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 78/198 (39%), Positives = 115/198 (58%), Gaps = 2/198 (1%)

Query: 40   KAIPYVFIKRNIYLTK-RIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-C 97
            K  PY  IK N  + K +I      +   C+C A+  S    D +C   MLL  C+   C
Sbjct: 1749 KPPPYKHIKVNRPIGKVQIFTADLSEVPRCNCKATDESPCGSDSECINRMLLYECNPQVC 1808

Query: 98   KCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERL 157
              G  CLN+ F  R   ++++ +T   G G+    DIK+G+FV EYVGEVID++ C  R+
Sbjct: 1809 PAGEKCLNQAFTKRQYSQVEIFRTLSRGWGLRCVHDIKKGQFVSEYVGEVIDEEECRSRI 1868

Query: 158  WKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFAT 217
               +     NFY+  +++D +IDA  KGN++R++NH C PN E QKW ++G+TR+G+FA 
Sbjct: 1869 RHAQENDICNFYMLTLDKDRIIDAGPKGNEARFMNHCCQPNCETQKWTVNGDTRVGLFAL 1928

Query: 218  RDIKKGENLTYDYQYEFL 235
             D+K G  LT++Y  E L
Sbjct: 1929 VDVKAGTELTFNYNLECL 1946


>gi|410898830|ref|XP_003962900.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
            [Takifugu rubripes]
          Length = 1329

 Score =  141 bits (356), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 79/207 (38%), Positives = 113/207 (54%), Gaps = 4/207 (1%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
            K  PY FIK N  + K ++    D      C   P     C  + +C   ML   C    
Sbjct: 947  KPPPYKFIKVNKPVGK-VQVYAADVSEIPKCNCKPSDERPCGFESECLNRMLQYECHPQV 1005

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G  C N+ F  R     K+++T   G G++   DIK+GEFV EY+GE+ID++ C  R
Sbjct: 1006 CPSGERCGNQDFTQRLYPDTKIIKTPGKGWGLITLRDIKKGEFVNEYIGELIDEEECRAR 1065

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            +   +    TNFY+  I++D +IDA  KGN SR++NHSC PN E QKW ++G+TR+G+FA
Sbjct: 1066 IKYAQENNVTNFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETQKWTVNGDTRVGLFA 1125

Query: 217  TRDIKKGENLTYDYQYEFLHDSLIAYC 243
              D+  G  LT++Y  + L +   A C
Sbjct: 1126 ICDVPAGTELTFNYNLDCLGNEKTACC 1152


>gi|426232375|ref|XP_004010202.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
            N-methyltransferase NSD2 [Ovis aries]
          Length = 1273

 Score =  141 bits (356), Expect = 2e-31,   Method: Composition-based stats.
 Identities = 78/196 (39%), Positives = 110/196 (56%), Gaps = 4/196 (2%)

Query: 43   PYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG-CKC 99
            PY  IK N    K ++    D      C   P     C  +  C   ML+  C    C  
Sbjct: 936  PYKHIKVNKPYGK-VQVHTADISEIPKCNCKPTDENPCGSESQCLNRMLMFECHPQVCPA 994

Query: 100  GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
            G SC N+ F  R   + K+V+T+  G G+VA  DI++GEFV EYVGE+ID++ C  R+ +
Sbjct: 995  GESCQNQCFTKRQYPETKIVRTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMARIKR 1054

Query: 160  MKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD 219
                  T+FY+  I++D +IDA  KGN SR++NHSC PN E  KW ++G+TR+G+FA  D
Sbjct: 1055 AHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCD 1114

Query: 220  IKKGENLTYDYQYEFL 235
            I  G  LT++Y  + L
Sbjct: 1115 IPAGTELTFNYNLDCL 1130


>gi|47226564|emb|CAG08580.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 1404

 Score =  141 bits (356), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 79/194 (40%), Positives = 110/194 (56%), Gaps = 4/194 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
            K  PY FIK N  + K ++  + D      C   P     C     C   ML   C    
Sbjct: 1059 KPPPYKFIKSNKPVGK-VQMHIADLSEVQRCNCRPTDEHPCGLQSQCLNRMLQYECHPQV 1117

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G++C N+ F  R   + ++V+T   G G+ A++ IK+GEFVIEYVGEVID + C++R
Sbjct: 1118 CPAGDNCENQCFTKRLYAETEVVKTADRGWGLKANQPIKKGEFVIEYVGEVIDAEECQQR 1177

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +      TNFY+  + +D VIDA  KGN SR+INHSC PN E QKW ++G+  IG+FA
Sbjct: 1178 IKRAHENHMTNFYMLTLTKDRVIDAGQKGNLSRFINHSCSPNCETQKWTVNGDVHIGLFA 1237

Query: 217  TRDIKKGENLTYDY 230
              DI+    LT++Y
Sbjct: 1238 LCDIETDTELTFNY 1251


>gi|413937236|gb|AFW71787.1| hypothetical protein ZEAMMB73_686749 [Zea mays]
          Length = 1756

 Score =  141 bits (356), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 76/194 (39%), Positives = 105/194 (54%), Gaps = 1/194 (0%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSC-SSGCK 98
            K   +  ++ N++L +  + +  D+ + C+C         C   C   ML   C    C 
Sbjct: 934  KPPSWTLVRSNLFLHRNRRTQSIDESMVCNCKPPQDGRMGCRDGCLNRMLNIECVKRTCP 993

Query: 99   CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
            CG  C N+ FQ R   K++   + K G G+   ED+  G F+IEYVGEV+D  + E R  
Sbjct: 994  CGEQCSNQKFQRRSYAKLRWFHSGKKGYGLQLQEDVTEGRFLIEYVGEVLDITSYESRQR 1053

Query: 159  KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATR 218
                 G+ +FY   +N   VIDA  KGN  R+INHSC PN   +KW+++GE  IGIFA R
Sbjct: 1054 YYACKGQKHFYFMALNGGEVIDACTKGNLGRFINHSCSPNCCTEKWMVNGEVCIGIFALR 1113

Query: 219  DIKKGENLTYDYQY 232
             IKKGE LT+DY Y
Sbjct: 1114 SIKKGEELTFDYNY 1127


>gi|426229361|ref|XP_004008759.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific [Ovis aries]
          Length = 2698

 Score =  141 bits (355), Expect = 2e-31,   Method: Composition-based stats.
 Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)

Query: 68   CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
            C+C A+  +    D +C   MLL  C  + C  G  C N+ F  R   ++++ +T + G 
Sbjct: 1898 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFTKRQYPEVEIFRTLQRGW 1957

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+    DIK+GEFV EYVGE+ID++ C  R+   +    TNFY+  +++D +IDA  KGN
Sbjct: 1958 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 2017

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
             +R++NH C PN E QKW ++G+TR+G+FA  DIK G  LT++Y  E L
Sbjct: 2018 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 2066


>gi|440898362|gb|ELR49876.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
            specific [Bos grunniens mutus]
          Length = 2698

 Score =  141 bits (355), Expect = 2e-31,   Method: Composition-based stats.
 Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)

Query: 68   CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
            C+C A+  +    D +C   MLL  C  + C  G  C N+ F  R   ++++ +T + G 
Sbjct: 1898 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFTKRQYPEVEIFRTLQRGW 1957

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+    DIK+GEFV EYVGE+ID++ C  R+   +    TNFY+  +++D +IDA  KGN
Sbjct: 1958 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 2017

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
             +R++NH C PN E QKW ++G+TR+G+FA  DIK G  LT++Y  E L
Sbjct: 2018 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 2066


>gi|10438794|dbj|BAB15346.1| unnamed protein product [Homo sapiens]
          Length = 1069

 Score =  141 bits (355), Expect = 2e-31,   Method: Composition-based stats.
 Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)

Query: 68  CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
           C+C A+  +    D +C   MLL  C  + C  G  C N+ F  R   ++++ +T + G 
Sbjct: 268 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGW 327

Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
           G+    DIK+GEFV EYVGE+ID++ C  R+   +    TNFY+  +++D +IDA  KGN
Sbjct: 328 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 387

Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
            +R++NH C PN E QKW ++G+TR+G+FA  DIK G  LT++Y  E L
Sbjct: 388 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 436


>gi|291227185|ref|XP_002733567.1| PREDICTED: HSPC069-like [Saccoglossus kowalevskii]
          Length = 2376

 Score =  141 bits (355), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 79/196 (40%), Positives = 110/196 (56%), Gaps = 6/196 (3%)

Query: 44   YVFIKRNIYLTKR--IKRRLEDDGIFCSCT--ASPGSSGV--CDRDCHCGMLLSSCSSGC 97
            Y  I  N+YLT+R   K R E   + C C+  A     G+  C  DC   +L+  C+S C
Sbjct: 1089 YEDIAENVYLTERKKSKARKEIKRMQCDCSTCAEDRDMGILACGDDCLNRLLMIECTSRC 1148

Query: 98   KCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERL 157
             CG+ C N+ FQ R   K+++ +T   G G+    +I  G+FV+EYVGEV++    + R 
Sbjct: 1149 PCGDYCTNRSFQRRENAKVEIFKTPWKGFGLRTCAEIPEGKFVLEYVGEVLNYSEFKSRT 1208

Query: 158  WKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFAT 217
                     ++Y   +  D +IDAT KGN SR+INHSC PN E QKW ++G  R+G F  
Sbjct: 1209 KHYNKDNRKHYYFMALTSDEIIDATKKGNVSRFINHSCDPNCETQKWTVNGHIRVGFFTK 1268

Query: 218  RDIKKGENLTYDYQYE 233
            R I  GE LT+DYQ+E
Sbjct: 1269 RAIPAGEELTFDYQFE 1284


>gi|119895257|ref|XP_592234.3| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific, partial [Bos taurus]
          Length = 2389

 Score =  141 bits (355), Expect = 3e-31,   Method: Composition-based stats.
 Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)

Query: 68   CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
            C+C A+  +    D +C   MLL  C  + C  G  C N+ F  R   ++++ +T + G 
Sbjct: 1589 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFTKRQYPEVEIFRTLQRGW 1648

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+    DIK+GEFV EYVGE+ID++ C  R+   +    TNFY+  +++D +IDA  KGN
Sbjct: 1649 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 1708

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
             +R++NH C PN E QKW ++G+TR+G+FA  DIK G  LT++Y  E L
Sbjct: 1709 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 1757


>gi|157112020|ref|XP_001657377.1| huntingtin interacting protein [Aedes aegypti]
 gi|108878208|gb|EAT42433.1| AAEL006013-PA [Aedes aegypti]
          Length = 2367

 Score =  141 bits (355), Expect = 3e-31,   Method: Composition-based stats.
 Identities = 75/192 (39%), Positives = 107/192 (55%), Gaps = 7/192 (3%)

Query: 47   IKRNIYLTKRIKRRLEDDGIFCSCTASP-----GSSGVCDRDCHCGMLLSSCSSGCKCGN 101
            I+ N+Y   ++  + E   + C C  +      G  G C  DC   +L+  C S C  G 
Sbjct: 1202 IRENMYHCDKVISK-EAKKMNCDCFLTTEEIDRGELG-CGEDCLNRLLMIECGSRCTIGE 1259

Query: 102  SCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMK 161
             C NK FQ       ++ +TEK G GI A  +I  G+F++EYVGEV++ +  +ER     
Sbjct: 1260 RCTNKRFQKLEYANCQVFRTEKKGFGIQASTEIVPGDFIMEYVGEVLNSEQFDERAELYS 1319

Query: 162  HLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIK 221
                 ++Y   +  D +IDAT KGN SR+INHSC PN E QKW ++GE RIG F T+ I 
Sbjct: 1320 KEKNQHYYFMALRSDAIIDATTKGNISRFINHSCDPNAETQKWTVNGELRIGFFCTKYIM 1379

Query: 222  KGENLTYDYQYE 233
             GE +T+DYQ++
Sbjct: 1380 PGEEITFDYQFQ 1391


>gi|149039889|gb|EDL94005.1| nuclear receptor binding SET domain protein 1 (predicted), isoform
            CRA_b [Rattus norvegicus]
          Length = 2586

 Score =  141 bits (355), Expect = 3e-31,   Method: Composition-based stats.
 Identities = 70/169 (41%), Positives = 103/169 (60%), Gaps = 1/169 (0%)

Query: 68   CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
            C+C A+  +    D +C   MLL  C  + C  G  C N+ F  R    +++ +T + G 
Sbjct: 1790 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPDVEIFRTLQRGW 1849

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+    DIK+GEFV EYVGE+ID++ C  R+   +    TNFY+  +++D +IDA  KGN
Sbjct: 1850 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 1909

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
             +R++NH C PN E QKW ++G+TR+G+FA  DIK G  LT++Y  E L
Sbjct: 1910 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 1958


>gi|354471955|ref|XP_003498206.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific [Cricetulus griseus]
          Length = 2690

 Score =  141 bits (355), Expect = 3e-31,   Method: Composition-based stats.
 Identities = 70/169 (41%), Positives = 103/169 (60%), Gaps = 1/169 (0%)

Query: 68   CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
            C+C A+  +    D +C   MLL  C  + C  G  C N+ F  R    +++ +T + G 
Sbjct: 1898 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPDVEIFRTLQRGW 1957

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+    DIK+GEFV EYVGE+ID++ C  R+   +    TNFY+  +++D +IDA  KGN
Sbjct: 1958 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 2017

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
             +R++NH C PN E QKW ++G+TR+G+FA  DIK G  LT++Y  E L
Sbjct: 2018 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 2066


>gi|221058949|ref|XP_002260120.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
            knowlesi strain H]
 gi|193810193|emb|CAQ41387.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
            knowlesi strain H]
          Length = 2872

 Score =  140 bits (354), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 82/216 (37%), Positives = 119/216 (55%), Gaps = 21/216 (9%)

Query: 42   IPYVFIKRNIYLTKRIKRRL--EDDGIFCSCTASPGSSGVCD-RDCHCGMLLSSCSSGCK 98
            + + +I +N+YL  + K  L  + D   C C       G CD   C+  +    CS   +
Sbjct: 2371 LNFEYISKNVYLNDQNKNLLACKSDDYRCLC------QGECDPYSCYNSLSKIQCSKN-R 2423

Query: 99   CG-------NSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQ 151
            C          C N+PF+   +K +++ QTE+ G G+ +  DIK GE + EYVGEV+  +
Sbjct: 2424 CNLPIQIQDKKCFNRPFRQSAIKDLEIRQTERTGYGVFSKRDIKNGELICEYVGEVLAKK 2483

Query: 152  TCEERLW----KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIID 207
              EER+     + K     N+Y  +INRD+ ID+  KGN SR++NHSC PN+  QKWI+ 
Sbjct: 2484 EFEERVEAYQEESKKTNMYNWYSIQINRDVHIDSRKKGNISRFVNHSCSPNSVSQKWIVR 2543

Query: 208  GETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
            G  RIGIFA RDI  GE +TY+Y Y F+ ++    C
Sbjct: 2544 GFYRIGIFAQRDIPAGEEITYNYSYNFVFNNFECLC 2579


>gi|344240382|gb|EGV96485.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
            specific [Cricetulus griseus]
          Length = 2318

 Score =  140 bits (354), Expect = 3e-31,   Method: Composition-based stats.
 Identities = 70/169 (41%), Positives = 103/169 (60%), Gaps = 1/169 (0%)

Query: 68   CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
            C+C A+  +    D +C   MLL  C  + C  G  C N+ F  R    +++ +T + G 
Sbjct: 1526 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPDVEIFRTLQRGW 1585

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+    DIK+GEFV EYVGE+ID++ C  R+   +    TNFY+  +++D +IDA  KGN
Sbjct: 1586 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 1645

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
             +R++NH C PN E QKW ++G+TR+G+FA  DIK G  LT++Y  E L
Sbjct: 1646 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 1694


>gi|432114829|gb|ELK36567.1| Putative histone-lysine N-methyltransferase NSD2 [Myotis davidii]
          Length = 1037

 Score =  140 bits (354), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 79/199 (39%), Positives = 111/199 (55%), Gaps = 4/199 (2%)

Query: 40  KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
           K  PY  IK N    K ++    D      C   P     C  D  C   ML+  C    
Sbjct: 601 KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDDSPCGLDSECLNRMLMFECHPQV 659

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
           C  G +C N+ F  R   + K+V+T+  G G+VA  DI++GEFV EYVGE+ID++ C  R
Sbjct: 660 CPAGEACQNQSFSKRQYPETKIVRTDGKGWGLVATRDIRKGEFVNEYVGELIDEEECMAR 719

Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
           +   +    T+FY+  I++D +IDA  KGN SR++NHSC PN E  KW ++G+TR+G+FA
Sbjct: 720 IKHAQENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 779

Query: 217 TRDIKKGENLTYDYQYEFL 235
             DI  G  LT++Y  + L
Sbjct: 780 VCDIPAGTELTFNYNLDCL 798


>gi|348675982|gb|EGZ15800.1| hypothetical protein PHYSODRAFT_263017 [Phytophthora sojae]
          Length = 823

 Score =  140 bits (354), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 72/167 (43%), Positives = 100/167 (59%), Gaps = 3/167 (1%)

Query: 80  CDRDCHCGMLLSSCS-SGCKCGN--SCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKR 136
           C   C+  ML  SCS   C   +   C N+  + R +K +++      G G+VA+E I  
Sbjct: 198 CGEGCYNRMLFISCSDETCSAPDLSMCSNRAIKRRELKSVRVEYIPGPGFGLVANEKINA 257

Query: 137 GEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCC 196
           GEF+IEYVGEVIDD  CE R+ + +  GE NFY+ E+ +++VIDA Y+ N SR+INH C 
Sbjct: 258 GEFIIEYVGEVIDDIECERRMIQYRDNGEVNFYMMELEKNIVIDAKYRSNDSRFINHCCD 317

Query: 197 PNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
           PN+  QKW +DG  RIGIFA R+I   E +T DY +    ++    C
Sbjct: 318 PNSVTQKWNVDGMQRIGIFARRNIAPDEEITIDYNFSHFGEAADCKC 364


>gi|291387888|ref|XP_002710468.1| PREDICTED: nuclear receptor binding SET domain protein 1 isoform 1
            [Oryctolagus cuniculus]
          Length = 2700

 Score =  140 bits (354), Expect = 4e-31,   Method: Composition-based stats.
 Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)

Query: 68   CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
            C+C A+  +    D +C   MLL  C  + C  G  C N+ F  R   ++++ +T + G 
Sbjct: 1898 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFTKRQYPEVEIFRTLQRGW 1957

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+    DIK+GEFV EYVGE+ID++ C  R+   +    TNFY+  +++D +IDA  KGN
Sbjct: 1958 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 2017

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
             +R++NH C PN E QKW ++G+TR+G+FA  DIK G  LT++Y  E L
Sbjct: 2018 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 2066


>gi|383864320|ref|XP_003707627.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific-like [Megachile rotundata]
          Length = 1302

 Score =  140 bits (354), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 82/192 (42%), Positives = 109/192 (56%), Gaps = 2/192 (1%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CK 98
            K  PYV +K N  +   +K    +  + C C A   +      DC   +LL  CS G C 
Sbjct: 834  KPPPYVKLKVNKPVGN-VKPAEVESIVACDCDAEWNNPCAPGTDCLNRILLVECSPGICP 892

Query: 99   CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
             G  C N+ F  R    M+   T   G G+ + E IK G+FVIEYVGEVID+   + RL 
Sbjct: 893  AGPKCNNQAFVRRQYPAMEPFHTVARGWGLRSLEFIKAGQFVIEYVGEVIDEAEYKRRLH 952

Query: 159  KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATR 218
            + K L   NFY   I+ + +IDA  KGN SR++NHSC PN E QKW ++G+TRIG+FA  
Sbjct: 953  RKKELKNENFYFLTIDNNRMIDAEPKGNLSRFMNHSCSPNCETQKWTVNGDTRIGLFALC 1012

Query: 219  DIKKGENLTYDY 230
            DI+ GE LT++Y
Sbjct: 1013 DIEPGEELTFNY 1024


>gi|359067302|ref|XP_002689078.2| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
           lysine-20 specific [Bos taurus]
          Length = 1470

 Score =  140 bits (354), Expect = 4e-31,   Method: Composition-based stats.
 Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)

Query: 68  CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
           C+C A+  +    D +C   MLL  C  + C  G  C N+ F  R   ++++ +T + G 
Sbjct: 670 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFTKRQYPEVEIFRTLQRGW 729

Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
           G+    DIK+GEFV EYVGE+ID++ C  R+   +    TNFY+  +++D +IDA  KGN
Sbjct: 730 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 789

Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
            +R++NH C PN E QKW ++G+TR+G+FA  DIK G  LT++Y  E L
Sbjct: 790 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 838


>gi|332216412|ref|XP_003257344.1| PREDICTED: histone-lysine N-methyltransferase SETD2 [Nomascus
            leucogenys]
          Length = 2499

 Score =  140 bits (354), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 72/197 (36%), Positives = 111/197 (56%), Gaps = 7/197 (3%)

Query: 44   YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSG 96
            +  I+ N+YLT+R K +   D     C  +P S          C  DC   +L+  CSS 
Sbjct: 1458 FDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLNRLLMIECSSR 1517

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G+ C N+ FQ +    ++++ TEK G G+ A +D+    FV+EY GEV+D +  + R
Sbjct: 1518 CPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKAR 1577

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +       ++Y   +  D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F 
Sbjct: 1578 VKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFT 1637

Query: 217  TRDIKKGENLTYDYQYE 233
            T+ +  G  LT+DYQ++
Sbjct: 1638 TKLVPSGSELTFDYQFQ 1654


>gi|296225059|ref|XP_002758501.1| PREDICTED: histone-lysine N-methyltransferase SETD2 [Callithrix
            jacchus]
          Length = 2510

 Score =  140 bits (353), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 72/197 (36%), Positives = 111/197 (56%), Gaps = 7/197 (3%)

Query: 44   YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSG 96
            +  I+ N+YLT+R K +   D     C  +P S          C  DC   +L+  CSS 
Sbjct: 1419 FDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLNRLLMIECSSR 1478

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G+ C N+ FQ +    ++++ TEK G G+ A +D+    FV+EY GEV+D +  + R
Sbjct: 1479 CPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKAR 1538

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +       ++Y   +  D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F 
Sbjct: 1539 VKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFT 1598

Query: 217  TRDIKKGENLTYDYQYE 233
            T+ +  G  LT+DYQ++
Sbjct: 1599 TKLVPSGSELTFDYQFQ 1615


>gi|410220670|gb|JAA07554.1| SET domain containing 2 [Pan troglodytes]
 gi|410261336|gb|JAA18634.1| SET domain containing 2 [Pan troglodytes]
 gi|410295964|gb|JAA26582.1| SET domain containing 2 [Pan troglodytes]
 gi|410339683|gb|JAA38788.1| SET domain containing 2 [Pan troglodytes]
          Length = 2564

 Score =  140 bits (353), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 72/197 (36%), Positives = 111/197 (56%), Gaps = 7/197 (3%)

Query: 44   YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSG 96
            +  I+ N+YLT+R K +   D     C  +P S          C  DC   +L+  CSS 
Sbjct: 1473 FDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLNRLLMIECSSR 1532

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G+ C N+ FQ +    ++++ TEK G G+ A +D+    FV+EY GEV+D +  + R
Sbjct: 1533 CPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKAR 1592

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +       ++Y   +  D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F 
Sbjct: 1593 VKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFT 1652

Query: 217  TRDIKKGENLTYDYQYE 233
            T+ +  G  LT+DYQ++
Sbjct: 1653 TKLVPSGSELTFDYQFQ 1669


>gi|297671474|ref|XP_002813857.1| PREDICTED: histone-lysine N-methyltransferase SETD2 [Pongo abelii]
          Length = 2563

 Score =  140 bits (353), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 72/197 (36%), Positives = 111/197 (56%), Gaps = 7/197 (3%)

Query: 44   YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSG 96
            +  I+ N+YLT+R K +   D     C  +P S          C  DC   +L+  CSS 
Sbjct: 1472 FDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLNRLLMIECSSR 1531

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G+ C N+ FQ +    ++++ TEK G G+ A +D+    FV+EY GEV+D +  + R
Sbjct: 1532 CPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKAR 1591

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +       ++Y   +  D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F 
Sbjct: 1592 VKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFT 1651

Query: 217  TRDIKKGENLTYDYQYE 233
            T+ +  G  LT+DYQ++
Sbjct: 1652 TKLVPSGSELTFDYQFQ 1668


>gi|397495290|ref|XP_003818492.1| PREDICTED: histone-lysine N-methyltransferase SETD2 [Pan paniscus]
          Length = 2564

 Score =  140 bits (353), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 72/197 (36%), Positives = 111/197 (56%), Gaps = 7/197 (3%)

Query: 44   YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSG 96
            +  I+ N+YLT+R K +   D     C  +P S          C  DC   +L+  CSS 
Sbjct: 1473 FDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLNRLLMIECSSR 1532

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G+ C N+ FQ +    ++++ TEK G G+ A +D+    FV+EY GEV+D +  + R
Sbjct: 1533 CPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKAR 1592

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +       ++Y   +  D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F 
Sbjct: 1593 VKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFT 1652

Query: 217  TRDIKKGENLTYDYQYE 233
            T+ +  G  LT+DYQ++
Sbjct: 1653 TKLVPSGSELTFDYQFQ 1669


>gi|114586572|ref|XP_516423.2| PREDICTED: histone-lysine N-methyltransferase SETD2 isoform 3 [Pan
            troglodytes]
          Length = 2549

 Score =  140 bits (353), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 72/197 (36%), Positives = 111/197 (56%), Gaps = 7/197 (3%)

Query: 44   YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSG 96
            +  I+ N+YLT+R K +   D     C  +P S          C  DC   +L+  CSS 
Sbjct: 1458 FDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLNRLLMIECSSR 1517

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G+ C N+ FQ +    ++++ TEK G G+ A +D+    FV+EY GEV+D +  + R
Sbjct: 1518 CPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKAR 1577

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +       ++Y   +  D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F 
Sbjct: 1578 VKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFT 1637

Query: 217  TRDIKKGENLTYDYQYE 233
            T+ +  G  LT+DYQ++
Sbjct: 1638 TKLVPSGSELTFDYQFQ 1654


>gi|195130337|ref|XP_002009608.1| GI15146 [Drosophila mojavensis]
 gi|193908058|gb|EDW06925.1| GI15146 [Drosophila mojavensis]
          Length = 1885

 Score =  140 bits (353), Expect = 4e-31,   Method: Composition-based stats.
 Identities = 71/165 (43%), Positives = 96/165 (58%), Gaps = 1/165 (0%)

Query: 80   CDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEF 139
            C   C   ML+  C   C  G+ C NK FQ       ++ +T+K G GI A+  I  GEF
Sbjct: 948  CGAGCINRMLMIECGPLCTYGDRCTNKRFQQHQGWPCRVFRTKKKGCGITAEMLIPPGEF 1007

Query: 140  VIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNT 199
            ++EYVGEVID +  E R      +   ++Y   +  + +IDAT KGN SRYINHSC PN 
Sbjct: 1008 IMEYVGEVIDSEEFERRQHHYSQIRNRHYYFMALRGEAIIDATVKGNISRYINHSCDPNA 1067

Query: 200  EMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
            E QKW ++GE RIG F+ + I  GE +T+DYQY+ +  D+   YC
Sbjct: 1068 ETQKWTVNGELRIGFFSVKTILPGEEITFDYQYQRYGRDAQRCYC 1112


>gi|148677064|gb|EDL09011.1| mCG15806 [Mus musculus]
          Length = 2034

 Score =  140 bits (353), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 75/207 (36%), Positives = 115/207 (55%), Gaps = 9/207 (4%)

Query: 36   FIKPKAIP--YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHC 86
            F K   +P  +  I+ N+YLT+R K +   D     C  +P S          C  DC  
Sbjct: 934  FAKLGKMPCYFDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEVACGEDCLN 993

Query: 87   GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
             +L+  CSS C  G+ C N+ FQ +    ++++ TEK G G+ A +D+    FV+EY GE
Sbjct: 994  RLLMIECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGE 1053

Query: 147  VIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWII 206
            V+D +  + R+ +       ++Y   +  D +IDAT KGN SR++NHSC PN E QKW +
Sbjct: 1054 VLDHKEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTV 1113

Query: 207  DGETRIGIFATRDIKKGENLTYDYQYE 233
            +G+ R+G F T+ +  G  LT+DYQ++
Sbjct: 1114 NGQLRVGFFTTKLVPSGSELTFDYQFQ 1140


>gi|405966105|gb|EKC31425.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
           specific [Crassostrea gigas]
          Length = 1079

 Score =  140 bits (353), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 78/199 (39%), Positives = 113/199 (56%), Gaps = 4/199 (2%)

Query: 40  KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSC-SSG 96
           K  P+ F+K NI +   +  + +   I   C   P S   C  D DC   ML+  C  S 
Sbjct: 635 KPAPFKFVKNNIPVGSVVIPKPDLTSIP-RCECDPNSEAPCSSDTDCLNRMLMYECHPSV 693

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
           C+CG  C N+ FQ R        +TE  G G+  + DIK+G+FV EYVGE+ID++  + R
Sbjct: 694 CQCGEKCHNQRFQRREYPDCTPFKTEGRGWGLRTNVDIKKGQFVHEYVGELIDEEEVKRR 753

Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
           + +      +N+Y+  ++++ VIDA  KGN SR++NHSC PN E QKW  +G+ R+G+FA
Sbjct: 754 IDESHENNISNYYMLTLDKNRVIDAGPKGNLSRFMNHSCAPNCETQKWTANGDVRVGLFA 813

Query: 217 TRDIKKGENLTYDYQYEFL 235
             DI  G  LT++Y  E L
Sbjct: 814 IYDIPAGTELTFNYNLECL 832


>gi|403268536|ref|XP_003926329.1| PREDICTED: histone-lysine N-methyltransferase SETD2 [Saimiri
            boliviensis boliviensis]
          Length = 2057

 Score =  140 bits (353), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 72/197 (36%), Positives = 111/197 (56%), Gaps = 7/197 (3%)

Query: 44   YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSG 96
            +  I+ N+YLT+R K +   D     C  +P S          C  DC   +L+  CSS 
Sbjct: 966  FDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLNRLLMIECSSR 1025

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G+ C N+ FQ +    ++++ TEK G G+ A +D+    FV+EY GEV+D +  + R
Sbjct: 1026 CPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKAR 1085

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +       ++Y   +  D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F 
Sbjct: 1086 VKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFT 1145

Query: 217  TRDIKKGENLTYDYQYE 233
            T+ +  G  LT+DYQ++
Sbjct: 1146 TKLVPSGSELTFDYQFQ 1162


>gi|119585214|gb|EAW64810.1| SET domain containing 2, isoform CRA_f [Homo sapiens]
          Length = 2342

 Score =  140 bits (353), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 72/197 (36%), Positives = 111/197 (56%), Gaps = 7/197 (3%)

Query: 44   YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSG 96
            +  I+ N+YLT+R K +   D     C  +P S          C  DC   +L+  CSS 
Sbjct: 1251 FDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLNRLLMIECSSR 1310

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G+ C N+ FQ +    ++++ TEK G G+ A +D+    FV+EY GEV+D +  + R
Sbjct: 1311 CPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKAR 1370

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +       ++Y   +  D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F 
Sbjct: 1371 VKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFT 1430

Query: 217  TRDIKKGENLTYDYQYE 233
            T+ +  G  LT+DYQ++
Sbjct: 1431 TKLVPSGSELTFDYQFQ 1447


>gi|20521978|dbj|BAB21823.2| KIAA1732 protein [Homo sapiens]
          Length = 1915

 Score =  140 bits (353), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 72/197 (36%), Positives = 111/197 (56%), Gaps = 7/197 (3%)

Query: 44   YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSG 96
            +  I+ N+YLT+R K +   D     C  +P S          C  DC   +L+  CSS 
Sbjct: 824  FDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLNRLLMIECSSR 883

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G+ C N+ FQ +    ++++ TEK G G+ A +D+    FV+EY GEV+D +  + R
Sbjct: 884  CPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKAR 943

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +       ++Y   +  D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F 
Sbjct: 944  VKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFT 1003

Query: 217  TRDIKKGENLTYDYQYE 233
            T+ +  G  LT+DYQ++
Sbjct: 1004 TKLVPSGSELTFDYQFQ 1020


>gi|128485462|ref|NP_001076020.1| probable histone-lysine N-methyltransferase NSD2 [Danio rerio]
          Length = 1461

 Score =  140 bits (353), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 77/199 (38%), Positives = 112/199 (56%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
            K  P+ +IK N     R++    D      C   P +   C  +  C   MLL  C    
Sbjct: 1078 KPPPFKYIKVN-KPCGRVQVYTADISEIPKCNCKPSTERPCSFESECLNRMLLYECHPQV 1136

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G  C N+ F  R   + K+++T   G G+++  DIK+GEFV EYVGE+ID++ C  R
Sbjct: 1137 CPAGERCQNQDFTKRLYPETKIIRTAGKGWGLISLRDIKKGEFVNEYVGELIDEEECRSR 1196

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            +   +    T+FY+  I++D +IDA  KGN SR++NHSC PN E QKW ++G+TR+G+FA
Sbjct: 1197 IRHAQENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETQKWTVNGDTRVGLFA 1256

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1257 VCDIPAGTELTFNYNLDCL 1275


>gi|338714932|ref|XP_001495700.3| PREDICTED: histone-lysine N-methyltransferase SETD2 [Equus caballus]
          Length = 2064

 Score =  140 bits (353), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 72/197 (36%), Positives = 111/197 (56%), Gaps = 7/197 (3%)

Query: 44   YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSG 96
            +  I+ N+YLT+R K +   D     C  +P S          C  DC   +L+  CSS 
Sbjct: 973  FDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLNRLLMIECSSR 1032

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G+ C N+ FQ +    ++++ TEK G G+ A +D+    FV+EY GEV+D +  + R
Sbjct: 1033 CPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKAR 1092

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +       ++Y   +  D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F 
Sbjct: 1093 VKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFT 1152

Query: 217  TRDIKKGENLTYDYQYE 233
            T+ +  G  LT+DYQ++
Sbjct: 1153 TKLVPSGSELTFDYQFQ 1169


>gi|296485540|tpg|DAA27655.1| TPA: nuclear receptor binding SET domain protein 1 [Bos taurus]
          Length = 1275

 Score =  140 bits (353), Expect = 5e-31,   Method: Composition-based stats.
 Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)

Query: 68  CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
           C+C A+  +    D +C   MLL  C  + C  G  C N+ F  R   ++++ +T + G 
Sbjct: 473 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFTKRQYPEVEIFRTLQRGW 532

Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
           G+    DIK+GEFV EYVGE+ID++ C  R+   +    TNFY+  +++D +IDA  KGN
Sbjct: 533 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 592

Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
            +R++NH C PN E QKW ++G+TR+G+FA  DIK G  LT++Y  E L
Sbjct: 593 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 641


>gi|357619110|gb|EHJ71815.1| putative huntingtin interacting protein [Danaus plexippus]
          Length = 225

 Score =  140 bits (353), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 78/194 (40%), Positives = 106/194 (54%), Gaps = 5/194 (2%)

Query: 44  YVFIKRNIYLTKRIKRRLEDDGIFCSC--TASPGSSG--VCDRDCHCGMLLSSCSSGCKC 99
           Y  +  N YL +R+  R E   + C C  T      G   C  DC   +L+  C+S C  
Sbjct: 4   YQHLDENEYLCERMVSR-ETKKMICDCFMTKEELERGELACGEDCLNRLLMIECNSRCPV 62

Query: 100 GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
           G  C N+ FQ +    +K+   +K G G+ A  DI  GEF++EYVGEV+D     +R   
Sbjct: 63  GERCTNRRFQKKENGPLKVFYADKKGCGVEATTDITNGEFLMEYVGEVLDYDQFYKRAQA 122

Query: 160 MKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD 219
                  + Y   +  D VIDAT KGN SR+INHSC PN E QKW ++GE RIG F+ R+
Sbjct: 123 YSDDNNLHHYFMSLKGDTVIDATLKGNISRFINHSCEPNAETQKWTVNGELRIGFFSKRE 182

Query: 220 IKKGENLTYDYQYE 233
           I  GE +T+DYQ++
Sbjct: 183 ISAGEEITFDYQFQ 196


>gi|348534024|ref|XP_003454503.1| PREDICTED: histone-lysine N-methyltransferase SETD2-like [Oreochromis
            niloticus]
          Length = 2253

 Score =  140 bits (353), Expect = 5e-31,   Method: Composition-based stats.
 Identities = 76/207 (36%), Positives = 116/207 (56%), Gaps = 9/207 (4%)

Query: 36   FIKPKAIP--YVFIKRNIYLTKRIKRRLEDD--GIFCSCTASPG---SSGV--CDRDCHC 86
            F K K +P  +  I+ N+YLT+R K +   D   + C C   P    + GV  C  DC  
Sbjct: 968  FAKSKKMPPYFYLIEDNMYLTERKKSKSHRDIKRMQCECPVLPREDRARGVLACGDDCLN 1027

Query: 87   GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
             +L+  CSS C  G  C N+ FQ +     +++ TE  G G+ A +D+    FV+EY GE
Sbjct: 1028 RLLMIECSSRCLNGAYCSNRRFQMKQHADFEVILTEDKGWGLRAAKDLAPNTFVLEYCGE 1087

Query: 147  VIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWII 206
            V+D +  + R+ +       ++Y   +  + +IDAT KGN SR++NHSC PN E QKW +
Sbjct: 1088 VLDHKEFKTRVKEYARNKNIHYYFMSLKNNEIIDATQKGNCSRFMNHSCEPNCETQKWTV 1147

Query: 207  DGETRIGIFATRDIKKGENLTYDYQYE 233
            +G+ R+G F T+ +  G  LT+DYQ++
Sbjct: 1148 NGQLRVGFFTTKAVTAGTELTFDYQFQ 1174


>gi|60688116|gb|AAH90954.1| SETD2 protein [Homo sapiens]
          Length = 1845

 Score =  140 bits (353), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 72/197 (36%), Positives = 111/197 (56%), Gaps = 7/197 (3%)

Query: 44  YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSG 96
           +  I+ N+YLT+R K +   D     C  +P S          C  DC   +L+  CSS 
Sbjct: 754 FDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLNRLLMIECSSR 813

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
           C  G+ C N+ FQ +    ++++ TEK G G+ A +D+    FV+EY GEV+D +  + R
Sbjct: 814 CPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKAR 873

Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
           + +       ++Y   +  D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F 
Sbjct: 874 VKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFT 933

Query: 217 TRDIKKGENLTYDYQYE 233
           T+ +  G  LT+DYQ++
Sbjct: 934 TKLVPSGSELTFDYQFQ 950


>gi|213624868|gb|AAI71696.1| Wolf-Hirschhorn syndrome candidate 1 [Danio rerio]
          Length = 1461

 Score =  140 bits (353), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 77/199 (38%), Positives = 112/199 (56%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
            K  P+ +IK N     R++    D      C   P +   C  +  C   MLL  C    
Sbjct: 1078 KPPPFKYIKVN-KPCGRVQVYTADISEIPKCNCKPSTERPCSFESECLNRMLLYECHPQV 1136

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G  C N+ F  R   + K+++T   G G+++  DIK+GEFV EYVGE+ID++ C  R
Sbjct: 1137 CPAGERCQNQDFTKRLYPETKIIRTAGKGWGLISLRDIKKGEFVNEYVGELIDEEECRSR 1196

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            +   +    T+FY+  I++D +IDA  KGN SR++NHSC PN E QKW ++G+TR+G+FA
Sbjct: 1197 IRHAQENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETQKWTVNGDTRVGLFA 1256

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1257 VCDIPAGTELTFNYNLDCL 1275


>gi|109658484|gb|AAI17163.1| SET domain containing 2 [Homo sapiens]
 gi|109658962|gb|AAI17165.1| SET domain containing 2 [Homo sapiens]
          Length = 2061

 Score =  140 bits (353), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 72/197 (36%), Positives = 111/197 (56%), Gaps = 7/197 (3%)

Query: 44   YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSG 96
            +  I+ N+YLT+R K +   D     C  +P S          C  DC   +L+  CSS 
Sbjct: 970  FDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLNRLLMIECSSR 1029

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G+ C N+ FQ +    ++++ TEK G G+ A +D+    FV+EY GEV+D +  + R
Sbjct: 1030 CPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKAR 1089

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +       ++Y   +  D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F 
Sbjct: 1090 VKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFT 1149

Query: 217  TRDIKKGENLTYDYQYE 233
            T+ +  G  LT+DYQ++
Sbjct: 1150 TKLVPSGSELTFDYQFQ 1166


>gi|12697196|emb|CAC28349.1| huntingtin interacting protein 1 [Homo sapiens]
 gi|50512435|gb|AAT77612.1| HSPC069 isoform a [Homo sapiens]
          Length = 2061

 Score =  140 bits (353), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 72/197 (36%), Positives = 111/197 (56%), Gaps = 7/197 (3%)

Query: 44   YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSG 96
            +  I+ N+YLT+R K +   D     C  +P S          C  DC   +L+  CSS 
Sbjct: 970  FDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLNRLLMIECSSR 1029

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G+ C N+ FQ +    ++++ TEK G G+ A +D+    FV+EY GEV+D +  + R
Sbjct: 1030 CPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKAR 1089

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +       ++Y   +  D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F 
Sbjct: 1090 VKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFT 1149

Query: 217  TRDIKKGENLTYDYQYE 233
            T+ +  G  LT+DYQ++
Sbjct: 1150 TKLVPSGSELTFDYQFQ 1166


>gi|340378403|ref|XP_003387717.1| PREDICTED: histone-lysine N-methyltransferase SETD2-like
           [Amphimedon queenslandica]
          Length = 862

 Score =  140 bits (353), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 77/204 (37%), Positives = 110/204 (53%), Gaps = 11/204 (5%)

Query: 44  YVFIKRNIYLTKRIKRRLEDDG-----IFCSCTASPGS---SGVCDRDCHCGMLLSSCSS 95
           + FI  N+Y+ +R   R   DG     + C+C   P +   S  C  +C   +L+  C S
Sbjct: 48  FDFITENVYIVERGTTR---DGKRSKRMMCTCQFDPETDNHSEACGENCLNRLLMIECGS 104

Query: 96  GCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEE 155
            C CG  C NK F       +++++TE  G G+ A  DI R  FV+EY GEV   +  E 
Sbjct: 105 RCPCGEYCTNKRFTRSSYANVEVIKTEMKGWGLKATCDISRYSFVMEYCGEVCSLEEFER 164

Query: 156 RLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIF 215
           R    +     ++Y   +  D ++DAT KGN SR+INHSC PN E QKW ++G  R+G F
Sbjct: 165 RRNIYEKESRRHYYFMSLKTDEILDATRKGNLSRFINHSCEPNCETQKWTVNGRLRVGFF 224

Query: 216 ATRDIKKGENLTYDYQYEFLHDSL 239
           A R I  GE LT+DYQ++   +S+
Sbjct: 225 ALRHIPAGEELTFDYQFQRFGESV 248


>gi|197313748|ref|NP_054878.5| histone-lysine N-methyltransferase SETD2 [Homo sapiens]
 gi|296452963|sp|Q9BYW2.3|SETD2_HUMAN RecName: Full=Histone-lysine N-methyltransferase SETD2; AltName:
            Full=HIF-1; AltName: Full=Huntingtin yeast partner B;
            AltName: Full=Huntingtin-interacting protein 1;
            Short=HIP-1; AltName: Full=Huntingtin-interacting protein
            B; AltName: Full=Lysine N-methyltransferase 3A; AltName:
            Full=SET domain-containing protein 2; Short=hSET2;
            AltName: Full=p231HBP
          Length = 2564

 Score =  140 bits (353), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 72/197 (36%), Positives = 111/197 (56%), Gaps = 7/197 (3%)

Query: 44   YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSG 96
            +  I+ N+YLT+R K +   D     C  +P S          C  DC   +L+  CSS 
Sbjct: 1473 FDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLNRLLMIECSSR 1532

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G+ C N+ FQ +    ++++ TEK G G+ A +D+    FV+EY GEV+D +  + R
Sbjct: 1533 CPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKAR 1592

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +       ++Y   +  D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F 
Sbjct: 1593 VKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFT 1652

Query: 217  TRDIKKGENLTYDYQYE 233
            T+ +  G  LT+DYQ++
Sbjct: 1653 TKLVPSGSELTFDYQFQ 1669


>gi|326674803|ref|XP_003200208.1| PREDICTED: histone-lysine N-methyltransferase SETD2-like [Danio
           rerio]
          Length = 1428

 Score =  140 bits (353), Expect = 5e-31,   Method: Composition-based stats.
 Identities = 79/221 (35%), Positives = 122/221 (55%), Gaps = 10/221 (4%)

Query: 21  LKQIGNPVEFELPDWFIKPKAIPYV-FIKRNIYLTKRIKRRLEDD--GIFCSC---TASP 74
           LK+  +P  F   D   + K  PY   I+ N+YLT+R K +   D   + C C   +   
Sbjct: 482 LKEFSDP--FVWRDKAKQKKMPPYFDLIEENLYLTERKKNKSHRDIKRMQCECAILSKEE 539

Query: 75  GSSGV--CDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADE 132
            + G+  C  DC   +L+  CSS C  G  C N+ FQ +     +++ TE  G G+ A +
Sbjct: 540 RARGILACGEDCLNRLLMIECSSRCLNGAYCSNRRFQMKQHADYEVILTESKGWGLRAAK 599

Query: 133 DIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYIN 192
           D++   FV+EY GEV+D +  + R+ +       ++Y   +  + +IDAT KGN SR++N
Sbjct: 600 DLQPNTFVLEYCGEVLDHREFKARVKEYARNKNIHYYFMALKNNEIIDATLKGNCSRFMN 659

Query: 193 HSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
           HSC PN E QKW ++G+ RIG F T+ +  G  LT+DYQ++
Sbjct: 660 HSCEPNCETQKWTVNGQLRIGFFTTKAVTAGTELTFDYQFQ 700


>gi|431905124|gb|ELK10179.1| Histone-lysine N-methyltransferase SETD2 [Pteropus alecto]
          Length = 2482

 Score =  140 bits (353), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 72/197 (36%), Positives = 111/197 (56%), Gaps = 7/197 (3%)

Query: 44   YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSG 96
            +  I+ N+YLT+R K +   D     C  +P S          C  DC   +L+  CSS 
Sbjct: 1391 FDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLNRLLMIECSSR 1450

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G+ C N+ FQ +    ++++ TEK G G+ A +D+    FV+EY GEV+D +  + R
Sbjct: 1451 CPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKAR 1510

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +       ++Y   +  D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F 
Sbjct: 1511 VKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFT 1570

Query: 217  TRDIKKGENLTYDYQYE 233
            T+ +  G  LT+DYQ++
Sbjct: 1571 TKLVPSGSELTFDYQFQ 1587


>gi|410951014|ref|XP_003982197.1| PREDICTED: histone-lysine N-methyltransferase SETD2 [Felis catus]
          Length = 2064

 Score =  140 bits (353), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 72/197 (36%), Positives = 111/197 (56%), Gaps = 7/197 (3%)

Query: 44   YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSG 96
            +  I+ N+YLT+R K +   D     C  +P S          C  DC   +L+  CSS 
Sbjct: 973  FDLIEENVYLTERKKNKSHRDIKRMQCECAPLSKDERAQGEIACGEDCLNRLLMIECSSR 1032

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G+ C N+ FQ +    ++++ TEK G G+ A +D+    FV+EY GEV+D +  + R
Sbjct: 1033 CPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKAR 1092

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +       ++Y   +  D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F 
Sbjct: 1093 VKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFT 1152

Query: 217  TRDIKKGENLTYDYQYE 233
            T+ +  G  LT+DYQ++
Sbjct: 1153 TKLVPSGSELTFDYQFQ 1169


>gi|417406999|gb|JAA50136.1| Putative clathrin coat binding protein/huntingtin [Desmodus rotundus]
          Length = 2557

 Score =  140 bits (352), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 72/197 (36%), Positives = 111/197 (56%), Gaps = 7/197 (3%)

Query: 44   YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSG 96
            +  I+ N+YLT+R K +   D     C  +P S          C  DC   +L+  CSS 
Sbjct: 1466 FDLIEENVYLTERKKNKSHRDIKRMQCECAPLSKDERAQGEIACGEDCLNRLLMIECSSR 1525

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G+ C N+ FQ +    ++++ TEK G G+ A +D+    FV+EY GEV+D +  + R
Sbjct: 1526 CPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKAR 1585

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +       ++Y   +  D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F 
Sbjct: 1586 VKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFT 1645

Query: 217  TRDIKKGENLTYDYQYE 233
            T+ +  G  LT+DYQ++
Sbjct: 1646 TKLVPSGSELTFDYQFQ 1662


>gi|344276291|ref|XP_003409942.1| PREDICTED: histone-lysine N-methyltransferase SETD2 [Loxodonta
            africana]
          Length = 2551

 Score =  140 bits (352), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 72/197 (36%), Positives = 110/197 (55%), Gaps = 7/197 (3%)

Query: 44   YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSG 96
            +  I+ N+YLT+R K +   D     C  +P S          C  DC   +L+  CSS 
Sbjct: 1459 FDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLNRLLMIECSSR 1518

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G+ C N+ FQ +    ++++ TEK G G+ A  D+    FV+EY GEV+D +  + R
Sbjct: 1519 CPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAARDLPSNTFVLEYCGEVLDHKEFKAR 1578

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +       ++Y   +  D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F 
Sbjct: 1579 VKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFT 1638

Query: 217  TRDIKKGENLTYDYQYE 233
            T+ +  G  LT+DYQ++
Sbjct: 1639 TKLVPSGSELTFDYQFQ 1655


>gi|355559685|gb|EHH16413.1| hypothetical protein EGK_11693 [Macaca mulatta]
          Length = 2343

 Score =  140 bits (352), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 72/197 (36%), Positives = 111/197 (56%), Gaps = 7/197 (3%)

Query: 44   YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSG 96
            +  I+ N+YLT+R K +   D     C  +P S          C  DC   +L+  CSS 
Sbjct: 1252 FDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLNRLLMIECSSR 1311

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G+ C N+ FQ +    ++++ TEK G G+ A +D+    FV+EY GEV+D +  + R
Sbjct: 1312 CPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKAR 1371

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +       ++Y   +  D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F 
Sbjct: 1372 VKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFT 1431

Query: 217  TRDIKKGENLTYDYQYE 233
            T+ +  G  LT+DYQ++
Sbjct: 1432 TKLVPSGSELTFDYQFQ 1448


>gi|6841376|gb|AAF29041.1|AF161554_1 HSPC069 [Homo sapiens]
          Length = 591

 Score =  140 bits (352), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 72/197 (36%), Positives = 111/197 (56%), Gaps = 7/197 (3%)

Query: 44  YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSG 96
           +  I+ N+YLT+R K +   D     C  +P S          C  DC   +L+  CSS 
Sbjct: 23  FDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDKELKGEIACGEDCLNRLLMIECSSR 82

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
           C  G+ C N+ FQ +    ++++ TEK G G+ A +D+    FV+EY GEV+D +  + R
Sbjct: 83  CPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKAR 142

Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
           + +       ++Y   +  D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F 
Sbjct: 143 VKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFT 202

Query: 217 TRDIKKGENLTYDYQYE 233
           T+ +  G  LT+DYQ++
Sbjct: 203 TKLVPSGSELTFDYQFQ 219


>gi|50512437|gb|AAT77613.1| HSPC069 isoform b [Homo sapiens]
          Length = 1211

 Score =  140 bits (352), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 72/197 (36%), Positives = 111/197 (56%), Gaps = 7/197 (3%)

Query: 44   YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSG 96
            +  I+ N+YLT+R K +   D     C  +P S          C  DC   +L+  CSS 
Sbjct: 970  FDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLNRLLMIECSSR 1029

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G+ C N+ FQ +    ++++ TEK G G+ A +D+    FV+EY GEV+D +  + R
Sbjct: 1030 CPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKAR 1089

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +       ++Y   +  D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F 
Sbjct: 1090 VKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFT 1149

Query: 217  TRDIKKGENLTYDYQYE 233
            T+ +  G  LT+DYQ++
Sbjct: 1150 TKLVPSGSELTFDYQFQ 1166


>gi|119585209|gb|EAW64805.1| SET domain containing 2, isoform CRA_a [Homo sapiens]
          Length = 1538

 Score =  140 bits (352), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 72/197 (36%), Positives = 111/197 (56%), Gaps = 7/197 (3%)

Query: 44   YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSG 96
            +  I+ N+YLT+R K +   D     C  +P S          C  DC   +L+  CSS 
Sbjct: 970  FDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLNRLLMIECSSR 1029

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G+ C N+ FQ +    ++++ TEK G G+ A +D+    FV+EY GEV+D +  + R
Sbjct: 1030 CPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKAR 1089

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +       ++Y   +  D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F 
Sbjct: 1090 VKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFT 1149

Query: 217  TRDIKKGENLTYDYQYE 233
            T+ +  G  LT+DYQ++
Sbjct: 1150 TKLVPSGSELTFDYQFQ 1166


>gi|86278478|gb|ABC88477.1| Wolf-Hirschhorn syndrome candidate 1 protein [Danio rerio]
          Length = 1366

 Score =  140 bits (352), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 77/199 (38%), Positives = 112/199 (56%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
            K  P+ +IK N     R++    D      C   P +   C  +  C   MLL  C    
Sbjct: 983  KPPPFKYIKVN-KPCGRVQVYTADISEIPKCNCKPSTERPCSFESECLNRMLLYECHPQV 1041

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G  C N+ F  R   + K+++T   G G+++  DIK+GEFV EYVGE+ID++ C  R
Sbjct: 1042 CPAGERCQNQDFTKRLYPETKIIRTAGKGWGLISLRDIKKGEFVNEYVGELIDEEECRSR 1101

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            +   +    T+FY+  I++D +IDA  KGN SR++NHSC PN E QKW ++G+TR+G+FA
Sbjct: 1102 IRHAQENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETQKWTVNGDTRVGLFA 1161

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1162 VCDIPAGTELTFNYNLDCL 1180


>gi|148709229|gb|EDL41175.1| nuclear receptor-binding SET-domain protein 1, isoform CRA_a [Mus
            musculus]
          Length = 2588

 Score =  140 bits (352), Expect = 6e-31,   Method: Composition-based stats.
 Identities = 70/169 (41%), Positives = 103/169 (60%), Gaps = 1/169 (0%)

Query: 68   CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
            C+C A+  +    D +C   MLL  C  + C  G  C N+ F  R    +++ +T + G 
Sbjct: 1793 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGVRCQNQCFSKRQYPDVEIFRTLQRGW 1852

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+    DIK+GEFV EYVGE+ID++ C  R+   +    TNFY+  +++D +IDA  KGN
Sbjct: 1853 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 1912

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
             +R++NH C PN E QKW ++G+TR+G+FA  DIK G  LT++Y  E L
Sbjct: 1913 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 1961


>gi|68565655|sp|O88491.1|NSD1_MOUSE RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific; AltName: Full=H3-K36-HMTase; AltName:
            Full=H4-K20-HMTase; AltName: Full=Nuclear
            receptor-binding SET domain-containing protein 1;
            Short=NR-binding SET domain-containing protein
 gi|3329465|gb|AAC40182.1| NSD1 protein [Mus musculus]
          Length = 2588

 Score =  140 bits (352), Expect = 6e-31,   Method: Composition-based stats.
 Identities = 70/169 (41%), Positives = 103/169 (60%), Gaps = 1/169 (0%)

Query: 68   CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
            C+C A+  +    D +C   MLL  C  + C  G  C N+ F  R    +++ +T + G 
Sbjct: 1793 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGVRCQNQCFSKRQYPDVEIFRTLQRGW 1852

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+    DIK+GEFV EYVGE+ID++ C  R+   +    TNFY+  +++D +IDA  KGN
Sbjct: 1853 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 1912

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
             +R++NH C PN E QKW ++G+TR+G+FA  DIK G  LT++Y  E L
Sbjct: 1913 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 1961


>gi|350420879|ref|XP_003492658.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
           isoform 1 [Bombus impatiens]
          Length = 1230

 Score =  140 bits (352), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 75/166 (45%), Positives = 99/166 (59%), Gaps = 1/166 (0%)

Query: 66  IFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKC 124
           + C C A   +      DC   +LL  CS G C  G  C N+ F  R    M+   T   
Sbjct: 786 VACDCDAEWENPCAPGTDCLNRILLVECSPGICPAGAKCKNQAFVQRQYPAMEPFHTVGR 845

Query: 125 GAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYK 184
           G G+ + E IK G+FVIEYVGEVID+   + RL + K L   NFY   I+ + +IDA  K
Sbjct: 846 GWGLRSLEHIKAGQFVIEYVGEVIDEAEYKRRLHRKKELKNENFYFLTIDNNRMIDAEPK 905

Query: 185 GNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDY 230
           GN SR++NHSC PN E QKW ++G+TRIG+FA  DI++GE LT++Y
Sbjct: 906 GNLSRFMNHSCSPNCETQKWTVNGDTRIGLFALCDIERGEELTFNY 951


>gi|356518575|ref|XP_003527954.1| PREDICTED: histone-lysine N-methyltransferase ASHH2-like [Glycine
            max]
          Length = 2037

 Score =  140 bits (352), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 77/190 (40%), Positives = 107/190 (56%), Gaps = 1/190 (0%)

Query: 44   YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNS 102
            +  I  N +L +  K +  D+ + C C  S      C  +C   +L   C  G C CG+ 
Sbjct: 1110 FTHILTNEFLHRSHKTQTIDEIMVCHCKPSQEGKLGCGDECLNRILNIECVQGTCPCGDR 1169

Query: 103  CLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
            C N+ FQ      +K  +  K G G+ A E++ +G+F+IEYVGEV+D Q  E R  +   
Sbjct: 1170 CSNQQFQKHKYASLKWFKCGKKGYGLKAIENVAQGQFLIEYVGEVLDMQAYEARQREYAL 1229

Query: 163  LGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKK 222
             G  +FY   +N   VIDA+ KGN  R+INHSC PN   +KW+++GE  IG+FA RDIKK
Sbjct: 1230 KGHRHFYFMTLNGSEVIDASAKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALRDIKK 1289

Query: 223  GENLTYDYQY 232
             E LT+DY Y
Sbjct: 1290 DEELTFDYNY 1299


>gi|443722431|gb|ELU11300.1| hypothetical protein CAPTEDRAFT_160470, partial [Capitella teleta]
          Length = 282

 Score =  140 bits (352), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 74/171 (43%), Positives = 93/171 (54%), Gaps = 7/171 (4%)

Query: 70  CTASPGSS-------GVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTE 122
           C  SP S          C  DC   ML+  C S C CG+ C NK FQ R   K +  + E
Sbjct: 3   CDCSPPSQEDISMGLAFCGEDCLNRMLMIECGSRCPCGDMCTNKRFQRRHYAKTEPFRAE 62

Query: 123 KCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDAT 182
             G G+ A  D+  G FV+EYVGEV+D      R  +      T+ Y   +N D +IDAT
Sbjct: 63  VKGWGLRATSDLSSGVFVMEYVGEVLDYPNFRLRCKQYAEDNHTHHYFMALNGDEIIDAT 122

Query: 183 YKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
            KGN SR+INHSC PN E QKW ++G+ R+G F  R I  G  LT+DYQ+E
Sbjct: 123 QKGNTSRFINHSCDPNCETQKWTVNGQLRVGFFTLRSIPAGTELTFDYQFE 173


>gi|350420881|ref|XP_003492659.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
           isoform 2 [Bombus impatiens]
          Length = 1239

 Score =  140 bits (352), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 75/166 (45%), Positives = 99/166 (59%), Gaps = 1/166 (0%)

Query: 66  IFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKC 124
           + C C A   +      DC   +LL  CS G C  G  C N+ F  R    M+   T   
Sbjct: 795 VACDCDAEWENPCAPGTDCLNRILLVECSPGICPAGAKCKNQAFVQRQYPAMEPFHTVGR 854

Query: 125 GAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYK 184
           G G+ + E IK G+FVIEYVGEVID+   + RL + K L   NFY   I+ + +IDA  K
Sbjct: 855 GWGLRSLEHIKAGQFVIEYVGEVIDEAEYKRRLHRKKELKNENFYFLTIDNNRMIDAEPK 914

Query: 185 GNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDY 230
           GN SR++NHSC PN E QKW ++G+TRIG+FA  DI++GE LT++Y
Sbjct: 915 GNLSRFMNHSCSPNCETQKWTVNGDTRIGLFALCDIERGEELTFNY 960


>gi|328865276|gb|EGG13662.1| SET domain-containing protein [Dictyostelium fasciculatum]
          Length = 1418

 Score =  140 bits (352), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 83/202 (41%), Positives = 113/202 (55%), Gaps = 8/202 (3%)

Query: 44   YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNS 102
            + F   NIYL ++ +   +++ I C+C A       C   C        C SG C  G+ 
Sbjct: 991  FKFTGSNIYLERKKRTEEQENIIMCNCPAE----SPCLESCLNRKSYFECHSGYCIHGDR 1046

Query: 103  CLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
            C N+ FQ +   ++     EK G G+ A ++I    F+IEY GEVI  Q C ER+ + + 
Sbjct: 1047 CRNQRFQKQQYCRILPFSAEKKGWGLKAVDNIGAKTFIIEYCGEVISKQKCLERMTESE- 1105

Query: 163  LGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKK 222
              E  FY   ++R   +DA+ KGN +R+INHSC PN E QKW +DGE RIGIFA RDIK+
Sbjct: 1106 -SEKYFYFLTLDRLECLDASRKGNLARFINHSCDPNCETQKWNVDGEVRIGIFAIRDIKR 1164

Query: 223  GENLTYDYQYE-FLHDSLIAYC 243
            GE LT+DY YE F     + YC
Sbjct: 1165 GEELTFDYNYERFGTSKQVCYC 1186


>gi|195392836|ref|XP_002055060.1| GJ19006 [Drosophila virilis]
 gi|194149570|gb|EDW65261.1| GJ19006 [Drosophila virilis]
          Length = 2101

 Score =  140 bits (352), Expect = 7e-31,   Method: Composition-based stats.
 Identities = 72/165 (43%), Positives = 94/165 (56%), Gaps = 1/165 (0%)

Query: 80   CDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEF 139
            C   C   ML+  C   C  G  C NK FQ       ++ +TEK G GI A+  I  GEF
Sbjct: 1028 CGAGCINRMLMIECGPLCTNGERCTNKRFQLHQCWPCRVFRTEKKGCGITAELQIPPGEF 1087

Query: 140  VIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNT 199
            ++EYVGEVID +  E R          ++Y   +  + +IDAT KGN SRYINHSC PN 
Sbjct: 1088 IMEYVGEVIDSEEFERRQHIYSRDRNRHYYFMALRGEAIIDATAKGNISRYINHSCDPNA 1147

Query: 200  EMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
            E QKW ++GE RIG F+ + I  GE +T+DYQY+ +  D+   YC
Sbjct: 1148 ETQKWTVNGELRIGFFSVKTIMPGEEITFDYQYQRYGRDAQRCYC 1192


>gi|119585211|gb|EAW64807.1| SET domain containing 2, isoform CRA_c [Homo sapiens]
          Length = 1819

 Score =  140 bits (352), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 72/197 (36%), Positives = 111/197 (56%), Gaps = 7/197 (3%)

Query: 44   YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSG 96
            +  I+ N+YLT+R K +   D     C  +P S          C  DC   +L+  CSS 
Sbjct: 1251 FDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLNRLLMIECSSR 1310

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G+ C N+ FQ +    ++++ TEK G G+ A +D+    FV+EY GEV+D +  + R
Sbjct: 1311 CPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKAR 1370

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +       ++Y   +  D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F 
Sbjct: 1371 VKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFT 1430

Query: 217  TRDIKKGENLTYDYQYE 233
            T+ +  G  LT+DYQ++
Sbjct: 1431 TKLVPSGSELTFDYQFQ 1447


>gi|355746723|gb|EHH51337.1| hypothetical protein EGM_10693 [Macaca fascicularis]
          Length = 2343

 Score =  140 bits (352), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 72/197 (36%), Positives = 110/197 (55%), Gaps = 7/197 (3%)

Query: 44   YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSG 96
            +  I+ N+YLT+R K +   D     C  +P S          C  DC   +L+  CSS 
Sbjct: 1252 FDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLNRLLMIECSSR 1311

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G+ C N+ FQ +    ++++ TEK G G+ A  D+    FV+EY GEV+D +  + R
Sbjct: 1312 CPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAARDLPSNTFVLEYCGEVLDHKEFKAR 1371

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +       ++Y   +  D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F 
Sbjct: 1372 VKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFT 1431

Query: 217  TRDIKKGENLTYDYQYE 233
            T+ +  G  LT+DYQ++
Sbjct: 1432 TKLVPSGSELTFDYQFQ 1448


>gi|340372263|ref|XP_003384664.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
           [Amphimedon queenslandica]
          Length = 1171

 Score =  139 bits (351), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 75/193 (38%), Positives = 109/193 (56%), Gaps = 5/193 (2%)

Query: 44  YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSS-GCKCGNS 102
           +V IKRN YLT R     E +   C+ ++  G S     +C    + + CS+  C  G+ 
Sbjct: 709 FVHIKRNKYLTSRPPLETETEPCKCTESSPCGPSS----NCINRAIFTECSTENCPAGDK 764

Query: 103 CLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
           C N+          +   T   G G+     +  G+FVIEYVGE++D    +ERL K + 
Sbjct: 765 CQNQRMLRNESVPTQTFYTGNRGWGLKTMRSLSPGDFVIEYVGEIVDMAAVQERLKKTQE 824

Query: 163 LGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKK 222
              ++FY   + R+++IDA  K N +R+INHSC PN E QKW ++GETRIGIFA +DIK+
Sbjct: 825 ASVSSFYFLTLERNLIIDARVKSNHARFINHSCDPNCETQKWTVNGETRIGIFAIKDIKE 884

Query: 223 GENLTYDYQYEFL 235
              LT+DYQ++ L
Sbjct: 885 DTELTFDYQFDCL 897


>gi|242017983|ref|XP_002429463.1| set domain protein, putative [Pediculus humanus corporis]
 gi|212514395|gb|EEB16725.1| set domain protein, putative [Pediculus humanus corporis]
          Length = 2269

 Score =  139 bits (350), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 73/197 (37%), Positives = 113/197 (57%), Gaps = 2/197 (1%)

Query: 39   PKAIPYVFIKRNIYLTK-RIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG- 96
            P+  P+V IK N  + K ++    + +   C C  +  S      DC   MLL  C+   
Sbjct: 1622 PRPPPFVKIKTNRPVGKVQLGSGYQSEPQTCDCDPNYESPCGPGSDCLNRMLLVECNPNY 1681

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G  C N+ F+ R    ++  +TE+ G G+ +  DI +G FVIEYVGEV+DD+  + R
Sbjct: 1682 CNAGTRCKNQSFEKREYPPLEPYRTERRGWGLRSTVDIPKGSFVIEYVGEVVDDEEFKRR 1741

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + + +   + N+Y   I++D +IDA  KGN +R++NHSC PN E QKW ++ +TR+G+FA
Sbjct: 1742 MKRKQETMDNNYYFLTIDKDRIIDAGPKGNLARFMNHSCSPNCETQKWTVNMDTRVGLFA 1801

Query: 217  TRDIKKGENLTYDYQYE 233
              DI  G  LT++Y  +
Sbjct: 1802 LEDIPAGTELTFNYNLD 1818


>gi|66828443|ref|XP_647576.1| SET domain-containing protein [Dictyostelium discoideum AX4]
 gi|60475584|gb|EAL73519.1| SET domain-containing protein [Dictyostelium discoideum AX4]
          Length = 898

 Score =  139 bits (350), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 81/211 (38%), Positives = 118/211 (55%), Gaps = 6/211 (2%)

Query: 25  GNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGI-FCSCTASPGSSGVCDRD 83
            NP   ++     K +   + FI RN ++ +  K   E D I  C+C+ S GS  VC  D
Sbjct: 528 ANPDYSDIAVLIQKSRNKKFGFISRNFFIERTEKILYEIDDIDICNCSKSSGS--VCGDD 585

Query: 84  CHCGMLLSSCS-SGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIE 142
           C        C+   C+ G  C N+ FQ +    +K   T K G G++A+EDI+  +F++E
Sbjct: 586 CLNRESYVECNIEHCELGKKCTNQRFQRKQYSNIKPAFTGKKGWGLIANEDIEEKQFIME 645

Query: 143 YVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQ 202
           Y GEVI  QTC  R+ + ++  E  FY   ++    +DA+ +GN +R++NHSC PN E Q
Sbjct: 646 YCGEVISKQTCLRRMKEAEN--EKFFYFLTLDSKECLDASKRGNLARFMNHSCDPNCETQ 703

Query: 203 KWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
           KW + GE +IGIFA + I KG  LT+DY YE
Sbjct: 704 KWTVGGEVKIGIFAIKPIPKGTELTFDYNYE 734


>gi|302839691|ref|XP_002951402.1| histone H3 Lys 36 methyltransferase/ASH1 [Volvox carteri f.
            nagariensis]
 gi|300263377|gb|EFJ47578.1| histone H3 Lys 36 methyltransferase/ASH1 [Volvox carteri f.
            nagariensis]
          Length = 2345

 Score =  139 bits (350), Expect = 1e-30,   Method: Composition-based stats.
 Identities = 84/214 (39%), Positives = 115/214 (53%), Gaps = 16/214 (7%)

Query: 30   FELP-DWFIK--PKAIPYVFIKRNIYLTKRIKRRLEDDGI-FCSCT----ASPG--SSGV 79
            F LP DW +   P+  P   I+RN++L++   +RL  D I  C+C     A+PG      
Sbjct: 1478 FALPRDWLVTRPPRYEP---IRRNVWLSRNKPKRLPKDEINICACRPPPFAAPGELQRMG 1534

Query: 80   CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGE 138
            C ++C   +    C +  C CG  C N+        K ++  TE  G G+   E + +G 
Sbjct: 1535 CAQNCLNRLSSVLCDAKLCPCGELCSNRSLHLLRQPKTEVFLTENRGWGVRTMEPLSKGA 1594

Query: 139  FVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPN 198
            F+IEY GEVIDD+    R+   +  GE +FY+ E+   + IDA  KGN +R IN SC PN
Sbjct: 1595 FIIEYAGEVIDDRELGRRMEHARMNGEPHFYIMELAAGLYIDARRKGNIARLINSSCDPN 1654

Query: 199  TEMQKW--IIDGETRIGIFATRDIKKGENLTYDY 230
             E QKW     GE R+GIFA+RDI  GE L YDY
Sbjct: 1655 CETQKWHDASTGEIRVGIFASRDIPPGEELVYDY 1688


>gi|157278865|gb|AAI15212.1| Whsc1 protein [Danio rerio]
          Length = 486

 Score =  139 bits (350), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 77/199 (38%), Positives = 112/199 (56%), Gaps = 4/199 (2%)

Query: 40  KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
           K  P+ +IK N     R++    D      C   P +   C  +  C   MLL  C    
Sbjct: 103 KPPPFKYIKVN-KPCGRVQVYTADISEIPKCNCKPSTERPCSFESECLNRMLLYECHPQV 161

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
           C  G  C N+ F  R   + K+++T   G G+++  DIK+GEFV EYVGE+ID++ C  R
Sbjct: 162 CPAGERCQNQDFTKRLYPETKIIRTAGKGWGLISLRDIKKGEFVNEYVGELIDEEECRSR 221

Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
           +   +    T+FY+  I++D +IDA  KGN SR++NHSC PN E QKW ++G+TR+G+FA
Sbjct: 222 IRHAQENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETQKWTVNGDTRVGLFA 281

Query: 217 TRDIKKGENLTYDYQYEFL 235
             DI  G  LT++Y  + L
Sbjct: 282 VCDIPAGTELTFNYNLDCL 300


>gi|242019388|ref|XP_002430143.1| histone-lysine N-methyltransferase SUVR5, putative [Pediculus
           humanus corporis]
 gi|212515234|gb|EEB17405.1| histone-lysine N-methyltransferase SUVR5, putative [Pediculus
           humanus corporis]
          Length = 1448

 Score =  139 bits (350), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 68/154 (44%), Positives = 90/154 (58%)

Query: 80  CDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEF 139
           C  DC   ML   C S C  G+ C NK FQ       K+ +TEK G G+ A+ED+    F
Sbjct: 528 CGEDCLNRMLFLECGSRCALGDRCTNKRFQKLQYANCKIFKTEKKGFGLRAEEDLSGNTF 587

Query: 140 VIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNT 199
           ++EYVGEV++ +    R+         +FY   +  D VIDAT KGN SR+INHSC PN 
Sbjct: 588 IMEYVGEVVNQKEFGRRVKMYAKENNKHFYFMALKGDAVIDATNKGNISRFINHSCDPNA 647

Query: 200 EMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
           E QKW I+GE R+G F  R +  GE +T+DYQ++
Sbjct: 648 ETQKWTINGELRVGFFTRRFVAAGEEITFDYQFQ 681


>gi|395505173|ref|XP_003756919.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific [Sarcophilus harrisii]
          Length = 2717

 Score =  139 bits (349), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 71/169 (42%), Positives = 104/169 (61%), Gaps = 1/169 (0%)

Query: 68   CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
            C+C AS  +    D +C   MLL  C  + C  G  C N+ F  R   ++++ +T + G 
Sbjct: 1898 CNCKASDDNPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGW 1957

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+    DIK+GEFV EYVGE+ID++ C  R+   +    TNFY+  +++D +IDA  KGN
Sbjct: 1958 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 2017

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
             +R++NH C PN E QKW ++G+TR+G+FA  DIK G  LT++Y  E L
Sbjct: 2018 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 2066


>gi|357149500|ref|XP_003575133.1| PREDICTED: histone-lysine N-methyltransferase ASHH2-like
           [Brachypodium distachyon]
          Length = 1022

 Score =  139 bits (349), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 77/196 (39%), Positives = 108/196 (55%), Gaps = 5/196 (2%)

Query: 40  KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSS-GCK 98
           +A  +  +K N +L +  + +  D+ + C+C         C   C   ML   C+   C 
Sbjct: 171 RAPSWTAVKTNSFLHRNRRTQSIDESMVCNCKPPQDGRMGCRDGCLNRMLNIECTKRTCP 230

Query: 99  CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQT--CEER 156
           CG  C N+ FQ R   K+    + K G G+   E++  G F+IEYVGEV+D     C +R
Sbjct: 231 CGEYCSNQQFQRRSYAKISWFCSGKKGFGLQLKEEVTEGRFLIEYVGEVLDITAYECRQR 290

Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            +  K  G+ +FY   +N   VIDA  KGN  R+INHSC PN   +KW+++GE  IGIFA
Sbjct: 291 YYASK--GQKHFYFMALNGGEVIDACTKGNLGRFINHSCSPNCRTEKWMVNGEVCIGIFA 348

Query: 217 TRDIKKGENLTYDYQY 232
            R+IKKGE LT+DY Y
Sbjct: 349 MRNIKKGEELTFDYNY 364


>gi|449474840|ref|XP_002193971.2| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
           lysine-20 specific [Taeniopygia guttata]
          Length = 1651

 Score =  139 bits (349), Expect = 2e-30,   Method: Composition-based stats.
 Identities = 77/199 (38%), Positives = 110/199 (55%), Gaps = 4/199 (2%)

Query: 40  KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSS-G 96
           K  PY  IK N  + K ++    D      C   P     C  D  C   MLL  C    
Sbjct: 595 KPPPYKHIKVNRPVGK-VQIFTADLSEIPRCNCKPTDENPCGLDSECINRMLLYECHPLV 653

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
           C  G  C N+ F  R   ++++ +T   G G+ A  DI++GEFV EYVGE+ID++ C  R
Sbjct: 654 CPAGERCQNQCFSKRQYPEVQIFRTLARGWGLQAKTDIRKGEFVNEYVGELIDEEECRAR 713

Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
           +   +    TNFY+  +++D +IDA  KGN +R++NH C PN E QKW ++G+TR+G+FA
Sbjct: 714 IRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWCVNGDTRVGLFA 773

Query: 217 TRDIKKGENLTYDYQYEFL 235
             +IK G  LT++Y  E L
Sbjct: 774 LVNIKAGTELTFNYNLECL 792


>gi|356576073|ref|XP_003556159.1| PREDICTED: histone-lysine N-methyltransferase ASHH1-like [Glycine
           max]
          Length = 480

 Score =  138 bits (348), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 77/194 (39%), Positives = 110/194 (56%), Gaps = 2/194 (1%)

Query: 44  YVFIKRNIYLTKRIKRRLEDDGIFCSCT-ASPGSSGVCDRDCHCGMLLSSCSSG-CKCGN 101
           Y+ + +N +  +R K++ E+D   C C   +      C   C   +  + C+ G C CG 
Sbjct: 11  YIHMNQNEFCIRRHKKQKEEDIAICECKYDADDPDNACGDSCLNVLTSTECTPGYCPCGV 70

Query: 102 SCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMK 161
            C N+ FQ     K KL +TE  G G++ADEDIK G+FVIEY GEVI  +  + R    +
Sbjct: 71  LCKNQKFQKCEYAKTKLFKTEGRGWGLLADEDIKAGQFVIEYCGEVISWKEAKRRSQAYE 130

Query: 162 HLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIK 221
           + G  + ++  +N    IDAT KG+ +R+INHSC PN E +KW + GE R+GIFA  DI 
Sbjct: 131 NQGLKDAFIICLNASESIDATRKGSLARFINHSCQPNCETRKWNVLGEIRVGIFAKHDIP 190

Query: 222 KGENLTYDYQYEFL 235
            G  L YDY +E+ 
Sbjct: 191 IGNELAYDYNFEWF 204


>gi|68611259|emb|CAD41011.3| OSJNBa0042L16.10 [Oryza sativa Japonica Group]
          Length = 1153

 Score =  138 bits (348), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 81/222 (36%), Positives = 121/222 (54%), Gaps = 30/222 (13%)

Query: 44  YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCG------MLLSSCSSG- 96
           Y+ I+ N +L +R KR+ E+D   C C  +     + D D  CG      +  + C+ G 
Sbjct: 194 YIHIETNDFLHRRHKRQKEEDIAVCECQYN-----LLDPDSACGDRCLNVLTSTECTPGY 248

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDI------------------KRGE 138
           C CG  C N+ FQ       +LV+TE  G G++ADE+I                  + G+
Sbjct: 249 CLCGVYCKNQRFQKSQYAATRLVKTEGRGWGLLADENIMVTEFTLILWSANVVKYIQAGQ 308

Query: 139 FVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPN 198
           FV+EY GEVI  +  + R    ++ G T+ Y+  +N D  IDAT KG+ +R+INHSC PN
Sbjct: 309 FVMEYCGEVISWKEAKRRSQAYENQGLTDAYIIYLNADESIDATKKGSLARFINHSCQPN 368

Query: 199 TEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLI 240
            E +KW + GE R+GIFA +DI  G  L+YDY +E+   +++
Sbjct: 369 CETRKWNVLGEVRVGIFAKQDIPIGTELSYDYNFEWFGGAMV 410


>gi|326488341|dbj|BAJ93839.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 1070

 Score =  138 bits (348), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 75/194 (38%), Positives = 106/194 (54%), Gaps = 1/194 (0%)

Query: 40  KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSS-GCK 98
           +A  +  ++ N YL +  + +  D+ + C+C         C   C   ML   C+   C 
Sbjct: 269 RAPSWTNLRSNAYLHRNRRNQSIDESMVCNCKHPQDGRMGCRDGCLNRMLNIECAKRTCP 328

Query: 99  CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
           C   C N+ FQ R   K++   + K G G+   E++  G F+IEYVGEV+D  T E R  
Sbjct: 329 CEEQCSNQQFQRRNYAKIEWFHSGKKGYGLQLLEEVSEGRFLIEYVGEVLDITTYESRQR 388

Query: 159 KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATR 218
                G+ +FY   ++   VIDA  KGN  R+INHSC PN   +KW+++GE  IGIFA R
Sbjct: 389 DYASKGKKHFYFMALDGGEVIDACTKGNLGRFINHSCSPNCRTEKWMVNGEVCIGIFAMR 448

Query: 219 DIKKGENLTYDYQY 232
           +IKKGE LT+DY Y
Sbjct: 449 NIKKGEELTFDYNY 462


>gi|350587283|ref|XP_003128857.3| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Sus
            scrofa]
          Length = 1338

 Score =  138 bits (348), Expect = 2e-30,   Method: Composition-based stats.
 Identities = 69/169 (40%), Positives = 103/169 (60%), Gaps = 1/169 (0%)

Query: 68   CSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
            C+C  +  +    D +C   ML+  C    C  G  C N+ F  R   + K+++T+  G 
Sbjct: 989  CNCKPTDENPCGLDSECLNRMLMFECHPQVCPAGEFCQNQCFTKRQYPETKIIKTDGKGW 1048

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+VA  DI++GEFV EYVGE+ID++ C  R+ + +    T FY+  I++D +IDA  KGN
Sbjct: 1049 GLVAKRDIRKGEFVNEYVGELIDEEECMARIRRAQEHDITRFYMLTIDKDRIIDAGPKGN 1108

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
             SR++NHSC PN E  KW ++G+TR+G+FA  DI  G  LT++Y  + L
Sbjct: 1109 YSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTELTFNYNLDCL 1157


>gi|403072167|pdb|4FMU|A Chain A, Crystal Structure Of Methyltransferase Domain Of Human Set
           Domain- Containing Protein 2 Compound: Pr-Snf
 gi|407944022|pdb|4H12|A Chain A, The Crystal Structure Of Methyltransferase Domain Of Human
           Set Domain- Containing Protein 2 In Complex With
           S-Adenosyl-L-Homocysteine
          Length = 278

 Score =  138 bits (348), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 72/197 (36%), Positives = 111/197 (56%), Gaps = 7/197 (3%)

Query: 44  YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSG 96
           +  I+ N+YLT+R K +   D     C  +P S          C  DC   +L+  CSS 
Sbjct: 40  FDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLNRLLMIECSSR 99

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
           C  G+ C N+ FQ +    ++++ TEK G G+ A +D+    FV+EY GEV+D +  + R
Sbjct: 100 CPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKAR 159

Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
           + +       ++Y   +  D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F 
Sbjct: 160 VKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFT 219

Query: 217 TRDIKKGENLTYDYQYE 233
           T+ +  G  LT+DYQ++
Sbjct: 220 TKLVPSGSELTFDYQFQ 236


>gi|126303359|ref|XP_001372863.1| PREDICTED: histone-lysine N-methyltransferase NSD3 [Monodelphis
            domestica]
          Length = 1435

 Score =  138 bits (348), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 82/222 (36%), Positives = 118/222 (53%), Gaps = 6/222 (2%)

Query: 17   FNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGS 76
            F++L  Q  N    EL     KP   PY  IK N  + K ++ ++ D      C   P  
Sbjct: 1046 FHELKAQRENKEALELERNSRKPP--PYKHIKANKVIGK-VQIQVADLSEIPRCNCKPAD 1102

Query: 77   SGVCDRDCHC--GMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADED 133
               C  +  C   ML   C    C  G+ C N+ F  R     ++++TE+ G G+     
Sbjct: 1103 ENPCGLESECLNRMLQYECHPQVCPAGDRCQNQCFTKRLYPDAEVIKTERRGWGLRTKRS 1162

Query: 134  IKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINH 193
            IK+GEFV EYVGE+ID++ C  R+ +      TNFY+  + +D +IDA  KGN SR++NH
Sbjct: 1163 IKKGEFVNEYVGELIDEEECRLRIKRAHENSITNFYMLTVTKDRIIDAGPKGNYSRFMNH 1222

Query: 194  SCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
            SC PN E QKW ++G+ R+G+FA  DI  G  LT++Y  + L
Sbjct: 1223 SCNPNCETQKWTVNGDIRVGLFALCDIPAGVELTFNYNLDCL 1264


>gi|334311241|ref|XP_003339591.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
            H3 lysine-36 and H4 lysine-20 specific-like [Monodelphis
            domestica]
          Length = 2705

 Score =  138 bits (348), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 71/169 (42%), Positives = 104/169 (61%), Gaps = 1/169 (0%)

Query: 68   CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
            C+C AS  +    D +C   MLL  C  + C  G  C N+ F  R   ++++ +T + G 
Sbjct: 1897 CNCKASDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGW 1956

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+    DIK+GEFV EYVGE+ID++ C  R+   +    TNFY+  +++D +IDA  KGN
Sbjct: 1957 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 2016

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
             +R++NH C PN E QKW ++G+TR+G+FA  DIK G  LT++Y  E L
Sbjct: 2017 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 2065


>gi|363739108|ref|XP_414538.3| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific [Gallus gallus]
          Length = 2412

 Score =  138 bits (348), Expect = 2e-30,   Method: Composition-based stats.
 Identities = 77/199 (38%), Positives = 110/199 (55%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSS-G 96
            K  PY  IK N  + K ++    D      C   P     C  D  C   MLL  C    
Sbjct: 1581 KPPPYKHIKVNRPVGK-VQIFTADLSEIPRCNCKPTDENPCGLDSECINRMLLYECHPLV 1639

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G  C N+ F  R   ++++ +T   G G+ A  DI++GEFV EYVGE+ID++ C  R
Sbjct: 1640 CPAGERCQNQCFSKRQYPEVQIFRTLARGWGLQAKTDIRKGEFVNEYVGELIDEEECRAR 1699

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            +   +    TNFY+  +++D +IDA  KGN +R++NH C PN E QKW ++G+TR+G+FA
Sbjct: 1700 IRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWCVNGDTRVGLFA 1759

Query: 217  TRDIKKGENLTYDYQYEFL 235
              +IK G  LT++Y  E L
Sbjct: 1760 IVNIKAGTELTFNYNLECL 1778


>gi|157126101|ref|XP_001654536.1| set domain protein [Aedes aegypti]
 gi|108873380|gb|EAT37605.1| AAEL010414-PA [Aedes aegypti]
          Length = 1480

 Score =  138 bits (348), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 74/195 (37%), Positives = 106/195 (54%), Gaps = 1/195 (0%)

Query: 44   YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSS-GCKCGNS 102
            +V IK N Y+      + E DG  C C A+       D +C    L+  C+   C  G  
Sbjct: 1149 FVKIKSNRYVPPLKAPKDEMDGNVCVCKATDSDPCGPDSNCINRALMVECNPKSCPAGEL 1208

Query: 103  CLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
            C N+ F+ R    +   +  + G G+VA EDI++G+FVIEYVGEVI ++  E RL     
Sbjct: 1209 CQNQCFEKRQYPSLAARRIPQKGWGLVAQEDIRQGQFVIEYVGEVISNEELERRLQHKVA 1268

Query: 163  LGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKK 222
              + N+Y   ++ ++ IDA  KGN +R+INHSC PN E   W + G   +G+FA  DIK 
Sbjct: 1269 QKDENYYFLTVDSELTIDAGPKGNLARFINHSCEPNCETMLWTVGGAQSVGLFAIMDIKA 1328

Query: 223  GENLTYDYQYEFLHD 237
            GE LT++Y +E   D
Sbjct: 1329 GEELTFNYNFESKSD 1343


>gi|444511191|gb|ELV09829.1| Histone-lysine N-methyltransferase NSD3 [Tupaia chinensis]
          Length = 1235

 Score =  138 bits (348), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 80/222 (36%), Positives = 117/222 (52%), Gaps = 6/222 (2%)

Query: 17   FNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGS 76
            F +L  Q  +    E+     KP   PY  IK N  + K ++ ++ D      C   P  
Sbjct: 846  FQELKAQRESKEALEMEKHSRKPP--PYKHIKANKVIGK-VQIQVADLAEIPRCNCKPAD 902

Query: 77   SGVCDRDCHC--GMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADED 133
               C  +  C   ML   C    C  G+ C N+ F  R     ++++TE+ G G+     
Sbjct: 903  ENPCGLESECLNRMLQYECHPQVCPAGDRCQNQCFTKRLYPDAEVIKTERRGWGLRTKRS 962

Query: 134  IKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINH 193
            IK+GEFV EYVGE+ID++ C  R+ +      TNFY+  + +D +IDA  KGN SR++NH
Sbjct: 963  IKKGEFVNEYVGELIDEEECRLRIQRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNH 1022

Query: 194  SCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
            SC PN E QKW ++G+ R+G+FA  DI  G  LT++Y  + L
Sbjct: 1023 SCNPNCETQKWTVNGDVRVGLFALCDIPAGMELTFNYNLDCL 1064


>gi|94732456|emb|CAK03662.1| novel protein similar to vertebrate Wolf-Hirschhorn syndrome
           candidate 1 (WHSC1) [Danio rerio]
          Length = 728

 Score =  138 bits (347), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 77/199 (38%), Positives = 112/199 (56%), Gaps = 4/199 (2%)

Query: 40  KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
           K  P+ +IK N     R++    D      C   P +   C  +  C   MLL  C    
Sbjct: 345 KPPPFKYIKVN-KPCGRVQVYTADISEIPKCNCKPSTERPCSFESECLNRMLLYECHPQV 403

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
           C  G  C N+ F  R   + K+++T   G G+++  DIK+GEFV EYVGE+ID++ C  R
Sbjct: 404 CPAGERCQNQDFTKRLYPETKIIRTAGKGWGLISLRDIKKGEFVNEYVGELIDEEECRSR 463

Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
           +   +    T+FY+  I++D +IDA  KGN SR++NHSC PN E QKW ++G+TR+G+FA
Sbjct: 464 IRHAQENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETQKWTVNGDTRVGLFA 523

Query: 217 TRDIKKGENLTYDYQYEFL 235
             DI  G  LT++Y  + L
Sbjct: 524 VCDIPAGTELTFNYNLDCL 542


>gi|417406466|gb|JAA49891.1| Putative histone-lysine n-methyltransferase nsd3-like isoform 3
            [Desmodus rotundus]
          Length = 1438

 Score =  138 bits (347), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 76/199 (38%), Positives = 108/199 (54%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
            K  PY  IK N  + K ++ ++ D      C   P     C  D  C   ML   C    
Sbjct: 1070 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLDSECLNRMLQYECHPQV 1128

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G  C N+ F  R     ++++TE+ G G+     IK+GEFV EYVGE+ID++ C  R
Sbjct: 1129 CPAGERCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1188

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +      TNFY+  + +D +IDA  KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1189 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1248

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1249 LCDIPAGMELTFNYNLDCL 1267


>gi|384250559|gb|EIE24038.1| SET domain-containing protein, partial [Coccomyxa subellipsoidea
           C-169]
          Length = 295

 Score =  138 bits (347), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 72/195 (36%), Positives = 104/195 (53%), Gaps = 1/195 (0%)

Query: 44  YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNS 102
           +  I+ NIY  +  K + EDD + C C         C  DC   ML   C  G C CG  
Sbjct: 11  WQLIRENIYTHRAPKVQDEDDVMLCHCRLPNDGGPGCGPDCLNRMLNMECVPGYCPCGER 70

Query: 103 CLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
           C N+ F  R   K++  +    G G+ A +D+  G+F++EY+GEV++++    R    + 
Sbjct: 71  CSNQQFSKRQYAKLEKRRAGAKGFGLFATQDLVAGQFIVEYIGEVLEEEEYLRRKDYYQE 130

Query: 163 LGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKK 222
            G+ ++Y   I    VIDA  KG   R+INHSC PN E QKW++ GE  IG++A +DI  
Sbjct: 131 SGQRHYYFMNIGNGEVIDAARKGALGRFINHSCNPNCETQKWVVRGELAIGLYALKDIPA 190

Query: 223 GENLTYDYQYEFLHD 237
           G  LT+DY +E   D
Sbjct: 191 GVELTFDYNFERYGD 205


>gi|302846429|ref|XP_002954751.1| histone H3 methyltransferase [Volvox carteri f. nagariensis]
 gi|300259934|gb|EFJ44157.1| histone H3 methyltransferase [Volvox carteri f. nagariensis]
          Length = 261

 Score =  138 bits (347), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 81/210 (38%), Positives = 113/210 (53%), Gaps = 4/210 (1%)

Query: 38  KPKAIPYVF--IKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSS 95
           K + IP V+  +K N+   ++ K + EDD + C C         C  DC   ML   C  
Sbjct: 1   KRRRIPAVWQLLKDNVLAHRKRKVQDEDDIMICHCKPPWRGGDGCGPDCINRMLCIECVP 60

Query: 96  G-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCE 154
           G C     C N+ F  R    +++ +    G G+ A EDIK G+F+IEY+GEV+++   +
Sbjct: 61  GFCPSEEKCTNQMFSKRMYANLEIRRAGAKGFGLFALEDIKAGQFIIEYIGEVLEEDEYQ 120

Query: 155 ERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGI 214
            R      +G+ ++Y   I    VIDA  KGN SR+INHSC PN E QKW++ GE  IG+
Sbjct: 121 RRKEYYMSVGQRHYYFMNIGNGEVIDACRKGNISRFINHSCEPNCETQKWLVRGELAIGL 180

Query: 215 FATRDIKKGENLTYDYQYEFLHDS-LIAYC 243
           FA RDI K   LT+DY +E   D  +  YC
Sbjct: 181 FAVRDIPKDTELTFDYNFERYGDKPMRCYC 210


>gi|224082490|ref|XP_002306713.1| SET domain protein [Populus trichocarpa]
 gi|222856162|gb|EEE93709.1| SET domain protein [Populus trichocarpa]
          Length = 495

 Score =  138 bits (347), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 76/195 (38%), Positives = 116/195 (59%), Gaps = 6/195 (3%)

Query: 44  YVFIKRNIYLTKRIKRRLEDDGIFCSCTAS---PGSSGVCDRDCHCGMLLSSCSSG-CKC 99
           Y  I+RN ++ ++ K++ E+D   C C  +   P S+  C   C   +  + C+ G C C
Sbjct: 11  YEHIERNDFIYRKHKKQKEEDIAICECKFNGDDPDSA--CGERCLNLLTSTECTPGYCPC 68

Query: 100 GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
           G  C N+ FQ     K +L +TE  G G++ADE+IK G+F+IEY GEVI  +  ++R   
Sbjct: 69  GVYCKNQRFQKFEYAKTQLFKTEGRGWGLLADEEIKAGQFIIEYCGEVISWKEAKKRSQV 128

Query: 160 MKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD 219
            ++ G  + ++  +N    IDAT KG+ +R+INHSC PN E +KW + GE R+GIFA ++
Sbjct: 129 YENQGLKDAFIISLNSTESIDATKKGSLARFINHSCQPNCETRKWTVLGEIRVGIFAKQN 188

Query: 220 IKKGENLTYDYQYEF 234
           I  G  L YDY +E+
Sbjct: 189 ISIGTELAYDYNFEW 203


>gi|326928449|ref|XP_003210391.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific-like, partial [Meleagris gallopavo]
          Length = 2336

 Score =  137 bits (346), Expect = 3e-30,   Method: Composition-based stats.
 Identities = 77/199 (38%), Positives = 110/199 (55%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSS-G 96
            K  PY  IK N  + K ++    D      C   P     C  D  C   MLL  C    
Sbjct: 1569 KPPPYKHIKVNRPVGK-VQIFTADLSEIPRCNCKPTDENPCGLDSECINRMLLYECHPLV 1627

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G  C N+ F  R   ++++ +T   G G+ A  DI++GEFV EYVGE+ID++ C  R
Sbjct: 1628 CPAGERCQNQCFSKRQYPEVQIFRTLARGWGLQAKTDIRKGEFVNEYVGELIDEEECRAR 1687

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            +   +    TNFY+  +++D +IDA  KGN +R++NH C PN E QKW ++G+TR+G+FA
Sbjct: 1688 IRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWCVNGDTRVGLFA 1747

Query: 217  TRDIKKGENLTYDYQYEFL 235
              +IK G  LT++Y  E L
Sbjct: 1748 IVNIKAGTELTFNYNLECL 1766


>gi|255575537|ref|XP_002528669.1| set domain protein, putative [Ricinus communis]
 gi|223531892|gb|EEF33708.1| set domain protein, putative [Ricinus communis]
          Length = 495

 Score =  137 bits (346), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 79/205 (38%), Positives = 119/205 (58%), Gaps = 7/205 (3%)

Query: 34  DWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSC---TASPGSSGVCDRDCHCGMLL 90
           D +I   A+ Y  I++N +  ++ +++ E+D   C C    + P S+  C   C   +  
Sbjct: 2   DQWIDALAL-YEHIQQNDFSYRKHRKQKEEDIAICECRFDASDPESA--CGERCLNVLTS 58

Query: 91  SSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVID 149
           + C+ G C+CG  C N+ FQ     K +L +TE  G G++ADEDIK G+F+IEY GEVI 
Sbjct: 59  TECTPGYCRCGIFCKNQRFQKCEYFKTRLFKTEGRGWGLLADEDIKAGQFIIEYCGEVIS 118

Query: 150 DQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGE 209
            +  + R    +  G  + ++  +N    IDAT KG+ +R+INHSC PN E +KW + GE
Sbjct: 119 WKEAKRRSQAYERQGLKDAFIISLNSSESIDATRKGSLARFINHSCQPNCETRKWNVLGE 178

Query: 210 TRIGIFATRDIKKGENLTYDYQYEF 234
            R+GIFA +DI  G  L YDY +E+
Sbjct: 179 IRVGIFAKQDISIGTELAYDYNFEW 203


>gi|118572948|sp|Q6P2L6.2|NSD3_MOUSE RecName: Full=Histone-lysine N-methyltransferase NSD3; AltName:
            Full=Nuclear SET domain-containing protein 3; AltName:
            Full=Wolf-Hirschhorn syndrome candidate 1-like protein 1
            homolog; Short=WHSC1-like protein 1
          Length = 1439

 Score =  137 bits (346), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 80/222 (36%), Positives = 118/222 (53%), Gaps = 6/222 (2%)

Query: 17   FNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGS 76
            F +L  Q  +    E+     KP   PY  IK N  + K ++ ++ D      C   PG 
Sbjct: 1051 FQELKAQRESKEALEMERTSRKPP--PYKHIKANKVIGK-VQVQVADLSEIPRCNCKPGD 1107

Query: 77   SGVCDRDCHCGMLLSS--CSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADED 133
               C  +  C   +S   C    C  G+ C N+ F  R     ++++TE+ G G+     
Sbjct: 1108 ENPCGLESQCLNRMSQYECHPQVCPAGDRCQNQCFTKRLYPDAEVIKTERRGWGLRTKRS 1167

Query: 134  IKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINH 193
            IK+GEFV EYVGE+ID++ C  R+ +      TNFY+  + +D +IDA  KGN SR++NH
Sbjct: 1168 IKKGEFVNEYVGELIDEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNH 1227

Query: 194  SCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
            SC PN E QKW ++G+ R+G+FA  DI  G  LT++Y  + L
Sbjct: 1228 SCNPNCETQKWTVNGDVRVGLFALCDIPAGMELTFNYNLDCL 1269


>gi|124486903|ref|NP_001074738.1| histone-lysine N-methyltransferase NSD3 isoform 2 [Mus musculus]
 gi|189442807|gb|AAI67226.1| Wolf-Hirschhorn syndrome candidate 1-like 1 (human) [synthetic
            construct]
          Length = 1446

 Score =  137 bits (346), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 80/222 (36%), Positives = 118/222 (53%), Gaps = 6/222 (2%)

Query: 17   FNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGS 76
            F +L  Q  +    E+     KP   PY  IK N  + K ++ ++ D      C   PG 
Sbjct: 1058 FQELKAQRESKEALEMERTSRKPP--PYKHIKANKVIGK-VQVQVADLSEIPRCNCKPGD 1114

Query: 77   SGVCDRDCHCGMLLSS--CSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADED 133
               C  +  C   +S   C    C  G+ C N+ F  R     ++++TE+ G G+     
Sbjct: 1115 ENPCGLESQCLNRMSQYECHPQVCPAGDRCQNQCFTKRLYPDAEVIKTERRGWGLRTKRS 1174

Query: 134  IKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINH 193
            IK+GEFV EYVGE+ID++ C  R+ +      TNFY+  + +D +IDA  KGN SR++NH
Sbjct: 1175 IKKGEFVNEYVGELIDEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNH 1234

Query: 194  SCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
            SC PN E QKW ++G+ R+G+FA  DI  G  LT++Y  + L
Sbjct: 1235 SCNPNCETQKWTVNGDVRVGLFALCDIPAGMELTFNYNLDCL 1276


>gi|348574862|ref|XP_003473209.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
            H3 lysine-36 and H4 lysine-20 specific-like [Cavia
            porcellus]
          Length = 2509

 Score =  137 bits (346), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)

Query: 68   CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
            C+C A+  +    D +C   MLL  C  + C  G  C N+ F  R   ++++ +T + G 
Sbjct: 1708 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGW 1767

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+    DIK+GEFV EYVGE+ID++ C  R+   +    TNFY+  +++D +IDA  KGN
Sbjct: 1768 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 1827

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
             +R++NH C PN E QKW ++G+TR+G+FA  DIK G  LT++Y  E L
Sbjct: 1828 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 1876


>gi|395543169|ref|XP_003773493.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
            N-methyltransferase NSD2 [Sarcophilus harrisii]
          Length = 1464

 Score =  137 bits (346), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 79/199 (39%), Positives = 110/199 (55%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
            K  PY  IK N    K ++    D      C   P     C  D +C   ML+  C    
Sbjct: 988  KPPPYKHIKVNKPCGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMYECHPQV 1046

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G  C N+ F  R   + K+++T+  G G+VA  DIK+GEFV EYVGE+ID++ C  R
Sbjct: 1047 CPAGEQCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIKKGEFVNEYVGELIDEEECMAR 1106

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            +        T+FY+  I++D +IDA  KGN SR++NHSC PN E  KW ++G+TR+G+FA
Sbjct: 1107 IKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1166

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1167 VCDIPAGTELTFNYNLDCL 1185


>gi|355750457|gb|EHH54795.1| hypothetical protein EGM_15701 [Macaca fascicularis]
          Length = 2695

 Score =  137 bits (346), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)

Query: 68   CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
            C+C A+  +    D +C   MLL  C  + C  G  C N+ F  R   ++++ +T + G 
Sbjct: 1894 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGW 1953

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+    DIK+GEFV EYVGE+ID++ C  R+   +    TNFY+  +++D +IDA  KGN
Sbjct: 1954 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 2013

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
             +R++NH C PN E QKW ++G+TR+G+FA  DIK G  LT++Y  E L
Sbjct: 2014 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 2062


>gi|380815580|gb|AFE79664.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
            specific isoform b [Macaca mulatta]
 gi|383420749|gb|AFH33588.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
            specific isoform b [Macaca mulatta]
          Length = 2695

 Score =  137 bits (346), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)

Query: 68   CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
            C+C A+  +    D +C   MLL  C  + C  G  C N+ F  R   ++++ +T + G 
Sbjct: 1894 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGW 1953

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+    DIK+GEFV EYVGE+ID++ C  R+   +    TNFY+  +++D +IDA  KGN
Sbjct: 1954 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 2013

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
             +R++NH C PN E QKW ++G+TR+G+FA  DIK G  LT++Y  E L
Sbjct: 2014 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 2062


>gi|73953273|ref|XP_865778.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific isoform 5 [Canis lupus familiaris]
          Length = 2698

 Score =  137 bits (346), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)

Query: 68   CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
            C+C A+  +    D +C   MLL  C  + C  G  C N+ F  R   ++++ +T + G 
Sbjct: 1896 CNCKATDDNPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFTKRQYPEVEIFRTLQRGW 1955

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+    DIK+GEFV EYVGE+ID++ C  R+   +    TNFY+  +++D +IDA  KGN
Sbjct: 1956 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 2015

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
             +R++NH C PN E QKW ++G+TR+G+FA  DIK G  LT++Y  E L
Sbjct: 2016 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 2064


>gi|19923586|ref|NP_071900.2| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
            specific isoform b [Homo sapiens]
 gi|32469769|sp|Q96L73.1|NSD1_HUMAN RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific; AltName: Full=Androgen receptor
            coactivator 267 kDa protein; AltName: Full=Androgen
            receptor-associated protein of 267 kDa; AltName:
            Full=H3-K36-HMTase; AltName: Full=H4-K20-HMTase; AltName:
            Full=Lysine N-methyltransferase 3B; AltName: Full=Nuclear
            receptor-binding SET domain-containing protein 1;
            Short=NR-binding SET domain-containing protein
 gi|17530097|gb|AAL40694.1|AF395588_1 putative nuclear protein NSD1 [Homo sapiens]
 gi|16751269|gb|AAL06645.1| androgen receptor associated coregulator 267-b [Homo sapiens]
 gi|119605438|gb|EAW85032.1| nuclear receptor binding SET domain protein 1, isoform CRA_b [Homo
            sapiens]
          Length = 2696

 Score =  137 bits (346), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)

Query: 68   CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
            C+C A+  +    D +C   MLL  C  + C  G  C N+ F  R   ++++ +T + G 
Sbjct: 1895 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGW 1954

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+    DIK+GEFV EYVGE+ID++ C  R+   +    TNFY+  +++D +IDA  KGN
Sbjct: 1955 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 2014

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
             +R++NH C PN E QKW ++G+TR+G+FA  DIK G  LT++Y  E L
Sbjct: 2015 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 2063


>gi|355691890|gb|EHH27075.1| hypothetical protein EGK_17188 [Macaca mulatta]
          Length = 2695

 Score =  137 bits (346), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)

Query: 68   CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
            C+C A+  +    D +C   MLL  C  + C  G  C N+ F  R   ++++ +T + G 
Sbjct: 1894 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGW 1953

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+    DIK+GEFV EYVGE+ID++ C  R+   +    TNFY+  +++D +IDA  KGN
Sbjct: 1954 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 2013

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
             +R++NH C PN E QKW ++G+TR+G+FA  DIK G  LT++Y  E L
Sbjct: 2014 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 2062


>gi|410216828|gb|JAA05633.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
 gi|410260118|gb|JAA18025.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
          Length = 2428

 Score =  137 bits (346), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)

Query: 68   CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
            C+C A+  +    D +C   MLL  C  + C  G  C N+ F  R   ++++ +T + G 
Sbjct: 1627 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGW 1686

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+    DIK+GEFV EYVGE+ID++ C  R+   +    TNFY+  +++D +IDA  KGN
Sbjct: 1687 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 1746

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
             +R++NH C PN E QKW ++G+TR+G+FA  DIK G  LT++Y  E L
Sbjct: 1747 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 1795


>gi|441595720|ref|XP_004087266.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
            H3 lysine-36 and H4 lysine-20 specific [Nomascus
            leucogenys]
          Length = 2697

 Score =  137 bits (345), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)

Query: 68   CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
            C+C A+  +    D +C   MLL  C  + C  G  C N+ F  R   ++++ +T + G 
Sbjct: 1896 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGW 1955

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+    DIK+GEFV EYVGE+ID++ C  R+   +    TNFY+  +++D +IDA  KGN
Sbjct: 1956 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 2015

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
             +R++NH C PN E QKW ++G+TR+G+FA  DIK G  LT++Y  E L
Sbjct: 2016 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 2064


>gi|410949106|ref|XP_003981265.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific [Felis catus]
          Length = 2432

 Score =  137 bits (345), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)

Query: 68   CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
            C+C A+  +    D +C   MLL  C  + C  G  C N+ F  R   ++++ +T + G 
Sbjct: 1630 CNCKATDDNPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFTKRQYPEVEIFRTLQRGW 1689

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+    DIK+GEFV EYVGE+ID++ C  R+   +    TNFY+  +++D +IDA  KGN
Sbjct: 1690 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 1749

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
             +R++NH C PN E QKW ++G+TR+G+FA  DIK G  LT++Y  E L
Sbjct: 1750 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 1798


>gi|402873563|ref|XP_003900641.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific, partial [Papio anubis]
          Length = 2343

 Score =  137 bits (345), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)

Query: 68   CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
            C+C A+  +    D +C   MLL  C  + C  G  C N+ F  R   ++++ +T + G 
Sbjct: 1539 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGW 1598

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+    DIK+GEFV EYVGE+ID++ C  R+   +    TNFY+  +++D +IDA  KGN
Sbjct: 1599 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 1658

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
             +R++NH C PN E QKW ++G+TR+G+FA  DIK G  LT++Y  E L
Sbjct: 1659 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 1707


>gi|296193510|ref|XP_002806650.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
            H3 lysine-36 and H4 lysine-20 specific [Callithrix
            jacchus]
          Length = 2692

 Score =  137 bits (345), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)

Query: 68   CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
            C+C A+  +    D +C   MLL  C  + C  G  C N+ F  R   ++++ +T + G 
Sbjct: 1895 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGW 1954

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+    DIK+GEFV EYVGE+ID++ C  R+   +    TNFY+  +++D +IDA  KGN
Sbjct: 1955 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 2014

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
             +R++NH C PN E QKW ++G+TR+G+FA  DIK G  LT++Y  E L
Sbjct: 2015 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 2063


>gi|410303854|gb|JAA30527.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
 gi|410341931|gb|JAA39912.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
          Length = 2428

 Score =  137 bits (345), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)

Query: 68   CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
            C+C A+  +    D +C   MLL  C  + C  G  C N+ F  R   ++++ +T + G 
Sbjct: 1627 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGW 1686

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+    DIK+GEFV EYVGE+ID++ C  R+   +    TNFY+  +++D +IDA  KGN
Sbjct: 1687 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 1746

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
             +R++NH C PN E QKW ++G+TR+G+FA  DIK G  LT++Y  E L
Sbjct: 1747 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 1795


>gi|114603589|ref|XP_527132.2| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific isoform 8 [Pan troglodytes]
 gi|397470588|ref|XP_003806901.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific [Pan paniscus]
 gi|410303856|gb|JAA30528.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
 gi|410341933|gb|JAA39913.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
          Length = 2697

 Score =  137 bits (345), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)

Query: 68   CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
            C+C A+  +    D +C   MLL  C  + C  G  C N+ F  R   ++++ +T + G 
Sbjct: 1896 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGW 1955

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+    DIK+GEFV EYVGE+ID++ C  R+   +    TNFY+  +++D +IDA  KGN
Sbjct: 1956 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 2015

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
             +R++NH C PN E QKW ++G+TR+G+FA  DIK G  LT++Y  E L
Sbjct: 2016 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 2064


>gi|51849607|dbj|BAD42330.1| hypothetical protein [Nannochloris bacillaris]
          Length = 334

 Score =  137 bits (345), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 71/198 (35%), Positives = 107/198 (54%), Gaps = 4/198 (2%)

Query: 44  YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGV---CDRDCHCGMLLSSCSSG-CKC 99
           +  I +NIY+ +  K+  ED+ + C C    G+      C  +C   ML   C +  C C
Sbjct: 61  WQLIAKNIYMHRERKQLDEDEVMICQCKPIWGTDTTTIGCGENCLNRMLNIECVAKYCPC 120

Query: 100 GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
           G  C N+ F  R   K+++ +    G G+ A ED+K G+F++EYVGEV++++    R   
Sbjct: 121 GERCTNRGFSKRAYAKLEIRRAGAKGFGLFAAEDVKAGQFIVEYVGEVLEEEEYARRKEF 180

Query: 160 MKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD 219
               G+ ++Y   +    VIDA  +G   R+INHSC PN E QKW++ GE  IG+FA  D
Sbjct: 181 YIATGQRHYYFMNVGNGEVIDAARRGGLGRFINHSCEPNCETQKWVVRGELAIGLFALED 240

Query: 220 IKKGENLTYDYQYEFLHD 237
           +  G  LT+DY +E   D
Sbjct: 241 VPAGSVLTFDYNFERYGD 258


>gi|301785552|ref|XP_002928188.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific-like [Ailuropoda melanoleuca]
 gi|281342107|gb|EFB17691.1| hypothetical protein PANDA_018107 [Ailuropoda melanoleuca]
          Length = 2699

 Score =  137 bits (345), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)

Query: 68   CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
            C+C A+  +    D +C   MLL  C  + C  G  C N+ F  R   ++++ +T + G 
Sbjct: 1899 CNCKATDDNPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFTKRQYPEVEIFRTLQRGW 1958

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+    DIK+GEFV EYVGE+ID++ C  R+   +    TNFY+  +++D +IDA  KGN
Sbjct: 1959 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 2018

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
             +R++NH C PN E QKW ++G+TR+G+FA  DIK G  LT++Y  E L
Sbjct: 2019 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 2067


>gi|410216830|gb|JAA05634.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
 gi|410260120|gb|JAA18026.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
          Length = 2697

 Score =  137 bits (345), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)

Query: 68   CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
            C+C A+  +    D +C   MLL  C  + C  G  C N+ F  R   ++++ +T + G 
Sbjct: 1896 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGW 1955

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+    DIK+GEFV EYVGE+ID++ C  R+   +    TNFY+  +++D +IDA  KGN
Sbjct: 1956 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 2015

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
             +R++NH C PN E QKW ++G+TR+G+FA  DIK G  LT++Y  E L
Sbjct: 2016 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 2064


>gi|351708443|gb|EHB11362.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
            specific [Heterocephalus glaber]
          Length = 2698

 Score =  137 bits (345), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)

Query: 68   CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
            C+C A+  +    D +C   MLL  C  + C  G  C N+ F  R   ++++ +T + G 
Sbjct: 1895 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGW 1954

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+    DIK+GEFV EYVGE+ID++ C  R+   +    TNFY+  +++D +IDA  KGN
Sbjct: 1955 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 2014

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
             +R++NH C PN E QKW ++G+TR+G+FA  DIK G  LT++Y  E L
Sbjct: 2015 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 2063


>gi|148700883|gb|EDL32830.1| mCG14519 [Mus musculus]
          Length = 1381

 Score =  137 bits (345), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 80/222 (36%), Positives = 118/222 (53%), Gaps = 6/222 (2%)

Query: 17   FNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGS 76
            F +L  Q  +    E+     KP   PY  IK N  + K ++ ++ D      C   PG 
Sbjct: 993  FQELKAQRESKEALEMERTSRKPP--PYKHIKANKVIGK-VQVQVADLSEIPRCNCKPGD 1049

Query: 77   SGVCDRDCHCGMLLSS--CSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADED 133
               C  +  C   +S   C    C  G+ C N+ F  R     ++++TE+ G G+     
Sbjct: 1050 ENPCGLESQCLNRMSQYECHPQVCPAGDRCQNQCFTKRLYPDAEVIKTERRGWGLRTKRS 1109

Query: 134  IKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINH 193
            IK+GEFV EYVGE+ID++ C  R+ +      TNFY+  + +D +IDA  KGN SR++NH
Sbjct: 1110 IKKGEFVNEYVGELIDEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNH 1169

Query: 194  SCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
            SC PN E QKW ++G+ R+G+FA  DI  G  LT++Y  + L
Sbjct: 1170 SCNPNCETQKWTVNGDVRVGLFALCDIPAGMELTFNYNLDCL 1211


>gi|403287002|ref|XP_003934751.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Saimiri
            boliviensis boliviensis]
          Length = 1368

 Score =  137 bits (345), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 79/199 (39%), Positives = 110/199 (55%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
            K  PY  IK N    K ++    D      C   P     C  D +C   ML+  C    
Sbjct: 987  KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMFECHPQV 1045

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G  C N+ F  R   + K+++T+  G G+VA  DI++GEFV EYVGEVID++ C  R
Sbjct: 1046 CPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGEVIDEEECMAR 1105

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            +        T+FY+  I++D +IDA  KGN SR++NHSC PN E  KW ++G+TR+G+FA
Sbjct: 1106 IKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1165

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1166 VCDIPAGTELTFNYNLDCL 1184


>gi|380815578|gb|AFE79663.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
            specific isoform a [Macaca mulatta]
 gi|383420747|gb|AFH33587.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
            specific isoform a [Macaca mulatta]
          Length = 2426

 Score =  137 bits (345), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)

Query: 68   CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
            C+C A+  +    D +C   MLL  C  + C  G  C N+ F  R   ++++ +T + G 
Sbjct: 1625 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGW 1684

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+    DIK+GEFV EYVGE+ID++ C  R+   +    TNFY+  +++D +IDA  KGN
Sbjct: 1685 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 1744

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
             +R++NH C PN E QKW ++G+TR+G+FA  DIK G  LT++Y  E L
Sbjct: 1745 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 1793


>gi|403294386|ref|XP_003938171.1| PREDICTED: histone-lysine N-methyltransferase NSD3 [Saimiri
            boliviensis boliviensis]
          Length = 1438

 Score =  137 bits (345), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 76/205 (37%), Positives = 111/205 (54%), Gaps = 4/205 (1%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
            K  PY  IK N  + K ++ ++ D      C   P     C  +  C   ML   C    
Sbjct: 1070 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1128

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G+ C N+ F  R     ++++TE+ G G+     IK+GEFV EYVGE+ID++ C  R
Sbjct: 1129 CPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1188

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +      TNFY+  + +D +IDA  KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1189 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1248

Query: 217  TRDIKKGENLTYDYQYEFLHDSLIA 241
              DI  G  LT++Y  + L +   A
Sbjct: 1249 LCDIPAGMELTFNYNLDCLGNGRTA 1273


>gi|296197020|ref|XP_002746091.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 1
            [Callithrix jacchus]
          Length = 1365

 Score =  137 bits (345), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 79/199 (39%), Positives = 110/199 (55%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
            K  PY  IK N    K ++    D      C   P     C  D +C   ML+  C    
Sbjct: 987  KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMFECHPQV 1045

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G  C N+ F  R   + K+++T+  G G+VA  DI++GEFV EYVGEVID++ C  R
Sbjct: 1046 CPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGEVIDEEECMAR 1105

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            +        T+FY+  I++D +IDA  KGN SR++NHSC PN E  KW ++G+TR+G+FA
Sbjct: 1106 IKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1165

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1166 VCDIPAGTELTFNYNLDCL 1184


>gi|403290056|ref|XP_003936149.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific [Saimiri boliviensis boliviensis]
          Length = 2697

 Score =  137 bits (345), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)

Query: 68   CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
            C+C A+  +    D +C   MLL  C  + C  G  C N+ F  R   ++++ +T + G 
Sbjct: 1896 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGW 1955

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+    DIK+GEFV EYVGE+ID++ C  R+   +    TNFY+  +++D +IDA  KGN
Sbjct: 1956 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 2015

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
             +R++NH C PN E QKW ++G+TR+G+FA  DIK G  LT++Y  E L
Sbjct: 2016 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 2064


>gi|390461098|ref|XP_003732596.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 2
            [Callithrix jacchus]
          Length = 1400

 Score =  137 bits (345), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 79/199 (39%), Positives = 110/199 (55%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
            K  PY  IK N    K ++    D      C   P     C  D +C   ML+  C    
Sbjct: 1022 KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMFECHPQV 1080

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G  C N+ F  R   + K+++T+  G G+VA  DI++GEFV EYVGEVID++ C  R
Sbjct: 1081 CPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGEVIDEEECMAR 1140

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            +        T+FY+  I++D +IDA  KGN SR++NHSC PN E  KW ++G+TR+G+FA
Sbjct: 1141 IKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1200

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1201 VCDIPAGTELTFNYNLDCL 1219


>gi|187956219|gb|AAI50629.1| Nuclear receptor binding SET domain protein 1 [Homo sapiens]
          Length = 2427

 Score =  137 bits (345), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)

Query: 68   CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
            C+C A+  +    D +C   MLL  C  + C  G  C N+ F  R   ++++ +T + G 
Sbjct: 1626 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGW 1685

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+    DIK+GEFV EYVGE+ID++ C  R+   +    TNFY+  +++D +IDA  KGN
Sbjct: 1686 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 1745

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
             +R++NH C PN E QKW ++G+TR+G+FA  DIK G  LT++Y  E L
Sbjct: 1746 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 1794


>gi|27477095|ref|NP_758859.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
            specific isoform a [Homo sapiens]
 gi|16755530|gb|AAL27991.1|AF380302_1 androgen receptor-associated coregulator 267-a [Homo sapiens]
 gi|119605437|gb|EAW85031.1| nuclear receptor binding SET domain protein 1, isoform CRA_a [Homo
            sapiens]
          Length = 2427

 Score =  137 bits (345), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)

Query: 68   CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
            C+C A+  +    D +C   MLL  C  + C  G  C N+ F  R   ++++ +T + G 
Sbjct: 1626 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGW 1685

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+    DIK+GEFV EYVGE+ID++ C  R+   +    TNFY+  +++D +IDA  KGN
Sbjct: 1686 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 1745

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
             +R++NH C PN E QKW ++G+TR+G+FA  DIK G  LT++Y  E L
Sbjct: 1746 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 1794


>gi|119605439|gb|EAW85033.1| nuclear receptor binding SET domain protein 1, isoform CRA_c [Homo
            sapiens]
          Length = 2593

 Score =  137 bits (345), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)

Query: 68   CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
            C+C A+  +    D +C   MLL  C  + C  G  C N+ F  R   ++++ +T + G 
Sbjct: 1792 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGW 1851

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+    DIK+GEFV EYVGE+ID++ C  R+   +    TNFY+  +++D +IDA  KGN
Sbjct: 1852 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 1911

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
             +R++NH C PN E QKW ++G+TR+G+FA  DIK G  LT++Y  E L
Sbjct: 1912 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 1960


>gi|126332220|ref|XP_001374612.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2
            [Monodelphis domestica]
          Length = 1366

 Score =  137 bits (345), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 79/199 (39%), Positives = 110/199 (55%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
            K  PY  IK N    K ++    D      C   P     C  D +C   ML+  C    
Sbjct: 988  KPPPYKHIKVNKPCGK-VQIYTADVSEIPKCNCKPTDENPCGFDSECLNRMLMYECHPQV 1046

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G  C N+ F  R   + K+++T+  G G+VA  DIK+GEFV EYVGE+ID++ C  R
Sbjct: 1047 CPAGEQCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIKKGEFVNEYVGELIDEEECMAR 1106

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            +        T+FY+  I++D +IDA  KGN SR++NHSC PN E  KW ++G+TR+G+FA
Sbjct: 1107 IKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1166

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1167 VCDIPAGTELTFNYNLDCL 1185


>gi|356535766|ref|XP_003536414.1| PREDICTED: histone-lysine N-methyltransferase ASHH1-like [Glycine
           max]
 gi|34529091|dbj|BAC85636.1| unnamed protein product [Homo sapiens]
          Length = 480

 Score =  137 bits (345), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 77/194 (39%), Positives = 109/194 (56%), Gaps = 2/194 (1%)

Query: 44  YVFIKRNIYLTKRIKRRLEDDGIFCSCT-ASPGSSGVCDRDCHCGMLLSSCSSG-CKCGN 101
           Y+ I +N +  +R K++ E+D   C C   +      C   C   +  + C+ G C C  
Sbjct: 11  YIHINQNEFCIRRHKKQKEEDIAICECKYDADDPDNACGDSCLNVLTSTECTPGYCHCDI 70

Query: 102 SCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMK 161
            C N+ FQ     K KL +TE  G G++ADEDIK G+FVIEY GEVI  +  + R    +
Sbjct: 71  LCKNQKFQKCEYAKTKLFKTEGRGWGLLADEDIKAGQFVIEYCGEVISWKEAKRRSQAYE 130

Query: 162 HLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIK 221
           + G  + ++  +N    IDAT KG+ +R+INHSC PN E +KW + GE R+GIFA  DI 
Sbjct: 131 NQGLKDAFIIFLNVSESIDATRKGSLARFINHSCQPNCETRKWNVLGEIRVGIFAKHDIP 190

Query: 222 KGENLTYDYQYEFL 235
            G  L YDY +E+ 
Sbjct: 191 IGTELAYDYNFEWF 204


>gi|395861196|ref|XP_003802879.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific-like [Otolemur garnettii]
          Length = 2410

 Score =  137 bits (345), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)

Query: 68   CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
            C+C A+  +    D +C   MLL  C  + C  G  C N+ F  R   ++++ +T + G 
Sbjct: 1611 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGW 1670

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+    DIK+GEFV EYVGE+ID++ C  R+   +    TNFY+  +++D +IDA  KGN
Sbjct: 1671 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 1730

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
             +R++NH C PN E QKW ++G+TR+G+FA  DIK G  LT++Y  E L
Sbjct: 1731 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 1779


>gi|340718068|ref|XP_003397494.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
           H3 lysine-36 and H4 lysine-20 specific-like [Bombus
           terrestris]
          Length = 1238

 Score =  137 bits (345), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 75/166 (45%), Positives = 98/166 (59%), Gaps = 1/166 (0%)

Query: 66  IFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKC 124
           + C C A   +      DC   +LL  CS G C  G  C N+ F  R    M+   T   
Sbjct: 794 VACDCDAEWENPCAPGTDCLNRILLVECSPGICPAGAKCKNQAFVQRQYPAMEPFHTVGR 853

Query: 125 GAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYK 184
           G G+ + E IK G+FVIEYVGEVID+   + RL + K L   NFY   I+ + +IDA  K
Sbjct: 854 GWGLRSLEHIKAGQFVIEYVGEVIDEAEYKRRLHRKKELKNENFYFLTIDNNRMIDAEPK 913

Query: 185 GNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDY 230
           GN SR++NHSC PN E QKW ++G+TRIG+FA  DI+ GE LT++Y
Sbjct: 914 GNLSRFMNHSCSPNCETQKWTVNGDTRIGLFALCDIECGEELTFNY 959


>gi|326438002|gb|EGD83572.1| hypothetical protein PTSG_04178 [Salpingoeca sp. ATCC 50818]
          Length = 1398

 Score =  137 bits (345), Expect = 4e-30,   Method: Composition-based stats.
 Identities = 74/238 (31%), Positives = 117/238 (49%), Gaps = 16/238 (6%)

Query: 21  LKQIGNPVEFELPDWFIKPK-----------AIPYVFIKRNIYLTKRIKR---RLEDDGI 66
           L  +  P  F LP  +++               P+  I+ N+++ K  ++   +   +  
Sbjct: 274 LNLLCQPEPFRLPASYVQASRQGTLAITEEACAPFRKIRVNVFVDKETEQFMAKHRQEVS 333

Query: 67  FCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRP-VKKMKLVQTEKCG 125
            C C   P     C  DC   ++ S C   C CG  C NK FQ R     +K   T   G
Sbjct: 334 RCECIPPPDGGPGCTHDCLNRLMYSECGPDCPCGRQCTNKRFQQRAWCTAIKRAPTPGKG 393

Query: 126 AGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKG 185
            G+ A   I +G FVIEY GE++      +R   + +    +F+   ++R +VIDA   G
Sbjct: 394 YGVFATAYIPKGTFVIEYTGEIMTSSAFTQRANTL-YRARKHFHCLNLDRGLVIDAGQAG 452

Query: 186 NKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
           +++R+INHSC PN  ++KW ++G  R G+FA+RDIK  E L+YDY +   ++ L+  C
Sbjct: 453 SEARFINHSCDPNCHIEKWNVNGHWRAGVFASRDIKGDEELSYDYNFHNFNEKLVCRC 510


>gi|149726051|ref|XP_001502479.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific [Equus caballus]
          Length = 2700

 Score =  137 bits (345), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)

Query: 68   CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
            C+C A+  +    D +C   MLL  C  + C  G  C N+ F  R   ++++ +T + G 
Sbjct: 1898 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFTKRQYPEVEIFRTLQRGW 1957

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+    DIK+GEFV EYVGE+ID++ C  R+   +    TNFY+  +++D +IDA  KGN
Sbjct: 1958 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 2017

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
             +R++NH C PN E QKW ++G+TR+G+FA  DIK G  LT++Y  E L
Sbjct: 2018 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 2066


>gi|417407050|gb|JAA50158.1| Putative histone-lysine n-methyltransferase h3 lysine-36 and h4
            lysine-20 specific [Desmodus rotundus]
          Length = 2699

 Score =  137 bits (345), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)

Query: 68   CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
            C+C A+  +    D +C   MLL  C  + C  G  C N+ F  R   ++++ +T + G 
Sbjct: 1898 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFTKRQYPEVEIFRTLQRGW 1957

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+    DIK+GEFV EYVGE+ID++ C  R+   +    TNFY+  +++D +IDA  KGN
Sbjct: 1958 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 2017

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
             +R++NH C PN E QKW ++G+TR+G+FA  DIK G  LT++Y  E L
Sbjct: 2018 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 2066


>gi|410958014|ref|XP_003985618.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Felis
            catus]
          Length = 1300

 Score =  137 bits (345), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 81/202 (40%), Positives = 113/202 (55%), Gaps = 10/202 (4%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
            K  PY  IK N    K ++    D      C   P     C  D +C   ML+  C    
Sbjct: 922  KPPPYKHIKVNKPYGK-VQIHTADISEIPKCNCKPTDENPCGFDSECLNRMLMFECHPQV 980

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G  C N+ F  R   + K+++T+  G G+VA  DI++GEFV EYVGE+ID++ C  R
Sbjct: 981  CPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 1040

Query: 157  LWKMKHLGE---TNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIG 213
            +   KH  E   T+FY+  I++D +IDA  KGN SR++NHSC PN E  KW ++G+TR+G
Sbjct: 1041 I---KHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVG 1097

Query: 214  IFATRDIKKGENLTYDYQYEFL 235
            +FA  DI  G  LT++Y  + L
Sbjct: 1098 LFAVCDIPAGTELTFNYNLDCL 1119


>gi|350580826|ref|XP_003123715.3| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific, partial [Sus scrofa]
          Length = 2392

 Score =  137 bits (345), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)

Query: 68   CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
            C+C A+  +    D +C   MLL  C  + C  G  C N+ F  R   ++++ +T + G 
Sbjct: 1590 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFTKRQYPEVEIFRTLQRGW 1649

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+    DIK+GEFV EYVGE+ID++ C  R+   +    TNFY+  +++D +IDA  KGN
Sbjct: 1650 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 1709

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
             +R++NH C PN E QKW ++G+TR+G+FA  DIK G  LT++Y  E L
Sbjct: 1710 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 1758


>gi|16549858|dbj|BAB70868.1| unnamed protein product [Homo sapiens]
          Length = 1059

 Score =  137 bits (345), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)

Query: 68   CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
            C+C A+  +    D +C   MLL  C  + C  G  C N+ F  R   ++++ +T + G 
Sbjct: 858  CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGW 917

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+    DIK+GEFV EYVGE+ID++ C  R+   +    TNFY+  +++D +IDA  KGN
Sbjct: 918  GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 977

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
             +R++NH C PN E QKW ++G+TR+G+FA  DIK G  LT++Y  E L
Sbjct: 978  YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 1026


>gi|291387890|ref|XP_002710469.1| PREDICTED: nuclear receptor binding SET domain protein 1 isoform 2
            [Oryctolagus cuniculus]
          Length = 2431

 Score =  137 bits (345), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)

Query: 68   CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
            C+C A+  +    D +C   MLL  C  + C  G  C N+ F  R   ++++ +T + G 
Sbjct: 1629 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFTKRQYPEVEIFRTLQRGW 1688

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+    DIK+GEFV EYVGE+ID++ C  R+   +    TNFY+  +++D +IDA  KGN
Sbjct: 1689 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 1748

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
             +R++NH C PN E QKW ++G+TR+G+FA  DIK G  LT++Y  E L
Sbjct: 1749 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 1797


>gi|432099958|gb|ELK28852.1| Histone-lysine N-methyltransferase NSD3 [Myotis davidii]
          Length = 1641

 Score =  137 bits (344), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 75/199 (37%), Positives = 109/199 (54%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
            K  PY  IK N  + K ++ ++ D      C   P     C  +  C   ML   C    
Sbjct: 1273 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1331

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G+ C N+ F  R     ++++TE+ G G+     IK+GEFV EYVGE+ID++ C  R
Sbjct: 1332 CPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1391

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +      TNFY+  + +D +IDA  KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1392 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1451

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1452 LCDIPAGMELTFNYNLDCL 1470


>gi|389585110|dbj|GAB67841.1| hypothetical protein PCYB_124070 [Plasmodium cynomolgi strain B]
          Length = 2974

 Score =  137 bits (344), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 79/216 (36%), Positives = 120/216 (55%), Gaps = 21/216 (9%)

Query: 42   IPYVFIKRNIYLTKRIKRRL--EDDGIFCSCTASPGSSGVCDR-DCHCGMLLSSCSSGCK 98
            + + +I +N+YL  + K  L  + D   C C       G CD   C+  +    CS   +
Sbjct: 2473 LNFEYISKNVYLNDQNKNLLACKSDDYRCLC------QGECDPFSCYNSLSKIQCSRN-R 2525

Query: 99   CG-------NSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQ 151
            C          C N+PF++  +K++++ QTE+ G G+    DIK GE + EYVGEV+  +
Sbjct: 2526 CNLPIQVQDKKCFNRPFKHSAIKELEIKQTERTGFGVFCKRDIKNGELICEYVGEVLGKK 2585

Query: 152  TCEERLWKMKHLGET----NFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIID 207
              EER+   +   +     N+Y  +INRD+ ID+  KG+ SR++NHSC PN+  QKWI+ 
Sbjct: 2586 EFEERMEAYQEESKKTDMYNWYSIQINRDVHIDSRRKGSISRFVNHSCSPNSVSQKWIVR 2645

Query: 208  GETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
            G  RIGIFA +DI  GE +TY+Y Y F+ ++    C
Sbjct: 2646 GFYRIGIFAQQDIPAGEEITYNYSYNFVFNNFECLC 2681


>gi|426359420|ref|XP_004046973.1| PREDICTED: histone-lysine N-methyltransferase NSD3 [Gorilla gorilla
            gorilla]
          Length = 1397

 Score =  137 bits (344), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 75/199 (37%), Positives = 109/199 (54%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
            K  PY  IK N  + K ++ ++ D      C   P     C  +  C   ML   C    
Sbjct: 1029 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1087

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G+ C N+ F  R     ++++TE+ G G+     IK+GEFV EYVGE+ID++ C  R
Sbjct: 1088 CPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1147

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +      TNFY+  + +D +IDA  KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1148 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1207

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1208 LCDIPAGMELTFNYNLDCL 1226


>gi|92098122|gb|AAI15007.1| Wolf-Hirschhorn syndrome candidate 1-like 1 [Homo sapiens]
          Length = 1437

 Score =  137 bits (344), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 75/199 (37%), Positives = 109/199 (54%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
            K  PY  IK N  + K ++ ++ D      C   P     C  +  C   ML   C    
Sbjct: 1069 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1127

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G+ C N+ F  R     ++++TE+ G G+     IK+GEFV EYVGE+ID++ C  R
Sbjct: 1128 CPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1187

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +      TNFY+  + +D +IDA  KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1188 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1247

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1248 LCDIPAGMELTFNYNLDCL 1266


>gi|157821603|ref|NP_001099560.1| histone-lysine N-methyltransferase NSD3 [Rattus norvegicus]
 gi|149057818|gb|EDM09061.1| Wolf-Hirschhorn syndrome candidate 1-like 1 (predicted) [Rattus
            norvegicus]
          Length = 1396

 Score =  137 bits (344), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 80/222 (36%), Positives = 118/222 (53%), Gaps = 6/222 (2%)

Query: 17   FNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGS 76
            F +L  Q  +    E+     KP   PY  IK N  + K ++ ++ D      C   PG 
Sbjct: 1008 FQELKAQRESKEALEIEKTSRKPP--PYKHIKANKVIGK-VQIQVADLSEIPRCNCKPGD 1064

Query: 77   SGVCDRDCHCGMLLSS--CSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADED 133
               C  +  C   +S   C    C  G+ C N+ F  R     ++++TE+ G G+     
Sbjct: 1065 ENPCGLESECLNRMSQYECHPQVCPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRS 1124

Query: 134  IKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINH 193
            IK+GEFV EYVGE+ID++ C  R+ +      TNFY+  + +D +IDA  KGN SR++NH
Sbjct: 1125 IKKGEFVNEYVGELIDEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNH 1184

Query: 194  SCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
            SC PN E QKW ++G+ R+G+FA  DI  G  LT++Y  + L
Sbjct: 1185 SCNPNCETQKWTVNGDVRVGLFALCDIPAGMELTFNYNLDCL 1226


>gi|355708046|gb|AES03147.1| nuclear receptor binding SET domain protein 1 [Mustela putorius
           furo]
          Length = 261

 Score =  137 bits (344), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)

Query: 68  CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
           C+C A+  +    D +C   MLL  C  + C  G  C N+ F  R   ++++ +T + G 
Sbjct: 71  CNCKATDDNPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFTKRQYPEVEIFRTLQRGW 130

Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
           G+    DIK+GEFV EYVGE+ID++ C  R+   +    TNFY+  +++D +IDA  KGN
Sbjct: 131 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 190

Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
            +R++NH C PN E QKW ++G+TR+G+FA  DIK G  LT++Y  E L
Sbjct: 191 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 239


>gi|344265319|ref|XP_003404732.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific [Loxodonta africana]
          Length = 2702

 Score =  137 bits (344), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)

Query: 68   CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
            C+C A+  +    D +C   MLL  C  + C  G  C N+ F  R   ++++ +T + G 
Sbjct: 1899 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFTKRQYPEVEIFRTLQRGW 1958

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+    DIK+GEFV EYVGE+ID++ C  R+   +    TNFY+  +++D +IDA  KGN
Sbjct: 1959 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 2018

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
             +R++NH C PN E QKW ++G+TR+G+FA  DIK G  LT++Y  E L
Sbjct: 2019 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 2067


>gi|12697312|emb|CAC28350.1| putative chromatin modulator [Homo sapiens]
          Length = 1437

 Score =  137 bits (344), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 75/199 (37%), Positives = 109/199 (54%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
            K  PY  IK N  + K ++ ++ D      C   P     C  +  C   ML   C    
Sbjct: 1069 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1127

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G+ C N+ F  R     ++++TE+ G G+     IK+GEFV EYVGE+ID++ C  R
Sbjct: 1128 CPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1187

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +      TNFY+  + +D +IDA  KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1188 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1247

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1248 LCDIPAGMELTFNYNLDCL 1266


>gi|380814048|gb|AFE78898.1| histone-lysine N-methyltransferase NSD3 isoform long [Macaca mulatta]
 gi|383419477|gb|AFH32952.1| histone-lysine N-methyltransferase NSD3 isoform long [Macaca mulatta]
          Length = 1438

 Score =  137 bits (344), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 75/199 (37%), Positives = 109/199 (54%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
            K  PY  IK N  + K ++ ++ D      C   P     C  +  C   ML   C    
Sbjct: 1070 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1128

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G+ C N+ F  R     ++++TE+ G G+     IK+GEFV EYVGE+ID++ C  R
Sbjct: 1129 CPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1188

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +      TNFY+  + +D +IDA  KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1189 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1248

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1249 LCDIPAGMELTFNYNLDCL 1267


>gi|395739588|ref|XP_002819060.2| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1 [Pongo
            abelii]
          Length = 1438

 Score =  137 bits (344), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 75/199 (37%), Positives = 109/199 (54%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
            K  PY  IK N  + K ++ ++ D      C   P     C  +  C   ML   C    
Sbjct: 1070 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1128

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G+ C N+ F  R     ++++TE+ G G+     IK+GEFV EYVGE+ID++ C  R
Sbjct: 1129 CPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1188

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +      TNFY+  + +D +IDA  KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1189 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1248

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1249 LCDIPAGMELTFNYNLDCL 1267


>gi|12697314|emb|CAC28351.1| Putative Chromatin modulator [Homo sapiens]
          Length = 1388

 Score =  137 bits (344), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 75/199 (37%), Positives = 109/199 (54%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
            K  PY  IK N  + K ++ ++ D      C   P     C  +  C   ML   C    
Sbjct: 1020 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1078

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G+ C N+ F  R     ++++TE+ G G+     IK+GEFV EYVGE+ID++ C  R
Sbjct: 1079 CPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1138

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +      TNFY+  + +D +IDA  KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1139 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1198

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1199 LCDIPAGMELTFNYNLDCL 1217


>gi|431902251|gb|ELK08752.1| Histone-lysine N-methyltransferase NSD3 [Pteropus alecto]
          Length = 1322

 Score =  137 bits (344), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 75/199 (37%), Positives = 109/199 (54%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
            K  PY  IK N  + K ++ ++ D      C   P     C  +  C   ML   C    
Sbjct: 1029 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1087

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G+ C N+ F  R     ++++TE+ G G+     IK+GEFV EYVGE+ID++ C  R
Sbjct: 1088 CPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1147

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +      TNFY+  + +D +IDA  KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1148 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1207

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1208 LCDIPAGMELTFNYNLDCL 1226


>gi|359493197|ref|XP_003634539.1| PREDICTED: histone-lysine N-methyltransferase ASHH1-like [Vitis
           vinifera]
          Length = 413

 Score =  137 bits (344), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 79/195 (40%), Positives = 106/195 (54%), Gaps = 6/195 (3%)

Query: 44  YVFIKRNIYLTKRIKRRLEDDGIFCSC---TASPGSSGVCDRDCHCGMLLSSCSSG-CKC 99
           Y+ I +N +  +   ++ EDD   C C   T  P S+  C   C   +    C+   C C
Sbjct: 41  YIHINKNDFSYREHIKQEEDDITICECKYNTNDPDSA--CGERCLNVLTSIECTPHYCPC 98

Query: 100 GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
              C N+ FQ     K KL +TE  G G++A+EDIK G F+IEY GEVI      ER   
Sbjct: 99  SVHCKNQRFQKHEYAKTKLFRTEGRGWGLLANEDIKAGRFIIEYCGEVISWNEARERSLA 158

Query: 160 MKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD 219
               G  + Y+  +N    IDAT  G+++R+INHSC PN E +KW + GE RIGIFA RD
Sbjct: 159 YASQGINDAYIISLNARECIDATKSGSQARFINHSCEPNCETRKWSVLGEVRIGIFAMRD 218

Query: 220 IKKGENLTYDYQYEF 234
           I  G  LTYDY +++
Sbjct: 219 ISIGTELTYDYNFQW 233


>gi|13699811|ref|NP_075447.1| histone-lysine N-methyltransferase NSD3 isoform long [Homo sapiens]
 gi|74761342|sp|Q9BZ95.1|NSD3_HUMAN RecName: Full=Histone-lysine N-methyltransferase NSD3; AltName:
            Full=Nuclear SET domain-containing protein 3; AltName:
            Full=Protein whistle; AltName: Full=WHSC1-like 1 isoform
            9 with methyltransferase activity to lysine; AltName:
            Full=Wolf-Hirschhorn syndrome candidate 1-like protein 1;
            Short=WHSC1-like protein 1
 gi|12642817|gb|AAK00355.1| putative protein WHSC1L1l [Homo sapiens]
 gi|75517229|gb|AAI01718.1| WHSC1L1 protein, isoform long [Homo sapiens]
 gi|109731704|gb|AAI13470.1| Wolf-Hirschhorn syndrome candidate 1-like 1 [Homo sapiens]
 gi|119583726|gb|EAW63322.1| Wolf-Hirschhorn syndrome candidate 1-like 1, isoform CRA_c [Homo
            sapiens]
 gi|168277388|dbj|BAG10672.1| histone-lysine N-methyltransferase NSD3 [synthetic construct]
          Length = 1437

 Score =  137 bits (344), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 75/199 (37%), Positives = 109/199 (54%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
            K  PY  IK N  + K ++ ++ D      C   P     C  +  C   ML   C    
Sbjct: 1069 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1127

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G+ C N+ F  R     ++++TE+ G G+     IK+GEFV EYVGE+ID++ C  R
Sbjct: 1128 CPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1187

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +      TNFY+  + +D +IDA  KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1188 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1247

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1248 LCDIPAGMELTFNYNLDCL 1266


>gi|397521377|ref|XP_003830773.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 3 [Pan
            paniscus]
          Length = 1388

 Score =  137 bits (344), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 75/199 (37%), Positives = 109/199 (54%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
            K  PY  IK N  + K ++ ++ D      C   P     C  +  C   ML   C    
Sbjct: 1020 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1078

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G+ C N+ F  R     ++++TE+ G G+     IK+GEFV EYVGE+ID++ C  R
Sbjct: 1079 CPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1138

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +      TNFY+  + +D +IDA  KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1139 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1198

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1199 LCDIPAGMELTFNYNLDCL 1217


>gi|114619717|ref|XP_001170799.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 4 [Pan
            troglodytes]
          Length = 1388

 Score =  137 bits (344), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 75/199 (37%), Positives = 109/199 (54%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
            K  PY  IK N  + K ++ ++ D      C   P     C  +  C   ML   C    
Sbjct: 1020 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1078

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G+ C N+ F  R     ++++TE+ G G+     IK+GEFV EYVGE+ID++ C  R
Sbjct: 1079 CPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1138

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +      TNFY+  + +D +IDA  KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1139 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1198

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1199 LCDIPAGMELTFNYNLDCL 1217


>gi|348520760|ref|XP_003447895.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
           [Oreochromis niloticus]
          Length = 1167

 Score =  137 bits (344), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 77/199 (38%), Positives = 110/199 (55%), Gaps = 4/199 (2%)

Query: 40  KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
           K  PY FIK N  + K ++    D      C   P     C  + +C   ML   C    
Sbjct: 779 KPPPYKFIKVNKPVGK-VQVYTADISEIPKCNCKPSGERPCGFESECLNRMLQYECHPQV 837

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
           C  G  C N+ F  R   + K+++T   G G++   DIK+GEFV EY+GE+ID++ C  R
Sbjct: 838 CPSGERCCNQDFTKRLYPETKIIKTPGKGWGLICLRDIKKGEFVNEYIGELIDEEECRAR 897

Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
           +        T+FY+  I++D +IDA  KGN SR++NHSC PN E QKW ++G+TR+G+FA
Sbjct: 898 IKYAHENNITDFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETQKWTVNGDTRVGLFA 957

Query: 217 TRDIKKGENLTYDYQYEFL 235
             DI  G  LT++Y  + L
Sbjct: 958 VCDIPAGTELTFNYNLDCL 976


>gi|296222023|ref|XP_002757009.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 3
            [Callithrix jacchus]
          Length = 1389

 Score =  137 bits (344), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 75/199 (37%), Positives = 109/199 (54%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
            K  PY  IK N  + K ++ ++ D      C   P     C  +  C   ML   C    
Sbjct: 1021 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1079

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G+ C N+ F  R     ++++TE+ G G+     IK+GEFV EYVGE+ID++ C  R
Sbjct: 1080 CPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1139

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +      TNFY+  + +D +IDA  KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1140 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1199

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1200 LCDIPAGMELTFNYNLDCL 1218


>gi|410307858|gb|JAA32529.1| Wolf-Hirschhorn syndrome candidate 1-like 1 [Pan troglodytes]
          Length = 1437

 Score =  137 bits (344), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 75/199 (37%), Positives = 109/199 (54%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
            K  PY  IK N  + K ++ ++ D      C   P     C  +  C   ML   C    
Sbjct: 1069 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1127

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G+ C N+ F  R     ++++TE+ G G+     IK+GEFV EYVGE+ID++ C  R
Sbjct: 1128 CPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1187

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +      TNFY+  + +D +IDA  KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1188 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1247

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1248 LCDIPAGMELTFNYNLDCL 1266


>gi|402878017|ref|XP_003902703.1| PREDICTED: histone-lysine N-methyltransferase NSD3 [Papio anubis]
          Length = 1438

 Score =  137 bits (344), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 75/199 (37%), Positives = 109/199 (54%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
            K  PY  IK N  + K ++ ++ D      C   P     C  +  C   ML   C    
Sbjct: 1070 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1128

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G+ C N+ F  R     ++++TE+ G G+     IK+GEFV EYVGE+ID++ C  R
Sbjct: 1129 CPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1188

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +      TNFY+  + +D +IDA  KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1189 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1248

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1249 LCDIPAGMELTFNYNLDCL 1267


>gi|397521373|ref|XP_003830771.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1 [Pan
            paniscus]
          Length = 1437

 Score =  137 bits (344), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 75/199 (37%), Positives = 109/199 (54%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
            K  PY  IK N  + K ++ ++ D      C   P     C  +  C   ML   C    
Sbjct: 1069 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1127

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G+ C N+ F  R     ++++TE+ G G+     IK+GEFV EYVGE+ID++ C  R
Sbjct: 1128 CPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1187

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +      TNFY+  + +D +IDA  KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1188 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1247

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1248 LCDIPAGMELTFNYNLDCL 1266


>gi|296222021|ref|XP_002757008.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2
            [Callithrix jacchus]
          Length = 1438

 Score =  137 bits (344), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 75/199 (37%), Positives = 109/199 (54%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
            K  PY  IK N  + K ++ ++ D      C   P     C  +  C   ML   C    
Sbjct: 1070 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1128

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G+ C N+ F  R     ++++TE+ G G+     IK+GEFV EYVGE+ID++ C  R
Sbjct: 1129 CPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1188

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +      TNFY+  + +D +IDA  KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1189 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1248

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1249 LCDIPAGMELTFNYNLDCL 1267


>gi|224050217|ref|XP_002195834.1| PREDICTED: histone-lysine N-methyltransferase NSD2 [Taeniopygia
            guttata]
          Length = 1339

 Score =  137 bits (344), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 79/199 (39%), Positives = 110/199 (55%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
            K  PY  IK N    K ++    D      C   P     C  D +C   ML+  C    
Sbjct: 991  KPPPYKHIKVNKPCGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMYECHPQV 1049

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G  C N+ F  R   + K+++T+  G G+VA  DIK+GEFV EYVGE+ID++ C  R
Sbjct: 1050 CPAGERCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIKKGEFVNEYVGELIDEEECMAR 1109

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            +        T+FY+  I++D +IDA  KGN SR++NHSC PN E  KW ++G+TR+G+FA
Sbjct: 1110 IKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1169

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1170 VCDIPAGTELTFNYNLDCL 1188


>gi|114619713|ref|XP_519713.2| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 6 [Pan
            troglodytes]
 gi|410226716|gb|JAA10577.1| Wolf-Hirschhorn syndrome candidate 1-like 1 [Pan troglodytes]
 gi|410266866|gb|JAA21399.1| Wolf-Hirschhorn syndrome candidate 1-like 1 [Pan troglodytes]
 gi|410353855|gb|JAA43531.1| Wolf-Hirschhorn syndrome candidate 1-like 1 [Pan troglodytes]
          Length = 1437

 Score =  137 bits (344), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 75/199 (37%), Positives = 109/199 (54%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
            K  PY  IK N  + K ++ ++ D      C   P     C  +  C   ML   C    
Sbjct: 1069 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1127

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G+ C N+ F  R     ++++TE+ G G+     IK+GEFV EYVGE+ID++ C  R
Sbjct: 1128 CPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1187

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +      TNFY+  + +D +IDA  KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1188 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1247

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1248 LCDIPAGMELTFNYNLDCL 1266


>gi|327284319|ref|XP_003226886.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like [Anolis
            carolinensis]
          Length = 1438

 Score =  137 bits (344), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 74/199 (37%), Positives = 110/199 (55%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
            K  PY  IK N  + K ++ ++ D      C   P     C  +  C   ML   C    
Sbjct: 1070 KPPPYKHIKSNKVVGK-VQIQVADLSEIPRCNCKPSDENPCGLESECLNRMLQYECHPQV 1128

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G  C N+ F  R   + ++++T++ G G+    +IK+GEFV EYVGE+ID++ C  R
Sbjct: 1129 CPAGEGCQNQCFTKRLYPEAEIIKTDRRGWGLRTKRNIKKGEFVNEYVGELIDEEECRLR 1188

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +      TNFY+  + +D +IDA  KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1189 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCHPNCETQKWTVNGDVRVGLFA 1248

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1249 VCDIPAGMELTFNYNLDCL 1267


>gi|149634094|ref|XP_001506476.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1
            [Ornithorhynchus anatinus]
          Length = 1437

 Score =  137 bits (344), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 81/222 (36%), Positives = 116/222 (52%), Gaps = 6/222 (2%)

Query: 17   FNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGS 76
            F +L  Q  +    EL     KP   PY  IK N  + K ++ ++ D      C   P  
Sbjct: 1048 FQELKAQRESKEALELERNSRKPP--PYKHIKANKVIGK-VQIQVADLSEIPRCNCKPAD 1104

Query: 77   SGVCDRDCHC--GMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADED 133
               C  +  C   ML   C    C  G  C N+ F  R     ++++TE+ G G+     
Sbjct: 1105 ENPCGLESECLNRMLQYECHPQVCPAGERCQNQCFTKRLYPDAEIIKTERRGWGLRTKRS 1164

Query: 134  IKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINH 193
            IK+GEFV EYVGE+ID++ C  R+ +      TNFY+  + +D +IDA  KGN SR++NH
Sbjct: 1165 IKKGEFVNEYVGELIDEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNH 1224

Query: 194  SCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
            SC PN E QKW ++G+ R+G+FA  DI  G  LT++Y  + L
Sbjct: 1225 SCNPNCETQKWTVNGDVRVGLFALCDIPAGMELTFNYNLDCL 1266


>gi|344281620|ref|XP_003412576.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1
            [Loxodonta africana]
          Length = 1438

 Score =  137 bits (344), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 75/199 (37%), Positives = 109/199 (54%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
            K  PY  IK N  + K ++ ++ D      C   P     C  +  C   ML   C    
Sbjct: 1070 KPPPYKHIKANKVVGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1128

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G+ C N+ F  R     ++++TE+ G G+     IK+GEFV EYVGE+ID++ C  R
Sbjct: 1129 CPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1188

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +      TNFY+  + +D +IDA  KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1189 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1248

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1249 LCDIPAGMELTFNYNLDCL 1267


>gi|291409090|ref|XP_002720827.1| PREDICTED: WHSC1L1 protein [Oryctolagus cuniculus]
          Length = 1435

 Score =  136 bits (343), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 75/199 (37%), Positives = 108/199 (54%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
            K  PY  IK N  + K ++ ++ D      C   P     C  +  C   ML   C    
Sbjct: 1067 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESQCLNRMLQYECHPQV 1125

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G  C N+ F  R     ++++TE+ G G+     IK+GEFV EYVGE+ID++ C  R
Sbjct: 1126 CPAGERCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1185

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +      TNFY+  + +D +IDA  KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1186 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1245

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1246 LCDIPAGMELTFNYNLDCL 1264


>gi|348527922|ref|XP_003451468.1| PREDICTED: hypothetical protein LOC100692734 [Oreochromis niloticus]
          Length = 2421

 Score =  136 bits (343), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 77/198 (38%), Positives = 112/198 (56%), Gaps = 2/198 (1%)

Query: 40   KAIPYVFIKRNIYLTK-RIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-C 97
            K  PY  IK N  + K +I      +   C+C  +  S    D +C   MLL  C    C
Sbjct: 1924 KPPPYKHIKVNRPIGKVQIFTADLSEIPRCNCKKTDDSPCGMDSECINRMLLYECHPQVC 1983

Query: 98   KCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERL 157
                 CLN+ F  R   ++++ +T   G G+    DIK+G+FV EYVGEVID++ C  R+
Sbjct: 1984 PAAERCLNQAFTKRQYSQVEIFRTLSRGWGLRCVHDIKKGQFVSEYVGEVIDEEECRSRI 2043

Query: 158  WKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFAT 217
               +     NFY+  +++D +IDA  KGN++R++NHSC PN E QKW ++G+TR+G+FA 
Sbjct: 2044 RHAQDNNICNFYMLTLDKDRIIDAGPKGNEARFMNHSCQPNCETQKWTVNGDTRVGLFAL 2103

Query: 218  RDIKKGENLTYDYQYEFL 235
             DI  G  LT++Y  E L
Sbjct: 2104 IDIAAGTELTFNYNLECL 2121


>gi|410914796|ref|XP_003970873.1| PREDICTED: uncharacterized protein LOC101068764 [Takifugu rubripes]
          Length = 2363

 Score =  136 bits (343), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 77/198 (38%), Positives = 112/198 (56%), Gaps = 2/198 (1%)

Query: 40   KAIPYVFIKRNIYLTK-RIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-C 97
            K  PY  IK N  + K +I      +   C+C A+  S    + +C   MLL  C    C
Sbjct: 1872 KPPPYKHIKVNRPIGKVQIFTADLSEIPRCNCKATDESPCGMESECINRMLLYECHPQVC 1931

Query: 98   KCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERL 157
              G  CLN+ F  R   ++++ +T   G G+    DIK+G+FV EYVGEVID++ C  R+
Sbjct: 1932 PAGERCLNQAFTKRQYSQVEIFRTLSRGWGLRCVHDIKKGQFVSEYVGEVIDEEECRSRI 1991

Query: 158  WKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFAT 217
               +     NFY+  +++D +IDA  KGN++R++NH C PN E QKW + G+TR+G+FA 
Sbjct: 1992 RHAQENDICNFYMLTLDKDRIIDAGPKGNEARFMNHCCQPNCETQKWTVSGDTRVGLFAL 2051

Query: 218  RDIKKGENLTYDYQYEFL 235
             DI  G  LT++Y  E L
Sbjct: 2052 VDIPAGTELTFNYNLECL 2069


>gi|347972366|ref|XP_316738.5| AGAP004656-PA [Anopheles gambiae str. PEST]
 gi|333469400|gb|EAA11974.5| AGAP004656-PA [Anopheles gambiae str. PEST]
          Length = 1259

 Score =  136 bits (343), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 86/206 (41%), Positives = 114/206 (55%), Gaps = 13/206 (6%)

Query: 40   KAIP-YVFIKRNIYLTK----RIKRRLEDDGIFCS-CTASPGSSGVCDRDCHC---GMLL 90
            KA+P Y  IK N Y+       + R+L  DG+  S C   P     C     C    +++
Sbjct: 941  KAVPVYKKIKTNRYIPPLKPPSVNRQL--DGVEDSVCRCQPSDDDPCGPTSACLNRAIMM 998

Query: 91   SSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDD 150
               S  C    SC N+ F  R    +++      G G+VA ED+K G+FVIEYVGEVI+ 
Sbjct: 999  ECSSKTCPAKESCSNQRFTKRIYPALEVRFFSDKGFGLVALEDLKSGQFVIEYVGEVINS 1058

Query: 151  QTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGET 210
            +  + R+  M+   ETN+Y   +  D+ IDA  KGN SR+INHSC PN E QKW I GET
Sbjct: 1059 EEFDRRVMMMQAAKETNYYFLTVEPDLTIDAGPKGNVSRFINHSCEPNCETQKWTI-GET 1117

Query: 211  R-IGIFATRDIKKGENLTYDYQYEFL 235
            R IG+FA +DI  GE LT++Y  E L
Sbjct: 1118 RVIGLFAIKDINAGEELTFNYNLESL 1143


>gi|295424164|ref|NP_001074571.2| histone-lysine N-methyltransferase NSD2 isoform 1 [Mus musculus]
          Length = 1366

 Score =  136 bits (343), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 78/199 (39%), Positives = 109/199 (54%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
            K  PY  IK N    K ++    D      C   P     C  D  C   ML+  C    
Sbjct: 988  KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGSDSECLNRMLMFECHPQV 1046

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G  C N+ F  R   + K+++T+  G G+VA  DI++GEFV EYVGE+ID++ C  R
Sbjct: 1047 CPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 1106

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            +        T+FY+  I++D +IDA  KGN SR++NHSC PN E  KW ++G+TR+G+FA
Sbjct: 1107 IKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1166

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1167 VCDIPAGTELTFNYNLDCL 1185


>gi|148705490|gb|EDL37437.1| mCG16344 [Mus musculus]
          Length = 1298

 Score =  136 bits (343), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 78/199 (39%), Positives = 109/199 (54%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
            K  PY  IK N    K ++    D      C   P     C  D  C   ML+  C    
Sbjct: 920  KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGSDSECLNRMLMFECHPQV 978

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G  C N+ F  R   + K+++T+  G G+VA  DI++GEFV EYVGE+ID++ C  R
Sbjct: 979  CPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 1038

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            +        T+FY+  I++D +IDA  KGN SR++NHSC PN E  KW ++G+TR+G+FA
Sbjct: 1039 IKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1098

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1099 VCDIPAGTELTFNYNLDCL 1117


>gi|344244292|gb|EGW00396.1| putative histone-lysine N-methyltransferase NSD2 [Cricetulus griseus]
          Length = 1344

 Score =  136 bits (343), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 78/199 (39%), Positives = 109/199 (54%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
            K  PY  IK N    K ++    D      C   P     C  D  C   ML+  C    
Sbjct: 966  KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGSDSECLNRMLMFECHPQV 1024

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G  C N+ F  R   + K+++T+  G G+VA  DI++GEFV EYVGE+ID++ C  R
Sbjct: 1025 CPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 1084

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            +        T+FY+  I++D +IDA  KGN SR++NHSC PN E  KW ++G+TR+G+FA
Sbjct: 1085 IKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1144

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1145 VCDIPAGTELTFNYNLDCL 1163


>gi|351699801|gb|EHB02720.1| Histone-lysine N-methyltransferase NSD3 [Heterocephalus glaber]
          Length = 1439

 Score =  136 bits (343), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 75/199 (37%), Positives = 108/199 (54%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
            K  PY  IK N  + K ++  + D      C   P     C  +  C   ML   C    
Sbjct: 1071 KPPPYKHIKANKVIGK-VQIHVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1129

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G+ C N+ F  R     ++++TE+ G G+     IK+GEFV EYVGE+ID++ C  R
Sbjct: 1130 CPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1189

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +      TNFY+  + +D +IDA  KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1190 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1249

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1250 LCDIPAGMELTFNYNLDCL 1268


>gi|350593412|ref|XP_003483678.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            NSD3-like [Sus scrofa]
          Length = 1438

 Score =  136 bits (343), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 75/199 (37%), Positives = 108/199 (54%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
            K  PY  IK N  + K ++ ++ D      C   P     C  +  C   ML   C    
Sbjct: 1070 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1128

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G  C N+ F  R     ++++TE+ G G+     IK+GEFV EYVGE+ID++ C  R
Sbjct: 1129 CPAGERCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1188

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +      TNFY+  + +D +IDA  KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1189 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1248

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1249 LCDIPAGMELTFNYNLDCL 1267


>gi|157822347|ref|NP_001100807.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
            specific [Rattus norvegicus]
 gi|149039888|gb|EDL94004.1| nuclear receptor binding SET domain protein 1 (predicted), isoform
            CRA_a [Rattus norvegicus]
          Length = 2381

 Score =  136 bits (343), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 70/169 (41%), Positives = 103/169 (60%), Gaps = 1/169 (0%)

Query: 68   CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
            C+C A+  +    D +C   MLL  C  + C  G  C N+ F  R    +++ +T + G 
Sbjct: 1585 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPDVEIFRTLQRGW 1644

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+    DIK+GEFV EYVGE+ID++ C  R+   +    TNFY+  +++D +IDA  KGN
Sbjct: 1645 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 1704

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
             +R++NH C PN E QKW ++G+TR+G+FA  DIK G  LT++Y  E L
Sbjct: 1705 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 1753


>gi|162318272|gb|AAI56161.1| Wolf-Hirschhorn syndrome candidate 1 (human) [synthetic construct]
 gi|162318442|gb|AAI56968.1| Wolf-Hirschhorn syndrome candidate 1 (human) [synthetic construct]
          Length = 1346

 Score =  136 bits (343), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 78/199 (39%), Positives = 109/199 (54%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
            K  PY  IK N    K ++    D      C   P     C  D  C   ML+  C    
Sbjct: 968  KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGSDSECLNRMLMFECHPQV 1026

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G  C N+ F  R   + K+++T+  G G+VA  DI++GEFV EYVGE+ID++ C  R
Sbjct: 1027 CPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 1086

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            +        T+FY+  I++D +IDA  KGN SR++NHSC PN E  KW ++G+TR+G+FA
Sbjct: 1087 IKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1146

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1147 VCDIPAGTELTFNYNLDCL 1165


>gi|295424166|ref|NP_780440.2| histone-lysine N-methyltransferase NSD2 isoform 2 [Mus musculus]
 gi|118572947|sp|Q8BVE8.2|NSD2_MOUSE RecName: Full=Histone-lysine N-methyltransferase NSD2; AltName:
            Full=Multiple myeloma SET domain-containing protein;
            Short=MMSET; AltName: Full=Nuclear SET domain-containing
            protein 2; Short=NSD2; AltName: Full=Wolf-Hirschhorn
            syndrome candidate 1 protein homolog; Short=WHSC1
          Length = 1365

 Score =  136 bits (343), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 78/199 (39%), Positives = 109/199 (54%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
            K  PY  IK N    K ++    D      C   P     C  D  C   ML+  C    
Sbjct: 987  KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGSDSECLNRMLMFECHPQV 1045

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G  C N+ F  R   + K+++T+  G G+VA  DI++GEFV EYVGE+ID++ C  R
Sbjct: 1046 CPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 1105

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            +        T+FY+  I++D +IDA  KGN SR++NHSC PN E  KW ++G+TR+G+FA
Sbjct: 1106 IKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1165

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1166 VCDIPAGTELTFNYNLDCL 1184


>gi|348554221|ref|XP_003462924.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like [Cavia
            porcellus]
          Length = 1438

 Score =  136 bits (343), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 75/199 (37%), Positives = 108/199 (54%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
            K  PY  IK N  + K ++  + D      C   P     C  +  C   ML   C    
Sbjct: 1070 KPPPYKHIKANKVIGK-VQIHVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1128

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G+ C N+ F  R     ++++TE+ G G+     IK+GEFV EYVGE+ID++ C  R
Sbjct: 1129 CPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1188

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +      TNFY+  + +D +IDA  KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1189 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1248

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1249 LCDIPAGMELTFNYNLDCL 1267


>gi|297676794|ref|XP_002816309.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific isoform 1 [Pongo abelii]
          Length = 2697

 Score =  136 bits (343), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)

Query: 68   CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
            C+C A+  +    D +C   MLL  C  + C  G  C N+ F  R   ++++ +T + G 
Sbjct: 1896 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGW 1955

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+    DIK+GEFV EYVGE+ID++ C  R+   +    TNFY+  +++D +IDA  KGN
Sbjct: 1956 GLRTKIDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 2015

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
             +R++NH C PN E QKW ++G+TR+G+FA  DIK G  LT++Y  E L
Sbjct: 2016 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 2064


>gi|395847335|ref|XP_003796334.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1
            [Otolemur garnettii]
          Length = 1438

 Score =  136 bits (343), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 75/199 (37%), Positives = 108/199 (54%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
            K  PY  IK N  + K ++ ++ D      C   P     C  +  C   ML   C    
Sbjct: 1070 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1128

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G  C N+ F  R     ++++TE+ G G+     IK+GEFV EYVGE+ID++ C  R
Sbjct: 1129 CPAGERCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1188

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +      TNFY+  + +D +IDA  KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1189 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1248

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1249 LCDIPAGMELTFNYNLDCL 1267


>gi|315364634|pdb|3OOI|A Chain A, Crystal Structure Of Human Histone-Lysine
           N-Methyltransferase Nsd1 Set Domain In Complex With
           S-Adenosyl-L-Methionine
          Length = 232

 Score =  136 bits (343), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)

Query: 68  CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
           C+C A+  +    D +C   MLL  C  + C  G  C N+ F  R   ++++ +T + G 
Sbjct: 45  CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGW 104

Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
           G+    DIK+GEFV EYVGE+ID++ C  R+   +    TNFY+  +++D +IDA  KGN
Sbjct: 105 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 164

Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
            +R++NH C PN E QKW ++G+TR+G+FA  DIK G  LT++Y  E L
Sbjct: 165 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 213


>gi|354483938|ref|XP_003504149.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 1
            [Cricetulus griseus]
          Length = 1365

 Score =  136 bits (343), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 78/199 (39%), Positives = 109/199 (54%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
            K  PY  IK N    K ++    D      C   P     C  D  C   ML+  C    
Sbjct: 987  KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGSDSECLNRMLMFECHPQV 1045

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G  C N+ F  R   + K+++T+  G G+VA  DI++GEFV EYVGE+ID++ C  R
Sbjct: 1046 CPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 1105

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            +        T+FY+  I++D +IDA  KGN SR++NHSC PN E  KW ++G+TR+G+FA
Sbjct: 1106 IKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1165

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1166 VCDIPAGTELTFNYNLDCL 1184


>gi|426256406|ref|XP_004021831.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2 [Ovis
            aries]
          Length = 1439

 Score =  136 bits (343), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 75/199 (37%), Positives = 108/199 (54%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
            K  PY  IK N  + K ++ ++ D      C   P     C  +  C   ML   C    
Sbjct: 1071 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1129

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G  C N+ F  R     ++++TE+ G G+     IK+GEFV EYVGE+ID++ C  R
Sbjct: 1130 CPAGERCQNQCFTKRLYPDAEVIRTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1189

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +      TNFY+  + +D +IDA  KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1190 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1249

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1250 LCDIPAGMELTFNYNLDCL 1268


>gi|395507430|ref|XP_003758027.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2
            [Sarcophilus harrisii]
          Length = 1389

 Score =  136 bits (343), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 81/222 (36%), Positives = 117/222 (52%), Gaps = 6/222 (2%)

Query: 17   FNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGS 76
            F +L  Q  +    EL     KP   PY  IK N  + K ++ ++ D      C   P  
Sbjct: 1000 FQELKAQRESKEALELERNSRKPP--PYKHIKANKVIGK-VQIQVADLSEIPRCNCKPAD 1056

Query: 77   SGVCDRDCHC--GMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADED 133
               C  +  C   ML   C    C  G+ C N+ F  R     ++++TE+ G G+     
Sbjct: 1057 ENPCGLESECLNRMLQYECHPQVCPAGDRCQNQCFTKRLYPDAEVIKTERRGWGLRTKRS 1116

Query: 134  IKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINH 193
            IK+GEFV EYVGE+ID++ C  R+ +      TNFY+  + +D +IDA  KGN SR++NH
Sbjct: 1117 IKKGEFVNEYVGELIDEEECRLRIKRAHENSITNFYMLTVTKDRIIDAGPKGNYSRFMNH 1176

Query: 194  SCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
            SC PN E QKW ++G+ R+G+FA  DI  G  LT++Y  + L
Sbjct: 1177 SCNPNCETQKWTVNGDVRVGLFALCDIPAGVELTFNYNLDCL 1218


>gi|432952957|ref|XP_004085262.1| PREDICTED: histone-lysine N-methyltransferase NSD2-like, partial
            [Oryzias latipes]
          Length = 1167

 Score =  136 bits (342), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 78/199 (39%), Positives = 109/199 (54%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
            K  PY FIK N     R++    D      C   P     C  + +C   ML   C    
Sbjct: 881  KPPPYKFIKVNRPYG-RVQVYTADVSEIPKCNCKPTDERPCGFESECLNRMLQYECHPQV 939

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G  C N+ F  R   + K+++T   G G+VA  DIK+G+FV EY+GE+ID++ C  R
Sbjct: 940  CPSGERCCNQDFTKRLYPETKIIKTPGKGWGLVALRDIKKGKFVNEYIGELIDEEECRAR 999

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            +        TNFY+  I++D +IDA  KGN SR++NHSC PN E QKW ++G+TR+G+FA
Sbjct: 1000 IKYAHENNITNFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETQKWTVNGDTRVGLFA 1059

Query: 217  TRDIKKGENLTYDYQYEFL 235
               I  G  LT++Y  + L
Sbjct: 1060 VCHIPAGTELTFNYNLDCL 1078


>gi|395736540|ref|XP_003776772.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific isoform 2 [Pongo abelii]
          Length = 2594

 Score =  136 bits (342), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)

Query: 68   CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
            C+C A+  +    D +C   MLL  C  + C  G  C N+ F  R   ++++ +T + G 
Sbjct: 1793 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGW 1852

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+    DIK+GEFV EYVGE+ID++ C  R+   +    TNFY+  +++D +IDA  KGN
Sbjct: 1853 GLRTKIDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 1912

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
             +R++NH C PN E QKW ++G+TR+G+FA  DIK G  LT++Y  E L
Sbjct: 1913 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 1961


>gi|395507428|ref|XP_003758026.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1
            [Sarcophilus harrisii]
          Length = 1437

 Score =  136 bits (342), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 81/222 (36%), Positives = 117/222 (52%), Gaps = 6/222 (2%)

Query: 17   FNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGS 76
            F +L  Q  +    EL     KP   PY  IK N  + K ++ ++ D      C   P  
Sbjct: 1048 FQELKAQRESKEALELERNSRKPP--PYKHIKANKVIGK-VQIQVADLSEIPRCNCKPAD 1104

Query: 77   SGVCDRDCHC--GMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADED 133
               C  +  C   ML   C    C  G+ C N+ F  R     ++++TE+ G G+     
Sbjct: 1105 ENPCGLESECLNRMLQYECHPQVCPAGDRCQNQCFTKRLYPDAEVIKTERRGWGLRTKRS 1164

Query: 134  IKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINH 193
            IK+GEFV EYVGE+ID++ C  R+ +      TNFY+  + +D +IDA  KGN SR++NH
Sbjct: 1165 IKKGEFVNEYVGELIDEEECRLRIKRAHENSITNFYMLTVTKDRIIDAGPKGNYSRFMNH 1224

Query: 194  SCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
            SC PN E QKW ++G+ R+G+FA  DI  G  LT++Y  + L
Sbjct: 1225 SCNPNCETQKWTVNGDVRVGLFALCDIPAGVELTFNYNLDCL 1266


>gi|358419376|ref|XP_003584219.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1 [Bos
            taurus]
          Length = 1439

 Score =  136 bits (342), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 75/199 (37%), Positives = 108/199 (54%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
            K  PY  IK N  + K ++ ++ D      C   P     C  +  C   ML   C    
Sbjct: 1071 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1129

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G  C N+ F  R     ++++TE+ G G+     IK+GEFV EYVGE+ID++ C  R
Sbjct: 1130 CPAGERCQNQCFTKRLYPDAEVIRTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1189

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +      TNFY+  + +D +IDA  KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1190 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1249

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1250 LCDIPAGMELTFNYNLDCL 1268


>gi|440907576|gb|ELR57709.1| Histone-lysine N-methyltransferase NSD3 [Bos grunniens mutus]
          Length = 1446

 Score =  136 bits (342), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 75/199 (37%), Positives = 108/199 (54%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
            K  PY  IK N  + K ++ ++ D      C   P     C  +  C   ML   C    
Sbjct: 1080 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1138

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G  C N+ F  R     ++++TE+ G G+     IK+GEFV EYVGE+ID++ C  R
Sbjct: 1139 CPAGERCQNQCFTKRLYPDAEVIRTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1198

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +      TNFY+  + +D +IDA  KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1199 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1258

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1259 LCDIPAGMELTFNYNLDCL 1277


>gi|449490008|ref|XP_004176439.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            ASH1L-like [Taeniopygia guttata]
          Length = 2968

 Score =  136 bits (342), Expect = 8e-30,   Method: Composition-based stats.
 Identities = 80/227 (35%), Positives = 123/227 (54%), Gaps = 18/227 (7%)

Query: 19   KLLKQIGNPVEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFC 68
            K L+Q    ++F+LP      W     + KP    Y  I+ N+Y+   +K     +   C
Sbjct: 2041 KYLRQ--KRIDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTC 2096

Query: 69   SCTASPGSSGV-CDRDCHCGMLLSSCS-SGCKCGNSCLNKPFQNRP-VKKMKLVQTEKCG 125
            +C      SG  C  DC   M+ + CS + C CG  C N+  Q    V+ ++  + E+ G
Sbjct: 2097 NCKKPEDDSGKGCVEDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKG 2156

Query: 126  AGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKG 185
             GI   E +K G+F+IEY+GEV+ +Q    R+ +  H   ++ Y   ++  MVID+   G
Sbjct: 2157 WGIRTKEPLKAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMG 2215

Query: 186  NKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            N++R+INHSC PN EMQKW ++G  RIG++A +D+  G  LTYDY +
Sbjct: 2216 NEARFINHSCNPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2262


>gi|357443507|ref|XP_003592031.1| hypothetical protein MTR_1g098000 [Medicago truncatula]
 gi|355481079|gb|AES62282.1| hypothetical protein MTR_1g098000 [Medicago truncatula]
          Length = 479

 Score =  136 bits (342), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 76/194 (39%), Positives = 107/194 (55%), Gaps = 2/194 (1%)

Query: 44  YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGS-SGVCDRDCHCGMLLSSCSSG-CKCGN 101
           Y+ I +N +  +R K++ E+D   C C          C   C   +  + C+ G C C  
Sbjct: 12  YIHINQNDFFMRRHKKQKEEDIAICECRYDEDDPDSACGDGCLNVLTSTECTPGFCPCDI 71

Query: 102 SCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMK 161
            C N+ FQ     K KL +TE  G G++ADE+IK G+FVIEY GEVI  +  + R    +
Sbjct: 72  HCKNQKFQKCEYAKTKLFKTEGRGWGLLADEEIKAGQFVIEYCGEVISCKEAKRRSHTYE 131

Query: 162 HLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIK 221
             G  + ++  +N    IDAT KG+ +R+INHSC PN E +KW + GE R+GIFA  DI 
Sbjct: 132 IQGLKDAFIISLNASESIDATRKGSLARFINHSCQPNCETRKWNVMGEIRVGIFALEDIP 191

Query: 222 KGENLTYDYQYEFL 235
            G  L YDY +E+ 
Sbjct: 192 IGTELAYDYNFEWF 205


>gi|300796853|ref|NP_001178481.1| probable histone-lysine N-methyltransferase NSD2 [Rattus norvegicus]
          Length = 1346

 Score =  136 bits (342), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 78/199 (39%), Positives = 109/199 (54%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
            K  PY  IK N    K ++    D      C   P     C  D  C   ML+  C    
Sbjct: 968  KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGSDSECLNRMLMFECHPQV 1026

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G  C N+ F  R   + K+++T+  G G+VA  DI++GEFV EYVGE+ID++ C  R
Sbjct: 1027 CPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 1086

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            +        T+FY+  I++D +IDA  KGN SR++NHSC PN E  KW ++G+TR+G+FA
Sbjct: 1087 IKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1146

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1147 VCDIPAGTELTFNYNLDCL 1165


>gi|149047443|gb|EDM00113.1| similar to Wolf-Hirschhorn syndrome candidate 1 protein isoform 3
            (predicted) [Rattus norvegicus]
          Length = 1298

 Score =  136 bits (342), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 78/199 (39%), Positives = 109/199 (54%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
            K  PY  IK N    K ++    D      C   P     C  D  C   ML+  C    
Sbjct: 920  KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGSDSECLNRMLMFECHPQV 978

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G  C N+ F  R   + K+++T+  G G+VA  DI++GEFV EYVGE+ID++ C  R
Sbjct: 979  CPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 1038

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            +        T+FY+  I++D +IDA  KGN SR++NHSC PN E  KW ++G+TR+G+FA
Sbjct: 1039 IKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1098

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1099 VCDIPAGTELTFNYNLDCL 1117


>gi|395847337|ref|XP_003796335.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2
            [Otolemur garnettii]
          Length = 1389

 Score =  136 bits (342), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 75/199 (37%), Positives = 108/199 (54%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
            K  PY  IK N  + K ++ ++ D      C   P     C  +  C   ML   C    
Sbjct: 1021 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1079

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G  C N+ F  R     ++++TE+ G G+     IK+GEFV EYVGE+ID++ C  R
Sbjct: 1080 CPAGERCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1139

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +      TNFY+  + +D +IDA  KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1140 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1199

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1200 LCDIPAGMELTFNYNLDCL 1218


>gi|355557406|gb|EHH14186.1| Putative histone-lysine N-methyltransferase NSD2 [Macaca mulatta]
          Length = 1365

 Score =  136 bits (342), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 78/199 (39%), Positives = 110/199 (55%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
            K  PY  IK N    K ++    D      C   P     C  D +C   ML+  C    
Sbjct: 987  KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMFECHPQV 1045

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G  C N+ F  R   + K+++T+  G G+VA  DI++GEFV EYVGE+ID++ C  R
Sbjct: 1046 CPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 1105

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            +        T+FY+  I++D +IDA  KGN SR++NHSC PN E  KW ++G+TR+G+FA
Sbjct: 1106 IKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1165

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1166 VCDIPAGTELTFNYNLDCL 1184


>gi|224080887|ref|XP_002197925.1| PREDICTED: histone-lysine N-methyltransferase NSD3 [Taeniopygia
            guttata]
          Length = 1435

 Score =  136 bits (342), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 75/199 (37%), Positives = 108/199 (54%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
            K  PY  IK N  + K ++ ++ D      C   P     C  +  C   ML   C    
Sbjct: 1067 KPPPYKHIKSNKVIGK-VQIQVADLSEIPRCNCKPSDENPCGLESECLNRMLQYECHPQV 1125

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G  C N+ F  R     ++++TE+ G G+     IK+GEFV EYVGE+ID++ C  R
Sbjct: 1126 CPAGERCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1185

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +      TNFY+  + +D +IDA  KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1186 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDIRVGLFA 1245

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1246 LCDIPAGMELTFNYNLDCL 1264


>gi|449270866|gb|EMC81514.1| Histone-lysine N-methyltransferase NSD3 [Columba livia]
          Length = 1440

 Score =  136 bits (342), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 75/199 (37%), Positives = 108/199 (54%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
            K  PY  IK N  + K ++ ++ D      C   P     C  +  C   ML   C    
Sbjct: 1072 KPPPYKHIKSNKVIGK-VQIQVADLSEIPRCNCKPSDENPCGLESECLNRMLQYECHPQV 1130

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G  C N+ F  R     ++++TE+ G G+     IK+GEFV EYVGE+ID++ C  R
Sbjct: 1131 CPAGERCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1190

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +      TNFY+  + +D +IDA  KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1191 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDIRVGLFA 1250

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1251 LCDIPAGMELTFNYNLDCL 1269


>gi|383421363|gb|AFH33895.1| putative histone-lysine N-methyltransferase NSD2 isoform 1 [Macaca
            mulatta]
 gi|384949270|gb|AFI38240.1| putative histone-lysine N-methyltransferase NSD2 isoform 1 [Macaca
            mulatta]
 gi|387540940|gb|AFJ71097.1| putative histone-lysine N-methyltransferase NSD2 isoform 1 [Macaca
            mulatta]
          Length = 1365

 Score =  136 bits (342), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 78/199 (39%), Positives = 110/199 (55%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
            K  PY  IK N    K ++    D      C   P     C  D +C   ML+  C    
Sbjct: 987  KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMFECHPQV 1045

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G  C N+ F  R   + K+++T+  G G+VA  DI++GEFV EYVGE+ID++ C  R
Sbjct: 1046 CPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 1105

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            +        T+FY+  I++D +IDA  KGN SR++NHSC PN E  KW ++G+TR+G+FA
Sbjct: 1106 IKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1165

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1166 VCDIPAGTELTFNYNLDCL 1184


>gi|355744804|gb|EHH49429.1| Putative histone-lysine N-methyltransferase NSD2 [Macaca
            fascicularis]
          Length = 1365

 Score =  136 bits (342), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 78/199 (39%), Positives = 110/199 (55%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
            K  PY  IK N    K ++    D      C   P     C  D +C   ML+  C    
Sbjct: 987  KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMFECHPQV 1045

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G  C N+ F  R   + K+++T+  G G+VA  DI++GEFV EYVGE+ID++ C  R
Sbjct: 1046 CPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 1105

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            +        T+FY+  I++D +IDA  KGN SR++NHSC PN E  KW ++G+TR+G+FA
Sbjct: 1106 IKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1165

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1166 VCDIPAGTELTFNYNLDCL 1184


>gi|170050731|ref|XP_001861443.1| set domain protein [Culex quinquefasciatus]
 gi|167872245|gb|EDS35628.1| set domain protein [Culex quinquefasciatus]
          Length = 1181

 Score =  136 bits (342), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 76/210 (36%), Positives = 112/210 (53%), Gaps = 10/210 (4%)

Query: 34   DWFIKPKAIPYVFIKRNIY---LTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC---G 87
            D   KP    YV IK N Y   L  R   R E++   C C   P  +  C  D +C    
Sbjct: 869  DLSFKPPM--YVKIKSNKYVAPLRGRNAARDEEEDSICEC--KPSDTDPCGLDSNCINRA 924

Query: 88   MLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEV 147
            +L+      C  G+SC N+ F+ +    +   +    G G+VA EDI +G+FVIEYVGEV
Sbjct: 925  LLVECNPKTCPAGDSCQNQCFERKQYPALAAKRIPNKGWGLVAQEDIHQGQFVIEYVGEV 984

Query: 148  IDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIID 207
            I+ +    R+ + +   + N+Y   ++ ++ IDA  KGN +R+INHSC PN E   W + 
Sbjct: 985  INGEELARRIKQKQEQKDENYYFLTVDSELTIDAGPKGNLARFINHSCEPNCETLLWKVG 1044

Query: 208  GETRIGIFATRDIKKGENLTYDYQYEFLHD 237
            G   +G+FA +D+K GE LT++Y +E   D
Sbjct: 1045 GSQSVGLFALKDLKAGEELTFNYNFETFGD 1074


>gi|297672976|ref|XP_002814554.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
            N-methyltransferase NSD2 [Pongo abelii]
          Length = 1365

 Score =  136 bits (342), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 78/199 (39%), Positives = 110/199 (55%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
            K  PY  IK N    K ++    D      C   P     C  D +C   ML+  C    
Sbjct: 987  KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMFECHPQV 1045

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G  C N+ F  R   + K+++T+  G G+VA  DI++GEFV EYVGE+ID++ C  R
Sbjct: 1046 CPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 1105

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            +        T+FY+  I++D +IDA  KGN SR++NHSC PN E  KW ++G+TR+G+FA
Sbjct: 1106 IKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1165

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1166 VCDIPAGTELTFNYNLDCL 1184


>gi|426343599|ref|XP_004038381.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 1
            [Gorilla gorilla gorilla]
          Length = 1365

 Score =  136 bits (342), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 78/199 (39%), Positives = 110/199 (55%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
            K  PY  IK N    K ++    D      C   P     C  D +C   ML+  C    
Sbjct: 987  KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMFECHPQV 1045

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G  C N+ F  R   + K+++T+  G G+VA  DI++GEFV EYVGE+ID++ C  R
Sbjct: 1046 CPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 1105

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            +        T+FY+  I++D +IDA  KGN SR++NHSC PN E  KW ++G+TR+G+FA
Sbjct: 1106 IKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1165

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1166 VCDIPAGTELTFNYNLDCL 1184


>gi|19913348|ref|NP_579877.1| histone-lysine N-methyltransferase NSD2 isoform 1 [Homo sapiens]
 gi|19913350|ref|NP_579878.1| histone-lysine N-methyltransferase NSD2 isoform 1 [Homo sapiens]
 gi|19913358|ref|NP_579890.1| histone-lysine N-methyltransferase NSD2 isoform 1 [Homo sapiens]
 gi|109633019|ref|NP_001035889.1| histone-lysine N-methyltransferase NSD2 isoform 1 [Homo sapiens]
 gi|74706096|sp|O96028.1|NSD2_HUMAN RecName: Full=Histone-lysine N-methyltransferase NSD2; AltName:
            Full=Multiple myeloma SET domain-containing protein;
            Short=MMSET; AltName: Full=Nuclear SET domain-containing
            protein 2; Short=NSD2; AltName: Full=Protein trithorax-5;
            AltName: Full=Wolf-Hirschhorn syndrome candidate 1
            protein; Short=WHSC1
 gi|3249713|gb|AAC24150.1| MMSET type II [Homo sapiens]
 gi|4378019|gb|AAD19343.1| putative WHSC1 protein [Homo sapiens]
 gi|4521954|gb|AAD21770.1| putative WHSC1 protein [Homo sapiens]
 gi|4521955|gb|AAD21771.1| putative WHSC1 protein [Homo sapiens]
 gi|5123789|emb|CAB45386.1| TRX5 protein [Homo sapiens]
 gi|6683809|gb|AAF23370.1| MMSET type II [Homo sapiens]
 gi|119602958|gb|EAW82552.1| Wolf-Hirschhorn syndrome candidate 1, isoform CRA_e [Homo sapiens]
 gi|119602959|gb|EAW82553.1| Wolf-Hirschhorn syndrome candidate 1, isoform CRA_e [Homo sapiens]
 gi|119602962|gb|EAW82556.1| Wolf-Hirschhorn syndrome candidate 1, isoform CRA_e [Homo sapiens]
 gi|168273154|dbj|BAG10416.1| histone-lysine N-methyltransferase NSD2 [synthetic construct]
 gi|187252511|gb|AAI66668.1| Wolf-Hirschhorn syndrome candidate 1 [synthetic construct]
          Length = 1365

 Score =  136 bits (342), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 78/199 (39%), Positives = 110/199 (55%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
            K  PY  IK N    K ++    D      C   P     C  D +C   ML+  C    
Sbjct: 987  KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMFECHPQV 1045

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G  C N+ F  R   + K+++T+  G G+VA  DI++GEFV EYVGE+ID++ C  R
Sbjct: 1046 CPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 1105

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            +        T+FY+  I++D +IDA  KGN SR++NHSC PN E  KW ++G+TR+G+FA
Sbjct: 1106 IKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1165

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1166 VCDIPAGTELTFNYNLDCL 1184


>gi|326432726|gb|EGD78296.1| hypothetical protein PTSG_09362 [Salpingoeca sp. ATCC 50818]
          Length = 1279

 Score =  136 bits (342), Expect = 1e-29,   Method: Composition-based stats.
 Identities = 78/199 (39%), Positives = 106/199 (53%), Gaps = 10/199 (5%)

Query: 38   KPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCS-SG 96
            KP A   V  + N+Y   R +   ++      C  SP +   CD  C   ++   C    
Sbjct: 995  KPPAFQRV--RTNVYTIPRKRTPGQE------CCCSPATD-TCDDSCLNRIVHCECDPKT 1045

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C   + C N+ FQ R   K+    T+  G G+ A EDI  G+FVIEYVGE+ID   C  R
Sbjct: 1046 CPVKDKCQNRRFQRRQYPKLIPFLTQSKGWGLKAGEDIAEGQFVIEYVGEIIDATECRRR 1105

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            L   +   + +FY+  ++    +DA  K N +R+INHSC PN E QKW + GETR+GIFA
Sbjct: 1106 LAASQAANDHSFYILSLSGSSFVDARNKANLARFINHSCGPNCETQKWNVLGETRVGIFA 1165

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI KG  LT+DYQ + L
Sbjct: 1166 KEDIPKGTELTFDYQLDSL 1184


>gi|114592860|ref|XP_001146084.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 6
            [Pan troglodytes]
 gi|114592864|ref|XP_001146248.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 7
            [Pan troglodytes]
 gi|114592866|ref|XP_001146323.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 8
            [Pan troglodytes]
 gi|114592870|ref|XP_001146473.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform
            10 [Pan troglodytes]
 gi|397483594|ref|XP_003812984.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Pan
            paniscus]
 gi|410227780|gb|JAA11109.1| Wolf-Hirschhorn syndrome candidate 1 [Pan troglodytes]
 gi|410259494|gb|JAA17713.1| Wolf-Hirschhorn syndrome candidate 1 [Pan troglodytes]
 gi|410299310|gb|JAA28255.1| Wolf-Hirschhorn syndrome candidate 1 [Pan troglodytes]
 gi|410334709|gb|JAA36301.1| Wolf-Hirschhorn syndrome candidate 1 [Pan troglodytes]
          Length = 1365

 Score =  136 bits (342), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 78/199 (39%), Positives = 110/199 (55%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
            K  PY  IK N    K ++    D      C   P     C  D +C   ML+  C    
Sbjct: 987  KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMFECHPQV 1045

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G  C N+ F  R   + K+++T+  G G+VA  DI++GEFV EYVGE+ID++ C  R
Sbjct: 1046 CPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 1105

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            +        T+FY+  I++D +IDA  KGN SR++NHSC PN E  KW ++G+TR+G+FA
Sbjct: 1106 IKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1165

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1166 VCDIPAGTELTFNYNLDCL 1184


>gi|124513208|ref|XP_001349960.1| SET domain protein, putative [Plasmodium falciparum 3D7]
 gi|23615377|emb|CAD52368.1| SET domain protein, putative [Plasmodium falciparum 3D7]
          Length = 2548

 Score =  136 bits (342), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 84/215 (39%), Positives = 119/215 (55%), Gaps = 19/215 (8%)

Query: 42   IPYVFIKRNIYLTKRIKRRL--EDDGIFCSCTASPGSSGVCD-RDCHCGMLLSSCS-SGC 97
            + + +I +NIYL  + K  L  + D   C C       G C+   C+  +    CS S C
Sbjct: 2044 LKFEYISKNIYLNDKNKNLLACKSDDYKCLC------QGECNLYTCYNSLSNIQCSKSRC 2097

Query: 98   KC-----GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQT 152
                      C N+PF+   VK +++ +TEK G G+    DIK GE + EYVGEV+  + 
Sbjct: 2098 NLPEKIQDRKCFNRPFRKSFVKDLEIKKTEKTGYGVFCKRDIKNGELICEYVGEVLGKRE 2157

Query: 153  CEERLW----KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDG 208
             E+RL     + K     N+Y+ +IN+D+ ID+  KG+ SR+INHSC PN+  QKWI+ G
Sbjct: 2158 FEKRLEVYQEESKKTDMYNWYIIQINKDVYIDSGKKGSISRFINHSCSPNSVSQKWIVRG 2217

Query: 209  ETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
              RIGIFA RDI  GE +TY+Y Y FL ++    C
Sbjct: 2218 FYRIGIFALRDIPSGEEITYNYSYNFLFNNFECLC 2252


>gi|357443463|ref|XP_003592009.1| Histone-lysine N-methyltransferase NSD3 [Medicago truncatula]
 gi|355481057|gb|AES62260.1| Histone-lysine N-methyltransferase NSD3 [Medicago truncatula]
          Length = 616

 Score =  135 bits (341), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 78/196 (39%), Positives = 110/196 (56%), Gaps = 6/196 (3%)

Query: 44  YVFIKRNIYLTKRIKRRLEDDGIFCSC---TASPGSSGVCDRDCHCGMLLSSCSSG-CKC 99
           Y+ I +N +  +R K++ E+D   C C      P S+  C   C   +  + C+ G C C
Sbjct: 149 YIHINQNDFFMRRHKKQKEEDIAICECRYDEDDPDSA--CGDGCLNVLTSTECTPGFCPC 206

Query: 100 GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
              C N+ FQ     K KL +TE  G G++ADE+IK G+FVIEY GEVI  +  + R   
Sbjct: 207 DIHCKNQKFQKCEYAKTKLFKTEGRGWGLLADEEIKAGQFVIEYCGEVISCKEAKRRSHT 266

Query: 160 MKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD 219
            +  G  + ++  +N    IDAT KG+ +R+INHSC PN E +KW + GE R+GIFA  D
Sbjct: 267 YEIQGLKDAFIISLNASESIDATRKGSLARFINHSCQPNCETRKWNVMGEIRVGIFALED 326

Query: 220 IKKGENLTYDYQYEFL 235
           I  G  L YDY +E+ 
Sbjct: 327 IPIGTELAYDYNFEWF 342


>gi|296081208|emb|CBI18234.3| unnamed protein product [Vitis vinifera]
          Length = 862

 Score =  135 bits (341), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 76/193 (39%), Positives = 106/193 (54%), Gaps = 2/193 (1%)

Query: 44  YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGS-SGVCDRDCHCGMLLSSCSSG-CKCGN 101
           Y+ I RN +  ++  ++ E+D   C C          C   C   +  + C+ G C+CG 
Sbjct: 426 YIHINRNDFSYRKHIKQQEEDIAICECKYDANDPDSACGEACLNVLTSTECTPGYCRCGL 485

Query: 102 SCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMK 161
            C N+ FQ     K KL +TE  G G++ADE+IK G FVIEY GEVI  +    R     
Sbjct: 486 FCKNQRFQKCEYAKTKLFRTEGRGWGLLADENIKAGRFVIEYCGEVISWKEARGRSQVYA 545

Query: 162 HLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIK 221
            LG  + ++  +N    IDAT KG+  R+INHSC PN E +KW + GE R+GIFA +DI 
Sbjct: 546 SLGLKDAFIISLNGSECIDATKKGSLGRFINHSCQPNCETRKWTVLGEVRVGIFAKQDIS 605

Query: 222 KGENLTYDYQYEF 234
            G  L Y+Y +E+
Sbjct: 606 IGTELAYNYNFEW 618



 Score =  131 bits (330), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 78/188 (41%), Positives = 103/188 (54%), Gaps = 7/188 (3%)

Query: 51  IYLTKRIKRRLEDDGIFCSC---TASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNK 106
           +++   IK+  EDD   C C   T  P S+  C   C   +    C+   C C   C N+
Sbjct: 11  MFMHLHIKQE-EDDITICECKYNTNDPDSA--CGERCLNVLTSIECTPHYCPCSVHCKNQ 67

Query: 107 PFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGET 166
            FQ     K KL +TE  G G++A+EDIK G F+IEY GEVI      ER       G  
Sbjct: 68  RFQKHEYAKTKLFRTEGRGWGLLANEDIKAGRFIIEYCGEVISWNEARERSLAYASQGIN 127

Query: 167 NFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENL 226
           + Y+  +N    IDAT  G+++R+INHSC PN E +KW + GE RIGIFA RDI  G  L
Sbjct: 128 DAYIISLNARECIDATKSGSQARFINHSCEPNCETRKWSVLGEVRIGIFAMRDISIGTEL 187

Query: 227 TYDYQYEF 234
           TYDY +++
Sbjct: 188 TYDYNFQW 195


>gi|148709230|gb|EDL41176.1| nuclear receptor-binding SET-domain protein 1, isoform CRA_b [Mus
            musculus]
          Length = 2382

 Score =  135 bits (341), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 70/169 (41%), Positives = 103/169 (60%), Gaps = 1/169 (0%)

Query: 68   CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
            C+C A+  +    D +C   MLL  C  + C  G  C N+ F  R    +++ +T + G 
Sbjct: 1587 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGVRCQNQCFSKRQYPDVEIFRTLQRGW 1646

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+    DIK+GEFV EYVGE+ID++ C  R+   +    TNFY+  +++D +IDA  KGN
Sbjct: 1647 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 1706

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
             +R++NH C PN E QKW ++G+TR+G+FA  DIK G  LT++Y  E L
Sbjct: 1707 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 1755


>gi|260941922|ref|XP_002615127.1| hypothetical protein CLUG_05142 [Clavispora lusitaniae ATCC 42720]
 gi|238851550|gb|EEQ41014.1| hypothetical protein CLUG_05142 [Clavispora lusitaniae ATCC 42720]
          Length = 717

 Score =  135 bits (341), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 72/194 (37%), Positives = 108/194 (55%), Gaps = 7/194 (3%)

Query: 44  YVFIKRNIYLTKRIKRRLEDDGIFCSCTAS----PGSSGVCDRDCHCGMLLSS--CSSG- 96
           +V +    YLTK++    + + + C C          +  C  D +C    +S  C +G 
Sbjct: 24  FVPLAECTYLTKQLGGSGQKETMTCDCVEEWNEGKAQNLACGEDSNCINRATSVECVNGS 83

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
           C CG +C N+ FQ +    +K++QTEK G G+VA  D+  G FV EY+GEVID++T   R
Sbjct: 84  CTCGQNCQNQRFQKKQYSSVKVIQTEKKGYGLVAQADVPEGSFVYEYIGEVIDEKTFRAR 143

Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
           +         +FY   + +D  IDAT KG+ +R+ NHSC PN  + KW++  + R+GIFA
Sbjct: 144 MLDYDRRNLRHFYFMMLTKDAFIDATEKGSLARFCNHSCSPNAYVDKWVVGDKLRMGIFA 203

Query: 217 TRDIKKGENLTYDY 230
            R I  GE +T+DY
Sbjct: 204 RRRILAGEEITFDY 217


>gi|147816102|emb|CAN75267.1| hypothetical protein VITISV_028162 [Vitis vinifera]
          Length = 460

 Score =  135 bits (341), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 80/188 (42%), Positives = 104/188 (55%), Gaps = 7/188 (3%)

Query: 51  IYLTKRIKRRLEDDGIFCSC---TASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNK 106
           I+L + IK+  EDD   C C   T  P S+  C   C   +    C+   C C   C N+
Sbjct: 96  IFLIRHIKQE-EDDITICECKYNTNDPDSA--CGERCLNVLTSIECTPHYCPCSVHCKNQ 152

Query: 107 PFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGET 166
            FQ     K KL +TE  G G++A+EDIK G F+IEY GEVI      ER       G  
Sbjct: 153 RFQKHEYAKTKLFRTEGRGWGLLANEDIKAGRFIIEYCGEVISWNEARERSLAYASQGIN 212

Query: 167 NFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENL 226
           + Y+  +N    IDAT  G+++R+INHSC PN E +KW + GE RIGIFA RDI  G  L
Sbjct: 213 DAYIISLNARECIDATKSGSQARFINHSCEPNCETRKWSVLGEVRIGIFAMRDISIGTEL 272

Query: 227 TYDYQYEF 234
           TYDY +++
Sbjct: 273 TYDYNFQW 280


>gi|301762334|ref|XP_002916587.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
            N-methyltransferase NSD2-like [Ailuropoda melanoleuca]
          Length = 1364

 Score =  135 bits (341), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 78/199 (39%), Positives = 110/199 (55%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
            K  PY  IK N    K ++    D      C   P     C  D +C   ML+  C    
Sbjct: 986  KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMFECHPQV 1044

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G  C N+ F  R   + K+++T+  G G+VA  DI++GEFV EYVGE+ID++ C  R
Sbjct: 1045 CPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 1104

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            +        T+FY+  I++D +IDA  KGN SR++NHSC PN E  KW ++G+TR+G+FA
Sbjct: 1105 IKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1164

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1165 VCDIPAGTELTFNYNLDCL 1183


>gi|328781326|ref|XP_003249962.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
           [Apis mellifera]
          Length = 1218

 Score =  135 bits (341), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 74/166 (44%), Positives = 96/166 (57%), Gaps = 1/166 (0%)

Query: 66  IFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKC 124
           + C C     +      DC   +LL  CS G C  G  C N+ F  R    M+   T   
Sbjct: 778 VACDCDPEWENPCAPGTDCLNRILLVECSPGICPAGPKCNNQAFVRRQYPAMEPFHTIGR 837

Query: 125 GAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYK 184
           G G+ + E IK G+FVIEYVGEVID+   + RL + K L   NFY   I+ +  IDA  K
Sbjct: 838 GWGLRSLEHIKAGQFVIEYVGEVIDEAEYKRRLHRKKELKNENFYFLTIDNNRTIDAEPK 897

Query: 185 GNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDY 230
           GN SR++NHSC PN E QKW ++G+TRIG+FA  DI+ GE LT++Y
Sbjct: 898 GNLSRFMNHSCSPNCETQKWTVNGDTRIGLFALCDIEPGEELTFNY 943


>gi|359493199|ref|XP_003634540.1| PREDICTED: histone-lysine N-methyltransferase ASHH1-like [Vitis
           vinifera]
          Length = 515

 Score =  135 bits (341), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 76/193 (39%), Positives = 106/193 (54%), Gaps = 2/193 (1%)

Query: 44  YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGS-SGVCDRDCHCGMLLSSCSSG-CKCGN 101
           Y+ I RN +  ++  ++ E+D   C C          C   C   +  + C+ G C+CG 
Sbjct: 25  YIHINRNDFSYRKHIKQQEEDIAICECKYDANDPDSACGEACLNVLTSTECTPGYCRCGL 84

Query: 102 SCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMK 161
            C N+ FQ     K KL +TE  G G++ADE+IK G FVIEY GEVI  +    R     
Sbjct: 85  FCKNQRFQKCEYAKTKLFRTEGRGWGLLADENIKAGRFVIEYCGEVISWKEARGRSQVYA 144

Query: 162 HLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIK 221
            LG  + ++  +N    IDAT KG+  R+INHSC PN E +KW + GE R+GIFA +DI 
Sbjct: 145 SLGLKDAFIISLNGSECIDATKKGSLGRFINHSCQPNCETRKWTVLGEVRVGIFAKQDIS 204

Query: 222 KGENLTYDYQYEF 234
            G  L Y+Y +E+
Sbjct: 205 IGTELAYNYNFEW 217


>gi|375294191|ref|NP_001069063.2| histone-lysine N-methyltransferase NSD3 [Bos taurus]
 gi|359080579|ref|XP_003588019.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1 [Bos
            taurus]
          Length = 1439

 Score =  135 bits (341), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 76/208 (36%), Positives = 111/208 (53%), Gaps = 6/208 (2%)

Query: 31   ELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GM 88
            E+P     P   PY  +K N  + K ++ ++ D      C   P     C  +  C   M
Sbjct: 1064 EVPQSLRDPP--PYQHLKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRM 1120

Query: 89   LLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEV 147
            L   C    C  G  C N+ F  R     ++++TE+ G G+     IK+GEFV EYVGE+
Sbjct: 1121 LQYECHPQVCPAGERCQNQCFTKRLYPDAEVIRTERRGWGLRTKRSIKKGEFVNEYVGEL 1180

Query: 148  IDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIID 207
            ID++ C  R+ +      TNFY+  + +D +IDA  KGN SR++NHSC PN E QKW ++
Sbjct: 1181 IDEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVN 1240

Query: 208  GETRIGIFATRDIKKGENLTYDYQYEFL 235
            G+ R+G+FA  DI  G  LT++Y  + L
Sbjct: 1241 GDVRVGLFALCDIPAGMELTFNYNLDCL 1268


>gi|118918400|ref|NP_032765.3| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
            specific [Mus musculus]
          Length = 2691

 Score =  135 bits (341), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 70/169 (41%), Positives = 103/169 (60%), Gaps = 1/169 (0%)

Query: 68   CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
            C+C A+  +    D +C   MLL  C  + C  G  C N+ F  R    +++ +T + G 
Sbjct: 1896 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGVRCQNQCFSKRQYPDVEIFRTLQRGW 1955

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+    DIK+GEFV EYVGE+ID++ C  R+   +    TNFY+  +++D +IDA  KGN
Sbjct: 1956 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 2015

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
             +R++NH C PN E QKW ++G+TR+G+FA  DIK G  LT++Y  E L
Sbjct: 2016 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 2064


>gi|281339990|gb|EFB15574.1| hypothetical protein PANDA_004672 [Ailuropoda melanoleuca]
          Length = 1363

 Score =  135 bits (341), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 78/199 (39%), Positives = 110/199 (55%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
            K  PY  IK N    K ++    D      C   P     C  D +C   ML+  C    
Sbjct: 985  KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMFECHPQV 1043

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G  C N+ F  R   + K+++T+  G G+VA  DI++GEFV EYVGE+ID++ C  R
Sbjct: 1044 CPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 1103

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            +        T+FY+  I++D +IDA  KGN SR++NHSC PN E  KW ++G+TR+G+FA
Sbjct: 1104 IKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1163

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1164 VCDIPAGTELTFNYNLDCL 1182


>gi|348571625|ref|XP_003471596.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 1
            [Cavia porcellus]
          Length = 1366

 Score =  135 bits (341), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 78/199 (39%), Positives = 110/199 (55%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
            K  PY  IK N    K ++    D      C   P     C  D +C   ML+  C    
Sbjct: 987  KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMFECHPQV 1045

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G  C N+ F  R   + K+++T+  G G+VA  DI++GEFV EYVGE+ID++ C  R
Sbjct: 1046 CPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 1105

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            +        T+FY+  I++D +IDA  KGN SR++NHSC PN E  KW ++G+TR+G+FA
Sbjct: 1106 IKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1165

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1166 VCDIPAGTELTFNYNLDCL 1184


>gi|348571627|ref|XP_003471597.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 2
            [Cavia porcellus]
          Length = 1367

 Score =  135 bits (341), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 78/199 (39%), Positives = 110/199 (55%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
            K  PY  IK N    K ++    D      C   P     C  D +C   ML+  C    
Sbjct: 988  KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMFECHPQV 1046

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G  C N+ F  R   + K+++T+  G G+VA  DI++GEFV EYVGE+ID++ C  R
Sbjct: 1047 CPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 1106

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            +        T+FY+  I++D +IDA  KGN SR++NHSC PN E  KW ++G+TR+G+FA
Sbjct: 1107 IKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1166

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1167 VCDIPAGTELTFNYNLDCL 1185


>gi|345798392|ref|XP_536224.3| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
            N-methyltransferase NSD2 [Canis lupus familiaris]
          Length = 1364

 Score =  135 bits (340), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 78/199 (39%), Positives = 110/199 (55%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
            K  PY  IK N    K ++    D      C   P     C  D +C   ML+  C    
Sbjct: 986  KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMFECHPQV 1044

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G  C N+ F  R   + K+++T+  G G+VA  DI++GEFV EYVGE+ID++ C  R
Sbjct: 1045 CPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 1104

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            +        T+FY+  I++D +IDA  KGN SR++NHSC PN E  KW ++G+TR+G+FA
Sbjct: 1105 IKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1164

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1165 VCDIPAGTELTFNYNLDCL 1183


>gi|449446403|ref|XP_004140961.1| PREDICTED: histone-lysine N-methyltransferase ASHH1-like [Cucumis
           sativus]
          Length = 497

 Score =  135 bits (340), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 76/193 (39%), Positives = 110/193 (56%), Gaps = 2/193 (1%)

Query: 44  YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS-GVCDRDCHCGMLLSSCSSG-CKCGN 101
           Y  IK+N +  ++ K++ E+D   C C      +   C   C   +  + C+ G C  G 
Sbjct: 11  YKDIKQNEFCYRKHKKQKEEDIAVCECKFDDNDNDSACGEGCLNVLTSTECTPGHCPSGV 70

Query: 102 SCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMK 161
            C N+ FQ     K KL +TE  G G++ADE+IK G+F+IEY GEVI  +  + R    +
Sbjct: 71  HCRNQRFQKCEYAKTKLFKTEGRGWGLLADENIKNGQFIIEYCGEVISWKEAKRRSHTYE 130

Query: 162 HLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIK 221
           + G  + Y+  +N    IDAT KG+ +R+INHSC PN E +KW + GE R+GIFA +DI 
Sbjct: 131 NQGLKDAYIISLNASESIDATRKGSLARFINHSCFPNCETRKWNVLGEIRVGIFAKQDIS 190

Query: 222 KGENLTYDYQYEF 234
            G  L YDY +E+
Sbjct: 191 IGTELAYDYNFEW 203


>gi|118101386|ref|XP_001232891.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1 [Gallus
            gallus]
          Length = 1436

 Score =  135 bits (340), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 74/199 (37%), Positives = 109/199 (54%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
            K  PY  IK N  + K ++ ++ D      C   P     C  +  C   ML   C    
Sbjct: 1068 KPPPYKHIKSNKVIGK-VQIQVADLSEIPRCNCKPSDENPCGLESECLNRMLQYECHPQV 1126

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G  C N+ F  R     ++++T++ G G+    +IK+GEFV EYVGE+ID++ C  R
Sbjct: 1127 CPAGERCQNQCFTKRLYPDAEIIKTDRRGWGLRTKRNIKKGEFVNEYVGELIDEEECRLR 1186

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +      TNFY+  + +D +IDA  KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1187 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDIRVGLFA 1246

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1247 LCDIPAGMELTFNYNLDCL 1265


>gi|326932813|ref|XP_003212507.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like isoform 1
            [Meleagris gallopavo]
          Length = 1436

 Score =  135 bits (340), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 74/199 (37%), Positives = 109/199 (54%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
            K  PY  IK N  + K ++ ++ D      C   P     C  +  C   ML   C    
Sbjct: 1068 KPPPYKHIKSNKVIGK-VQIQVADLSEIPRCNCKPSDENPCGLESECLNRMLQYECHPQV 1126

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G  C N+ F  R     ++++T++ G G+    +IK+GEFV EYVGE+ID++ C  R
Sbjct: 1127 CPAGERCQNQCFTKRLYPDAEIIKTDRRGWGLRTKRNIKKGEFVNEYVGELIDEEECRLR 1186

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +      TNFY+  + +D +IDA  KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1187 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDIRVGLFA 1246

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1247 LCDIPAGMELTFNYNLDCL 1265


>gi|37360238|dbj|BAC98097.1| mKIAA1090 protein [Mus musculus]
          Length = 857

 Score =  135 bits (340), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 78/199 (39%), Positives = 109/199 (54%), Gaps = 4/199 (2%)

Query: 40  KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
           K  PY  IK N    K ++    D      C   P     C  D  C   ML+  C    
Sbjct: 479 KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGSDSECLNRMLMFECHPQV 537

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
           C  G  C N+ F  R   + K+++T+  G G+VA  DI++GEFV EYVGE+ID++ C  R
Sbjct: 538 CPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 597

Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
           +        T+FY+  I++D +IDA  KGN SR++NHSC PN E  KW ++G+TR+G+FA
Sbjct: 598 IKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 657

Query: 217 TRDIKKGENLTYDYQYEFL 235
             DI  G  LT++Y  + L
Sbjct: 658 VCDIPAGTELTFNYNLDCL 676


>gi|28204960|gb|AAH46473.1| Whsc1 protein, partial [Mus musculus]
          Length = 851

 Score =  135 bits (340), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 78/199 (39%), Positives = 109/199 (54%), Gaps = 4/199 (2%)

Query: 40  KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
           K  PY  IK N    K ++    D      C   P     C  D  C   ML+  C    
Sbjct: 473 KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGSDSECLNRMLMFECHPQV 531

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
           C  G  C N+ F  R   + K+++T+  G G+VA  DI++GEFV EYVGE+ID++ C  R
Sbjct: 532 CPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 591

Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
           +        T+FY+  I++D +IDA  KGN SR++NHSC PN E  KW ++G+TR+G+FA
Sbjct: 592 IKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 651

Query: 217 TRDIKKGENLTYDYQYEFL 235
             DI  G  LT++Y  + L
Sbjct: 652 VCDIPAGTELTFNYNLDCL 670


>gi|156099290|ref|XP_001615647.1| hypothetical protein [Plasmodium vivax Sal-1]
 gi|148804521|gb|EDL45920.1| hypothetical protein, conserved [Plasmodium vivax]
          Length = 3021

 Score =  135 bits (340), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 78/216 (36%), Positives = 119/216 (55%), Gaps = 21/216 (9%)

Query: 42   IPYVFIKRNIYLTKRIKRRL--EDDGIFCSCTASPGSSGVCDR-DCHCGMLLSSCSSGCK 98
            + + +I +N+Y+  + K  L  + D   C C       G CD   C+  +    CS   +
Sbjct: 2520 LNFEYISKNVYVNDQNKNLLSCKSDDYRCLC------QGECDPFSCYNSLSKIQCSKN-R 2572

Query: 99   CG-------NSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQ 151
            C          C N+PF++  +K +++ QTE+ G G+    DIK GE + EYVGEV+  +
Sbjct: 2573 CNLPIQIQDKKCFNRPFKHSAIKDLEIKQTERTGYGVFCKRDIKNGELICEYVGEVLGKK 2632

Query: 152  TCEERLWKMKHLGET----NFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIID 207
              EER+   +   +     N+Y  +INRD+ ID+  KG+ SR++NHSC PN+  QKWI+ 
Sbjct: 2633 EFEERMEAYQEESKKTDMYNWYSIQINRDVHIDSRRKGSISRFVNHSCSPNSVSQKWIVR 2692

Query: 208  GETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
            G  RIGIFA +DI  GE +TY+Y Y F+ ++    C
Sbjct: 2693 GFYRIGIFAQQDIPAGEEITYNYSYNFVFNNFECLC 2728


>gi|149756942|ref|XP_001488967.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 1
            [Equus caballus]
          Length = 1365

 Score =  135 bits (340), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 78/199 (39%), Positives = 110/199 (55%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
            K  PY  IK N    K ++    D      C   P     C  D +C   ML+  C    
Sbjct: 987  KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMFECHPQV 1045

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G  C N+ F  R   + K+++T+  G G+VA  DI++GEFV EYVGE+ID++ C  R
Sbjct: 1046 CPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 1105

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            +        T+FY+  I++D +IDA  KGN SR++NHSC PN E  KW ++G+TR+G+FA
Sbjct: 1106 IKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1165

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1166 VCDIPAGTELTFNYNLDCL 1184


>gi|326919530|ref|XP_003206033.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
            [Meleagris gallopavo]
          Length = 1348

 Score =  135 bits (340), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 78/199 (39%), Positives = 110/199 (55%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
            K  PY  IK N    K ++    D      C   P     C  D +C   ML+  C    
Sbjct: 971  KPPPYKHIKVNKPCGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMYECHPQV 1029

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G  C N+ F  R   + ++++T+  G G+VA  DIK+GEFV EYVGE+ID++ C  R
Sbjct: 1030 CPAGERCQNQCFTKREYPETEIIKTDGKGWGLVAKRDIKKGEFVNEYVGELIDEEECMAR 1089

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            +        T+FY+  I++D +IDA  KGN SR++NHSC PN E  KW ++G+TR+G+FA
Sbjct: 1090 IKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1149

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1150 VCDIPAGTELTFNYNLDCL 1168


>gi|351704076|gb|EHB06995.1| Putative histone-lysine N-methyltransferase NSD2 [Heterocephalus
            glaber]
          Length = 1372

 Score =  135 bits (340), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 77/196 (39%), Positives = 109/196 (55%), Gaps = 4/196 (2%)

Query: 43   PYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG-CKC 99
            PY  IK N    K ++    D      C   P     C  D +C   ML+  C    C  
Sbjct: 996  PYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMFECHPQVCPA 1054

Query: 100  GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
            G  C N+ F  R   + K+++T+  G G+VA  DI++GEFV EYVGE+ID++ C  R+  
Sbjct: 1055 GEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMARIKY 1114

Query: 160  MKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD 219
                  T+FY+  I++D +IDA  KGN SR++NHSC PN E  KW ++G+TR+G+FA  D
Sbjct: 1115 AHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCD 1174

Query: 220  IKKGENLTYDYQYEFL 235
            I  G  LT++Y  + L
Sbjct: 1175 IPAGTELTFNYNLDCL 1190


>gi|326932815|ref|XP_003212508.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like isoform 2
            [Meleagris gallopavo]
          Length = 1386

 Score =  135 bits (340), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 74/199 (37%), Positives = 109/199 (54%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
            K  PY  IK N  + K ++ ++ D      C   P     C  +  C   ML   C    
Sbjct: 1018 KPPPYKHIKSNKVIGK-VQIQVADLSEIPRCNCKPSDENPCGLESECLNRMLQYECHPQV 1076

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G  C N+ F  R     ++++T++ G G+    +IK+GEFV EYVGE+ID++ C  R
Sbjct: 1077 CPAGERCQNQCFTKRLYPDAEIIKTDRRGWGLRTKRNIKKGEFVNEYVGELIDEEECRLR 1136

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +      TNFY+  + +D +IDA  KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1137 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDIRVGLFA 1196

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1197 LCDIPAGMELTFNYNLDCL 1215


>gi|118101388|ref|XP_424390.2| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2 [Gallus
            gallus]
          Length = 1386

 Score =  135 bits (340), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 74/199 (37%), Positives = 109/199 (54%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
            K  PY  IK N  + K ++ ++ D      C   P     C  +  C   ML   C    
Sbjct: 1018 KPPPYKHIKSNKVIGK-VQIQVADLSEIPRCNCKPSDENPCGLESECLNRMLQYECHPQV 1076

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G  C N+ F  R     ++++T++ G G+    +IK+GEFV EYVGE+ID++ C  R
Sbjct: 1077 CPAGERCQNQCFTKRLYPDAEIIKTDRRGWGLRTKRNIKKGEFVNEYVGELIDEEECRLR 1136

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +      TNFY+  + +D +IDA  KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1137 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDIRVGLFA 1196

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1197 LCDIPAGMELTFNYNLDCL 1215


>gi|431897323|gb|ELK06585.1| Putative histone-lysine N-methyltransferase NSD2 [Pteropus alecto]
          Length = 502

 Score =  135 bits (340), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 78/199 (39%), Positives = 109/199 (54%), Gaps = 4/199 (2%)

Query: 40  KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
           K  PY  IK N    K ++    D      C   P     C  D  C   ML+  C    
Sbjct: 57  KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMFECHPQV 115

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
           C  G  C N+ F  R   + K+++T+  G G+VA  DI++GEFV EYVGE+ID+  C  R
Sbjct: 116 CPAGEFCQNQCFTKRQYPETKIIRTDGKGWGLVAKRDIRKGEFVNEYVGELIDEDECMAR 175

Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
           + +      T+FY+  I++D +IDA  KGN SR++NHSC PN E  KW ++G+TR+G+FA
Sbjct: 176 IKRAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 235

Query: 217 TRDIKKGENLTYDYQYEFL 235
             DI  G  LT++Y  + L
Sbjct: 236 VCDIPAGTELTFNYNLDCL 254


>gi|118090799|ref|XP_420839.2| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Gallus
            gallus]
          Length = 1369

 Score =  135 bits (340), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 78/199 (39%), Positives = 110/199 (55%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
            K  PY  IK N    K ++    D      C   P     C  D +C   ML+  C    
Sbjct: 992  KPPPYKHIKVNKPCGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMYECHPQV 1050

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G  C N+ F  R   + ++++T+  G G+VA  DIK+GEFV EYVGE+ID++ C  R
Sbjct: 1051 CPAGERCQNQCFTKREYPETEIIKTDGKGWGLVAKRDIKKGEFVNEYVGELIDEEECMAR 1110

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            +        T+FY+  I++D +IDA  KGN SR++NHSC PN E  KW ++G+TR+G+FA
Sbjct: 1111 IKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1170

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1171 VCDIPAGTELTFNYNLDCL 1189


>gi|402852477|ref|XP_003890948.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
           [Papio anubis]
          Length = 1013

 Score =  135 bits (339), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 78/199 (39%), Positives = 110/199 (55%), Gaps = 4/199 (2%)

Query: 40  KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
           K  PY  IK N    K ++    D      C   P     C  D +C   ML+  C    
Sbjct: 635 KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMFECHPQV 693

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
           C  G  C N+ F  R   + K+++T+  G G+VA  DI++GEFV EYVGE+ID++ C  R
Sbjct: 694 CPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 753

Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
           +        T+FY+  I++D +IDA  KGN SR++NHSC PN E  KW ++G+TR+G+FA
Sbjct: 754 IKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 813

Query: 217 TRDIKKGENLTYDYQYEFL 235
             DI  G  LT++Y  + L
Sbjct: 814 VCDIPAGTELTFNYNLDCL 832


>gi|26347387|dbj|BAC37342.1| unnamed protein product [Mus musculus]
          Length = 601

 Score =  135 bits (339), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 78/199 (39%), Positives = 109/199 (54%), Gaps = 4/199 (2%)

Query: 40  KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
           K  PY  IK N    K ++    D      C   P     C  D  C   ML+  C    
Sbjct: 223 KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGSDSECLNRMLMFECHPQV 281

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
           C  G  C N+ F  R   + K+++T+  G G+VA  DI++GEFV EYVGE+ID++ C  R
Sbjct: 282 CPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 341

Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
           +        T+FY+  I++D +IDA  KGN SR++NHSC PN E  KW ++G+TR+G+FA
Sbjct: 342 IKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 401

Query: 217 TRDIKKGENLTYDYQYEFL 235
             DI  G  LT++Y  + L
Sbjct: 402 VCDIPAGTELTFNYNLDCL 420


>gi|313226807|emb|CBY21952.1| unnamed protein product [Oikopleura dioica]
          Length = 216

 Score =  135 bits (339), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 66/153 (43%), Positives = 93/153 (60%)

Query: 80  CDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEF 139
           C  +C    L+  CSS C+ GN C NK FQ R     K+   +  G G+ A   I RG+F
Sbjct: 16  CGEECLNRALMVECSSRCRTGNRCTNKNFQKRKYAPAKIFWAKGKGHGLKATAFIPRGKF 75

Query: 140 VIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNT 199
           +IEY+GEV+  +  ++R  +    G+ + Y  E++R   IDA +KG  SR+INHSC PN+
Sbjct: 76  IIEYLGEVVSAKEFKKRSHEYARSGKQHHYFMELSRQATIDAYHKGAISRFINHSCEPNS 135

Query: 200 EMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
           E QKW ++G  RIG FA RDI+  E +T+DYQ+
Sbjct: 136 ETQKWTVNGLLRIGFFAIRDIQPEEEITFDYQF 168


>gi|190349638|gb|ACE75882.1| multiple-myeloma-related WHSC1/MMSET isoform RE-IIBP [Homo sapiens]
          Length = 704

 Score =  135 bits (339), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 78/199 (39%), Positives = 110/199 (55%), Gaps = 4/199 (2%)

Query: 40  KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
           K  PY  IK N    K ++    D      C  +P     C  D  C   ML+  C    
Sbjct: 326 KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCTPTDENPCGSDSECLNRMLMFECHPQV 384

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
           C  G  C N+ F  R   + K+++T+  G G+VA  DI++GEFV EYVGE+ID++ C  R
Sbjct: 385 CPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 444

Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
           +        T+FY+  I++D +IDA  KGN SR++NHSC PN E  KW ++G+TR+G+FA
Sbjct: 445 IKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 504

Query: 217 TRDIKKGENLTYDYQYEFL 235
             DI  G  LT++Y  + L
Sbjct: 505 VCDIPAGTELTFNYNLDCL 523


>gi|342209884|gb|AEL16989.1| ASH1-like protein [Phaseolus vulgaris]
          Length = 481

 Score =  135 bits (339), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 78/212 (36%), Positives = 113/212 (53%), Gaps = 11/212 (5%)

Query: 26  NPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS-GVCDRDC 84
           +P   ELP          Y+ I +N +  +R K++ E+D   C C      +   C   C
Sbjct: 2   DPFTEELPQ---------YIHINQNEFFMRRQKKQKEEDIAICECKYDANDTDSACGDSC 52

Query: 85  HCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEY 143
              +  + C+ G C C   C N+ FQ     K KL +TE  G G++A ED+K G+FVIEY
Sbjct: 53  LNVLTSTECTPGYCPCDILCKNQKFQKCEYAKTKLFKTEGRGWGLLAGEDLKAGQFVIEY 112

Query: 144 VGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQK 203
            GEVI  +  + R    ++ G  + ++  +N    IDAT KG+ +R+INHSC PN E +K
Sbjct: 113 CGEVISWKEAKRRSQAYENQGLKDAFIICLNASESIDATRKGSLARFINHSCRPNCETRK 172

Query: 204 WIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
           W + GE R+GIFA  D+  G  L YDY +E+ 
Sbjct: 173 WNVLGEIRVGIFAKHDVPIGTELAYDYNFEWF 204


>gi|432909264|ref|XP_004078147.1| PREDICTED: histone-lysine N-methyltransferase SETD2-like [Oryzias
           latipes]
          Length = 1665

 Score =  135 bits (339), Expect = 2e-29,   Method: Composition-based stats.
 Identities = 73/207 (35%), Positives = 115/207 (55%), Gaps = 9/207 (4%)

Query: 36  FIKPKAIPYVF--IKRNIYLTKRIKRRLEDD--GIFCSC---TASPGSSGV--CDRDCHC 86
           F + K +P  F  I+ N+YLT+R K +   D   + C C   +      GV  C  DC  
Sbjct: 544 FARSKKMPPYFDLIEENLYLTERKKSKSHRDIKRMQCECPVLSREERGKGVMACGEDCLN 603

Query: 87  GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
            +L+  CSS C  G  C N+ FQ +     +++ T+  G G+ A +++    FV+EY GE
Sbjct: 604 RLLMIECSSRCLNGAYCSNRRFQMKQHADFEVILTDDKGWGLRAAKEMAPNTFVLEYCGE 663

Query: 147 VIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWII 206
           V+D +  + R+ +       ++Y   +  + +IDAT KGN SR++NHSC PN E QKW +
Sbjct: 664 VLDHKEFKTRVKEYARNKNIHYYFMSLKNNEIIDATLKGNCSRFMNHSCEPNCETQKWTV 723

Query: 207 DGETRIGIFATRDIKKGENLTYDYQYE 233
           +G+ R+G F T+ +  G  LT+DYQ++
Sbjct: 724 NGQLRVGFFTTKAVAAGTELTFDYQFQ 750


>gi|31418293|gb|AAH53454.1| Whsc1 protein, partial [Mus musculus]
          Length = 558

 Score =  135 bits (339), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 78/199 (39%), Positives = 109/199 (54%), Gaps = 4/199 (2%)

Query: 40  KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
           K  PY  IK N    K ++    D      C   P     C  D  C   ML+  C    
Sbjct: 180 KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGSDSECLNRMLMFECHPQV 238

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
           C  G  C N+ F  R   + K+++T+  G G+VA  DI++GEFV EYVGE+ID++ C  R
Sbjct: 239 CPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 298

Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
           +        T+FY+  I++D +IDA  KGN SR++NHSC PN E  KW ++G+TR+G+FA
Sbjct: 299 IKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 358

Query: 217 TRDIKKGENLTYDYQYEFL 235
             DI  G  LT++Y  + L
Sbjct: 359 VCDIPAGTELTFNYNLDCL 377


>gi|384484414|gb|EIE76594.1| hypothetical protein RO3G_01298 [Rhizopus delemar RA 99-880]
          Length = 291

 Score =  134 bits (338), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 81/218 (37%), Positives = 119/218 (54%), Gaps = 16/218 (7%)

Query: 29  EFELPDWF-----------IKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
           EFELP +            +K K   +  I+ N+++ ++ K    D+   C C    G +
Sbjct: 49  EFELPPFIWQERELNLIKSLKDKRPVFTKIRSNLFVERKPKL-CPDERNVCHCKEPTGEN 107

Query: 78  GV-CDRDCHCGMLLSSCS-SGCKCGNSCLNKPFQNRPVK-KMKLVQTEKCGAGIVADEDI 134
            V C  DC   +L   C    C CG  C N  FQ +  K K+++ +T++ G G+   EDI
Sbjct: 108 EVGCGDDCINRLLFYECDPKHCPCGPKCSNNRFQKKQRKYKLEVFKTDQRGWGLRTLEDI 167

Query: 135 KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
           K+G FVIEY GE+I  + CEER+    ++ E NFY  E ++  VIDA  KG ++R+INHS
Sbjct: 168 KKGAFVIEYRGEIISQKLCEERMC-TDYVNENNFYFLEYSKGEVIDACTKGTEARFINHS 226

Query: 195 CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
           C PN  ++KW   GE   G+FA++DI     L+YDY +
Sbjct: 227 CDPNCHIEKWSYRGEAHFGVFASKDIPAYSELSYDYNF 264


>gi|354472091|ref|XP_003498274.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1
            [Cricetulus griseus]
          Length = 1436

 Score =  134 bits (338), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 79/222 (35%), Positives = 116/222 (52%), Gaps = 6/222 (2%)

Query: 17   FNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGS 76
            F +L  Q  +    E+     KP   PY  IK N  + K +  ++ D      C   P  
Sbjct: 1048 FQELKAQRESKEALEIEKTSRKPP--PYKHIKANKVIGK-VPIQVADLSEIPRCNCKPAD 1104

Query: 77   SGVCDRDCHCGMLLSS--CSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADED 133
               C  +  C   +S   C    C  G+ C N+ F  R     ++++TE+ G G+     
Sbjct: 1105 ENPCGLESECLNRMSQYECHPQVCPAGDRCQNQCFTKRLYPDAEIIKTERKGWGLRTKRS 1164

Query: 134  IKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINH 193
            IK+GEFV EYVGE+ID++ C  R+ +      TNFY+  + +D +IDA  KGN SR++NH
Sbjct: 1165 IKKGEFVNEYVGELIDEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNH 1224

Query: 194  SCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
            SC PN E QKW ++G+ R+G+FA  DI  G  LT++Y  + L
Sbjct: 1225 SCNPNCETQKWTVNGDVRVGLFAICDIPAGMELTFNYNLDCL 1266


>gi|195503632|ref|XP_002098733.1| GE10528 [Drosophila yakuba]
 gi|194184834|gb|EDW98445.1| GE10528 [Drosophila yakuba]
          Length = 1441

 Score =  134 bits (338), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 76/205 (37%), Positives = 114/205 (55%), Gaps = 7/205 (3%)

Query: 43   PYVFIKRNIYLTK-RIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG-CK 98
            PYV IK N  +   R  + LED     +C   PG    C  D  C   ML + C+   CK
Sbjct: 1166 PYVKIKTNKAVPPVRFTQNLED---LSACYCQPGDDHPCGPDSGCINRMLFNECNPEFCK 1222

Query: 99   CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
             GN C N+ F+ R   ++++V   + G G+V  E I  G+FVIEYVGEVI+    + R+ 
Sbjct: 1223 AGNRCENQMFEQRKSPRLEVVYMNERGFGLVNREPIAEGDFVIEYVGEVINHAEFQRRME 1282

Query: 159  KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATR 218
            + +   + N+Y   + +D +IDA  KGN +R++NHSC PN E QKW ++   R+G+FA +
Sbjct: 1283 QKQRDRDENYYFLGVEKDFIIDAGPKGNLARFMNHSCEPNCETQKWTVNCIHRVGLFAIK 1342

Query: 219  DIKKGENLTYDYQYEFLHDSLIAYC 243
            DI     LT++Y ++ L ++    C
Sbjct: 1343 DIPVNTELTFNYLWDDLMNNSKKAC 1367


>gi|354472093|ref|XP_003498275.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2
            [Cricetulus griseus]
          Length = 1387

 Score =  134 bits (338), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 79/222 (35%), Positives = 116/222 (52%), Gaps = 6/222 (2%)

Query: 17   FNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGS 76
            F +L  Q  +    E+     KP   PY  IK N  + K +  ++ D      C   P  
Sbjct: 999  FQELKAQRESKEALEIEKTSRKPP--PYKHIKANKVIGK-VPIQVADLSEIPRCNCKPAD 1055

Query: 77   SGVCDRDCHCGMLLSS--CSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADED 133
               C  +  C   +S   C    C  G+ C N+ F  R     ++++TE+ G G+     
Sbjct: 1056 ENPCGLESECLNRMSQYECHPQVCPAGDRCQNQCFTKRLYPDAEIIKTERKGWGLRTKRS 1115

Query: 134  IKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINH 193
            IK+GEFV EYVGE+ID++ C  R+ +      TNFY+  + +D +IDA  KGN SR++NH
Sbjct: 1116 IKKGEFVNEYVGELIDEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNH 1175

Query: 194  SCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
            SC PN E QKW ++G+ R+G+FA  DI  G  LT++Y  + L
Sbjct: 1176 SCNPNCETQKWTVNGDVRVGLFAICDIPAGMELTFNYNLDCL 1217


>gi|281346901|gb|EFB22485.1| hypothetical protein PANDA_005493 [Ailuropoda melanoleuca]
          Length = 926

 Score =  134 bits (338), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 75/199 (37%), Positives = 108/199 (54%), Gaps = 4/199 (2%)

Query: 40  KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
           K  PY  IK N  + K ++  + D      C   P     C  +  C   ML   C    
Sbjct: 558 KPPPYKHIKANKVIGK-VQIHVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 616

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
           C  G  C N+ F  R     ++++TE+ G G+     IK+GEFV EYVGE+ID++ C+ R
Sbjct: 617 CPAGERCQNQCFTKRLYPDAEVIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECKLR 676

Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
           + +      TNFY+  + +D +IDA  KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 677 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 736

Query: 217 TRDIKKGENLTYDYQYEFL 235
             DI  G  LT++Y  + L
Sbjct: 737 LCDIPAGMELTFNYNLDCL 755


>gi|441664377|ref|XP_003279042.2| PREDICTED: histone-lysine N-methyltransferase NSD2-like [Nomascus
           leucogenys]
          Length = 780

 Score =  134 bits (337), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 78/199 (39%), Positives = 109/199 (54%), Gaps = 4/199 (2%)

Query: 40  KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
           K  PY  IK N    K ++    D      C   P     C  D  C   ML+  C    
Sbjct: 482 KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMFECHPQV 540

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
           C  G  C N+ F  R   + K+++T+  G G+VA  DI++GEFV EYVGE+ID++ C  R
Sbjct: 541 CPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 600

Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
           +        T+FY+  I++D +IDA  KGN SR++NHSC PN E  KW ++G+TR+G+FA
Sbjct: 601 IKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 660

Query: 217 TRDIKKGENLTYDYQYEFL 235
             DI  G  LT++Y  + L
Sbjct: 661 VCDIPAGTELTFNYNLDCL 679


>gi|156230137|gb|AAI52413.1| WHSC1 protein [Homo sapiens]
          Length = 713

 Score =  134 bits (337), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 78/199 (39%), Positives = 109/199 (54%), Gaps = 4/199 (2%)

Query: 40  KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
           K  PY  IK N    K ++    D      C   P     C  D  C   ML+  C    
Sbjct: 335 KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMFECHPQV 393

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
           C  G  C N+ F  R   + K+++T+  G G+VA  DI++GEFV EYVGE+ID++ C  R
Sbjct: 394 CPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 453

Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
           +        T+FY+  I++D +IDA  KGN SR++NHSC PN E  KW ++G+TR+G+FA
Sbjct: 454 IKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 513

Query: 217 TRDIKKGENLTYDYQYEFL 235
             DI  G  LT++Y  + L
Sbjct: 514 VCDIPAGTELTFNYNLDCL 532


>gi|297282129|ref|XP_002802212.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
           [Macaca mulatta]
          Length = 713

 Score =  134 bits (337), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 78/199 (39%), Positives = 109/199 (54%), Gaps = 4/199 (2%)

Query: 40  KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
           K  PY  IK N    K ++    D      C   P     C  D  C   ML+  C    
Sbjct: 335 KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMFECHPQV 393

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
           C  G  C N+ F  R   + K+++T+  G G+VA  DI++GEFV EYVGE+ID++ C  R
Sbjct: 394 CPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 453

Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
           +        T+FY+  I++D +IDA  KGN SR++NHSC PN E  KW ++G+TR+G+FA
Sbjct: 454 IKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 513

Query: 217 TRDIKKGENLTYDYQYEFL 235
             DI  G  LT++Y  + L
Sbjct: 514 VCDIPAGTELTFNYNLDCL 532


>gi|40789042|dbj|BAA83042.2| KIAA1090 protein [Homo sapiens]
          Length = 715

 Score =  134 bits (337), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 78/199 (39%), Positives = 109/199 (54%), Gaps = 4/199 (2%)

Query: 40  KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
           K  PY  IK N    K ++    D      C   P     C  D  C   ML+  C    
Sbjct: 337 KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMFECHPQV 395

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
           C  G  C N+ F  R   + K+++T+  G G+VA  DI++GEFV EYVGE+ID++ C  R
Sbjct: 396 CPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 455

Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
           +        T+FY+  I++D +IDA  KGN SR++NHSC PN E  KW ++G+TR+G+FA
Sbjct: 456 IKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 515

Query: 217 TRDIKKGENLTYDYQYEFL 235
             DI  G  LT++Y  + L
Sbjct: 516 VCDIPAGTELTFNYNLDCL 534


>gi|313227685|emb|CBY22833.1| unnamed protein product [Oikopleura dioica]
          Length = 1179

 Score =  134 bits (337), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 74/189 (39%), Positives = 107/189 (56%), Gaps = 4/189 (2%)

Query: 55   KRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVK 114
            KRI+  L +  I C C         CD +C    L + CSS C  G+ C N+ F  R   
Sbjct: 936  KRIEANLAETEI-CDCHKR---DNPCDEECTNRQLQNECSSKCLNGDKCQNRRFTKRHYP 991

Query: 115  KMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEIN 174
             +   QT+  G G+ A + I +G F+IEY+GE+I       RL +   +G TN+Y+ E++
Sbjct: 992  PLLKFQTDWGGNGLKAKKLIPKGSFIIEYIGEIISHDESRIRLEESAKIGVTNYYILELD 1051

Query: 175  RDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEF 234
               +IDA  +GN +R+INHSC PN  +  WI+ G+TRIGIF+ RDI++GE LT++YQ + 
Sbjct: 1052 NLRMIDAGPRGNIARFINHSCDPNCGIDPWIVQGDTRIGIFSKRDIQEGEELTFNYQLQQ 1111

Query: 235  LHDSLIAYC 243
              D     C
Sbjct: 1112 SSDEGKTKC 1120


>gi|119602957|gb|EAW82551.1| Wolf-Hirschhorn syndrome candidate 1, isoform CRA_d [Homo sapiens]
          Length = 742

 Score =  134 bits (337), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 78/199 (39%), Positives = 109/199 (54%), Gaps = 4/199 (2%)

Query: 40  KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
           K  PY  IK N    K ++    D      C   P     C  D  C   ML+  C    
Sbjct: 364 KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMFECHPQV 422

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
           C  G  C N+ F  R   + K+++T+  G G+VA  DI++GEFV EYVGE+ID++ C  R
Sbjct: 423 CPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 482

Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
           +        T+FY+  I++D +IDA  KGN SR++NHSC PN E  KW ++G+TR+G+FA
Sbjct: 483 IKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 542

Query: 217 TRDIKKGENLTYDYQYEFL 235
             DI  G  LT++Y  + L
Sbjct: 543 VCDIPAGTELTFNYNLDCL 561


>gi|12642795|gb|AAK00344.1|AF330040_1 IL-5 promoter REII-region-binding protein [Homo sapiens]
 gi|119602961|gb|EAW82555.1| Wolf-Hirschhorn syndrome candidate 1, isoform CRA_g [Homo sapiens]
 gi|133777178|gb|AAH94825.2| Wolf-Hirschhorn syndrome candidate 1 [Homo sapiens]
          Length = 584

 Score =  134 bits (337), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 78/199 (39%), Positives = 109/199 (54%), Gaps = 4/199 (2%)

Query: 40  KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
           K  PY  IK N    K ++    D      C   P     C  D  C   ML+  C    
Sbjct: 206 KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMFECHPQV 264

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
           C  G  C N+ F  R   + K+++T+  G G+VA  DI++GEFV EYVGE+ID++ C  R
Sbjct: 265 CPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 324

Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
           +        T+FY+  I++D +IDA  KGN SR++NHSC PN E  KW ++G+TR+G+FA
Sbjct: 325 IKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 384

Query: 217 TRDIKKGENLTYDYQYEFL 235
             DI  G  LT++Y  + L
Sbjct: 385 VCDIPAGTELTFNYNLDCL 403


>gi|326671180|ref|XP_694414.5| PREDICTED: histone-lysine N-methyltransferase NSD3 [Danio rerio]
          Length = 1562

 Score =  134 bits (337), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 73/199 (36%), Positives = 107/199 (53%), Gaps = 4/199 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
            +  PY  IK N    K ++  + D      C   P     C +D  C   ML   C    
Sbjct: 1182 RPPPYKLIKSNKPFGK-VQLHVADLSEIPRCNCKPTDERPCSQDSQCLNRMLQYECHPQV 1240

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G+ C N+ F  R     ++++T   G G+   +D+K+G+FV+EYVGE+ID + C++R
Sbjct: 1241 CPAGDRCHNQCFSKRLYPDTEVIKTTGRGWGLKTKQDLKKGDFVMEYVGELIDSEECKQR 1300

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            +        TNFY+  + +D VIDA  KGN SR++NHSC PN E QKW ++G+ RIG+F 
Sbjct: 1301 IRTANENHVTNFYMLTLTKDRVIDAGPKGNLSRFMNHSCSPNCETQKWTVNGDVRIGLFT 1360

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI     LT++Y  + L
Sbjct: 1361 LCDISADTELTFNYNLDCL 1379


>gi|355729163|gb|AES09785.1| Wolf-Hirschhorn syndrome candidate 1 [Mustela putorius furo]
          Length = 409

 Score =  134 bits (337), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 78/199 (39%), Positives = 110/199 (55%), Gaps = 4/199 (2%)

Query: 40  KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
           K  PY  IK N    K ++    D      C   P     C  D +C   ML+  C    
Sbjct: 32  KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMFECHPQV 90

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
           C  G  C N+ F  R   + K+++T+  G G+VA  DI++GEFV EYVGE+ID++ C  R
Sbjct: 91  CPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 150

Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
           +        T+FY+  I++D +IDA  KGN SR++NHSC PN E  KW ++G+TR+G+FA
Sbjct: 151 IKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 210

Query: 217 TRDIKKGENLTYDYQYEFL 235
             DI  G  LT++Y  + L
Sbjct: 211 VCDIPAGTELTFNYNLDCL 229


>gi|328715143|ref|XP_001944096.2| PREDICTED: hypothetical protein LOC100167890 [Acyrthosiphon pisum]
          Length = 2332

 Score =  134 bits (337), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 69/164 (42%), Positives = 97/164 (59%), Gaps = 2/164 (1%)

Query: 80   CDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEF 139
            C  +C   +L   C   C   + C NK FQN+  K+ K+++T   G G+ A EDI  G F
Sbjct: 862  CATECLNRLLNIECGKACVLKSLCTNKQFQNKQFKRTKIIKTADKGYGVFALEDIPSGTF 921

Query: 140  VIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNT 199
            V EY+GEVID   CE  +   K L + N Y+ ++  D +IDAT KGN +R+INHSC PN 
Sbjct: 922  VDEYMGEVIDQ--CEMIIRMKKILYKNNNYMVQLKHDEIIDATRKGNITRFINHSCEPNC 979

Query: 200  EMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
              +KW + GE+R+G F+   I+KGE +T+DY +E   D+    C
Sbjct: 980  VAEKWNVLGESRMGFFSKELIRKGEEITFDYSFEIFGDAAQQKC 1023


>gi|156390320|ref|XP_001635219.1| predicted protein [Nematostella vectensis]
 gi|156222310|gb|EDO43156.1| predicted protein [Nematostella vectensis]
          Length = 690

 Score =  134 bits (337), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 78/192 (40%), Positives = 110/192 (57%), Gaps = 5/192 (2%)

Query: 40  KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSC-SSGCK 98
           K   Y  IK N   T   +R + D      C  +P  +   D +C   ML   C +S C 
Sbjct: 305 KPPQYKHIKTNKCTT--AQRIILDPSEMPVCECTPDQACGQDANCLNLMLQFECVASRCP 362

Query: 99  CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
            G+ C N+ FQ R     +  +    G G+ + + IK+G FVIEYVGE+IDD TC ER+ 
Sbjct: 363 AGDKCQNQRFQKRQYVDCEPFRAHSRGWGLRSKQAIKKGTFVIEYVGELIDDATCRERVK 422

Query: 159 KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATR 218
           K     +TN+Y+  I++D +IDA   GN SR++NHSC PN E QKW ++GE R+G+F +R
Sbjct: 423 KGD--DDTNYYMLTIDKDCIIDAGPMGNLSRFMNHSCYPNCETQKWTVNGEVRVGLFTSR 480

Query: 219 DIKKGENLTYDY 230
           D++  E LT+DY
Sbjct: 481 DVESQEELTFDY 492


>gi|15488420|gb|AAL01111.1|AF408060_1 ASH1-like protein 3 [Arabidopsis thaliana]
          Length = 125

 Score =  134 bits (336), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 81/125 (64%), Positives = 99/125 (79%), Gaps = 2/125 (1%)

Query: 11  SRIGHAFNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSC 70
           + +G  F+KLL QIG   EFELP+W  K K  PY+FI+RNIYLT+++KRR+EDDGIFCSC
Sbjct: 1   NHLGQVFDKLLNQIGESEEFELPEWLNKGKPTPYIFIRRNIYLTEKVKRRVEDDGIFCSC 60

Query: 71  TASPGSS--GVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGI 128
           ++S   S   VC  +CHCGML SSCSS CKCG+ C NKPFQ R VKKMKL+QTEKCG+GI
Sbjct: 61  SSSSPGSSSTVCGSNCHCGMLFSSCSSSCKCGSECNNKPFQQRHVKKMKLIQTEKCGSGI 120

Query: 129 VADED 133
           VA+E+
Sbjct: 121 VAEEE 125


>gi|380797995|gb|AFE70873.1| putative histone-lysine N-methyltransferase NSD2 isoform 1, partial
           [Macaca mulatta]
          Length = 421

 Score =  134 bits (336), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 78/199 (39%), Positives = 109/199 (54%), Gaps = 4/199 (2%)

Query: 40  KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
           K  PY  IK N    K ++    D      C   P     C  D  C   ML+  C    
Sbjct: 43  KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMFECHPQV 101

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
           C  G  C N+ F  R   + K+++T+  G G+VA  DI++GEFV EYVGE+ID++ C  R
Sbjct: 102 CPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 161

Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
           +        T+FY+  I++D +IDA  KGN SR++NHSC PN E  KW ++G+TR+G+FA
Sbjct: 162 IKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 221

Query: 217 TRDIKKGENLTYDYQYEFL 235
             DI  G  LT++Y  + L
Sbjct: 222 VCDIPAGTELTFNYNLDCL 240


>gi|355729169|gb|AES09787.1| Wolf-Hirschhorn syndrome candidate 1-like 1 [Mustela putorius furo]
          Length = 596

 Score =  134 bits (336), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 76/205 (37%), Positives = 108/205 (52%), Gaps = 10/205 (4%)

Query: 40  KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCG--------MLLS 91
           K  PY  IK N  + K ++  + D      C   P        +  CG        ML  
Sbjct: 223 KPPPYKHIKANKVIGK-VQIHVADLSEIPRCNCKPADENSPADENPCGLESECLNRMLQY 281

Query: 92  SCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDD 150
            C    C  G  C N+ F  R     ++++TE+ G G+     IK+GEFV EYVGE+ID+
Sbjct: 282 ECHPQVCPAGERCQNQCFTKRLYPDAEVIKTERRGWGLRTKRSIKKGEFVNEYVGELIDE 341

Query: 151 QTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGET 210
           + C  R+ +      TNFY+  + +D +IDA  KGN SR++NHSC PN E QKW ++G+ 
Sbjct: 342 EECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDV 401

Query: 211 RIGIFATRDIKKGENLTYDYQYEFL 235
           R+G+FA RDI  G  LT++Y  + L
Sbjct: 402 RVGLFALRDIPAGMELTFNYNLDCL 426


>gi|162463380|ref|NP_001105665.1| SET domain-containing protein SET102 [Zea mays]
 gi|22121720|gb|AAM89289.1| SET domain-containing protein SET102 [Zea mays]
 gi|414587223|tpg|DAA37794.1| TPA: SET domain-containing protein SET102 [Zea mays]
          Length = 513

 Score =  134 bits (336), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 77/208 (37%), Positives = 115/208 (55%), Gaps = 12/208 (5%)

Query: 40  KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCG------MLLSSC 93
           +A  Y+ I  N +  ++ K + E+D   C C        + D +  CG      +  + C
Sbjct: 9   EAPSYIHIDSNAFSYRKHKSQTEEDIAVCECQYD-----ILDPESPCGDRCLNLLTNTEC 63

Query: 94  SSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQT 152
           + G C+CG  C N+ FQ     + +LV+T   G G+VADE+I  G+FVIEY GEVI  + 
Sbjct: 64  TPGYCRCGVYCKNQRFQKCQYARTRLVRTGGRGWGLVADENIMAGQFVIEYCGEVISWKE 123

Query: 153 CEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRI 212
            + R    +     + Y+  +N D  IDAT KGN +R+INHSC PN E +KW + GE R+
Sbjct: 124 AKRRAQAYETQCLKDAYIIYLNADESIDATRKGNLARFINHSCQPNCETRKWNVLGEVRV 183

Query: 213 GIFATRDIKKGENLTYDYQYEFLHDSLI 240
           GIFA ++I  G  L+YDY +E+    ++
Sbjct: 184 GIFAKQNIPFGTELSYDYNFEWYGGVMV 211


>gi|270014006|gb|EFA10454.1| hypothetical protein TcasGA2_TC012700 [Tribolium castaneum]
          Length = 1740

 Score =  134 bits (336), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 79/208 (37%), Positives = 117/208 (56%), Gaps = 6/208 (2%)

Query: 29   EFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHC 86
            +FE  +  +KP   PYV IK N  +   ++    +     SC   P     C  D DC  
Sbjct: 1275 KFEEKNSGMKPP--PYVRIKVNKPVG-NVRVFDGNTSNTTSCDCDPNQPHPCGPDSDCLN 1331

Query: 87   GMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVG 145
             +LL+ C+   C  G+ C N+ F+ R    +   +T   G G+     I++G+FVIEYVG
Sbjct: 1332 RLLLTECNPDVCPAGDRCNNQCFEKREYPPLVPHRTLYRGWGLKTLAPIRKGQFVIEYVG 1391

Query: 146  EVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWI 205
            E+ID+Q  + R+ KM    E N+Y   I++D ++DA  KGN +R++NHSC PN E QKW 
Sbjct: 1392 EMIDEQEYQRRVQKMHEQKEENYYFLTIDKDRMLDAGPKGNVARFMNHSCDPNCETQKWT 1451

Query: 206  IDGETRIGIFATRDIKKGENLTYDYQYE 233
            ++G+TR+G+FA  DI  G  LT++Y  E
Sbjct: 1452 VNGDTRVGLFANCDIPAGTELTFNYNLE 1479


>gi|91090902|ref|XP_973711.1| PREDICTED: similar to NSD1 [Tribolium castaneum]
          Length = 1795

 Score =  134 bits (336), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 79/208 (37%), Positives = 117/208 (56%), Gaps = 6/208 (2%)

Query: 29   EFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHC 86
            +FE  +  +KP   PYV IK N  +   ++    +     SC   P     C  D DC  
Sbjct: 1330 KFEEKNSGMKPP--PYVRIKVNKPVG-NVRVFDGNTSNTTSCDCDPNQPHPCGPDSDCLN 1386

Query: 87   GMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVG 145
             +LL+ C+   C  G+ C N+ F+ R    +   +T   G G+     I++G+FVIEYVG
Sbjct: 1387 RLLLTECNPDVCPAGDRCNNQCFEKREYPPLVPHRTLYRGWGLKTLAPIRKGQFVIEYVG 1446

Query: 146  EVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWI 205
            E+ID+Q  + R+ KM    E N+Y   I++D ++DA  KGN +R++NHSC PN E QKW 
Sbjct: 1447 EMIDEQEYQRRVQKMHEQKEENYYFLTIDKDRMLDAGPKGNVARFMNHSCDPNCETQKWT 1506

Query: 206  IDGETRIGIFATRDIKKGENLTYDYQYE 233
            ++G+TR+G+FA  DI  G  LT++Y  E
Sbjct: 1507 VNGDTRVGLFANCDIPAGTELTFNYNLE 1534


>gi|301615056|ref|XP_002936997.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific [Xenopus (Silurana) tropicalis]
          Length = 2440

 Score =  134 bits (336), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 79/198 (39%), Positives = 113/198 (57%), Gaps = 2/198 (1%)

Query: 40   KAIPYVFIKRNIYLTK-RIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSC-SSGC 97
            K  PY  IK N  + K +I      +   C+C A+  +    D +C   MLL  C  S C
Sbjct: 1685 KPPPYKHIKVNRPVGKVQILTADLSEIPRCNCKATDENPCGQDSECINRMLLYECHPSVC 1744

Query: 98   KCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERL 157
              G  C N+ F  R   ++++ +T   G G+    DIK+GEFV EYVGE+ID++ C  R+
Sbjct: 1745 PAGERCQNQAFSKRQYPEVEIFRTLSRGWGLRCRTDIKKGEFVNEYVGEMIDEEECRARI 1804

Query: 158  WKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFAT 217
               +    TNFY+  +++D VIDA  KGN +R++NH C PN E QKW ++G+TR+G+FA 
Sbjct: 1805 RYAQEQDITNFYMLTLDKDRVIDAGPKGNFARFMNHCCQPNCETQKWTVNGDTRVGLFAL 1864

Query: 218  RDIKKGENLTYDYQYEFL 235
             DIK    LT++Y  E L
Sbjct: 1865 CDIKAXVELTFNYNLECL 1882


>gi|307111585|gb|EFN59819.1| hypothetical protein CHLNCDRAFT_18588, partial [Chlorella
           variabilis]
          Length = 380

 Score =  134 bits (336), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 74/203 (36%), Positives = 110/203 (54%), Gaps = 9/203 (4%)

Query: 44  YVFIKRNIYLTKRIKRRLEDDGIFCSCT---ASPGSSGVCDRDCHCGMLLSSC-SSGCKC 99
           +  I  N+Y  +  K + EDD + C C    A+ G++  C  +C   ML   C    C  
Sbjct: 99  WQLITSNLYTHRERKEQDEDDIMICQCKKIWATDGTTVGCGAECLNRMLNIECVEEYCPS 158

Query: 100 GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER--- 156
           G+ C N+ F  R   K+ + +    G G+ A ED+K G+F+IEY+GEV++++    R   
Sbjct: 159 GHRCSNQMFTKREYSKLDVKRAGAKGFGLFAAEDMKAGQFLIEYLGEVLEEEEYHRRQGA 218

Query: 157 LWKMKHL--GETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGI 214
            WK   +  G+ ++Y   +    VIDA+ +GN  R+INHSC PN E QKW++ GE  IG+
Sbjct: 219 AWKEYFIETGQRHYYFMNVGNGEVIDASRRGNLGRFINHSCEPNCETQKWVVHGELAIGL 278

Query: 215 FATRDIKKGENLTYDYQYEFLHD 237
           F   DI  G  LT+DY +E   D
Sbjct: 279 FTLEDISAGTELTFDYNFERYGD 301


>gi|444722051|gb|ELW62755.1| putative histone-lysine N-methyltransferase NSD2 [Tupaia chinensis]
          Length = 1421

 Score =  133 bits (335), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 70/169 (41%), Positives = 100/169 (59%), Gaps = 3/169 (1%)

Query: 70   CTASPGSSGVC--DRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
            C   P     C  D +C   ML+  C    C  G  C N+ F  R   + K+++T+  G 
Sbjct: 859  CNCKPTDENPCGFDSECLNRMLMFECHPQVCPAGEYCQNQCFTKRQYPETKIIKTDGKGW 918

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+VA  DI++GEFV EYVGE+ID++ C  R+        T+FY+  I++D +IDA  KGN
Sbjct: 919  GLVAKRDIRKGEFVNEYVGELIDEEECMARIKYAHENDITHFYMLTIDKDRIIDAGPKGN 978

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
             SR++NHSC PN E  KW ++G+TR+G+FA  DI  G  LT++Y  + L
Sbjct: 979  YSRFMNHSCQPNCETLKWTVNGDTRVGLFALCDIPAGTELTFNYNLDCL 1027


>gi|15213542|gb|AAK92049.1|AF322907_1 NSD1 [Homo sapiens]
          Length = 2596

 Score =  133 bits (335), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 71/172 (41%), Positives = 107/172 (62%), Gaps = 7/172 (4%)

Query: 68   CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
            C+C A+  +    D +C   MLL  C  + C  G  C N+ F  R   ++++ +T + G 
Sbjct: 1792 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGW 1851

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGE---TNFYLCEINRDMVIDATY 183
            G+   EDI++GEFV EYVGE+ID++ C   + ++KH  E   T+FY+  I++D +IDA  
Sbjct: 1852 GLRTQEDIRKGEFVNEYVGELIDEEEC---MARIKHAHENDITHFYMLTIDKDRIIDAGP 1908

Query: 184  KGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
            KGN SR++NHSC PN E  KW ++G+TR+G+FA  DI  G  LT++Y  + L
Sbjct: 1909 KGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTELTFNYNLDCL 1960


>gi|194907101|ref|XP_001981487.1| GG12082 [Drosophila erecta]
 gi|190656125|gb|EDV53357.1| GG12082 [Drosophila erecta]
          Length = 1441

 Score =  133 bits (335), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 76/211 (36%), Positives = 116/211 (54%), Gaps = 7/211 (3%)

Query: 37   IKPKAIPYVFIKRNIYLTK-RIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSC 93
            +K    PYV IK N  +   R  + LED     +C   PG    C  D  C   ML + C
Sbjct: 1160 VKLHPPPYVKIKTNKAVPPVRFTQNLED---LSACYCQPGDEHPCGPDSGCLNRMLFNEC 1216

Query: 94   SSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQT 152
            +   CK G+ C N+ F+ R   ++++V   + G G+V  E I  G+FVIEYVGEVI+   
Sbjct: 1217 NPEFCKAGSRCENQMFEQRKSPRLEVVYMNERGFGLVNREPIAEGDFVIEYVGEVINHAE 1276

Query: 153  CEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRI 212
             + R+ + +   + N+Y   + +D +IDA  KGN +R++NHSC PN E QKW ++   R+
Sbjct: 1277 FQRRMEQKQRGRDENYYFLGVEKDFIIDAGPKGNLARFMNHSCEPNCETQKWTVNCIHRV 1336

Query: 213  GIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
            G+FA +DI     LT++Y ++ L ++    C
Sbjct: 1337 GLFAIKDIPANTELTFNYLWDDLMNNSKKAC 1367


>gi|198432159|ref|XP_002123225.1| PREDICTED: similar to Wolf-Hirschhorn syndrome candidate 1 protein,
           partial [Ciona intestinalis]
          Length = 752

 Score =  133 bits (335), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 66/156 (42%), Positives = 97/156 (62%), Gaps = 1/156 (0%)

Query: 81  DRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEF 139
           + +C   ML+  C S  C  G  C N+ FQ R     ++ +T   G  + A + IK+GEF
Sbjct: 247 ESECLNRMLMYECHSDLCPAGEKCQNQRFQKREYPSSEVFKTSWGGWALRAKDLIKKGEF 306

Query: 140 VIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNT 199
           V EYVGE++D + C  R+        TNFY+  I++D +IDA  KGN SR++NHSC PN 
Sbjct: 307 VSEYVGELVDSEECMRRIEDAHKNNVTNFYMLTIDKDRIIDAGPKGNYSRFMNHSCDPNC 366

Query: 200 EMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
           E QKW+++G+TR+G+FA R+I+ GE L ++Y  + L
Sbjct: 367 ETQKWMVNGDTRVGLFALREIQDGEELMFNYNLDCL 402


>gi|320166309|gb|EFW43208.1| ASHH4 [Capsaspora owczarzaki ATCC 30864]
          Length = 453

 Score =  133 bits (335), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 72/194 (37%), Positives = 110/194 (56%), Gaps = 9/194 (4%)

Query: 44  YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNS 102
           +  I++N Y+  +  + + D+   C+C AS      C   C   ++   C  G C+  + 
Sbjct: 153 FTRIRKNAYIDVKRPKLIADNE--CNCKAS----APCTESCLNRLMFVECVRGSCRMEDR 206

Query: 103 CLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMK 161
           C N+ FQ     K +K+ QT   G G+   + I  G+FV+EYVGEV+ D   E R+W   
Sbjct: 207 CQNRNFQRHNWTKNLKVFQTPNAGYGLRCTDPIAPGQFVMEYVGEVVSDAERERRMWG-P 265

Query: 162 HLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIK 221
           + G  N Y  E+ + ++IDA  KG  +R+INHSC PN  ++KW ++GE R+GIFA+R I 
Sbjct: 266 YAGNPNHYFLELEKGVLIDACSKGCDARFINHSCDPNCHVEKWNVNGEFRVGIFASRAIA 325

Query: 222 KGENLTYDYQYEFL 235
             E L+YDY++E L
Sbjct: 326 PNEELSYDYRFETL 339


>gi|167526088|ref|XP_001747378.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163774213|gb|EDQ87845.1| predicted protein [Monosiga brevicollis MX1]
          Length = 1048

 Score =  133 bits (335), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 76/196 (38%), Positives = 109/196 (55%), Gaps = 6/196 (3%)

Query: 44  YVFIKRNIYLTKRIK--RRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGC-KCG 100
           +  IKRN+YL    K   R E+  I C C A+   + +    C   M+   C + C   G
Sbjct: 694 FKLIKRNVYLPPAAKPPHRKEEQEI-CRCDAALNCADM--HSCLNRMMYIECDAKCCNNG 750

Query: 101 NSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKM 160
            +C N+ FQ R   K+   +TE  G G+   +D++ G+ VIEYVGEVID   C  R+ + 
Sbjct: 751 KNCRNQRFQRREYPKLIPFKTEHRGWGLRLGQDVEEGDLVIEYVGEVIDGAECRRRIDQY 810

Query: 161 KHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDI 220
           +    ++FY+  +  D  +DA  K N +R+INHSC PN   QKW + GETR+GIFA R +
Sbjct: 811 EERNTSSFYILSLGSDTFVDAREKANMARFINHSCDPNCVTQKWNVLGETRVGIFAKRAL 870

Query: 221 KKGENLTYDYQYEFLH 236
            KG  LT+DY  + L+
Sbjct: 871 AKGTELTFDYMLDCLN 886


>gi|367027906|ref|XP_003663237.1| methyltransferase-like protein [Myceliophthora thermophila ATCC
           42464]
 gi|347010506|gb|AEO57992.1| methyltransferase-like protein [Myceliophthora thermophila ATCC
           42464]
          Length = 950

 Score =  133 bits (334), Expect = 7e-29,   Method: Composition-based stats.
 Identities = 71/188 (37%), Positives = 103/188 (54%), Gaps = 6/188 (3%)

Query: 62  EDDGIFCSCTAS--PGSSGVC--DRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKM 116
           + D + C C      G +  C  D DC        C SG C CG  C N+ FQ +   K+
Sbjct: 120 DHDALDCDCAEDWHDGQNHACGEDSDCINRATKIECVSGDCNCGEGCQNQRFQRKQYAKV 179

Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
            +++TEK G G+ AD D++  +FV EYVGEVI++ T   R+ K    G  +FY   + ++
Sbjct: 180 SVIKTEKKGFGLRADTDLQANDFVYEYVGEVINEPTFRSRMVKYDREGIKHFYFMSLTKN 239

Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FL 235
             +DAT KGN  R+ NHSC PN  + KW++  + R+GIFA R I+ GE L ++Y  + + 
Sbjct: 240 EFVDATKKGNLGRFCNHSCNPNCYVDKWVVGEKLRMGIFAGRAIRAGEELVFNYNVDRYG 299

Query: 236 HDSLIAYC 243
            D    YC
Sbjct: 300 ADPQPCYC 307


>gi|344238567|gb|EGV94670.1| Histone-lysine N-methyltransferase NSD3 [Cricetulus griseus]
          Length = 620

 Score =  133 bits (334), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 74/199 (37%), Positives = 108/199 (54%), Gaps = 4/199 (2%)

Query: 40  KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSS--CSSG- 96
           K  PY  IK N  + K +  ++ D      C   P     C  +  C   +S   C    
Sbjct: 253 KPPPYKHIKANKVIGK-VPIQVADLSEIPRCNCKPADENPCGLESECLNRMSQYECHPQV 311

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
           C  G+ C N+ F  R     ++++TE+ G G+     IK+GEFV EYVGE+ID++ C  R
Sbjct: 312 CPAGDRCQNQCFTKRLYPDAEIIKTERKGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 371

Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
           + +      TNFY+  + +D +IDA  KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 372 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 431

Query: 217 TRDIKKGENLTYDYQYEFL 235
             DI  G  LT++Y  + L
Sbjct: 432 ICDIPAGMELTFNYNLDCL 450


>gi|47221386|emb|CAF97304.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 722

 Score =  133 bits (334), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 68/169 (40%), Positives = 100/169 (59%), Gaps = 1/169 (0%)

Query: 68  CSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
           C+C A+  S    + +C   MLL  C    C  G  CLN+ F  R   ++++ +T   G 
Sbjct: 396 CNCKATDESPCGMESECINRMLLYECHPQVCPAGERCLNQAFTKRQYSQVEIFRTLSRGW 455

Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
           G+    DIK+G FV EYVGEVID++ C  R+   +     NFY+  +++D +IDA  KGN
Sbjct: 456 GLRCVHDIKKGHFVSEYVGEVIDEEECRARIRHAQENDICNFYMLTLDKDRIIDAGPKGN 515

Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
           ++R++NH C PN E QKW + G+TR+G+FA  D+  G  LT++Y  E L
Sbjct: 516 EARFMNHCCQPNCETQKWTVSGDTRVGLFALVDVPAGTELTFNYNLECL 564


>gi|156391978|ref|XP_001635826.1| predicted protein [Nematostella vectensis]
 gi|156222924|gb|EDO43763.1| predicted protein [Nematostella vectensis]
          Length = 348

 Score =  132 bits (333), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 73/186 (39%), Positives = 105/186 (56%), Gaps = 3/186 (1%)

Query: 51  IYLTKRIKR-RLEDDGIFCSCTASPGSSGV--CDRDCHCGMLLSSCSSGCKCGNSCLNKP 107
           I L+ R KR   E   + C C   P +     C  DC   +L+  C+  C CG+ C N+ 
Sbjct: 11  ILLSSRKKRINKEVRKMTCECYPEPDNPDFVGCGEDCLNRLLMIECNHRCPCGDLCTNRR 70

Query: 108 FQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETN 167
           FQ     K+++ +TEK G G+   ED+++ +FVIEY GEV++ +  + R  +       +
Sbjct: 71  FQEGCKIKVEVFKTEKKGWGVKTLEDLEQNQFVIEYCGEVMNYRDFQSRAQRYDRQKRRH 130

Query: 168 FYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLT 227
           +Y   +  D +IDAT KG+ SR+INHSC PN   QKW ++G  RIG F  R IK GE LT
Sbjct: 131 YYFMTLRADEIIDATLKGSISRFINHSCEPNCVTQKWTVNGLLRIGFFTLRTIKAGEELT 190

Query: 228 YDYQYE 233
           +DYQ +
Sbjct: 191 FDYQLQ 196


>gi|344241969|gb|EGV98072.1| putative histone-lysine N-methyltransferase ASH1L [Cricetulus
            griseus]
          Length = 1546

 Score =  132 bits (332), Expect = 1e-28,   Method: Composition-based stats.
 Identities = 78/227 (34%), Positives = 122/227 (53%), Gaps = 18/227 (7%)

Query: 19   KLLKQIGNPVEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFC 68
            K L+Q    ++F+LP      W     + KP    Y  I+ N+Y+   +K     +   C
Sbjct: 1132 KYLRQ--KRIDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTC 1187

Query: 69   SCTASPGSSGV-CDRDCHCGMLLSSCS-SGCKCGNSCLNKPFQNRP-VKKMKLVQTEKCG 125
            +C      +   C  DC   M+ + CS + C CG  C N+  Q    V+ ++  + E+ G
Sbjct: 1188 NCKKPDDDTRKGCGDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKG 1247

Query: 126  AGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKG 185
             GI   E +K G+F+IEY+GEV+ +Q    R+ +  H   ++ Y   ++  MVID+   G
Sbjct: 1248 WGIRTKEPLKAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMG 1306

Query: 186  NKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            N++R+INHSC PN EMQKW ++G  RIG++A +D+  G  LTYDY +
Sbjct: 1307 NEARFINHSCDPNCEMQKWSVNGVYRIGLYALKDVLAGTELTYDYNF 1353


>gi|348530060|ref|XP_003452529.1| PREDICTED: hypothetical protein LOC100707110 [Oreochromis niloticus]
          Length = 2876

 Score =  131 bits (329), Expect = 3e-28,   Method: Composition-based stats.
 Identities = 79/226 (34%), Positives = 118/226 (52%), Gaps = 16/226 (7%)

Query: 19   KLLKQIGNPVEFELPD---WFIK-------PKAIPYVFIKRNIYLTKRIKRRLEDDGIFC 68
            K L+Q    ++F+LP    W  K       P    Y  I+ N+Y+  +     E     C
Sbjct: 1913 KYLRQ--KRIDFQLPYDILWLWKHDQLHKRPDVPLYKKIRSNVYVDVKPLSGYETTTCNC 1970

Query: 69   SCTASPGSSGVCDRDCHCGMLLSSCS-SGCKCGNSCLNKPFQNRP-VKKMKLVQTEKCGA 126
                     G C  +C   M  + CS S C CG+ C N+  Q    V+ ++  +TE  G 
Sbjct: 1971 RLPEDSTEKG-CMDECLNRMSFAECSPSTCPCGDQCDNQHIQRHEWVQCLERFRTEGKGW 2029

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            GI   E ++ G+F+IEY+GEV+ +Q    R+ + ++   +  Y   ++  MVID+   GN
Sbjct: 2030 GIRTKESLRSGQFIIEYLGEVVSEQEFRSRMME-QYFSHSGHYCLNLDSGMVIDSYRMGN 2088

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            ++R+INHSC PN EMQKW ++G  RIG+FA +DI  G  LTYDY +
Sbjct: 2089 EARFINHSCEPNCEMQKWSVNGVYRIGLFALKDISSGTELTYDYNF 2134


>gi|342319484|gb|EGU11432.1| Histone-lysine N-methyltransferase, H3 lysine-36 specific
           [Rhodotorula glutinis ATCC 204091]
          Length = 844

 Score =  131 bits (329), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 74/189 (39%), Positives = 103/189 (54%), Gaps = 9/189 (4%)

Query: 64  DGIFCSC-----TASPGSSGVCDRDCHC--GMLLSSCSSG-CKCGNSCLNKPFQNRPVKK 115
           D   C C     T  P +   C    +C   M+   C  G C+CG  C N+ FQ R    
Sbjct: 235 DASRCDCSWNTSTREPDADEACGEHSNCVNRMMQIECLMGDCRCGRHCQNQRFQKRQYAP 294

Query: 116 MKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINR 175
           +++VQTEK G G+ A  DI    FV EYVGEVI  Q  + ++ +  + G  +FY   ++R
Sbjct: 295 IEIVQTEKKGFGVRAGADISADTFVYEYVGEVIGPQPFQRKMKEYANEGIKHFYFMALDR 354

Query: 176 DMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-F 234
           D+ IDAT KG K R++NHSC PN  + KW +  + R+GIF  RDIK  E LT++Y  + +
Sbjct: 355 DVFIDATKKGGKGRFLNHSCNPNCVVAKWTVGRKMRMGIFTKRDIKMHEELTFNYNVDRY 414

Query: 235 LHDSLIAYC 243
            H +   YC
Sbjct: 415 GHVAQECYC 423


>gi|238883879|gb|EEQ47517.1| conserved hypothetical protein [Candida albicans WO-1]
          Length = 846

 Score =  130 bits (328), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 61/136 (44%), Positives = 92/136 (67%), Gaps = 4/136 (2%)

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
           C CG+ C N+ FQNR   K+K++QTE  G G++A++DI+  +F+ EY+GEVID+ +  +R
Sbjct: 125 CLCGDDCQNQRFQNRQYSKVKVIQTELKGYGLIAEQDIEENQFIYEYIGEVIDEISFRQR 184

Query: 157 L--WKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGI 214
           +  + ++HL   +FY   ++ D  IDAT KG+  R+INHSC PN  + KW +    R+GI
Sbjct: 185 MIEYDLRHL--KHFYFMMLSNDSFIDATEKGSLGRFINHSCNPNAFVDKWHVGDRLRMGI 242

Query: 215 FATRDIKKGENLTYDY 230
           FA R I +GE +T+DY
Sbjct: 243 FAKRKISRGEEITFDY 258


>gi|281206847|gb|EFA81031.1| SET domain-containing protein [Polysphondylium pallidum PN500]
          Length = 1363

 Score =  130 bits (328), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 90/246 (36%), Positives = 127/246 (51%), Gaps = 16/246 (6%)

Query: 6    KNSDNSRIGHAFNKLLKQIGNPVE-----FELPDWFIKPKAIPYVFIKRNIYLTKRIKRR 60
            KN DNS +     + +K I + +E      ++     K +   + + K     + R KR 
Sbjct: 874  KNEDNSIMNLPEPQAIKMIEDNIENGASYSQIAVLIQKARGKSFKYTKS----SNRRKRT 929

Query: 61   LEDDGI-FCSCTASPGSSGVCDRDCHCGMLLSSCS-SGCKCGNSCLNKPFQNRPVKKMKL 118
            L++D I  C+C      +G C  +C        C  S CK G  C N+ FQ +    +  
Sbjct: 930  LQEDDIAICNCDII--DNGDCGPECLNRRSFIECEPSNCKLGKHCKNQRFQRQEYALIAP 987

Query: 119  VQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMV 178
               +K G G+ A E I   +FVIEY GEVI      +R+ +    GE  FY   ++   V
Sbjct: 988  FNAKKKGWGLKAKEKISAHQFVIEYCGEVITRAQSMDRMREAD--GEKYFYFLTLDSKEV 1045

Query: 179  IDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
            +DA+ KGN +R+INHSC PN E QKW +DGETRIGIFA +DI+ G  LT+DY YE +  S
Sbjct: 1046 LDASRKGNLARFINHSCDPNCETQKWSVDGETRIGIFALKDIEAGTELTFDYNYERVGSS 1105

Query: 239  LIA-YC 243
              + YC
Sbjct: 1106 KQSCYC 1111


>gi|384499018|gb|EIE89509.1| hypothetical protein RO3G_14220 [Rhizopus delemar RA 99-880]
          Length = 962

 Score =  130 bits (328), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 74/210 (35%), Positives = 112/210 (53%), Gaps = 13/210 (6%)

Query: 44  YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCG--------MLLSSCSS 95
           YV +  N+Y      + + ++ + C C        V D D  CG        M+   C++
Sbjct: 90  YVNLIENVYCGSATGKTIAEESMPCECKYL---QEVDDLDAACGDDNYCINRMMFMECTA 146

Query: 96  -GCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCE 154
             C CG  C N+ FQ R   ++ +++TEK G G+ A  D+    F++EY+GEVI +Q   
Sbjct: 147 EDCPCGRYCRNRRFQLRQFARVDVIRTEKKGFGLRALTDLPTNSFIMEYIGEVIPNQEFI 206

Query: 155 ERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGI 214
            R  + +  G  ++Y   +  D +IDAT KG  +R+INHSC PN   QKW++    RIGI
Sbjct: 207 RRTKEYEASGLEHYYFMTLKTDEIIDATKKGCLARFINHSCNPNCVTQKWVVGKNMRIGI 266

Query: 215 FATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
           F  R IK GE LT+DY++E +   + + YC
Sbjct: 267 FTNRGIKAGEELTFDYKFERYGAQAQVCYC 296


>gi|68483319|ref|XP_714401.1| likely histone lysine methyltransferase Set2p [Candida albicans
           SC5314]
 gi|68483410|ref|XP_714361.1| likely histone lysine methyltransferase Set2p [Candida albicans
           SC5314]
 gi|74585304|sp|Q59XV0.1|SET2_CANAL RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36
           specific; AltName: Full=SET domain-containing protein 2
 gi|46435920|gb|EAK95292.1| likely  histone lysine methyltransferase Set2p [Candida albicans
           SC5314]
 gi|46435962|gb|EAK95333.1| likely  histone lysine methyltransferase Set2p [Candida albicans
           SC5314]
          Length = 844

 Score =  130 bits (328), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 61/136 (44%), Positives = 92/136 (67%), Gaps = 4/136 (2%)

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
           C CG+ C N+ FQNR   K+K++QTE  G G++A++DI+  +F+ EY+GEVID+ +  +R
Sbjct: 125 CLCGDDCQNQRFQNRQYSKVKVIQTELKGYGLIAEQDIEENQFIYEYIGEVIDEISFRQR 184

Query: 157 L--WKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGI 214
           +  + ++HL   +FY   ++ D  IDAT KG+  R+INHSC PN  + KW +    R+GI
Sbjct: 185 MIEYDLRHL--KHFYFMMLSNDSFIDATEKGSLGRFINHSCNPNAFVDKWHVGDRLRMGI 242

Query: 215 FATRDIKKGENLTYDY 230
           FA R I +GE +T+DY
Sbjct: 243 FAKRKISRGEEITFDY 258


>gi|384500869|gb|EIE91360.1| hypothetical protein RO3G_16071 [Rhizopus delemar RA 99-880]
          Length = 883

 Score =  130 bits (328), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 71/196 (36%), Positives = 104/196 (53%), Gaps = 6/196 (3%)

Query: 44  YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPG---SSGVCDRDCHC---GMLLSSCSSGC 97
           Y  I  NIY+     R + ++ + C C   P     S  C  D  C    M +   +  C
Sbjct: 173 YENIGENIYIGSATGRSMAEESMPCECKYDPDLDDPSEACGDDNACINRMMFMECIAQDC 232

Query: 98  KCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERL 157
            CG  C N+ FQ     ++ +++TEK G G+ A  D+    F++EY+GEVI       R 
Sbjct: 233 PCGRLCRNRRFQLGQYARVDVIRTEKKGYGLRALTDLSSNSFIMEYIGEVITQNEFLHRT 292

Query: 158 WKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFAT 217
            +    G  ++Y   +  D +IDAT KG  +R++NHSC PN   QKW+I  + RIGIF +
Sbjct: 293 REYDAQGFKHYYFMTLKNDEIIDATRKGCLARFMNHSCRPNCVTQKWVIGKKMRIGIFTS 352

Query: 218 RDIKKGENLTYDYQYE 233
           R+IK GE LT+DY++E
Sbjct: 353 RNIKAGEELTFDYKFE 368


>gi|357627347|gb|EHJ77076.1| hypothetical protein KGM_14526 [Danaus plexippus]
          Length = 1912

 Score =  130 bits (328), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 78/245 (31%), Positives = 127/245 (51%), Gaps = 16/245 (6%)

Query: 3    AAKKNSDNSRIGHAFNKLLK---------QIGNPVEFELPDWFIKPKAIP--YVFIKRNI 51
            + K +S  S+I  AF   ++         ++  P E E  D  I    +P  YV +K N 
Sbjct: 1540 SGKVSSQKSKIDAAFTMAMEHAQRACSILKMAAPNEEESSD--IASSLLPPHYVKLKVNK 1597

Query: 52   YLTKRIKRRLE-DDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSGCKCGNSCLNKPF 108
                   ++++ ++     C   P     C     C   MLL+ C   C+ G+ C N+ F
Sbjct: 1598 PCGSLCGKKIDLEESSLTQCECDPNDVDPCGPYTQCLNRMLLTECGPTCRAGDRCNNRAF 1657

Query: 109  QNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNF 168
            + R   ++   +T   G G+   +D++ G+FVIEYVGE+ID++    R+ +   + + NF
Sbjct: 1658 EKRLYPRLGPYRTPHRGWGLRTMQDLRAGQFVIEYVGELIDEEEFRRRMNRKHEVRDENF 1717

Query: 169  YLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTY 228
            Y   ++++ +IDA  KGN +R++NHSC PN E QKW + G+ R+G+FA RDI     LT+
Sbjct: 1718 YFLTLDKERMIDAGPKGNLARFMNHSCEPNCETQKWTVLGDVRVGLFALRDIPANSELTF 1777

Query: 229  DYQYE 233
            +Y  E
Sbjct: 1778 NYNLE 1782


>gi|322799945|gb|EFZ21071.1| hypothetical protein SINV_04653 [Solenopsis invicta]
          Length = 1817

 Score =  130 bits (326), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 76/216 (35%), Positives = 114/216 (52%), Gaps = 15/216 (6%)

Query: 23  QIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASP-----GSS 77
           +I  P+  E  D  ++ +   +  ++ N+YLT+R   + E   + C C  +      G  
Sbjct: 768 KISLPMVLETEDREMEERLSQFEHLQENLYLTERYTNK-ETKRMVCDCFLTEEEIERGEL 826

Query: 78  GVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRG 137
           G C  DC   +L+  C   C  G+ C N+ FQN    K ++ +TEK G G+ A  +I  G
Sbjct: 827 G-CGEDCLNRLLMIECGPRCVVGDRCTNRRFQNCDYAKCEVFRTEKKGFGLRAVVNITVG 885

Query: 138 EFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCP 197
           EF++EYVGEV+D +    R  +       ++Y   +  D +IDAT KGN SR+INHSC P
Sbjct: 886 EFIMEYVGEVVDPKDFRRRAKEYSKDKNRHYYFMALKSDQIIDATMKGNISRFINHSCDP 945

Query: 198 NTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
           N E QKW ++GE RI          GE +T+DY ++
Sbjct: 946 NAETQKWTVNGELRIAA--------GEEITFDYHFQ 973


>gi|198476699|ref|XP_002132425.1| GA25456 [Drosophila pseudoobscura pseudoobscura]
 gi|198137811|gb|EDY69827.1| GA25456 [Drosophila pseudoobscura pseudoobscura]
          Length = 483

 Score =  130 bits (326), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 65/154 (42%), Positives = 87/154 (56%)

Query: 80  CDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEF 139
           C   C   ML+  C   C  G  C NK FQ       ++  TEK G GI A+  I  GEF
Sbjct: 137 CGSGCINRMLMIECGPLCSNGERCTNKRFQLNQCWPCRVFHTEKKGCGITAELAIPAGEF 196

Query: 140 VIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNT 199
           ++EYVGEVID    E R  +       ++Y   +    +IDAT  GN SR++NHSC PN 
Sbjct: 197 IMEYVGEVIDSAEFERRQHRYAEGRNRHYYFMALRGGAIIDATMGGNISRFMNHSCDPNA 256

Query: 200 EMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
           E QKW ++GE RIG+F+ + I  GE +T+DY+Y+
Sbjct: 257 ETQKWTVNGELRIGLFSVKTIMPGEEITFDYRYQ 290


>gi|328768890|gb|EGF78935.1| hypothetical protein BATDEDRAFT_90118 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 1361

 Score =  130 bits (326), Expect = 6e-28,   Method: Composition-based stats.
 Identities = 80/227 (35%), Positives = 119/227 (52%), Gaps = 12/227 (5%)

Query: 18  NKLLKQIGNPVEFELPDWF-----IKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSC-- 70
           N+ ++ +  PV  +   W+     +K     YV I+ NIY   R   ++  D + C+C  
Sbjct: 135 NESVQALSGPVCGQNGQWYENESVLKEVTEAYVHIESNIY-RGRTNGKVSSDFMQCNCEY 193

Query: 71  -TASPGSSGVC--DRDCHCGMLLSSCSS-GCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
            ++   S   C  D DC    L   CS+  C  GN+C N+ FQ      +++ +    G 
Sbjct: 194 NSSRDPSWMACGEDSDCINRQLSLECSAEDCPTGNACQNRRFQLCQYSPIQVARAGSKGF 253

Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
           GI A E+I  G F+IEY GEVI      +R+ +  +    +FY   + +D  IDA+ KGN
Sbjct: 254 GIYARENIAGGAFIIEYCGEVIPASLFGKRITEHSNNSAQHFYFMSLKKDEYIDASKKGN 313

Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
            SRY+NHSC PN  +QKW++    RIG+FA R I K   LT+DY++E
Sbjct: 314 LSRYLNHSCDPNCSLQKWLVGDTIRIGLFALRAIPKNAELTFDYKFE 360


>gi|344302485|gb|EGW32759.1| hypothetical protein SPAPADRAFT_54781 [Spathaspora passalidarum
           NRRL Y-27907]
          Length = 718

 Score =  130 bits (326), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 62/155 (40%), Positives = 91/155 (58%), Gaps = 3/155 (1%)

Query: 79  VCDRDCHCGMLLSSC---SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIK 135
            C  D HC   ++S    +  C CGN C N+ FQ R    + + QTE  G G+ AD +I 
Sbjct: 75  ACGEDSHCINRVTSVECMNRHCLCGNDCQNQRFQKREYAAVSVFQTELKGYGLRADLEIG 134

Query: 136 RGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSC 195
            G+F+ EY GEVID+ T  +R+ +       +FY   +  D  IDAT +G+ +R++NHSC
Sbjct: 135 EGQFIYEYTGEVIDEATFRQRMVEYDQKSFKHFYFMMLKSDSFIDATVRGSLARFVNHSC 194

Query: 196 CPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDY 230
            PN  + KW++  + R+GIFA R I +GE +T+DY
Sbjct: 195 NPNAYVDKWVVGDKLRMGIFAKRKIARGEEITFDY 229


>gi|297842431|ref|XP_002889097.1| hypothetical protein ARALYDRAFT_316589 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297334938|gb|EFH65356.1| hypothetical protein ARALYDRAFT_316589 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 525

 Score =  130 bits (326), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 75/193 (38%), Positives = 105/193 (54%), Gaps = 2/193 (1%)

Query: 44  YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGS-SGVCDRDCHCGMLLSSCSSG-CKCGN 101
           Y  I +N +  ++ K++ E+D   C C    G     C   C   +  + C+ G C CG 
Sbjct: 17  YEHIYQNDFSYRKHKKQKEEDISICECKFDFGDPDSACGERCLNVITNTECTPGYCPCGV 76

Query: 102 SCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMK 161
            C N+ FQ     K KL++ E  G G+VA EDIK G+F+IEY GEVI  +  + R    +
Sbjct: 77  YCKNQKFQKCEYAKTKLIKCEGRGWGLVALEDIKAGQFIIEYCGEVISWKEAKRRAQTYE 136

Query: 162 HLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIK 221
             G  + Y+  +N    IDAT KG+ +R+INHSC PN E +KW + GE R+GIFA   I 
Sbjct: 137 THGVKDAYIISLNASEAIDATKKGSLARFINHSCRPNCETRKWNVLGEVRVGIFAKESIS 196

Query: 222 KGENLTYDYQYEF 234
               L YDY +E+
Sbjct: 197 PRTELAYDYNFEW 209


>gi|195155589|ref|XP_002018686.1| GL25818 [Drosophila persimilis]
 gi|194114839|gb|EDW36882.1| GL25818 [Drosophila persimilis]
          Length = 476

 Score =  130 bits (326), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 65/154 (42%), Positives = 88/154 (57%)

Query: 80  CDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEF 139
           C   C   ML+  C   C  G  C NK FQ       ++ +TEK G GI A+  I  GEF
Sbjct: 137 CGSGCINRMLMIECGPLCSNGERCTNKRFQLNQCWPCRVFRTEKKGCGISAELAIPAGEF 196

Query: 140 VIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNT 199
           ++EYVGEVID    E R  +       ++Y   +    +IDAT  GN SR++NHSC PN 
Sbjct: 197 IMEYVGEVIDSAEFERRQHRYAEGRNRHYYFMALRGGAIIDATMGGNISRFMNHSCDPNA 256

Query: 200 EMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
           E QKW ++GE RIG+F+ + I  GE +T+DY+Y+
Sbjct: 257 ETQKWTVNGELRIGLFSVKSIMPGEEITFDYRYQ 290


>gi|451846131|gb|EMD59442.1| hypothetical protein COCSADRAFT_258710 [Cochliobolus sativus
           ND90Pr]
          Length = 923

 Score =  130 bits (326), Expect = 7e-28,   Method: Composition-based stats.
 Identities = 62/164 (37%), Positives = 92/164 (56%), Gaps = 1/164 (0%)

Query: 81  DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
           D DC        C   C CG  C N+ FQ +    + +++TEK G G+ A++D+  GEFV
Sbjct: 175 DSDCINRATKMECVGDCGCGRRCQNQRFQRKQYANVAVIKTEKKGFGLRANKDMAPGEFV 234

Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
            EY+GEVID++T   R+ +    G  +FY   + +   +DAT KGN  R+ NHSC PN  
Sbjct: 235 FEYIGEVIDERTFRRRMGQYDEEGIKHFYFMSLTKGEFVDATKKGNLGRFCNHSCNPNCF 294

Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
           + KW++  + R+GIF  R +K GE L ++Y  + +  D    YC
Sbjct: 295 VDKWVVGDKLRMGIFVERQVKAGEELVFNYNVDRYGADPQPCYC 338


>gi|359493195|ref|XP_003634538.1| PREDICTED: histone-lysine N-methyltransferase ASHH1-like [Vitis
           vinifera]
          Length = 413

 Score =  130 bits (326), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 81/236 (34%), Positives = 117/236 (49%), Gaps = 12/236 (5%)

Query: 2   PAAKKNSDNSRIGHAFNKLLKQIGNPVEFELPDWFIKPKAIP-YVFIKRNIYLTKRIKRR 60
           PAA++  +  R+  +   +  Q  +  + E          +P ++ I RN +  ++  ++
Sbjct: 8   PAARRTLNRGRVASSLCHIDPQGQSDQQIE---------GVPEFIHINRNDFSYRKHIKQ 58

Query: 61  LEDDGIFCSCT-ASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKL 118
            E+D   C C   +      C   C   +    C+   C C   C N+ FQ R   K KL
Sbjct: 59  KENDIAICECKYKANDPDSACGERCWNVLTSIECTPRYCPCSIHCKNQRFQKREYAKTKL 118

Query: 119 VQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMV 178
            + E  G G++A E+IK GEFV+EY GEVI       R       G  + Y+  +N    
Sbjct: 119 FRAEGRGWGLLATENIKAGEFVMEYCGEVISRTEARGRSQVYVSQGLKDVYIIPLNAREC 178

Query: 179 IDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEF 234
           IDAT KGN +R+INHSC PN E  KW + GE R+GIFA R+I  G  LTY Y +E+
Sbjct: 179 IDATKKGNLARFINHSCQPNCETMKWSVLGEDRVGIFALRNISVGTELTYSYNFEW 234


>gi|451994892|gb|EMD87361.1| hypothetical protein COCHEDRAFT_1144880 [Cochliobolus
           heterostrophus C5]
          Length = 923

 Score =  129 bits (325), Expect = 7e-28,   Method: Composition-based stats.
 Identities = 75/224 (33%), Positives = 112/224 (50%), Gaps = 10/224 (4%)

Query: 27  PVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTAS--PGS--SGVC-- 80
           P+  +LPD     K   +  +    Y  K I     D  + C C     P +  +  C  
Sbjct: 118 PLYLDLPDSTEAAKET-FTVLTECTYANKTIGT--TDPALECDCQEEYDPATRKNHACGE 174

Query: 81  DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
           D DC        C   C CG  C N+ FQ +    + +++TEK G G+ A++D+  GEFV
Sbjct: 175 DSDCINRATKMECVGDCGCGRRCQNQRFQRKQYANVAVIKTEKKGFGLRANKDMAPGEFV 234

Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
            EY+GEVID++T   R+ +    G  +FY   + +   +DAT KGN  R+ NHSC PN  
Sbjct: 235 FEYIGEVIDERTFRRRMGQYDEEGIKHFYFMSLTKGEFVDATKKGNLGRFCNHSCNPNCF 294

Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
           + KW++  + R+GIF  R +K GE L ++Y  + +  D    YC
Sbjct: 295 VDKWVVGDKLRMGIFVERQVKAGEELVFNYNVDRYGADPQPCYC 338


>gi|300124007|emb|CBK25278.2| unnamed protein product [Blastocystis hominis]
          Length = 223

 Score =  129 bits (325), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 66/146 (45%), Positives = 87/146 (59%)

Query: 95  SGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCE 154
           S C CG+ C N+ F+    +K+ L   E+ G G+  D  I RG F+IEYVGEVI  +  E
Sbjct: 45  STCPCGDDCTNQMFRRHEYQKLVLFYDEQKGWGVRTDVFIPRGTFIIEYVGEVISQKESE 104

Query: 155 ERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGI 214
            R    K  G+ + Y   +  D +IDAT KGN SR+INHSC PN E+QKW       +GI
Sbjct: 105 YRRQVGKRQGQMHMYYMSLAPDQLIDATDKGNASRFINHSCDPNCEIQKWATSSTYSVGI 164

Query: 215 FATRDIKKGENLTYDYQYEFLHDSLI 240
           FA RDI  GE +T+DYQ+E + +  I
Sbjct: 165 FAIRDIIPGEEITFDYQFERIGNGAI 190


>gi|391338508|ref|XP_003743600.1| PREDICTED: probable histone-lysine N-methyltransferase Mes-4-like
           [Metaseiulus occidentalis]
          Length = 890

 Score =  129 bits (325), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 74/208 (35%), Positives = 113/208 (54%), Gaps = 6/208 (2%)

Query: 29  EFELPDWFIKPKAIPYVF--IKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC 86
           EF+L    +  K  P  F  I  N  ++   ++R   D   CSC  S   S   D  C  
Sbjct: 654 EFQLNLQAVHSKKKPPKFRKISSNKMVSASCRKRRLHDVDPCSCKVS---SPCVDFTCTN 710

Query: 87  GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQT-EKCGAGIVADEDIKRGEFVIEYVG 145
             +   CS  C     C N+ FQ R  +  K+  T +  G G+ A ED+K+ +FVIEY+G
Sbjct: 711 RAVYVECSDDCAQREKCKNRRFQERAYRATKVFLTPDGRGWGLQAVEDLKKDDFVIEYIG 770

Query: 146 EVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWI 205
           E+ID + C+ RL ++  +G+ N Y  E + + +IDA   GN++R+INHSC PN   +KW 
Sbjct: 771 EIIDSKECDRRLSQLHAIGDPNVYFLEADANYIIDAGPAGNEARFINHSCNPNLRTEKWT 830

Query: 206 IDGETRIGIFATRDIKKGENLTYDYQYE 233
           ++ +  +G+FA R+I  GE LT++Y  +
Sbjct: 831 VNDKLCVGLFAKRNIAAGEELTFNYNMQ 858


>gi|50553372|ref|XP_504097.1| YALI0E18260p [Yarrowia lipolytica]
 gi|74633562|sp|Q6C5G5.1|SET2_YARLI RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36
           specific; AltName: Full=SET domain-containing protein 2
 gi|49649966|emb|CAG79692.1| YALI0E18260p [Yarrowia lipolytica CLIB122]
          Length = 768

 Score =  129 bits (325), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 70/179 (39%), Positives = 100/179 (55%), Gaps = 5/179 (2%)

Query: 66  IFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCG 125
           + C C   P +   CD D  C   L+S     +C   C NK FQ +    + ++ TEK G
Sbjct: 48  MACDCKPGPTA---CDEDSGCINRLTSIEC-VRCCKGCQNKRFQGKKYASVDVISTEKKG 103

Query: 126 AGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKG 185
            G+ A +DI  GEFV EYVGEVID+ T +ER       G  +FY   + +   IDAT KG
Sbjct: 104 FGLRATKDIAAGEFVYEYVGEVIDEPTFKERTAIYTTQGVKHFYFMMLQKGEFIDATAKG 163

Query: 186 NKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
              R+ NHSC PN  ++KW++    R+GIFA+R I++GE +T+DY  + +  ++   YC
Sbjct: 164 GLGRFCNHSCAPNGHVEKWVVGKRLRMGIFASRHIQRGEEVTFDYNVDRYGAEAQACYC 222


>gi|395857586|ref|XP_003801172.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
            N-methyltransferase NSD2 [Otolemur garnettii]
          Length = 1371

 Score =  129 bits (324), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 76/195 (38%), Positives = 106/195 (54%), Gaps = 4/195 (2%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
            K  PY  IK N    K ++    D      C   P     C  D +C   ML+  C    
Sbjct: 987  KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMFECHPQV 1045

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G  C N+ F  R   + K+++T+  G G+VA  DI++GEFV EYVGE+ID++ C  R
Sbjct: 1046 CPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 1105

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            +        T+FY+  I++D +IDA  KGN SR++NHSC PN E  KW ++G+TR+G+FA
Sbjct: 1106 IKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1165

Query: 217  TRDIKKGENLTYDYQ 231
              DI  G  L   Y+
Sbjct: 1166 VCDIPAGTALGGLYE 1180


>gi|195352984|ref|XP_002042990.1| GM16309 [Drosophila sechellia]
 gi|194127055|gb|EDW49098.1| GM16309 [Drosophila sechellia]
          Length = 1418

 Score =  129 bits (323), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 76/211 (36%), Positives = 115/211 (54%), Gaps = 7/211 (3%)

Query: 37   IKPKAIPYVFIKRNIYLTK-RIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSC 93
            IK    PYV IK N  +   R  + LED     +C   P     C  +  C   ML + C
Sbjct: 1147 IKLHPPPYVKIKTNKAVPPLRFSQNLED---LSTCNCLPVDEHPCGPEAGCLNRMLFNEC 1203

Query: 94   SSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQT 152
            +   CK G+ C N+ F+ R   ++++V   + G G+V  E I  G+FVIEYVGEVI+   
Sbjct: 1204 NPEYCKAGSLCENRMFEQRKSPRLEVVYMNERGFGLVNREPIAAGDFVIEYVGEVINHAE 1263

Query: 153  CEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRI 212
             + R+ + +   + N+Y   + +D +IDA  KGN +R++NHSC PN E QKW ++   R+
Sbjct: 1264 FQRRMEQKQRDRDENYYFLGVEKDFIIDAGPKGNLARFMNHSCEPNCETQKWTVNCIHRV 1323

Query: 213  GIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
            GIFA +DI     LT++Y ++ L ++    C
Sbjct: 1324 GIFAIKDIPVNTELTFNYLWDDLMNNSKKAC 1354


>gi|326933478|ref|XP_003212830.1| PREDICTED: probable histone-lysine N-methyltransferase ASH1L-like
            [Meleagris gallopavo]
          Length = 2974

 Score =  128 bits (322), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 76/218 (34%), Positives = 118/218 (54%), Gaps = 16/218 (7%)

Query: 28   VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
            ++F+LP      W     + KP    Y  I+ N+Y+   +K     +   C+C      +
Sbjct: 2054 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDEN 2111

Query: 78   GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
            G  C  DC   M+ + CS   C CG  C N+  Q    V+ ++  + E+ G GI   E +
Sbjct: 2112 GKGCMEDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2171

Query: 135  KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
            K G+F+IEY+GEV+ +Q    R+ +  H   ++ Y   ++  MVID+   GN++R+INHS
Sbjct: 2172 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2230

Query: 195  CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            C PN EMQKW ++G  RIG++A +D+  G  LTYDY +
Sbjct: 2231 CNPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2268


>gi|363742848|ref|XP_422858.3| PREDICTED: probable histone-lysine N-methyltransferase ASH1L [Gallus
            gallus]
          Length = 2954

 Score =  128 bits (322), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 76/218 (34%), Positives = 118/218 (54%), Gaps = 16/218 (7%)

Query: 28   VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
            ++F+LP      W     + KP    Y  I+ N+Y+   +K     +   C+C      +
Sbjct: 2034 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDEN 2091

Query: 78   GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
            G  C  DC   M+ + CS   C CG  C N+  Q    V+ ++  + E+ G GI   E +
Sbjct: 2092 GKGCMEDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2151

Query: 135  KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
            K G+F+IEY+GEV+ +Q    R+ +  H   ++ Y   ++  MVID+   GN++R+INHS
Sbjct: 2152 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2210

Query: 195  CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            C PN EMQKW ++G  RIG++A +D+  G  LTYDY +
Sbjct: 2211 CNPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2248


>gi|432881031|ref|XP_004073771.1| PREDICTED: histone-lysine N-methyltransferase ASH1L-like [Oryzias
            latipes]
          Length = 2798

 Score =  128 bits (322), Expect = 2e-27,   Method: Composition-based stats.
 Identities = 80/227 (35%), Positives = 120/227 (52%), Gaps = 18/227 (7%)

Query: 19   KLLKQIGNPVEFELP---------DWFIKPKAIP-YVFIKRNIYLTKRIKRRLEDDGIFC 68
            K L+Q    ++F+LP         D   K   +P Y  I+ N+Y+   +K     +   C
Sbjct: 1850 KYLRQ--KRIDFQLPYDILWLWKHDQLSKRPDVPLYKKIRSNVYVD--VKPFSGYETTTC 1905

Query: 69   SCTASPGSSGV-CDRDCHCGMLLSSCS-SGCKCGNSCLNKPFQNRP-VKKMKLVQTEKCG 125
            SC     S    C  DC   M  + CS S C CG  C N+  Q    V+ ++  + E  G
Sbjct: 1906 SCKPPENSDDKGCLDDCLNRMSYAECSPSTCPCGEQCDNQHIQRHEWVQCLERFRAEGKG 1965

Query: 126  AGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKG 185
             GI   E ++ G+F+IEY+GEV+ +Q    R+ + ++   +  Y   ++  MVID+   G
Sbjct: 1966 WGIRTKEPLRAGQFIIEYLGEVVSEQEFRSRMME-QYFSHSGHYCLNLDSGMVIDSYRMG 2024

Query: 186  NKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            N++R+INHSC PN EMQKW ++G  RIG+FA +D+  G  LTYDY +
Sbjct: 2025 NEARFINHSCDPNCEMQKWSVNGVYRIGLFALKDVSSGTELTYDYNF 2071


>gi|326676505|ref|XP_692254.4| PREDICTED: probable histone-lysine N-methyltransferase ASH1L [Danio
            rerio]
          Length = 2933

 Score =  128 bits (322), Expect = 2e-27,   Method: Composition-based stats.
 Identities = 78/227 (34%), Positives = 122/227 (53%), Gaps = 18/227 (7%)

Query: 19   KLLKQIGNPVEFELP---------DWFIKPKAIP-YVFIKRNIYLTKRIKRRLEDDGIFC 68
            K L+Q    ++F+LP         D   K   +P Y  I+ N+Y+   +K     +   C
Sbjct: 1957 KYLRQ--KRIDFQLPYDILWLWKHDQLYKRPDVPLYKKIRSNVYVD--VKPLSGYEATTC 2012

Query: 69   SCTASPGSSGV-CDRDCHCGMLLSSCS-SGCKCGNSCLNKPFQNRP-VKKMKLVQTEKCG 125
            +C     SS   C  DC   M+ + CS S C C + C N+  Q    V+ ++  + E  G
Sbjct: 2013 NCRLPDDSSEKGCQDDCLNRMIYAECSPSTCPCSDQCDNQRIQKHEWVQCLERFRAEGKG 2072

Query: 126  AGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKG 185
             GI   + ++ G+F+IEY+GEV+ +Q    R+ + ++   +  Y   ++  MVID+   G
Sbjct: 2073 WGIRTKQPLRAGQFIIEYLGEVVSEQEFRSRMME-QYFSHSGHYCLNLDSGMVIDSYRMG 2131

Query: 186  NKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            N++R++NHSC PN EMQKW ++G  RIG+FA +DI  G  LTYDY +
Sbjct: 2132 NEARFVNHSCEPNCEMQKWSVNGVYRIGLFALKDINSGTELTYDYNF 2178


>gi|281338719|gb|EFB14303.1| hypothetical protein PANDA_018255 [Ailuropoda melanoleuca]
          Length = 2981

 Score =  128 bits (321), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 76/218 (34%), Positives = 118/218 (54%), Gaps = 16/218 (7%)

Query: 28   VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
            ++F+LP      W     + KP    Y  I+ N+Y+   +K     +   C+C      +
Sbjct: 2060 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPEDDT 2117

Query: 78   GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
            G  C  DC   M+ + CS   C CG  C N+  Q    V+ ++  + E+ G GI   E +
Sbjct: 2118 GKGCVDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2177

Query: 135  KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
            K G+F+IEY+GEV+ +Q    R+ +  H   ++ Y   ++  MVID+   GN++R+INHS
Sbjct: 2178 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2236

Query: 195  CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            C PN EMQKW ++G  RIG++A +D+  G  LTYDY +
Sbjct: 2237 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2274


>gi|73960946|ref|XP_537251.2| PREDICTED: probable histone-lysine N-methyltransferase ASH1L isoform
            1 [Canis lupus familiaris]
          Length = 2965

 Score =  128 bits (321), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 76/218 (34%), Positives = 118/218 (54%), Gaps = 16/218 (7%)

Query: 28   VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
            ++F+LP      W     + KP    Y  I+ N+Y+   +K     +   C+C      +
Sbjct: 2044 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPEDDT 2101

Query: 78   GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
            G  C  DC   M+ + CS   C CG  C N+  Q    V+ ++  + E+ G GI   E +
Sbjct: 2102 GKGCVDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2161

Query: 135  KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
            K G+F+IEY+GEV+ +Q    R+ +  H   ++ Y   ++  MVID+   GN++R+INHS
Sbjct: 2162 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2220

Query: 195  CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            C PN EMQKW ++G  RIG++A +D+  G  LTYDY +
Sbjct: 2221 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2258


>gi|390355937|ref|XP_003728662.1| PREDICTED: uncharacterized protein LOC579712 isoform 2
            [Strongylocentrotus purpuratus]
          Length = 3111

 Score =  128 bits (321), Expect = 2e-27,   Method: Composition-based stats.
 Identities = 80/221 (36%), Positives = 116/221 (52%), Gaps = 24/221 (10%)

Query: 29   EFELP-----DWF-----IKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTA--SPGS 76
            +FELP      W       + +A  +  I+ NIY    +K  ++ + + CSC    +P  
Sbjct: 2203 DFELPYDIWWQWVHNKLPSRTQAPKFKKIRNNIYFD--LKPTIQAEVVRCSCKRPYNPEE 2260

Query: 77   SGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKK-MKLVQTEKCGAGIVADEDI 134
             G C  DC   M+   CSS  C CG+ C N+  Q       ++   TE  G G+   + I
Sbjct: 2261 KG-CGEDCLNRMIQHECSSASCPCGDQCANQVIQRHNWSPGLRRFMTENRGWGVRTLQPI 2319

Query: 135  KRGEFVIEYVGEVIDDQTCEERLWKM---KHLGETNFYLCEINRDMVIDATYKGNKSRYI 191
            +   F+IEY+GEVI      + LWK     +  + + Y   ++  MVID    GN+ R++
Sbjct: 2320 RHSSFIIEYLGEVIS----VKELWKRALDDYQYQKHHYCLNLDGGMVIDGYRYGNEGRFV 2375

Query: 192  NHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            NHSC PN EMQKW+++G  RIG+FA RDI+ GE LTYDY +
Sbjct: 2376 NHSCNPNCEMQKWMVNGLYRIGMFALRDIQPGEELTYDYNF 2416


>gi|452822434|gb|EME29453.1| chondroitin-glucuronate 5-epimerase [Galdieria sulphuraria]
          Length = 1135

 Score =  128 bits (321), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 73/196 (37%), Positives = 110/196 (56%), Gaps = 7/196 (3%)

Query: 40  KAIPYVFIKRNIYL---TKRIKRRLEDDGIF-CSCTASPGSSGVCDRDCHCGMLLSSCSS 95
           K  P+V +KRN Y+      I+ + E D +  C C++        D  C   +  + C  
Sbjct: 736 KPAPFVRLKRNEYICDYIGPIRGKGEMDKVMICHCSSVADLPCCMDSSCLNRVSFTECHP 795

Query: 96  G-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCE 154
             C+ G+ C N+ FQ     ++KL Q  + G G+ A E + +G F+IEY GEVID +  E
Sbjct: 796 EYCRTGSKCQNQRFQKCEYARVKLFQAGERGWGLKAAEFLPKGTFIIEYQGEVIDTEEYE 855

Query: 155 ERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGI 214
            R  K ++ GE +FY   ++ D +IDA+ K N +R+INHSC PN   +KW + GE  +GI
Sbjct: 856 RR--KRRYAGERHFYFMSLDSDHMIDASRKSNMARFINHSCQPNCHTEKWTVLGEPCVGI 913

Query: 215 FATRDIKKGENLTYDY 230
           FA++DI+ G  L +DY
Sbjct: 914 FASQDIEAGTELVFDY 929


>gi|390355935|ref|XP_003728661.1| PREDICTED: uncharacterized protein LOC579712 isoform 1
            [Strongylocentrotus purpuratus]
          Length = 3164

 Score =  128 bits (321), Expect = 2e-27,   Method: Composition-based stats.
 Identities = 80/221 (36%), Positives = 116/221 (52%), Gaps = 24/221 (10%)

Query: 29   EFELP-----DWF-----IKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTA--SPGS 76
            +FELP      W       + +A  +  I+ NIY    +K  ++ + + CSC    +P  
Sbjct: 2256 DFELPYDIWWQWVHNKLPSRTQAPKFKKIRNNIYFD--LKPTIQAEVVRCSCKRPYNPEE 2313

Query: 77   SGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKK-MKLVQTEKCGAGIVADEDI 134
             G C  DC   M+   CSS  C CG+ C N+  Q       ++   TE  G G+   + I
Sbjct: 2314 KG-CGEDCLNRMIQHECSSASCPCGDQCANQVIQRHNWSPGLRRFMTENRGWGVRTLQPI 2372

Query: 135  KRGEFVIEYVGEVIDDQTCEERLWKM---KHLGETNFYLCEINRDMVIDATYKGNKSRYI 191
            +   F+IEY+GEVI      + LWK     +  + + Y   ++  MVID    GN+ R++
Sbjct: 2373 RHSSFIIEYLGEVIS----VKELWKRALDDYQYQKHHYCLNLDGGMVIDGYRYGNEGRFV 2428

Query: 192  NHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            NHSC PN EMQKW+++G  RIG+FA RDI+ GE LTYDY +
Sbjct: 2429 NHSCNPNCEMQKWMVNGLYRIGMFALRDIQPGEELTYDYNF 2469


>gi|62531333|gb|AAH93421.1| Whsc1 protein [Danio rerio]
          Length = 320

 Score =  128 bits (321), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 60/133 (45%), Positives = 89/133 (66%)

Query: 103 CLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
           C N+ F  R   + K+++T   G G+++  DIK+GEFV EYVGE+ID++ C  R+   + 
Sbjct: 2   CQNQDFTKRLYPETKIIRTAGKGWGLISLRDIKKGEFVNEYVGELIDEEECRSRIRNAQE 61

Query: 163 LGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKK 222
              T+FY+  I++D +IDA  KGN SR++NHSC PN E QKW ++G+TR+G+FA  DI  
Sbjct: 62  NDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETQKWTVNGDTRVGLFAVCDIPA 121

Query: 223 GENLTYDYQYEFL 235
           G  LT++Y  + L
Sbjct: 122 GTELTFNYNLDCL 134


>gi|301785832|ref|XP_002928328.1| PREDICTED: probable histone-lysine N-methyltransferase ASH1L-like
            [Ailuropoda melanoleuca]
          Length = 2965

 Score =  128 bits (321), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 76/218 (34%), Positives = 118/218 (54%), Gaps = 16/218 (7%)

Query: 28   VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
            ++F+LP      W     + KP    Y  I+ N+Y+   +K     +   C+C      +
Sbjct: 2044 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPEDDT 2101

Query: 78   GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
            G  C  DC   M+ + CS   C CG  C N+  Q    V+ ++  + E+ G GI   E +
Sbjct: 2102 GKGCVDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2161

Query: 135  KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
            K G+F+IEY+GEV+ +Q    R+ +  H   ++ Y   ++  MVID+   GN++R+INHS
Sbjct: 2162 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2220

Query: 195  CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            C PN EMQKW ++G  RIG++A +D+  G  LTYDY +
Sbjct: 2221 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2258


>gi|255716246|ref|XP_002554404.1| KLTH0F04510p [Lachancea thermotolerans]
 gi|238935787|emb|CAR23967.1| KLTH0F04510p [Lachancea thermotolerans CBS 6340]
          Length = 686

 Score =  128 bits (321), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 72/193 (37%), Positives = 103/193 (53%), Gaps = 6/193 (3%)

Query: 44  YVFIKRNIYLTKRIKRRLEDDGIFCSCTA--SPGSSGVC--DRDCHCGMLLSSCSSGC-- 97
           +  +K+  Y +K +      D I C C      G +  C  D DC   + L  C +    
Sbjct: 36  FTELKKCSYASKELGNSNSHDFIECDCYEEFQDGVNHACGEDSDCINRLTLIECVNDLCG 95

Query: 98  KCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERL 157
            CGN C N+ FQ R    + + QT+K G G+ A +DI+  EF+ EY+GEVI +    +R+
Sbjct: 96  SCGNDCRNQRFQKREYADIAVFQTQKKGYGVRAQKDIEAHEFIYEYIGEVIAEDEFRDRM 155

Query: 158 WKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFAT 217
               H G  +FY   +     IDAT KG+ +R+ NHSC PN  + KW + G+ R+GIFA 
Sbjct: 156 IDYDHRGLKHFYFMMLQTGEFIDATVKGSLARFCNHSCNPNAYVNKWAVAGKLRMGIFAN 215

Query: 218 RDIKKGENLTYDY 230
           R I KGE +T+DY
Sbjct: 216 RKILKGEEITFDY 228


>gi|443897443|dbj|GAC74784.1| clathrin coat binding protein [Pseudozyma antarctica T-34]
          Length = 925

 Score =  128 bits (321), Expect = 3e-27,   Method: Composition-based stats.
 Identities = 76/204 (37%), Positives = 109/204 (53%), Gaps = 7/204 (3%)

Query: 47  IKRNIYLTKRIKRRLE--DDGIFCSCT-ASPGSSGVCDRDCHC--GMLLSSCSSG-CKCG 100
           I  N Y  K++ R     DD + C C  A+  +   C  +  C   M    CSS  C+ G
Sbjct: 117 ITHNDYHDKKLGRPPGKFDDYMLCDCNPATENTDLACTDESGCINRMTQIECSSSKCRWG 176

Query: 101 NSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKM 160
             C N+ F  R    + +VQTEK G G+ A  DI +  FV EYVGEV++  T  +R+   
Sbjct: 177 KLCRNQRFHRRQYVYVDIVQTEKKGFGLRAAHDIPKESFVYEYVGEVMNQHTFLDRMQLY 236

Query: 161 KHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDI 220
           +  G  +FY   +  +  +DAT KG K R+INHSC PN  + KW +    R+GIFA R+I
Sbjct: 237 RTEGIRHFYFMMLQPNEYLDATKKGGKGRFINHSCNPNCSVSKWQVGKHLRMGIFAKRNI 296

Query: 221 KKGENLTYDYQYE-FLHDSLIAYC 243
            KGE LT++Y  + + +D+   +C
Sbjct: 297 AKGEELTFNYNVDRYGNDAQECFC 320


>gi|344286471|ref|XP_003414981.1| PREDICTED: probable histone-lysine N-methyltransferase ASH1L
            [Loxodonta africana]
          Length = 2917

 Score =  128 bits (321), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 76/218 (34%), Positives = 118/218 (54%), Gaps = 16/218 (7%)

Query: 28   VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
            ++F+LP      W     + KP    Y  I+ N+Y+   +K     +   C+C      +
Sbjct: 1996 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDDA 2053

Query: 78   GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
            G  C  DC   M+ + CS   C CG  C N+  Q    V+ ++  + E+ G GI   E +
Sbjct: 2054 GKGCVDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2113

Query: 135  KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
            K G+F+IEY+GEV+ +Q    R+ +  H   ++ Y   ++  MVID+   GN++R+INHS
Sbjct: 2114 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2172

Query: 195  CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            C PN EMQKW ++G  RIG++A +D+  G  LTYDY +
Sbjct: 2173 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2210


>gi|414587221|tpg|DAA37792.1| TPA: hypothetical protein ZEAMMB73_251567 [Zea mays]
          Length = 503

 Score =  127 bits (320), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 74/192 (38%), Positives = 107/192 (55%), Gaps = 12/192 (6%)

Query: 56  RIKRRLEDDGIFCSCTASPGSSGVCDRDCHCG------MLLSSCSSG-CKCGNSCLNKPF 108
           R K + E+D   C C        + D +  CG      +  + C+ G C+CG  C N+ F
Sbjct: 15  RHKSQTEEDIAVCECQYD-----ILDPESPCGDRCLNLLTNTECTPGYCRCGVYCKNQRF 69

Query: 109 QNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNF 168
           Q     + +LV+T   G G+VADE+I  G+FVIEY GEVI  +  + R    +     + 
Sbjct: 70  QKCQYARTRLVRTGGRGWGLVADENIMAGQFVIEYCGEVISWKEAKRRAQAYETQCLKDA 129

Query: 169 YLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTY 228
           Y+  +N D  IDAT KGN +R+INHSC PN E +KW + GE R+GIFA ++I  G  L+Y
Sbjct: 130 YIIYLNADESIDATRKGNLARFINHSCQPNCETRKWNVLGEVRVGIFAKQNIPFGTELSY 189

Query: 229 DYQYEFLHDSLI 240
           DY +E+    ++
Sbjct: 190 DYNFEWYGGVMV 201


>gi|390355933|ref|XP_784903.3| PREDICTED: uncharacterized protein LOC579712 isoform 3
            [Strongylocentrotus purpuratus]
          Length = 3326

 Score =  127 bits (320), Expect = 3e-27,   Method: Composition-based stats.
 Identities = 80/221 (36%), Positives = 116/221 (52%), Gaps = 24/221 (10%)

Query: 29   EFELP-----DWF-----IKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTA--SPGS 76
            +FELP      W       + +A  +  I+ NIY    +K  ++ + + CSC    +P  
Sbjct: 2435 DFELPYDIWWQWVHNKLPSRTQAPKFKKIRNNIYFD--LKPTIQAEVVRCSCKRPYNPEE 2492

Query: 77   SGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKK-MKLVQTEKCGAGIVADEDI 134
             G C  DC   M+   CSS  C CG+ C N+  Q       ++   TE  G G+   + I
Sbjct: 2493 KG-CGEDCLNRMIQHECSSASCPCGDQCANQVIQRHNWSPGLRRFMTENRGWGVRTLQPI 2551

Query: 135  KRGEFVIEYVGEVIDDQTCEERLWKM---KHLGETNFYLCEINRDMVIDATYKGNKSRYI 191
            +   F+IEY+GEVI      + LWK     +  + + Y   ++  MVID    GN+ R++
Sbjct: 2552 RHSSFIIEYLGEVIS----VKELWKRALDDYQYQKHHYCLNLDGGMVIDGYRYGNEGRFV 2607

Query: 192  NHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            NHSC PN EMQKW+++G  RIG+FA RDI+ GE LTYDY +
Sbjct: 2608 NHSCNPNCEMQKWMVNGLYRIGMFALRDIQPGEELTYDYNF 2648


>gi|47222897|emb|CAF99053.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 768

 Score =  127 bits (320), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 77/226 (34%), Positives = 115/226 (50%), Gaps = 29/226 (12%)

Query: 38  KPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG- 96
           KP    ++ + R I   + I   L +    C+C AS  +    D +C   ML+  C    
Sbjct: 366 KPPPYRHIKVNRAIGKVQFITADLSEI-PRCNCKASDENPCGVDSECINRMLMYECHPQV 424

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKR-------------------- 136
           C  G  C N+ F  R    +++ +T  CG G+ A  DIK+                    
Sbjct: 425 CAAGERCQNQAFSKRQYTPVEIYRTLSCGWGLRAVSDIKKVRRGDGDGEQLQGVPGVLMT 484

Query: 137 -------GEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSR 189
                  G FVIEYVGEVID++ C  R+   +     NFY+  +++D +IDA  KGN++R
Sbjct: 485 SSDATSQGAFVIEYVGEVIDEEECRARIKHAQENDIFNFYMLTLDKDRIIDAGPKGNQAR 544

Query: 190 YINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
           ++NH C PN E QKW ++G+TR+G+FA +DI KG+ L ++Y  E L
Sbjct: 545 FMNHCCQPNCETQKWTVNGDTRVGLFALQDIPKGKELNFNYNLECL 590


>gi|395532131|ref|XP_003768125.1| PREDICTED: histone-lysine N-methyltransferase ASH1L isoform 2
            [Sarcophilus harrisii]
          Length = 2974

 Score =  127 bits (320), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 80/233 (34%), Positives = 123/233 (52%), Gaps = 18/233 (7%)

Query: 13   IGHAFNKLLKQIGNPVEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLE 62
            + H   K L+Q    ++F+LP      W     + KP    Y  I+ N+Y+   +K    
Sbjct: 2041 VFHDSGKYLRQ--KRIDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSG 2096

Query: 63   DDGIFCSCT-ASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLV 119
             +   C+C     G    C  DC   M+ + CS   C CG  C N+  Q    V+ ++  
Sbjct: 2097 YEATTCNCKKPDDGDRKGCMDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERF 2156

Query: 120  QTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVI 179
            + E+ G GI   E +K G+F+IEY+GEV+ +Q    R+ +  H   ++ Y   ++  MVI
Sbjct: 2157 RAEEKGWGIRTKEPLKAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVI 2215

Query: 180  DATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            D+   GN++R+INHSC PN EMQKW ++G  RIG++A +D+  G  LTYDY +
Sbjct: 2216 DSYRMGNEARFINHSCNPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2268


>gi|195453973|ref|XP_002074027.1| GK14418 [Drosophila willistoni]
 gi|194170112|gb|EDW85013.1| GK14418 [Drosophila willistoni]
          Length = 1420

 Score =  127 bits (320), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 73/197 (37%), Positives = 111/197 (56%), Gaps = 7/197 (3%)

Query: 43   PYVFIKRNIYLTK-RIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG-CK 98
            PYV IK N  +T  R+   LED  +   C   PG    C  +  C   ML + C+   C+
Sbjct: 1124 PYVKIKANKPVTPVRLVSNLEDLSV---CDCQPGLEHPCGPESGCLNRMLFNECNPEYCR 1180

Query: 99   CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
             G+ C N+ F+ R   ++ +V     G G+V  E I  G+FVIEYVGEVI+    ++R+ 
Sbjct: 1181 AGDRCENQMFEMRKSPRLDVVYMNDRGFGLVCREAIAEGDFVIEYVGEVINHAEFQKRML 1240

Query: 159  KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATR 218
            + +   + N+Y   + +D +IDA  KGN +R++NHSC PN E QKW ++   R+G+FA +
Sbjct: 1241 QKQRDRDENYYFLGVEKDFIIDAGPKGNLARFMNHSCEPNCETQKWSVNCIHRVGLFAIK 1300

Query: 219  DIKKGENLTYDYQYEFL 235
            DI     LT++Y ++ L
Sbjct: 1301 DIPANTELTFNYLWDDL 1317


>gi|442621474|ref|NP_001263029.1| Mes-4, isoform B [Drosophila melanogaster]
 gi|440217972|gb|AGB96409.1| Mes-4, isoform B [Drosophila melanogaster]
          Length = 1423

 Score =  127 bits (320), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 74/205 (36%), Positives = 113/205 (55%), Gaps = 7/205 (3%)

Query: 43   PYVFIKRNIYLTK-RIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG-CK 98
            PYV IK N  +   R  + LED     +C   P     C  +  C   ML + C+   CK
Sbjct: 1158 PYVKIKTNKAVPPLRFSQNLED---LSTCNCLPVDEHPCGPEAGCLNRMLFNECNPEYCK 1214

Query: 99   CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
             G+ C N+ F+ R   ++++V   + G G+V  E I  G+FVIEYVGEVI+    + R+ 
Sbjct: 1215 AGSLCENRMFEQRKSPRLEVVYMNERGFGLVNREPIAVGDFVIEYVGEVINHAEFQRRME 1274

Query: 159  KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATR 218
            + +   + N+Y   + +D +IDA  KGN +R++NHSC PN E QKW ++   R+GIFA +
Sbjct: 1275 QKQRDRDENYYFLGVEKDFIIDAGPKGNLARFMNHSCEPNCETQKWTVNCIHRVGIFAIK 1334

Query: 219  DIKKGENLTYDYQYEFLHDSLIAYC 243
            DI     LT++Y ++ L ++    C
Sbjct: 1335 DIPVNSELTFNYLWDDLMNNSKKAC 1359


>gi|209880293|ref|XP_002141586.1| SET domain-containing protein [Cryptosporidium muris RN66]
 gi|209557192|gb|EEA07237.1| SET domain-containing protein [Cryptosporidium muris RN66]
          Length = 517

 Score =  127 bits (320), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 79/190 (41%), Positives = 108/190 (56%), Gaps = 13/190 (6%)

Query: 49  RNIYLTKR-IKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG----CKCGNS- 102
           RN Y  KR + +RL  D   CSCT S      CD+ C    +   C+      CK G+  
Sbjct: 52  RNEYCFKRYVNKRLAQDLGRCSCTKS------CDQYCTNKSIYIECNKDTCQICKQGSKD 105

Query: 103 CLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
           C N  F  R  K + L  T+  G G+   EDIK GEFVIEYVGE+I+ +  EER  KM  
Sbjct: 106 CGNCRFTLRQYKLVTLFDTKSKGIGLRTAEDIKLGEFVIEYVGEIINLKQLEERK-KMTS 164

Query: 163 LGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKK 222
               + Y+  +  +  IDATYKGN +R+INHSC PN   QKW +  + ++G+F+ +DIK 
Sbjct: 165 KLTKHIYVFSLGNETYIDATYKGNLARFINHSCEPNLVAQKWFVGSDIKVGLFSLKDIKA 224

Query: 223 GENLTYDYQY 232
           G+ LT+DY++
Sbjct: 225 GDELTFDYRF 234


>gi|431892339|gb|ELK02779.1| Putative histone-lysine N-methyltransferase ASH1L [Pteropus alecto]
          Length = 1291

 Score =  127 bits (320), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 76/217 (35%), Positives = 114/217 (52%), Gaps = 14/217 (6%)

Query: 28  VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
           ++F+LP      W     + KP    Y  I+ N+Y+  +     E     C         
Sbjct: 374 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVDVKPLSGYEAPTCNCKKPDDAARK 433

Query: 78  GVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDIK 135
           G  D DC   M+ + CS   C CG  C N+  Q    V+ ++  + E+ G GI   E +K
Sbjct: 434 GCVD-DCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLK 492

Query: 136 RGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSC 195
            G+F+IEY+GEV+ +Q    R+ +  H   ++ Y   ++  MVID+   GN++R+INHSC
Sbjct: 493 AGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHSC 551

Query: 196 CPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            PN EMQKW ++G  RIG++A RD+  G  LTYDY +
Sbjct: 552 DPNCEMQKWSVNGVYRIGLYALRDMPAGTELTYDYNF 588


>gi|430813766|emb|CCJ28924.1| unnamed protein product [Pneumocystis jirovecii]
          Length = 809

 Score =  127 bits (320), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 66/185 (35%), Positives = 108/185 (58%), Gaps = 4/185 (2%)

Query: 50  NIYLTKRIKRRLEDDGIFCSCTAS--PGSSGVCDRDCHCGMLLSS--CSSGCKCGNSCLN 105
           NIY +K +    +D+ + C C +      +  C ++  C   ++S  C+  C CG  C N
Sbjct: 139 NIYHSKDLGESGQDEIMTCECKSEWDGTKNHACGKNSDCINRMTSVECTDDCNCGEDCQN 198

Query: 106 KPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGE 165
           + F+ +    + +++T+K G GI A+ D++ G+F+ EYVGEVID++   +R+    +   
Sbjct: 199 RRFKLKQYSNIDVIKTKKKGYGIRANSDMESGQFIYEYVGEVIDERKFRKRMKIYANEDI 258

Query: 166 TNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGEN 225
            +FY   + R   IDAT KG  +R++NHSC PN  + KW++  + R+GIF  R+I KGE 
Sbjct: 259 KHFYFMMLQRGEYIDATKKGGLARFLNHSCSPNCYVDKWVVGTKLRMGIFCKRNILKGEE 318

Query: 226 LTYDY 230
           LT+DY
Sbjct: 319 LTFDY 323


>gi|390461100|ref|XP_003732597.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 3
            [Callithrix jacchus]
          Length = 1389

 Score =  127 bits (320), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 78/199 (39%), Positives = 108/199 (54%), Gaps = 15/199 (7%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
            K  PY  IK N    K ++    D      C   P     C  D +C   ML+  C    
Sbjct: 1022 KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMFECHPQV 1080

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G  C N+ F  R   + K+++T+  G G+VA  DI++GEFV EYVGEVID++ C   
Sbjct: 1081 CPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGEVIDEEEC--- 1137

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + ++KH  E        N+D +IDA  KGN SR++NHSC PN E  KW ++G+TR+G+FA
Sbjct: 1138 MARIKHAHE--------NQDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1189

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1190 VCDIPAGTELTFNYNLDCL 1208


>gi|24650756|ref|NP_733239.1| Mes-4, isoform A [Drosophila melanogaster]
 gi|29427833|sp|Q8MT36.2|MES4_DROME RecName: Full=Probable histone-lysine N-methyltransferase Mes-4;
            AltName: Full=Maternal-effect sterile 4 homolog
 gi|23172478|gb|AAF56762.2| Mes-4, isoform A [Drosophila melanogaster]
 gi|94400569|gb|ABF17912.1| FI01019p [Drosophila melanogaster]
          Length = 1427

 Score =  127 bits (320), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 74/205 (36%), Positives = 113/205 (55%), Gaps = 7/205 (3%)

Query: 43   PYVFIKRNIYLTK-RIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG-CK 98
            PYV IK N  +   R  + LED     +C   P     C  +  C   ML + C+   CK
Sbjct: 1162 PYVKIKTNKAVPPLRFSQNLED---LSTCNCLPVDEHPCGPEAGCLNRMLFNECNPEYCK 1218

Query: 99   CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
             G+ C N+ F+ R   ++++V   + G G+V  E I  G+FVIEYVGEVI+    + R+ 
Sbjct: 1219 AGSLCENRMFEQRKSPRLEVVYMNERGFGLVNREPIAVGDFVIEYVGEVINHAEFQRRME 1278

Query: 159  KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATR 218
            + +   + N+Y   + +D +IDA  KGN +R++NHSC PN E QKW ++   R+GIFA +
Sbjct: 1279 QKQRDRDENYYFLGVEKDFIIDAGPKGNLARFMNHSCEPNCETQKWTVNCIHRVGIFAIK 1338

Query: 219  DIKKGENLTYDYQYEFLHDSLIAYC 243
            DI     LT++Y ++ L ++    C
Sbjct: 1339 DIPVNSELTFNYLWDDLMNNSKKAC 1363


>gi|383854492|ref|XP_003702755.1| PREDICTED: uncharacterized protein LOC100877317 [Megachile rotundata]
          Length = 2187

 Score =  127 bits (319), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 78/237 (32%), Positives = 119/237 (50%), Gaps = 19/237 (8%)

Query: 19   KLLKQIGNPVEFELPDWFIKPKA-------IP---YVFIKRNIYLTKRIKRRLEDDGIFC 68
            KLL+Q   P +     W++  ++       +P   Y  I+ N+Y    +K     +   C
Sbjct: 1300 KLLRQRKMPFQLPYDLWWLHTQSRLPGRESVPSWNYKKIRTNVYYD--VKPTTLYEAQAC 1357

Query: 69   SCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGA 126
             C    G    C  DC   M+ S CS   C CG+ C N+  Q       ++   TE  G 
Sbjct: 1358 ECKPESG----CGDDCINRMIFSECSPQLCPCGDKCENQKIQKHEWAPGLQKFMTEDKGW 1413

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+   + IK G F++EYVGEV+ ++  + R+   ++  +T+ Y  +++  +VID    G 
Sbjct: 1414 GVRTQQSIKSGVFILEYVGEVVSEREFKSRMA-TRYANDTHHYCLQLDGGLVIDGHRMGG 1472

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
              R++NHSC PN EMQKW + G  R+ +FA+RDIK GE LTYDY +   + S    C
Sbjct: 1473 DGRFVNHSCEPNCEMQKWSVHGLPRMALFASRDIKPGEELTYDYNFALFNPSEGQEC 1529


>gi|6143888|gb|AAF04434.1|AC010718_3 unknown protein; 29143-26659 [Arabidopsis thaliana]
          Length = 528

 Score =  127 bits (319), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 73/193 (37%), Positives = 106/193 (54%), Gaps = 2/193 (1%)

Query: 44  YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGS-SGVCDRDCHCGMLLSSCSSG-CKCGN 101
           Y  I +N +  ++ K++ E+D   C C    G     C   C   +  + C+ G C CG 
Sbjct: 17  YEHIYQNDFSYRKHKKQKEEDISICECKFDFGDPDSACGERCLNVITNTECTPGYCPCGV 76

Query: 102 SCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMK 161
            C N+ FQ     K KL++ E  G G+VA E+IK G+F++EY GEVI  +  ++R    +
Sbjct: 77  YCKNQKFQKCEYAKTKLIKCEGRGWGLVALEEIKAGQFIMEYCGEVISWKEAKKRAQTYE 136

Query: 162 HLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIK 221
             G  + Y+  +N    IDAT KG+ +R+INHSC PN E +KW + GE R+GIFA   I 
Sbjct: 137 THGVKDAYIISLNASEAIDATKKGSLARFINHSCRPNCETRKWNVLGEVRVGIFAKESIS 196

Query: 222 KGENLTYDYQYEF 234
               L YDY +E+
Sbjct: 197 PRTELAYDYNFEW 209


>gi|195574451|ref|XP_002105202.1| GD18047 [Drosophila simulans]
 gi|194201129|gb|EDX14705.1| GD18047 [Drosophila simulans]
          Length = 567

 Score =  127 bits (319), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 76/211 (36%), Positives = 115/211 (54%), Gaps = 7/211 (3%)

Query: 37  IKPKAIPYVFIKRNIYLTK-RIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSC 93
           IK    PYV IK N  +   R  + LED     +C   P     C  +  C   ML + C
Sbjct: 296 IKLHPPPYVKIKTNKAVPPLRFSQNLED---LSTCNCLPVDEHPCGPEAGCLNRMLFNEC 352

Query: 94  SSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQT 152
           +   CK G+ C N+ F+ R   ++++V   + G G+V  E I  G+FVIEYVGEVI+   
Sbjct: 353 NPDYCKAGSLCENRMFEQRKSPRLEVVYMNERGFGLVNREPIAAGDFVIEYVGEVINHAE 412

Query: 153 CEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRI 212
            + R+ + +   + N+Y   + +D +IDA  KGN +R++NHSC PN E QKW ++   R+
Sbjct: 413 FQRRMEQKQRDRDENYYFLGVEKDFIIDAGPKGNLARFMNHSCEPNCETQKWTVNCIHRV 472

Query: 213 GIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
           GIFA +DI     LT++Y ++ L ++    C
Sbjct: 473 GIFAIKDIPVNTELTFNYLWDDLMNNSKKAC 503


>gi|126307634|ref|XP_001366993.1| PREDICTED: probable histone-lysine N-methyltransferase ASH1L
            [Monodelphis domestica]
          Length = 2968

 Score =  127 bits (319), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 76/218 (34%), Positives = 118/218 (54%), Gaps = 16/218 (7%)

Query: 28   VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCT-ASPGS 76
            ++F+LP      W     + KP    Y  I+ N+Y+   +K     +   C+C     G+
Sbjct: 2048 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDGN 2105

Query: 77   SGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
               C  DC   M+ + CS   C CG  C N+  Q    V+ ++  + E+ G GI   E +
Sbjct: 2106 RKGCMDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2165

Query: 135  KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
            K G+F+IEY+GEV+ +Q    R+ +  H   ++ Y   ++  MVID+   GN++R+INHS
Sbjct: 2166 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2224

Query: 195  CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            C PN EMQKW ++G  RIG++A +D+  G  LTYDY +
Sbjct: 2225 CNPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2262


>gi|395532129|ref|XP_003768124.1| PREDICTED: histone-lysine N-methyltransferase ASH1L isoform 1
            [Sarcophilus harrisii]
          Length = 2969

 Score =  127 bits (319), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 76/218 (34%), Positives = 117/218 (53%), Gaps = 16/218 (7%)

Query: 28   VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCT-ASPGS 76
            ++F+LP      W     + KP    Y  I+ N+Y+   +K     +   C+C     G 
Sbjct: 2049 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDGD 2106

Query: 77   SGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
               C  DC   M+ + CS   C CG  C N+  Q    V+ ++  + E+ G GI   E +
Sbjct: 2107 RKGCMDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2166

Query: 135  KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
            K G+F+IEY+GEV+ +Q    R+ +  H   ++ Y   ++  MVID+   GN++R+INHS
Sbjct: 2167 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2225

Query: 195  CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            C PN EMQKW ++G  RIG++A +D+  G  LTYDY +
Sbjct: 2226 CNPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2263


>gi|21392158|gb|AAM48433.1| RE61305p [Drosophila melanogaster]
          Length = 1016

 Score =  127 bits (319), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 74/205 (36%), Positives = 113/205 (55%), Gaps = 7/205 (3%)

Query: 43  PYVFIKRNIYLTK-RIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG-CK 98
           PYV IK N  +   R  + LED     +C   P     C  +  C   ML + C+   CK
Sbjct: 751 PYVKIKTNKAVPPLRFSQNLED---LSTCNCLPVDEHPCGPEAGCLNRMLFNECNPEYCK 807

Query: 99  CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
            G+ C N+ F+ R   ++++V   + G G+V  E I  G+FVIEYVGEVI+    + R+ 
Sbjct: 808 AGSLCENRMFEQRKSPRLEVVYMNERGFGLVNREPIAVGDFVIEYVGEVINHAEFQRRME 867

Query: 159 KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATR 218
           + +   + N+Y   + +D +IDA  KGN +R++NHSC PN E QKW ++   R+GIFA +
Sbjct: 868 QKQRDRDENYYFLGVEKDFIIDAGPKGNLARFMNHSCEPNCETQKWTVNCIHRVGIFAIK 927

Query: 219 DIKKGENLTYDYQYEFLHDSLIAYC 243
           DI     LT++Y ++ L ++    C
Sbjct: 928 DIPVNSELTFNYLWDDLMNNSKKAC 952


>gi|189195894|ref|XP_001934285.1| histone-lysine N-methyltransferase [Pyrenophora tritici-repentis
           Pt-1C-BFP]
 gi|187980164|gb|EDU46790.1| histone-lysine N-methyltransferase [Pyrenophora tritici-repentis
           Pt-1C-BFP]
          Length = 948

 Score =  127 bits (319), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 72/224 (32%), Positives = 117/224 (52%), Gaps = 10/224 (4%)

Query: 27  PVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCT----ASPGSSGVCDR 82
           P+  +LPD     K   +V +    Y  K I     D  + C C     ++  ++  CD 
Sbjct: 118 PLYLDLPDSTEAAKET-FVVLPECTYANKTIG--TTDPALECDCQEEYDSATKTNHACDE 174

Query: 83  DCHCGMLLS--SCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
           D  C    +   C   C CG +C N+ FQ +    + +++TEK G G+ A++++  G+FV
Sbjct: 175 DSDCINRATKMECVGDCSCGRTCQNQRFQRKQYADVTVIKTEKKGFGLRANKNMVPGDFV 234

Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
            EY+GEVID++T   R+ +    G  +FY   + +   +DAT KGN  R+ NHSC PN  
Sbjct: 235 FEYIGEVIDERTFRRRMGQYDEEGIKHFYFMSLTKGEFVDATKKGNLGRFCNHSCNPNCF 294

Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
           + KW++  + R+GIF  R+++ GE L ++Y  + +  D    YC
Sbjct: 295 VDKWVVGDKLRMGIFVERNVRAGEELVFNYNVDRYGADPQPCYC 338


>gi|294658433|ref|XP_460767.2| DEHA2F09350p [Debaryomyces hansenii CBS767]
 gi|218511782|sp|Q6BM04.2|SET2_DEBHA RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36
           specific; AltName: Full=SET domain-containing protein 2
 gi|202953126|emb|CAG89108.2| DEHA2F09350p [Debaryomyces hansenii CBS767]
          Length = 731

 Score =  127 bits (318), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 61/151 (40%), Positives = 93/151 (61%), Gaps = 1/151 (0%)

Query: 81  DRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEF 139
           D DC   +    CS+  C CGN C N+ FQ +    + ++QTE  G G+ A+EDI    F
Sbjct: 76  DSDCINRVTSVECSNKFCTCGNDCQNQRFQKKQYANVTVIQTELKGYGLRANEDISESSF 135

Query: 140 VIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNT 199
           + EY+GEVID+++  +R+         +FY   + +D  IDAT KG+ +R+ NHSC PN 
Sbjct: 136 IYEYIGEVIDEESFRKRMIDYDTKKLIHFYFMMLKKDSFIDATMKGSLARFCNHSCNPNA 195

Query: 200 EMQKWIIDGETRIGIFATRDIKKGENLTYDY 230
            + KW++  + R+GIF+ R+I+KGE +T+DY
Sbjct: 196 YVDKWVVGEKLRMGIFSKRNIQKGEEITFDY 226


>gi|22330671|ref|NP_177797.2| histone-lysine N-methyltransferase ASHH1 [Arabidopsis thaliana]
 gi|42572135|ref|NP_974158.1| histone-lysine N-methyltransferase ASHH1 [Arabidopsis thaliana]
 gi|75243465|sp|Q84WW6.1|ASHH1_ARATH RecName: Full=Histone-lysine N-methyltransferase ASHH1; AltName:
           Full=ASH1 homolog 1; AltName: Full=Protein SET DOMAIN
           GROUP 26
 gi|25054844|gb|AAN71912.1| unknown protein [Arabidopsis thaliana]
 gi|225898088|dbj|BAH30376.1| hypothetical protein [Arabidopsis thaliana]
 gi|332197758|gb|AEE35879.1| histone-lysine N-methyltransferase ASHH1 [Arabidopsis thaliana]
 gi|332197759|gb|AEE35880.1| histone-lysine N-methyltransferase ASHH1 [Arabidopsis thaliana]
          Length = 492

 Score =  127 bits (318), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 73/193 (37%), Positives = 106/193 (54%), Gaps = 2/193 (1%)

Query: 44  YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGS-SGVCDRDCHCGMLLSSCSSG-CKCGN 101
           Y  I +N +  ++ K++ E+D   C C    G     C   C   +  + C+ G C CG 
Sbjct: 17  YEHIYQNDFSYRKHKKQKEEDISICECKFDFGDPDSACGERCLNVITNTECTPGYCPCGV 76

Query: 102 SCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMK 161
            C N+ FQ     K KL++ E  G G+VA E+IK G+F++EY GEVI  +  ++R    +
Sbjct: 77  YCKNQKFQKCEYAKTKLIKCEGRGWGLVALEEIKAGQFIMEYCGEVISWKEAKKRAQTYE 136

Query: 162 HLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIK 221
             G  + Y+  +N    IDAT KG+ +R+INHSC PN E +KW + GE R+GIFA   I 
Sbjct: 137 THGVKDAYIISLNASEAIDATKKGSLARFINHSCRPNCETRKWNVLGEVRVGIFAKESIS 196

Query: 222 KGENLTYDYQYEF 234
               L YDY +E+
Sbjct: 197 PRTELAYDYNFEW 209


>gi|393236319|gb|EJD43868.1| hypothetical protein AURDEDRAFT_114444 [Auricularia delicata
           TFB-10046 SS5]
          Length = 882

 Score =  127 bits (318), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 75/213 (35%), Positives = 115/213 (53%), Gaps = 12/213 (5%)

Query: 36  FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS---GVCDRDCHCGMLLSS 92
           FI+ KA  Y    +N+ L +   R +E+ G  C C+  P        C  DC   +    
Sbjct: 70  FIELKACTY----QNMQLGR--SRNIEE-GFACECSYDPEEDLPDAACGSDCINKLTQIE 122

Query: 93  C-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQ 151
           C    C+CG  C N+ F  R   ++ +VQTEK G G+ A  D+K+ +F+ EYVGEV+++ 
Sbjct: 123 CLEDECRCGVHCQNQRFAKRQYAQIHIVQTEKKGFGLRAATDLKKDDFIYEYVGEVVNNT 182

Query: 152 TCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETR 211
              +R+ +    G  +FY   + ++  IDAT KG   R+ NHSC PN  + KW +    R
Sbjct: 183 QFMKRMREYADEGIRHFYFMMLQKEEFIDATKKGGIGRFANHSCNPNCYVAKWTVGKRIR 242

Query: 212 IGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
           +GIFA R++K GE LT++Y  + + H++   YC
Sbjct: 243 MGIFAQRNVKAGEELTFNYNVDRYGHEAQTCYC 275


>gi|189534132|ref|XP_001923087.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
           lysine-20 specific [Danio rerio]
          Length = 773

 Score =  126 bits (317), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 74/198 (37%), Positives = 109/198 (55%), Gaps = 2/198 (1%)

Query: 40  KAIPYVFIKRNIYLTKRIKRRLEDDGI-FCSCTASPGSSGVCDRDCHCGMLLSSCSSG-C 97
           K  PY  IK N    K +    +   I  C+C A+  +    D +C   MLL  C    C
Sbjct: 319 KPPPYKQIKVNKPFGKVLIISADLSEIPRCNCKATDENPCGMDSECINRMLLYECHPQVC 378

Query: 98  KCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERL 157
             G  C N+ F  R   +++  +T   G G+    DIK+G F+ EYVGEVID++ C  R+
Sbjct: 379 PAGERCQNQCFIKRQYCQVETFRTLSRGWGLRCVHDIKKGGFISEYVGEVIDEEECRARI 438

Query: 158 WKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFAT 217
              +     NFY+  +++D +IDA  KGN++R++NH C PN E QKW ++G+TR+G+F+ 
Sbjct: 439 KHAQENNIGNFYMLTLDKDRIIDAGPKGNEARFMNHCCQPNCETQKWTVNGDTRVGLFSL 498

Query: 218 RDIKKGENLTYDYQYEFL 235
            DI  G  LT++Y  E L
Sbjct: 499 TDIPAGTELTFNYNLECL 516


>gi|327286108|ref|XP_003227773.1| PREDICTED: probable histone-lysine N-methyltransferase ASH1L-like
            [Anolis carolinensis]
          Length = 2957

 Score =  126 bits (317), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 76/218 (34%), Positives = 118/218 (54%), Gaps = 16/218 (7%)

Query: 28   VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
            ++F+LP      W     + KP    Y  I+ N+Y+   +K     +   C+C     S+
Sbjct: 2036 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPEDSN 2093

Query: 78   GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
               C  DC   M+ + CS   C CG  C N+  Q    V+ ++  + E+ G GI   E +
Sbjct: 2094 KKGCVDDCLNRMIFAECSPNTCPCGEHCCNQRIQRHEWVQCLERFRAEEKGWGIRTKESL 2153

Query: 135  KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
            K G+F+IEY+GEV+ +Q    R+ +  H   ++ Y   ++  MVID+   GN++R+INHS
Sbjct: 2154 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2212

Query: 195  CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            C PN EMQKW ++G  RIG++A +D+  G  LTYDY +
Sbjct: 2213 CNPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2250


>gi|241952364|ref|XP_002418904.1| SET domain-containing protein, putative; histone methyltransferase,
           putative; histone-lysine N-methyltransferase, putative
           [Candida dubliniensis CD36]
 gi|223642243|emb|CAX44211.1| SET domain-containing protein, putative [Candida dubliniensis CD36]
          Length = 837

 Score =  126 bits (317), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 60/137 (43%), Positives = 91/137 (66%), Gaps = 4/137 (2%)

Query: 96  GCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEE 155
            C CG+ C N+ FQN    K+K++QTE  G G++A++DI+  +F+ EY+GEVID+ +  +
Sbjct: 124 NCLCGDDCQNQRFQNCQYSKVKVIQTELKGYGLIAEQDIEENQFIYEYIGEVIDESSFRQ 183

Query: 156 RL--WKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIG 213
           R+  + ++HL   +FY   ++ D  IDAT KG+  R+INHSC PN  + KW +    R+G
Sbjct: 184 RMIEYDLRHL--KHFYFMMLSNDSFIDATQKGSLGRFINHSCNPNAFVDKWHVGDRLRMG 241

Query: 214 IFATRDIKKGENLTYDY 230
           IFA R I +GE +T+DY
Sbjct: 242 IFAKRKILRGEEITFDY 258


>gi|347966112|ref|XP_321588.5| AGAP001535-PA [Anopheles gambiae str. PEST]
 gi|333470205|gb|EAA00844.6| AGAP001535-PA [Anopheles gambiae str. PEST]
          Length = 3613

 Score =  126 bits (317), Expect = 7e-27,   Method: Composition-based stats.
 Identities = 71/214 (33%), Positives = 108/214 (50%), Gaps = 15/214 (7%)

Query: 27   PVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC 86
            P    +P W        Y  I+ N+Y   +      D+   C+C    G    C  DC  
Sbjct: 2682 PGRNSVPSW-------NYRKIRTNVYYDVKANPS-TDNNTQCNCKPDSG----CQDDCLN 2729

Query: 87   GMLLSSC-SSGCKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDIKRGEFVIEYV 144
             M+ + C    C CG+ C N   Q       ++   TE+ G GI + E I +G F++EY+
Sbjct: 2730 RMVYTECVPEQCPCGDRCRNTCIQRHEYAPGLERFMTEEKGWGIRSRERISKGTFIMEYL 2789

Query: 145  GEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKW 204
            GEV+ ++  +ER+  M +L +T+ Y   ++  +VID    G+  R++NHSC PN EMQKW
Sbjct: 2790 GEVVTEREFKERMRTM-YLNDTHHYCLNLDGGLVIDGHRMGSDCRFVNHSCAPNCEMQKW 2848

Query: 205  IIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
             ++G  R+ +FA RDI   E L YDY +   + S
Sbjct: 2849 SVNGLFRMALFAMRDIPPNEELCYDYNFSLFNPS 2882


>gi|414587222|tpg|DAA37793.1| TPA: hypothetical protein ZEAMMB73_251567 [Zea mays]
          Length = 489

 Score =  126 bits (317), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 73/190 (38%), Positives = 106/190 (55%), Gaps = 12/190 (6%)

Query: 58  KRRLEDDGIFCSCTASPGSSGVCDRDCHCG------MLLSSCSSG-CKCGNSCLNKPFQN 110
           K + E+D   C C        + D +  CG      +  + C+ G C+CG  C N+ FQ 
Sbjct: 3   KSQTEEDIAVCECQYD-----ILDPESPCGDRCLNLLTNTECTPGYCRCGVYCKNQRFQK 57

Query: 111 RPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYL 170
               + +LV+T   G G+VADE+I  G+FVIEY GEVI  +  + R    +     + Y+
Sbjct: 58  CQYARTRLVRTGGRGWGLVADENIMAGQFVIEYCGEVISWKEAKRRAQAYETQCLKDAYI 117

Query: 171 CEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDY 230
             +N D  IDAT KGN +R+INHSC PN E +KW + GE R+GIFA ++I  G  L+YDY
Sbjct: 118 IYLNADESIDATRKGNLARFINHSCQPNCETRKWNVLGEVRVGIFAKQNIPFGTELSYDY 177

Query: 231 QYEFLHDSLI 240
            +E+    ++
Sbjct: 178 NFEWYGGVMV 187


>gi|401625193|gb|EJS43214.1| set2p [Saccharomyces arboricola H-6]
          Length = 736

 Score =  126 bits (317), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 70/186 (37%), Positives = 101/186 (54%), Gaps = 6/186 (3%)

Query: 51  IYLTKRIKRRLEDDGIFCSCTA--SPGSSGVCDRDCHC--GMLLSSCSSG--CKCGNSCL 104
           IY  KRI     +D + C C    S G +  CD D  C   + L  C +     CGN C 
Sbjct: 51  IYANKRIGTFKNNDFMECDCYEEFSDGVNHACDEDSDCINRLTLIECVNDLCSSCGNDCQ 110

Query: 105 NKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLG 164
           N+ FQ +    + + +T+  G G+ A+EDI+  +F+ EY GEVI++    ERL       
Sbjct: 111 NQRFQKKQYSPIAIFKTKYKGYGVRAEEDIEANQFIYEYKGEVIEEMEFRERLIDYDQRH 170

Query: 165 ETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGE 224
             +FY   +     IDAT KG+ +R+ NHSC PN  + KW++ G+ R+GIFA + I KGE
Sbjct: 171 FKHFYFMMLQNGEFIDATIKGSLARFCNHSCSPNAYVNKWVVKGKLRMGIFAQKKILKGE 230

Query: 225 NLTYDY 230
            +T+DY
Sbjct: 231 EITFDY 236


>gi|194746360|ref|XP_001955648.1| GF16138 [Drosophila ananassae]
 gi|190628685|gb|EDV44209.1| GF16138 [Drosophila ananassae]
          Length = 1460

 Score =  126 bits (317), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 73/205 (35%), Positives = 112/205 (54%), Gaps = 7/205 (3%)

Query: 43   PYVFIKRNIYLTK-RIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG-CK 98
            PY  IK N  +   R  + LE+     +C   P     C  +  C   ML + C+   C+
Sbjct: 1171 PYAKIKANKAVAPVRFMQNLEE---LSACDCLPEHDHPCGPESGCLNRMLYNECNPDFCR 1227

Query: 99   CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
             G+ C N+ F+ R   ++++V   + G G+V  E I  G FVIEYVGEVI+    +ERL 
Sbjct: 1228 AGDRCENRMFEQRISPRLEVVYMNERGFGLVCREPIAEGTFVIEYVGEVINHAEFQERLI 1287

Query: 159  KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATR 218
            + +   + N+Y   + +D +IDA  KGN +R++NHSC PN E QKW ++   R+GIFA +
Sbjct: 1288 QKQRNRDENYYFLGVEKDFIIDAGPKGNLARFMNHSCEPNCETQKWTVNCVHRVGIFAIK 1347

Query: 219  DIKKGENLTYDYQYEFLHDSLIAYC 243
            DI     LT++Y ++ L ++    C
Sbjct: 1348 DIPANTELTFNYLWDDLMNNSKKAC 1372


>gi|328785970|ref|XP_001122948.2| PREDICTED: hypothetical protein LOC727238 [Apis mellifera]
          Length = 2172

 Score =  126 bits (316), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 78/237 (32%), Positives = 118/237 (49%), Gaps = 19/237 (8%)

Query: 19   KLLKQIGNPVEFELPDWFIKPKA-------IP---YVFIKRNIYLTKRIKRRLEDDGIFC 68
            KLL+Q   P +     W++  ++       +P   Y  I+ N+Y    +K     +   C
Sbjct: 1285 KLLRQRKMPFQLPYDLWWLHTQSRLPSRESVPSWNYKKIRTNVYYD--VKPTTLYEAQAC 1342

Query: 69   SCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGA 126
             C    G    C  DC   M+ S CS   C CG+ C N+  Q       ++   TE  G 
Sbjct: 1343 ECKPESG----CGDDCINRMVFSECSPQLCPCGDKCENQKIQKHEWAPGLQKFMTEDKGW 1398

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+   + IK G F++EYVGEV+ ++  + R+   ++  +T+ Y   ++  +VID    G 
Sbjct: 1399 GVRTQQSIKSGVFILEYVGEVVSEREFKSRMA-TRYANDTHHYCLHLDGGLVIDGHRMGG 1457

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
              R++NHSC PN EMQKW + G  R+ +FA+RDIK GE LTYDY +   + S    C
Sbjct: 1458 DGRFVNHSCEPNCEMQKWSVHGLPRMALFASRDIKPGEELTYDYNFALFNPSEGQEC 1514


>gi|303319123|ref|XP_003069561.1| SET domain containing protein [Coccidioides posadasii C735 delta
           SOWgp]
 gi|240109247|gb|EER27416.1| SET domain containing protein [Coccidioides posadasii C735 delta
           SOWgp]
          Length = 1011

 Score =  126 bits (316), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 65/164 (39%), Positives = 94/164 (57%), Gaps = 1/164 (0%)

Query: 81  DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
           D DC        C   C CG+SC N+ FQ R   K+ +++TEK G G+ AD D++  EF+
Sbjct: 170 DSDCINRATKMECFGDCGCGDSCQNQRFQRREYAKVSVIKTEKKGYGLRADCDLRPNEFI 229

Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
            EY+GEVI++     R+ +    G  +FY   +N+   +DAT KGN  R+ NHSC PN  
Sbjct: 230 FEYIGEVINEPQFRRRMIQYDEEGIKHFYFMSLNKGEFVDATKKGNLGRFCNHSCNPNCY 289

Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
           + KW++  + R+GIFA R IK GE L ++Y  + +  D    YC
Sbjct: 290 VDKWVVGEKLRMGIFAERYIKAGEELVFNYNVDRYGADPQPCYC 333


>gi|119182346|ref|XP_001242314.1| hypothetical protein CIMG_06210 [Coccidioides immitis RS]
          Length = 1003

 Score =  126 bits (316), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 65/164 (39%), Positives = 94/164 (57%), Gaps = 1/164 (0%)

Query: 81  DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
           D DC        C   C CG+SC N+ FQ R   K+ +++TEK G G+ AD D++  EF+
Sbjct: 162 DSDCINRATKMECFGDCGCGDSCQNQRFQRREYAKVSVIKTEKKGYGLRADCDLRPNEFI 221

Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
            EY+GEVI++     R+ +    G  +FY   +N+   +DAT KGN  R+ NHSC PN  
Sbjct: 222 FEYIGEVINEPQFRRRMIQYDEEGIKHFYFMSLNKGEFVDATKKGNLGRFCNHSCNPNCY 281

Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
           + KW++  + R+GIFA R IK GE L ++Y  + +  D    YC
Sbjct: 282 VDKWVVGEKLRMGIFAERYIKAGEELVFNYNVDRYGADPQPCYC 325


>gi|390360513|ref|XP_785219.3| PREDICTED: histone-lysine N-methyltransferase NSD3-like
            [Strongylocentrotus purpuratus]
          Length = 1736

 Score =  126 bits (316), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 74/201 (36%), Positives = 109/201 (54%), Gaps = 7/201 (3%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFC-SCTASPGSSGVC--DRDCHCGMLLSSCSSG 96
            K  P+ FIK N  +   +      D   C +C   P     C  D DC   +LL  C   
Sbjct: 1156 KPAPFKFIKTNRPVGNVVMPAF--DITQCQACECRPDMENPCGPDSDCLNRILLIECHPQ 1213

Query: 97   -CKCGNS-CLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCE 154
             C      C N+ FQ R       ++    G G+VA  DIK+G+FV EYVGE++D++ C 
Sbjct: 1214 ICPAKEEKCQNQRFQKRAYPDSCQMKVSHRGWGLVAMVDIKKGDFVNEYVGELVDEEECR 1273

Query: 155  ERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGI 214
             R+ +      T+FY   +++D +IDA  KGN SR++NHSC PN E QKW ++G+TR+G+
Sbjct: 1274 RRIKQAHEENITDFYFLTLDKDRIIDAGPKGNLSRFMNHSCQPNCETQKWTVNGDTRVGL 1333

Query: 215  FATRDIKKGENLTYDYQYEFL 235
            FA R+I  G  ++++Y  + L
Sbjct: 1334 FAIRNIAAGNEISFNYNLDCL 1354


>gi|320041043|gb|EFW22976.1| histone-lysine N-methyltransferase [Coccidioides posadasii str.
           Silveira]
          Length = 972

 Score =  126 bits (316), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 65/164 (39%), Positives = 94/164 (57%), Gaps = 1/164 (0%)

Query: 81  DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
           D DC        C   C CG+SC N+ FQ R   K+ +++TEK G G+ AD D++  EF+
Sbjct: 131 DSDCINRATKMECFGDCGCGDSCQNQRFQRREYAKVSVIKTEKKGYGLRADCDLRPNEFI 190

Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
            EY+GEVI++     R+ +    G  +FY   +N+   +DAT KGN  R+ NHSC PN  
Sbjct: 191 FEYIGEVINEPQFRRRMIQYDEEGIKHFYFMSLNKGEFVDATKKGNLGRFCNHSCNPNCY 250

Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
           + KW++  + R+GIFA R IK GE L ++Y  + +  D    YC
Sbjct: 251 VDKWVVGEKLRMGIFAERYIKAGEELVFNYNVDRYGADPQPCYC 294


>gi|442570025|sp|Q1DU03.2|SET2_COCIM RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36
           specific; AltName: Full=SET domain-containing protein 2
 gi|392865212|gb|EAS30985.2| histone-lysine N-methyltransferase, H3 lysine-36 specific
           [Coccidioides immitis RS]
          Length = 1011

 Score =  126 bits (316), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 65/164 (39%), Positives = 94/164 (57%), Gaps = 1/164 (0%)

Query: 81  DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
           D DC        C   C CG+SC N+ FQ R   K+ +++TEK G G+ AD D++  EF+
Sbjct: 170 DSDCINRATKMECFGDCGCGDSCQNQRFQRREYAKVSVIKTEKKGYGLRADCDLRPNEFI 229

Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
            EY+GEVI++     R+ +    G  +FY   +N+   +DAT KGN  R+ NHSC PN  
Sbjct: 230 FEYIGEVINEPQFRRRMIQYDEEGIKHFYFMSLNKGEFVDATKKGNLGRFCNHSCNPNCY 289

Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
           + KW++  + R+GIFA R IK GE L ++Y  + +  D    YC
Sbjct: 290 VDKWVVGEKLRMGIFAERYIKAGEELVFNYNVDRYGADPQPCYC 333


>gi|226291164|gb|EEH46592.1| histone-lysine N-methyltransferase [Paracoccidioides brasiliensis
           Pb18]
          Length = 980

 Score =  125 bits (315), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 64/172 (37%), Positives = 96/172 (55%), Gaps = 3/172 (1%)

Query: 75  GSSGVC--DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADE 132
           G +G C  D DC        C   C CG+ C N+ FQ R    + +++TEK G G+ AD 
Sbjct: 163 GRNGACGEDSDCINRATKMECVGDCGCGDECQNQRFQRRQYANVTVIKTEKKGYGLRADS 222

Query: 133 DIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYIN 192
           D++  +F+ EY+GEVI++ +  +R+      G  +FY   + +   +DAT KGN  R+ N
Sbjct: 223 DLRPHQFIFEYIGEVINEASFRKRMIAYDEEGIKHFYFMSLTKGEFVDATKKGNLGRFCN 282

Query: 193 HSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
           HSC PN  + KW++  + R+GIFA R IK GE L ++Y  + +  D    YC
Sbjct: 283 HSCNPNCYVDKWVVGEKLRMGIFAERHIKAGEELVFNYNVDRYGADPQPCYC 334


>gi|388858360|emb|CCF48089.1| related to SET2-Histone methyltransferase [Ustilago hordei]
          Length = 985

 Score =  125 bits (315), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 69/172 (40%), Positives = 96/172 (55%), Gaps = 4/172 (2%)

Query: 63  DDGIFCSCTASPGSSGVC---DRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKL 118
           DD + C C     S  +    +  C   M    CSS  C+ G  C N+ F  R    + +
Sbjct: 165 DDYMLCDCNPHSESEDLACTDESGCINRMTQIECSSSKCRWGKLCRNQRFHRRQYVNVDI 224

Query: 119 VQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMV 178
           VQTEK G G+ A +DI +  FV EYVGEV++  T  ER+ + +  G  +FY   +  +  
Sbjct: 225 VQTEKKGFGLRAAQDIPKETFVYEYVGEVMNQPTFLERMQQYRQEGIRHFYFMMLQPNEY 284

Query: 179 IDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDY 230
           +DAT KG K R+INHSC PN  + KW +    R+GIFA R+I+KGE LT++Y
Sbjct: 285 LDATKKGGKGRFINHSCNPNCSVSKWQVGKHLRMGIFAKRNIQKGEELTFNY 336


>gi|340959559|gb|EGS20740.1| hypothetical protein CTHT_0025760 [Chaetomium thermophilum var.
           thermophilum DSM 1495]
          Length = 939

 Score =  125 bits (315), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 67/186 (36%), Positives = 99/186 (53%), Gaps = 6/186 (3%)

Query: 64  DGIFCSCTAS--PGSSGVCDRDCHC---GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKL 118
           D + C C      G +  C  D  C      +      C CG  C N+ FQ +   K+ +
Sbjct: 125 DALDCDCAEDWRDGKNHACGEDSDCINRATKIECVIGDCNCGEGCQNQRFQRKQYAKVSV 184

Query: 119 VQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMV 178
           ++TEK G G+ AD D++  +F+ EYVGEVI++ T   R+ K    G  +FY   + ++  
Sbjct: 185 IKTEKKGYGLRADTDLQPNDFIYEYVGEVINEPTFRSRMLKYDKEGIKHFYFMSLTKNEF 244

Query: 179 IDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHD 237
           +DAT KGN  R+ NHSC PN  + KW++  + R+GIFA R IK GE L ++Y  + +  D
Sbjct: 245 VDATKKGNLGRFCNHSCNPNCYVDKWVVGDKLRMGIFAARYIKAGEELVFNYNVDRYGAD 304

Query: 238 SLIAYC 243
               YC
Sbjct: 305 PQPCYC 310


>gi|340724009|ref|XP_003400378.1| PREDICTED: hypothetical protein LOC100647786 [Bombus terrestris]
          Length = 2172

 Score =  125 bits (315), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 78/237 (32%), Positives = 117/237 (49%), Gaps = 19/237 (8%)

Query: 19   KLLKQIGNPVEFELPDWFIKPKA-------IP---YVFIKRNIYLTKRIKRRLEDDGIFC 68
            KLL+Q   P +     W++  ++       +P   Y  I+ N+Y    +K     +   C
Sbjct: 1285 KLLRQRKMPFQLPYDLWWLHTQSRLPGRESVPSWNYKKIRTNVYYD--VKPTTLYEAQAC 1342

Query: 69   SCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGA 126
             C    G    C  DC   M+ S CS   C CG  C N+  Q       ++   TE  G 
Sbjct: 1343 ECKPESG----CGDDCINRMVFSECSPQLCPCGEKCENQKIQKHEWAPGLQKFMTEDKGW 1398

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+   + IK G F++EYVGEV+ ++  + R+   ++  +T+ Y   ++  +VID    G 
Sbjct: 1399 GVRTQQSIKSGVFILEYVGEVVSEREFKSRMA-TRYANDTHHYCLHLDGGLVIDGHRMGG 1457

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
              R++NHSC PN EMQKW + G  R+ +FA+RDIK GE LTYDY +   + S    C
Sbjct: 1458 DGRFVNHSCEPNCEMQKWSVHGLPRMALFASRDIKPGEELTYDYNFALFNPSEGQEC 1514


>gi|30704948|gb|AAH52194.1| Ash1l protein, partial [Mus musculus]
          Length = 963

 Score =  125 bits (315), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 75/218 (34%), Positives = 117/218 (53%), Gaps = 16/218 (7%)

Query: 28  VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
           ++F+LP      W     + KP    Y  I+ N+Y+   +K     +   C+C      +
Sbjct: 43  IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDDT 100

Query: 78  GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
              C  DC   M+ + CS   C CG  C N+  Q    V+ ++  + E+ G GI   E +
Sbjct: 101 RKGCGDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 160

Query: 135 KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
           K G+F+IEY+GEV+ +Q    R+ +  H   ++ Y   ++  MVID+   GN++R+INHS
Sbjct: 161 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 219

Query: 195 CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
           C PN EMQKW ++G  RIG++A +D+  G  LTYDY +
Sbjct: 220 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 257


>gi|195108992|ref|XP_001999076.1| GI23270 [Drosophila mojavensis]
 gi|193915670|gb|EDW14537.1| GI23270 [Drosophila mojavensis]
          Length = 1433

 Score =  125 bits (315), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 72/196 (36%), Positives = 106/196 (54%), Gaps = 5/196 (2%)

Query: 43   PYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG-CKC 99
            PYV IK N  +        E D   C C   PGS+  C  +  C   ML   C    C  
Sbjct: 1139 PYVKIKTNKPVPPVRFVYNEQDLNICDC--KPGSAHPCGPESGCLNRMLYHECHPDYCHA 1196

Query: 100  GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
            G  C N+ F+ R   ++++V   + G G+V  E IK G+F+IEYVGEVI+    + R+ +
Sbjct: 1197 GTRCENQLFELRKSPRLEVVYMNERGFGLVCREPIKEGDFIIEYVGEVINHAEFQRRMAQ 1256

Query: 160  MKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD 219
                 + NFY   + +D +IDA  KGN +R++NHSC PN   QKW ++   R+G+FA +D
Sbjct: 1257 KTKNRDENFYFLGVEKDFIIDAGPKGNLARFMNHSCEPNCATQKWTVNCNNRVGLFAIKD 1316

Query: 220  IKKGENLTYDYQYEFL 235
            I +   LT++Y ++ L
Sbjct: 1317 IPENTELTFNYLWDDL 1332


>gi|13442965|gb|AAK26242.1|AF247132_1 putative chromatin remodeling factor [Mus musculus]
          Length = 2669

 Score =  125 bits (315), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 75/218 (34%), Positives = 117/218 (53%), Gaps = 16/218 (7%)

Query: 28   VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
            ++F+LP      W     + KP    Y  I+ N+Y+   +K     +   C+C      +
Sbjct: 1749 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDDT 1806

Query: 78   GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
               C  DC   M+ + CS   C CG  C N+  Q    V+ ++  + E+ G GI   E +
Sbjct: 1807 RKGCGDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 1866

Query: 135  KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
            K G+F+IEY+GEV+ +Q    R+ +  H   ++ Y   ++  MVID+   GN++R+INHS
Sbjct: 1867 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 1925

Query: 195  CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            C PN EMQKW ++G  RIG++A +D+  G  LTYDY +
Sbjct: 1926 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 1963


>gi|395845197|ref|XP_003795328.1| PREDICTED: histone-lysine N-methyltransferase ASH1L [Otolemur
            garnettii]
          Length = 2961

 Score =  125 bits (315), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 76/218 (34%), Positives = 117/218 (53%), Gaps = 16/218 (7%)

Query: 28   VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
            ++F+LP      W     + KP    Y  I+ N+Y+   +K     +   C+C      S
Sbjct: 2040 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDDS 2097

Query: 78   GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
               C  DC   M+ + CS   C CG  C N+  Q    V+ ++  + E+ G GI   E +
Sbjct: 2098 RKGCVDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2157

Query: 135  KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
            K G+F+IEY+GEV+ +Q    R+ +  H   ++ Y   ++  MVID+   GN++R+INHS
Sbjct: 2158 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2216

Query: 195  CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            C PN EMQKW ++G  RIG++A +D+  G  LTYDY +
Sbjct: 2217 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2254


>gi|350427843|ref|XP_003494899.1| PREDICTED: hypothetical protein LOC100746434 [Bombus impatiens]
          Length = 2217

 Score =  125 bits (315), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 78/237 (32%), Positives = 117/237 (49%), Gaps = 19/237 (8%)

Query: 19   KLLKQIGNPVEFELPDWFIKPKA-------IP---YVFIKRNIYLTKRIKRRLEDDGIFC 68
            KLL+Q   P +     W++  ++       +P   Y  I+ N+Y    +K     +   C
Sbjct: 1330 KLLRQRKMPFQLPYDLWWLHTQSRLPGRESVPSWNYKKIRTNVYYD--VKPTTLYEAQAC 1387

Query: 69   SCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGA 126
             C    G    C  DC   M+ S CS   C CG  C N+  Q       ++   TE  G 
Sbjct: 1388 ECKPESG----CGDDCINRMVFSECSPQLCPCGEKCENQKIQKHEWAPGLQKFMTEDKGW 1443

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+   + IK G F++EYVGEV+ ++  + R+   ++  +T+ Y   ++  +VID    G 
Sbjct: 1444 GVRTQQSIKSGVFILEYVGEVVSEREFKSRMA-TRYANDTHHYCLHLDGGLVIDGHRMGG 1502

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
              R++NHSC PN EMQKW + G  R+ +FA+RDIK GE LTYDY +   + S    C
Sbjct: 1503 DGRFVNHSCEPNCEMQKWSVHGLPRMALFASRDIKPGEELTYDYNFALFNPSEGQEC 1559


>gi|328716964|ref|XP_001950932.2| PREDICTED: hypothetical protein LOC100161826, partial [Acyrthosiphon
            pisum]
          Length = 1530

 Score =  125 bits (315), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 68/161 (42%), Positives = 91/161 (56%), Gaps = 7/161 (4%)

Query: 74   PGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADE 132
            PGS      DC   ML+  C    C  G+ C N+ F+      M    T+  G G+   E
Sbjct: 1113 PGS------DCINRMLMFECEPKVCPAGDKCNNQRFEKTLYPAMVPFLTKGRGWGLKTLE 1166

Query: 133  DIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYIN 192
            DIK G FVIEYVG+VID++  + R  +M    E N+Y   I+    IDA  KGN SR++N
Sbjct: 1167 DIKEGSFVIEYVGDVIDEEEFQRRCLEMHQRNEQNYYFLTIDNSRTIDAGPKGNLSRFMN 1226

Query: 193  HSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
            HSC PN   QKW ++G+TRIG+FA +DI  G  L +DY+ +
Sbjct: 1227 HSCEPNCVTQKWTVNGDTRIGLFALQDIPTGTELVFDYRLQ 1267


>gi|225679434|gb|EEH17718.1| histone-lysine N-methyltransferase [Paracoccidioides brasiliensis
           Pb03]
          Length = 976

 Score =  125 bits (315), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 64/172 (37%), Positives = 96/172 (55%), Gaps = 3/172 (1%)

Query: 75  GSSGVC--DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADE 132
           G +G C  D DC        C   C CG+ C N+ FQ R    + +++TEK G G+ AD 
Sbjct: 163 GKNGACGEDSDCINRATKMECVGDCGCGDECQNQRFQRRQYANVTVIKTEKKGYGLRADS 222

Query: 133 DIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYIN 192
           D++  +F+ EY+GEVI++ +  +R+      G  +FY   + +   +DAT KGN  R+ N
Sbjct: 223 DLRPHQFIFEYIGEVINEASFRKRMIAYDEEGIKHFYFMSLTKGEFVDATKKGNLGRFCN 282

Query: 193 HSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
           HSC PN  + KW++  + R+GIFA R IK GE L ++Y  + +  D    YC
Sbjct: 283 HSCNPNCYVDKWVVGEKLRMGIFAERHIKAGEELVFNYNVDRYGADPQPCYC 334


>gi|295665754|ref|XP_002793428.1| histone-lysine N-methyltransferase [Paracoccidioides sp. 'lutzii'
           Pb01]
 gi|226278342|gb|EEH33908.1| histone-lysine N-methyltransferase [Paracoccidioides sp. 'lutzii'
           Pb01]
          Length = 1001

 Score =  125 bits (315), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 64/172 (37%), Positives = 96/172 (55%), Gaps = 3/172 (1%)

Query: 75  GSSGVC--DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADE 132
           G +G C  D DC        C   C CG+ C N+ FQ R    + +++TEK G G+ AD 
Sbjct: 164 GKNGACGEDSDCINRATKMECVGDCGCGDECQNQRFQRRQYANVTVIKTEKKGYGLRADS 223

Query: 133 DIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYIN 192
           D++  +F+ EY+GEVI++ +  +R+      G  +FY   + +   +DAT KGN  R+ N
Sbjct: 224 DLRPHQFIFEYIGEVINEASFRKRMIAYDEEGIKHFYFMSLTKGEFVDATKKGNLGRFCN 283

Query: 193 HSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
           HSC PN  + KW++  + R+GIFA R IK GE L ++Y  + +  D    YC
Sbjct: 284 HSCNPNCYVDKWVVGEKLRMGIFAERHIKAGEELVFNYNVDRYGADPQPCYC 335


>gi|73622271|ref|NP_619620.3| histone-lysine N-methyltransferase ASH1L [Mus musculus]
 gi|341940590|sp|Q99MY8.3|ASH1L_MOUSE RecName: Full=Histone-lysine N-methyltransferase ASH1L; AltName:
            Full=ASH1-like protein; AltName: Full=Absent small and
            homeotic disks protein 1 homolog
          Length = 2958

 Score =  125 bits (314), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 75/218 (34%), Positives = 117/218 (53%), Gaps = 16/218 (7%)

Query: 28   VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
            ++F+LP      W     + KP    Y  I+ N+Y+   +K     +   C+C      +
Sbjct: 2038 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDDT 2095

Query: 78   GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
               C  DC   M+ + CS   C CG  C N+  Q    V+ ++  + E+ G GI   E +
Sbjct: 2096 RKGCGDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2155

Query: 135  KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
            K G+F+IEY+GEV+ +Q    R+ +  H   ++ Y   ++  MVID+   GN++R+INHS
Sbjct: 2156 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2214

Query: 195  CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            C PN EMQKW ++G  RIG++A +D+  G  LTYDY +
Sbjct: 2215 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2252


>gi|291397821|ref|XP_002715465.1| PREDICTED: absent, small, or homeotic 1-like [Oryctolagus cuniculus]
          Length = 2961

 Score =  125 bits (314), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 75/218 (34%), Positives = 118/218 (54%), Gaps = 16/218 (7%)

Query: 28   VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCT-ASPGS 76
            ++F+LP      W     + KP    Y  I+ N+Y+   +K     +   C+C      +
Sbjct: 2040 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDDT 2097

Query: 77   SGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
               C  DC   M+ + CS   C CG+ C N+  Q    V+ ++  + E+ G GI   E +
Sbjct: 2098 RKGCVDDCLNRMIFAECSPNTCPCGDQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2157

Query: 135  KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
            K G+F+IEY+GEV+ +Q    R+ +  H   ++ Y   ++  MVID+   GN++R+INHS
Sbjct: 2158 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2216

Query: 195  CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            C PN EMQKW ++G  RIG++A +D+  G  LTYDY +
Sbjct: 2217 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2254


>gi|417407091|gb|JAA50172.1| Putative histone-lysine n-methyltransferase ash1l isoform 1 [Desmodus
            rotundus]
          Length = 2962

 Score =  125 bits (314), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 75/218 (34%), Positives = 118/218 (54%), Gaps = 16/218 (7%)

Query: 28   VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
            ++F+LP      W     + KP    Y  I+ N+Y+   +K     +   C+C      +
Sbjct: 2041 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDDT 2098

Query: 78   GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
               C  DC   M+ + CS   C CG+ C N+  Q    V+ ++  + E+ G GI   E +
Sbjct: 2099 RKGCVDDCLNRMIFAECSPNTCPCGDQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2158

Query: 135  KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
            K G+F+IEY+GEV+ +Q    R+ +  H   ++ Y   ++  MVID+   GN++R+INHS
Sbjct: 2159 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2217

Query: 195  CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            C PN EMQKW ++G  RIG++A +D+  G  LTYDY +
Sbjct: 2218 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2255


>gi|402856517|ref|XP_003892835.1| PREDICTED: histone-lysine N-methyltransferase ASH1L [Papio anubis]
          Length = 1277

 Score =  125 bits (314), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 75/218 (34%), Positives = 117/218 (53%), Gaps = 16/218 (7%)

Query: 28  VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
           ++F+LP      W     + KP    Y  I+ N+Y+   +K     +   C+C      +
Sbjct: 356 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDDT 413

Query: 78  GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
              C  DC   M+ + CS   C CG  C N+  Q    V+ ++  + E+ G GI   E +
Sbjct: 414 RKGCVDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 473

Query: 135 KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
           K G+F+IEY+GEV+ +Q    R+ +  H   ++ Y   ++  MVID+   GN++R+INHS
Sbjct: 474 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 532

Query: 195 CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
           C PN EMQKW ++G  RIG++A +D+  G  LTYDY +
Sbjct: 533 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 570


>gi|354478852|ref|XP_003501628.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
            N-methyltransferase ASH1L-like [Cricetulus griseus]
          Length = 2962

 Score =  125 bits (314), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 75/218 (34%), Positives = 117/218 (53%), Gaps = 16/218 (7%)

Query: 28   VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
            ++F+LP      W     + KP    Y  I+ N+Y+   +K     +   C+C      +
Sbjct: 2039 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDDT 2096

Query: 78   GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
               C  DC   M+ + CS   C CG  C N+  Q    V+ ++  + E+ G GI   E +
Sbjct: 2097 RKGCGDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2156

Query: 135  KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
            K G+F+IEY+GEV+ +Q    R+ +  H   ++ Y   ++  MVID+   GN++R+INHS
Sbjct: 2157 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2215

Query: 195  CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            C PN EMQKW ++G  RIG++A +D+  G  LTYDY +
Sbjct: 2216 CDPNCEMQKWSVNGVYRIGLYALKDVLAGTELTYDYNF 2253


>gi|417407083|gb|JAA50168.1| Putative histone-lysine n-methyltransferase ash1l isoform 1 [Desmodus
            rotundus]
          Length = 2832

 Score =  125 bits (314), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 75/218 (34%), Positives = 118/218 (54%), Gaps = 16/218 (7%)

Query: 28   VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
            ++F+LP      W     + KP    Y  I+ N+Y+   +K     +   C+C      +
Sbjct: 2041 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDDT 2098

Query: 78   GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
               C  DC   M+ + CS   C CG+ C N+  Q    V+ ++  + E+ G GI   E +
Sbjct: 2099 RKGCVDDCLNRMIFAECSPNTCPCGDQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2158

Query: 135  KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
            K G+F+IEY+GEV+ +Q    R+ +  H   ++ Y   ++  MVID+   GN++R+INHS
Sbjct: 2159 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2217

Query: 195  CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            C PN EMQKW ++G  RIG++A +D+  G  LTYDY +
Sbjct: 2218 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2255


>gi|320583119|gb|EFW97335.1| Histone methyltransferase [Ogataea parapolymorpha DL-1]
          Length = 942

 Score =  125 bits (314), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 66/175 (37%), Positives = 99/175 (56%), Gaps = 7/175 (4%)

Query: 63  DDGIFCSCTASPGSSGV-----CDRDCHCGMLLSSCSSG--CKCGNSCLNKPFQNRPVKK 115
           D+ + C C    G+ GV      D DC   +    C  G    CG+ C N+ FQN  +  
Sbjct: 57  DEFMACDCMEDVGADGVNHACGPDSDCINRLTNIECVDGQCSGCGDKCQNQRFQNNEIAP 116

Query: 116 MKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINR 175
           + + +TE  G G+ ADEDI    F++EY GE++D++  +ER  +  + G  +FY   I  
Sbjct: 117 VSIFKTEHKGYGMRADEDIYPHTFIVEYKGEIVDNELYKERKEQYSNEGIKHFYFMMIQD 176

Query: 176 DMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDY 230
           + +IDAT K +  R+ NHSC PN  ++KW+++   R+GIFA R I KGE +T+DY
Sbjct: 177 NEIIDATKKASIGRFCNHSCDPNAYVEKWVVNKRFRMGIFAKRKIIKGEEITFDY 231


>gi|390596531|gb|EIN05933.1| hypothetical protein PUNSTDRAFT_54761 [Punctularia strigosozonata
           HHB-11173 SS5]
          Length = 813

 Score =  125 bits (314), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 81/250 (32%), Positives = 128/250 (51%), Gaps = 11/250 (4%)

Query: 5   KKNSDNSRIGHAFNKLLKQI-GNPVEFELPDWFIKPKAIPYVF--IKRNIYLTKRIKRRL 61
           K N D+S  G + ++  +Q    P    + D  +  +A    F  +K N Y    + R  
Sbjct: 45  KTNRDSSTPGASSSRRQQQPQARPGPQLIGDLPVATEAAMATFEELKANHYQYGTLGRSR 104

Query: 62  ED-DGIFCSCT-----ASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVK 114
           E  +G+ C C        P  +     DC   +    C  G C+CG  C N+ F  R   
Sbjct: 105 EALEGMTCDCQYDHAEDEPWMACGEGSDCINRLTQVECLPGECRCGAHCRNQRFNRREYA 164

Query: 115 KMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEIN 174
            +++VQTEK G G+ A EDI++ +F+ EYVG+V+   + ++R+ +    G  +FY   + 
Sbjct: 165 PIEIVQTEKKGFGLRAREDIRKDQFIYEYVGDVVSHPSFKKRMREYAQEGIRHFYFMMLQ 224

Query: 175 RDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE- 233
           +D  IDAT +G   R+ NHSC PN  + KW +    R+GIFA+R I++ E LT++Y  + 
Sbjct: 225 KDEYIDATKRGGIGRFANHSCSPNCYVAKWTVGTHVRMGIFASRHIRQHEELTFNYNVDR 284

Query: 234 FLHDSLIAYC 243
           + HD+   YC
Sbjct: 285 YGHDAQPCYC 294


>gi|403293713|ref|XP_003937857.1| PREDICTED: histone-lysine N-methyltransferase ASH1L [Saimiri
            boliviensis boliviensis]
          Length = 2970

 Score =  125 bits (314), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 75/218 (34%), Positives = 118/218 (54%), Gaps = 16/218 (7%)

Query: 28   VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
            ++F+LP      W     + KP    Y  I+ N+Y+   +K     +   C+C     ++
Sbjct: 2049 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDNT 2106

Query: 78   GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
               C  DC   M+ + CS   C CG  C N+  Q    V+ ++  + E+ G GI   E +
Sbjct: 2107 RKGCVDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2166

Query: 135  KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
            K G+F+IEY+GEV+ +Q    R+ +  H   ++ Y   ++  MVID+   GN++R+INHS
Sbjct: 2167 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2225

Query: 195  CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            C PN EMQKW ++G  RIG++A +D+  G  LTYDY +
Sbjct: 2226 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2263


>gi|47216786|emb|CAG03790.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 1443

 Score =  125 bits (313), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 79/236 (33%), Positives = 112/236 (47%), Gaps = 33/236 (13%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
            K  PY FIK N  + K ++    D      C   P     C  + +C   ML   C    
Sbjct: 990  KPPPYKFIKVNKPVGK-VQVYAADISEIPKCNCKPSDERPCGFESECLNRMLQYECHPQV 1048

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKR-------------------- 136
            C  G  C N+ F  R     K+++T   G G++   DIK+                    
Sbjct: 1049 CPSGERCCNQDFTQRLYPDTKIIKTPGKGWGLITLRDIKKVSARRPGSPVPVFLPVGRRV 1108

Query: 137  ---------GEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNK 187
                     GEFV EY+GE+ID++ C  R+   +    TNFY+  I++D +IDA  KGN 
Sbjct: 1109 GTSWSDVTQGEFVNEYIGELIDEEECRARIKYAQENNITNFYMLTIDKDRIIDAGPKGNY 1168

Query: 188  SRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
            SR++NHSC PN E QKW ++G+TR+G+FA  DI  G  LT++Y  + L +     C
Sbjct: 1169 SRFMNHSCQPNCETQKWTVNGDTRVGLFAVCDIPAGTELTFNYNLDCLGNEKTVCC 1224


>gi|401888851|gb|EJT52799.1| histone-lysine N-methyltransferase [Trichosporon asahii var. asahii
           CBS 2479]
          Length = 905

 Score =  125 bits (313), Expect = 2e-26,   Method: Composition-based stats.
 Identities = 63/164 (38%), Positives = 92/164 (56%), Gaps = 9/164 (5%)

Query: 81  DRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEF 139
           + DC   ++   C +G CK G  C N+    +    + +VQTEK G G+ A EDI  G  
Sbjct: 206 ESDCINRIMFIECLAGECKAGKRCRNQRLARKQYAPVDVVQTEKKGFGLRAREDIPAGAL 265

Query: 140 VIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNT 199
           V EY+GEV+ ++T  +R+ +    G  +FY   + ++  IDAT KG   R+ NHSC PN 
Sbjct: 266 VYEYIGEVVVEKTFRKRMAEYADEGIKHFYFMMLQKEEYIDATKKGGIGRFANHSCNPNC 325

Query: 200 EMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
           E+QKW++    R+GIF  RD+KK E +T++Y         I YC
Sbjct: 326 EVQKWVVGRRMRMGIFTKRDVKKDEEITFNYN--------ICYC 361


>gi|298706866|emb|CBJ25830.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 810

 Score =  125 bits (313), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 67/168 (39%), Positives = 94/168 (55%), Gaps = 11/168 (6%)

Query: 76  SSGVCDRDCHCGMLLSSCSSG---------CKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
           S GVC   C   ML + C  G         CK G  C N+  QNR   K+   +    G 
Sbjct: 491 SEGVCGDSCLNAMLRTECVCGNGIRAKYQNCKLGAGCGNRRLQNRVNAKVAPFREAGMGW 550

Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHL--GETNFYLCEINRDMVIDATYK 184
           G+    D+ +G  + EYVGEVID+   E R+ + + L   +  FY+ E+ + + IDA  K
Sbjct: 551 GLKVAVDVPKGSLIGEYVGEVIDEAMVEHRMAEQRRLRPNDGEFYIMELGQSLFIDAKEK 610

Query: 185 GNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
           GN  R INHSC PN ++Q W I G TR+GI+A +D+ KGE+L+YDY++
Sbjct: 611 GNLMRLINHSCNPNCDVQAWNIAGYTRLGIYAKKDLAKGESLSYDYKF 658


>gi|390476801|ref|XP_002760038.2| PREDICTED: histone-lysine N-methyltransferase ASH1L [Callithrix
            jacchus]
          Length = 2970

 Score =  125 bits (313), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 75/217 (34%), Positives = 114/217 (52%), Gaps = 14/217 (6%)

Query: 28   VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
            ++F+LP      W     + KP    Y  I+ N+Y+  +     E     C         
Sbjct: 2049 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVDVKPLSGYEATTCNCKKPDDDARK 2108

Query: 78   GVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDIK 135
            G  D DC   M+ + CS   C CG  C N+  Q    V+ ++  + E+ G GI   E +K
Sbjct: 2109 GCVD-DCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLK 2167

Query: 136  RGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSC 195
             G+F+IEY+GEV+ +Q    R+ +  H   ++ Y   ++  MVID+   GN++R+INHSC
Sbjct: 2168 AGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHSC 2226

Query: 196  CPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
             PN EMQKW ++G  RIG++A +D+  G  LTYDY +
Sbjct: 2227 DPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2263


>gi|196001997|ref|XP_002110866.1| hypothetical protein TRIADDRAFT_54228 [Trichoplax adhaerens]
 gi|190586817|gb|EDV26870.1| hypothetical protein TRIADDRAFT_54228 [Trichoplax adhaerens]
          Length = 1004

 Score =  125 bits (313), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 70/186 (37%), Positives = 97/186 (52%), Gaps = 2/186 (1%)

Query: 50  NIYLTKRIKRRLEDDGIFCSCTASPGSSGV--CDRDCHCGMLLSSCSSGCKCGNSCLNKP 107
           N YL+ R  +      + C CT          C  +C   ML   C S C CG  C N+ 
Sbjct: 57  NTYLSGRKSKDKRIKNMTCECTYDEDDPNFIGCGENCLNRMLFIECGSKCSCGKFCTNRR 116

Query: 108 FQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETN 167
           FQ     K+++ +TEK G G+   ED++  +FV+EY GEVID +  E R          +
Sbjct: 117 FQMAEYPKVEVFKTEKKGFGLRTLEDLEDNQFVLEYCGEVIDLREFERRKRDYAKKKIKH 176

Query: 168 FYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLT 227
           +Y   ++ + +IDA+ KG  SR+INHSC PN   QKW ++G  RIG F  R I     LT
Sbjct: 177 YYFMTLSPNEIIDASRKGTFSRFINHSCDPNCVTQKWTVNGMLRIGFFTLRKIPANTELT 236

Query: 228 YDYQYE 233
           +DYQ+E
Sbjct: 237 FDYQFE 242


>gi|426216789|ref|XP_004002640.1| PREDICTED: histone-lysine N-methyltransferase ASH1L [Ovis aries]
          Length = 2965

 Score =  125 bits (313), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 75/218 (34%), Positives = 117/218 (53%), Gaps = 16/218 (7%)

Query: 28   VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
            ++F+LP      W     + KP    Y  I+ N+Y+   +K     +   C+C      +
Sbjct: 2044 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDDT 2101

Query: 78   GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
               C  DC   M+ + CS   C CG  C N+  Q    V+ ++  + E+ G GI   E +
Sbjct: 2102 KKGCVDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2161

Query: 135  KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
            K G+F+IEY+GEV+ +Q    R+ +  H   ++ Y   ++  MVID+   GN++R+INHS
Sbjct: 2162 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2220

Query: 195  CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            C PN EMQKW ++G  RIG++A +D+  G  LTYDY +
Sbjct: 2221 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2258


>gi|300795068|ref|NP_001179672.1| probable histone-lysine N-methyltransferase ASH1L [Bos taurus]
 gi|296489728|tpg|DAA31841.1| TPA: ash1 (absent, small, or homeotic)-like [Bos taurus]
          Length = 2965

 Score =  125 bits (313), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 75/218 (34%), Positives = 117/218 (53%), Gaps = 16/218 (7%)

Query: 28   VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
            ++F+LP      W     + KP    Y  I+ N+Y+   +K     +   C+C      +
Sbjct: 2044 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDDT 2101

Query: 78   GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
               C  DC   M+ + CS   C CG  C N+  Q    V+ ++  + E+ G GI   E +
Sbjct: 2102 KKGCVDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2161

Query: 135  KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
            K G+F+IEY+GEV+ +Q    R+ +  H   ++ Y   ++  MVID+   GN++R+INHS
Sbjct: 2162 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2220

Query: 195  CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            C PN EMQKW ++G  RIG++A +D+  G  LTYDY +
Sbjct: 2221 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2258


>gi|440903623|gb|ELR54260.1| Putative histone-lysine N-methyltransferase ASH1L [Bos grunniens
            mutus]
          Length = 2965

 Score =  125 bits (313), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 75/218 (34%), Positives = 117/218 (53%), Gaps = 16/218 (7%)

Query: 28   VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
            ++F+LP      W     + KP    Y  I+ N+Y+   +K     +   C+C      +
Sbjct: 2044 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDDT 2101

Query: 78   GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
               C  DC   M+ + CS   C CG  C N+  Q    V+ ++  + E+ G GI   E +
Sbjct: 2102 KKGCVDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2161

Query: 135  KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
            K G+F+IEY+GEV+ +Q    R+ +  H   ++ Y   ++  MVID+   GN++R+INHS
Sbjct: 2162 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2220

Query: 195  CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            C PN EMQKW ++G  RIG++A +D+  G  LTYDY +
Sbjct: 2221 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2258


>gi|116207846|ref|XP_001229732.1| hypothetical protein CHGG_03216 [Chaetomium globosum CBS 148.51]
 gi|121787978|sp|Q2H988.1|SET2_CHAGB RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36
           specific; AltName: Full=SET domain-containing protein 2
 gi|88183813|gb|EAQ91281.1| hypothetical protein CHGG_03216 [Chaetomium globosum CBS 148.51]
          Length = 894

 Score =  125 bits (313), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 67/186 (36%), Positives = 97/186 (52%), Gaps = 6/186 (3%)

Query: 64  DGIFCSCTAS--PGSSGVCDRDCHC---GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKL 118
           D + C C      G +  C  D  C      +   S  C CG  C N+ FQ +    + +
Sbjct: 73  DALDCDCAEEWHDGQNHACGEDSDCINRATKIECVSGDCNCGEGCENQRFQRKQYANVSV 132

Query: 119 VQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMV 178
           ++TEK G G+  D D++  +FV EYVGEVI++ T   R  K    G  +FY   + +   
Sbjct: 133 IKTEKKGFGLRTDADLQANDFVFEYVGEVINEPTFRNRTVKYDKEGIKHFYFMSLTKSEF 192

Query: 179 IDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHD 237
           +DAT KGN  R+ NHSC PN  + KW++  + R+GIFATR I+ GE L ++Y  + +  D
Sbjct: 193 VDATKKGNLGRFCNHSCNPNCYVDKWVVGDKLRMGIFATRAIRAGEELVFNYNVDRYGAD 252

Query: 238 SLIAYC 243
               YC
Sbjct: 253 PQPCYC 258


>gi|417515828|gb|JAA53722.1| histone-lysine N-methyltransferase ASH1L [Sus scrofa]
          Length = 2951

 Score =  125 bits (313), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 75/218 (34%), Positives = 117/218 (53%), Gaps = 16/218 (7%)

Query: 28   VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
            ++F+LP      W     + KP    Y  I+ N+Y+   +K     +   C+C      +
Sbjct: 2030 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDDT 2087

Query: 78   GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
               C  DC   M+ + CS   C CG  C N+  Q    V+ ++  + E+ G GI   E +
Sbjct: 2088 RKGCVDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2147

Query: 135  KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
            K G+F+IEY+GEV+ +Q    R+ +  H   ++ Y   ++  MVID+   GN++R+INHS
Sbjct: 2148 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2206

Query: 195  CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            C PN EMQKW ++G  RIG++A +D+  G  LTYDY +
Sbjct: 2207 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2244


>gi|156051364|ref|XP_001591643.1| hypothetical protein SS1G_07089 [Sclerotinia sclerotiorum 1980]
 gi|154704867|gb|EDO04606.1| hypothetical protein SS1G_07089 [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 971

 Score =  125 bits (313), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 77/225 (34%), Positives = 119/225 (52%), Gaps = 10/225 (4%)

Query: 27  PVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTA--SPGSSGVCDRDC 84
           P+  +LPD   +  ++ +  IK  IY  K +    + D + C C    S G++  C  D 
Sbjct: 114 PLFNDLPDSTEEAASV-FQVIKDCIYGAKHMGAS-DHDALDCDCPEEYSDGNNHACGEDS 171

Query: 85  HCGMLLS--SCSSG---CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEF 139
            C   L+   C  G   C CG +C N+ FQ R   K+ +++T+K G G+ A+ D+   +F
Sbjct: 172 DCINRLTKMECGGGHKDCNCGVACQNQRFQRRQYAKVSVIKTDKKGYGLRANTDLHPDDF 231

Query: 140 VIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNT 199
           + EY+GEVI++ T   R  +    G  +FY   + +   +DAT KGN  R+ NHSC PN 
Sbjct: 232 IFEYIGEVINEPTFRRRTIQYDQEGIKHFYFMSLTKHEFVDATKKGNLGRFCNHSCNPNC 291

Query: 200 EMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
            + KW++  + R+GIFA R IK GE L ++Y  + +  D    YC
Sbjct: 292 YVDKWVVGEKLRMGIFAERAIKAGEELVFNYNVDRYGADPQPCYC 336


>gi|410226116|gb|JAA10277.1| ash1 (absent, small, or homeotic)-like [Pan troglodytes]
 gi|410264036|gb|JAA19984.1| ash1 (absent, small, or homeotic)-like [Pan troglodytes]
 gi|410264040|gb|JAA19986.1| ash1 (absent, small, or homeotic)-like [Pan troglodytes]
 gi|410306368|gb|JAA31784.1| ash1 (absent, small, or homeotic)-like [Pan troglodytes]
 gi|410355463|gb|JAA44335.1| ash1 (absent, small, or homeotic)-like [Pan troglodytes]
          Length = 2964

 Score =  125 bits (313), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 75/218 (34%), Positives = 117/218 (53%), Gaps = 16/218 (7%)

Query: 28   VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
            ++F+LP      W     + KP    Y  I+ N+Y+   +K     +   C+C      +
Sbjct: 2043 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDDT 2100

Query: 78   GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
               C  DC   M+ + CS   C CG  C N+  Q    V+ ++  + E+ G GI   E +
Sbjct: 2101 RKGCVDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2160

Query: 135  KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
            K G+F+IEY+GEV+ +Q    R+ +  H   ++ Y   ++  MVID+   GN++R+INHS
Sbjct: 2161 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2219

Query: 195  CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            C PN EMQKW ++G  RIG++A +D+  G  LTYDY +
Sbjct: 2220 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2257


>gi|410033849|ref|XP_003949641.1| PREDICTED: histone-lysine N-methyltransferase ASH1L [Pan troglodytes]
          Length = 2964

 Score =  125 bits (313), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 75/218 (34%), Positives = 117/218 (53%), Gaps = 16/218 (7%)

Query: 28   VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
            ++F+LP      W     + KP    Y  I+ N+Y+   +K     +   C+C      +
Sbjct: 2043 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDDT 2100

Query: 78   GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
               C  DC   M+ + CS   C CG  C N+  Q    V+ ++  + E+ G GI   E +
Sbjct: 2101 RKGCVDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2160

Query: 135  KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
            K G+F+IEY+GEV+ +Q    R+ +  H   ++ Y   ++  MVID+   GN++R+INHS
Sbjct: 2161 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2219

Query: 195  CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            C PN EMQKW ++G  RIG++A +D+  G  LTYDY +
Sbjct: 2220 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2257


>gi|354548180|emb|CCE44916.1| hypothetical protein CPAR2_407180 [Candida parapsilosis]
          Length = 810

 Score =  125 bits (313), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 61/155 (39%), Positives = 92/155 (59%), Gaps = 3/155 (1%)

Query: 79  VCDRDCHCGMLLSS---CSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIK 135
            C  D +C   L+S    +  C CG+ C N+ FQ R    + + QTE  G G+ A++ I 
Sbjct: 71  ACGEDSNCINRLTSVECTNRHCLCGDDCQNQRFQKRQYADVSVFQTELKGYGLKANKPIS 130

Query: 136 RGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSC 195
            G+F+ EY+GEVID+    +R+ +       +FY   +  D  IDAT KG+ +R++NHSC
Sbjct: 131 EGQFIYEYIGEVIDEGAFRQRMIEYDMKNYKHFYFMMLKPDAFIDATEKGSLARFVNHSC 190

Query: 196 CPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDY 230
            PN  + KW++  + R+GIFA R+I KGE +T+DY
Sbjct: 191 NPNAFVDKWVVGDKLRMGIFAKRNIAKGEEITFDY 225


>gi|367049654|ref|XP_003655206.1| hypothetical protein THITE_2118632 [Thielavia terrestris NRRL 8126]
 gi|347002470|gb|AEO68870.1| hypothetical protein THITE_2118632 [Thielavia terrestris NRRL 8126]
          Length = 931

 Score =  125 bits (313), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 65/165 (39%), Positives = 95/165 (57%), Gaps = 2/165 (1%)

Query: 81  DRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEF 139
           D DC        C  G C CG+ C N+ FQ +   K+ +++TEK G G+ AD D++  +F
Sbjct: 138 DSDCINRATKIECVDGECNCGDGCQNQRFQRKQYAKVSVIKTEKKGFGLRADTDLQPNDF 197

Query: 140 VIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNT 199
           V EYVGEVI++ T   R+ K    G  +FY   + ++  +DAT KGN  R+ NHSC PN 
Sbjct: 198 VYEYVGEVINEPTFRSRMLKYDKEGIKHFYFMSLTKNEFVDATKKGNLGRFCNHSCNPNC 257

Query: 200 EMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
            + KW++  + R+GIFA R I+ GE L ++Y  + +  D    YC
Sbjct: 258 YVDKWVVGEKLRMGIFAGRRIRAGEELVFNYNVDRYGADPQPCYC 302


>gi|7739725|gb|AAF68983.1|AF257305_1 ASH1 [Homo sapiens]
          Length = 2969

 Score =  125 bits (313), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 75/218 (34%), Positives = 117/218 (53%), Gaps = 16/218 (7%)

Query: 28   VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
            ++F+LP      W     + KP    Y  I+ N+Y+   +K     +   C+C      +
Sbjct: 2048 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDDT 2105

Query: 78   GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
               C  DC   M+ + CS   C CG  C N+  Q    V+ ++  + E+ G GI   E +
Sbjct: 2106 RKGCVDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2165

Query: 135  KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
            K G+F+IEY+GEV+ +Q    R+ +  H   ++ Y   ++  MVID+   GN++R+INHS
Sbjct: 2166 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2224

Query: 195  CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            C PN EMQKW ++G  RIG++A +D+  G  LTYDY +
Sbjct: 2225 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2262


>gi|397492363|ref|XP_003817092.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            ASH1L [Pan paniscus]
          Length = 2964

 Score =  125 bits (313), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 75/218 (34%), Positives = 117/218 (53%), Gaps = 16/218 (7%)

Query: 28   VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
            ++F+LP      W     + KP    Y  I+ N+Y+   +K     +   C+C      +
Sbjct: 2043 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDDT 2100

Query: 78   GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
               C  DC   M+ + CS   C CG  C N+  Q    V+ ++  + E+ G GI   E +
Sbjct: 2101 RKGCVDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2160

Query: 135  KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
            K G+F+IEY+GEV+ +Q    R+ +  H   ++ Y   ++  MVID+   GN++R+INHS
Sbjct: 2161 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2219

Query: 195  CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            C PN EMQKW ++G  RIG++A +D+  G  LTYDY +
Sbjct: 2220 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2257


>gi|351696657|gb|EHA99575.1| Putative histone-lysine N-methyltransferase ASH1L [Heterocephalus
            glaber]
          Length = 2930

 Score =  125 bits (313), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 75/218 (34%), Positives = 117/218 (53%), Gaps = 16/218 (7%)

Query: 28   VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
            ++F+LP      W     + KP    Y  I+ N+Y+   +K     +   C+C      +
Sbjct: 2009 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDDT 2066

Query: 78   GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
               C  DC   M+ + CS   C CG  C N+  Q    V+ ++  + E+ G GI   E +
Sbjct: 2067 RKGCVDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2126

Query: 135  KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
            K G+F+IEY+GEV+ +Q    R+ +  H   ++ Y   ++  MVID+   GN++R+INHS
Sbjct: 2127 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2185

Query: 195  CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            C PN EMQKW ++G  RIG++A +D+  G  LTYDY +
Sbjct: 2186 CDPNCEMQKWSVNGVYRIGLYALKDMTAGTELTYDYNF 2223


>gi|357443579|ref|XP_003592067.1| hypothetical protein MTR_1g098370 [Medicago truncatula]
 gi|355481115|gb|AES62318.1| hypothetical protein MTR_1g098370 [Medicago truncatula]
          Length = 491

 Score =  125 bits (313), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 73/182 (40%), Positives = 102/182 (56%), Gaps = 6/182 (3%)

Query: 58  KRRLEDDGIFCSC---TASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPV 113
           K++ E+D   C C      P S+  C   C   +  + C+ G C C   C N+ FQ    
Sbjct: 38  KKQKEEDIAICECRYDEDDPDSA--CGDGCLNVLTSTECTPGFCPCDIHCKNQKFQKCEY 95

Query: 114 KKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEI 173
            K KL +TE  G G++ADE+IK G+FVIEY GEVI  +  + R    +  G  + ++  +
Sbjct: 96  AKTKLFKTEGRGWGLLADEEIKAGQFVIEYCGEVISCKEAKRRSHTYEIQGLKDAFIISL 155

Query: 174 NRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
           N    IDAT KG+ +R+INHSC PN E +KW + GE R+GIFA  +I  G  L YDY +E
Sbjct: 156 NASESIDATRKGSLARFINHSCQPNCETRKWNVMGEIRVGIFALENIPIGTELAYDYNFE 215

Query: 234 FL 235
           + 
Sbjct: 216 WF 217


>gi|350583322|ref|XP_003125756.3| PREDICTED: probable histone-lysine N-methyltransferase ASH1L-like,
            partial [Sus scrofa]
          Length = 2824

 Score =  125 bits (313), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 75/218 (34%), Positives = 117/218 (53%), Gaps = 16/218 (7%)

Query: 28   VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
            ++F+LP      W     + KP    Y  I+ N+Y+   +K     +   C+C      +
Sbjct: 1897 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDDT 1954

Query: 78   GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
               C  DC   M+ + CS   C CG  C N+  Q    V+ ++  + E+ G GI   E +
Sbjct: 1955 RKGCVDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2014

Query: 135  KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
            K G+F+IEY+GEV+ +Q    R+ +  H   ++ Y   ++  MVID+   GN++R+INHS
Sbjct: 2015 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2073

Query: 195  CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            C PN EMQKW ++G  RIG++A +D+  G  LTYDY +
Sbjct: 2074 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2111


>gi|338724967|ref|XP_001499134.2| PREDICTED: probable histone-lysine N-methyltransferase ASH1L isoform
            1 [Equus caballus]
          Length = 2963

 Score =  125 bits (313), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 75/218 (34%), Positives = 117/218 (53%), Gaps = 16/218 (7%)

Query: 28   VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
            ++F+LP      W     + KP    Y  I+ N+Y+   +K     +   C+C      +
Sbjct: 2042 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDDT 2099

Query: 78   GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
               C  DC   M+ + CS   C CG  C N+  Q    V+ ++  + E+ G GI   E +
Sbjct: 2100 RKGCVDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2159

Query: 135  KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
            K G+F+IEY+GEV+ +Q    R+ +  H   ++ Y   ++  MVID+   GN++R+INHS
Sbjct: 2160 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2218

Query: 195  CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            C PN EMQKW ++G  RIG++A +D+  G  LTYDY +
Sbjct: 2219 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2256


>gi|110349788|ref|NP_060959.2| histone-lysine N-methyltransferase ASH1L [Homo sapiens]
 gi|225000936|gb|AAI72595.1| Ash1 (absent, small, or homeotic)-like (Drosophila) [synthetic
            construct]
          Length = 2964

 Score =  125 bits (313), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 75/218 (34%), Positives = 117/218 (53%), Gaps = 16/218 (7%)

Query: 28   VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
            ++F+LP      W     + KP    Y  I+ N+Y+   +K     +   C+C      +
Sbjct: 2043 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDDT 2100

Query: 78   GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
               C  DC   M+ + CS   C CG  C N+  Q    V+ ++  + E+ G GI   E +
Sbjct: 2101 RKGCVDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2160

Query: 135  KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
            K G+F+IEY+GEV+ +Q    R+ +  H   ++ Y   ++  MVID+   GN++R+INHS
Sbjct: 2161 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2219

Query: 195  CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            C PN EMQKW ++G  RIG++A +D+  G  LTYDY +
Sbjct: 2220 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2257


>gi|117949323|sp|Q9NR48.2|ASH1L_HUMAN RecName: Full=Histone-lysine N-methyltransferase ASH1L; AltName:
            Full=ASH1-like protein; Short=huASH1; AltName:
            Full=Absent small and homeotic disks protein 1 homolog;
            AltName: Full=Lysine N-methyltransferase 2H
          Length = 2969

 Score =  125 bits (313), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 75/218 (34%), Positives = 117/218 (53%), Gaps = 16/218 (7%)

Query: 28   VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
            ++F+LP      W     + KP    Y  I+ N+Y+   +K     +   C+C      +
Sbjct: 2048 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDDT 2105

Query: 78   GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
               C  DC   M+ + CS   C CG  C N+  Q    V+ ++  + E+ G GI   E +
Sbjct: 2106 RKGCVDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2165

Query: 135  KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
            K G+F+IEY+GEV+ +Q    R+ +  H   ++ Y   ++  MVID+   GN++R+INHS
Sbjct: 2166 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2224

Query: 195  CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            C PN EMQKW ++G  RIG++A +D+  G  LTYDY +
Sbjct: 2225 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2262


>gi|392587279|gb|EIW76613.1| SET domain-containing protein [Coniophora puteana RWD-64-598 SS2]
          Length = 793

 Score =  124 bits (312), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 68/187 (36%), Positives = 100/187 (53%), Gaps = 7/187 (3%)

Query: 64  DGIFCSCTASPGSSGVCD-----RDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMK 117
           +G+ C C   PG S   D      DC   +    C  G C+C N C N+ FQ +    ++
Sbjct: 66  EGMACDCQFEPGVSDPSDACGPHSDCINRLTEVECIPGECRCRNYCQNQRFQRKQYANIE 125

Query: 118 LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDM 177
           +VQTEK G G+ A  DI +  F+ EYVG+V+   +  +R+ +    G  +FY   +  + 
Sbjct: 126 VVQTEKKGFGLRAGSDIPKDAFIYEYVGDVVSHPSFLKRMREYAAEGIRHFYFMMLQSNE 185

Query: 178 VIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLH 236
            IDAT +G K R+ NHSC PN  + KW +    R+GIFA R +K+ E LT++Y  + + H
Sbjct: 186 YIDATKRGGKGRFANHSCNPNCYVAKWTVGPHVRMGIFAKRAVKRNEELTFNYNVDRYGH 245

Query: 237 DSLIAYC 243
           D    YC
Sbjct: 246 DPQPCYC 252


>gi|119573453|gb|EAW53068.1| ash1 (absent, small, or homeotic)-like (Drosophila) [Homo sapiens]
          Length = 2969

 Score =  124 bits (312), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 75/218 (34%), Positives = 117/218 (53%), Gaps = 16/218 (7%)

Query: 28   VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
            ++F+LP      W     + KP    Y  I+ N+Y+   +K     +   C+C      +
Sbjct: 2048 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDDT 2105

Query: 78   GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
               C  DC   M+ + CS   C CG  C N+  Q    V+ ++  + E+ G GI   E +
Sbjct: 2106 RKGCVDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2165

Query: 135  KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
            K G+F+IEY+GEV+ +Q    R+ +  H   ++ Y   ++  MVID+   GN++R+INHS
Sbjct: 2166 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2224

Query: 195  CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            C PN EMQKW ++G  RIG++A +D+  G  LTYDY +
Sbjct: 2225 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2262


>gi|320586545|gb|EFW99215.1| set and ww domain containing protein [Grosmannia clavigera kw1407]
          Length = 1051

 Score =  124 bits (312), Expect = 2e-26,   Method: Composition-based stats.
 Identities = 64/177 (36%), Positives = 94/177 (53%), Gaps = 5/177 (2%)

Query: 62  EDDGIFCSCTAS--PGSSGVC--DRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKM 116
           E D   C C A    G +  C  D DC        C  G C CG +C N+ FQ +    +
Sbjct: 119 EHDAFECDCAAEWHDGQNLACGEDSDCINRATKMECVRGDCNCGETCQNQRFQRKQYAAV 178

Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
            +++T K G G+ AD  ++  +FV EY+GEVI++     R+ +    G  +FY   + + 
Sbjct: 179 SVIKTAKKGFGLRADVALRANDFVYEYIGEVINEPAFRRRMLQYDEEGIRHFYFMSLTKH 238

Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
             +DAT KGN  R+ NHSC PN  + KW++  + R+GIFA+R I  GE L +DY  +
Sbjct: 239 EFVDATRKGNLGRFCNHSCRPNCYVDKWVVGEKLRMGIFASRAIAAGEELVFDYNVD 295


>gi|332219957|ref|XP_003259124.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            ASH1L [Nomascus leucogenys]
          Length = 2892

 Score =  124 bits (312), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 75/218 (34%), Positives = 117/218 (53%), Gaps = 16/218 (7%)

Query: 28   VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
            ++F+LP      W     + KP    Y  I+ N+Y+   +K     +   C+C      +
Sbjct: 2043 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDDT 2100

Query: 78   GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
               C  DC   M+ + CS   C CG  C N+  Q    V+ ++  + E+ G GI   E +
Sbjct: 2101 RKGCVDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2160

Query: 135  KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
            K G+F+IEY+GEV+ +Q    R+ +  H   ++ Y   ++  MVID+   GN++R+INHS
Sbjct: 2161 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2219

Query: 195  CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            C PN EMQKW ++G  RIG++A +D+  G  LTYDY +
Sbjct: 2220 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2257


>gi|426331996|ref|XP_004026979.1| PREDICTED: histone-lysine N-methyltransferase ASH1L [Gorilla gorilla
            gorilla]
          Length = 2776

 Score =  124 bits (312), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 75/218 (34%), Positives = 117/218 (53%), Gaps = 16/218 (7%)

Query: 28   VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
            ++F+LP      W     + KP    Y  I+ N+Y+   +K     +   C+C      +
Sbjct: 2043 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDDT 2100

Query: 78   GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
               C  DC   M+ + CS   C CG  C N+  Q    V+ ++  + E+ G GI   E +
Sbjct: 2101 RKGCVDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2160

Query: 135  KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
            K G+F+IEY+GEV+ +Q    R+ +  H   ++ Y   ++  MVID+   GN++R+INHS
Sbjct: 2161 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2219

Query: 195  CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            C PN EMQKW ++G  RIG++A +D+  G  LTYDY +
Sbjct: 2220 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2257


>gi|357611223|gb|EHJ67375.1| hypothetical protein KGM_13830 [Danaus plexippus]
          Length = 1798

 Score =  124 bits (312), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 72/215 (33%), Positives = 114/215 (53%), Gaps = 17/215 (7%)

Query: 32   LPDWFIKPKAIPYVFIKRNIYLTKRIKRRLED-DGIFCSCTASPGSSGVCDRDCHCGMLL 90
            +P W        Y  I+ N+Y    +K   E+ + + C+C      S  C+ DC   ++ 
Sbjct: 886  VPSWN-------YKKIRTNVYYD--VKPSAEECESVACNC----APSSACNEDCINRLVY 932

Query: 91   SSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVI 148
            S CS   C CG+ C N+  Q    V  ++   TE  G G+   + I+ G+F++EYVGEV+
Sbjct: 933  SECSPQLCPCGDKCKNQRIQRHEWVPGLEKFMTENKGWGVRTKQMIRSGDFILEYVGEVV 992

Query: 149  DDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDG 208
             D+  +ER+   ++  +T+ Y   ++  +VID    G   R++NHSC PN EMQKW  +G
Sbjct: 993  SDKEFKERMA-TRYARDTHHYCLHLDGGLVIDGHRVGGDGRFVNHSCRPNCEMQKWTANG 1051

Query: 209  ETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
              R+ +FA RDI+  E LTYDY +   + ++   C
Sbjct: 1052 TFRMALFALRDIEPDEELTYDYNFSLFNPAVGQPC 1086


>gi|410986772|ref|XP_003999683.1| PREDICTED: histone-lysine N-methyltransferase ASH1L isoform 1 [Felis
            catus]
          Length = 2965

 Score =  124 bits (312), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 75/218 (34%), Positives = 117/218 (53%), Gaps = 16/218 (7%)

Query: 28   VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCT-ASPGS 76
            ++F+LP      W     + KP    Y  I+ N+Y+   +K     +   C+C      +
Sbjct: 2044 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDDT 2101

Query: 77   SGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
               C  DC   M+ + CS   C CG  C N+  Q    V+ ++  + E+ G GI   E +
Sbjct: 2102 RKGCVDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2161

Query: 135  KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
            K G+F+IEY+GEV+ +Q    R+ +  H   ++ Y   ++  MVID+   GN++R+INHS
Sbjct: 2162 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2220

Query: 195  CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            C PN EMQKW ++G  RIG++A +D+  G  LTYDY +
Sbjct: 2221 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2258


>gi|380814664|gb|AFE79206.1| putative histone-lysine N-methyltransferase ASH1L [Macaca mulatta]
 gi|383419979|gb|AFH33203.1| putative histone-lysine N-methyltransferase ASH1L [Macaca mulatta]
          Length = 2963

 Score =  124 bits (312), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 75/218 (34%), Positives = 117/218 (53%), Gaps = 16/218 (7%)

Query: 28   VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
            ++F+LP      W     + KP    Y  I+ N+Y+   +K     +   C+C      +
Sbjct: 2042 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDDT 2099

Query: 78   GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
               C  DC   M+ + CS   C CG  C N+  Q    V+ ++  + E+ G GI   E +
Sbjct: 2100 RKGCVDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2159

Query: 135  KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
            K G+F+IEY+GEV+ +Q    R+ +  H   ++ Y   ++  MVID+   GN++R+INHS
Sbjct: 2160 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2218

Query: 195  CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            C PN EMQKW ++G  RIG++A +D+  G  LTYDY +
Sbjct: 2219 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2256


>gi|348579791|ref|XP_003475662.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
            N-methyltransferase ASH1L-like [Cavia porcellus]
          Length = 2964

 Score =  124 bits (312), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 75/218 (34%), Positives = 117/218 (53%), Gaps = 16/218 (7%)

Query: 28   VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
            ++F+LP      W     + KP    Y  I+ N+Y+   +K     +   C+C      +
Sbjct: 2043 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKRPDDET 2100

Query: 78   GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
               C  DC   M+ + CS   C CG  C N+  Q    V+ ++  + E+ G GI   E +
Sbjct: 2101 RKGCVDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2160

Query: 135  KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
            K G+F+IEY+GEV+ +Q    R+ +  H   ++ Y   ++  MVID+   GN++R+INHS
Sbjct: 2161 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2219

Query: 195  CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            C PN EMQKW ++G  RIG++A +D+  G  LTYDY +
Sbjct: 2220 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2257


>gi|410986774|ref|XP_003999684.1| PREDICTED: histone-lysine N-methyltransferase ASH1L isoform 2 [Felis
            catus]
          Length = 2974

 Score =  124 bits (312), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 75/218 (34%), Positives = 117/218 (53%), Gaps = 16/218 (7%)

Query: 28   VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCT-ASPGS 76
            ++F+LP      W     + KP    Y  I+ N+Y+   +K     +   C+C      +
Sbjct: 2053 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDDT 2110

Query: 77   SGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
               C  DC   M+ + CS   C CG  C N+  Q    V+ ++  + E+ G GI   E +
Sbjct: 2111 RKGCVDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2170

Query: 135  KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
            K G+F+IEY+GEV+ +Q    R+ +  H   ++ Y   ++  MVID+   GN++R+INHS
Sbjct: 2171 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2229

Query: 195  CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            C PN EMQKW ++G  RIG++A +D+  G  LTYDY +
Sbjct: 2230 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2267


>gi|258571704|ref|XP_002544655.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
 gi|237904925|gb|EEP79326.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
          Length = 1013

 Score =  124 bits (312), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 62/164 (37%), Positives = 91/164 (55%), Gaps = 1/164 (0%)

Query: 81  DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
           D DC        C   C CG+SC N+ FQ R    + +++TEK G G+  D D++  +F+
Sbjct: 171 DSDCINRATKMECVGDCGCGDSCQNQRFQQREYANVSVIKTEKKGYGLRTDSDLRPNQFI 230

Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
            EY+GEVI++     R+      G  +FY   +N+   +DAT KGN  R+ NHSC PN  
Sbjct: 231 FEYIGEVINEPQFRRRMINYDEEGIKHFYFMSLNKGEFVDATKKGNLGRFCNHSCNPNCY 290

Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
           + KW++  + R+GIFA R IK GE L ++Y  + +  D    YC
Sbjct: 291 VDKWVVGEKLRMGIFAERHIKAGEELVFNYNVDRYGADPQPCYC 334


>gi|345486764|ref|XP_001606931.2| PREDICTED: histone-lysine N-methyltransferase ash1 [Nasonia
           vitripennis]
          Length = 1690

 Score =  124 bits (311), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 77/237 (32%), Positives = 115/237 (48%), Gaps = 19/237 (8%)

Query: 19  KLLKQIGNPVEFELPDWFIKPKA-------IP---YVFIKRNIYLTKRIKRRLEDDGIFC 68
           K L+Q   P +     W++   +       +P   Y  I+ N+Y    +K  +  +   C
Sbjct: 763 KFLRQREIPFQLPYDLWWLHTHSRLPGRDLVPSWNYRKIRSNVYYD--VKPTMHYEAQAC 820

Query: 69  SCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGA 126
            C    G    C  DC   M+ S CS   C CG  C N+  Q       ++   TE  G 
Sbjct: 821 ECKPDAG----CGDDCINRMVFSECSPQLCPCGERCKNQKIQKHDWAPGLQRFMTESKGW 876

Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
           G+   E I+ GEF++EYVGEV+ ++  + R+   ++  +T+ Y   ++  +VID    G 
Sbjct: 877 GVRTHEPIRTGEFILEYVGEVVSEREFKTRMA-TRYANDTHHYCLHLDGGLVIDGHRMGG 935

Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
             R++NHSC PN EMQKW + G  R+ +FA RDI  GE LTYDY +   + S    C
Sbjct: 936 DGRFVNHSCEPNCEMQKWSVHGLPRMALFALRDITAGEELTYDYNFALFNPSEGQEC 992


>gi|74140676|dbj|BAC28183.2| unnamed protein product [Mus musculus]
          Length = 418

 Score =  124 bits (311), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 71/200 (35%), Positives = 111/200 (55%), Gaps = 6/200 (3%)

Query: 36  FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCT-ASPGSSGVCDRDCHCGMLLSSCS 94
           + KP    Y  I+ N+Y+   +K     +   C+C      +   C  DC   M+ + CS
Sbjct: 9   YKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDDTRKGCGDDCLNRMIFAECS 66

Query: 95  -SGCKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQT 152
            + C CG  C N+  Q    V+ ++  + E+ G GI   E +K G+F+IEY+GEV+ +Q 
Sbjct: 67  PNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAGQFIIEYLGEVVSEQE 126

Query: 153 CEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRI 212
              R+ +  H   ++ Y   ++  MVID+   GN++R+INHSC PN EMQKW ++G  RI
Sbjct: 127 FRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRI 185

Query: 213 GIFATRDIKKGENLTYDYQY 232
           G++A +D+  G  LTYDY +
Sbjct: 186 GLYALKDMPAGTELTYDYNF 205


>gi|47219458|emb|CAG10822.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 2598

 Score =  124 bits (311), Expect = 4e-26,   Method: Composition-based stats.
 Identities = 70/200 (35%), Positives = 110/200 (55%), Gaps = 6/200 (3%)

Query: 36   FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSC-TASPGSSGVCDRDCHCGMLLSSCS 94
            + +P    Y  I+ N+Y+   +K     +   C+C T    +   C  DC   M  + CS
Sbjct: 1666 YKRPDVPLYKKIRSNVYVD--VKPLSGYETTTCNCRTPDDQTEKSCLDDCLNRMSFAECS 1723

Query: 95   -SGCKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQT 152
             S C C + C N+  Q    V+ ++  +TE  G GI   + ++ G+F+IEY+GEV+ +Q 
Sbjct: 1724 PSTCPCADQCDNQRIQRHEWVQCLERFRTEGKGWGIRTKQPLRAGQFIIEYLGEVVSEQE 1783

Query: 153  CEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRI 212
               R+ + ++   +  Y   ++  MVID+   GN++R+INHSC PN EMQKW ++G  RI
Sbjct: 1784 FRSRMME-QYFSHSGNYCLNLDSGMVIDSYRMGNEARFINHSCEPNCEMQKWSVNGVYRI 1842

Query: 213  GIFATRDIKKGENLTYDYQY 232
            G+FA  +I  G  LTYDY +
Sbjct: 1843 GLFALGEIPSGTELTYDYNF 1862


>gi|327354316|gb|EGE83173.1| histone-lysine N-methyltransferase [Ajellomyces dermatitidis ATCC
           18188]
          Length = 1159

 Score =  124 bits (311), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 66/183 (36%), Positives = 98/183 (53%), Gaps = 7/183 (3%)

Query: 68  CSCTAS----PGSSGVC--DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQT 121
           C CT         +G C  D DC        C   C CG+ C N+ FQ R    + +++T
Sbjct: 155 CDCTEEWDPVAAKNGACGEDSDCINRATKMECVGDCGCGDECENQRFQRRQYANVTVIKT 214

Query: 122 EKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDA 181
           EK G G+ AD D++  +F+ EY+GEVI++    +R+      G  +FY   +++   +DA
Sbjct: 215 EKKGYGLRADSDLRPNQFIFEYIGEVINEGNFRKRMRSYDEEGIKHFYFMSLSKGEFVDA 274

Query: 182 TYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLI 240
           T KGN  R+ NHSC PN  + KW++  + R+GIFA R IK GE L ++Y  + +  D   
Sbjct: 275 TKKGNLGRFCNHSCNPNCYVDKWVVGEKLRMGIFAERHIKAGEELVFNYNVDRYGADPQP 334

Query: 241 AYC 243
            YC
Sbjct: 335 CYC 337


>gi|444323962|ref|XP_004182621.1| hypothetical protein TBLA_0J01040 [Tetrapisispora blattae CBS 6284]
 gi|387515669|emb|CCH63102.1| hypothetical protein TBLA_0J01040 [Tetrapisispora blattae CBS 6284]
          Length = 745

 Score =  124 bits (311), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 65/169 (38%), Positives = 95/169 (56%), Gaps = 6/169 (3%)

Query: 68  CSCTA--SPGSSGVCDRDCHC---GMLLSSCSSGCK-CGNSCLNKPFQNRPVKKMKLVQT 121
           C C    S G +  CD +  C     L+   +S C  CG++C N+ FQ      + + QT
Sbjct: 44  CDCVENYSDGVNHSCDENSDCINRLTLIECVNSKCNYCGDNCQNQRFQKHEYSNISIFQT 103

Query: 122 EKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDA 181
           E  G G+ A+E+++  +F+ EY+GEVIDD    +R+      GE +FY   +     IDA
Sbjct: 104 EMKGFGVRANENLEINQFIYEYIGEVIDDDEFHKRMINYDQRGEKHFYFMMLKSGEFIDA 163

Query: 182 TYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDY 230
           T KGN  R+ NHSC PN  + KW++  + ++GIFA R I KGE +T+DY
Sbjct: 164 TEKGNLGRFCNHSCNPNAYVNKWLVGDKLKMGIFAKRPIDKGEEITFDY 212


>gi|330946631|ref|XP_003306794.1| hypothetical protein PTT_20031 [Pyrenophora teres f. teres 0-1]
 gi|311315535|gb|EFQ85104.1| hypothetical protein PTT_20031 [Pyrenophora teres f. teres 0-1]
          Length = 961

 Score =  124 bits (310), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 72/224 (32%), Positives = 115/224 (51%), Gaps = 10/224 (4%)

Query: 27  PVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCT----ASPGSSGVC-- 80
           P+  +LPD     K   +V +    Y  K I     D  + C C     ++  ++  C  
Sbjct: 119 PLYLDLPDSTEAAKET-FVVLPECTYANKTIG--TTDPALECDCQEEYDSATKTNHACGE 175

Query: 81  DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
           D DC        C   C CG  C N+ FQ +    + +++TEK G G+ A++++  G+FV
Sbjct: 176 DSDCINRATKMECVGDCSCGRRCQNQRFQRKQYADVTVIKTEKKGFGLRANKNMVPGDFV 235

Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
            EY+GEVID++T   R+ +    G  +FY   + +   +DAT KGN  R+ NHSC PN  
Sbjct: 236 FEYIGEVIDERTFRRRMGQYDEEGIKHFYFMSLTKGEFVDATKKGNLGRFCNHSCNPNCF 295

Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
           + KW++  + R+GIF  R+++ GE L ++Y  + +  D    YC
Sbjct: 296 VDKWVVGDKLRMGIFVERNVRAGEELVFNYNVDRYGADPQPCYC 339


>gi|239609047|gb|EEQ86034.1| histone-lysine N-methyltransferase [Ajellomyces dermatitidis ER-3]
          Length = 897

 Score =  124 bits (310), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 66/183 (36%), Positives = 98/183 (53%), Gaps = 7/183 (3%)

Query: 68  CSCTAS----PGSSGVC--DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQT 121
           C CT         +G C  D DC        C   C CG+ C N+ FQ R    + +++T
Sbjct: 155 CDCTEEWDPVAAKNGACGEDSDCINRATKMECVGDCGCGDECENQRFQRRQYANVTVIKT 214

Query: 122 EKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDA 181
           EK G G+ AD D++  +F+ EY+GEVI++    +R+      G  +FY   +++   +DA
Sbjct: 215 EKKGYGLRADSDLRPNQFIFEYIGEVINEGNFRKRMRSYDEEGIKHFYFMSLSKGEFVDA 274

Query: 182 TYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLI 240
           T KGN  R+ NHSC PN  + KW++  + R+GIFA R IK GE L ++Y  + +  D   
Sbjct: 275 TKKGNLGRFCNHSCNPNCYVDKWVVGEKLRMGIFAERHIKAGEELVFNYNVDRYGADPQP 334

Query: 241 AYC 243
            YC
Sbjct: 335 CYC 337


>gi|71013616|ref|XP_758647.1| hypothetical protein UM02500.1 [Ustilago maydis 521]
 gi|74702635|sp|Q4PBL3.1|SET2_USTMA RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36
           specific; AltName: Full=SET domain-containing protein 2
 gi|46098305|gb|EAK83538.1| hypothetical protein UM02500.1 [Ustilago maydis 521]
          Length = 972

 Score =  124 bits (310), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 69/175 (39%), Positives = 100/175 (57%), Gaps = 4/175 (2%)

Query: 63  DDGIFCSCTASPGSSGVCDRD---CHCGMLLSSCS-SGCKCGNSCLNKPFQNRPVKKMKL 118
           DD + C CT + G+  +   D   C   M    CS S C+ G  C N+ F  R    + +
Sbjct: 191 DDYMICDCTPNSGNLDMACTDYSGCINRMTQIECSASKCRWGKQCRNQRFHRRQYVDVDI 250

Query: 119 VQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMV 178
           VQTEK G G+ A +DI +  F+ EYVGEV++  T  +R+ + +  G  +FY   +  +  
Sbjct: 251 VQTEKKGFGLRACQDIPKETFIYEYVGEVMNQTTFLQRMQQYRIEGIRHFYFMMLQPNEY 310

Query: 179 IDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
           +DAT KG K R+INHSC PN  + KW +    R+GIFA R+I+KGE LT++Y  +
Sbjct: 311 LDATKKGGKGRFINHSCNPNCAVSKWQVGKHLRMGIFAKRNIQKGEELTFNYNVD 365


>gi|332021876|gb|EGI62212.1| Putative histone-lysine N-methyltransferase ASH1L [Acromyrmex
            echinatior]
          Length = 2178

 Score =  124 bits (310), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 78/237 (32%), Positives = 119/237 (50%), Gaps = 19/237 (8%)

Query: 19   KLLKQIGNPVEFELPDWFIKPKA-------IP---YVFIKRNIYLTKRIKRRLEDDGIFC 68
            KLL+Q   P + +    ++  ++       +P   Y  I+ NIY    +K     +   C
Sbjct: 1296 KLLRQRKTPFQLQYDLLWLHAQSRLSGRDLVPSWNYKKIRTNIYYD--VKPTTLYEAQAC 1353

Query: 69   SCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGA 126
             C    G    C  DC   M+ S CS   C CG+ C N+  Q       ++   TE  G 
Sbjct: 1354 ECKQESG----CGDDCINRMVFSECSPQLCPCGDKCENQKIQKHEWSPGLQRFMTEDKGW 1409

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+   + IK G+F++EYVGEV+ ++  + R+   ++  +T+ Y   ++  +VID    G 
Sbjct: 1410 GVRTQQAIKAGDFILEYVGEVVSEREFKSRMA-TRYANDTHHYCLHLDGGLVIDGHRMGG 1468

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
              R++NHSC PN EMQKW + G  R+ +FA+RDIK GE LTYDY +   + S    C
Sbjct: 1469 DGRFVNHSCEPNCEMQKWSVLGLPRMALFASRDIKPGEELTYDYNFALFNPSEGQQC 1525


>gi|261189306|ref|XP_002621064.1| histone-lysine N-methyltransferase [Ajellomyces dermatitidis
           SLH14081]
 gi|239591641|gb|EEQ74222.1| histone-lysine N-methyltransferase [Ajellomyces dermatitidis
           SLH14081]
          Length = 897

 Score =  124 bits (310), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 66/186 (35%), Positives = 99/186 (53%), Gaps = 7/186 (3%)

Query: 65  GIFCSCTAS----PGSSGVC--DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKL 118
            + C CT         +G C  D DC        C   C CG+ C N+ FQ R    + +
Sbjct: 152 AMECDCTEEWDPVAAKNGACGEDSDCINRATKMECVGDCGCGDECENQRFQRRQYANVTV 211

Query: 119 VQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMV 178
           ++TEK G G+ AD D++  +F+ EY+GEVI++    +R+      G  +FY   +++   
Sbjct: 212 IKTEKKGYGLRADSDLRPNQFIFEYIGEVINEGNFRKRMRSYDEEGIKHFYFMSLSKGEF 271

Query: 179 IDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHD 237
           +DAT KGN  R+ NHSC PN  + KW++  + R+GIFA R IK GE L ++Y  + +  D
Sbjct: 272 VDATKKGNLGRFCNHSCNPNCYVDKWVVGEKLRMGIFAERHIKAGEELVFNYNVDRYGAD 331

Query: 238 SLIAYC 243
               YC
Sbjct: 332 PQPCYC 337


>gi|440633543|gb|ELR03462.1| hypothetical protein GMDG_06195 [Geomyces destructans 20631-21]
          Length = 977

 Score =  123 bits (309), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 65/183 (35%), Positives = 100/183 (54%), Gaps = 4/183 (2%)

Query: 65  GIFCSCTASPGSSGVCDRDCHCGMLLSSCS---SGCKCGNSCLNKPFQNRPVKKMKLVQT 121
           G  CS     G +  C  D  C   L+        C CG +C N+ FQ +   K+ +++T
Sbjct: 149 GCDCSEDWRDGKNHACGEDTDCINRLTKIECMDDECNCGPNCQNQRFQRKEFSKVSVIKT 208

Query: 122 EKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDA 181
           EK G G+ AD D+   +F+ EY+GEV+++ T   R+ +    G  +FY   +++   +DA
Sbjct: 209 EKKGYGLRADTDLSAHDFIFEYIGEVVNEPTLRRRMRQYDEEGIKHFYFMSLSKSEFVDA 268

Query: 182 TYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLI 240
           T KGN  R+ NHSC PN  + KW++  + R+GIFA R+IK GE L ++Y  + +  D   
Sbjct: 269 TIKGNLGRFCNHSCNPNCYVDKWVVGDKLRMGIFAERNIKVGEELVFNYNVDRYGADPQP 328

Query: 241 AYC 243
            YC
Sbjct: 329 CYC 331


>gi|296081207|emb|CBI18233.3| unnamed protein product [Vitis vinifera]
          Length = 370

 Score =  123 bits (309), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 73/186 (39%), Positives = 96/186 (51%), Gaps = 2/186 (1%)

Query: 51  IYLTKRIKRRLEDDGIFCSCT-ASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPF 108
           + L  R  ++ E+D   C C   +      C   C   +    C+   C C   C N+ F
Sbjct: 6   LLLCSRHIKQKENDIAICECKYKANDPDSACGERCWNVLTSIECTPRYCPCSIHCKNQRF 65

Query: 109 QNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNF 168
           Q R   K KL + E  G G++A E+IK GEFV+EY GEVI       R       G  + 
Sbjct: 66  QKREYAKTKLFRAEGRGWGLLATENIKAGEFVMEYCGEVISRTEARGRSQVYVSQGLKDV 125

Query: 169 YLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTY 228
           Y+  +N    IDAT KGN +R+INHSC PN E  KW + GE R+GIFA R+I  G  LTY
Sbjct: 126 YIIPLNARECIDATKKGNLARFINHSCQPNCETMKWSVLGEDRVGIFALRNISVGTELTY 185

Query: 229 DYQYEF 234
            Y +E+
Sbjct: 186 SYNFEW 191


>gi|323354369|gb|EGA86208.1| Set2p [Saccharomyces cerevisiae VL3]
          Length = 683

 Score =  123 bits (309), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 68/186 (36%), Positives = 100/186 (53%), Gaps = 6/186 (3%)

Query: 51  IYLTKRIKRRLEDDGIFCSCTA--SPGSSGVCDRDCHC--GMLLSSCSSG--CKCGNSCL 104
           IY  KRI     +D + C C    S G +  CD D  C   + L  C +     CGN C 
Sbjct: 51  IYANKRIGTFKNNDFMECDCYEEFSDGVNHACDEDSDCINRLTLIECVNDLCSSCGNDCQ 110

Query: 105 NKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLG 164
           N+ FQ +    + + +T+  G G+ A++DI+  +F+ EY GEVI++    +RL       
Sbjct: 111 NQRFQKKQYAPIAIFKTKHKGYGVRAEQDIEANQFIYEYKGEVIEEMEFRDRLIDYDQRH 170

Query: 165 ETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGE 224
             +FY   +     IDAT KG+ +R+ NHSC PN  + KW++  + R+GIFA R I KGE
Sbjct: 171 FKHFYFMMLQNGEFIDATIKGSLARFCNHSCSPNAYVNKWVVKDKLRMGIFAQRKILKGE 230

Query: 225 NLTYDY 230
            +T+DY
Sbjct: 231 EITFDY 236


>gi|448524136|ref|XP_003868932.1| Set2 protein [Candida orthopsilosis Co 90-125]
 gi|380353272|emb|CCG26028.1| Set2 protein [Candida orthopsilosis]
          Length = 806

 Score =  123 bits (309), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 61/155 (39%), Positives = 91/155 (58%), Gaps = 3/155 (1%)

Query: 79  VCDRDCHCGMLLSSC---SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIK 135
            C  D +C   L+S    +  C CG+ C N+ FQ R    + + QTE  G G+ A++ I 
Sbjct: 71  ACGEDSNCINRLTSVECTNRHCLCGDDCQNQRFQKRQYADVSVFQTELKGYGLKANKPIS 130

Query: 136 RGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSC 195
            G+F+ EY+GEVID+    +R+ +       +FY   +  D  IDAT KG+ +R++NHSC
Sbjct: 131 EGQFIYEYIGEVIDEGAFRQRMIEYDVKNYKHFYFMMLKPDAFIDATEKGSLARFVNHSC 190

Query: 196 CPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDY 230
            PN  + KW++  + R+GIFA R I KGE +T+DY
Sbjct: 191 NPNAFVDKWVVGDKLRMGIFAKRKIAKGEEITFDY 225


>gi|409043202|gb|EKM52685.1| hypothetical protein PHACADRAFT_211907 [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 841

 Score =  123 bits (309), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 69/208 (33%), Positives = 111/208 (53%), Gaps = 8/208 (3%)

Query: 44  YVFIKRNIYLTKRIKRRLE-DDGIFCSCTASPG---SSGVCDRDCHCGMLLSSCS---SG 96
           ++ I  N Y  + + R  E  + + C C    G    S  C  D  C   L+        
Sbjct: 111 FIEIPDNHYQYQSLGRSREAGESMTCDCVYEHGVSDPSDACGHDSDCINRLTQVECLPED 170

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
           C+C + C N+ FQ +    +++VQTEK G G+ A +D+++  F+ EYVG+V++  +  +R
Sbjct: 171 CRCRSHCQNQRFQRKQHAPIEIVQTEKKGFGLRAGQDLRKDTFIYEYVGDVVNHPSLLKR 230

Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
           + +    G  +FY   + +D  IDAT +G   R+ NHSC PN  + KW +    R+GIFA
Sbjct: 231 MRQYGEEGIQHFYFMALQKDEFIDATKRGGIGRFANHSCNPNCYVAKWSVGKRVRMGIFA 290

Query: 217 TRDIKKGENLTYDYQYE-FLHDSLIAYC 243
            R+IKK E LT++Y  + + H+  + YC
Sbjct: 291 NRNIKKNEELTFNYNVDRYGHEPQMCYC 318


>gi|353234454|emb|CCA66479.1| related to SET2-Histone methyltransferase [Piriformospora indica
           DSM 11827]
          Length = 799

 Score =  123 bits (309), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 75/211 (35%), Positives = 108/211 (51%), Gaps = 27/211 (12%)

Query: 58  KRRLEDDGIFCSCTASPGSSGVCDR------DCHCGMLLSSC-SSGCKCGNSCLNKPFQN 110
           K ++++D + C C   PG     DR      DC   + L  C    C+CG  C N+ FQ 
Sbjct: 93  KPKIQEDAMGCECYYKPGDP--LDRACGPNSDCINRLTLVECVEEECRCGVFCQNQRFQK 150

Query: 111 RPVKKMKLVQTEKCGAGIVADEDIKR-----------------GEFVIEYVGEVIDDQTC 153
           R    + +V+TEK G G+ A   IKR                  EF+ EY+GE+I+++T 
Sbjct: 151 RQYANIHVVKTEKKGYGLRAASPIKRYLIFSSADASTPHMVSSDEFIYEYIGEIINERTL 210

Query: 154 EERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIG 213
           E R+      G  +FY   + +   IDAT KG   R+ NHSC PN  + +W++DG  R+G
Sbjct: 211 ERRMDNYGDEGIEHFYFMMLQKGEFIDATKKGGFGRFANHSCNPNCYVARWVVDGGLRMG 270

Query: 214 IFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
           IFA RDI K E LT++Y  + + H +   YC
Sbjct: 271 IFAKRDILKDEELTFNYNADRYGHAAQPCYC 301


>gi|361129227|gb|EHL01139.1| putative Histone-lysine N-methyltransferase, H3 lysine-36 specific
           [Glarea lozoyensis 74030]
          Length = 839

 Score =  123 bits (309), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 60/148 (40%), Positives = 92/148 (62%), Gaps = 1/148 (0%)

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
           CKCG++C N+ FQ +    + +++T+K G G+ A+ D+  G+FV EY+GEVI++ T  +R
Sbjct: 104 CKCGDTCQNQRFQRQDYADVSVIKTDKKGYGLRANTDLDAGDFVFEYIGEVINEPTFRKR 163

Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
             +    G  +FY   + +D  +DAT KGN  R+ NHSC PN  + KWI+  + R+GIFA
Sbjct: 164 TIEYHKEGIKHFYFMSLGKDEFVDATKKGNLGRFCNHSCNPNCFVDKWIVKDKLRMGIFA 223

Query: 217 TRDIKKGENLTYDYQYE-FLHDSLIAYC 243
            R I+KGE L ++Y  + +  D    YC
Sbjct: 224 QRPIQKGEELVFNYNVDRYGADPQPCYC 251


>gi|1008361|emb|CAA89464.1| EZL1 [Saccharomyces cerevisiae]
          Length = 733

 Score =  123 bits (309), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 68/186 (36%), Positives = 100/186 (53%), Gaps = 6/186 (3%)

Query: 51  IYLTKRIKRRLEDDGIFCSCTA--SPGSSGVCDRDCHC--GMLLSSCSSG--CKCGNSCL 104
           IY  KRI     +D + C C    S G +  CD D  C   + L  C +     CGN C 
Sbjct: 51  IYANKRIGTFKNNDFMECDCYEEFSDGVNHACDEDSDCINRLTLIECVNDLCSSCGNDCQ 110

Query: 105 NKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLG 164
           N+ FQ +    + + +T+  G G+ A++DI+  +F+ EY GEVI++    +RL       
Sbjct: 111 NQRFQKKQYAPIAIFKTKHKGYGVRAEQDIEANQFIYEYKGEVIEEMEFRDRLIDYDQRH 170

Query: 165 ETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGE 224
             +FY   +     IDAT KG+ +R+ NHSC PN  + KW++  + R+GIFA R I KGE
Sbjct: 171 FKHFYFMMLQNGEFIDATIKGSLARFCNHSCSPNAYVNKWVVKDKLRMGIFAQRKILKGE 230

Query: 225 NLTYDY 230
            +T+DY
Sbjct: 231 EITFDY 236


>gi|330443605|ref|NP_012367.2| Set2p [Saccharomyces cerevisiae S288c]
 gi|347595709|sp|P46995.2|SET2_YEAST RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36
           specific; AltName: Full=Lysine N-methyltransferase 3;
           AltName: Full=SET domain-containing protein 2
 gi|329138914|tpg|DAA08635.2| TPA: Set2p [Saccharomyces cerevisiae S288c]
          Length = 733

 Score =  123 bits (309), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 68/186 (36%), Positives = 100/186 (53%), Gaps = 6/186 (3%)

Query: 51  IYLTKRIKRRLEDDGIFCSCTA--SPGSSGVCDRDCHC--GMLLSSCSSG--CKCGNSCL 104
           IY  KRI     +D + C C    S G +  CD D  C   + L  C +     CGN C 
Sbjct: 51  IYANKRIGTFKNNDFMECDCYEEFSDGVNHACDEDSDCINRLTLIECVNDLCSSCGNDCQ 110

Query: 105 NKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLG 164
           N+ FQ +    + + +T+  G G+ A++DI+  +F+ EY GEVI++    +RL       
Sbjct: 111 NQRFQKKQYAPIAIFKTKHKGYGVRAEQDIEANQFIYEYKGEVIEEMEFRDRLIDYDQRH 170

Query: 165 ETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGE 224
             +FY   +     IDAT KG+ +R+ NHSC PN  + KW++  + R+GIFA R I KGE
Sbjct: 171 FKHFYFMMLQNGEFIDATIKGSLARFCNHSCSPNAYVNKWVVKDKLRMGIFAQRKILKGE 230

Query: 225 NLTYDY 230
            +T+DY
Sbjct: 231 EITFDY 236


>gi|349579035|dbj|GAA24198.1| K7_Set2p [Saccharomyces cerevisiae Kyokai no. 7]
          Length = 733

 Score =  123 bits (309), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 68/186 (36%), Positives = 100/186 (53%), Gaps = 6/186 (3%)

Query: 51  IYLTKRIKRRLEDDGIFCSCTA--SPGSSGVCDRDCHC--GMLLSSCSSG--CKCGNSCL 104
           IY  KRI     +D + C C    S G +  CD D  C   + L  C +     CGN C 
Sbjct: 51  IYANKRIGTFKNNDFMECDCYEEFSDGVNHACDEDSDCINRLTLIECVNDLCSSCGNDCQ 110

Query: 105 NKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLG 164
           N+ FQ +    + + +T+  G G+ A++DI+  +F+ EY GEVI++    +RL       
Sbjct: 111 NQRFQKKQYAPIAIFKTKHKGYGVRAEQDIEANQFIYEYKGEVIEEMEFRDRLIDYDQRH 170

Query: 165 ETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGE 224
             +FY   +     IDAT KG+ +R+ NHSC PN  + KW++  + R+GIFA R I KGE
Sbjct: 171 FKHFYFMMLQNGEFIDATIKGSLARFCNHSCSPNAYVNKWVVKDKLRMGIFAQRKILKGE 230

Query: 225 NLTYDY 230
            +T+DY
Sbjct: 231 EITFDY 236


>gi|151944961|gb|EDN63216.1| SET domain-containing protein [Saccharomyces cerevisiae YJM789]
 gi|190409343|gb|EDV12608.1| SET domain protein 2 [Saccharomyces cerevisiae RM11-1a]
 gi|207344135|gb|EDZ71371.1| YJL168Cp-like protein [Saccharomyces cerevisiae AWRI1631]
 gi|256271359|gb|EEU06424.1| Set2p [Saccharomyces cerevisiae JAY291]
          Length = 733

 Score =  123 bits (309), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 68/186 (36%), Positives = 100/186 (53%), Gaps = 6/186 (3%)

Query: 51  IYLTKRIKRRLEDDGIFCSCTA--SPGSSGVCDRDCHC--GMLLSSCSSG--CKCGNSCL 104
           IY  KRI     +D + C C    S G +  CD D  C   + L  C +     CGN C 
Sbjct: 51  IYANKRIGTFKNNDFMECDCYEEFSDGVNHACDEDSDCINRLTLIECVNDLCSSCGNDCQ 110

Query: 105 NKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLG 164
           N+ FQ +    + + +T+  G G+ A++DI+  +F+ EY GEVI++    +RL       
Sbjct: 111 NQRFQKKQYAPIAIFKTKHKGYGVRAEQDIEANQFIYEYKGEVIEEMEFRDRLIDYDQRH 170

Query: 165 ETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGE 224
             +FY   +     IDAT KG+ +R+ NHSC PN  + KW++  + R+GIFA R I KGE
Sbjct: 171 FKHFYFMMLQNGEFIDATIKGSLARFCNHSCSPNAYVNKWVVKDKLRMGIFAQRKILKGE 230

Query: 225 NLTYDY 230
            +T+DY
Sbjct: 231 EITFDY 236


>gi|323332954|gb|EGA74356.1| Set2p [Saccharomyces cerevisiae AWRI796]
          Length = 537

 Score =  123 bits (308), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 68/186 (36%), Positives = 100/186 (53%), Gaps = 6/186 (3%)

Query: 51  IYLTKRIKRRLEDDGIFCSCTA--SPGSSGVCDRDCHC--GMLLSSCSSG--CKCGNSCL 104
           IY  KRI     +D + C C    S G +  CD D  C   + L  C +     CGN C 
Sbjct: 51  IYANKRIGTFKNNDFMECDCYEEFSDGVNHACDEDSDCINRLTLIECVNDLCSSCGNDCQ 110

Query: 105 NKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLG 164
           N+ FQ +    + + +T+  G G+ A++DI+  +F+ EY GEVI++    +RL       
Sbjct: 111 NQRFQKKQYAPIAIFKTKHKGYGVRAEQDIEANQFIYEYKGEVIEEMEFRDRLIDYDQRH 170

Query: 165 ETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGE 224
             +FY   +     IDAT KG+ +R+ NHSC PN  + KW++  + R+GIFA R I KGE
Sbjct: 171 FKHFYFMMLQNGEFIDATIKGSLARFCNHSCSPNAYVNKWVVKDKLRMGIFAQRKILKGE 230

Query: 225 NLTYDY 230
            +T+DY
Sbjct: 231 EITFDY 236


>gi|365764896|gb|EHN06414.1| Set2p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
          Length = 733

 Score =  123 bits (308), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 68/186 (36%), Positives = 100/186 (53%), Gaps = 6/186 (3%)

Query: 51  IYLTKRIKRRLEDDGIFCSCTA--SPGSSGVCDRDCHC--GMLLSSCSSG--CKCGNSCL 104
           IY  KRI     +D + C C    S G +  CD D  C   + L  C +     CGN C 
Sbjct: 51  IYANKRIGTFKNNDFMECDCYEEFSDGVNHACDEDSDCINRLTLIECVNDLCSSCGNDCQ 110

Query: 105 NKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLG 164
           N+ FQ +    + + +T+  G G+ A++DI+  +F+ EY GEVI++    +RL       
Sbjct: 111 NQRFQKKQYAPIAIFKTKHKGYGVRAEQDIEANQFIYEYKGEVIEEMEFRDRLIDYDQRH 170

Query: 165 ETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGE 224
             +FY   +     IDAT KG+ +R+ NHSC PN  + KW++  + R+GIFA R I KGE
Sbjct: 171 FKHFYFMMLQNGEFIDATIKGSLARFCNHSCSPNAYVNKWVVKDKLRMGIFAQRKILKGE 230

Query: 225 NLTYDY 230
            +T+DY
Sbjct: 231 EITFDY 236


>gi|198451130|ref|XP_001358254.2| GA18567 [Drosophila pseudoobscura pseudoobscura]
 gi|198131348|gb|EAL27392.2| GA18567 [Drosophila pseudoobscura pseudoobscura]
          Length = 1541

 Score =  123 bits (308), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 75/214 (35%), Positives = 114/214 (53%), Gaps = 11/214 (5%)

Query: 37   IKPKAIPYVFIKRN--IYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSS 92
            IK    PYV I+ N  +   K I    E      +C   P     C  + +C   ML + 
Sbjct: 1266 IKVNPPPYVKIRINKAVPPVKFITNSEE----HSTCDCRPEDEHPCGANSNCLNRMLFNE 1321

Query: 93   CSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQ 151
            C    C+CG+ C N+ F+ R   +M +V     G G+V  E I  G+F+IEYVGEVI+ +
Sbjct: 1322 CHPEYCRCGDRCENRMFETRKSPRMDVVYMNARGFGLVCREPIAEGDFIIEYVGEVINQE 1381

Query: 152  TCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETR 211
              + R+ + +   + NFY   + ++ +IDA  KGN +R++NHSC PN   QKW ++   R
Sbjct: 1382 EFQRRMLRKQKDRDENFYFLGVEKEFIIDAGPKGNLARFMNHSCEPNCTSQKWTVNCTNR 1441

Query: 212  IGIFATRDIKKGENLTYDYQY-EFLHDSLIA-YC 243
            +G+FA +DI     LT++Y + + L+D   A YC
Sbjct: 1442 VGLFAIQDIPAETELTFNYLWDDLLNDKKKACYC 1475


>gi|157818737|ref|NP_001101159.1| probable histone-lysine N-methyltransferase ASH1L [Rattus norvegicus]
 gi|149048100|gb|EDM00676.1| ash1 (absent, small, or homeotic)-like (Drosophila) (predicted)
            [Rattus norvegicus]
          Length = 2918

 Score =  123 bits (308), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 62/155 (40%), Positives = 93/155 (60%), Gaps = 3/155 (1%)

Query: 80   CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDIKRG 137
            C  DC   M+ + CS   C CG  C N+  Q    V+ ++  + E+ G GI   E +K G
Sbjct: 2059 CGDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAG 2118

Query: 138  EFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCP 197
            +F+IEY+GEV+ +Q    R+ +  H   ++ Y   ++  MVID+   GN++R+INHSC P
Sbjct: 2119 QFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDP 2177

Query: 198  NTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            N EMQKW ++G  RIG++A +D+  G  LTYDY +
Sbjct: 2178 NCEMQKWSVNGVYRIGLYALKDVPAGTELTYDYNF 2212


>gi|365982940|ref|XP_003668303.1| hypothetical protein NDAI_0B00260 [Naumovozyma dairenensis CBS 421]
 gi|343767070|emb|CCD23060.1| hypothetical protein NDAI_0B00260 [Naumovozyma dairenensis CBS 421]
          Length = 795

 Score =  123 bits (308), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 73/200 (36%), Positives = 107/200 (53%), Gaps = 18/200 (9%)

Query: 44  YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGS----SGVCDRDCHC--GMLLSSCSSGC 97
           ++ I+  IY +K++     +D I C C          +  CD D  C   + L  C +  
Sbjct: 27  FIPIESCIYSSKKLGNSKNNDFIECDCYEDKDPQIHINHACDEDSDCINRLTLIECVNNL 86

Query: 98  --KCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEE 155
              CGN C N+ FQ +    + + +T+  G G+ A+E+I+ G+F+ EY+GEVID+    E
Sbjct: 87  CYSCGNDCQNQRFQKKQYSNVSIFKTKLKGYGVRANENIENGQFIYEYIGEVIDEIQFRE 146

Query: 156 R-----LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGET 210
           R     L K KH     FY   +     IDAT KG+ +R+ NHSC PN  + KW + G+ 
Sbjct: 147 RMIDYDLKKFKH-----FYFMMLQNGQFIDATIKGSLARFCNHSCSPNAYVNKWEVAGKL 201

Query: 211 RIGIFATRDIKKGENLTYDY 230
           R+GIFA R I KGE +T+DY
Sbjct: 202 RMGIFAKRKILKGEEITFDY 221


>gi|195143973|ref|XP_002012971.1| GL23881 [Drosophila persimilis]
 gi|194101914|gb|EDW23957.1| GL23881 [Drosophila persimilis]
          Length = 1466

 Score =  123 bits (308), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 68/200 (34%), Positives = 106/200 (53%), Gaps = 1/200 (0%)

Query: 37   IKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG 96
            IK    PYV IK N  +         ++   C C          + +C   ML + C   
Sbjct: 1191 IKVNPPPYVKIKINKAVPPVKFVTNSEEHSTCDCRPEDEHPCGANSNCLNRMLFNECHPE 1250

Query: 97   -CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEE 155
             C+CG+ C N+ F+ R   +M +V     G G+V  E I  G+F+IEYVGEVI+ +  + 
Sbjct: 1251 YCRCGDRCENRMFETRKSPRMDVVYMNARGFGLVCREPIAEGDFIIEYVGEVINQEEFQR 1310

Query: 156  RLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIF 215
            R+ + +   + NFY   + ++ +IDA  KGN +R++NHSC PN   QKW ++   R+G+F
Sbjct: 1311 RMLRKQKDRDENFYFLGVEKEFIIDAGPKGNLARFMNHSCEPNCTSQKWTVNCTHRVGLF 1370

Query: 216  ATRDIKKGENLTYDYQYEFL 235
            A +DI     LT++Y ++ L
Sbjct: 1371 AIQDIPAETELTFNYLWDDL 1390


>gi|156847005|ref|XP_001646388.1| hypothetical protein Kpol_2001p35 [Vanderwaltozyma polyspora DSM
           70294]
 gi|156117064|gb|EDO18530.1| hypothetical protein Kpol_2001p35 [Vanderwaltozyma polyspora DSM
           70294]
          Length = 693

 Score =  123 bits (308), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 68/185 (36%), Positives = 102/185 (55%), Gaps = 6/185 (3%)

Query: 52  YLTKRIKRRLEDDGIFCSCTAS--PGSSGVCDR--DCHCGMLLSSCSSGC--KCGNSCLN 105
           Y +K I      D + C C      G +  CD   DC   + L  C +G    CG++C N
Sbjct: 46  YASKYIGNAKSSDFMECDCYEEFEEGINHACDENSDCINRLTLIECVNGLCDTCGDNCQN 105

Query: 106 KPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGE 165
           + FQ +    + + +T+  G G+ A++DI+  +F+ EY GEVI++    ERL      G 
Sbjct: 106 QRFQKKEYADISVFKTKMKGYGVRANKDIEANDFIYEYKGEVIEEDEFRERLIDYDERGF 165

Query: 166 TNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGEN 225
            +FY   +     IDAT KG+ +R+ NHSC PN  + KW++ G+ ++GIFA RDI KGE 
Sbjct: 166 KHFYFMMLQNAEFIDATIKGSLARFCNHSCNPNAYVNKWVVAGKLKMGIFAKRDIYKGEE 225

Query: 226 LTYDY 230
           +T+DY
Sbjct: 226 ITFDY 230


>gi|344230287|gb|EGV62172.1| SET domain-containing protein [Candida tenuis ATCC 10573]
          Length = 721

 Score =  123 bits (308), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 67/186 (36%), Positives = 99/186 (53%), Gaps = 7/186 (3%)

Query: 52  YLTKRIKRRLEDDGIFCSC----TASPGSSGVCDRDCHCGMLLSS--CSSG-CKCGNSCL 104
           Y  K I    +++ + C C      +   +  C  D  C    +S  C +G C CG+ C 
Sbjct: 40  YQNKSIGDSGQNENMTCDCHEKVEETTNVNHACGDDSGCINRATSVECMAGACNCGDRCQ 99

Query: 105 NKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLG 164
           N+ FQ      + + QTEK G G+ AD  +  G F+ EY+GEVID+ T   R+       
Sbjct: 100 NQRFQRCEYAAISVFQTEKKGYGVRADTALDEGSFIYEYIGEVIDEATFRRRMVDYDSRQ 159

Query: 165 ETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGE 224
             +FY   + +D  IDAT KG  +R++NHSC PN  + KW++  + R+GIFA R I +GE
Sbjct: 160 LRHFYFMMLKKDAFIDATEKGALARFVNHSCSPNAYVDKWVVGDKLRMGIFAKRHIARGE 219

Query: 225 NLTYDY 230
            +T+DY
Sbjct: 220 EITFDY 225


>gi|195395005|ref|XP_002056127.1| GJ10771 [Drosophila virilis]
 gi|194142836|gb|EDW59239.1| GJ10771 [Drosophila virilis]
          Length = 1430

 Score =  122 bits (307), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 70/196 (35%), Positives = 105/196 (53%), Gaps = 5/196 (2%)

Query: 43   PYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG-CKC 99
            PY+ IK N  +        E+D   C C   PGS   C  +  C   ML   C    C  
Sbjct: 1154 PYIKIKANKPVPPVRFVYNEEDLNICEC--KPGSEHPCGPESGCLNRMLYHECHPDYCHA 1211

Query: 100  GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
            G  C N+ F+ R   ++++V   + G G+V  E I  G+F+IEYVGEVI+    + R+ +
Sbjct: 1212 GRQCENQLFELRKSPRLEVVYMNERGFGLVCREPIAEGDFIIEYVGEVINHAEFQRRMAQ 1271

Query: 160  MKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD 219
                 + NFY   + +D +IDA  KGN +R++NHSC PN   QKW ++   R+G+FA +D
Sbjct: 1272 KTRDRDENFYFLGVEKDFIIDAGPKGNLARFMNHSCEPNCATQKWTVNCINRVGLFAIKD 1331

Query: 220  IKKGENLTYDYQYEFL 235
            I +   LT++Y ++ L
Sbjct: 1332 IPENTELTFNYLWDDL 1347


>gi|358056897|dbj|GAA97247.1| hypothetical protein E5Q_03924 [Mixia osmundae IAM 14324]
          Length = 949

 Score =  122 bits (307), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 65/181 (35%), Positives = 98/181 (54%), Gaps = 5/181 (2%)

Query: 68  CSCTASPG---SSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEK 123
           C C    G    S  C  +C   +    C  G C+CG  C N+ FQ +    +++V T  
Sbjct: 219 CECDYEHGLDAPSSACGTNCINRLTQVECVPGSCRCGKYCTNRRFQRKQYANIQIVDTGM 278

Query: 124 CGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATY 183
            G G+ A ED+ +  FV EY+GEV+     ++R+      G  +FY   + R+  IDAT 
Sbjct: 279 KGFGVRAAEDMLKDAFVYEYIGEVVGAGQLQKRMKDYYEEGIEHFYFMALQREEFIDATK 338

Query: 184 KGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAY 242
           KGNK R++NHSC PN  + KW++  + R+GIF  R I+ GE LT++Y  + + H++   Y
Sbjct: 339 KGNKGRFLNHSCSPNCYVSKWVVGEKMRMGIFTKRKIQAGEELTFNYNVDRYGHEAQPCY 398

Query: 243 C 243
           C
Sbjct: 399 C 399


>gi|148683294|gb|EDL15241.1| ash1 (absent, small, or homeotic)-like (Drosophila) [Mus musculus]
          Length = 2918

 Score =  122 bits (307), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 62/155 (40%), Positives = 93/155 (60%), Gaps = 3/155 (1%)

Query: 80   CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDIKRG 137
            C  DC   M+ + CS   C CG  C N+  Q    V+ ++  + E+ G GI   E +K G
Sbjct: 2059 CGDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAG 2118

Query: 138  EFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCP 197
            +F+IEY+GEV+ +Q    R+ +  H   ++ Y   ++  MVID+   GN++R+INHSC P
Sbjct: 2119 QFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDP 2177

Query: 198  NTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            N EMQKW ++G  RIG++A +D+  G  LTYDY +
Sbjct: 2178 NCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2212


>gi|147846734|emb|CAN80636.1| hypothetical protein VITISV_017995 [Vitis vinifera]
          Length = 278

 Score =  122 bits (307), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 75/204 (36%), Positives = 102/204 (50%), Gaps = 4/204 (1%)

Query: 41  AIPYVFIK--RNIYLTKRIKRRLEDDGIFCSCTASPGS-SGVCDRDCHCGMLLSSCSSG- 96
            + Y F+    N+   +R  ++ E+D   C C          C   C   +    C+   
Sbjct: 37  VVDYSFVSFLDNLEGEERHIKQKENDIAICECKYKANDPDSACGERCWNVLTSIECTPRY 96

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
           C C   C N+ FQ R   K KL + E  G G++A E+IK GEFV+EY GEVI       R
Sbjct: 97  CPCSIHCKNQRFQKREYAKTKLFRAEGRGWGLLATENIKAGEFVMEYCGEVISRTEARGR 156

Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
                  G  + Y+  +N    IDAT KGN +R+INHSC PN E  KW + GE R+GIFA
Sbjct: 157 SQVYVSQGLKDVYIIPLNARECIDATKKGNLARFINHSCQPNCETMKWSVLGEDRVGIFA 216

Query: 217 TRDIKKGENLTYDYQYEFLHDSLI 240
            R+I  G  LTY Y +E+   + +
Sbjct: 217 LRNISVGTELTYSYNFEWYSXAKV 240


>gi|154292496|ref|XP_001546822.1| hypothetical protein BC1G_14887 [Botryotinia fuckeliana B05.10]
 gi|347836524|emb|CCD51096.1| hypothetical protein [Botryotinia fuckeliana]
          Length = 953

 Score =  122 bits (307), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 76/225 (33%), Positives = 118/225 (52%), Gaps = 10/225 (4%)

Query: 27  PVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTA--SPGSSGVCDRDC 84
           P+  +LPD   +  ++ +  IK  IY  K +    + D + C C    S G +  C  D 
Sbjct: 114 PLFNDLPDSTEEAASV-FQVIKDCIYGAKHMGAS-DHDALDCDCPEEFSDGKNYACGEDS 171

Query: 85  HCGMLLS--SCSSG---CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEF 139
            C   L+   C  G   C CG  C N+ FQ +   K+ +++T+K G G+ A+ D++  +F
Sbjct: 172 DCINRLTKMECGGGHKDCNCGLDCQNQRFQRKQYAKVSVIKTDKKGYGLRANTDLQPDDF 231

Query: 140 VIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNT 199
           + EY+GEVI++ T   R  +    G  +FY   + +   +DAT KGN  R+ NHSC PN 
Sbjct: 232 IFEYIGEVINEPTFRRRTVQYDQEGIKHFYFMSLTKHEFVDATKKGNLGRFCNHSCNPNC 291

Query: 200 EMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
            + KW++  + R+GIFA R IK GE L ++Y  + +  D    YC
Sbjct: 292 YVDKWVVGEKLRMGIFAERAIKAGEELVFNYNVDRYGADPQPCYC 336


>gi|218190961|gb|EEC73388.1| hypothetical protein OsI_07633 [Oryza sativa Indica Group]
          Length = 1906

 Score =  122 bits (307), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 61/125 (48%), Positives = 76/125 (60%)

Query: 108  FQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETN 167
            FQ R   K+    T K G G+   ED+  G F+IEYVGEV+D    E R       G+ +
Sbjct: 1302 FQRRTYAKLGKFHTGKKGYGLQLKEDVSEGRFLIEYVGEVLDITAYESRQRYYASKGQKH 1361

Query: 168  FYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLT 227
            FY   +N   VIDA  KGN  R+INHSC PN   +KW+++GE  IGIFA R+IKKGE LT
Sbjct: 1362 FYFMALNGGEVIDACTKGNLGRFINHSCSPNCRTEKWMVNGEVCIGIFAMRNIKKGEELT 1421

Query: 228  YDYQY 232
            +DY Y
Sbjct: 1422 FDYNY 1426


>gi|222623047|gb|EEE57179.1| hypothetical protein OsJ_07116 [Oryza sativa Japonica Group]
          Length = 1963

 Score =  122 bits (307), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 61/125 (48%), Positives = 76/125 (60%)

Query: 108  FQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETN 167
            FQ R   K+    T K G G+   ED+  G F+IEYVGEV+D    E R       G+ +
Sbjct: 1287 FQRRTYAKLGKFHTGKKGYGLQLKEDVSEGRFLIEYVGEVLDITAYESRQRYYASKGQKH 1346

Query: 168  FYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLT 227
            FY   +N   VIDA  KGN  R+INHSC PN   +KW+++GE  IGIFA R+IKKGE LT
Sbjct: 1347 FYFMALNGGEVIDACTKGNLGRFINHSCSPNCRTEKWMVNGEVCIGIFAMRNIKKGEELT 1406

Query: 228  YDYQY 232
            +DY Y
Sbjct: 1407 FDYNY 1411


>gi|323337021|gb|EGA78277.1| Set2p [Saccharomyces cerevisiae Vin13]
          Length = 733

 Score =  122 bits (306), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 67/186 (36%), Positives = 100/186 (53%), Gaps = 6/186 (3%)

Query: 51  IYLTKRIKRRLEDDGIFCSCTA--SPGSSGVCDRDCHC--GMLLSSCSSG--CKCGNSCL 104
           IY  KRI     +D + C C    S G +  CD D  C   + L  C +     CGN C 
Sbjct: 51  IYANKRIGTFKNNDFMECDCYEEFSDGVNHACDEDSDCINRLTLIECVNDLCSSCGNDCQ 110

Query: 105 NKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLG 164
           N+ FQ +    + + +T+  G G+ A++DI+  +F+ EY GEVI++    +RL       
Sbjct: 111 NQRFQKKQYAPIAIFKTKHKGYGVRAEQDIEANQFIYEYKGEVIEEMEFRDRLIDYDQRH 170

Query: 165 ETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGE 224
             +FY   +     IDAT KG+ +R+ NHSC PN  + KW++  + R+GIFA R + KGE
Sbjct: 171 FKHFYFMMLQNGEFIDATIKGSLARFCNHSCSPNAYVNKWVVKDKLRMGIFAQRKLLKGE 230

Query: 225 NLTYDY 230
            +T+DY
Sbjct: 231 EITFDY 236


>gi|342872314|gb|EGU74695.1| hypothetical protein FOXB_14795 [Fusarium oxysporum Fo5176]
          Length = 919

 Score =  122 bits (306), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 74/240 (30%), Positives = 121/240 (50%), Gaps = 9/240 (3%)

Query: 11  SRIGHAFNKLLKQIGNPVEFE-LPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCS 69
           S++    ++ L     PV F+ LPD   +     +  I   +Y +K +    ++D + C 
Sbjct: 59  SKLSRKASQKLAASREPVLFDHLPDMTAESCNF-FQLIPDCLYGSKHLGS-TDNDALDCE 116

Query: 70  CTAS--PGSSGVCDRDCHC---GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKC 124
           C      G +  C  D  C      +   +    C   C N+ FQ +    + +++TEK 
Sbjct: 117 CREEWHDGENIACGEDSDCINRATKMECSAEAGNCAGGCQNQRFQRKQYANVSVIKTEKK 176

Query: 125 GAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYK 184
           G G+ AD D++  +FV EY+GEVI++ T   R+ +    G  +FY   +N+   +DAT K
Sbjct: 177 GFGLRADSDLQANDFVFEYIGEVINEPTFRRRMMQYDEEGIKHFYFMSLNKSEFVDATKK 236

Query: 185 GNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
           GN  R+ NHSC PN  + KW++  + R+GIFA+R I+ GE L ++Y  + +  D    YC
Sbjct: 237 GNYGRFCNHSCNPNCYVDKWVVGDKLRMGIFASRKIQSGEELVFNYNVDRYGADPQPCYC 296


>gi|392298598|gb|EIW09695.1| Set2p [Saccharomyces cerevisiae CEN.PK113-7D]
          Length = 733

 Score =  122 bits (306), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 67/186 (36%), Positives = 100/186 (53%), Gaps = 6/186 (3%)

Query: 51  IYLTKRIKRRLEDDGIFCSCTA--SPGSSGVCDRDCHC--GMLLSSCSSG--CKCGNSCL 104
           IY  KRI     +D + C C    S G +  CD D  C   + L  C +     CGN C 
Sbjct: 51  IYANKRIGTFKNNDFMECDCYEEFSDGVNHACDEDSDCINRLTLIECVNDLCSSCGNDCQ 110

Query: 105 NKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLG 164
           N+ FQ +    + + +T+  G G+ A++DI+  +F+ EY GEVI++    +RL       
Sbjct: 111 NQRFQKKQYAPIAIFKTKHKGYGVRAEQDIEANQFIYEYKGEVIEEMEFRDRLIDYDQRH 170

Query: 165 ETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGE 224
             +FY   +     IDAT KG+ +R+ NHSC PN  + KW++  + R+GIFA R + KGE
Sbjct: 171 FKHFYFMMLQNGEFIDATIKGSLARFCNHSCSPNAYVNKWVVKDKLRMGIFAQRKLLKGE 230

Query: 225 NLTYDY 230
            +T+DY
Sbjct: 231 EITFDY 236


>gi|259147310|emb|CAY80562.1| Set2p [Saccharomyces cerevisiae EC1118]
          Length = 733

 Score =  122 bits (306), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 67/186 (36%), Positives = 100/186 (53%), Gaps = 6/186 (3%)

Query: 51  IYLTKRIKRRLEDDGIFCSCTA--SPGSSGVCDRDCHC--GMLLSSCSSG--CKCGNSCL 104
           IY  KRI     +D + C C    S G +  CD D  C   + L  C +     CGN C 
Sbjct: 51  IYANKRIGTFKNNDFMECDCYEEFSDGVNHACDEDSDCINRLTLIECVNDLCSSCGNDCQ 110

Query: 105 NKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLG 164
           N+ FQ +    + + +T+  G G+ A++DI+  +F+ EY GEVI++    +RL       
Sbjct: 111 NQRFQKKQYAPIAIFKTKHKGYGVRAEQDIEANQFIYEYKGEVIEEMEFRDRLIDYDQRH 170

Query: 165 ETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGE 224
             +FY   +     IDAT KG+ +R+ NHSC PN  + KW++  + R+GIFA R + KGE
Sbjct: 171 FKHFYFMMLQNGEFIDATIKGSLARFCNHSCSPNAYVNKWVVKDKLRMGIFAQRKLLKGE 230

Query: 225 NLTYDY 230
            +T+DY
Sbjct: 231 EITFDY 236


>gi|149239909|ref|XP_001525830.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
           YB-4239]
 gi|146449953|gb|EDK44209.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
           YB-4239]
          Length = 822

 Score =  122 bits (306), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 62/172 (36%), Positives = 96/172 (55%), Gaps = 7/172 (4%)

Query: 66  IFCSCTASPGSSG----VCDRDCHCGMLLSSC---SSGCKCGNSCLNKPFQNRPVKKMKL 118
           + C C     SS      C  D +C   ++S    +  C CG +C N+ FQ +    + +
Sbjct: 55  MTCDCEEDWDSSTEQNMACGEDSNCINRITSVECINRHCSCGENCQNQRFQKKQYADVSV 114

Query: 119 VQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMV 178
            QTE  G G+ A+  ++ G+F+ EY+GEVID+ T  +++ +       +FY   +  D  
Sbjct: 115 FQTELKGYGLRANTQLREGDFIYEYIGEVIDEPTFRQKMIEYDLKQYKHFYFMMLKNDAF 174

Query: 179 IDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDY 230
           IDAT KG+ +R++NHSC PN  + KW++    R+GIFA RDI  GE +T+DY
Sbjct: 175 IDATEKGSLARFVNHSCSPNAFVDKWVVADRLRMGIFAKRDIMAGEEITFDY 226


>gi|323348010|gb|EGA82268.1| Set2p [Saccharomyces cerevisiae Lalvin QA23]
          Length = 650

 Score =  122 bits (306), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 67/186 (36%), Positives = 100/186 (53%), Gaps = 6/186 (3%)

Query: 51  IYLTKRIKRRLEDDGIFCSCTA--SPGSSGVCDRDCHC--GMLLSSCSSG--CKCGNSCL 104
           IY  KRI     +D + C C    S G +  CD D  C   + L  C +     CGN C 
Sbjct: 51  IYANKRIGTFKNNDFMECDCYEEFSDGVNHACDEDSDCINRLTLIECVNDLCSSCGNDCQ 110

Query: 105 NKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLG 164
           N+ FQ +    + + +T+  G G+ A++DI+  +F+ EY GEVI++    +RL       
Sbjct: 111 NQRFQKKQYAPIAIFKTKHKGYGVRAEQDIEANQFIYEYKGEVIEEMEFRDRLIDYDQRH 170

Query: 165 ETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGE 224
             +FY   +     IDAT KG+ +R+ NHSC PN  + KW++  + R+GIFA R + KGE
Sbjct: 171 FKHFYFMMLQNGEFIDATIKGSLARFCNHSCSPNAYVNKWVVKDKLRMGIFAQRKLLKGE 230

Query: 225 NLTYDY 230
            +T+DY
Sbjct: 231 EITFDY 236


>gi|322798377|gb|EFZ20101.1| hypothetical protein SINV_07690 [Solenopsis invicta]
          Length = 2203

 Score =  122 bits (306), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 78/237 (32%), Positives = 115/237 (48%), Gaps = 19/237 (8%)

Query: 19   KLLKQIGNPVEFELPD-WFIKPKAIP---------YVFIKRNIYLTKRIKRRLEDDGIFC 68
            KLL+Q   P + +    W      +P         Y  I+ NIY    +K     +   C
Sbjct: 1310 KLLRQRKTPFQLQYDLLWLHAQSRLPGRDLVPSWNYKKIRTNIYYD--VKPTTLYEAQAC 1367

Query: 69   SCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGA 126
             C    G    C  DC   M+ S CS   C C + C N+  Q       ++   TE  G 
Sbjct: 1368 ECKPESG----CGDDCINRMVFSECSPQLCPCSDKCENQKIQKHEWSPGLQRFMTEDKGW 1423

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+   + IK G+F++EYVGEV+ ++  + R+   ++  +T+ Y   ++  +VID    G 
Sbjct: 1424 GVRTQQAIKSGDFILEYVGEVVSEREFKSRMA-TRYANDTHHYCLHLDGGLVIDGHRMGG 1482

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
              R++NHSC PN EMQKW + G  R+ +FA+RDIK GE LTYDY +   + S    C
Sbjct: 1483 DGRFVNHSCEPNCEMQKWSVLGLPRMALFASRDIKPGEELTYDYNFALFNPSEGQQC 1539


>gi|396478086|ref|XP_003840449.1| hypothetical protein LEMA_P101010.1 [Leptosphaeria maculans JN3]
 gi|312217021|emb|CBX96970.1| hypothetical protein LEMA_P101010.1 [Leptosphaeria maculans JN3]
          Length = 962

 Score =  122 bits (306), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 73/239 (30%), Positives = 117/239 (48%), Gaps = 10/239 (4%)

Query: 12  RIGHAFNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCT 71
           ++  A ++ +     P+  +LPD     KA  +  +    Y  K I     D  + C C 
Sbjct: 129 KLSRAASQKIHARPPPLYLDLPDSTEAAKAT-FTVLPECTYANKNIG--TTDPALECDCQ 185

Query: 72  ASPGS----SGVC--DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCG 125
                    +  C  D DC        C   C CG  C N+ F  +    + +++T+K G
Sbjct: 186 TEWDPVLHVNNACGEDSDCINRATKMECVGDCTCGRKCQNQRFLRKQYADVTVIKTDKKG 245

Query: 126 AGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKG 185
            G+ A++D+  G+FV EY+GEVID++T   R+ +  H G  +FY   + +   +DAT KG
Sbjct: 246 FGLRANKDMAPGDFVFEYIGEVIDEKTFRRRMLQYDHEGIKHFYFMSLTKGEFVDATKKG 305

Query: 186 NKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
           N  R+ NHSC PN  + KW++  + R+GIF  R ++ GE L ++Y  + +  D    YC
Sbjct: 306 NLGRFCNHSCNPNCFVDKWVVGDKLRMGIFVERRVQAGEELVFNYNVDRYGADPQPCYC 364


>gi|150865134|ref|XP_001384227.2| hypothetical protein PICST_31672 [Scheffersomyces stipitis CBS
           6054]
 gi|149386390|gb|ABN66198.2| predicted protein [Scheffersomyces stipitis CBS 6054]
          Length = 732

 Score =  122 bits (306), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 69/186 (37%), Positives = 100/186 (53%), Gaps = 7/186 (3%)

Query: 52  YLTKRIKRRLEDDGIFCSCTASPGSSG----VCDRDCHCGMLLSS--CSSG-CKCGNSCL 104
           Y +K I    + + + C C     S       C  D +C   ++S  C +G C CG +C 
Sbjct: 51  YSSKLIGSSGQQEYMTCDCVEDWDSESQRNLACSDDSNCINRVTSVECINGHCGCGKNCQ 110

Query: 105 NKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLG 164
           N+ FQ R    + + QTE  G G+ AD+ I  G F+ EY+GEVID+ +   R+       
Sbjct: 111 NQRFQKRQYASVSVFQTELKGYGLRADDVIPEGGFIYEYIGEVIDEASFRARMIDYDSKN 170

Query: 165 ETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGE 224
             +FY   +  D  IDAT KG+ +R+ NHSC PN  + KW++  + R+GIFA R I KGE
Sbjct: 171 FKHFYFMMLKNDSFIDATIKGSLARFCNHSCSPNAFVDKWVVGDKLRMGIFAKRSISKGE 230

Query: 225 NLTYDY 230
            +T+DY
Sbjct: 231 EITFDY 236


>gi|170573421|ref|XP_001892464.1| SET domain containing protein [Brugia malayi]
 gi|158601976|gb|EDP38706.1| SET domain containing protein [Brugia malayi]
          Length = 1603

 Score =  122 bits (305), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 88/286 (30%), Positives = 136/286 (47%), Gaps = 55/286 (19%)

Query: 12   RIGHAFNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRN-IYLTKRIKR-----RLEDDG 65
            +I   F  L   +G+   ++LP + +  K +    I+RN +  +KR K      R+ D  
Sbjct: 754  QISRQFANLELNLGDRKHWDLPWYRLNWKEVTRRLIERNRLEKSKREKNSSKTYRIGDKK 813

Query: 66   IFCS-------CTASPGSSGVCDRDCHCG-------------------MLLSSCSSGC-- 97
            +  S        +A+P SS VC+R                         +  SCSSGC  
Sbjct: 814  LPKSRKSDSFATSATPSSSKVCERSRRSSGDYIKLKQNVIVDAYPKIEQMQCSCSSGCCG 873

Query: 98   ------------KCGNSCLN-------KPFQNRPVKKMKLVQT-EKCGAGIVADEDIKRG 137
                        +CGNSC         + F+   V++++  QT   CG G+  D +I +G
Sbjct: 874  ESDECLNRVVLMECGNSCPRNALCTNKRLFRRECVERLRTFQTMNGCGIGVKTDVNIDKG 933

Query: 138  EFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCP 197
            +F+ EY+GEV+  +T   R  +  +  + N Y   +    V+DA +KGN +R+INHSC P
Sbjct: 934  QFICEYIGEVVSMETFNIR-SRTDYRYQRNHYALNLCPGFVVDAYHKGNIARFINHSCAP 992

Query: 198  NTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
            N EMQ+W ++G  RIG+FA R I +GE LTYDY ++      +  C
Sbjct: 993  NCEMQRWSVNGHYRIGLFALRGIHEGEELTYDYNWDAFEFDDVTIC 1038


>gi|355558542|gb|EHH15322.1| hypothetical protein EGK_01394 [Macaca mulatta]
          Length = 2796

 Score =  122 bits (305), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 62/155 (40%), Positives = 93/155 (60%), Gaps = 3/155 (1%)

Query: 80   CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDIKRG 137
            C  DC   M+ + CS   C CG  C N+  Q    V+ ++  + E+ G GI   E +K G
Sbjct: 2029 CVDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAG 2088

Query: 138  EFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCP 197
            +F+IEY+GEV+ +Q    R+ +  H   ++ Y   ++  MVID+   GN++R+INHSC P
Sbjct: 2089 QFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDP 2147

Query: 198  NTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            N EMQKW ++G  RIG++A +D+  G  LTYDY +
Sbjct: 2148 NCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2182


>gi|403167549|ref|XP_003327326.2| histone-lysine N-methyltransferase SETD2 [Puccinia graminis f. sp.
           tritici CRL 75-36-700-3]
 gi|375167080|gb|EFP82907.2| histone-lysine N-methyltransferase SETD2 [Puccinia graminis f. sp.
           tritici CRL 75-36-700-3]
          Length = 974

 Score =  122 bits (305), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 71/208 (34%), Positives = 109/208 (52%), Gaps = 9/208 (4%)

Query: 44  YVFIKRNIYLTKRI--KRRLEDDGIFCSCTASPGS--SGVCDRDCHCGMLLSS---CSSG 96
           +  I +N Y  K +   ++LED  + C CT +PG   S  C     C   L+        
Sbjct: 122 FTQIDQNWYQNKSLGTTKQLED-AMVCDCTWTPGDDPSWACSEHSGCINYLTQIECLQDQ 180

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
           C+C   C N+ FQ R    +++V T K G G+    D+ +  FV EY+GEVI  +   +R
Sbjct: 181 CRCREKCQNQRFQKRLYAPIEIVLTPKKGFGMRLQADVPKDTFVYEYLGEVIGVKALHKR 240

Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
           L      G  +FY  E+ +D  IDAT KG   R++NHSC PN  + KW++  + R+GIF 
Sbjct: 241 LKDYGQEGIKHFYFMELQKDQYIDATKKGGFGRFLNHSCNPNCYIGKWVVGRQLRMGIFT 300

Query: 217 TRDIKKGENLTYDYQYE-FLHDSLIAYC 243
            R ++ GE LT++Y  + + H++   +C
Sbjct: 301 KRAVRGGEELTFNYNVDRYGHEAQECFC 328


>gi|302677464|ref|XP_003028415.1| hypothetical protein SCHCODRAFT_70386 [Schizophyllum commune H4-8]
 gi|300102103|gb|EFI93512.1| hypothetical protein SCHCODRAFT_70386 [Schizophyllum commune H4-8]
          Length = 883

 Score =  122 bits (305), Expect = 2e-25,   Method: Composition-based stats.
 Identities = 71/211 (33%), Positives = 106/211 (50%), Gaps = 14/211 (6%)

Query: 47  IKRNIYLTKRI-KRRLEDDGIFCSCTASPGSSGVCDRDC--HCGMLLSSC---------- 93
           I  N Y   ++ K R   +G  C C    G      RD   H     S+C          
Sbjct: 66  IPENWYQYAKLGKSREAMEGSTCDCEYEEGEHAALGRDNLDHACGPRSNCINRLTQVECL 125

Query: 94  SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTC 153
              C C   C N+ FQ +    +++V+TEK G G+ A++DI+R  F+ EYVG+V++  + 
Sbjct: 126 EEECHCRQFCQNQRFQRKEYAPIEIVKTEKKGFGLRAEQDIQRDTFIYEYVGDVVNSPSF 185

Query: 154 EERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIG 213
           ++R+      G  +FY   + +D  IDAT  G   R+ NHSC PN  + KW +    R+G
Sbjct: 186 KKRMRDYAKEGIQHFYFMMLQKDEFIDATKSGGIGRFANHSCSPNCYVAKWTVGDRVRMG 245

Query: 214 IFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
           IFA R+IKK E LT++Y  + + H +   YC
Sbjct: 246 IFAKRNIKKYEELTFNYNVDRYGHQAQTCYC 276


>gi|355745722|gb|EHH50347.1| hypothetical protein EGM_01160 [Macaca fascicularis]
          Length = 2904

 Score =  122 bits (305), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 62/155 (40%), Positives = 93/155 (60%), Gaps = 3/155 (1%)

Query: 80   CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDIKRG 137
            C  DC   M+ + CS   C CG  C N+  Q    V+ ++  + E+ G GI   E +K G
Sbjct: 2105 CVDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAG 2164

Query: 138  EFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCP 197
            +F+IEY+GEV+ +Q    R+ +  H   ++ Y   ++  MVID+   GN++R+INHSC P
Sbjct: 2165 QFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDP 2223

Query: 198  NTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            N EMQKW ++G  RIG++A +D+  G  LTYDY +
Sbjct: 2224 NCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2258


>gi|443709486|gb|ELU04158.1| hypothetical protein CAPTEDRAFT_93578, partial [Capitella teleta]
          Length = 1129

 Score =  122 bits (305), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 76/233 (32%), Positives = 120/233 (51%), Gaps = 11/233 (4%)

Query: 6    KNSDNSRIGHAFNKLLKQIG------NPVEFELPDWFIKPKAIPYVFIKRNIYL-TKRIK 58
            K S+++ +G +FN+ +K+        N ++ +L     + K   Y  IK N  + + +I 
Sbjct: 773  KASESNSLGKSFNQAVKEAALVNKMWNTIKEQLKGDIDQKKPSAYKHIKSNFPIGSVQIY 832

Query: 59   RRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMK 117
            +   +    C CTA   +      +C   + L  C    CK G  C N+ F  R      
Sbjct: 833  KSPSN---VCECTAQMENPCGEQGNCLNRLTLVECHPQMCKAGEKCQNQRFMKREYPDQS 889

Query: 118  LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDM 177
              +T   G G+  + DIK+G+FV EYVG++ID + C+ R+         NFY+  ++   
Sbjct: 890  AYRTGARGWGLKTNVDIKKGQFVNEYVGDLIDSEECQRRIKAAHDNNIVNFYMLTLDNRR 949

Query: 178  VIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDY 230
            +IDA  KGN SR++NHSC PN   QKW ++G+ R+G+FA  DI  G  LT++Y
Sbjct: 950  IIDAGPKGNLSRFMNHSCDPNLVTQKWTVNGDVRVGLFAVCDIPAGTELTFNY 1002


>gi|302505469|ref|XP_003014441.1| hypothetical protein ARB_07003 [Arthroderma benhamiae CBS 112371]
 gi|291178262|gb|EFE34052.1| hypothetical protein ARB_07003 [Arthroderma benhamiae CBS 112371]
          Length = 973

 Score =  122 bits (305), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 60/164 (36%), Positives = 94/164 (57%), Gaps = 1/164 (0%)

Query: 81  DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
           D DC        C   C CG++C N+ FQ R    + +++TEK G G+ A+ D++  +F+
Sbjct: 163 DSDCINRATKMECVDNCGCGDNCQNQRFQKRQYADVTVIKTEKKGFGLRANTDLRPNDFI 222

Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
            EY+GEVI++    +R+ +    G  +FY   +N+   +DAT KGN  R+ NHSC PN  
Sbjct: 223 YEYIGEVINEPQFRKRMMQYDDEGIKHFYFMSLNKGEFVDATRKGNLGRFCNHSCNPNCY 282

Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
           + KW++  + R+GIFA + IK GE L ++Y  + +  D    YC
Sbjct: 283 IDKWVVGEKLRMGIFAEKHIKAGEELVFNYNVDRYGADPQPCYC 326


>gi|46122361|ref|XP_385734.1| hypothetical protein FG05558.1 [Gibberella zeae PH-1]
          Length = 1051

 Score =  122 bits (305), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 63/188 (33%), Positives = 100/188 (53%), Gaps = 6/188 (3%)

Query: 62  EDDGIFCSCTAS--PGSSGVCDRDCHC---GMLLSSCSSGCKCGNSCLNKPFQNRPVKKM 116
           ++D + C C      G +  C  D  C      +   + G  C   C N+ FQ +    +
Sbjct: 249 DNDALDCECRDEWHDGKNLACGEDSDCINRATKMECSAEGGNCAGGCQNQRFQRKQYANV 308

Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
            +++TEK G G+ AD D++  +FV EY+GEVI++ T   R+ +    G  +FY   +N+ 
Sbjct: 309 SVIKTEKKGFGLRADSDLQPNDFVFEYIGEVINEPTFRRRMIQYDEEGIKHFYFMSLNKS 368

Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FL 235
             +DAT KGN  R+ NHSC PN  + KW++  + R+GIF +R I+ GE L ++Y  + + 
Sbjct: 369 EFVDATKKGNYGRFCNHSCNPNCYVDKWVVGDKLRMGIFTSRKIQSGEELVFNYNVDRYG 428

Query: 236 HDSLIAYC 243
            D    YC
Sbjct: 429 ADPQPCYC 436


>gi|302664281|ref|XP_003023773.1| hypothetical protein TRV_02085 [Trichophyton verrucosum HKI 0517]
 gi|291187786|gb|EFE43155.1| hypothetical protein TRV_02085 [Trichophyton verrucosum HKI 0517]
          Length = 973

 Score =  122 bits (305), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 60/164 (36%), Positives = 94/164 (57%), Gaps = 1/164 (0%)

Query: 81  DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
           D DC        C   C CG++C N+ FQ R    + +++TEK G G+ A+ D++  +F+
Sbjct: 163 DSDCINRATKMECVDNCGCGDNCQNQRFQKRQYADVTVIKTEKKGFGLRANTDLRPNDFI 222

Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
            EY+GEVI++    +R+ +    G  +FY   +N+   +DAT KGN  R+ NHSC PN  
Sbjct: 223 YEYIGEVINEPQFRKRMMQYDDEGIKHFYFMSLNKGEFVDATRKGNLGRFCNHSCNPNCY 282

Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
           + KW++  + R+GIFA + IK GE L ++Y  + +  D    YC
Sbjct: 283 IDKWVVGEKLRMGIFAEKHIKAGEELVFNYNVDRYGADPQPCYC 326


>gi|290992999|ref|XP_002679121.1| set domain-containing protein [Naegleria gruberi]
 gi|284092736|gb|EFC46377.1| set domain-containing protein [Naegleria gruberi]
          Length = 153

 Score =  122 bits (305), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 55/126 (43%), Positives = 88/126 (69%), Gaps = 2/126 (1%)

Query: 108 FQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETN 167
           FQ +  KK+++ +  + G G+ A E I +G F+IEYVGEV+D++  ++R  +  + GE +
Sbjct: 1   FQKKSWKKIEVKRAGEKGFGVFAKEKISKGTFIIEYVGEVLDEKIYKQR--QEAYDGERH 58

Query: 168 FYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLT 227
           +Y   +  + +IDA+ KGN +R+INHSC PN+ +QKW +  ++RIG+FA RDI+KGE +T
Sbjct: 59  YYFLSVGTNQIIDASKKGNNARFINHSCDPNSVLQKWTVGHQSRIGVFALRDIEKGEEVT 118

Query: 228 YDYQYE 233
           +DY  E
Sbjct: 119 FDYAME 124


>gi|410516927|sp|Q4IB50.2|SET2_GIBZE RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36
           specific; AltName: Full=SET domain-containing protein 2
          Length = 911

 Score =  121 bits (304), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 63/188 (33%), Positives = 100/188 (53%), Gaps = 6/188 (3%)

Query: 62  EDDGIFCSCTAS--PGSSGVCDRDCHC---GMLLSSCSSGCKCGNSCLNKPFQNRPVKKM 116
           ++D + C C      G +  C  D  C      +   + G  C   C N+ FQ +    +
Sbjct: 109 DNDALDCECRDEWHDGKNLACGEDSDCINRATKMECSAEGGNCAGGCQNQRFQRKQYANV 168

Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
            +++TEK G G+ AD D++  +FV EY+GEVI++ T   R+ +    G  +FY   +N+ 
Sbjct: 169 SVIKTEKKGFGLRADSDLQPNDFVFEYIGEVINEPTFRRRMIQYDEEGIKHFYFMSLNKS 228

Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FL 235
             +DAT KGN  R+ NHSC PN  + KW++  + R+GIF +R I+ GE L ++Y  + + 
Sbjct: 229 EFVDATKKGNYGRFCNHSCNPNCYVDKWVVGDKLRMGIFTSRKIQSGEELVFNYNVDRYG 288

Query: 236 HDSLIAYC 243
            D    YC
Sbjct: 289 ADPQPCYC 296


>gi|406697594|gb|EKD00852.1| histone-lysine N-methyltransferase [Trichosporon asahii var. asahii
           CBS 8904]
          Length = 843

 Score =  121 bits (304), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 60/149 (40%), Positives = 88/149 (59%), Gaps = 1/149 (0%)

Query: 83  DCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVI 141
           DC   ++   C +G CK G  C N+    +    + +VQTEK G G+ A EDI  G  V 
Sbjct: 208 DCINRIMFIECLAGECKAGKRCRNQRLARKQYAPVDVVQTEKKGFGLRAREDIPAGALVY 267

Query: 142 EYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEM 201
           EY+GEV+ ++T  +R+ +    G  +FY   + ++  IDAT KG   R+ NHSC PN E+
Sbjct: 268 EYIGEVVVEKTFRKRMAEYADEGIKHFYFMMLQKEEYIDATKKGGIGRFANHSCNPNCEV 327

Query: 202 QKWIIDGETRIGIFATRDIKKGENLTYDY 230
           QKW++    R+GIF  RD+KK E +T++Y
Sbjct: 328 QKWVVGRRMRMGIFTKRDVKKDEEITFNY 356


>gi|50302715|ref|XP_451294.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
 gi|74607741|sp|Q6CXP5.1|SET2_KLULA RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36
           specific; AltName: Full=SET domain-containing protein 2
 gi|49640425|emb|CAH02882.1| KLLA0A06600p [Kluyveromyces lactis]
          Length = 702

 Score =  121 bits (304), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 61/153 (39%), Positives = 90/153 (58%), Gaps = 3/153 (1%)

Query: 81  DRDCHCGMLLSSCSSG-CK--CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRG 137
           D DC     L  C +G CK  CG  C N+ FQ +    + + +TE+ G G+ A+ DI+  
Sbjct: 71  DSDCINRATLIECVNGLCKHSCGTDCQNQRFQKKAYADISVFKTERKGFGVRANSDIEPH 130

Query: 138 EFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCP 197
            F+ EY+GEVI ++    R+ K   +G  +FY   +     IDAT KG  +R+ NHSC P
Sbjct: 131 NFIYEYIGEVIQEEEFRNRMVKYDQMGFKHFYFMMLQTGQFIDATLKGCIARFCNHSCNP 190

Query: 198 NTEMQKWIIDGETRIGIFATRDIKKGENLTYDY 230
           N  + KW+++G+ ++GIFA R I KGE +T+DY
Sbjct: 191 NAYVNKWVVNGKLKMGIFANRHISKGEEVTFDY 223


>gi|408394323|gb|EKJ73531.1| hypothetical protein FPSE_06149 [Fusarium pseudograminearum CS3096]
          Length = 911

 Score =  121 bits (304), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 63/188 (33%), Positives = 100/188 (53%), Gaps = 6/188 (3%)

Query: 62  EDDGIFCSCTAS--PGSSGVCDRDCHC---GMLLSSCSSGCKCGNSCLNKPFQNRPVKKM 116
           ++D + C C      G +  C  D  C      +   + G  C   C N+ FQ +    +
Sbjct: 109 DNDALDCECRDEWHDGKNLACGEDSDCINRATKMECSAEGGNCAGGCQNQRFQRKQYANV 168

Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
            +++TEK G G+ AD D++  +FV EY+GEVI++ T   R+ +    G  +FY   +N+ 
Sbjct: 169 SVIKTEKKGFGLRADSDLQPNDFVFEYIGEVINEPTFRRRMIQYDEEGIKHFYFMSLNKS 228

Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FL 235
             +DAT KGN  R+ NHSC PN  + KW++  + R+GIF +R I+ GE L ++Y  + + 
Sbjct: 229 EFVDATKKGNYGRFCNHSCNPNCYVDKWVVGDKLRMGIFTSRKIQSGEELVFNYNVDRYG 288

Query: 236 HDSLIAYC 243
            D    YC
Sbjct: 289 ADPQPCYC 296


>gi|291225527|ref|XP_002732754.1| PREDICTED: Ash1l protein-like [Saccoglossus kowalevskii]
          Length = 2643

 Score =  121 bits (304), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 78/218 (35%), Positives = 112/218 (51%), Gaps = 16/218 (7%)

Query: 28   VEFELPD---WFIKPKAIP-------YVFIKRNIYLTKRIKRRLEDDGIFCSCT-ASPGS 76
            V+F LP    W      +P       Y  I+ NIY+  +    LE   + C+C   + G 
Sbjct: 1781 VDFSLPYDIWWLHDMDKLPPKNDDSKYKKIRSNIYVDVKPYSGLE--AVSCNCMKPTNGI 1838

Query: 77   SGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
               C  DC   M+ + CS   C CG+ C N   Q    +  ++ + TE  G GI   + I
Sbjct: 1839 DRGCGDDCINKMVFTECSPNLCPCGDHCSNMSIQKHEWITTLERLPTENRGWGIRTKDAI 1898

Query: 135  KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
            K G+F++EYVGEV+ +    +R  K  +    + Y   +N  MVID    G + R++NHS
Sbjct: 1899 KSGQFILEYVGEVVSENEFWQRAMK-NYQNRRHHYCLNLNSGMVIDGYRMGCEGRFVNHS 1957

Query: 195  CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            C PN EMQKW ++G  RIG+FA +DI+ G  LTYDY +
Sbjct: 1958 CEPNCEMQKWSVNGVYRIGLFALKDIQPGSELTYDYNF 1995


>gi|195377533|ref|XP_002047543.1| GJ11874 [Drosophila virilis]
 gi|194154701|gb|EDW69885.1| GJ11874 [Drosophila virilis]
          Length = 2343

 Score =  121 bits (304), Expect = 2e-25,   Method: Composition-based stats.
 Identities = 69/214 (32%), Positives = 112/214 (52%), Gaps = 12/214 (5%)

Query: 27   PVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC 86
            P    +P W        Y  I+ N+Y           D   C+C   P +SG C  +C  
Sbjct: 1414 PTRHVVPSW-------NYRKIRTNVYAESVRPNLAGFDHPNCNC--KPQASGACLDNCLN 1464

Query: 87   GMLLSSCS-SGCKCGNSCLNKPFQNRPVKK-MKLVQTEKCGAGIVADEDIKRGEFVIEYV 144
             M+ + C+ S C  G+ C N+  Q   V   ++   T   G G+     I +G +++EYV
Sbjct: 1465 RMVYTECAPSNCPAGDKCRNQKIQRHEVAPGVERFMTADKGWGVRTKLPIAKGTYILEYV 1524

Query: 145  GEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKW 204
            GEV+ ++  ++R+  + +L +T+ Y   ++  +VID    G+  R++NHSC PN EMQKW
Sbjct: 1525 GEVVTEREFKQRMASI-YLNDTHHYCLHLDGGLVIDGQRMGSDCRFVNHSCEPNCEMQKW 1583

Query: 205  IIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
             ++G +R+ +FA R I++GE LTYDY +   + S
Sbjct: 1584 SVNGLSRMVLFAKRAIEQGEELTYDYNFSLFNPS 1617


>gi|402217077|gb|EJT97159.1| SET domain-containing protein [Dacryopinax sp. DJM-731 SS1]
          Length = 711

 Score =  121 bits (304), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 70/208 (33%), Positives = 112/208 (53%), Gaps = 8/208 (3%)

Query: 44  YVFIKRNIYLTKRIKR-RLEDDGIFCSC-----TASPGSSGVCDRDCHCGMLLSSC-SSG 96
           ++ +  N Y    + R R +DD + C C     T  P ++     DC   +    C +  
Sbjct: 34  FIELSDNAYQYNSLGRSRQKDDCMECDCQFDYATDPPEAACGYGSDCINRLTQVECLADE 93

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
           C+C + C N+ FQ +    +++V T+K G G+ A  D++   F+ EY+GEVI+  T  +R
Sbjct: 94  CRCRSHCQNQRFQRKQYADIEIVSTDKKGYGLRAATDLQPDSFIYEYIGEVINQTTFLKR 153

Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
           + +    G  +FY   + R+  IDAT KG   R+ NHSC PN  + KW++    R+GIFA
Sbjct: 154 MHEYADEGIKHFYFMMLQREEYIDATKKGGIGRFANHSCNPNCYVAKWVVGPRVRMGIFA 213

Query: 217 TRDIKKGENLTYDYQYE-FLHDSLIAYC 243
            R+I +GE LT++Y  + + HD+   YC
Sbjct: 214 KREISRGEELTFNYNVDRYGHDAQPCYC 241


>gi|170587756|ref|XP_001898640.1| SET domain containing protein [Brugia malayi]
 gi|158593910|gb|EDP32504.1| SET domain containing protein [Brugia malayi]
          Length = 1449

 Score =  121 bits (304), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 76/219 (34%), Positives = 110/219 (50%), Gaps = 21/219 (9%)

Query: 20  LLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGV 79
           +LK  G PV++E  D  I       +    ++    ++ R       FC  T+   + G 
Sbjct: 550 MLKSAGEPVQYEYLDENI-------ILCDESLIKEAKVVR------CFCEPTSVEIAEGR 596

Query: 80  -CDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGE 138
            C   C    L + C S C  G  C N+ F N+   K+++      G G+ A E ++ G 
Sbjct: 597 GCSSGCINRELYTECGSRCPSGAGCANRRFHNKQYAKVEVFNAGIKGWGLRAAEPLEPGR 656

Query: 139 FVIEYVGEVIDDQTCEERLWKMKHLGE----TNFYLCEINRDMVIDATYKGNKSRYINHS 194
           F+IEYVGEVID    EE + + +  G+     + YL  +    VIDAT KGN SR+INHS
Sbjct: 657 FIIEYVGEVID---AEEMIRRGRRYGKDPKHVHHYLMALKNGAVIDATAKGNVSRFINHS 713

Query: 195 CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
           C PN E QKW ++ + R+G F  + I  GE + +DYQ E
Sbjct: 714 CDPNCESQKWTVNRQLRVGFFVIKPIALGEEIVFDYQLE 752


>gi|452846178|gb|EME48111.1| hypothetical protein DOTSEDRAFT_167709 [Dothistroma septosporum
           NZE10]
          Length = 963

 Score =  121 bits (304), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 60/168 (35%), Positives = 95/168 (56%), Gaps = 3/168 (1%)

Query: 79  VCDRDCHCGMLLS--SCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKR 136
            CD D  C    +   C   C CG+ C N+ FQ +    + +++TEK G G+ AD++++ 
Sbjct: 172 ACDEDSDCINRATKMECVGDCGCGSKCQNRRFQRKKYADVTVIKTEKKGYGLRADKELRP 231

Query: 137 GEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCC 196
           G+FV EY+GEVI +     R+ +    G  +FY   + +   +DAT KGN  R+ NHSC 
Sbjct: 232 GDFVYEYIGEVIGENVFRRRMQQYDEEGIKHFYFMSLTKGEFVDATKKGNLGRFCNHSCN 291

Query: 197 PNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
           PN  + KW+++ + R+GIF  R+I+ GE L ++Y  + +  D    YC
Sbjct: 292 PNCYVDKWVVNDKLRMGIFVERNIQAGEELVFNYNVDRYGADPQPCYC 339


>gi|340515729|gb|EGR45981.1| predicted protein [Trichoderma reesei QM6a]
          Length = 933

 Score =  121 bits (304), Expect = 2e-25,   Method: Composition-based stats.
 Identities = 58/177 (32%), Positives = 95/177 (53%), Gaps = 5/177 (2%)

Query: 62  EDDGIFCSCTAS--PGSSGVCDRDCHC---GMLLSSCSSGCKCGNSCLNKPFQNRPVKKM 116
           + D + C C      G +  C  D  C      +   +    CG  C N+ FQ +    +
Sbjct: 123 DSDALDCDCREEWRDGENFACGEDSDCINRATKMECSADSANCGGGCQNRRFQRKQYADV 182

Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
            +++TEK G G+ A+  ++  +F+ EY+GEVI++ T   R+ +    G  +FY   +N++
Sbjct: 183 TVIKTEKKGFGLRANSPLEPNDFIYEYIGEVINEPTFRRRMLQYDEEGIKHFYFMSLNKN 242

Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
             +DAT KGN  R+ NHSC PN  + KW++  + R+GIFA R I+ GE L ++Y  +
Sbjct: 243 EFVDATKKGNLGRFCNHSCSPNCFVDKWVVGDKLRMGIFALRKIQAGEELVFNYNVD 299


>gi|296808927|ref|XP_002844802.1| histone-lysine N-methyltransferase [Arthroderma otae CBS 113480]
 gi|238844285|gb|EEQ33947.1| histone-lysine N-methyltransferase [Arthroderma otae CBS 113480]
          Length = 969

 Score =  121 bits (303), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 59/162 (36%), Positives = 93/162 (57%), Gaps = 1/162 (0%)

Query: 83  DCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIE 142
           DC        C   C CG++C N+ FQ R    + +++TEK G G+ A+ D++  +F+ E
Sbjct: 164 DCINRATKMECVQNCGCGDNCQNQRFQKRQYADVTVIKTEKKGFGLRANTDLQPNDFIFE 223

Query: 143 YVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQ 202
           Y+GEVI++    +R+ +    G  +FY   +N+   +DAT KGN  R+ NHSC PN  + 
Sbjct: 224 YIGEVINEPQFRKRMIQYDEEGIKHFYFMSLNKGEFVDATRKGNLGRFCNHSCNPNCYID 283

Query: 203 KWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
           KW++  + R+GIFA R +K GE L ++Y  + +  D    YC
Sbjct: 284 KWVVGEKLRMGIFAERHVKAGEELVFNYNVDRYGADPQPCYC 325


>gi|219841942|gb|AAI43511.1| WHSC1L1 protein [Homo sapiens]
          Length = 1426

 Score =  121 bits (303), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 72/199 (36%), Positives = 103/199 (51%), Gaps = 15/199 (7%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
            K  PY  IK N  + K ++ ++ D      C   P     C  +  C   ML   C    
Sbjct: 1069 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1127

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G+ C N+ F  R     ++++TE+ G G+     IK+GEFV EYVGE+ID++ C  R
Sbjct: 1128 CPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1187

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +              N D +IDA  KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1188 IKRAHE-----------NSDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1236

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1237 LCDIPAGMELTFNYNLDCL 1255


>gi|397521375|ref|XP_003830772.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2 [Pan
            paniscus]
          Length = 1426

 Score =  121 bits (303), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 72/199 (36%), Positives = 103/199 (51%), Gaps = 15/199 (7%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
            K  PY  IK N  + K ++ ++ D      C   P     C  +  C   ML   C    
Sbjct: 1069 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1127

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G+ C N+ F  R     ++++TE+ G G+     IK+GEFV EYVGE+ID++ C  R
Sbjct: 1128 CPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1187

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +              N D +IDA  KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1188 IKRAHE-----------NSDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1236

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1237 LCDIPAGMELTFNYNLDCL 1255


>gi|255943807|ref|XP_002562671.1| Pc20g01100 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211587406|emb|CAP85439.1| Pc20g01100 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 688

 Score =  121 bits (303), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 59/164 (35%), Positives = 93/164 (56%), Gaps = 1/164 (0%)

Query: 81  DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
           D DC        C+  C CG  C N+ FQ R    + +++TEK G G+ A++++  GE +
Sbjct: 145 DSDCINRATKIECAGECGCGPDCQNQRFQKRQFAPVSVIKTEKKGFGLRAEKNLDPGELI 204

Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
            EYVGEV+ +Q   +R+ +    G  +FY   +N+   +DAT +GN  R+ NHSC PN  
Sbjct: 205 YEYVGEVVGEQQFRKRMRQYDEEGIKHFYFMSLNKGEFVDATKRGNLGRFCNHSCNPNCY 264

Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
           + KW++  + R+GIFA R ++ GE L ++Y  + +  D    YC
Sbjct: 265 VDKWVVGEKLRMGIFAERAVQAGEELVFNYNVDRYGADPQPCYC 308


>gi|301629157|ref|XP_002943714.1| PREDICTED: hypothetical protein LOC100496979 [Xenopus (Silurana)
           tropicalis]
          Length = 1666

 Score =  121 bits (303), Expect = 3e-25,   Method: Composition-based stats.
 Identities = 63/167 (37%), Positives = 94/167 (56%), Gaps = 4/167 (2%)

Query: 68  CSCTASPGSSGVCDRDCHCGMLLSSCS-SGCKCGNSCLNKPFQNRP-VKKMKLVQTEKCG 125
           C C  +    G C  DC   M+ + CS + C CG  C N+  Q    V+ ++  + E  G
Sbjct: 4   CKCPDNQSEKG-CVEDCLNRMIFAECSPNTCPCGEQCNNQCIQRHEWVQCLERFRAEGKG 62

Query: 126 AGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKG 185
            GI   E +K  +F+IEY+GEV+ +     R  +  H   ++ Y   ++  MVID+   G
Sbjct: 63  WGIRTKEPLKASQFIIEYLGEVVSETEFRNRTIEQYH-NHSDHYCLSLDSGMVIDSYRMG 121

Query: 186 NKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
           N++R+INHSC PN EMQKW ++G  RIG++A +D+  G  LTYDY +
Sbjct: 122 NEARFINHSCDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 168


>gi|332825882|ref|XP_003311721.1| PREDICTED: histone-lysine N-methyltransferase NSD3 [Pan troglodytes]
          Length = 1426

 Score =  121 bits (303), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 72/199 (36%), Positives = 103/199 (51%), Gaps = 15/199 (7%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
            K  PY  IK N  + K ++ ++ D      C   P     C  +  C   ML   C    
Sbjct: 1069 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1127

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G+ C N+ F  R     ++++TE+ G G+     IK+GEFV EYVGE+ID++ C  R
Sbjct: 1128 CPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1187

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +              N D +IDA  KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1188 IKRAHE-----------NSDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1236

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1237 LCDIPAGMELTFNYNLDCL 1255


>gi|296222019|ref|XP_002757007.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1
            [Callithrix jacchus]
          Length = 1427

 Score =  121 bits (303), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 72/199 (36%), Positives = 103/199 (51%), Gaps = 15/199 (7%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
            K  PY  IK N  + K ++ ++ D      C   P     C  +  C   ML   C    
Sbjct: 1070 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1128

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G+ C N+ F  R     ++++TE+ G G+     IK+GEFV EYVGE+ID++ C  R
Sbjct: 1129 CPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1188

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +              N D +IDA  KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1189 IKRAHE-----------NSDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1237

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1238 LCDIPAGMELTFNYNLDCL 1256


>gi|327300913|ref|XP_003235149.1| SET and WW domain-containing protein [Trichophyton rubrum CBS
           118892]
 gi|326462501|gb|EGD87954.1| SET and WW domain-containing protein [Trichophyton rubrum CBS
           118892]
          Length = 971

 Score =  121 bits (303), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 60/164 (36%), Positives = 94/164 (57%), Gaps = 1/164 (0%)

Query: 81  DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
           D DC        C   C CG++C N+ FQ R    + +++TEK G G+ A+ D++  +F+
Sbjct: 162 DSDCINRATKMECVDNCGCGDNCQNQRFQKRQYADVTVIKTEKKGFGLRANTDLRPNDFI 221

Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
            EY+GEVI++    +R+ +    G  +FY   +N+   +DAT KGN  R+ NHSC PN  
Sbjct: 222 FEYIGEVINEPQFRKRMMQYDDEGIKHFYFMSLNKGEFVDATRKGNLGRFCNHSCNPNCY 281

Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
           + KW++  + R+GIFA + IK GE L ++Y  + +  D    YC
Sbjct: 282 IDKWVVGEKLRMGIFAEKYIKAGEELVFNYNVDRYGADPQPCYC 325


>gi|315048637|ref|XP_003173693.1| histone-lysine N-methyltransferase [Arthroderma gypseum CBS 118893]
 gi|311341660|gb|EFR00863.1| histone-lysine N-methyltransferase [Arthroderma gypseum CBS 118893]
          Length = 967

 Score =  121 bits (303), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 65/188 (34%), Positives = 102/188 (54%), Gaps = 7/188 (3%)

Query: 63  DDGIFCSCT----ASPGSSGVC--DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKM 116
           D  + C C     +  G +  C  D DC        C   C CG++C N+ FQ R    +
Sbjct: 138 DHAMECDCAEEWNSKTGINEACGEDSDCINRATKMECVQNCGCGDNCQNQRFQKRQYADV 197

Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
            +++TEK G G+ A+ D++  +F+ EY+GEVI++    +R+ +    G  +FY   +N+ 
Sbjct: 198 TVIKTEKKGFGLRANADLRPNDFIFEYIGEVINEPQFRKRMIQYDDEGIKHFYFMSLNKG 257

Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FL 235
             +DAT KGN  R+ NHSC PN  + KW++  + R+GIFA + IK GE L ++Y  + + 
Sbjct: 258 EFVDATRKGNLGRFCNHSCNPNCYIDKWVVGEKLRMGIFAEKYIKAGEELVFNYNVDRYG 317

Query: 236 HDSLIAYC 243
            D    YC
Sbjct: 318 ADPQPCYC 325


>gi|58269858|ref|XP_572085.1| histone-lysine N-methyltransferase [Cryptococcus neoformans var.
           neoformans JEC21]
 gi|338819405|sp|P0CO28.1|SET2_CRYNJ RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36
           specific; AltName: Full=SET domain-containing protein 2
 gi|57228321|gb|AAW44778.1| histone-lysine N-methyltransferase, putative [Cryptococcus
           neoformans var. neoformans JEC21]
          Length = 834

 Score =  121 bits (303), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 69/205 (33%), Positives = 107/205 (52%), Gaps = 11/205 (5%)

Query: 45  VFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGV----CDRDCHCGMLLSSCSSG-CKC 99
           V+ +++I L+K       D+ + C C  +           D DC    L   C +G C+ 
Sbjct: 112 VYERKDIGLSKE-----NDEMMVCECVYNRHDPDADPCGPDSDCINRALYIECIAGECRA 166

Query: 100 GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
           G  C N+ F  R    + +V TEK G G+ A   I     + EY+GEV+ ++T  +R+ +
Sbjct: 167 GKHCHNQQFSKRQYANVDVVLTEKKGYGLRASSTIPANTLIYEYIGEVVAEKTFRKRMQQ 226

Query: 160 MKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD 219
               G  +FY   + ++  IDAT KG   R+ NHSC PN E+QKW++    R+GIF  RD
Sbjct: 227 YADEGIRHFYFMMLQKEEYIDATKKGGIGRFANHSCNPNCEVQKWVVGRRLRMGIFTKRD 286

Query: 220 IKKGENLTYDYQYE-FLHDSLIAYC 243
           + KGE +T++Y  + + HD+   YC
Sbjct: 287 VIKGEEITFNYNVDRYGHDAQTCYC 311


>gi|134113727|ref|XP_774448.1| hypothetical protein CNBG0940 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|338819404|sp|P0CO29.1|SET2_CRYNB RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36
           specific; AltName: Full=SET domain-containing protein 2
 gi|50257086|gb|EAL19801.1| hypothetical protein CNBG0940 [Cryptococcus neoformans var.
           neoformans B-3501A]
          Length = 834

 Score =  121 bits (303), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 69/205 (33%), Positives = 107/205 (52%), Gaps = 11/205 (5%)

Query: 45  VFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGV----CDRDCHCGMLLSSCSSG-CKC 99
           V+ +++I L+K       D+ + C C  +           D DC    L   C +G C+ 
Sbjct: 112 VYERKDIGLSKE-----NDEMMVCECVYNRHDPDADPCGPDSDCINRALYIECIAGECRA 166

Query: 100 GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
           G  C N+ F  R    + +V TEK G G+ A   I     + EY+GEV+ ++T  +R+ +
Sbjct: 167 GKHCHNQQFSKRQYANVDVVLTEKKGYGLRASSTIPANTLIYEYIGEVVAEKTFRKRMQQ 226

Query: 160 MKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD 219
               G  +FY   + ++  IDAT KG   R+ NHSC PN E+QKW++    R+GIF  RD
Sbjct: 227 YADEGIRHFYFMMLQKEEYIDATKKGGIGRFANHSCNPNCEVQKWVVGRRLRMGIFTKRD 286

Query: 220 IKKGENLTYDYQYE-FLHDSLIAYC 243
           + KGE +T++Y  + + HD+   YC
Sbjct: 287 VIKGEEITFNYNVDRYGHDAQTCYC 311


>gi|321479504|gb|EFX90460.1| hypothetical protein DAPPUDRAFT_39415 [Daphnia pulex]
          Length = 799

 Score =  120 bits (302), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 67/202 (33%), Positives = 111/202 (54%), Gaps = 10/202 (4%)

Query: 44  YVFIKRNIYLTKRIKRRLEDDGIFCSCT-----ASPGSSGVCDRDCHCGMLLSSCSSG-C 97
           Y  I+ N+Y    +K    +D   C+CT          +  C  DC   M+ + CSS  C
Sbjct: 95  YRKIRSNVYYD--VKPPFTNDAEACNCTLPQPDTEDAEAQCCGDDCLNRMVYTECSSQMC 152

Query: 98  KCGNSCLNKPFQ-NRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
             G+ C+N+  Q +R  + ++   T++ G G+    ++K G F++EYVGEV+ D+  +ER
Sbjct: 153 PVGDKCMNQRIQRHRWAQGLERFMTKEKGWGVRCRNELKTGVFILEYVGEVVSDKEFKER 212

Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
           +  + ++ +T+ Y   ++  +VID    G   R++NHSC PN EMQKW ++G  R+ +FA
Sbjct: 213 MHTV-YVHDTHHYCLHLDGGLVIDGHRMGGDGRFVNHSCAPNCEMQKWSVNGLPRMALFA 271

Query: 217 TRDIKKGENLTYDYQYEFLHDS 238
            RDI   E L+YDY +   + +
Sbjct: 272 LRDISPLEELSYDYNFSLFNPA 293


>gi|254566589|ref|XP_002490405.1| Histone methyltransferase [Komagataella pastoris GS115]
 gi|238030201|emb|CAY68124.1| Histone methyltransferase [Komagataella pastoris GS115]
 gi|328350800|emb|CCA37200.1| histone-lysine N-methyltransferase SETD2 [Komagataella pastoris CBS
           7435]
          Length = 722

 Score =  120 bits (302), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 67/174 (38%), Positives = 96/174 (55%), Gaps = 7/174 (4%)

Query: 64  DGIFCSCTAS----PGSSGVCDRDCHCGMLLSS--CSSG-CKCGNSCLNKPFQNRPVKKM 116
           DG+ C C  +     G +  C     C   L+S  C SG C CG+ C N+ FQ +    +
Sbjct: 65  DGMSCDCEENWDTVSGINNACGEYSECINRLTSIECISGTCSCGDDCQNQRFQKKQYAPI 124

Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
            + +TEK G G+ A  DI++  F+ EY+GEVID+ T  +R     + G  +FY   + + 
Sbjct: 125 AVFETEKKGYGVRAQADIRQDAFIYEYLGEVIDESTFRKRKENYDNQGLEHFYFMMLQKG 184

Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDY 230
             IDAT KG   R+ NHSC PN  + KW +  + R+GIFA R+I KGE + +DY
Sbjct: 185 EFIDATAKGGLGRFCNHSCRPNAYVDKWEVGNKLRMGIFAKREIYKGEEICFDY 238


>gi|312072804|ref|XP_003139232.1| hypothetical protein LOAG_03647 [Loa loa]
 gi|307765598|gb|EFO24832.1| hypothetical protein LOAG_03647 [Loa loa]
          Length = 1422

 Score =  120 bits (302), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 67/172 (38%), Positives = 93/172 (54%), Gaps = 8/172 (4%)

Query: 67  FCSCTASPGSSGV-CDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCG 125
           FC  T +  + G  C   C    L + C S C  G  C N+ F N+   K+++      G
Sbjct: 554 FCEPTLAEIAEGRGCSSGCINRELYTECGSRCPSGVGCANRRFHNKQYAKVEVFNAGVKG 613

Query: 126 AGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGE----TNFYLCEINRDMVIDA 181
            G+ A E ++ G F+IEY+GEVID    EE + + +  G+     + YL  +    VIDA
Sbjct: 614 WGLRAAEPLEPGRFIIEYIGEVID---AEEMIRRGRRYGKDPKHVHHYLMALKNGAVIDA 670

Query: 182 TYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
           T KGN SR+INHSC PN E QKW +D + R+G F  + I  GE + +DYQ E
Sbjct: 671 TAKGNVSRFINHSCDPNCESQKWTVDRQLRVGFFVIKPIALGEEIVFDYQLE 722


>gi|74210080|dbj|BAE21322.1| unnamed protein product [Mus musculus]
          Length = 484

 Score =  120 bits (302), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 74/199 (37%), Positives = 105/199 (52%), Gaps = 15/199 (7%)

Query: 40  KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSS--CSSG- 96
           K  PY  IK N  + K ++ ++ D      C   PG    C  +  C   +S   C    
Sbjct: 128 KPPPYKHIKANKVIGK-VQVQVADLSEIPRCNCKPGDENPCGLESQCLNRMSQYECHPQV 186

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
           C  G+ C N+ F  R     ++++TE+ G G+     IK+GEFV EYVGE+ID++ C  R
Sbjct: 187 CPAGDRCQNQCFTKRLYPNAEVIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 246

Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
           + K  H           N D +IDA  KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 247 I-KRAHE----------NSDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 295

Query: 217 TRDIKKGENLTYDYQYEFL 235
             DI  G  LT++Y  + L
Sbjct: 296 LCDIPAGMELTFNYNLDCL 314


>gi|297295821|ref|XP_001094467.2| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific, partial [Macaca mulatta]
          Length = 2329

 Score =  120 bits (302), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 66/169 (39%), Positives = 97/169 (57%), Gaps = 12/169 (7%)

Query: 68   CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
            C+C A+  +    D +C   MLL  C  + C  G  C N+ F  R   ++++ +T + G 
Sbjct: 1539 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGW 1598

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+    DIK+GEFV EYVGE+ID++ C  R+              + + D +IDA  KGN
Sbjct: 1599 GLRTKTDIKKGEFVNEYVGELIDEEECRARI-----------RYAQEHEDRIIDAGPKGN 1647

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
             +R++NH C PN E QKW ++G+TR+G+FA  DIK G  LT++Y  E L
Sbjct: 1648 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 1696


>gi|389743273|gb|EIM84458.1| hypothetical protein STEHIDRAFT_82382 [Stereum hirsutum FP-91666
           SS1]
          Length = 1040

 Score =  120 bits (302), Expect = 4e-25,   Method: Composition-based stats.
 Identities = 65/188 (34%), Positives = 97/188 (51%), Gaps = 8/188 (4%)

Query: 64  DGIFCSCTASPGSSG----VCDRDCHCGMLLSSCS---SGCKCGNSCLNKPFQNRPVKKM 116
           + + C CT  P +       C  D  C   L+        C+C + C N+ FQ      +
Sbjct: 241 ESMTCDCTYDPSAGDDPDIACGHDSDCINRLTQVECMPDDCRCKSYCRNQRFQRLQYAPI 300

Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
            +VQTEK G G+ A  D+++  F+ EYVG+VI   T  +R+ +    G  +FY   + +D
Sbjct: 301 DIVQTEKKGFGLRAAADLRKDTFIYEYVGDVISHPTFVKRMREYAEEGIRHFYFMMLQKD 360

Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FL 235
             IDAT +G   R+ NHSC PN  + KW +    R+GIFA R I+  E LT++Y  + + 
Sbjct: 361 EYIDATKRGGIGRFANHSCNPNCYVAKWTVGSRIRMGIFANRFIRANEELTFNYNVDRYG 420

Query: 236 HDSLIAYC 243
           HD+   YC
Sbjct: 421 HDAQPCYC 428


>gi|366997071|ref|XP_003678298.1| hypothetical protein NCAS_0I02880 [Naumovozyma castellii CBS 4309]
 gi|342304169|emb|CCC71956.1| hypothetical protein NCAS_0I02880 [Naumovozyma castellii CBS 4309]
          Length = 726

 Score =  120 bits (302), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 68/193 (35%), Positives = 106/193 (54%), Gaps = 6/193 (3%)

Query: 44  YVFIKRNIYLTKRIKRRLEDDGIFCSC--TASPGSSGVCDR--DCHCGMLLSSCSSGC-- 97
           +  +K+  Y +K++     +D I C C   ++ G +  CD   DC   + L  C +    
Sbjct: 30  FTELKKCTYSSKKLGNSKNNDFIECDCYEDSTNGVNHACDENSDCINRLTLIECVNDLCY 89

Query: 98  KCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERL 157
            CGN C N+ FQ      + + +T+  G G+ A+ DI+  EF+ EY GEVID++   +R+
Sbjct: 90  SCGNDCQNQRFQKSQYADISIFKTKMKGYGVRANADIETNEFIYEYTGEVIDEEIFRDRM 149

Query: 158 WKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFAT 217
            +       +FY   +     IDAT KG+ +R+ NHSC PN  + KW + G+ R+GIFA+
Sbjct: 150 IEYDEKKFKHFYFMMLQNCEFIDATLKGSLARFCNHSCNPNAYVNKWEVAGKLRMGIFAS 209

Query: 218 RDIKKGENLTYDY 230
           R I KGE +T+DY
Sbjct: 210 RKIIKGEEITFDY 222


>gi|225562106|gb|EEH10386.1| histone-lysine N-methyltransferase [Ajellomyces capsulatus G186AR]
          Length = 1000

 Score =  120 bits (302), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 60/164 (36%), Positives = 92/164 (56%), Gaps = 1/164 (0%)

Query: 81  DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
           D DC        C   C CG+ C N+ FQ R    + +++TEK G G+ A+ D++  +F+
Sbjct: 173 DSDCINRATKMECVGDCGCGDECENQRFQRRQYADVSVIKTEKKGYGLRAESDLRPNQFI 232

Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
            EY+GEVI++    +R+      G  +FY   +++   +DAT KGN  R+ NHSC PN  
Sbjct: 233 FEYIGEVINETLFRKRMLSYDEEGIKHFYFMSLSKGEFVDATKKGNLGRFCNHSCNPNCY 292

Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
           + KW++  + R+GIFA R IK GE L ++Y  + +  D    YC
Sbjct: 293 VDKWVVGEKLRMGIFAERHIKAGEELVFNYNVDRYGADPQPCYC 336


>gi|325091551|gb|EGC44861.1| histone-lysine N-methyltransferase [Ajellomyces capsulatus H88]
          Length = 1000

 Score =  120 bits (302), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 60/164 (36%), Positives = 92/164 (56%), Gaps = 1/164 (0%)

Query: 81  DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
           D DC        C   C CG+ C N+ FQ R    + +++TEK G G+ A+ D++  +F+
Sbjct: 173 DSDCINRATKMECVGDCGCGDECENQRFQRRQYAHVSVIKTEKKGYGLRAESDLRPNQFI 232

Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
            EY+GEVI++    +R+      G  +FY   +++   +DAT KGN  R+ NHSC PN  
Sbjct: 233 FEYIGEVINETLFRKRMLSYDEEGIKHFYFMSLSKGEFVDATKKGNLGRFCNHSCNPNCY 292

Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
           + KW++  + R+GIFA R IK GE L ++Y  + +  D    YC
Sbjct: 293 VDKWVVGEKLRMGIFAERHIKAGEELVFNYNVDRYGADPQPCYC 336


>gi|410911836|ref|XP_003969396.1| PREDICTED: histone-lysine N-methyltransferase ASH1L-like [Takifugu
            rubripes]
          Length = 2782

 Score =  120 bits (302), Expect = 4e-25,   Method: Composition-based stats.
 Identities = 78/227 (34%), Positives = 120/227 (52%), Gaps = 18/227 (7%)

Query: 19   KLLKQIGNPVEFELP---------DWFIKPKAIP-YVFIKRNIYLTKRIKRRLEDDGIFC 68
            K L+Q    ++F+LP         D   K   +P Y  I+ N+Y+   +K     +   C
Sbjct: 1875 KYLRQ--KRIDFQLPYDILWLWKHDQLYKRPDVPLYKKIRSNVYVD--VKPLSGYETTTC 1930

Query: 69   SC-TASPGSSGVCDRDCHCGMLLSSCS-SGCKCGNSCLNKPFQNRP-VKKMKLVQTEKCG 125
            +C T +      C  DC   M  + CS S C   + C N+  Q    V+ ++  +TE  G
Sbjct: 1931 NCRTPNDRIEKSCLDDCLNRMSFAECSPSTCPSADQCDNQHIQRHDWVQCLERFRTEGKG 1990

Query: 126  AGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKG 185
             GI   E ++ G+F+IEY+GEV+ +Q    R+ + ++   +  Y   ++  MVID+   G
Sbjct: 1991 WGIRTKEPLRAGQFIIEYLGEVVSEQEFRSRMME-QYFSHSGNYCLNLDSGMVIDSYRMG 2049

Query: 186  NKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            N++R+INHSC PN EMQKW ++G  RIG+FA  +I  G  LTYDY +
Sbjct: 2050 NEARFINHSCEPNCEMQKWSVNGVYRIGLFALGEIPSGTELTYDYNF 2096


>gi|170029985|ref|XP_001842871.1| histone-lysine N-methyltransferase ash1 [Culex quinquefasciatus]
 gi|167865331|gb|EDS28714.1| histone-lysine N-methyltransferase ash1 [Culex quinquefasciatus]
          Length = 2119

 Score =  120 bits (302), Expect = 4e-25,   Method: Composition-based stats.
 Identities = 69/214 (32%), Positives = 107/214 (50%), Gaps = 16/214 (7%)

Query: 27   PVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC 86
            P    +P W        Y  I+ N+Y    +K     D   C+C    G    C  DC  
Sbjct: 1723 PGRHSVPSW-------NYRKIRTNVYYD--VKPNPSSDHPQCNCKPDSG----CQDDCLN 1769

Query: 87   GMLLSSC-SSGCKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDIKRGEFVIEYV 144
             ++   C    C CG+ C N   Q       ++   TE  G GI + E +K+G F++EY+
Sbjct: 1770 RLVYVECVPENCPCGDRCQNTKIQRHEYAPGLERFMTELKGWGIRSKEGVKKGSFIMEYL 1829

Query: 145  GEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKW 204
            GEV+ ++  +ER+ +  +L +T+ Y   +   +VID    G+  R++NHSC PN EMQKW
Sbjct: 1830 GEVVTEKEFKERM-RTIYLNDTHHYCLNLTGGLVIDGHRMGSDCRFVNHSCAPNCEMQKW 1888

Query: 205  IIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
             ++G  R+ ++A+RDI   E L YDY +   + S
Sbjct: 1889 SVNGLFRMALYASRDIPPHEELCYDYNFSLFNPS 1922


>gi|326481316|gb|EGE05326.1| histone-lysine N-methyltransferase [Trichophyton equinum CBS
           127.97]
          Length = 970

 Score =  120 bits (302), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 60/164 (36%), Positives = 94/164 (57%), Gaps = 1/164 (0%)

Query: 81  DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
           D DC        C   C CG++C N+ FQ R    + +++TEK G G+ A+ D++  +F+
Sbjct: 163 DSDCINRATKMECVDNCGCGDNCQNQRFQKRQYADVTVIKTEKKGFGLRANTDLRPNDFI 222

Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
            EY+GEVI++    +R+ +    G  +FY   +N+   +DAT KGN  R+ NHSC PN  
Sbjct: 223 FEYIGEVINEPQFRKRMIQYDDEGIKHFYFMSLNKGEFVDATRKGNLGRFCNHSCNPNCY 282

Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
           + KW++  + R+GIFA + IK GE L ++Y  + +  D    YC
Sbjct: 283 IDKWVVGEKLRMGIFAEKYIKAGEELVFNYNVDRYGADPQPCYC 326


>gi|441621285|ref|XP_003269632.2| PREDICTED: histone-lysine N-methyltransferase NSD3 [Nomascus
           leucogenys]
          Length = 505

 Score =  120 bits (302), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 72/199 (36%), Positives = 103/199 (51%), Gaps = 15/199 (7%)

Query: 40  KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
           K  PY  IK N  + K ++ ++ D      C   P     C  +  C   ML   C    
Sbjct: 217 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 275

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
           C  G+ C N+ F  R     ++++TE+ G G+     IK+GEFV EYVGE+ID++ C  R
Sbjct: 276 CPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 335

Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
           + +              N D +IDA  KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 336 IKRAHE-----------NSDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 384

Query: 217 TRDIKKGENLTYDYQYEFL 235
             DI  G  LT++Y  + L
Sbjct: 385 LCDIPAGMELTFNYNLDCL 403


>gi|326468707|gb|EGD92716.1| histone-lysine N-methyltransferase [Trichophyton tonsurans CBS
           112818]
          Length = 970

 Score =  120 bits (302), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 60/164 (36%), Positives = 94/164 (57%), Gaps = 1/164 (0%)

Query: 81  DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
           D DC        C   C CG++C N+ FQ R    + +++TEK G G+ A+ D++  +F+
Sbjct: 163 DSDCINRATKMECVDNCGCGDNCQNQRFQKRQYADVTVIKTEKKGFGLRANTDLRPNDFI 222

Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
            EY+GEVI++    +R+ +    G  +FY   +N+   +DAT KGN  R+ NHSC PN  
Sbjct: 223 FEYIGEVINEPQFRKRMIQYDDEGIKHFYFMSLNKGEFVDATRKGNLGRFCNHSCNPNCY 282

Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
           + KW++  + R+GIFA + IK GE L ++Y  + +  D    YC
Sbjct: 283 IDKWVVGEKLRMGIFAEKYIKAGEELVFNYNVDRYGADPQPCYC 326


>gi|395847339|ref|XP_003796336.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 3
            [Otolemur garnettii]
          Length = 1427

 Score =  120 bits (302), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 72/199 (36%), Positives = 102/199 (51%), Gaps = 15/199 (7%)

Query: 40   KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
            K  PY  IK N  + K ++ ++ D      C   P     C  +  C   ML   C    
Sbjct: 1070 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1128

Query: 97   CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
            C  G  C N+ F  R     ++++TE+ G G+     IK+GEFV EYVGE+ID++ C  R
Sbjct: 1129 CPAGERCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1188

Query: 157  LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
            + +              N D +IDA  KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1189 IKRAHE-----------NSDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1237

Query: 217  TRDIKKGENLTYDYQYEFL 235
              DI  G  LT++Y  + L
Sbjct: 1238 LCDIPAGMELTFNYNLDCL 1256


>gi|448089486|ref|XP_004196819.1| Piso0_004045 [Millerozyma farinosa CBS 7064]
 gi|448093762|ref|XP_004197850.1| Piso0_004045 [Millerozyma farinosa CBS 7064]
 gi|359378241|emb|CCE84500.1| Piso0_004045 [Millerozyma farinosa CBS 7064]
 gi|359379272|emb|CCE83469.1| Piso0_004045 [Millerozyma farinosa CBS 7064]
          Length = 744

 Score =  120 bits (302), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 61/155 (39%), Positives = 89/155 (57%), Gaps = 3/155 (1%)

Query: 79  VCDRDCHCGMLLSSCSSGCK---CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIK 135
            CD D  C   L+S     K   CG  C N+ FQ +    + + QTE  G G+ A++DI 
Sbjct: 72  ACDDDSDCINRLTSVECTNKYGTCGKDCQNQRFQKKQYASVAVFQTEMKGYGLRAEKDIP 131

Query: 136 RGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSC 195
              F+ EY GEVID++   +R+ +       +FY   + +D  IDAT KG+ +R+ NHSC
Sbjct: 132 DSSFIYEYTGEVIDEEAFRKRMVQYDEKNIKHFYFMMLKQDSFIDATRKGSLARFCNHSC 191

Query: 196 CPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDY 230
            PN  + KW++  + R+GIFA R IK+GE +T+DY
Sbjct: 192 NPNAYVDKWVVGEKLRMGIFAKRTIKRGEEITFDY 226


>gi|260800140|ref|XP_002594994.1| hypothetical protein BRAFLDRAFT_99284 [Branchiostoma floridae]
 gi|229280233|gb|EEN51005.1| hypothetical protein BRAFLDRAFT_99284 [Branchiostoma floridae]
          Length = 1541

 Score =  120 bits (301), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 70/202 (34%), Positives = 110/202 (54%), Gaps = 3/202 (1%)

Query: 37   IKPKAIPYVFIKRNIYL-TKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSC-S 94
            I  K  P+  IK N  + + +I      +   C C  +  S    + DC    L+  C  
Sbjct: 1057 IGKKPPPFRMIKTNKPVGSVQIHTADPSEIQRCECKVTDESPCGPESDCLNRNLMIECHP 1116

Query: 95   SGCKCGNSCLNKPFQNRPVKKMKLVQT-EKCGAGIVADEDIKRGEFVIEYVGEVIDDQTC 153
            +GC  G  C N+ F  R    ++  +T +  G G+    D+K+ +FV EYVGE+ID++  
Sbjct: 1117 AGCPAGEKCQNQRFVKRQYPAVESFKTPDGRGWGLKTLVDVKKHDFVYEYVGELIDEEEV 1176

Query: 154  EERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIG 213
            + R+ K      TNFY+  ++++ +IDA  K N SR++NHSC PN E QKW+++G+ R+G
Sbjct: 1177 QRRIKKAHEDNVTNFYMLTLDKNRIIDAGPKANMSRFMNHSCQPNCETQKWMVNGDIRVG 1236

Query: 214  IFATRDIKKGENLTYDYQYEFL 235
            +FA  DI  G  LT++Y  + L
Sbjct: 1237 LFAMDDIPTGSELTFNYNLDCL 1258


>gi|164655887|ref|XP_001729072.1| hypothetical protein MGL_3860 [Malassezia globosa CBS 7966]
 gi|159102961|gb|EDP41858.1| hypothetical protein MGL_3860 [Malassezia globosa CBS 7966]
          Length = 707

 Score =  120 bits (301), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 62/148 (41%), Positives = 86/148 (58%), Gaps = 1/148 (0%)

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
           C+CG  C N+ FQ     K+ +V+T   G G+ A  D+ R EFV EY+GEVI+  T   R
Sbjct: 29  CRCGVHCANQRFQQHAYAKVDIVKTPDKGYGLRAQTDLGRDEFVYEYIGEVINHDTFMRR 88

Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
           + + K     +FY   + RD  IDAT +G +SR+INHSC PN  + KW +    R+GIFA
Sbjct: 89  MAQYKTEHIEHFYFMMLQRDEYIDATKRGGRSRFINHSCSPNCYVSKWHVGRHVRMGIFA 148

Query: 217 TRDIKKGENLTYDYQYE-FLHDSLIAYC 243
            R I  GE LT++Y  + + +D    YC
Sbjct: 149 KRRILAGEELTFNYNVDRYGNDPQPCYC 176


>gi|402594990|gb|EJW88916.1| SET domain-containing protein [Wuchereria bancrofti]
          Length = 1425

 Score =  120 bits (301), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 67/172 (38%), Positives = 94/172 (54%), Gaps = 8/172 (4%)

Query: 67  FCSCTASPGSSGV-CDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCG 125
           FC  T++  + G  C   C    L + C S C  G  C N+ F N+   K+++      G
Sbjct: 557 FCEPTSAEIAEGRGCSSGCINRELYTECGSRCPSGAGCANRRFHNKQYAKVEVFNAGIKG 616

Query: 126 AGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGE----TNFYLCEINRDMVIDA 181
            G+ A E ++ G F+IEYVGEVID    EE + + +  G+     + YL  +    VIDA
Sbjct: 617 WGLRAAEPLEPGRFIIEYVGEVID---AEEMIRRGRRYGKDPKHVHHYLMALKNGAVIDA 673

Query: 182 TYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
           T KGN SR+INHSC PN E QKW ++ + R+G F  + I  GE + +DYQ E
Sbjct: 674 TAKGNVSRFINHSCDPNCESQKWTVNRQLRVGFFVIKPIALGEEIVFDYQLE 725


>gi|302903800|ref|XP_003048936.1| hypothetical protein NECHADRAFT_106311 [Nectria haematococca mpVI
           77-13-4]
 gi|256729870|gb|EEU43223.1| hypothetical protein NECHADRAFT_106311 [Nectria haematococca mpVI
           77-13-4]
          Length = 928

 Score =  120 bits (301), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 63/188 (33%), Positives = 99/188 (52%), Gaps = 6/188 (3%)

Query: 62  EDDGIFCSCTAS--PGSSGVCDRDCHC---GMLLSSCSSGCKCGNSCLNKPFQNRPVKKM 116
           ++D + C C      G +  C  D  C      +   +    CG  C N+ FQ +    +
Sbjct: 119 DNDSLDCECREEWRDGENLACGEDSDCINRATKMECSAEAGNCGGGCQNQRFQRKQYANV 178

Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
            +++TEK G G+ AD D++  +FV EY+GEVI++ T   R+ +    G  +FY   +N+ 
Sbjct: 179 SVIKTEKKGFGLRADADLQPNDFVYEYIGEVINEPTFRRRMLQYDEEGIKHFYFMSLNKS 238

Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FL 235
             +DAT KGN  R+ NHSC PN  + KW++  + R+GIF  R I+ GE L ++Y  + + 
Sbjct: 239 EFVDATKKGNLGRFCNHSCNPNCYVDKWVVGDKLRMGIFTLRKIQSGEELVFNYNVDRYG 298

Query: 236 HDSLIAYC 243
            D    YC
Sbjct: 299 ADPQPCYC 306


>gi|195368593|ref|XP_002045796.1| GM13547 [Drosophila sechellia]
 gi|194134946|gb|EDW56462.1| GM13547 [Drosophila sechellia]
          Length = 419

 Score =  120 bits (301), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 71/188 (37%), Positives = 104/188 (55%), Gaps = 7/188 (3%)

Query: 37  IKPKAIPYVFIKRNIYLTK-RIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSC 93
           IK    PYV IK N  +   R  + LED     +C   P     C  +  C   ML + C
Sbjct: 221 IKLHPPPYVKIKTNKAVPPLRFSQNLED---LSTCNCLPVDEHPCGPEAGCLNRMLFNEC 277

Query: 94  SSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQT 152
           +   CK G+ C N+ F+ R   ++++V   + G G+V  E I  G+FVIEYVGEVI+   
Sbjct: 278 NPEYCKAGSLCENRMFEQRKSPRLEVVYMNERGFGLVNREPIAAGDFVIEYVGEVINHAE 337

Query: 153 CEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRI 212
            + R+ + +   + N+Y   + +D +IDA  KGN +R++NHSC PN E QKW ++   R+
Sbjct: 338 FQRRMEQKQRDRDENYYFLGVEKDFIIDAGPKGNLARFMNHSCEPNCETQKWTVNCIHRV 397

Query: 213 GIFATRDI 220
           GIFA +DI
Sbjct: 398 GIFAIKDI 405


>gi|67903676|ref|XP_682094.1| hypothetical protein AN8825.2 [Aspergillus nidulans FGSC A4]
 gi|74592565|sp|Q5ASA5.1|SET2_EMENI RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36
           specific; AltName: Full=SET domain-containing protein 2
 gi|40740923|gb|EAA60113.1| hypothetical protein AN8825.2 [Aspergillus nidulans FGSC A4]
 gi|259482964|tpe|CBF77940.1| TPA: Histone-lysine N-methyltransferase, H3 lysine-36 specific (EC
           2.1.1.43)(SET domain-containing protein 2)
           [Source:UniProtKB/Swiss-Prot;Acc:Q5ASA5] [Aspergillus
           nidulans FGSC A4]
          Length = 980

 Score =  120 bits (301), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 61/164 (37%), Positives = 92/164 (56%), Gaps = 1/164 (0%)

Query: 81  DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
           D DC        C   C CG  C N+ FQ R    + +++TEK G G+ A+ED++  +F+
Sbjct: 208 DSDCINRATKIECMGDCGCGPDCQNQRFQRREYANVAVIKTEKKGYGLRAEEDLRPHQFI 267

Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
            EYVGEVI++     R+ +    G  +FY   +++   +DAT KGN  R+ NHSC PN  
Sbjct: 268 FEYVGEVINEGPFHRRMRQYDAEGIKHFYFMSLSKGEFVDATKKGNLGRFCNHSCNPNCY 327

Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
           + KW++  + R+GIFA R I+ GE L ++Y  + +  D    YC
Sbjct: 328 VDKWVVGEKLRMGIFAERHIQAGEELVFNYNVDRYGADPQPCYC 371


>gi|428183837|gb|EKX52694.1| hypothetical protein GUITHDRAFT_65244 [Guillardia theta CCMP2712]
          Length = 211

 Score =  120 bits (300), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 68/190 (35%), Positives = 104/190 (54%), Gaps = 6/190 (3%)

Query: 46  FIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCL 104
            ++RN YL ++  +   DD   C+C    G     D+ C    +   C+ G C CG+SC 
Sbjct: 1   MLRRNTYLHRKYAKLSVDDIAVCNCLPIKGKVACADQTCALRQIYVECTPGFCPCGDSCQ 60

Query: 105 NKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLG 164
           N+ FQ     + ++ + +K G  +   ED+++G FVIEY+GE+++ +  E R  K  +  
Sbjct: 61  NQRFQKCQYVRTEVKKVDKRGWALFTMEDVQQGTFVIEYMGEILNRRMYERR--KKAYAK 118

Query: 165 ETNFYLCEINRD---MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIK 221
           E + Y   +N      VIDA+ K +  R+INHSC PN    ++   GE  +GIFA RDI+
Sbjct: 119 EKHTYFMVLNTSPIFEVIDASRKSSMGRFINHSCDPNCHTHRYRSLGEVVVGIFAKRDIE 178

Query: 222 KGENLTYDYQ 231
           KGE +T DYQ
Sbjct: 179 KGEEITIDYQ 188


>gi|406867230|gb|EKD20268.1| histone H3 lysine 36 (K36) methyltransferase [Marssonina brunnea f.
           sp. 'multigermtubi' MB_m1]
          Length = 1038

 Score =  120 bits (300), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 75/223 (33%), Positives = 116/223 (52%), Gaps = 8/223 (3%)

Query: 27  PVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTAS--PGSSGVC--DR 82
           P+  ++PD   +   + +  IK  IY +K +    E D + C C+     G +  C  D 
Sbjct: 119 PLFNDVPDATEEATGV-FQKIKDCIYGSKGMGSS-EHDALDCDCSEEWRDGKNHACGEDS 176

Query: 83  DCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVI 141
           DC        C  G C CG+ C N+ FQ +    + + +TEK G G+ A+ D+   +F+ 
Sbjct: 177 DCINRATKMECVGGECNCGSGCQNQRFQQKQYADVSVFKTEKKGYGLRANVDLDANDFIF 236

Query: 142 EYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEM 201
           EY+GEVI++ T   R  +    G  +FY   + +   +DAT KGN  R+ NHSC PN  +
Sbjct: 237 EYIGEVINEPTFRRRTVQYDQEGIKHFYFMSLTKHEFVDATKKGNLGRFCNHSCNPNCYV 296

Query: 202 QKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
            KW++  + R+GIFA R+IK GE L ++Y  + +  D    YC
Sbjct: 297 DKWVVGEKLRMGIFAERNIKAGEELVFNYNVDRYGADPQPCYC 339


>gi|157123795|ref|XP_001653916.1| set domain protein [Aedes aegypti]
 gi|157123797|ref|XP_001653917.1| set domain protein [Aedes aegypti]
 gi|108874200|gb|EAT38425.1| AAEL009666-PB, partial [Aedes aegypti]
 gi|108874201|gb|EAT38426.1| AAEL009666-PA, partial [Aedes aegypti]
          Length = 2091

 Score =  120 bits (300), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 67/197 (34%), Positives = 105/197 (53%), Gaps = 9/197 (4%)

Query: 44   YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSS-GCKCGNS 102
            Y  I+ N+Y    +K     D   C+C    G    C  DC   ++   CS   C CG  
Sbjct: 1240 YRKIRTNVYYD--VKPNPSTDHPQCNCKPDSG----CQDDCLNRLVFVECSPENCPCGER 1293

Query: 103  CLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMK 161
            C N   Q       ++   TE+ G GI + E +++G F++EY+GEV+ ++  +ER+  + 
Sbjct: 1294 CKNTKIQRHEYAPGLERFMTEQKGWGIRSKEGVRKGLFIMEYLGEVVTEKEFKERMRTI- 1352

Query: 162  HLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIK 221
            +L +T+ Y   +   +VID    G+  R++NHSC PN EMQKW ++G  R+ +FA+RDI 
Sbjct: 1353 YLNDTHHYCLNLTGGLVIDGHRMGSDCRFVNHSCAPNCEMQKWSVNGLFRMALFASRDIP 1412

Query: 222  KGENLTYDYQYEFLHDS 238
              E LTYDY +   + +
Sbjct: 1413 PYEELTYDYNFSLFNPT 1429


>gi|402087032|gb|EJT81930.1| histone-lysine N-methyltransferase [Gaeumannomyces graminis var.
           tritici R3-111a-1]
          Length = 962

 Score =  120 bits (300), Expect = 6e-25,   Method: Composition-based stats.
 Identities = 61/153 (39%), Positives = 85/153 (55%), Gaps = 1/153 (0%)

Query: 81  DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
           D DC        C  G  CG  C N+ FQ +   K+ +++TEK G G+ AD D+   +FV
Sbjct: 146 DSDCINRHTKIECVGG-NCGEGCQNQRFQAKQYAKVSVIKTEKKGFGLRADTDLDANDFV 204

Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
            EY+GEVI ++    RL K       +FY   + R   +DAT KGN  R+ NHSC PN  
Sbjct: 205 FEYIGEVIGEELFRRRLMKYDDQRLEHFYFMSLTRTEYVDATKKGNLGRFCNHSCNPNCF 264

Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
           + KW++  + R+GIFA R IK GE L ++Y  +
Sbjct: 265 VDKWVVGDKLRMGIFALRAIKAGEELCFNYNVD 297


>gi|307167950|gb|EFN61316.1| Probable histone-lysine N-methyltransferase ASH1L [Camponotus
            floridanus]
          Length = 2136

 Score =  120 bits (300), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 73/214 (34%), Positives = 106/214 (49%), Gaps = 16/214 (7%)

Query: 32   LPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLS 91
            +P W        Y  I+ NIY    +K     +   C C    G    C  DC   M+ S
Sbjct: 1282 VPSWN-------YKKIRTNIYYD--VKPTTLYEAQACECKPESG----CGDDCINRMVFS 1328

Query: 92   SCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVID 149
             CS   C  G  C N+  Q       ++   TE  G G+   + IK G+F++EYVGEV+ 
Sbjct: 1329 ECSPQLCPSGEKCENQKIQKHEWSPGLQKFMTEDKGWGVRTQQSIKSGDFILEYVGEVVS 1388

Query: 150  DQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGE 209
            ++  + R+   ++  +T+ Y   ++  +VID    G   R++NHSC PN EMQKW + G 
Sbjct: 1389 EREFKSRMA-TRYANDTHHYCLHLDGGLVIDGHRMGGDGRFVNHSCEPNCEMQKWSVLGL 1447

Query: 210  TRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
             R+ +FA+RDIK GE LTYDY +   + S    C
Sbjct: 1448 PRMALFASRDIKPGEELTYDYNFALFNPSEGQQC 1481


>gi|402585708|gb|EJW79647.1| hypothetical protein WUBG_09444, partial [Wuchereria bancrofti]
          Length = 511

 Score =  120 bits (300), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 66/176 (37%), Positives = 100/176 (56%), Gaps = 4/176 (2%)

Query: 70  CTASPGSSGVCDRDCHCGMLLSSCSSGCKCGNSCLNKP-FQNRPVKKMKLVQT-EKCGAG 127
           C+ S G  G  D +C   ++L  C S C     C NK  F+   +++++  QT   CG G
Sbjct: 137 CSCSSGCCGESD-ECLNRVVLMECGSSCPRNAICTNKRLFRRECIERLRTFQTMNGCGIG 195

Query: 128 IVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNK 187
           +  D +I +G+F+ EY+GEV+  +T   R  +  +  + N Y   +    V+DA +KGN 
Sbjct: 196 VKTDVNIDKGQFICEYIGEVVSMETFNIRS-RTDYRYQRNHYALNLCPGFVVDAYHKGNI 254

Query: 188 SRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
           +R+INHSC PN EMQ+W ++G  RIG+FA R I +GE LTYDY ++      +  C
Sbjct: 255 ARFINHSCAPNCEMQRWSVNGHYRIGLFALRGIHEGEELTYDYNWDAFEFDDVTIC 310


>gi|169596857|ref|XP_001791852.1| hypothetical protein SNOG_01198 [Phaeosphaeria nodorum SN15]
 gi|111069727|gb|EAT90847.1| hypothetical protein SNOG_01198 [Phaeosphaeria nodorum SN15]
          Length = 940

 Score =  120 bits (300), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 73/240 (30%), Positives = 119/240 (49%), Gaps = 12/240 (5%)

Query: 12  RIGHAFNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSC- 70
           ++  A ++ +K+   P+  +LPD     K   +  + +  Y  K I     D  + C C 
Sbjct: 102 KLSRAASRTVKR-APPLYLDLPDSTQDAKQT-FEVLPQCTYANKHIG--TTDPALECDCG 157

Query: 71  ---TASPGSSGVCDRDCHC---GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKC 124
               A    +  C  D  C      +   + GC CG  C N+ F  +    + +++T K 
Sbjct: 158 EEWDAVEKVNNACGEDSDCINRATKMECVADGCSCGTKCQNQRFFRKQYADVTVIKTAKK 217

Query: 125 GAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYK 184
           G G+ A++D+  G+FV EY+GEVID++T   R+ +    G  +FY   + +   +DAT K
Sbjct: 218 GFGLRANQDMFPGDFVFEYIGEVIDERTFRRRMIQYDEEGIKHFYFMSLTKGEFVDATKK 277

Query: 185 GNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
           GN  R+ NHSC PN  + KW++  + R+GIF  R +K GE L ++Y  + +  D    YC
Sbjct: 278 GNLGRFCNHSCNPNCFVDKWVVGDKLRMGIFVERKVKAGEELVFNYNVDRYGADPQPCYC 337


>gi|425766575|gb|EKV05181.1| Histone-lysine N-methyltransferase, H3 lysine-36 specific
           [Penicillium digitatum PHI26]
 gi|425781728|gb|EKV19675.1| Histone-lysine N-methyltransferase, H3 lysine-36 specific
           [Penicillium digitatum Pd1]
          Length = 887

 Score =  120 bits (300), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 58/164 (35%), Positives = 93/164 (56%), Gaps = 1/164 (0%)

Query: 81  DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
           D DC        C+  C CG+ C N+ FQ +    + +++TEK G G+ A+ ++  GE +
Sbjct: 144 DSDCINRATKIECAGDCGCGSDCQNQRFQKQQFAPVSVIKTEKKGFGLRAERNLDPGELI 203

Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
            EYVGEV+ +Q   +R+ +    G  +FY   +N+   +DAT +GN  R+ NHSC PN  
Sbjct: 204 YEYVGEVVGEQQFRKRMRQYDEEGIKHFYFMSLNKGEFVDATKRGNLGRFCNHSCNPNCY 263

Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
           + KW++  + R+GIFA R ++ GE L ++Y  + +  D    YC
Sbjct: 264 VDKWVVGEKLRMGIFAERAVQAGEELVFNYNVDRYGADPQPCYC 307


>gi|336387462|gb|EGO28607.1| hypothetical protein SERLADRAFT_359892 [Serpula lacrymans var.
           lacrymans S7.9]
          Length = 780

 Score =  120 bits (300), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 71/208 (34%), Positives = 104/208 (50%), Gaps = 8/208 (3%)

Query: 44  YVFIKRNIYLTKRIKRRLED-DGIFCSCTASPG---SSGVCDRDCHCGMLLSSCS---SG 96
           +  I  N Y    + R  E  + + C C   PG    S  C     C   L+        
Sbjct: 50  FTEIHENHYQYGTLGRSREALESMTCDCQFEPGVDRPSDACGHGSDCINRLTQVECLPDD 109

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
           C+C + C N+ FQ R    + +VQTEK G G+ A  D+ R  F+ EYVG+V+   +  +R
Sbjct: 110 CRCRSFCQNQRFQRREYANINIVQTEKKGYGLRAAADLHRDAFIYEYVGDVVSHPSFVKR 169

Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
           + +    G  +FY   + +D  IDAT +G   R+ NHSC PN  + KW I    R+GIFA
Sbjct: 170 MREYGEQGIRHFYFMMLQKDEFIDATKRGGMGRFANHSCNPNCYVAKWTIGNHVRMGIFA 229

Query: 217 TRDIKKGENLTYDYQYE-FLHDSLIAYC 243
            R+I K E LT++Y  + + H++   YC
Sbjct: 230 NRNIVKDEELTFNYNVDRYGHEAQPCYC 257


>gi|296418221|ref|XP_002838740.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295634701|emb|CAZ82931.1| unnamed protein product [Tuber melanosporum]
          Length = 892

 Score =  120 bits (300), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 71/187 (37%), Positives = 99/187 (52%), Gaps = 8/187 (4%)

Query: 51  IYLTKRIKRRLEDDGIFCSCTASPGSSGVCD------RDCHCGMLLSSCSSGCK-CGNSC 103
           IY TK I    +D G   SC       G C+       DC   M    C+   + CG  C
Sbjct: 120 IYSTKSIGDSGQD-GEVMSCECRTDWDGECNPACGDNSDCINRMTSMECTQDDRACGQDC 178

Query: 104 LNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHL 163
            N+ FQ R    + +++TEK G G+ A+ DI    F+ EYVGEVID+     R+ K    
Sbjct: 179 QNQRFQQRQYADVSVIKTEKKGFGLRANSDISINTFLYEYVGEVIDENKFRRRMEKYDSE 238

Query: 164 GETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKG 223
           G  +FY   + ++  IDAT KG  +R+ NHSC PN  + KW++  + R+GIFA R++K G
Sbjct: 239 GIKHFYFMSLGKNEFIDATKKGGLARFCNHSCNPNCFIDKWVVGEKLRMGIFAKRNVKAG 298

Query: 224 ENLTYDY 230
           E L +DY
Sbjct: 299 EELVFDY 305


>gi|145540026|ref|XP_001455703.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124423511|emb|CAK88306.1| unnamed protein product [Paramecium tetraurelia]
          Length = 1371

 Score =  120 bits (300), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 78/228 (34%), Positives = 111/228 (48%), Gaps = 18/228 (7%)

Query: 25  GNPVEFE-LPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCT-----------A 72
           GNPV  + L D F       +  +K+N YL ++  R   D+ + CSC             
Sbjct: 32  GNPVRLQGLKDSFKY-----FTLLKKNQYLERKQYRVSRDETLVCSCIMCPEDQIQNRPQ 86

Query: 73  SPGSSGVCDRDCHCGMLLSSCS-SGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVAD 131
            P  S  C   C      + C    C C   C N+ FQ      +  ++    G G+ A 
Sbjct: 87  GPQYSYNCGERCLNRFTCTECDVELCPCAEQCKNRRFQKHDDACVYPLRCGGKGMGLFAG 146

Query: 132 EDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYI 191
           E I +G+F+++YVGE+    +   R    ++   T  YL ++N   VID T KGN +R+I
Sbjct: 147 ERILKGQFIMQYVGEIFQINSAFGRRRVQEYSKSTCTYLMKLNNQEVIDPTSKGNLARFI 206

Query: 192 NHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSL 239
           NHSC PN   +KW + GE  IGIFA RDI + E LT+DYQ++  H  L
Sbjct: 207 NHSCEPNCITEKWNVLGEVCIGIFAIRDINEDEELTFDYQFDVFHTPL 254


>gi|392579013|gb|EIW72140.1| hypothetical protein TREMEDRAFT_25563 [Tremella mesenterica DSM
           1558]
          Length = 869

 Score =  119 bits (299), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 70/194 (36%), Positives = 102/194 (52%), Gaps = 14/194 (7%)

Query: 60  RLEDDGIFCSCT-------ASP-GSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQN 110
           R +D+ + C C        A P GS      DC    +   C  G C+    C N+ F  
Sbjct: 139 REQDEMMVCDCVYDRDDPHAEPCGSQS----DCINRAIFIECLPGECRAREQCQNQRFGK 194

Query: 111 RPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYL 170
           R    + +V TE  G G+ A +DI  G  + EY+GEVI + T  +R+ +    G  +FY 
Sbjct: 195 RQYAPIDVVLTEMKGFGLRAADDIPEGSLIYEYIGEVIKETTFRKRMAQYADEGIKHFYF 254

Query: 171 CEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDY 230
             + ++  IDAT KG   R+ NHSC PN+E+QKW++    R+GIFA RDI K E +T++Y
Sbjct: 255 MMLQKEEYIDATKKGGIGRFANHSCNPNSEVQKWVVGRRLRMGIFAKRDIIKDEEITFNY 314

Query: 231 QYE-FLHDSLIAYC 243
             + + HD+   YC
Sbjct: 315 NVDRYGHDAQPCYC 328


>gi|363756006|ref|XP_003648219.1| hypothetical protein Ecym_8108 [Eremothecium cymbalariae
           DBVPG#7215]
 gi|356891419|gb|AET41402.1| Hypothetical protein Ecym_8108 [Eremothecium cymbalariae
           DBVPG#7215]
          Length = 682

 Score =  119 bits (299), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 68/193 (35%), Positives = 104/193 (53%), Gaps = 6/193 (3%)

Query: 44  YVFIKRNIYLTKRIKRRLEDDGIFCSCTAS--PGSSGVCDR--DCHCGMLLSSCSSG--C 97
           ++ ++++ Y  KR+      + + C C      G +  C    DC   + L  C +    
Sbjct: 34  FIELEKSTYAHKRLGNSPSHEFMECDCFEEYRDGKNHACGETSDCINRLTLIECVNELCT 93

Query: 98  KCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERL 157
            CG++C N+ FQ R    + + QTEK G G+ A++DI+  EF+ EY+GEVI +    ER+
Sbjct: 94  SCGDNCQNQRFQGRQYADIAVFQTEKKGYGVRAEKDIEANEFIYEYIGEVISESEFRERM 153

Query: 158 WKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFAT 217
                 G  +FY   +     IDAT KG  +R+ NHSC PN  + KW + G+ ++GIFA 
Sbjct: 154 VDYDVRGYKHFYFMMLQTGEFIDATEKGCLARFCNHSCNPNAYVSKWDVAGKLKMGIFAN 213

Query: 218 RDIKKGENLTYDY 230
           R I KGE +T+DY
Sbjct: 214 RKIFKGEEITFDY 226


>gi|85086731|ref|XP_957740.1| histone H3 lysine 36 (K36) methyltransferase [Neurospora crassa
           OR74A]
 gi|74614418|sp|Q7RZU4.1|SET2_NEUCR RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36
           specific; AltName: Full=SET domain-containing protein 2
 gi|28918835|gb|EAA28504.1| histone H3 lysine 36 (K36) methyltransferase [Neurospora crassa
           OR74A]
          Length = 954

 Score =  119 bits (299), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 62/165 (37%), Positives = 93/165 (56%), Gaps = 2/165 (1%)

Query: 81  DRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEF 139
           D DC        C  G C CG+ C N+ FQ +    + +++TEK G G+ A+ D++  +F
Sbjct: 148 DSDCINRATKMECVDGDCNCGSGCQNQRFQRKQYADVSVIKTEKKGFGLRANTDLQVNDF 207

Query: 140 VIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNT 199
           + EY+GEVI++ T   R+ K    G  +FY   + +   +DAT KGN  R+ NHSC PN 
Sbjct: 208 IFEYIGEVINEPTFRSRMVKYDKEGIKHFYFMSLTKSEFVDATKKGNLGRFCNHSCDPNC 267

Query: 200 EMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
            + KW++  + R+GIFA R IK GE L ++Y  + +  D    YC
Sbjct: 268 YVDKWVVGDKLRMGIFAGRAIKAGEELVFNYNVDRYGADPQPCYC 312


>gi|336374580|gb|EGO02917.1| hypothetical protein SERLA73DRAFT_47203 [Serpula lacrymans var.
           lacrymans S7.3]
          Length = 776

 Score =  119 bits (299), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 71/208 (34%), Positives = 104/208 (50%), Gaps = 8/208 (3%)

Query: 44  YVFIKRNIYLTKRIKRRLED-DGIFCSCTASPG---SSGVCDRDCHCGMLLSSCS---SG 96
           +  I  N Y    + R  E  + + C C   PG    S  C     C   L+        
Sbjct: 50  FTEIHENHYQYGTLGRSREALESMTCDCQFEPGVDRPSDACGHGSDCINRLTQVECLPDD 109

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
           C+C + C N+ FQ R    + +VQTEK G G+ A  D+ R  F+ EYVG+V+   +  +R
Sbjct: 110 CRCRSFCQNQRFQRREYANINIVQTEKKGYGLRAAADLHRDAFIYEYVGDVVSHPSFVKR 169

Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
           + +    G  +FY   + +D  IDAT +G   R+ NHSC PN  + KW I    R+GIFA
Sbjct: 170 MREYGEQGIRHFYFMMLQKDEFIDATKRGGMGRFANHSCNPNCYVAKWTIGNHVRMGIFA 229

Query: 217 TRDIKKGENLTYDYQYE-FLHDSLIAYC 243
            R+I K E LT++Y  + + H++   YC
Sbjct: 230 NRNIVKDEELTFNYNVDRYGHEAQPCYC 257


>gi|336469811|gb|EGO57973.1| histone-lysine N-methyltransferase, H3 lysine-36 specific
           [Neurospora tetrasperma FGSC 2508]
 gi|350290514|gb|EGZ71728.1| histone-lysine N-methyltransferase, H3 lysine-36 specific, partial
           [Neurospora tetrasperma FGSC 2509]
          Length = 957

 Score =  119 bits (299), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 62/165 (37%), Positives = 93/165 (56%), Gaps = 2/165 (1%)

Query: 81  DRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEF 139
           D DC        C  G C CG+ C N+ FQ +    + +++TEK G G+ A+ D++  +F
Sbjct: 148 DSDCINRATKMECVDGDCNCGSGCQNQRFQRKQYADVSVIKTEKKGFGLRANTDLQVNDF 207

Query: 140 VIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNT 199
           + EY+GEVI++ T   R+ K    G  +FY   + +   +DAT KGN  R+ NHSC PN 
Sbjct: 208 IFEYIGEVINEPTFRSRMVKYDKEGIKHFYFMSLTKSEFVDATKKGNLGRFCNHSCDPNC 267

Query: 200 EMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
            + KW++  + R+GIFA R IK GE L ++Y  + +  D    YC
Sbjct: 268 YVDKWVVGDKLRMGIFAGRAIKAGEELVFNYNVDRYGADPQPCYC 312


>gi|213406581|ref|XP_002174062.1| histone-lysine N-methyltransferase [Schizosaccharomyces japonicus
           yFS275]
 gi|212002109|gb|EEB07769.1| histone-lysine N-methyltransferase [Schizosaccharomyces japonicus
           yFS275]
          Length = 779

 Score =  119 bits (299), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 64/185 (34%), Positives = 101/185 (54%), Gaps = 5/185 (2%)

Query: 51  IYLTKRIKRRLEDDGIFCSCTAS--PGSSGVCDRDCHCGMLLSS---CSSGCKCGNSCLN 105
           +Y    +    +++ + C C      G +  C    +C   ++S       C CG SC N
Sbjct: 90  LYANAHLGEPQQNEAMACDCKPEWVDGVNIACGHGSYCINRMTSIECTDENCYCGPSCQN 149

Query: 106 KPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGE 165
           + FQ +    + ++QTEK G G+ A+  + +G FV EY+GEVI +    +R+ +    G 
Sbjct: 150 QRFQKKMYADVDVIQTEKKGFGLRANSYLTKGTFVYEYIGEVIPEVRFRKRMREYDERGI 209

Query: 166 TNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGEN 225
            +FY   + +   IDAT KG+ +R+ NHSC PN  + KW++  + R+GIF  RDI+KGE 
Sbjct: 210 RHFYFMMLQKGEYIDATVKGSLARFCNHSCRPNCYVDKWVVGNKLRMGIFCKRDIQKGEE 269

Query: 226 LTYDY 230
           LT+DY
Sbjct: 270 LTFDY 274


>gi|170097553|ref|XP_001879996.1| histone methyltransferase [Laccaria bicolor S238N-H82]
 gi|164645399|gb|EDR09647.1| histone methyltransferase [Laccaria bicolor S238N-H82]
          Length = 880

 Score =  119 bits (299), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 60/148 (40%), Positives = 88/148 (59%), Gaps = 1/148 (0%)

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
           C+C + C N+ FQ R    + +V TEK G G+ A+ED+ R  F+ EYVG+V++  + ++R
Sbjct: 204 CRCRSYCQNQRFQKREYANIDIVLTEKKGFGLRAEEDLPRDTFIYEYVGDVVNPTSFKKR 263

Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
           +      G  +FY   + +D  IDAT  G   R+ NHSC PN  + KW I    R+GIFA
Sbjct: 264 MRDYALEGIQHFYFMMLQKDEFIDATKGGGIGRFANHSCSPNCYVAKWTIGHHVRMGIFA 323

Query: 217 TRDIKKGENLTYDYQYE-FLHDSLIAYC 243
            RDIK+ E LT++Y  + + H + I YC
Sbjct: 324 KRDIKQYEELTFNYNVDRYGHQAQICYC 351


>gi|119498557|ref|XP_001266036.1| SET and WW domain protein [Neosartorya fischeri NRRL 181]
 gi|119414200|gb|EAW24139.1| SET and WW domain protein [Neosartorya fischeri NRRL 181]
          Length = 967

 Score =  119 bits (298), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 60/164 (36%), Positives = 91/164 (55%), Gaps = 1/164 (0%)

Query: 81  DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
           D DC        C   C CG  C N+ FQ +    + +++TEK G G+ A+ D++  +F+
Sbjct: 181 DSDCINRATKIECVGDCSCGAECQNQRFQRKEYANVAVIKTEKKGFGLRAETDLRPHQFI 240

Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
            EYVGEVI++     R+ +    G  +FY   ++R   +DAT KGN  R+ NHSC PN  
Sbjct: 241 FEYVGEVINEAQFRRRMRQYDEEGIKHFYFMSLSRGEFVDATKKGNLGRFCNHSCNPNCY 300

Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
           + KW++  + R+GIFA R I+ GE L ++Y  + +  D    YC
Sbjct: 301 VDKWVVGEKLRMGIFAERTIQAGEELVFNYNVDRYGADPQPCYC 344


>gi|452985541|gb|EME85297.1| hypothetical protein MYCFIDRAFT_83386 [Pseudocercospora fijiensis
           CIRAD86]
          Length = 835

 Score =  119 bits (298), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 56/150 (37%), Positives = 85/150 (56%)

Query: 81  DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
           D DC        C   C CG +C N+ FQ +    + ++QTEK G G+ A+ D++  +F+
Sbjct: 96  DSDCINRATKMECVGDCNCGIACQNQRFQRKQYANVSVIQTEKKGYGLRANTDLRANDFI 155

Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
            EY+GEVI +     ++ +    G  +FY   + +   +DAT KGN  R+ NHSC PN  
Sbjct: 156 FEYIGEVIGENVFRRKMQQYDEQGIKHFYFMSLTKGEFVDATKKGNLGRFCNHSCNPNCY 215

Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDY 230
           + KW++  + R+GIFA R IK GE L ++Y
Sbjct: 216 VDKWVVGDKLRMGIFAERKIKAGEELVFNY 245


>gi|195020546|ref|XP_001985216.1| GH14625 [Drosophila grimshawi]
 gi|193898698|gb|EDV97564.1| GH14625 [Drosophila grimshawi]
          Length = 2406

 Score =  119 bits (298), Expect = 1e-24,   Method: Composition-based stats.
 Identities = 60/173 (34%), Positives = 101/173 (58%), Gaps = 5/173 (2%)

Query: 69   SCTASPGSSGVCDRDCHCGMLLSSCS-SGCKCGNSCLNKPFQNRPVKK--MKLVQTEKCG 125
            +C   P ++G C  +C   M+ + C+ S C  G+ C N+  Q   V     + + T+K G
Sbjct: 1519 NCNCKPQANGACLDNCLNRMVYTECAPSNCPAGDKCRNQKIQRHEVAPGVERFMTTDK-G 1577

Query: 126  AGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKG 185
             G+     I +G +++EYVGEV+ ++  ++R+  + +L +T+ Y   ++  +VID    G
Sbjct: 1578 WGVRTKLPIAKGTYILEYVGEVVTEREFKQRMASI-YLNDTHHYCLHLDGGLVIDGQRMG 1636

Query: 186  NKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
            +  R++NHSC PN EMQKW ++G +R+ +FA R I+ GE LTYDY +   + S
Sbjct: 1637 SDCRFVNHSCEPNCEMQKWSVNGLSRMVLFAKRAIEDGEELTYDYNFSLFNPS 1689


>gi|343427717|emb|CBQ71244.1| related to SET2-Histone methyltransferase [Sporisorium reilianum
           SRZ2]
          Length = 971

 Score =  119 bits (298), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 68/177 (38%), Positives = 99/177 (55%), Gaps = 8/177 (4%)

Query: 63  DDGIFCSCTASPGSSGV---CDRDCHC--GMLLSSCSSG-CKCGNSCLNKPFQNRPVKKM 116
           DD + C C  +P +  +   C  +  C   M    CSS  C+ G  C N+ F  R    +
Sbjct: 184 DDFMICDC--NPNTENLDMACTDESGCINRMTQIECSSSKCRWGKQCRNQRFHRRQYVDV 241

Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
            +VQTEK G G+ A +DI +  FV EYVGEV++  T  +R+ + +  G  +FY   +  +
Sbjct: 242 DIVQTEKKGFGLRAAQDIPKESFVYEYVGEVMNQPTFLDRMQQYRVEGIRHFYFMMLQPN 301

Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
             +DAT KG K R+INHSC PN  + KW +    R+GIFA R+I +GE LT++Y  +
Sbjct: 302 EYLDATKKGGKGRFINHSCNPNCSVSKWQVGKHIRMGIFAKRNILQGEELTFNYNVD 358


>gi|91077840|ref|XP_971447.1| PREDICTED: similar to set domain protein [Tribolium castaneum]
          Length = 1549

 Score =  119 bits (298), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 66/195 (33%), Positives = 104/195 (53%), Gaps = 10/195 (5%)

Query: 44  YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCGNSC 103
           Y  I+ N+Y  K      E     C+  A+      C  DC   ++L+ C +  KC N  
Sbjct: 754 YRKIRTNVYNVKTTTGACEPQTCNCTPLAN------CGDDCINRLVLAECPASHKCQNQK 807

Query: 104 LNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHL 163
           + +   +     ++   TE  G G+     IK GEF++EYVGEV+ DQ  +ER+  + ++
Sbjct: 808 IQR---HEWAPGLEKFMTENKGWGVRTKLPIKSGEFILEYVGEVVSDQEFKERMATI-YV 863

Query: 164 GETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKG 223
            +T+ Y   ++  +VID    G   R++NHSC PN EMQKW ++G+ R+ +FA RDI+  
Sbjct: 864 NDTHHYCLHLDGGLVIDGHRMGGDGRFVNHSCQPNCEMQKWSVNGQFRMALFALRDIESS 923

Query: 224 ENLTYDYQYEFLHDS 238
           E LTYDY +   + +
Sbjct: 924 EELTYDYNFSLFNPA 938


>gi|70998620|ref|XP_754032.1| SET and WW domain protein [Aspergillus fumigatus Af293]
 gi|74672926|sp|Q4WTT2.1|SET2_ASPFU RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36
           specific; AltName: Full=SET domain-containing protein 2
 gi|66851668|gb|EAL91994.1| SET and WW domain protein [Aspergillus fumigatus Af293]
          Length = 966

 Score =  119 bits (298), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 60/164 (36%), Positives = 91/164 (55%), Gaps = 1/164 (0%)

Query: 81  DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
           D DC        C   C CG  C N+ FQ +    + +++TEK G G+ A+ D++  +F+
Sbjct: 181 DSDCINRATKIECVGDCSCGAECQNQRFQRKEYANVAVIKTEKKGFGLRAETDLRPHQFI 240

Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
            EYVGEVI++     R+ +    G  +FY   ++R   +DAT KGN  R+ NHSC PN  
Sbjct: 241 FEYVGEVINEAQFRRRMRQYDEEGIKHFYFMSLSRGEFVDATKKGNLGRFCNHSCNPNCY 300

Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
           + KW++  + R+GIFA R I+ GE L ++Y  + +  D    YC
Sbjct: 301 VDKWVVGEKLRMGIFAERAIQAGEELVFNYNVDRYGADPQPCYC 344


>gi|358253063|dbj|GAA51760.1| histone-lysine N-methyltransferase NSD1/2 [Clonorchis sinensis]
          Length = 1596

 Score =  119 bits (298), Expect = 1e-24,   Method: Composition-based stats.
 Identities = 75/210 (35%), Positives = 111/210 (52%), Gaps = 23/210 (10%)

Query: 33   PDWFIKPKAIPYVFIKRNIYLTKRIKRRLED--DGIFCSCTASPG------SSGVCDRDC 84
            PD++ KP  + +      IY       RL D  +   C C  + G      SSG  +R+ 
Sbjct: 971  PDYY-KPIKVNWPLGSVRIY-------RLTDPSEAPRCECKPNSGGEPCGPSSGCINREL 1022

Query: 85   HCGMLLSSCSSGCKCGNSCLNKPFQNR--PVKKMKLVQTEKCGAGIVADEDIKRGEFVIE 142
            H   L S C +G    ++C N+ F  R  P ++     +E+ G G+     IK G FV E
Sbjct: 1023 HYECLPSVCPNG----DACQNQRFTKRLYPRQRPFWTGSER-GWGLKTLVPIKAGSFVNE 1077

Query: 143  YVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQ 202
            Y+G++ID++    RL        TN+Y+ +++   +IDA  KGN SR++NH C PN   Q
Sbjct: 1078 YIGDLIDEEEANRRLRFAHENNVTNYYMMKLDAQRIIDAGPKGNLSRFMNHCCDPNLNTQ 1137

Query: 203  KWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            KW ++G+ RIG+FA RDI  GE LT+DY +
Sbjct: 1138 KWTVNGDNRIGLFAVRDIAAGEELTFDYNF 1167


>gi|146419774|ref|XP_001485847.1| hypothetical protein PGUG_01518 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 707

 Score =  119 bits (298), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 62/189 (32%), Positives = 103/189 (54%), Gaps = 11/189 (5%)

Query: 51  IYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCG---------MLLSSCSSGCKCGN 101
           IY  K++    +++ + C C      +    R+  CG           +   +  C CG+
Sbjct: 38  IYANKQLGHSGQNEQMTCDCHEKWDQA--TQRNLACGEHSECINRATSVECVNKSCGCGD 95

Query: 102 SCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMK 161
            C N+ FQ +    + ++QTE  G G+ A+E+I    F+ EYVGEVI++++  +R+ +  
Sbjct: 96  DCQNQRFQKKEYANVSVIQTELKGYGLRANEEINESGFIYEYVGEVINEESFRKRMVEYD 155

Query: 162 HLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIK 221
                +FY   + +D  IDAT KG+ +R+ NHSC PN  + KW++  + R+GIFA R I+
Sbjct: 156 EKKFPHFYFMMLKKDSFIDATIKGSLARFCNHSCSPNAYVDKWVVGDKLRMGIFAKRLIQ 215

Query: 222 KGENLTYDY 230
            GE +T+DY
Sbjct: 216 AGEEITFDY 224


>gi|159126234|gb|EDP51350.1| SET and WW domain protein [Aspergillus fumigatus A1163]
          Length = 966

 Score =  119 bits (298), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 60/164 (36%), Positives = 91/164 (55%), Gaps = 1/164 (0%)

Query: 81  DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
           D DC        C   C CG  C N+ FQ +    + +++TEK G G+ A+ D++  +F+
Sbjct: 181 DSDCINRATKIECVGDCSCGPECQNQRFQRKEYANVAVIKTEKKGFGLRAETDLRPHQFI 240

Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
            EYVGEVI++     R+ +    G  +FY   ++R   +DAT KGN  R+ NHSC PN  
Sbjct: 241 FEYVGEVINEAQFRRRMRQYDEEGIKHFYFMSLSRGEFVDATKKGNLGRFCNHSCNPNCY 300

Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
           + KW++  + R+GIFA R I+ GE L ++Y  + +  D    YC
Sbjct: 301 VDKWVVGEKLRMGIFAERAIQAGEELVFNYNVDRYGADPQPCYC 344


>gi|270001477|gb|EEZ97924.1| hypothetical protein TcasGA2_TC000311 [Tribolium castaneum]
          Length = 1647

 Score =  119 bits (298), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 66/195 (33%), Positives = 104/195 (53%), Gaps = 10/195 (5%)

Query: 44   YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCGNSC 103
            Y  I+ N+Y  K      E     C+  A+      C  DC   ++L+ C +  KC N  
Sbjct: 852  YRKIRTNVYNVKTTTGACEPQTCNCTPLAN------CGDDCINRLVLAECPASHKCQNQK 905

Query: 104  LNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHL 163
            + +   +     ++   TE  G G+     IK GEF++EYVGEV+ DQ  +ER+  + ++
Sbjct: 906  IQR---HEWAPGLEKFMTENKGWGVRTKLPIKSGEFILEYVGEVVSDQEFKERMATI-YV 961

Query: 164  GETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKG 223
             +T+ Y   ++  +VID    G   R++NHSC PN EMQKW ++G+ R+ +FA RDI+  
Sbjct: 962  NDTHHYCLHLDGGLVIDGHRMGGDGRFVNHSCQPNCEMQKWSVNGQFRMALFALRDIESS 1021

Query: 224  ENLTYDYQYEFLHDS 238
            E LTYDY +   + +
Sbjct: 1022 EELTYDYNFSLFNPA 1036


>gi|427798749|gb|JAA64826.1| Putative histone-lysine n-methyltransferase nsd2, partial
           [Rhipicephalus pulchellus]
          Length = 755

 Score =  119 bits (298), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 68/152 (44%), Positives = 93/152 (61%), Gaps = 3/152 (1%)

Query: 70  CTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGI 128
           CT +P     C  +C   +LL  CS   C  G  C N+ FQ R   +  LV+T   G G+
Sbjct: 604 CTCTPQDP--CRANCLNRLLLYECSRDLCPAGPDCENQRFQKRTYARTSLVRTPGRGWGL 661

Query: 129 VADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKS 188
           +  + +  G+FV+EYVGE+ID++ CE RL ++      NFY   + +D +IDA  KGN S
Sbjct: 662 LTSQALAAGDFVMEYVGELIDEEECERRLEELHMENIQNFYFLTLEKDRIIDAGPKGNMS 721

Query: 189 RYINHSCCPNTEMQKWIIDGETRIGIFATRDI 220
           R+INHSC PN E QKW ++G+TR+GIFA RDI
Sbjct: 722 RFINHSCEPNCETQKWTVNGDTRVGIFAIRDI 753


>gi|254586457|ref|XP_002498796.1| ZYRO0G18766p [Zygosaccharomyces rouxii]
 gi|238941690|emb|CAR29863.1| ZYRO0G18766p [Zygosaccharomyces rouxii]
          Length = 680

 Score =  119 bits (298), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 66/190 (34%), Positives = 101/190 (53%), Gaps = 6/190 (3%)

Query: 47  IKRNIYLTKRIKRRLEDDGIFCSCTAS--PGSSGVCDR--DCHCGMLLSSCSSGC--KCG 100
           ++ N Y  KR+     ++ + C C      G +  CD   DC   + L  C +G    CG
Sbjct: 37  LEENHYANKRLGNSKHNEFMECDCYEEFEDGINRACDENSDCINRLTLIECVNGLCGSCG 96

Query: 101 NSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKM 160
             C N+ FQ +    + + QT+  G G+ A  DI+  +F+ EY+G+VI +    +RL   
Sbjct: 97  EDCQNQRFQRKQYADIAVFQTKLKGYGVRAQSDIEPHQFIYEYMGDVIPEDVFRDRLVDY 156

Query: 161 KHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDI 220
              G  +FY   +     IDAT KG+ +R+ NHSC PN  + KW++ G+ ++GIFA R I
Sbjct: 157 DQKGYQHFYFMMLQNGEFIDATVKGSLARFCNHSCNPNAYVNKWVVAGKLKMGIFAHRKI 216

Query: 221 KKGENLTYDY 230
            KGE +T+DY
Sbjct: 217 LKGEEITFDY 226


>gi|190345522|gb|EDK37420.2| hypothetical protein PGUG_01518 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 707

 Score =  119 bits (298), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 62/187 (33%), Positives = 103/187 (55%), Gaps = 7/187 (3%)

Query: 51  IYLTKRIKRRLEDDGIFCSC----TASPGSSGVCDRDCHCGMLLSSC---SSGCKCGNSC 103
           IY  K++    +++ + C C      +   +  C     C    +S    +  C CG+ C
Sbjct: 38  IYANKQLGHSGQNEQMTCDCHEKWDQATQRNLACGEHSECINRATSVECVNKSCGCGDDC 97

Query: 104 LNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHL 163
            N+ FQ +    + ++QTE  G G+ A+E+I    F+ EYVGEVI++++  +R+ +    
Sbjct: 98  QNQRFQKKEYANVSVIQTELKGYGLRANEEINESGFIYEYVGEVINEESFRKRMVEYDEK 157

Query: 164 GETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKG 223
              +FY   + +D  IDAT KG+ +R+ NHSC PN  + KW++  + R+GIFA R I+ G
Sbjct: 158 KFPHFYFMMLKKDSFIDATIKGSLARFCNHSCSPNAYVDKWVVGDKLRMGIFAKRLIQAG 217

Query: 224 ENLTYDY 230
           E +T+DY
Sbjct: 218 EEITFDY 224


>gi|345781638|ref|XP_003432154.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like [Canis
           lupus familiaris]
          Length = 742

 Score =  119 bits (297), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 75/228 (32%), Positives = 109/228 (47%), Gaps = 33/228 (14%)

Query: 40  KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
           K  PY  IK N  + K ++  + D      C   P     C  +  C   ML   C    
Sbjct: 410 KPPPYKHIKANKVIGK-VQIHVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 468

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGI---------------------------- 128
           C  G  C N+ F  R     ++++TE+ G G+                            
Sbjct: 469 CPAGERCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKVAWGTTLGIWKNSVKQDTFV 528

Query: 129 -VADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNK 187
            + + D K+GEFV EYVGE+ID++ C  R+ +      TNFY+  + +D +IDA  KGN 
Sbjct: 529 SLLETDNKQGEFVNEYVGELIDEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNY 588

Query: 188 SRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
           SR++NHSC PN E QKW ++G+ R+G+FA  DI  G  LT++Y  + L
Sbjct: 589 SRFMNHSCNPNCETQKWTVNGDIRVGLFALCDIPAGMELTFNYNLDCL 636


>gi|380492750|emb|CCF34378.1| SET domain-containing protein [Colletotrichum higginsianum]
          Length = 951

 Score =  119 bits (297), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 65/188 (34%), Positives = 104/188 (55%), Gaps = 6/188 (3%)

Query: 62  EDDGIFCSCTAS--PGSSGVC--DRDCHCGMLLSSCSSGC-KCGNSCLNKPFQNRPVKKM 116
           ++D + C C +    G +  C  D DC        C +G   CG+ C N+ FQ +    +
Sbjct: 119 DNDALDCDCRSEWHDGKNLSCGEDSDCINRATKMECVAGAGNCGDGCQNQRFQRKEYANV 178

Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
            +++TEK G G+ AD D++  +F+ EY+GEVI++ T   R+ +    G  +FY   + + 
Sbjct: 179 SVIKTEKKGFGLRADVDLQANDFIFEYIGEVINEPTFRRRMVQYDDEGIKHFYFMSLTKH 238

Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FL 235
             +DAT KGN  R+ NHSC PN  + KW++  + R+GIF++R IK GE L ++Y  + + 
Sbjct: 239 EFVDATKKGNLGRFCNHSCNPNCYVDKWVVGEKLRMGIFSSRRIKAGEELVFNYNVDRYG 298

Query: 236 HDSLIAYC 243
            D    YC
Sbjct: 299 ADPQPCYC 306


>gi|366998103|ref|XP_003683788.1| hypothetical protein TPHA_0A02720 [Tetrapisispora phaffii CBS 4417]
 gi|357522083|emb|CCE61354.1| hypothetical protein TPHA_0A02720 [Tetrapisispora phaffii CBS 4417]
          Length = 707

 Score =  119 bits (297), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 66/185 (35%), Positives = 99/185 (53%), Gaps = 6/185 (3%)

Query: 52  YLTKRIKRRLEDDGIFCSCTAS--PGSSGVCDR--DCHCGMLLSSCSSGC--KCGNSCLN 105
           Y +K I     ++ + C C      G +  C    DC   + L  C +G    CG++C N
Sbjct: 46  YASKNIGNAKNNEFMECDCYEEFEDGRNKACSEYSDCINRLTLIECVNGLCNTCGDNCEN 105

Query: 106 KPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGE 165
           + FQ +    + + +TE  G G+ A+ D++  +F+ EY GEVI++    ERL +    G 
Sbjct: 106 QKFQKKEYADISIFKTELKGYGVRANSDLEENQFIYEYKGEVIEENEFRERLIEYDQRGF 165

Query: 166 TNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGEN 225
            +FY   +     IDAT KG  +R+ NHSC PN  + KW + G+ R+GIFA R I KGE 
Sbjct: 166 KHFYFMMLQSGEFIDATIKGALARFCNHSCNPNAYVNKWEVAGKLRMGIFAKRRISKGEE 225

Query: 226 LTYDY 230
           +T+DY
Sbjct: 226 ITFDY 230


>gi|321260843|ref|XP_003195141.1| histone-lysine N-methyltransferase [Cryptococcus gattii WM276]
 gi|317461614|gb|ADV23354.1| Histone-lysine N-methyltransferase, putative [Cryptococcus gattii
           WM276]
          Length = 833

 Score =  119 bits (297), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 68/205 (33%), Positives = 107/205 (52%), Gaps = 11/205 (5%)

Query: 45  VFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGV----CDRDCHCGMLLSSCSSG-CKC 99
           V+ +++I L+K       D+ + C C  +           D DC    L   C +G C+ 
Sbjct: 112 VYERKDIGLSKE-----NDEMMVCECVYNRHDPDTDPCGPDSDCINRALYIECIAGECRA 166

Query: 100 GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
           G  C N+ F  +    + +V TEK G G+ A   I     + EY+GEV+ ++T  +R+ +
Sbjct: 167 GKHCHNQQFSKKQYANVDVVLTEKKGYGLRASSIIPANTLIYEYIGEVVAEKTFRKRMQQ 226

Query: 160 MKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD 219
               G  +FY   + ++  IDAT KG   R+ NHSC PN E+QKW++    R+GIF  RD
Sbjct: 227 YADEGIRHFYFMMLQKEEYIDATKKGGIGRFANHSCNPNCEVQKWVVGRRLRMGIFTKRD 286

Query: 220 IKKGENLTYDYQYE-FLHDSLIAYC 243
           + KGE +T++Y  + + HD+   YC
Sbjct: 287 VVKGEEITFNYNVDRYGHDAQTCYC 311


>gi|405121647|gb|AFR96415.1| histone-lysine N-methyltransferase [Cryptococcus neoformans var.
           grubii H99]
          Length = 836

 Score =  119 bits (297), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 68/205 (33%), Positives = 106/205 (51%), Gaps = 11/205 (5%)

Query: 45  VFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGV----CDRDCHCGMLLSSCSSG-CKC 99
           V+ +++I L+K       D+ + C C  +           D DC    L   C  G C+ 
Sbjct: 112 VYERKDIGLSKE-----NDEMMVCECVYNRHDPDADPCGPDSDCINRALYIECIGGECRA 166

Query: 100 GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
           G  C N+ F  +    + +V TEK G G+ A   I     + EY+GEV+ ++T  +R+ +
Sbjct: 167 GKHCHNQQFSKKQYANVDVVLTEKKGYGLRASSTIPANTLIYEYIGEVVAEKTFRKRMQQ 226

Query: 160 MKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD 219
               G  +FY   + ++  IDAT KG   R+ NHSC PN E+QKW++    R+GIF  RD
Sbjct: 227 YADEGIRHFYFMMLQKEEYIDATKKGGIGRFANHSCNPNCEVQKWVVGRRLRMGIFTKRD 286

Query: 220 IKKGENLTYDYQYE-FLHDSLIAYC 243
           + KGE +T++Y  + + HD+   YC
Sbjct: 287 VIKGEEITFNYNVDRYGHDAQTCYC 311


>gi|336266160|ref|XP_003347849.1| hypothetical protein SMAC_06682 [Sordaria macrospora k-hell]
 gi|380091782|emb|CCC10510.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 907

 Score =  119 bits (297), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 62/165 (37%), Positives = 92/165 (55%), Gaps = 2/165 (1%)

Query: 81  DRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEF 139
           D DC        C  G C CG  C N+ FQ +    + +++TEK G G+ A+ D++  +F
Sbjct: 148 DSDCINRATKMECVDGDCNCGAGCQNQRFQRKQYADVSVIKTEKKGFGLRANTDLQVNDF 207

Query: 140 VIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNT 199
           + EY+GEVI++ T   R+ K    G  +FY   + +   +DAT KGN  R+ NHSC PN 
Sbjct: 208 IFEYIGEVINEPTFRSRMIKYDKEGIKHFYFMSLTKSEFVDATKKGNLGRFCNHSCNPNC 267

Query: 200 EMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
            + KW++  + R+GIFA R IK GE L ++Y  + +  D    YC
Sbjct: 268 YVDKWVVGDKLRMGIFAGRAIKAGEELVFNYNVDRYGADPQPCYC 312


>gi|134083984|emb|CAK49139.1| unnamed protein product [Aspergillus niger]
          Length = 734

 Score =  119 bits (297), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 60/164 (36%), Positives = 91/164 (55%), Gaps = 1/164 (0%)

Query: 81  DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
           D DC        C   C CG  C N+ FQ R    + +++TEK G G+ A+ D++  +F+
Sbjct: 187 DSDCINRATKIECMGDCGCGPECQNQRFQRREYAPVAVIKTEKKGFGLRAEADLRPHQFI 246

Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
            EYVGEVI++     R+ +    G  +FY   +++   +DAT KGN  R+ NHSC PN  
Sbjct: 247 FEYVGEVINEGQFRRRMRQYDEEGIKHFYFMSLSKGEFVDATKKGNLGRFCNHSCNPNCY 306

Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
           + KW++  + R+GIFA R I+ GE L ++Y  + +  D    YC
Sbjct: 307 VDKWVVGEKLRMGIFAERHIQAGEELVFNYNVDRYGADPQPCYC 350


>gi|121806929|sp|Q2UTN6.1|SET2_ASPOR RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36
           specific; AltName: Full=SET domain-containing protein 2
 gi|83764935|dbj|BAE55079.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 965

 Score =  119 bits (297), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 59/164 (35%), Positives = 93/164 (56%), Gaps = 1/164 (0%)

Query: 81  DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
           D DC        C   C CG  C N+ FQ +   ++ +++TEK G G+ A+ D++  +F+
Sbjct: 192 DSDCINRATKIECVGDCGCGPDCQNQRFQRKEYAQVAVIKTEKKGFGLRAEADLRPHQFI 251

Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
            EYVGEVI++     R+ +    G  +FY   +++   +DAT +GN  R+ NHSC PN  
Sbjct: 252 YEYVGEVINEGQFRRRMRQYDEEGIKHFYFMSLSKGEFVDATKRGNLGRFCNHSCNPNCY 311

Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
           + KW++  + R+GIFA RDI+ GE L ++Y  + +  D    YC
Sbjct: 312 VDKWVVGEKLRMGIFAERDIQAGEELVFNYNVDRYGADPQPCYC 355


>gi|389633989|ref|XP_003714647.1| histone-lysine N-methyltransferase [Magnaporthe oryzae 70-15]
 gi|351646980|gb|EHA54840.1| histone-lysine N-methyltransferase [Magnaporthe oryzae 70-15]
 gi|440474559|gb|ELQ43296.1| histone-lysine N-methyltransferase [Magnaporthe oryzae Y34]
 gi|440479738|gb|ELQ60486.1| histone-lysine N-methyltransferase [Magnaporthe oryzae P131]
          Length = 946

 Score =  119 bits (297), Expect = 2e-24,   Method: Composition-based stats.
 Identities = 59/153 (38%), Positives = 87/153 (56%), Gaps = 1/153 (0%)

Query: 81  DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
           D DC   +    C SG  CG+ C N+ FQ +    + +++TE  G G+ AD +++  +FV
Sbjct: 145 DSDCINRVTKIECVSG-NCGDGCQNQRFQRKQYANVSVIKTENKGYGLRADANLEPNDFV 203

Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
            EY+GEVI ++    RL K       +FY   + R   +DAT KGN  R+ NHSC PN  
Sbjct: 204 FEYIGEVIGEELFRSRLMKYDTQRLEHFYFMSLTRTEYVDATKKGNLGRFCNHSCNPNCY 263

Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
           + KW++  + R+GIFA R IK GE L ++Y  +
Sbjct: 264 VDKWVVGDKLRMGIFAMRAIKAGEELCFNYNVD 296


>gi|317455359|pdb|3OPE|A Chain A, Structural Basis Of Auto-Inhibitory Mechanism Of Histone
           Methyltransferase
 gi|317455360|pdb|3OPE|B Chain B, Structural Basis Of Auto-Inhibitory Mechanism Of Histone
           Methyltransferase
          Length = 222

 Score =  119 bits (297), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 69/191 (36%), Positives = 105/191 (54%), Gaps = 4/191 (2%)

Query: 44  YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCS-SGCKCGNS 102
           Y  I+ N+Y+  +     E     C         G  D DC   M+ + CS + C CG  
Sbjct: 3   YKKIRSNVYVDVKPLSGYEATTCNCKKPDDDTRKGCVD-DCLNRMIFAECSPNTCPCGEQ 61

Query: 103 CLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMK 161
           C N+  Q    V+ ++  + E+ G GI   E +K G+F+IEY+GEV+ +Q    R+ +  
Sbjct: 62  CCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAGQFIIEYLGEVVSEQEFRNRMIEQY 121

Query: 162 HLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIK 221
           H   ++ Y   ++  MVID+   GN++R+INHSC PN EMQKW ++G  RIG++A +D+ 
Sbjct: 122 H-NHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGLYALKDMP 180

Query: 222 KGENLTYDYQY 232
            G  LTYDY +
Sbjct: 181 AGTELTYDYNF 191


>gi|238503484|ref|XP_002382975.1| SET and WW domain protein [Aspergillus flavus NRRL3357]
 gi|220690446|gb|EED46795.1| SET and WW domain protein [Aspergillus flavus NRRL3357]
 gi|391863367|gb|EIT72678.1| clathrin coat binding protein [Aspergillus oryzae 3.042]
          Length = 947

 Score =  118 bits (296), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 59/164 (35%), Positives = 93/164 (56%), Gaps = 1/164 (0%)

Query: 81  DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
           D DC        C   C CG  C N+ FQ +   ++ +++TEK G G+ A+ D++  +F+
Sbjct: 174 DSDCINRATKIECVGDCGCGPDCQNQRFQRKEYAQVAVIKTEKKGFGLRAEADLRPHQFI 233

Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
            EYVGEVI++     R+ +    G  +FY   +++   +DAT +GN  R+ NHSC PN  
Sbjct: 234 YEYVGEVINEGQFRRRMRQYDEEGIKHFYFMSLSKGEFVDATKRGNLGRFCNHSCNPNCY 293

Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
           + KW++  + R+GIFA RDI+ GE L ++Y  + +  D    YC
Sbjct: 294 VDKWVVGEKLRMGIFAERDIQAGEELVFNYNVDRYGADPQPCYC 337


>gi|121712644|ref|XP_001273933.1| SET and WW domain protein [Aspergillus clavatus NRRL 1]
 gi|119402086|gb|EAW12507.1| SET and WW domain protein [Aspergillus clavatus NRRL 1]
          Length = 968

 Score =  118 bits (296), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 63/174 (36%), Positives = 96/174 (55%), Gaps = 3/174 (1%)

Query: 73  SPGSSGVC--DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVA 130
           S G +  C  D DC        C   C CG +C N+ FQ R    + +++TEK G G+ A
Sbjct: 171 SAGKNLACGEDSDCINRATKIECVGDCSCGANCQNQRFQRREYANVAVIKTEKKGFGLRA 230

Query: 131 DEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRY 190
           + D++  +F+ EYVGEVI++     R+ +    G  +FY   +++   +DAT KGN  R+
Sbjct: 231 ETDLQPHQFIFEYVGEVINEGHFRRRMRQYDEEGIKHFYFMSLSKGEFVDATKKGNLGRF 290

Query: 191 INHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
            NHSC PN  + KW++  + R+GIFA R I+ GE L ++Y  + +  D    YC
Sbjct: 291 CNHSCNPNCYVDKWVVGEKLRMGIFAERAIRAGEELVFNYNVDRYGADPQPCYC 344


>gi|410080658|ref|XP_003957909.1| hypothetical protein KAFR_0F01770 [Kazachstania africana CBS 2517]
 gi|372464496|emb|CCF58774.1| hypothetical protein KAFR_0F01770 [Kazachstania africana CBS 2517]
          Length = 711

 Score =  118 bits (296), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 53/132 (40%), Positives = 83/132 (62%)

Query: 99  CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
           CG +C N+ FQ +    + + QTE  G G+  + D++  EF+ EY+GEVID+++  +RL 
Sbjct: 99  CGENCQNQRFQKKQYADISVFQTEMKGYGVRCNTDLEANEFIYEYIGEVIDEESFRDRLI 158

Query: 159 KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATR 218
           K   +   +FY   +     IDAT KG   R+ NHSC PN  + KW+++G+ ++GIF+ R
Sbjct: 159 KYDEMHFKHFYFMMLQNGQFIDATRKGALGRFCNHSCNPNAYVNKWVVNGKLKMGIFSKR 218

Query: 219 DIKKGENLTYDY 230
            I+KGE +T+DY
Sbjct: 219 KIQKGEEITFDY 230


>gi|400596232|gb|EJP64008.1| SET domain-containing protein [Beauveria bassiana ARSEF 2860]
          Length = 900

 Score =  118 bits (296), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 61/188 (32%), Positives = 97/188 (51%), Gaps = 6/188 (3%)

Query: 62  EDDGIFCSCTA--SPGSSGVCDRDCHC---GMLLSSCSSGCKCGNSCLNKPFQNRPVKKM 116
           E+D   C C      G +  C  D  C      +    +   CG  C N+ FQ +    +
Sbjct: 121 ENDAFDCDCREEWQDGVNMACGEDSDCINRATKMECSETAGNCGGGCQNQRFQRKHYADV 180

Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
            +++T+K G G+  D D+   +F+ EY+GEVI++ T   R+ +    G  +FY   +N++
Sbjct: 181 SVIKTDKKGYGLRTDADLAANDFIFEYIGEVINEATFRRRMLQYDQEGIKHFYFMSLNKN 240

Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FL 235
             +DAT KGN  R+ NHSC PN  + KW++  + R+GIFA R +  GE L ++Y  + + 
Sbjct: 241 EFVDATRKGNLGRFCNHSCVPNCYVDKWVVGDKLRMGIFALRAVSAGEELVFNYNVDRYG 300

Query: 236 HDSLIAYC 243
            D    YC
Sbjct: 301 ADPQPCYC 308


>gi|453087448|gb|EMF15489.1| hypothetical protein SEPMUDRAFT_161660 [Mycosphaerella populorum
           SO2202]
          Length = 966

 Score =  118 bits (296), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 59/164 (35%), Positives = 92/164 (56%), Gaps = 1/164 (0%)

Query: 81  DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
           D DC        C   C CG  C N+ FQ++    + +++TEK G G+ A+ +++  +F+
Sbjct: 173 DTDCINRATKMECVGDCGCGAGCQNRRFQHKQYANVSVIKTEKKGYGLRANTELQANDFI 232

Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
            EY+GEVI ++T   RL +    G  +FY   +++   +DAT KGN  R+ NHSC PN  
Sbjct: 233 FEYIGEVIGEKTFRNRLHQYDEEGIKHFYFMSLSKGEFVDATKKGNLGRFCNHSCNPNCY 292

Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
           + KW++  + R+GIFA R I  GE L ++Y  + +  D    YC
Sbjct: 293 VDKWVVGDKLRMGIFAERKIHAGEELVFNYNVDRYGADPQPCYC 336


>gi|403419215|emb|CCM05915.1| predicted protein [Fibroporia radiculosa]
          Length = 861

 Score =  118 bits (296), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 74/224 (33%), Positives = 112/224 (50%), Gaps = 19/224 (8%)

Query: 39  PKAIPYVF-----IKRNIYLTKRIKRRLED-DGIFCSCTASPGSSGVCD-----RDCHCG 87
           P+A P        I +N Y    + R  E  + + C C   PG     D      DC   
Sbjct: 111 PRAEPQALLTFEEIPQNHYQYGTLGRSRESLESMTCDCQYDPGMDDPDDACGHGSDCINR 170

Query: 88  MLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGE------FV 140
           +    C    C+C + C N+ FQ +    +++VQTEK G G+ A +D+ +G+      F+
Sbjct: 171 LTQVECLPDDCRCRSYCQNQRFQRKEYAPIEIVQTEKKGFGLRAAQDLHKGQPYYRDAFI 230

Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
            EY+G+V+   +  +R+      G  +FY   + +D  IDAT +G   R+ NHSC PN  
Sbjct: 231 YEYLGDVVSQPSFLKRMRLYAEEGIRHFYFMMLQKDEFIDATKRGGIGRFANHSCNPNCY 290

Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
           + KW +    R+GIFA RDIKK E LT++Y  + + HD+   YC
Sbjct: 291 VAKWTVGHHVRMGIFANRDIKKDEELTFNYNVDRYGHDAQPCYC 334


>gi|145479919|ref|XP_001425982.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124393054|emb|CAK58584.1| unnamed protein product [Paramecium tetraurelia]
          Length = 1384

 Score =  118 bits (296), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 77/228 (33%), Positives = 110/228 (48%), Gaps = 18/228 (7%)

Query: 25  GNPVEFE-LPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCT-----------A 72
           GNP   + L D F       +  +K+N YL ++  R   D+ + CSC             
Sbjct: 32  GNPARLQGLKDSFK-----YFTLLKKNQYLERKQYRVSRDETLVCSCIMCPEDQIQSRPQ 86

Query: 73  SPGSSGVCDRDCHCGMLLSSCS-SGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVAD 131
            P  S  C   C      + C    C C   C N+ FQ      +  ++    G G+ A 
Sbjct: 87  GPQYSYNCGERCLNRFTCTECDVELCPCAEQCKNRRFQKHDDACVYPLRCGGKGMGLFAG 146

Query: 132 EDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYI 191
           E I +G+F+++YVGE+    +   R    ++   T  YL ++N   VID T KGN +R+I
Sbjct: 147 ERILKGQFIMQYVGEIFQINSAFGRRRVQEYSKSTCTYLMKLNNQEVIDPTSKGNLARFI 206

Query: 192 NHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSL 239
           NHSC PN   +KW + GE  IGIFA RDI + E LT+DYQ++  H  L
Sbjct: 207 NHSCEPNCITEKWNVLGEVCIGIFAIRDINEDEELTFDYQFDVFHTPL 254


>gi|317138698|ref|XP_001817081.2| histone-lysine N-methyltransferase, H3 lysine-36 specific
           [Aspergillus oryzae RIB40]
          Length = 849

 Score =  118 bits (296), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 59/164 (35%), Positives = 93/164 (56%), Gaps = 1/164 (0%)

Query: 81  DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
           D DC        C   C CG  C N+ FQ +   ++ +++TEK G G+ A+ D++  +F+
Sbjct: 174 DSDCINRATKIECVGDCGCGPDCQNQRFQRKEYAQVAVIKTEKKGFGLRAEADLRPHQFI 233

Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
            EYVGEVI++     R+ +    G  +FY   +++   +DAT +GN  R+ NHSC PN  
Sbjct: 234 YEYVGEVINEGQFRRRMRQYDEEGIKHFYFMSLSKGEFVDATKRGNLGRFCNHSCNPNCY 293

Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
           + KW++  + R+GIFA RDI+ GE L ++Y  + +  D    YC
Sbjct: 294 VDKWVVGEKLRMGIFAERDIQAGEELVFNYNVDRYGADPQPCYC 337


>gi|290985403|ref|XP_002675415.1| predicted protein [Naegleria gruberi]
 gi|284089011|gb|EFC42671.1| predicted protein [Naegleria gruberi]
          Length = 438

 Score =  118 bits (295), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 72/210 (34%), Positives = 110/210 (52%), Gaps = 17/210 (8%)

Query: 29  EFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGM 88
           + +LPD F        V+I  N   T +   +   D   C CT     +G   + C   +
Sbjct: 186 KIKLPDDF--------VYIMSNERKTTKQSDKKHQD--HCHCT-----NGCIPKYCINYL 230

Query: 89  LLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADED-IKRGEFVIEYVGEV 147
               C + C CG++C N+ F      ++   Q++  G G+  ++D IK+G F+ EYVGEV
Sbjct: 231 SQIECPTDCNCGDNCQNRVFTKMAYPELICFQSDTKGIGVKCNQDVIKKGTFITEYVGEV 290

Query: 148 IDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIID 207
           I     E R  K  +    + Y   +N + +IDAT+ GN +R+INHSC PN   Q W ++
Sbjct: 291 ISVDKFETRT-KRSYKKSLHHYCMNMNENEIIDATWMGNIARFINHSCAPNARTQTWDVN 349

Query: 208 GETRIGIFATRDIKKGENLTYDYQYEFLHD 237
           G+ R+GIFA +DI KGE +TY+Y +   +D
Sbjct: 350 GQNRVGIFAIKDIVKGEEITYNYNFLIYND 379


>gi|118348872|ref|XP_001007909.1| SET domain containing protein [Tetrahymena thermophila]
 gi|89289676|gb|EAR87664.1| SET domain containing protein [Tetrahymena thermophila SB210]
          Length = 3527

 Score =  118 bits (295), Expect = 2e-24,   Method: Composition-based stats.
 Identities = 63/155 (40%), Positives = 89/155 (57%), Gaps = 1/155 (0%)

Query: 80  CDRDC-HCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGE 138
           C  +C +  +    C   C+CG  C N+ FQ      +  V T   G G+ A + I +G 
Sbjct: 168 CGENCWNRAVCTECCDLSCRCGELCQNRRFQKHQDACVYPVPTRGKGWGLCAGQFIPKGT 227

Query: 139 FVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPN 198
           F+I+Y GEV D  + E       +   T  YL +I+R+ VID TYKGN +R+INHSC PN
Sbjct: 228 FIIQYTGEVFDINSSEGIKRCKDYSRSTCTYLMKIDRNEVIDPTYKGNLARFINHSCDPN 287

Query: 199 TEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
              QKW + GE  IGIF+ +DI++ E LT+DYQ++
Sbjct: 288 CITQKWHVLGEICIGIFSIKDIQEDEELTFDYQFD 322


>gi|403214852|emb|CCK69352.1| hypothetical protein KNAG_0C02410 [Kazachstania naganishii CBS
           8797]
          Length = 708

 Score =  118 bits (295), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 58/153 (37%), Positives = 89/153 (58%), Gaps = 3/153 (1%)

Query: 81  DRDCHCGMLLSSCSSG---CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRG 137
           D DC   + L  C  G     CG +C N+ FQ +    + + QT+  G G++A EDI + 
Sbjct: 75  DSDCINRLTLIECVDGLCESTCGKNCQNQRFQRKQYADVMVFQTKMKGYGVLAKEDIDQH 134

Query: 138 EFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCP 197
           +F+ EY+GEVID++   +R+         +FY   +     IDAT KG  +R+ NHSC P
Sbjct: 135 QFIYEYMGEVIDEEEFRDRMATYDEKKFKHFYFMMLQNGQFIDATMKGCLARFCNHSCSP 194

Query: 198 NTEMQKWIIDGETRIGIFATRDIKKGENLTYDY 230
           N  + KW+++G+ ++GIFA+R I K E +T+DY
Sbjct: 195 NAYVNKWVVNGKLKMGIFASRKILKDEEITFDY 227


>gi|350634064|gb|EHA22428.1| hypothetical protein ASPNIDRAFT_51069 [Aspergillus niger ATCC 1015]
          Length = 962

 Score =  118 bits (295), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 60/164 (36%), Positives = 91/164 (55%), Gaps = 1/164 (0%)

Query: 81  DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
           D DC        C   C CG  C N+ FQ R    + +++TEK G G+ A+ D++  +F+
Sbjct: 187 DSDCINRATKIECMGDCGCGPECQNQRFQRREYAPVAVIKTEKKGFGLRAEADLRPHQFI 246

Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
            EYVGEVI++     R+ +    G  +FY   +++   +DAT KGN  R+ NHSC PN  
Sbjct: 247 FEYVGEVINEGQFRRRMRQYDEEGIKHFYFMSLSKGEFVDATKKGNLGRFCNHSCNPNCY 306

Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
           + KW++  + R+GIFA R I+ GE L ++Y  + +  D    YC
Sbjct: 307 VDKWVVGEKLRMGIFAERHIQAGEELVFNYNVDRYGADPQPCYC 350


>gi|358373289|dbj|GAA89888.1| SET and WW domain protein [Aspergillus kawachii IFO 4308]
          Length = 963

 Score =  117 bits (294), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 60/164 (36%), Positives = 91/164 (55%), Gaps = 1/164 (0%)

Query: 81  DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
           D DC        C   C CG  C N+ FQ R    + +++TEK G G+ A+ D++  +F+
Sbjct: 188 DSDCINRATKIECMGDCGCGPECQNQRFQRREYAPVAVIKTEKKGFGLRAEADLRPHQFI 247

Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
            EYVGEVI++     R+ +    G  +FY   +++   +DAT KGN  R+ NHSC PN  
Sbjct: 248 FEYVGEVINEGQFRRRMRQYDEEGIKHFYFMSLSKGEFVDATKKGNLGRFCNHSCNPNCY 307

Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
           + KW++  + R+GIFA R I+ GE L ++Y  + +  D    YC
Sbjct: 308 VDKWVVGEKLRMGIFAERHIQAGEELVFNYNVDRYGADPQPCYC 351


>gi|322697133|gb|EFY88916.1| histone H3 lysine 36 (K36) methyltransferase [Metarhizium acridum
           CQMa 102]
          Length = 895

 Score =  117 bits (294), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 59/174 (33%), Positives = 95/174 (54%), Gaps = 5/174 (2%)

Query: 62  EDDGIFCSCTA--SPGSSGVCDRDCHC---GMLLSSCSSGCKCGNSCLNKPFQNRPVKKM 116
           E+D + C C      G +  C  D  C      +   +SG  CG  C N+ FQ +    +
Sbjct: 115 ENDALDCDCAEEFQDGENFACGEDSDCINRATKMECVASGSNCGGGCQNQRFQRKLWADV 174

Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
            +++T+K G G+  +  ++  +F+ EYVGEVI++ T   R+ +    G  +FY   +++ 
Sbjct: 175 AVIKTDKKGYGLRTESSLQPNDFIYEYVGEVINEPTFRRRMLQYDEEGIKHFYFMSLSKS 234

Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDY 230
             +DAT KGN  R+ NHSC PN  + KW++  + R+GIFA R I+ GE L ++Y
Sbjct: 235 EFVDATRKGNLGRFCNHSCNPNCYVDKWVVGDKLRMGIFALRKIQAGEELVFNY 288


>gi|324500453|gb|ADY40214.1| Histone-lysine N-methyltransferase lin-59 [Ascaris suum]
          Length = 1467

 Score =  117 bits (294), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 76/195 (38%), Positives = 109/195 (55%), Gaps = 10/195 (5%)

Query: 41  AIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCG 100
           A  Y+ IK N+ +      R+E   + CSC    GS G  D +C   M+   C++ C  G
Sbjct: 721 AKSYIKIKHNVIVDAYP--RIEQ--LQCSCQK--GSCGESD-ECLNRMVQMECNNSCGRG 773

Query: 101 NSCLNKP-FQNRPVKKMKLVQTEKC-GAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
             C NK  F+   V K+ L +T    G G+  D  +++G+FV EYVGEV+  +T + R  
Sbjct: 774 VHCSNKRIFRRECVDKLSLFETSNGRGLGVRTDVPLQKGQFVCEYVGEVVSMETFDAR-N 832

Query: 159 KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATR 218
              +    N Y   +    VIDA  KGN +R++NHSC PN EMQ+W ++G+ RIG+FA R
Sbjct: 833 AHSYRAFRNHYALNLCPGYVIDAYQKGNIARFVNHSCVPNCEMQRWSVNGQHRIGLFALR 892

Query: 219 DIKKGENLTYDYQYE 233
            + KGE LTYDY ++
Sbjct: 893 VVAKGEELTYDYNWD 907


>gi|317037088|ref|XP_001398411.2| histone-lysine N-methyltransferase, H3 lysine-36 specific
           [Aspergillus niger CBS 513.88]
          Length = 853

 Score =  117 bits (293), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 60/164 (36%), Positives = 91/164 (55%), Gaps = 1/164 (0%)

Query: 81  DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
           D DC        C   C CG  C N+ FQ R    + +++TEK G G+ A+ D++  +F+
Sbjct: 187 DSDCINRATKIECMGDCGCGPECQNQRFQRREYAPVAVIKTEKKGFGLRAEADLRPHQFI 246

Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
            EYVGEVI++     R+ +    G  +FY   +++   +DAT KGN  R+ NHSC PN  
Sbjct: 247 FEYVGEVINEGQFRRRMRQYDEEGIKHFYFMSLSKGEFVDATKKGNLGRFCNHSCNPNCY 306

Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
           + KW++  + R+GIFA R I+ GE L ++Y  + +  D    YC
Sbjct: 307 VDKWVVGEKLRMGIFAERHIQAGEELVFNYNVDRYGADPQPCYC 350


>gi|346321276|gb|EGX90876.1| histone H3 lysine 36 (K36) methyltransferase [Cordyceps militaris
           CM01]
          Length = 905

 Score =  117 bits (293), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 62/188 (32%), Positives = 98/188 (52%), Gaps = 6/188 (3%)

Query: 62  EDDGIFCSCTAS--PGSSGVCDRDCHC---GMLLSSCSSGCKCGNSCLNKPFQNRPVKKM 116
           E+D   C C      G +  C  D  C      +    +   CG  C N+ FQ +    +
Sbjct: 122 ENDAFDCECRGDWHDGVNMACGEDSDCINRATKMECSETAGNCGGGCQNQRFQRKQYADV 181

Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
            +++TEK G G+  D ++   +F+ EY+GEVI++ T   R+ +    G  +FY   ++++
Sbjct: 182 CVIKTEKKGYGLRTDAELSAHDFIFEYIGEVINEATFRRRMLQYDQQGIKHFYFMSLSKN 241

Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FL 235
             +DAT KGN  R+ NHSC PN  + KW+I  + R+GIFA R I+ GE L ++Y  + + 
Sbjct: 242 EFVDATRKGNLGRFCNHSCVPNCYVDKWVIGDKLRMGIFALRTIRAGEELVFNYNVDRYG 301

Query: 236 HDSLIAYC 243
            D    YC
Sbjct: 302 ADPQPCYC 309


>gi|212535752|ref|XP_002148032.1| SET and WW domain protein [Talaromyces marneffei ATCC 18224]
 gi|210070431|gb|EEA24521.1| SET and WW domain protein [Talaromyces marneffei ATCC 18224]
          Length = 900

 Score =  117 bits (293), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 59/164 (35%), Positives = 92/164 (56%), Gaps = 1/164 (0%)

Query: 81  DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
           D DC        C + C CG  C N+ FQ +    + +++T K G G+ A+ DI+  +F+
Sbjct: 185 DSDCINRATRIECLNDCSCGQDCQNQRFQRKEYANVTVIKTAKKGFGLRAESDIQPHQFI 244

Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
            EY+GEVI++     R+ +    G  +FY   +N+   +DAT KGN +R+ NHSC PN  
Sbjct: 245 YEYIGEVINEANFRRRMIQYDKEGIKHFYFMSLNKGEFVDATKKGNLARFCNHSCNPNCY 304

Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
           + KW++  + R+GIFA R I+ GE L ++Y  + +  D    YC
Sbjct: 305 VDKWVVGEKLRMGIFAERYIQAGEELVFNYNVDRYGADPQPCYC 348


>gi|256084142|ref|XP_002578291.1| SET domain protein [Schistosoma mansoni]
          Length = 1746

 Score =  117 bits (292), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 71/207 (34%), Positives = 104/207 (50%), Gaps = 12/207 (5%)

Query: 33   PDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC---GML 89
            PD++ KP  + +      IY       RL D      C   P S   C    +C    + 
Sbjct: 950  PDYY-KPIKVNWPLGSVRIY-------RLTDPSEAPRCECKPNSEDPCGPSSNCINRELH 1001

Query: 90   LSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQT-EKCGAGIVADEDIKRGEFVIEYVGEVI 148
                 S C  G++C N+ F  R     +   T ++ G G+     I+ GEFV EY+G++I
Sbjct: 1002 YECLPSICPNGDACRNQRFTKRLYPPQRPFWTGDQRGWGLKTMIAIRAGEFVNEYIGDLI 1061

Query: 149  DDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDG 208
            D+     RL        TN+Y+ +++   +IDA  KGN SR++NHSC PN   QKW ++G
Sbjct: 1062 DEDEANRRLRFAHENNITNYYMMKLDSQRIIDAGPKGNLSRFMNHSCDPNLNTQKWTVNG 1121

Query: 209  ETRIGIFATRDIKKGENLTYDYQYEFL 235
            + RIG+FA RDI  GE LT++Y +  L
Sbjct: 1122 DNRIGLFAVRDISVGEELTFNYNFVAL 1148


>gi|393219563|gb|EJD05050.1| hypothetical protein FOMMEDRAFT_105290 [Fomitiporia mediterranea
           MF3/22]
          Length = 872

 Score =  117 bits (292), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 68/205 (33%), Positives = 105/205 (51%), Gaps = 8/205 (3%)

Query: 47  IKRNIYLTKRIKRRLED-DGIFCSCTASPGSSG---VCD--RDCHCGMLLSSCSSG-CKC 99
           ++ N Y    + R  E  + + C C   PG       C    DC   +    C  G C+C
Sbjct: 100 LEENYYQYSTLGRSREALESMTCDCQFVPGRDAPEMACGYGSDCINRLTQVECEEGDCRC 159

Query: 100 GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
              C N+  Q +    +++V TEK G G+ A  DI +  F+ EY+G+V+   +  +R+ +
Sbjct: 160 RGYCRNQRLQRKEYANVEIVLTEKKGFGLRAGSDITKDAFIYEYIGDVVSQPSFAKRMRE 219

Query: 160 MKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD 219
               G  +FY   + +D  IDAT +G K R+ NHSC PN  + KW +    R+GI+A R+
Sbjct: 220 YAEEGIRHFYFMMLQKDEFIDATKRGGKGRFANHSCNPNCYVAKWTVGDRVRMGIYAKRN 279

Query: 220 IKKGENLTYDYQYE-FLHDSLIAYC 243
           IKK E LT++Y  + + HD+   YC
Sbjct: 280 IKKDEELTFNYNVDRYGHDAQPCYC 304


>gi|353232109|emb|CCD79464.1| putative set domain protein [Schistosoma mansoni]
          Length = 1503

 Score =  117 bits (292), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 71/207 (34%), Positives = 104/207 (50%), Gaps = 12/207 (5%)

Query: 33   PDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC---GML 89
            PD++ KP  + +      IY       RL D      C   P S   C    +C    + 
Sbjct: 950  PDYY-KPIKVNWPLGSVRIY-------RLTDPSEAPRCECKPNSEDPCGPSSNCINRELH 1001

Query: 90   LSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQT-EKCGAGIVADEDIKRGEFVIEYVGEVI 148
                 S C  G++C N+ F  R     +   T ++ G G+     I+ GEFV EY+G++I
Sbjct: 1002 YECLPSICPNGDACRNQRFTKRLYPPQRPFWTGDQRGWGLKTMIAIRAGEFVNEYIGDLI 1061

Query: 149  DDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDG 208
            D+     RL        TN+Y+ +++   +IDA  KGN SR++NHSC PN   QKW ++G
Sbjct: 1062 DEDEANRRLRFAHENNITNYYMMKLDSQRIIDAGPKGNLSRFMNHSCDPNLNTQKWTVNG 1121

Query: 209  ETRIGIFATRDIKKGENLTYDYQYEFL 235
            + RIG+FA RDI  GE LT++Y +  L
Sbjct: 1122 DNRIGLFAVRDISVGEELTFNYNFVAL 1148


>gi|310793120|gb|EFQ28581.1| SET domain-containing protein [Glomerella graminicola M1.001]
          Length = 940

 Score =  116 bits (291), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 64/188 (34%), Positives = 104/188 (55%), Gaps = 6/188 (3%)

Query: 62  EDDGIFCSCTAS--PGSSGVC--DRDCHCGMLLSSCSSGC-KCGNSCLNKPFQNRPVKKM 116
           ++D + C C +    G +  C  D DC        C +G   CG+ C N+ FQ +    +
Sbjct: 112 DNDALDCDCRSEWHDGENLSCGEDSDCINRATKMECVAGAGNCGDGCQNQRFQRKQYANV 171

Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
            +++TEK G G+ A+ D++  +F+ EY+GEVI++ T   R+ +    G  +FY   + + 
Sbjct: 172 SVIKTEKKGFGLRANVDLQANDFIFEYIGEVINEPTFRRRMVQYDDEGIKHFYFMSLTKH 231

Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FL 235
             +DAT KGN  R+ NHSC PN  + KW++  + R+GIF++R IK GE L ++Y  + + 
Sbjct: 232 EFVDATKKGNLGRFCNHSCNPNCYVDKWVVGEKLRMGIFSSRLIKAGEELVFNYNVDRYG 291

Query: 236 HDSLIAYC 243
            D    YC
Sbjct: 292 ADPQPCYC 299


>gi|406606115|emb|CCH42475.1| Histone-lysine N-methyltransferase [Wickerhamomyces ciferrii]
          Length = 815

 Score =  116 bits (291), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 61/174 (35%), Positives = 98/174 (56%), Gaps = 8/174 (4%)

Query: 65  GIFCSCTASPG-------SSGVCDRDC-HCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKM 116
           G F SC   P        +    D DC + G  +   +    CG  C N+ FQ +    +
Sbjct: 109 GEFMSCECHPDWDESNELNYACIDDDCINRGTKIECINGVSSCGEDCSNQRFQKKQYVDI 168

Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
            ++QTEK G G+ +  DI+   F+ EY+GEVID+++ ++R+ +   L   +FY   + + 
Sbjct: 169 NVIQTEKKGYGVRSQIDIEPDTFIYEYIGEVIDEKSFKKRMLEYDELNFKHFYFMMLQKG 228

Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDY 230
             +DAT KG+ +R+ NHSC PN  ++KW++  + ++GIFA R I KGE +T+DY
Sbjct: 229 EFLDATRKGSLARFCNHSCNPNCYVEKWVVGEKLKMGIFAKRKILKGEEITFDY 282


>gi|195128035|ref|XP_002008472.1| GI13513 [Drosophila mojavensis]
 gi|193920081|gb|EDW18948.1| GI13513 [Drosophila mojavensis]
          Length = 2416

 Score =  116 bits (291), Expect = 7e-24,   Method: Composition-based stats.
 Identities = 67/214 (31%), Positives = 110/214 (51%), Gaps = 12/214 (5%)

Query: 27   PVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC 86
            P    +P W        Y  I+ N+Y           D   C+C   P  +G C  +C  
Sbjct: 1468 PTRHVVPSW-------NYRKIRTNVYAESVRPNLAGFDHPNCNCKPQP--NGACLDNCLN 1518

Query: 87   GMLLSSC-SSGCKCGNSCLNKPFQNRPVKK-MKLVQTEKCGAGIVADEDIKRGEFVIEYV 144
             M+ + C  + C  G+ C N+  Q   V   ++   T   G G+     I +G +++EYV
Sbjct: 1519 RMVYTECVPNNCPAGDKCRNQKIQRHEVAPGVERFMTADKGWGVRTKLPIAKGTYILEYV 1578

Query: 145  GEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKW 204
            GEV+ ++  ++R+  + +L +T+ Y   ++  +VID    G+  R++NHSC PN EMQKW
Sbjct: 1579 GEVVTEREFKQRMATI-YLNDTHHYCLHLDGGLVIDGQRMGSDCRFVNHSCEPNCEMQKW 1637

Query: 205  IIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
             ++G +R+ +FA R I++GE LTYDY +   + S
Sbjct: 1638 SVNGLSRMVLFAKRAIEEGEELTYDYNFSLFNPS 1671


>gi|242793892|ref|XP_002482258.1| SET and WW domain protein [Talaromyces stipitatus ATCC 10500]
 gi|218718846|gb|EED18266.1| SET and WW domain protein [Talaromyces stipitatus ATCC 10500]
          Length = 891

 Score =  116 bits (291), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 59/164 (35%), Positives = 92/164 (56%), Gaps = 1/164 (0%)

Query: 81  DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
           D DC        C + C CG  C N+ FQ +    + +++T K G G+ A+ DI   +F+
Sbjct: 176 DSDCINRATRIECLNDCSCGQDCQNQRFQRKEYANVTVIKTAKKGFGLRAESDIHPHQFI 235

Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
            EY+GEVI++ +   R+ +    G  +FY   +N+   +DAT KGN +R+ NHSC PN  
Sbjct: 236 YEYIGEVINEVSFRRRMIQYDKEGIKHFYFMSLNKGEFVDATKKGNLARFCNHSCNPNCY 295

Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
           + KW++  + R+GIFA R I+ GE L ++Y  + +  D    YC
Sbjct: 296 VDKWVVGEKLRMGIFAERYIQAGEELVFNYNVDRYGADPQPCYC 339


>gi|388581385|gb|EIM21694.1| SET domain-containing protein [Wallemia sebi CBS 633.66]
          Length = 681

 Score =  116 bits (291), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 67/190 (35%), Positives = 101/190 (53%), Gaps = 5/190 (2%)

Query: 56  RIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCS-SGCKCGNSCLNKPFQNRPVK 114
           R K+ LE+   +C C  + G  G    DC   +    C    CK G  C N+ FQ +   
Sbjct: 30  RTKQNLEE--AYCDCYLTGGECG-NHSDCINRLTQVECLLDDCKTGPQCNNQRFQRKQWA 86

Query: 115 KMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEIN 174
            + +++TEK G G+ A+ D+ R  F+IEY+GEV+       R+      G  +FY   + 
Sbjct: 87  NIDIIKTEKKGYGLRANVDLDRDTFLIEYIGEVVTQTQFLRRMNTYSKEGIKHFYFMMLQ 146

Query: 175 RDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE- 233
            +  IDAT +GN  R+ NHSC PN  + KW++    ++GIF  R I+KGE LT++Y  + 
Sbjct: 147 NEEFIDATRRGNIGRFANHSCAPNCFVSKWVVGKYVKMGIFTKRKIEKGEELTFNYNVDR 206

Query: 234 FLHDSLIAYC 243
           + HD+   YC
Sbjct: 207 YGHDAQPCYC 216


>gi|328767162|gb|EGF77213.1| hypothetical protein BATDEDRAFT_91931 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 779

 Score =  116 bits (290), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 72/234 (30%), Positives = 110/234 (47%), Gaps = 30/234 (12%)

Query: 28  VEFELP-----------------DWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSC 70
           V+FELP                   F + +A P++ I++N+++ +R      D    C C
Sbjct: 469 VDFELPYDLQHFVKICNGVEGMRTLFNEEEASPFIHIQKNVWVDRRPLDVPFDFESICQC 528

Query: 71  TASPGSSGVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQ-NRPVKKMKLVQTEKCGAGIV 129
                 S  C+           C   C  G+ C N+ F+ N  ++K+++      G G+ 
Sbjct: 529 KLPKDGSPPCE-----------CGDNCPMGDCCTNRAFRRNEQIEKLRVFYAPNRGFGLY 577

Query: 130 ADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSR 189
            D  IK G  +IEY GE+I    C ER   + + G+ N Y  E    +V+D   KG  +R
Sbjct: 578 TDVPIKAGVLIIEYRGEIISTAKCIERNDTI-YSGQKNHYFLEYGNGLVLDGCRKGTIAR 636

Query: 190 YINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
           + NHSC PN  ++KW +  E R+GIFAT +I  G  LTYDY+++        YC
Sbjct: 637 FANHSCDPNCHVEKWYVGTEFRVGIFATNNISVGSELTYDYRFDSYGQMQPCYC 690


>gi|392563363|gb|EIW56542.1| SET domain-containing protein [Trametes versicolor FP-101664 SS1]
          Length = 785

 Score =  116 bits (290), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 65/193 (33%), Positives = 100/193 (51%), Gaps = 7/193 (3%)

Query: 58  KRRLEDDGIFCSCTASPG---SSGVCDRDCHCGMLLSSCS---SGCKCGNSCLNKPFQNR 111
           K R   + + C C    G       C  D  C   L+        C+C + C N+ FQ +
Sbjct: 61  KSREALESMMCDCQYEHGIDDPDTACGHDSDCINRLTQVECLPEDCRCRSYCRNQRFQRK 120

Query: 112 PVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLC 171
               +++V+TE  G G+ A E+I +  F+ EYVG+V+   +  +R+ +    G  +FY  
Sbjct: 121 QYAPIEIVKTEMKGFGLRAGENIPKDAFIYEYVGDVVSQPSFLKRMRQYAEEGIRHFYFM 180

Query: 172 EINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQ 231
            + +D  IDAT +G   R+ NHSC PN  + KW +    R+GIFA R IK+ E LT++Y 
Sbjct: 181 MLQKDEYIDATKRGGIGRFANHSCNPNCYVAKWTVGDHVRMGIFANRTIKQHEELTFNYN 240

Query: 232 YE-FLHDSLIAYC 243
            + + HD+ I YC
Sbjct: 241 VDRYGHDAQICYC 253


>gi|324500837|gb|ADY40382.1| Histone-lysine N-methyltransferase SETD2 [Ascaris suum]
          Length = 950

 Score =  116 bits (290), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 71/198 (35%), Positives = 103/198 (52%), Gaps = 9/198 (4%)

Query: 43  PYVFIKRNIYL-TKRIKRRLEDDGIFCSCTASP-GSSGVCDRDCHCGMLLSSCSSGCKCG 100
           P++ I+ N+ L  + + +       FC  T       G C  +C   ML + C S C  G
Sbjct: 99  PFIAIEENVILCDESLIKEAAVVRCFCEPTEEELAVGGGCGENCINRMLYTECGSRCPSG 158

Query: 101 NSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKM 160
             C N+   N+   K+++      G G+ A E ++ G+FV+EYVGEVI  +    R+ + 
Sbjct: 159 ARCSNRRLHNKEYAKVEVFYAGVKGWGLRACEPLEPGQFVMEYVGEVISAEEMRRRVRRY 218

Query: 161 ----KHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
               KH+   + YL  +    VIDAT +GN SR+INHSC PN   +KW +D   RIG FA
Sbjct: 219 GRDPKHV---HHYLMALKNGAVIDATIRGNISRFINHSCDPNCRSEKWTVDRRVRIGFFA 275

Query: 217 TRDIKKGENLTYDYQYEF 234
           T+ +  GE L +DYQ E 
Sbjct: 276 TKKVAVGEELVFDYQLEL 293


>gi|198435574|ref|XP_002121834.1| PREDICTED: absent, small, or homeotic discs 1 homolog [Ciona
            intestinalis]
          Length = 2850

 Score =  116 bits (290), Expect = 9e-24,   Method: Composition-based stats.
 Identities = 76/235 (32%), Positives = 121/235 (51%), Gaps = 21/235 (8%)

Query: 15   HAFNKLLKQIGNPVEFELP--DWF------IKPK---AIPYVFIKRNIYLTKRIKRRLED 63
            HA N LLK   +  +F+LP   W+      I P    A  ++ I++N+Y+  +     E+
Sbjct: 2007 HAGNFLLK---SRKDFQLPFNIWWMYNRKLISPSQDLATQFIKIEKNVYVDSQPTCEQEE 2063

Query: 64   DGIFCSCTAS--PGSSGV--CDRDCHCGMLLSSCS-SGCKCGNSCLNKPFQNRPV-KKMK 117
                C   +     SS V  C ++C   ++   CS   C C + C N+  Q +   K ++
Sbjct: 2064 HVCVCQTLSDIHSLSSDVHGCGKECLNRLMYIECSPDTCPCQDKCANRCIQKQQWWKDLE 2123

Query: 118  LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDM 177
              +T   G G+  + DI  G+F++EYVGEV+ ++    R  +  +    + Y  ++    
Sbjct: 2124 RFRTNDRGWGVRTNSDIPEGQFLLEYVGEVVSEREFRRRTIE-NYNAHNDHYCVQLEAGT 2182

Query: 178  VIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            VID     N+ R++NHSC PN EMQKW+++GE R+G+FA R I   E LTYDY +
Sbjct: 2183 VIDGYRLANEGRFVNHSCQPNCEMQKWVVNGEYRVGLFAKRPIVSSEELTYDYNF 2237


>gi|195037347|ref|XP_001990122.1| GH19166 [Drosophila grimshawi]
 gi|193894318|gb|EDV93184.1| GH19166 [Drosophila grimshawi]
          Length = 1434

 Score =  116 bits (290), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 67/196 (34%), Positives = 105/196 (53%), Gaps = 5/196 (2%)

Query: 43   PYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG-CKC 99
            PY+ IK N  +        E+    C C   P S   C  +  C   +L   C+   C  
Sbjct: 1162 PYIKIKANKAVPPVRFVYSEESQNICVC--KPDSEHPCGPESGCLNRILYHECNPKYCNA 1219

Query: 100  GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
            G  C N+ F+ R   ++++V   + G G+V  E I  G+F+IEYVGEVI+    + R+ +
Sbjct: 1220 GKKCENQLFELRKSPRLEVVYMNERGFGLVCREAIAEGDFIIEYVGEVINHAEFQRRVAQ 1279

Query: 160  MKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD 219
              +  + NFY   + +D +IDA  KGN +R++NHSC PN   QKW ++   R+G+FA +D
Sbjct: 1280 KTNDRDENFYFLGVEKDFIIDAGPKGNLARFMNHSCEPNCATQKWTVNCINRVGLFAIKD 1339

Query: 220  IKKGENLTYDYQYEFL 235
            I +   LT++Y ++ L
Sbjct: 1340 IPENTELTFNYLWDDL 1355


>gi|385301625|gb|EIF45804.1| putative histone lysine methyltransferase set2p [Dekkera
           bruxellensis AWRI1499]
          Length = 695

 Score =  116 bits (290), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 72/194 (37%), Positives = 102/194 (52%), Gaps = 8/194 (4%)

Query: 44  YVFIKRNIYLTKRIK--RRLEDDGIFCSCTAS--PGSSGVCDRDCHCGMLLSS--CSSG- 96
           +  +K N Y+  R+K      ++ + C CT     G +  C  D  C   L++  C  G 
Sbjct: 42  FQLVKHNEYID-RLKGVYSAANEVMTCDCTERRVAGVNVACGADSDCINRLTNVECVDGE 100

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
           C CG  C N+ FQ   +  + +  TE  G G+ A+ DI    F+IEY GEVID+Q  + R
Sbjct: 101 CSCGXRCQNQRFQRNQMADISIFLTEHKGYGMRANCDIPANTFIIEYKGEVIDEQAYKIR 160

Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
                  G  +FY   I    +IDAT KG+  R+ NHSC PN   +KW+++   R+GIFA
Sbjct: 161 KEAYAKEGIKHFYFMMIQDGQIIDATKKGSLGRFCNHSCDPNAYXEKWVVNKRYRMGIFA 220

Query: 217 TRDIKKGENLTYDY 230
            R I KGE +T+DY
Sbjct: 221 KRKIPKGEEITFDY 234


>gi|322709340|gb|EFZ00916.1| histone H3 lysine 36 (K36) methyltransferase [Metarhizium
           anisopliae ARSEF 23]
          Length = 894

 Score =  115 bits (289), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 58/174 (33%), Positives = 95/174 (54%), Gaps = 5/174 (2%)

Query: 62  EDDGIFCSCTA--SPGSSGVCDRDCHC---GMLLSSCSSGCKCGNSCLNKPFQNRPVKKM 116
           ++D + C C      G +  C  D  C      +   +SG  CG  C N+ FQ +    +
Sbjct: 115 DNDALDCDCAEEFQDGENFACGEDSDCINRATKMECVASGSNCGGGCQNQRFQRKLWADV 174

Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
            +++T+K G G+  +  ++  +F+ EYVGEVI++ T   R+ +    G  +FY   +++ 
Sbjct: 175 AVIKTDKKGYGLRTESPLQPNDFIYEYVGEVINEPTFRRRMLQYDEEGIKHFYFMSLSKS 234

Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDY 230
             +DAT KGN  R+ NHSC PN  + KW++  + R+GIFA R I+ GE L ++Y
Sbjct: 235 EFVDATRKGNLGRFCNHSCNPNCYVDKWVVGDKLRMGIFALRKIQAGEELVFNY 288


>gi|171686378|ref|XP_001908130.1| hypothetical protein [Podospora anserina S mat+]
 gi|170943150|emb|CAP68803.1| unnamed protein product [Podospora anserina S mat+]
          Length = 919

 Score =  115 bits (289), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 61/186 (32%), Positives = 97/186 (52%), Gaps = 6/186 (3%)

Query: 64  DGIFCSCTAS--PGSSGVCDRDCHC---GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKL 118
           D + C CT       +  C  D  C      +      C CG  C N+ FQ +    + +
Sbjct: 122 DALDCDCTEEWRNDENHACGEDSDCINRATKIECVDGDCNCGPGCQNQRFQRKQYADVSV 181

Query: 119 VQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMV 178
           ++T+K G G+ A+ +++  +F+ EY+GEVI++ T   R+ K    G  +FY   + +   
Sbjct: 182 IKTDKKGFGLRANRNLQPNDFIFEYIGEVINEPTFRNRMIKYDREGIKHFYFMSLTKSEF 241

Query: 179 IDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHD 237
           +DAT KGN  R+ NHSC PN  + KW++  + R+GIFA R I+ GE L ++Y  + +  D
Sbjct: 242 VDATKKGNLGRFCNHSCNPNCYVDKWVVGEKLRMGIFAGRPIRAGEELVFNYNVDRYGAD 301

Query: 238 SLIAYC 243
               YC
Sbjct: 302 PQPCYC 307


>gi|346971461|gb|EGY14913.1| histone-lysine N-methyltransferase [Verticillium dahliae VdLs.17]
          Length = 913

 Score =  115 bits (288), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 61/188 (32%), Positives = 98/188 (52%), Gaps = 6/188 (3%)

Query: 62  EDDGIFCSCTA--SPGSSGVCDRDCHC---GMLLSSCSSGCKCGNSCLNKPFQNRPVKKM 116
           ++D + C C      G +  CD D  C      +        CG+ C N+ FQ +    +
Sbjct: 112 DNDALDCDCREEWQNGENHSCDEDSDCINRATKMECVMGAGNCGDGCQNQRFQRKAYADV 171

Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
            +++TEK G G+ A+ D++  +F+ EY+GEVI++ T   R+ +    G  +FY   + + 
Sbjct: 172 SVIKTEKKGFGLRANVDMQANDFIFEYIGEVINEPTFRRRMVQYDDEGIKHFYFMSLTKH 231

Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FL 235
             +DAT KGN  R+ NHSC PN  + KW++  + R+GIF  R I  GE L ++Y  + + 
Sbjct: 232 EFVDATKKGNLGRFCNHSCNPNCYVDKWVVGDKLRMGIFTLRGIAAGEELVFNYNVDRYG 291

Query: 236 HDSLIAYC 243
            D    YC
Sbjct: 292 ADPQPCYC 299


>gi|397568484|gb|EJK46160.1| hypothetical protein THAOC_35187 [Thalassiosira oceanica]
          Length = 473

 Score =  115 bits (288), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 73/213 (34%), Positives = 109/213 (51%), Gaps = 10/213 (4%)

Query: 27  PVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGV--CDRDC 84
           PV+F+  +   KP A  Y  +  N Y      RR    G  C C  S G  GV  C   C
Sbjct: 203 PVDFKE-EVHSKPPA--YAHVNSNRYDPDNRPRRSIFSGEKCRCKPS-GEDGVPSCGERC 258

Query: 85  HCGM-LLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEY 143
              +     C++ C  G +C N+    R   K ++ +    G G+++ + +K G+ VIEY
Sbjct: 259 DNRLNYFECCAANCDLGPNCGNRAMGRRRFAKCRVQREHGKGWGLISVDGVKSGDLVIEY 318

Query: 144 VGEVIDDQTCEERL--WKMKHLGETNFYLCEINR-DMVIDATYKGNKSRYINHSCCPNTE 200
            GEVID+ T E RL  W   H  + NFY+  + +    IDA +  N++R++NHSC PN  
Sbjct: 319 AGEVIDESTKESRLAAWTRDHPTDPNFYVMALGQAGWYIDARHVANQARFVNHSCDPNCR 378

Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
           +    + G  R+ I A RD++ GE L+YDYQ++
Sbjct: 379 LVPLNVAGHMRVAIVAVRDVRPGEFLSYDYQFD 411


>gi|380012561|ref|XP_003690348.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
           N-methyltransferase NSD2-like [Apis florea]
          Length = 1208

 Score =  115 bits (288), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 64/149 (42%), Positives = 82/149 (55%), Gaps = 1/149 (0%)

Query: 66  IFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKC 124
           + C C     +      DC   +LL  CS G C  G  C N+ F  R    M+   T   
Sbjct: 788 VACDCDPEWENPCAPGTDCLNRILLVECSPGICPAGPKCNNQAFVRRQYPAMEPFHTVGR 847

Query: 125 GAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYK 184
           G G+ + E IK G+FVIEYVGEVID+   + RL + K L   NFY   I+ +  IDA  K
Sbjct: 848 GWGLRSLEHIKAGQFVIEYVGEVIDEAEYKRRLHRKKELKNENFYFLTIDNNRTIDAEPK 907

Query: 185 GNKSRYINHSCCPNTEMQKWIIDGETRIG 213
           GN SR++NHSC PN E QKW ++G+TRI 
Sbjct: 908 GNLSRFMNHSCSPNCETQKWTVNGDTRIA 936


>gi|255725552|ref|XP_002547705.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
 gi|240135596|gb|EER35150.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
          Length = 782

 Score =  115 bits (287), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 58/155 (37%), Positives = 90/155 (58%), Gaps = 3/155 (1%)

Query: 79  VCDRDCHCGMLLSSC---SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIK 135
            C  D +C   ++S    +  C CG  C N+ FQ +   K+ + QTE  G G+ ++E I 
Sbjct: 109 ACGEDSNCINRITSVECINKHCSCGEDCENQRFQKKLYSKVSVFQTELKGYGLKSEEIIN 168

Query: 136 RGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSC 195
             +F+ EY+GEVI++    +R+ +       +FY   ++ D  IDAT KG  +R+ NHSC
Sbjct: 169 ENQFIYEYIGEVINETDFRKRMIEYDLENYKHFYFMMLDHDSFIDATKKGCLARFANHSC 228

Query: 196 CPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDY 230
            PN  + KW++  + R+GIFA R I+KGE +T+DY
Sbjct: 229 NPNAYVDKWVVGDKLRMGIFAKRRIEKGEEITFDY 263


>gi|345565974|gb|EGX48921.1| hypothetical protein AOL_s00079g142 [Arthrobotrys oligospora ATCC
           24927]
          Length = 1227

 Score =  115 bits (287), Expect = 2e-23,   Method: Composition-based stats.
 Identities = 69/232 (29%), Positives = 115/232 (49%), Gaps = 24/232 (10%)

Query: 30  FELPDWFIKPKAIPYVFIKRNI--------YLTKRIKRRLEDDGIFCSCTASPGSSGVCD 81
           F L  W + PK   +  +KRNI        Y  ++  R+       C+CT     SG C 
Sbjct: 259 FNLDPW-VYPKPPGWKSLKRNIFVGEAAELYKQQKNSRKNAYQSTMCNCTPE---SGGCG 314

Query: 82  RDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVK---------KMKLVQTEKCGAGIVADE 132
             C   ++   C         C N+PF++  ++          ++++ T+  G G+ A  
Sbjct: 315 PGCLNRVMYYECDKNNCSLKDCQNRPFRDLAIRVENDTWFDDGIEIILTQDKGYGLRACR 374

Query: 133 DIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYIN 192
                + ++EY+GE+I    CEERL       E+ +YL E +  ++IDAT +G+ +R++N
Sbjct: 375 SFGPNQIIVEYIGEIITQDECEERLHGPYKDNES-YYLMEFDNSLIIDAT-RGSLARFVN 432

Query: 193 HSCCPNTEMQKWIIDGETRIGIFATRD-IKKGENLTYDYQYEFLHDSLIAYC 243
           HSC PN +M+KW++ G+ R+ +FA  + I+ GE LTYDY + +        C
Sbjct: 433 HSCSPNCKMEKWMVAGQPRMALFAGDEGIEVGEELTYDYNFSWFSGGSTQLC 484


>gi|256074584|ref|XP_002573604.1| huntingtin interacting protein-related [Schistosoma mansoni]
          Length = 1575

 Score =  115 bits (287), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 60/141 (42%), Positives = 82/141 (58%), Gaps = 1/141 (0%)

Query: 93  CSSGCKCGNSCLNKPFQNRPVKKMKLVQTEK-CGAGIVADEDIKRGEFVIEYVGEVIDDQ 151
           C   C  G+ C N+ FQ R     +     K  G G++A +++K+G FVIEYVGEVID  
Sbjct: 431 CGLHCAAGDFCSNRQFQMRLYAPTRPFYAGKDKGWGLMATDNVKKGSFVIEYVGEVIDFS 490

Query: 152 TCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETR 211
               R+ + + LG  + Y   +  D  IDA  KGN +R++NHSC PN   QKW ++GE R
Sbjct: 491 EFRRRIRRYERLGHAHHYFMAVESDRFIDAGSKGNWARFVNHSCEPNCVTQKWSVNGEIR 550

Query: 212 IGIFATRDIKKGENLTYDYQY 232
           IG FA  DI  G+ +T DYQ+
Sbjct: 551 IGFFAKEDIPSGQEVTIDYQF 571


>gi|312374857|gb|EFR22332.1| hypothetical protein AND_15429 [Anopheles darlingi]
          Length = 579

 Score =  115 bits (287), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 68/202 (33%), Positives = 101/202 (50%), Gaps = 18/202 (8%)

Query: 44  YVFIKRNIYL------TKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCS-SG 96
           Y  I RN Y       ++R + + E     C+C+   G     +  C        C+   
Sbjct: 371 YTKIYRNRYTASLKESSRRSRIKEESMDSVCNCSMDDGDPCGANSSCINRFTFIECNPKT 430

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
           C     C N+ F  R    +++ + E  G G+V  ED+  G+F+IEYVGEVI+ +    R
Sbjct: 431 CPAKERCSNQRFAKRIYPDLEVRKFEDRGHGLVTKEDLTAGQFIIEYVGEVINKKEFGRR 490

Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
           L  M+   E ++Y  E++ +++IDA  KGN +R++NHSC PN E QKW +D    IG+FA
Sbjct: 491 LNYMQQQNEQHYYFLEVDSEIIIDAGPKGNLARFVNHSCEPNCETQKWTVDNMRVIGVFA 550

Query: 217 TRDIKKGENLTYDYQYEFLHDS 238
            +DIK             LHDS
Sbjct: 551 LKDIKA-----------MLHDS 561


>gi|210075797|ref|XP_002143056.1| YALI0D21684p [Yarrowia lipolytica]
 gi|199425837|emb|CAG81317.4| YALI0D21684p [Yarrowia lipolytica CLIB122]
          Length = 1284

 Score =  115 bits (287), Expect = 2e-23,   Method: Composition-based stats.
 Identities = 77/232 (33%), Positives = 116/232 (50%), Gaps = 28/232 (12%)

Query: 29  EFELPDWFI-------KPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCD 81
           +F+LP W I       KPK+  +  IKRNI++     +      I C+C         CD
Sbjct: 152 DFQLP-WNIYCPTGNVKPKS--WKSIKRNIFIDVEPPKPSTLKSI-CACVDR------CD 201

Query: 82  RDCHCGMLLSSC-SSGCKCGN-SCLNKPF---QNRPVK----KMKLVQTEKCGAGIVADE 132
             C   ML   C SS C  G+  C N+ F   +NR  +     ++++ TE  G G+ A  
Sbjct: 202 ESCLNKMLQFECDSSNCPFGDVDCGNREFGLLKNRTSRFYHQAIEIIHTESKGYGLRACR 261

Query: 133 DIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYIN 192
               GE V EY G+VI     + RL   ++    N+Y   + +  VID+  +G+ +R++N
Sbjct: 262 SFAPGELVCEYTGDVISPSEIKRRL-DNEYADSDNYYFLNLEKGAVIDSGLRGSAARFVN 320

Query: 193 HSCCPNTEMQKWIIDGETRIGIFATRD-IKKGENLTYDYQYEFLHDSLIAYC 243
           HSC PN EMQKW + G  R+G+FA  D ++ G+ LTYDY + +   +    C
Sbjct: 321 HSCMPNCEMQKWFVKGLPRVGLFAGEDGVQNGQELTYDYNFVWFEGAKAQTC 372


>gi|50285531|ref|XP_445194.1| hypothetical protein [Candida glabrata CBS 138]
 gi|74638008|sp|Q6FX50.1|SET2_CANGA RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36
           specific; AltName: Full=SET domain-containing protein 2
 gi|49524498|emb|CAG58098.1| unnamed protein product [Candida glabrata]
          Length = 716

 Score =  115 bits (287), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 63/174 (36%), Positives = 95/174 (54%), Gaps = 6/174 (3%)

Query: 63  DDGIFCSCTA--SPGSSGVC--DRDCHCGMLLSSCSSGC--KCGNSCLNKPFQNRPVKKM 116
           +D + C C      G +  C  D DC   + L  C +     CGN C N+ FQ +    +
Sbjct: 81  NDFMECDCYEDFQNGKNHACGEDSDCINRLTLIECVNDLCGTCGNDCANQRFQKKEYANI 140

Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
            + +T+  G G+ A+ DI+  +F+ EY GEVI+++   +RL         +FY   +   
Sbjct: 141 AVFKTKMKGYGVRAESDIEINDFIYEYKGEVIEEEEFRDRLVDYDQKKFRHFYFMMLQSG 200

Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDY 230
             IDAT KG+ +R+ NHSC PN  + KW++ G+ R+GIFA R I KGE +T+DY
Sbjct: 201 EFIDATIKGSLARFCNHSCNPNAYVNKWVVAGKLRMGIFAKRKILKGEEITFDY 254


>gi|358381100|gb|EHK18776.1| hypothetical protein TRIVIDRAFT_57632 [Trichoderma virens Gv29-8]
          Length = 924

 Score =  114 bits (286), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 57/172 (33%), Positives = 92/172 (53%), Gaps = 5/172 (2%)

Query: 64  DGIFCSCTAS--PGSSGVCDRDCHC---GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKL 118
           D + C C      G +  C  D  C      +   +    CG  C N+ FQ +    + +
Sbjct: 121 DALDCDCREEWRDGENFACGEDSDCINRATKMECSADAGNCGGGCQNRRFQRKQYADVTV 180

Query: 119 VQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMV 178
           ++TEK G G+  +  ++  +F+ EY+GEVI++ T   R+ +    G  +FY   +N++  
Sbjct: 181 IKTEKKGFGLRTNSALEPNDFIYEYIGEVINEPTFRRRMLQYDEEGIKHFYFMSLNKNEF 240

Query: 179 IDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDY 230
           +DAT KGN  R+ NHSC PN  + KW++  + R+GIFA R I+ GE L ++Y
Sbjct: 241 VDATKKGNLGRFCNHSCSPNCFVDKWVVGDKLRMGIFALRKIRAGEELVFNY 292


>gi|19115892|ref|NP_594980.1| histone lysine methyltransferase Set2 [Schizosaccharomyces pombe
           972h-]
 gi|74626626|sp|O14026.1|SET2_SCHPO RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36
           specific; AltName: Full=Lysine N-methyltransferase 3;
           AltName: Full=SET domain-containing protein 2
 gi|2408044|emb|CAB16247.1| histone lysine methyltransferase Set2 [Schizosaccharomyces pombe]
          Length = 798

 Score =  114 bits (286), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 64/184 (34%), Positives = 99/184 (53%), Gaps = 5/184 (2%)

Query: 52  YLTKRIKRRLEDDGIFCSCTAS--PGSSGVCDRDCHCGMLLSS---CSSGCKCGNSCLNK 106
           Y +  I +  E++ + C C      G +  C    +C   ++S         CG SC N+
Sbjct: 113 YASANIGKPPENEAMICDCRPHWVDGVNVACGHGSNCINRMTSIECTDEDNVCGPSCQNQ 172

Query: 107 PFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGET 166
            FQ     K+ +  TEK G G+ AD ++ +  FV EY+GEVI +Q   +R+ +    G  
Sbjct: 173 RFQRHEFAKVDVFLTEKKGFGLRADANLPKDTFVYEYIGEVIPEQKFRKRMRQYDSEGIK 232

Query: 167 NFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENL 226
           +FY   + +   IDAT +G+ +R+ NHSC PN  + KW++  + R+GIF  RDI +GE L
Sbjct: 233 HFYFMMLQKGEYIDATKRGSLARFCNHSCRPNCYVDKWMVGDKLRMGIFCKRDIIRGEEL 292

Query: 227 TYDY 230
           T+DY
Sbjct: 293 TFDY 296


>gi|302406220|ref|XP_003000946.1| histone-lysine N-methyltransferase [Verticillium albo-atrum
           VaMs.102]
 gi|261360204|gb|EEY22632.1| histone-lysine N-methyltransferase [Verticillium albo-atrum
           VaMs.102]
          Length = 936

 Score =  114 bits (286), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 63/188 (33%), Positives = 100/188 (53%), Gaps = 6/188 (3%)

Query: 62  EDDGIFCSCTA--SPGSSGVCDRDCHCGMLLS--SCSSGC-KCGNSCLNKPFQNRPVKKM 116
           ++D + C C      G +  CD D  C    +   C  G   CG+ C N+ FQ +    +
Sbjct: 112 DNDALDCDCREEWQNGENHSCDEDSDCINRATKMECVMGTGNCGDGCQNQRFQRKAYADV 171

Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
            +++TEK G G+ A+ D++  +F+ EY+GEVI++ T   R+ +    G  +FY   + + 
Sbjct: 172 SVIKTEKKGFGLRANVDMQANDFIFEYIGEVINEPTFRRRMVQYDDEGIKHFYFMSLTKH 231

Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FL 235
             +DAT KGN  R+ NHSC PN  + KW++  + R+GIF  R I  GE L ++Y  + + 
Sbjct: 232 EFVDATKKGNLGRFCNHSCNPNCYVDKWVVGDKLRMGIFTLRGIAAGEELVFNYNVDRYG 291

Query: 236 HDSLIAYC 243
            D    YC
Sbjct: 292 ADPQPCYC 299


>gi|398407533|ref|XP_003855232.1| histone methyltransferase, partial [Zymoseptoria tritici IPO323]
 gi|339475116|gb|EGP90208.1| histone methyltransferase [Zymoseptoria tritici IPO323]
          Length = 799

 Score =  114 bits (286), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 58/172 (33%), Positives = 91/172 (52%), Gaps = 6/172 (3%)

Query: 65  GIFCSCT----ASPGSSGVC--DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKL 118
            + C C+    A   S+  C  D DC        C + C CG+ C N  FQ +    + +
Sbjct: 68  ALECDCSEEWDAPSRSNHACGEDSDCINRATKMECVADCNCGSKCQNMRFQRKKYANVDV 127

Query: 119 VQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMV 178
           ++TEK G G+    D++  +F+ EY+GEVI +     R+ +    G  +FY   + +   
Sbjct: 128 IKTEKKGYGLRTQTDLRPNDFIFEYIGEVIGENVFRRRMQQYDEDGIKHFYFMSLTKGEF 187

Query: 179 IDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDY 230
           +DAT +GN  R+ NHSC PN  + KW++  + R+GIFA R I+ GE L ++Y
Sbjct: 188 VDATKRGNLGRFCNHSCNPNCYVDKWVVGDKLRMGIFAERAIQAGEELVFNY 239


>gi|336264616|ref|XP_003347084.1| hypothetical protein SMAC_05383 [Sordaria macrospora k-hell]
 gi|380093778|emb|CCC08742.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 1224

 Score =  114 bits (286), Expect = 3e-23,   Method: Composition-based stats.
 Identities = 67/213 (31%), Positives = 111/213 (52%), Gaps = 23/213 (10%)

Query: 38  KPKAIPYVFIKRNIYLTK-----RIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSS 92
           +PK   Y  + +N ++ +     +     ED    C CT   G    C +DC   ++L  
Sbjct: 679 QPKPAAYRTMTKNRFIGQAAAIWKKTPHFEDFASKCVCTPEDG----CAQDCQNRVMLYE 734

Query: 93  C-SSGCKCGNS-CLNKPFQ---------NRPVKKMKLVQTEKCGAGIVADEDIKRGEFVI 141
           C  + C  G   C N+ FQ          R    +++ +TE  G G+ ++   +  + ++
Sbjct: 735 CDETNCNVGKEFCQNRAFQMLTERTKQGGRYRVGVEVFKTEDRGYGVRSNRCFEPHQIIM 794

Query: 142 EYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEM 201
           EY GE+I D+ CE R+   ++     +YL   +++M+IDAT  G+ +R++NHSC PN  M
Sbjct: 795 EYTGEIITDEECERRM-NEEYKNNECYYLMSFDQNMIIDAT-TGSIARFVNHSCSPNCRM 852

Query: 202 QKWIIDGETRIGIFA-TRDIKKGENLTYDYQYE 233
            KWI+ G+ R+ +FA  R I+ GE LTYDY ++
Sbjct: 853 IKWIVSGQPRMALFAGDRPIQTGEELTYDYNFD 885


>gi|350855153|emb|CCD58126.1| huntingtin interacting protein-related [Schistosoma mansoni]
          Length = 887

 Score =  114 bits (286), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 60/141 (42%), Positives = 82/141 (58%), Gaps = 1/141 (0%)

Query: 93  CSSGCKCGNSCLNKPFQNRPVKKMKLVQTEK-CGAGIVADEDIKRGEFVIEYVGEVIDDQ 151
           C   C  G+ C N+ FQ R     +     K  G G++A +++K+G FVIEYVGEVID  
Sbjct: 259 CGLHCAAGDFCSNRQFQMRLYAPTRPFYAGKDKGWGLMATDNVKKGSFVIEYVGEVIDFS 318

Query: 152 TCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETR 211
               R+ + + LG  + Y   +  D  IDA  KGN +R++NHSC PN   QKW ++GE R
Sbjct: 319 EFRRRIRRYERLGHAHHYFMAVESDRFIDAGSKGNWARFVNHSCEPNCVTQKWSVNGEIR 378

Query: 212 IGIFATRDIKKGENLTYDYQY 232
           IG FA  DI  G+ +T DYQ+
Sbjct: 379 IGFFAKEDIPSGQEVTIDYQF 399


>gi|328856222|gb|EGG05344.1| hypothetical protein MELLADRAFT_78094 [Melampsora larici-populina
           98AG31]
          Length = 1098

 Score =  114 bits (285), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 69/211 (32%), Positives = 108/211 (51%), Gaps = 25/211 (11%)

Query: 44  YVFIKRNIYLTKRI-KRRLEDDGIFCSCTASPGSS----------GVCDRD---CHCGML 89
           +  +++N Y  K + K ++ DD + C C+ S G S           + +RD     CG L
Sbjct: 190 FDLLEQNWYQFKGLGKTKMLDDMMVCECSWSLGKSCAIALIDPSHMIQERDDPNLACG-L 248

Query: 90  LSSC----------SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEF 139
            S C             C+C   C N+ FQ R    +++V TE+ G G+    D+    F
Sbjct: 249 NSGCINYLTQVECLKKECRCLQMCQNQRFQKRQYAPIEIVATERKGFGVRLKSDVPADSF 308

Query: 140 VIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNT 199
           V EY+GEV+ ++  + R+ +    G  +FY  ++ R+  IDAT KG   R++NHSC PN 
Sbjct: 309 VYEYIGEVVGEKAFQRRIKEYAQEGLKHFYFMQLQREEYIDATKKGGLGRFLNHSCNPNC 368

Query: 200 EMQKWIIDGETRIGIFATRDIKKGENLTYDY 230
            + KW++    R+GIF  R +K GE LT++Y
Sbjct: 369 YIGKWVVGRHLRMGIFTKRAVKGGEELTFNY 399


>gi|195435682|ref|XP_002065808.1| GK20252 [Drosophila willistoni]
 gi|194161893|gb|EDW76794.1| GK20252 [Drosophila willistoni]
          Length = 2294

 Score =  114 bits (285), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 67/215 (31%), Positives = 114/215 (53%), Gaps = 18/215 (8%)

Query: 27   PVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLED-DGIFCSCTASPGSSGVCDRDCH 85
            P+  ++P W        Y  I+ N+Y  + ++  L   D   C+C  S      C  +C 
Sbjct: 1378 PLRHQVPSWN-------YRKIRTNVY-AESVRPNLAGFDHPVCNCKES------CMDNCL 1423

Query: 86   CGMLLSSCS-SGCKCGNSCLNKPFQNRPVKK-MKLVQTEKCGAGIVADEDIKRGEFVIEY 143
              M+ + CS + C  G+ C N+      V   ++   T   G G+     I +G +++EY
Sbjct: 1424 NRMVYTECSATNCPSGDKCHNQKIHRHEVAPGVERFMTADKGWGVRTKLPIAKGTYILEY 1483

Query: 144  VGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQK 203
            VGEV+ ++  ++R+  + +L +T+ Y   ++  +VID    G+  R++NHSC PN EMQK
Sbjct: 1484 VGEVVTEREFKQRMASI-YLNDTHHYCLHLDGGLVIDGQRMGSDCRFVNHSCEPNCEMQK 1542

Query: 204  WIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
            W ++G +R+ +FA R I++GE LTYDY +   + S
Sbjct: 1543 WSVNGLSRMVLFAKRQIRQGEELTYDYNFSLFNPS 1577


>gi|323451560|gb|EGB07437.1| hypothetical protein AURANDRAFT_27977 [Aureococcus anophagefferens]
          Length = 224

 Score =  114 bits (284), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 60/158 (37%), Positives = 92/158 (58%), Gaps = 5/158 (3%)

Query: 80  CDRDC-HCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGE 138
           CD DC +  M +    + C  G+ C N+ F  R   K+ + +  +CG G+ A E I  G 
Sbjct: 37  CDGDCLNRNMEIECDPATCPMGDKCQNRCFAARLGSKVSVEKAGRCGRGLFAREPIPEGA 96

Query: 139 FVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEIN----RDMVIDATYKGNKSRYINHS 194
           FV+E +GE+I ++  +ERL   +  G+ ++Y+   +    + +VIDAT KGN+ R+ NHS
Sbjct: 97  FVVEALGELISEEEAQERLATARANGDEHYYMLAASDAATKGLVIDATRKGNEFRWANHS 156

Query: 195 CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
           C P+  ++KW    + R GI A R IK GE LTYDY++
Sbjct: 157 CDPSCRLEKWRCGSQDRYGIVALRSIKPGEELTYDYRW 194


>gi|321460437|gb|EFX71479.1| Mes-4-like protein [Daphnia pulex]
          Length = 706

 Score =  114 bits (284), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 66/180 (36%), Positives = 94/180 (52%), Gaps = 14/180 (7%)

Query: 72  ASPGSSGVCD-------------RDCHCGMLLSSCSSGCKCGNS-CLNKPFQNRPVKKMK 117
           +SP SS +CD              +C   +LL  C      G   C N+ FQ R    + 
Sbjct: 349 SSPYSSAICDCEVKEGQPTCGPDSECINRILLFECDPNVCAGRKLCQNQRFQQRLSPPLY 408

Query: 118 LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDM 177
              T   G G+ +  DIK+G+FV+EYVGE+ID+     RL   +   +  +Y   ++ + 
Sbjct: 409 PFNTGGKGWGLKSHADIKKGDFVVEYVGEIIDNNEFRRRLKAKQDAQDEAYYFLTLDNNR 468

Query: 178 VIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHD 237
           +IDA   GN +R++NHSC PN E QKW + G+TRIG+FA  DI     LT++YQ E   D
Sbjct: 469 MIDAGPSGNFARFMNHSCQPNCETQKWTVLGDTRIGLFAVVDIPAHAELTFNYQLECAQD 528


>gi|302307921|ref|NP_984733.2| AEL128Cp [Ashbya gossypii ATCC 10895]
 gi|442570024|sp|Q757Y8.2|SET2_ASHGO RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36
           specific; AltName: Full=SET domain-containing protein 2
 gi|299789232|gb|AAS52557.2| AEL128Cp [Ashbya gossypii ATCC 10895]
 gi|374107951|gb|AEY96858.1| FAEL128Cp [Ashbya gossypii FDAG1]
          Length = 684

 Score =  114 bits (284), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 65/193 (33%), Positives = 102/193 (52%), Gaps = 6/193 (3%)

Query: 44  YVFIKRNIYLTKRIKRRLEDDGIFCSCTAS--PGSSGVCDR--DCHCGMLLSSCSSG--C 97
           ++ +++  Y  KR+     ++ + C C        +  CD   DC   + L  C +    
Sbjct: 34  FIELEQCTYTHKRLGDSRSNEFMECDCFEDYKDEQNHACDENSDCINRLTLIECVNDLCT 93

Query: 98  KCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERL 157
            CG+ C N+ FQ +    + + QTEK G G+ A+ DI+  EF+ EY+GEVI +    +R+
Sbjct: 94  SCGDDCQNQRFQKKEYADIAVFQTEKKGYGVRAERDIEANEFIYEYIGEVISEADFRDRM 153

Query: 158 WKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFAT 217
                 G  +FY   +     IDAT +G  +R+ NHSC PN  + KW + G+ ++GIFA 
Sbjct: 154 VDYDMRGFKHFYFMMLQAGEFIDATERGCLARFCNHSCNPNAYVSKWDVAGKLKMGIFAH 213

Query: 218 RDIKKGENLTYDY 230
           R I KGE +T+DY
Sbjct: 214 RKILKGEEITFDY 226


>gi|195496208|ref|XP_002095596.1| GE22484 [Drosophila yakuba]
 gi|194181697|gb|EDW95308.1| GE22484 [Drosophila yakuba]
          Length = 2215

 Score =  113 bits (283), Expect = 5e-23,   Method: Composition-based stats.
 Identities = 71/225 (31%), Positives = 113/225 (50%), Gaps = 17/225 (7%)

Query: 28   VEFELPD---WFIKPKAIP---------YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPG 75
            ++FELP    W      +P         Y  I+ N+Y           D   C+C     
Sbjct: 1281 MDFELPYDIWWAYTNSKLPTRNVVPSWNYRKIRTNVYAESVRPNLAGFDHPTCNCKNQGE 1340

Query: 76   SSGVCDRDCHCGMLLSSCS-SGCKCGNSCLNKPFQNRPVKK-MKLVQTEKCGAGIVADED 133
             S  C  +C   M+ + CS S C  G  C N+  Q   V   ++   T   G G+     
Sbjct: 1341 KS--CLDNCLNRMVYTECSPSNCPAGEKCRNQKIQRHAVAPGVERFMTADKGWGVRTKLP 1398

Query: 134  IKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINH 193
            I +G +++EYVGEV+ ++  ++R+  + +L +T+ Y   ++  +VID    G+  R++NH
Sbjct: 1399 IAKGTYILEYVGEVVTEKEFKQRMASI-YLNDTHHYCLHLDGGLVIDGQRMGSDCRFVNH 1457

Query: 194  SCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
            SC PN EMQKW ++G +R+ +FA R I++GE LTYDY +   + S
Sbjct: 1458 SCEPNCEMQKWSVNGLSRMVLFAKRAIEEGEELTYDYNFSLFNPS 1502


>gi|429858497|gb|ELA33313.1| set and ww domain protein [Colletotrichum gloeosporioides Nara gc5]
          Length = 833

 Score =  113 bits (283), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 60/172 (34%), Positives = 95/172 (55%), Gaps = 5/172 (2%)

Query: 64  DGIFCSCTA--SPGSSGVC--DRDCHCGMLLSSCSSGC-KCGNSCLNKPFQNRPVKKMKL 118
           D + C C +    G +  C  D DC        C  G   CG+ C N+ FQ +    + +
Sbjct: 113 DALDCDCRSEWQDGKNYSCGEDSDCINRATRMECVVGTGNCGDGCQNQRFQRKQYANVSV 172

Query: 119 VQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMV 178
           ++T+K G G+ A+ D++  +F+ EY+GEVI++ T   R+ +    G  +FY   + +   
Sbjct: 173 IKTDKKGFGLRANVDMQANDFIFEYIGEVINEPTFRRRMSQYDDEGIKHFYFMSLTKHEF 232

Query: 179 IDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDY 230
           +DAT KGN  R+ NHSC PN  + KW++  + R+GIF +R IK GE L ++Y
Sbjct: 233 VDATKKGNLGRFCNHSCNPNCYVDKWVVGEKLRMGIFTSRKIKAGEELVFNY 284


>gi|194874189|ref|XP_001973356.1| GG13392 [Drosophila erecta]
 gi|190655139|gb|EDV52382.1| GG13392 [Drosophila erecta]
          Length = 2215

 Score =  113 bits (283), Expect = 6e-23,   Method: Composition-based stats.
 Identities = 71/225 (31%), Positives = 113/225 (50%), Gaps = 17/225 (7%)

Query: 28   VEFELPD---WFIKPKAIP---------YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPG 75
            ++FELP    W      +P         Y  I+ N+Y           D   C+C     
Sbjct: 1281 MDFELPYDIWWAYTNSKLPTRNVVPSWNYRKIRTNVYAESVRPNLAGFDHPTCNCKNQGE 1340

Query: 76   SSGVCDRDCHCGMLLSSCS-SGCKCGNSCLNKPFQNRPVKK-MKLVQTEKCGAGIVADED 133
             S  C  +C   M+ + CS S C  G  C N+  Q   V   ++   T   G G+     
Sbjct: 1341 KS--CLDNCLNRMVYTECSPSNCPAGEKCRNQKIQRHAVAPGVERFMTADKGWGVRTKLP 1398

Query: 134  IKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINH 193
            I +G +++EYVGEV+ ++  ++R+  + +L +T+ Y   ++  +VID    G+  R++NH
Sbjct: 1399 IAKGTYILEYVGEVVTEKEFKQRMASI-YLNDTHHYCLHLDGGLVIDGQRMGSDCRFVNH 1457

Query: 194  SCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
            SC PN EMQKW ++G +R+ +FA R I++GE LTYDY +   + S
Sbjct: 1458 SCEPNCEMQKWSVNGLSRMVLFAKRAIEEGEELTYDYNFSLFNPS 1502


>gi|426193608|gb|EKV43541.1| hypothetical protein AGABI2DRAFT_76781 [Agaricus bisporus var.
           bisporus H97]
          Length = 851

 Score =  113 bits (283), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 58/163 (35%), Positives = 94/163 (57%), Gaps = 2/163 (1%)

Query: 83  DCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVI 141
           DC   +    C +G C+C N C N+  Q +    +++V TEK G G+ A+ED+ +  F+ 
Sbjct: 133 DCINRLTQVECLAGDCRCRNYCQNQRLQRKEYAPIEIVLTEKKGYGLRAEEDLPKDAFIY 192

Query: 142 EYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEM 201
           EYVG+V++  + ++R+ +    G  +FY   + +D  IDAT  G   R+ NHSC PN  +
Sbjct: 193 EYVGDVVNPISFKKRMREYGQEGIRHFYFMMLQKDEFIDATKNGGIGRFANHSCNPNCYV 252

Query: 202 QKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
            KW +  + R+GIFA R +KK E LT++Y  + + H +   +C
Sbjct: 253 AKWTVGDKVRMGIFAKRTVKKHEELTFNYNVDRYGHQAQTCFC 295


>gi|403333784|gb|EJY66014.1| hypothetical protein OXYTRI_13823 [Oxytricha trifallax]
          Length = 1936

 Score =  113 bits (283), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 72/208 (34%), Positives = 111/208 (53%), Gaps = 19/208 (9%)

Query: 44  YVFIKRNIYLTKR---IKRRLEDDGIFCSCT---------ASPGSSGVCDRDCHCGMLLS 91
           ++ IK+N YL ++   +  +     + C+C           +P  S  C   C     +S
Sbjct: 75  FIPIKKNKYLERKRYVLNEKEASSLLVCNCQKATYKKNEKTNPLQSFNCGERC-INRCVS 133

Query: 92  S--CSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEV-- 147
           +  C+  C  G  C N+ FQ      +   +TEK G G+ A E I +G F+++YVGEV  
Sbjct: 134 TECCAQTCPSGAFCKNRRFQLHQNAYVFPAKTEKKGYGLFAGEFIPKGTFIMQYVGEVFS 193

Query: 148 IDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIID 207
           +D    ++R+  +++   T  YL     + VID TY GN +R+INHSC PN E QKW + 
Sbjct: 194 VDTDLGQQRV--LEYRKSTCTYLMRTTNNEVIDPTYVGNVARFINHSCEPNCETQKWNVL 251

Query: 208 GETRIGIFATRDIKKGENLTYDYQYEFL 235
           GE  +GIF+ RDI + E L++DYQ++F 
Sbjct: 252 GEVCVGIFSLRDIHENEELSFDYQFDFF 279


>gi|409074525|gb|EKM74921.1| hypothetical protein AGABI1DRAFT_123473 [Agaricus bisporus var.
           burnettii JB137-S8]
          Length = 881

 Score =  113 bits (283), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 58/163 (35%), Positives = 94/163 (57%), Gaps = 2/163 (1%)

Query: 83  DCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVI 141
           DC   +    C +G C+C N C N+  Q +    +++V TEK G G+ A+ED+ +  F+ 
Sbjct: 155 DCINRLTQVECLAGDCRCRNYCQNQRLQRKEYAPIEIVLTEKKGYGLRAEEDLPKDAFIY 214

Query: 142 EYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEM 201
           EYVG+V++  + ++R+ +    G  +FY   + +D  IDAT  G   R+ NHSC PN  +
Sbjct: 215 EYVGDVVNPISFKKRMREYGQEGIRHFYFMMLQKDEFIDATKNGGIGRFANHSCNPNCYV 274

Query: 202 QKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
            KW +  + R+GIFA R +KK E LT++Y  + + H +   +C
Sbjct: 275 AKWTVGDKVRMGIFAKRTVKKHEELTFNYNVDRYGHQAQTCFC 317


>gi|395325049|gb|EJF57478.1| hypothetical protein DICSQDRAFT_111705 [Dichomitus squalens
           LYAD-421 SS1]
          Length = 866

 Score =  113 bits (283), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 72/212 (33%), Positives = 106/212 (50%), Gaps = 16/212 (7%)

Query: 44  YVFIKRNIYLTKRI-KRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSC--------- 93
           +V I  N Y    + K R   + + C C       GV D D  CG L S C         
Sbjct: 109 FVEIPNNHYQYGTLGKSREALESMTCDCQYE---HGVDDPDMACG-LSSDCINLMTQVEC 164

Query: 94  -SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQT 152
               C+C   C N+ FQ +    +++V+TE  G G+ A  DI +  F+ EYVG+V+   +
Sbjct: 165 LPDDCRCRGYCRNQRFQRKQYAPIEIVKTEMKGFGLRAAADIPKDTFIYEYVGDVVSQPS 224

Query: 153 CEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRI 212
             +R+ +    G  +FY   + +D  IDAT +G   R+ NHSC PN  + KW +    R+
Sbjct: 225 FIKRMRQYAEEGIRHFYFMMLQKDEFIDATKRGGIGRFANHSCNPNCYVAKWTVGSHVRM 284

Query: 213 GIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
           GIFA R IKK E LT++Y  + + H++   YC
Sbjct: 285 GIFANRVIKKDEELTFNYNVDRYGHEAQPCYC 316


>gi|323451944|gb|EGB07819.1| hypothetical protein AURANDRAFT_6525 [Aureococcus anophagefferens]
          Length = 160

 Score =  113 bits (283), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 53/140 (37%), Positives = 87/140 (62%), Gaps = 4/140 (2%)

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
           C  G+ C N+ F  +   K+ + + ++CG G+ A E I +  FV+E +GE+I ++  +ER
Sbjct: 8   CPMGDKCQNRCFAAKLGSKVSVEKADQCGWGLFAGEPIAKRAFVVEALGELISEEEAQER 67

Query: 157 LWKMKHLGETNFYLCEIN----RDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRI 212
           L   +  G+ ++Y+   +    + +VIDAT KGN++R+ NHSC P+  ++KW    + R 
Sbjct: 68  LATARANGDEDYYMLAASDAATKGLVIDATRKGNEARFANHSCDPSCRLEKWRCGSQDRY 127

Query: 213 GIFATRDIKKGENLTYDYQY 232
           GIFA R +K GE LTYDY++
Sbjct: 128 GIFALRSVKPGEQLTYDYRW 147


>gi|194751483|ref|XP_001958056.1| GF10724 [Drosophila ananassae]
 gi|190625338|gb|EDV40862.1| GF10724 [Drosophila ananassae]
          Length = 2257

 Score =  113 bits (283), Expect = 7e-23,   Method: Composition-based stats.
 Identities = 70/225 (31%), Positives = 111/225 (49%), Gaps = 17/225 (7%)

Query: 28   VEFELPD---WFIKPKAIP---------YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPG 75
            ++FELP    W      +P         Y  I+ N+Y           D   C+C     
Sbjct: 1310 IDFELPYDIWWAYTNSKLPTRHVVPSWNYRKIRTNVYAESVRPNLAGFDHPTCNCKNH-- 1367

Query: 76   SSGVCDRDCHCGMLLSSCS-SGCKCGNSCLNKPFQNRPVKK-MKLVQTEKCGAGIVADED 133
                C  +C   M+ + CS S C     C N+  Q   V   ++   TE  G G+     
Sbjct: 1368 GEIACQDNCLNRMVYTECSPSNCPAAEKCRNQKIQRHEVAPGVERFMTEDKGWGVRTKLP 1427

Query: 134  IKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINH 193
            I +G +++EYVGEV+ ++  ++R+  + +L +T+ Y   ++  +VID    G+  R++NH
Sbjct: 1428 IAKGTYILEYVGEVVTEREFKQRMASI-YLNDTHHYCLHLDGGLVIDGQRMGSDCRFVNH 1486

Query: 194  SCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
            SC PN EMQKW ++G +R+ +FA R I +GE LTYDY +   + S
Sbjct: 1487 SCEPNCEMQKWSVNGLSRMVLFAKRAIAQGEELTYDYNFSLFNPS 1531


>gi|195354202|ref|XP_002043588.1| GM17444 [Drosophila sechellia]
 gi|194127756|gb|EDW49799.1| GM17444 [Drosophila sechellia]
          Length = 2218

 Score =  113 bits (283), Expect = 7e-23,   Method: Composition-based stats.
 Identities = 71/225 (31%), Positives = 113/225 (50%), Gaps = 17/225 (7%)

Query: 28   VEFELPD---WFIKPKAIP---------YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPG 75
            ++FELP    W      +P         Y  I+ N+Y           D   C+C     
Sbjct: 1283 MDFELPYDIWWAYTNSKLPTRNVVPSWNYRKIRTNVYAESVRPNLAGFDHPTCNCKDQGE 1342

Query: 76   SSGVCDRDCHCGMLLSSCS-SGCKCGNSCLNKPFQNRPVKK-MKLVQTEKCGAGIVADED 133
             S  C  +C   M+ + CS S C  G  C N+  Q   V   ++   T   G G+     
Sbjct: 1343 KS--CLDNCLNRMVYTECSPSNCPAGEKCRNQKIQRHAVAPGVERFMTADKGWGVRTKLP 1400

Query: 134  IKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINH 193
            I +G +++EYVGEV+ ++  ++R+  + +L +T+ Y   ++  +VID    G+  R++NH
Sbjct: 1401 IAKGTYILEYVGEVVTEKEFKQRMASI-YLNDTHHYCLHLDGGLVIDGQRMGSDCRFVNH 1459

Query: 194  SCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
            SC PN EMQKW ++G +R+ +FA R I++GE LTYDY +   + S
Sbjct: 1460 SCEPNCEMQKWSVNGLSRMVLFAKRAIEEGEELTYDYNFSLFNPS 1504


>gi|386771427|ref|NP_524160.2| absent, small, or homeotic discs 1, isoform B [Drosophila
            melanogaster]
 gi|386771429|ref|NP_001246834.1| absent, small, or homeotic discs 1, isoform C [Drosophila
            melanogaster]
 gi|118572535|sp|Q9VW15.3|ASH1_DROME RecName: Full=Histone-lysine N-methyltransferase ash1; AltName:
            Full=Absent small and homeotic disks protein 1; AltName:
            Full=Lysine N-methyltransferase 2H
 gi|383292012|gb|AAF49140.3| absent, small, or homeotic discs 1, isoform B [Drosophila
            melanogaster]
 gi|383292013|gb|AFH04505.1| absent, small, or homeotic discs 1, isoform C [Drosophila
            melanogaster]
          Length = 2226

 Score =  113 bits (283), Expect = 7e-23,   Method: Composition-based stats.
 Identities = 71/225 (31%), Positives = 113/225 (50%), Gaps = 17/225 (7%)

Query: 28   VEFELPD---WFIKPKAIP---------YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPG 75
            ++FELP    W      +P         Y  I+ N+Y           D   C+C     
Sbjct: 1292 MDFELPYDIWWAYTNSKLPTRNVVPSWNYRKIRTNVYAESVRPNLAGFDHPTCNCKNQGE 1351

Query: 76   SSGVCDRDCHCGMLLSSCS-SGCKCGNSCLNKPFQNRPVKK-MKLVQTEKCGAGIVADED 133
             S  C  +C   M+ + CS S C  G  C N+  Q   V   ++   T   G G+     
Sbjct: 1352 KS--CLDNCLNRMVYTECSPSNCPAGEKCRNQKIQRHAVAPGVERFMTADKGWGVRTKLP 1409

Query: 134  IKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINH 193
            I +G +++EYVGEV+ ++  ++R+  + +L +T+ Y   ++  +VID    G+  R++NH
Sbjct: 1410 IAKGTYILEYVGEVVTEKEFKQRMASI-YLNDTHHYCLHLDGGLVIDGQRMGSDCRFVNH 1468

Query: 194  SCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
            SC PN EMQKW ++G +R+ +FA R I++GE LTYDY +   + S
Sbjct: 1469 SCEPNCEMQKWSVNGLSRMVLFAKRAIEEGEELTYDYNFSLFNPS 1513


>gi|449300569|gb|EMC96581.1| hypothetical protein BAUCODRAFT_481348 [Baudoinia compniacensis
           UAMH 10762]
          Length = 992

 Score =  113 bits (283), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 53/150 (35%), Positives = 85/150 (56%)

Query: 81  DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
           D DC        C++ C CG +C N+ F  +    + +++TEK G G+  + ++K  +F+
Sbjct: 177 DSDCINRATKMECAADCGCGAACQNQRFLRKQFADVTVIKTEKKGYGLRTNTNLKANDFI 236

Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
            EY+GEVI +     R+ +    G  +FY   + +   +DAT KGN  R+ NHSC PN  
Sbjct: 237 FEYIGEVIGENVFRRRMQQYDEEGIKHFYFMSLTKGEFVDATKKGNLGRFCNHSCNPNCY 296

Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDY 230
           + KW++  + R+GIFA R I+ GE L ++Y
Sbjct: 297 VDKWVVGDKLRMGIFAERKIQAGEELVFNY 326


>gi|356495442|ref|XP_003516586.1| PREDICTED: histone-lysine N-methyltransferase SUVR5-like [Glycine
            max]
          Length = 1492

 Score =  113 bits (283), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 68/166 (40%), Positives = 93/166 (56%), Gaps = 19/166 (11%)

Query: 87   GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
            G L+  C+  C+C  SC N+  QN    K+++ +TEK G  + A E I RG FV EY+GE
Sbjct: 1310 GYLVYECNHMCRCNKSCPNRVLQNGVRVKLEVFKTEKKGWAVRAGEAILRGTFVCEYIGE 1369

Query: 147  VIDDQTCEERLWKMKHLGETNFYLCEIN---RDM----------VIDATYKGNKSRYINH 193
            V+D Q   +R  + ++  E   YL +I+    DM          VIDAT  GN SR+INH
Sbjct: 1370 VLDVQEARDR--RKRYGAEHCSYLYDIDARVNDMGRLIEEQAQYVIDATKFGNVSRFINH 1427

Query: 194  SCCPNTEMQKWIIDG----ETRIGIFATRDIKKGENLTYDYQYEFL 235
            SC PN    + +++        IG +A+RDI  GE LTYDYQYE +
Sbjct: 1428 SCSPNLVNHQVLVESMDCERAHIGFYASRDIALGEELTYDYQYELM 1473


>gi|1335892|gb|AAB01100.1| ASH1 [Drosophila melanogaster]
          Length = 2210

 Score =  113 bits (282), Expect = 8e-23,   Method: Composition-based stats.
 Identities = 71/225 (31%), Positives = 113/225 (50%), Gaps = 17/225 (7%)

Query: 28   VEFELPD---WFIKPKAIP---------YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPG 75
            ++FELP    W      +P         Y  I+ N+Y           D   C+C     
Sbjct: 1276 MDFELPYDIWWAYTNSKLPTRNVVPSWNYRKIRTNVYAESVRPNLAGFDHPTCNCKNQGE 1335

Query: 76   SSGVCDRDCHCGMLLSSCS-SGCKCGNSCLNKPFQNRPVKK-MKLVQTEKCGAGIVADED 133
             S  C  +C   M+ + CS S C  G  C N+  Q   V   ++   T   G G+     
Sbjct: 1336 KS--CLDNCLNRMVYTECSPSNCPAGEKCRNQKIQRHAVAPGVERFMTADKGWGVRTKLP 1393

Query: 134  IKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINH 193
            I +G +++EYVGEV+ ++  ++R+  + +L +T+ Y   ++  +VID    G+  R++NH
Sbjct: 1394 IAKGTYILEYVGEVVTEKEFKQRMASI-YLNDTHHYCLHLDGGLVIDGQRMGSDCRFVNH 1452

Query: 194  SCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
            SC PN EMQKW ++G +R+ +FA R I++GE LTYDY +   + S
Sbjct: 1453 SCEPNCEMQKWSVNGLSRMVLFAKRAIEEGEELTYDYNFSLFNPS 1497


>gi|367016577|ref|XP_003682787.1| hypothetical protein TDEL_0G02090 [Torulaspora delbrueckii]
 gi|359750450|emb|CCE93576.1| hypothetical protein TDEL_0G02090 [Torulaspora delbrueckii]
          Length = 687

 Score =  113 bits (282), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 70/193 (36%), Positives = 108/193 (55%), Gaps = 6/193 (3%)

Query: 44  YVFIKRNIYLTKRIKRRLEDDGIFCSCTA--SPGSSGVCDRDCHC--GMLLSSCSSGC-- 97
           +V +   +Y +KR+     ++ + C C    S G +  CD D  C   + L  C +G   
Sbjct: 35  FVSLNECLYASKRLGSAKNNEFMECDCFEDFSEGVNRACDEDSDCINRLTLIECVNGLCG 94

Query: 98  KCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERL 157
            CG+ C N+ FQ R    + + QTE  G G+ A+ DI+  +F+ EY+GEVI+++   ERL
Sbjct: 95  SCGDDCQNQRFQKRQYADIAVFQTEMKGYGVRAETDIEAHQFIYEYMGEVIEEEEFRERL 154

Query: 158 WKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFAT 217
            +       +FY   +     IDAT KG+ +R+ NHSC PN  + KW++ G+ ++GIFA 
Sbjct: 155 VEYDQKKLKHFYFMMLQNGEFIDATMKGSLARFCNHSCNPNAYVNKWVVAGKLKMGIFAN 214

Query: 218 RDIKKGENLTYDY 230
           R I KGE +T+DY
Sbjct: 215 RKILKGEEITFDY 227


>gi|325180900|emb|CCA15310.1| histonelysine Nmethyltransferase putative [Albugo laibachii Nc14]
          Length = 1258

 Score =  113 bits (282), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 72/222 (32%), Positives = 110/222 (49%), Gaps = 26/222 (11%)

Query: 30  FELP-DWFIK--PKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC 86
           F +P D+ ++   +  PY  + RN+Y  K+++   ED    C CT        C   C  
Sbjct: 542 FRIPADYLVRVHTRPPPYRKLTRNLYTFKQLRPPFEDVPT-CVCTER------CGDGCIN 594

Query: 87  GMLLSSC--------------SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADE 132
            +  + C              +  C  G  C N+    R   + +L  +   G  + A E
Sbjct: 595 RLSFTECFGPAPTEESTRNNQTFNCHVGEHCGNRALHQRVYPRTELFHSFDKGFALRAKE 654

Query: 133 DIKRGEFVIEYVGEVIDDQTCEERLWK--MKHLGETNFYLCEINRDMVIDATYKGNKSRY 190
            I+ G+ V+EYVGEVI++Q  + RL +   KH  + N Y+ E+   + IDA ++G+ SR+
Sbjct: 655 SIQAGQLVMEYVGEVINEQEKDRRLEEHARKHPQDRNMYIMELGNQIYIDARFRGSVSRF 714

Query: 191 INHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
           INHSC PN  + KW +    RI I A RDIK  E L+YDY++
Sbjct: 715 INHSCDPNCHLVKWRVCDLDRIAISALRDIKPEEELSYDYRF 756


>gi|299741036|ref|XP_001834167.2| histone-lysine N-methyltransferase [Coprinopsis cinerea
           okayama7#130]
 gi|298404525|gb|EAU87663.2| histone-lysine N-methyltransferase [Coprinopsis cinerea
           okayama7#130]
          Length = 826

 Score =  113 bits (282), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 55/148 (37%), Positives = 87/148 (58%), Gaps = 1/148 (0%)

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
           C+C + C N+ FQ R    +++V TEK G G+ A++DI +  F+ EYVG+V+   + ++R
Sbjct: 175 CRCRSHCQNQRFQKRQYANIEIVLTEKKGYGLRAEDDIPKDSFIYEYVGDVVSPNSFKKR 234

Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
           + +    G  +FY   + +D  IDAT  G   R+ NHSC PN  + KW +    R+GIF+
Sbjct: 235 MREYAEEGIQHFYFMMLQKDEFIDATKSGGIGRFANHSCNPNCYVAKWTVGDHVRMGIFS 294

Query: 217 TRDIKKGENLTYDYQYE-FLHDSLIAYC 243
            R I+K E LT++Y  + + H +   YC
Sbjct: 295 KRLIRKHEELTFNYNVDRYGHQAQTCYC 322


>gi|358396308|gb|EHK45689.1| hypothetical protein TRIATDRAFT_241288 [Trichoderma atroviride IMI
           206040]
          Length = 931

 Score =  112 bits (281), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 57/174 (32%), Positives = 96/174 (55%), Gaps = 5/174 (2%)

Query: 62  EDDGIFCSCTAS--PGSSGVCDRDCHC---GMLLSSCSSGCKCGNSCLNKPFQNRPVKKM 116
           ++D + C C      G +  C  +  C      +   ++   CG  C N+ FQ +    +
Sbjct: 119 DNDSLDCDCREEWRDGENFACGEESDCINRATKMECSANAGNCGGGCQNRRFQRKQYADV 178

Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
            +++TEK G G+ A+  ++  +F+ EY+GEVI++ T   R+ +    G  +FY   +N++
Sbjct: 179 TVIKTEKKGFGLRANAMLEPNDFIYEYIGEVINEPTFRRRMLQYDDEGIKHFYFMSLNKN 238

Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDY 230
             +DAT KGN  R+ NHSC PN  + KW++  + R+GIFA R I+ GE L ++Y
Sbjct: 239 EFVDATKKGNLGRFCNHSCNPNCFVDKWVVGDKLRMGIFALRKIQAGEELVFNY 292


>gi|407926722|gb|EKG19683.1| WW/Rsp5/WWP [Macrophomina phaseolina MS6]
          Length = 972

 Score =  112 bits (281), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 59/164 (35%), Positives = 90/164 (54%), Gaps = 1/164 (0%)

Query: 81  DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
           D DC        C   C CG  C N+ F  +    + + +T+K G G+ A+ D+K  +FV
Sbjct: 175 DSDCINRATKMECVGDCGCGPDCQNQRFLRKQYADVTVFRTDKKGYGLRANTDLKPNDFV 234

Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
            EY+GEVI+++    R+ +    G  +FY   +++   +DAT KGN  R+ NHSC PN  
Sbjct: 235 YEYIGEVINERAFRGRMVQYDEEGIKHFYFMSLSKGEFVDATKKGNLGRFCNHSCNPNCF 294

Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
           + KW++  + R+GIFA R IK GE L ++Y  + +  D    YC
Sbjct: 295 VDKWVVGEKLRMGIFAERHIKAGEELVFNYNVDRYGADPQPCYC 338


>gi|378732919|gb|EHY59378.1| histone-lysine N-methyltransferase SETD2 [Exophiala dermatitidis
           NIH/UT8656]
          Length = 1034

 Score =  112 bits (281), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 59/164 (35%), Positives = 91/164 (55%), Gaps = 1/164 (0%)

Query: 81  DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
           D DC        C+  C CG +C N+ F  R    + +++TEK G G+  + D++  +F+
Sbjct: 193 DSDCINRACKMECAKDCGCGPACQNQRFLRRLWANVSVIKTEKKGYGLRTNVDLRPHDFI 252

Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
            EY+GE I + T  +R+ +    G  +FY   +++   IDAT KGN  R+ NHSC PN  
Sbjct: 253 FEYIGETIPENTFRKRMRQYDEEGIKHFYFMSLSKGEFIDATKKGNLGRFCNHSCNPNCY 312

Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
           + KW++  + R+GIFA R IK GE L ++Y  + +  D    YC
Sbjct: 313 VDKWVVGDKLRMGIFAERYIKAGEELVFNYNVDRYGADPQPCYC 356


>gi|242813250|ref|XP_002486129.1| histone-lysine N-methyltransferase (Ash1), putative [Talaromyces
           stipitatus ATCC 10500]
 gi|218714468|gb|EED13891.1| histone-lysine N-methyltransferase (Ash1), putative [Talaromyces
           stipitatus ATCC 10500]
          Length = 865

 Score =  112 bits (281), Expect = 1e-22,   Method: Composition-based stats.
 Identities = 64/210 (30%), Positives = 110/210 (52%), Gaps = 20/210 (9%)

Query: 38  KPKAIPYVFIKRNIYLTKRIKRRLEDDGI---FCSCTASPGSSGVCDRDCHCGMLLSSC- 93
           +PK   +  + RN+++        E+  I    C C+   G    CD DC    +   C 
Sbjct: 420 QPKPDEWKKVNRNVFVGDAASFWRENKKIELSTCLCSEETG----CDEDCQNRFMFYECD 475

Query: 94  SSGCKCGNSCLNKPFQNRPVKK---------MKLVQTEKCGAGIVADEDIKRGEFVIEYV 144
           SS C+ G +C N+ F+    +          +++++T   G G+ ++   +  + ++EY 
Sbjct: 476 SSNCRVGPNCGNRSFEELKQRTKAGGKYNVGVEVIKTADRGYGVRSNRTFEPNQVIVEYT 535

Query: 145 GEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKW 204
           GE+I    CE R+  +    E  +YL   +++M+IDAT +G+ +R++NHSC PN  M+KW
Sbjct: 536 GEIITQSECERRMRTVYKNNEC-YYLMYFDQNMIIDAT-RGSIARFVNHSCAPNCRMEKW 593

Query: 205 IIDGETRIGIFA-TRDIKKGENLTYDYQYE 233
            + G+ R+ +FA  R I  GE LTYDY ++
Sbjct: 594 TVGGKPRMALFAGDRGIMTGEELTYDYNFD 623


>gi|260830013|ref|XP_002609956.1| hypothetical protein BRAFLDRAFT_124382 [Branchiostoma floridae]
 gi|229295318|gb|EEN65966.1| hypothetical protein BRAFLDRAFT_124382 [Branchiostoma floridae]
          Length = 902

 Score =  112 bits (280), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 76/230 (33%), Positives = 114/230 (49%), Gaps = 18/230 (7%)

Query: 16  AFNKLLKQIGNPVEFELPD---WFIKP-KAIP-------YVFIKRNIYLTKRIKRRLEDD 64
           A  K L+Q    ++F++P    W  K  K IP       Y  I+ NI++   +K     +
Sbjct: 6   AAGKYLRQ--KRIDFQMPYDIWWQYKHNKLIPKIDERTKYKKIRSNIFVD--VKPLSGCE 61

Query: 65  GIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKK-MKLVQTE 122
            + C+C     +   C  DC   M    CS   C   + C N+  Q       ++ + T+
Sbjct: 62  PVVCTCVRPAQADKACQEDCLNRMSFMECSPNTCPYADQCANQRIQRHEWDPGLERIVTK 121

Query: 123 KCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDAT 182
             G G+ +   I +G F++EYVGEV+ +Q    R  ++ H    + Y   ++   VID  
Sbjct: 122 DRGYGVRSKTPIPQGNFILEYVGEVVSEQEFRRRTVEIYH-DHNHHYCLNLHSGAVIDGY 180

Query: 183 YKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
             G + R++NHSC PN EMQKW ++G  RIG+FA RDI  GE LTYDY +
Sbjct: 181 KYGCEGRFVNHSCEPNCEMQKWSVNGVYRIGLFALRDIPAGEELTYDYNF 230


>gi|93003038|tpd|FAA00102.1| TPA: zinc finger protein [Ciona intestinalis]
          Length = 883

 Score =  112 bits (280), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 76/235 (32%), Positives = 120/235 (51%), Gaps = 21/235 (8%)

Query: 15  HAFNKLLKQIGNPVEFELP--DWF------IKPK---AIPYVFIKRNIYLTKRIKRRLED 63
           HA N LLK   +   F+LP   W+      I P    A  ++ I++N+Y+  +     E+
Sbjct: 7   HAGNFLLKSRKD---FQLPFNIWWMYNRKLISPSQDLATQFIKIEKNVYVDSQPTCEQEE 63

Query: 64  DGIFCSCTAS--PGSSGV--CDRDCHCGMLLSSCS-SGCKCGNSCLNKPFQNRPV-KKMK 117
               C   +     SS V  C ++C   ++   CS   C C + C N+  Q +   K ++
Sbjct: 64  HVCVCQTLSDIHSLSSDVHGCGKECLNRLMYIECSPDTCPCQDKCANRCIQKQQWWKDLE 123

Query: 118 LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDM 177
             +T   G G+  + DI  G+F++EYVGEV+ ++    R  +  +    + Y  ++    
Sbjct: 124 RFRTNDRGWGVRTNSDIPEGQFLLEYVGEVVSEREFRRRTIE-NYNAHNDHYCVQLEAGT 182

Query: 178 VIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
           VID     N+ R++NHSC PN EMQKW+++GE R+G+FA R I   E LTYDY +
Sbjct: 183 VIDGYRLANEGRFVNHSCQPNCEMQKWVVNGEYRVGLFAKRPIVSSEELTYDYNF 237


>gi|299116101|emb|CBN74517.1| Possible Cdc2-related protein kinase [Ectocarpus siliculosus]
          Length = 1898

 Score =  112 bits (279), Expect = 2e-22,   Method: Composition-based stats.
 Identities = 52/172 (30%), Positives = 91/172 (52%), Gaps = 6/172 (3%)

Query: 68  CSCT---ASPGSSGVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKC 124
           C+CT     P +    D  CH  + +  C    +C   C N+  Q R   ++++ + +  
Sbjct: 225 CNCTPVGGDPNAPTCYDDQCHNYVTMQECRKD-QCHRGCRNQRIQKRENAQVEVFKADGK 283

Query: 125 GAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYK 184
           G G+   E + +G+F+ EYVGE+I  +   +R+  +   G    Y+ ++  D  +DA  K
Sbjct: 284 GMGLKVVEPVSKGQFIAEYVGEIITRKELNKRM--ISSAGTRKLYMMQLGDDTYLDAKRK 341

Query: 185 GNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLH 236
           G  +R++NHSC P   +++W   G+ R  +F+ R +K GE L++DYQ+E  H
Sbjct: 342 GGIARFVNHSCEPTCRLEQWTAMGQPRCAVFSLRAMKAGEELSFDYQWEAHH 393


>gi|426249978|ref|XP_004018720.1| PREDICTED: histone-lysine N-methyltransferase SETD2-like [Ovis aries]
          Length = 1669

 Score =  112 bits (279), Expect = 2e-22,   Method: Composition-based stats.
 Identities = 66/183 (36%), Positives = 98/183 (53%), Gaps = 11/183 (6%)

Query: 36   FIKPKAIPYVF--IKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHC 86
            + K   +P  F  I+ N+YLT+R K +   D     C  +P S          C  DC  
Sbjct: 1447 YAKQGKMPCYFDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLN 1506

Query: 87   GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
             +L+  CSS C  G+ C N+ FQ +    ++++ TEK G G+ A +D+    FV+EY GE
Sbjct: 1507 RLLMIECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGE 1566

Query: 147  VIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWII 206
            V+D +  + R+ +       ++Y   +  D +IDAT KGN SR++NHSC PN E QK  +
Sbjct: 1567 VLDHKEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQK--V 1624

Query: 207  DGE 209
            DGE
Sbjct: 1625 DGE 1627


>gi|440633321|gb|ELR03240.1| hypothetical protein GMDG_01223 [Geomyces destructans 20631-21]
          Length = 685

 Score =  112 bits (279), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 69/210 (32%), Positives = 113/210 (53%), Gaps = 19/210 (9%)

Query: 46  FIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNS-C 103
           FI     L K+ K    D    C C+ S G    C  DC+  M+L  C  G C  G   C
Sbjct: 452 FIGDAAALWKKTKH-FSDSDSRCICSPSTG----CGEDCYNRMMLYECDDGNCPLGAELC 506

Query: 104 LNKPFQNRPVKKMK---------LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCE 154
            N+ F +   ++ K         +++TE CG G+ A+   + G+ ++EY GE+I +  C+
Sbjct: 507 GNRAFADLHERRAKGGKYRVGVEVIKTEDCGYGVRANRCFQEGQIIVEYTGEIITEPECQ 566

Query: 155 ERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGI 214
            R+ +  +     +YL   +++M+IDAT +G+ +R++NHSC PN EM KWI+ G+  + +
Sbjct: 567 RRM-REDYKNNECYYLMLFDQNMIIDAT-RGSIARFVNHSCEPNCEMVKWIVGGKPHMAL 624

Query: 215 FATRD-IKKGENLTYDYQYEFLHDSLIAYC 243
           FA ++ I  GE LTYDY+++ +    +  C
Sbjct: 625 FAGKNPIMTGEELTYDYKFDPISTRNVQEC 654


>gi|348667102|gb|EGZ06928.1| hypothetical protein PHYSODRAFT_565808 [Phytophthora sojae]
          Length = 1540

 Score =  111 bits (278), Expect = 2e-22,   Method: Composition-based stats.
 Identities = 61/154 (39%), Positives = 88/154 (57%), Gaps = 5/154 (3%)

Query: 93  CSSGCKCGNSCLNKPFQN-RPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQ 151
           CS  C  G  C N+  Q+     +++  +TE  G G+   + I +   V EYVGE+ID +
Sbjct: 114 CSPSCAAGQYCRNQRLQHPERYPQLEPFKTEHKGYGVRTRQHIGQLSIVGEYVGEIIDQK 173

Query: 152 TCEERLWKM-KHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGET 210
               RL  + +H  E NFY   +   + IDA  KG+ +R++NHSC PN + +KW + GET
Sbjct: 174 ELARRLKSVPRH--ELNFYYLLLAPGVYIDARNKGSFTRFVNHSCEPNCKTEKWTVKGET 231

Query: 211 RIGIFATRDIKKGENLTYDYQYEFLHDSLI-AYC 243
           RI + A RDI+ GE LT+DYQ++ L    I  YC
Sbjct: 232 RIAVIALRDIEVGEELTFDYQWKALGSRQIKCYC 265


>gi|212544714|ref|XP_002152511.1| histone-lysine N-methyltransferase (Ash1), putative [Talaromyces
           marneffei ATCC 18224]
 gi|210065480|gb|EEA19574.1| histone-lysine N-methyltransferase (Ash1), putative [Talaromyces
           marneffei ATCC 18224]
          Length = 865

 Score =  111 bits (278), Expect = 2e-22,   Method: Composition-based stats.
 Identities = 64/210 (30%), Positives = 108/210 (51%), Gaps = 20/210 (9%)

Query: 38  KPKAIPYVFIKRNIYLTKRIKRRLEDDGI---FCSCTASPGSSGVCDRDCHCGMLLSSC- 93
           +PK   +    RN+++        E+  I    C C    G    CD DC    +   C 
Sbjct: 423 QPKPDEWKKTNRNVFVGDAASFWRENKKIELSTCLCEEETG----CDEDCQNRFMFYECD 478

Query: 94  SSGCKCGNSCLNKPFQNRPVKK---------MKLVQTEKCGAGIVADEDIKRGEFVIEYV 144
           SS C+ G +C N+ F+    +          +++++T   G G+ ++   +  + ++EY 
Sbjct: 479 SSNCRVGPNCGNRSFEELKQRTKAGGKYNIGVEVIKTADRGYGVRSNRTFEPNQIIVEYT 538

Query: 145 GEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKW 204
           GE+I    CE R+  +    E  +YL   +++M+IDAT +G+ +R++NHSC PN  M+KW
Sbjct: 539 GEIITQNECERRMRSVYKNNEC-YYLMYFDQNMIIDAT-RGSIARFVNHSCAPNCRMEKW 596

Query: 205 IIDGETRIGIFA-TRDIKKGENLTYDYQYE 233
            + G+ R+ +FA  R I  GE LTYDY ++
Sbjct: 597 TVGGKPRMALFAGDRGIMTGEELTYDYNFD 626


>gi|449544167|gb|EMD35141.1| hypothetical protein CERSUDRAFT_54162 [Ceriporiopsis subvermispora
           B]
          Length = 788

 Score =  111 bits (278), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 72/214 (33%), Positives = 108/214 (50%), Gaps = 20/214 (9%)

Query: 44  YVFIKRNIYLTKRIKRRLED-DGIFCSCTASPGSSGVCDRDCHCG---------MLLSSC 93
           +V IK N Y    + R  E  + + C C       GV D +  CG           +   
Sbjct: 45  FVEIKGNHYQYGTLGRSREALESMTCDCQYE---HGVDDEEMACGHSSDCINRLTQVECL 101

Query: 94  SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTC 153
              C+C + C N+ FQ +    +++V+TEK G G+ A  D+++  F+ EYVG+V+   + 
Sbjct: 102 PDDCRCRSYCQNQRFQRKEYADIEVVKTEKKGFGLRAGADLRKDTFIYEYVGDVVSQPSF 161

Query: 154 EERLWKMKHLGETN---FYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGET 210
              L +M+   E N   FY   + +D  IDAT +G   R+ NHSC PN  + KW I    
Sbjct: 162 ---LKRMRQYAEENIRHFYFMMLQKDEFIDATKRGGIGRFANHSCNPNCYVAKWTIGEHV 218

Query: 211 RIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
           R+GIFA R IKK E LT++Y  + + +D+   YC
Sbjct: 219 RMGIFANRYIKKDEELTFNYNVDRYGNDAQPCYC 252


>gi|356540712|ref|XP_003538829.1| PREDICTED: histone-lysine N-methyltransferase SUVR5-like [Glycine
            max]
          Length = 1545

 Score =  111 bits (278), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 66/164 (40%), Positives = 87/164 (53%), Gaps = 15/164 (9%)

Query: 87   GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
            G L+  C+  C+C  SC N+  QN    K+++ +TEK G  + A E I RG FV EY+GE
Sbjct: 1363 GYLVYECNHMCRCNKSCPNRVLQNGVRVKLEVFKTEKKGWAVRAGEAILRGTFVCEYIGE 1422

Query: 147  VIDDQTCEERLWKMKHLGETNFY-----------LCEINRDMVIDATYKGNKSRYINHSC 195
            V+D Q    R  +      + FY           L E     VID+T  GN SR+INHSC
Sbjct: 1423 VLDVQEARNRRKRYGTEHCSYFYDIDARVNDIGRLIEGQAQYVIDSTKFGNVSRFINHSC 1482

Query: 196  CPNTEMQKWIIDG----ETRIGIFATRDIKKGENLTYDYQYEFL 235
             PN    + I++        IG +A+RDI  GE LTYDYQYE +
Sbjct: 1483 SPNLVNHQVIVESMDCERAHIGFYASRDITLGEELTYDYQYELM 1526


>gi|299469716|emb|CBN76570.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 2317

 Score =  111 bits (277), Expect = 3e-22,   Method: Composition-based stats.
 Identities = 65/193 (33%), Positives = 98/193 (50%), Gaps = 8/193 (4%)

Query: 47  IKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSS-----GCK--C 99
           I  N+YL+K+   +   +G  C C      S  C   C        C+      G K  C
Sbjct: 391 ITSNVYLSKKPPAKDVGEGHRCGCVPPSDGSAGCTEACSNRATFEECTKRTCGLGGKGVC 450

Query: 100 GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
           GN  + +  Q    +K+ +      G G+VA   I  G  + EYVGEV+ +   + R   
Sbjct: 451 GNRKIQRYKQEYMRRKVCIKDVGPKGIGLVAKTRIPAGALIGEYVGEVLSEDDWQAR-QA 509

Query: 160 MKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD 219
           + H  E + ++ ++    VIDA+ KG+  R++NHSC PN E QKW+I G+ RIG+FAT  
Sbjct: 510 LLHANEKHKFVMDLGESEVIDASQKGSILRFVNHSCGPNAETQKWMIQGKRRIGLFATEV 569

Query: 220 IKKGENLTYDYQY 232
           I+KG  +T++Y Y
Sbjct: 570 IEKGVEVTFNYCY 582


>gi|406606267|emb|CCH42258.1| Histone-lysine N-methyltransferase [Wickerhamomyces ciferrii]
          Length = 1074

 Score =  111 bits (277), Expect = 3e-22,   Method: Composition-based stats.
 Identities = 83/246 (33%), Positives = 122/246 (49%), Gaps = 30/246 (12%)

Query: 19  KLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNI----YLTKRIKRRLEDDGI-------F 67
           K LK +    +F+LP W I     P +  +RNI    +  KR +  LE+  I        
Sbjct: 138 KGLKVLNTVQDFKLP-WNI---YCPTIDGERNIVKWKHGLKRNQWLLENPFIAKKQVNEV 193

Query: 68  CSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCGNS-CLNKPFQN-----RPVKKM----K 117
           CSCT     S   +R       +    + CK G+  C N+ F +     R   K     +
Sbjct: 194 CSCTGVRCGSTCLNRSVQ----IECTPNNCKFGDQDCGNRAFADLMNAYRDHSKYAFGCE 249

Query: 118 LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDM 177
           ++QT+K G G+++      G  V+EY GEVI     E RL  +    ++ +Y   +  + 
Sbjct: 250 VLQTDKKGCGLLSIRSFNAGSLVVEYTGEVIHLDEVEHRLNTIYKESDS-YYFLGLEEEY 308

Query: 178 VIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHD 237
           VIDA  KG+ +R+ NHSC PN EMQKW ++GE RIG+FA R I+ GE +TYDY +E+  +
Sbjct: 309 VIDAGQKGSVARFANHSCDPNAEMQKWYVNGEPRIGLFAKRSIEAGEEITYDYNFEWFEN 368

Query: 238 SLIAYC 243
                C
Sbjct: 369 GEPQKC 374


>gi|328709442|ref|XP_001950412.2| PREDICTED: hypothetical protein LOC100165448 isoform 1
           [Acyrthosiphon pisum]
 gi|328709444|ref|XP_003243962.1| PREDICTED: hypothetical protein LOC100165448 isoform 2
           [Acyrthosiphon pisum]
          Length = 1506

 Score =  111 bits (277), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 60/171 (35%), Positives = 94/171 (54%), Gaps = 4/171 (2%)

Query: 68  CSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCG 125
           C+C   P     C  DC   M+ + CS   C C   C N+  Q +     +    TE+ G
Sbjct: 734 CNCVV-PTEGKGCTDDCINRMIFAECSPELCPCKEKCSNQRLQTQQWAPGLVKFMTEEKG 792

Query: 126 AGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKG 185
            GI   E++K GE ++EYVGEV+ +QT ++R+  + +  + + Y  +++   VID    G
Sbjct: 793 WGIKTTEEVKSGELLLEYVGEVVSEQTFKDRMTSI-YKNDVHHYCLKLDGGSVIDGHRMG 851

Query: 186 NKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLH 236
            ++R++NHSC PN EMQKW ++G  R+ +FA R I+  E L YDY +   +
Sbjct: 852 GEARFVNHSCEPNCEMQKWSVNGLFRMALFALRKIQHDEELCYDYNFSLFN 902


>gi|195173304|ref|XP_002027432.1| GL20883 [Drosophila persimilis]
 gi|194113284|gb|EDW35327.1| GL20883 [Drosophila persimilis]
          Length = 2266

 Score =  110 bits (275), Expect = 5e-22,   Method: Composition-based stats.
 Identities = 66/214 (30%), Positives = 106/214 (49%), Gaps = 12/214 (5%)

Query: 27   PVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC 86
            P    +P W        Y  I+ N+Y           D   C+C         C  +C  
Sbjct: 1350 PTRHIVPSW-------NYRKIRTNVYAESVRPNLAGFDHPTCNCKNQ--GEKACLDNCLN 1400

Query: 87   GMLLSSCS-SGCKCGNSCLNKPFQNRPVKK-MKLVQTEKCGAGIVADEDIKRGEFVIEYV 144
             M+ + CS S C     C N+  Q   V   ++   T   G G+     I +G +++EYV
Sbjct: 1401 RMVYTECSPSNCPAAEKCRNQKIQRHEVAPGVERFMTLDKGWGVRTKLPIAKGTYILEYV 1460

Query: 145  GEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKW 204
            GEV+ ++  ++R+  + +L +T+ Y   ++  +VID    G+  R++NHSC PN EMQKW
Sbjct: 1461 GEVVTEREFKQRMASI-YLNDTHHYCLHLDGGLVIDGQRMGSDCRFVNHSCEPNCEMQKW 1519

Query: 205  IIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
             ++G +R+ +FA R I++GE LTYDY +   + S
Sbjct: 1520 SVNGLSRMVLFAKRPIEQGEELTYDYNFSLFNPS 1553


>gi|198463835|ref|XP_001352956.2| GA21391 [Drosophila pseudoobscura pseudoobscura]
 gi|198151432|gb|EAL30457.2| GA21391 [Drosophila pseudoobscura pseudoobscura]
          Length = 2266

 Score =  110 bits (275), Expect = 5e-22,   Method: Composition-based stats.
 Identities = 66/214 (30%), Positives = 106/214 (49%), Gaps = 12/214 (5%)

Query: 27   PVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC 86
            P    +P W        Y  I+ N+Y           D   C+C         C  +C  
Sbjct: 1350 PTRHIVPSW-------NYRKIRTNVYAESVRPNLAGFDHPTCNCKNQ--GEKACLDNCLN 1400

Query: 87   GMLLSSCS-SGCKCGNSCLNKPFQNRPVKK-MKLVQTEKCGAGIVADEDIKRGEFVIEYV 144
             M+ + CS S C     C N+  Q   V   ++   T   G G+     I +G +++EYV
Sbjct: 1401 RMVYTECSPSNCPAAEKCRNQKIQRHEVAPGVERFMTLDKGWGVRTKLPIAKGTYILEYV 1460

Query: 145  GEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKW 204
            GEV+ ++  ++R+  + +L +T+ Y   ++  +VID    G+  R++NHSC PN EMQKW
Sbjct: 1461 GEVVTEREFKQRMASI-YLNDTHHYCLHLDGGLVIDGQRMGSDCRFVNHSCEPNCEMQKW 1519

Query: 205  IIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
             ++G +R+ +FA R I++GE LTYDY +   + S
Sbjct: 1520 SVNGLSRMVLFAKRPIEQGEELTYDYNFSLFNPS 1553


>gi|313221636|emb|CBY36121.1| unnamed protein product [Oikopleura dioica]
          Length = 207

 Score =  110 bits (275), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 54/132 (40%), Positives = 80/132 (60%), Gaps = 6/132 (4%)

Query: 112 PVKKMKLVQTEKCGAGIVADEDIK------RGEFVIEYVGEVIDDQTCEERLWKMKHLGE 165
           P+ K  L  T      I A+  +K      +G F+IEY+GE+I       RL +   +G 
Sbjct: 11  PIHKAPLSTTPSSSRQIGAETALKAKKLIPKGSFIIEYIGEIISHDESRIRLEESAKIGV 70

Query: 166 TNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGEN 225
           TN+Y+ E++   +IDA  +GN +R+INHSC PN  +  WI+ G+TRIGIF+ RDI++GE 
Sbjct: 71  TNYYILELDNLRMIDAGPRGNIARFINHSCDPNCGIDPWIVQGDTRIGIFSKRDIQEGEE 130

Query: 226 LTYDYQYEFLHD 237
           LT++YQ +   D
Sbjct: 131 LTFNYQLQQSSD 142


>gi|320592445|gb|EFX04875.1| histone-lysine n-methyltransferase [Grosmannia clavigera kw1407]
          Length = 1246

 Score =  110 bits (275), Expect = 6e-22,   Method: Composition-based stats.
 Identities = 71/213 (33%), Positives = 108/213 (50%), Gaps = 23/213 (10%)

Query: 38  KPKAIPYVFIKRNIYL--TKRIKRRLEDDGIFCS-CTASPGSSGVCDRDCHCGMLLSSCS 94
           +PK + Y    +N ++     I R+  D   F S C   P +   CD DC   ++L  C 
Sbjct: 528 QPKPVKYGRFSKNRFVGDAHAIWRKNPDFDDFSSKCVCKPETG--CDEDCQNRIMLYECD 585

Query: 95  S-GCKCG-NSCLNKPFQ---NRPVKK------MKLVQTEKCGAGIVADEDIKRGEFVIEY 143
              C  G   C N+ FQ    R   K      +++ +T   G GI A    K  + ++EY
Sbjct: 586 DMNCNVGPERCHNREFQRLAERTASKNPYHVGVEVFKTPDRGHGIRASRSFKPSQIIMEY 645

Query: 144 VGEVIDDQTCEERLWKMKHLGETN--FYLCEINRDMVIDATYKGNKSRYINHSCCPNTEM 201
           +GE+I ++  + R   M  L + N  +YL   ++ ++ID T  G+ +R++NHSC PN  M
Sbjct: 646 IGEIITEEESDRR---MNELYKNNACYYLMSFDQSLIIDGT-SGSIARFVNHSCSPNCRM 701

Query: 202 QKWIIDGETRIGIFA-TRDIKKGENLTYDYQYE 233
            KWI+ G+ RI +FA  R I  GE LTYDY ++
Sbjct: 702 IKWIVSGQPRIALFAGDRPIMTGEELTYDYNFD 734


>gi|405966542|gb|EKC31816.1| Putative histone-lysine N-methyltransferase ASH1L [Crassostrea gigas]
          Length = 2162

 Score =  110 bits (275), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 62/174 (35%), Positives = 94/174 (54%), Gaps = 6/174 (3%)

Query: 63   DDGIFCSCTAS--PGSSGVCDRDCHCGMLLSSCS-SGCKCGNSCLNKPFQNRP-VKKMKL 118
            D+   C+C     P   G C  +C   M+ + C  S C C   CLN+ F     V  +++
Sbjct: 1346 DEAHPCTCKRPYDPEVKG-CGEECLNRMMYTECDISTCPCQEQCLNQRFHKHEWVSGLEV 1404

Query: 119  VQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMV 178
            + T+  G GI   + I  G+F++EY+GEV+ +     R+ + ++  E + Y   ++   V
Sbjct: 1405 IVTKDRGYGIRTSDSISNGQFILEYLGEVVSEAEFRRRMTE-EYSQERHHYCLNLDSGAV 1463

Query: 179  IDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            ID    GN  RY+NHSC PN EMQKW ++G  R+G+FA +DI     LTYDY +
Sbjct: 1464 IDGYRMGNIGRYVNHSCEPNCEMQKWNVNGVYRMGLFALKDISPNMELTYDYNF 1517


>gi|325187666|emb|CCA22203.1| histonelysine Nmethyltransferase putative [Albugo laibachii Nc14]
          Length = 1414

 Score =  110 bits (274), Expect = 6e-22,   Method: Composition-based stats.
 Identities = 60/152 (39%), Positives = 90/152 (59%), Gaps = 6/152 (3%)

Query: 93  CSSGCKCGNSCLNKPFQNRPVK--KMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDD 150
           CS+ C+    C NK  Q +P K   ++  +T K G GI   E+I++G  V EYVGE+ID 
Sbjct: 326 CSAACEAREFCQNKRLQ-QPEKFPSLEAFKTLKKGFGIRTKENIRQGTIVGEYVGEIIDQ 384

Query: 151 QTCEERLWKM-KHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGE 209
                RL  + +H  E NFY   +   + IDA  +G+ +R++NHSC PN + +KW ++G+
Sbjct: 385 NELNRRLCAIGRH--ELNFYYLLLRPGVYIDARNRGSLTRFVNHSCDPNCKTEKWTVEGD 442

Query: 210 TRIGIFATRDIKKGENLTYDYQYEFLHDSLIA 241
           TRI + A R I+ G  LT+DYQ++ L    +A
Sbjct: 443 TRIAVVALRHIEFGTELTFDYQWKSLGSRQLA 474


>gi|113470951|gb|ABI34877.1| Wolf-Hirschhorn syndrome candidate 1-like 1 [Danio rerio]
          Length = 129

 Score =  110 bits (274), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 49/104 (47%), Positives = 71/104 (68%)

Query: 132 EDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYI 191
           +D+K+G+FV+EYVGE+ID + C++R+        TNFY+  + +D VIDA  KGN SR++
Sbjct: 5   QDLKKGDFVMEYVGELIDSEECKQRIRTANENHVTNFYMLTLTKDRVIDAGPKGNLSRFM 64

Query: 192 NHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
           NHSC PN E QKW ++G+ RIG+F   DI     LT++Y  + L
Sbjct: 65  NHSCSPNCETQKWTVNGDVRIGLFTLCDISADTELTFNYNLDCL 108


>gi|380006423|gb|AFD29602.1| NSD-1 [Schmidtea mediterranea]
          Length = 914

 Score =  110 bits (274), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 66/184 (35%), Positives = 98/184 (53%), Gaps = 12/184 (6%)

Query: 57  IKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKC-------GNSCLNKPFQ 109
           ++R++  +   C+C +    S     DC  G    + +S C+C       G+ C N+ F 
Sbjct: 595 LRRKVVPETPHCNCISFDPKS-----DCSIGSGCFNVASKCECDPQICNLGDKCKNQNFV 649

Query: 110 NRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFY 169
            R   K         G G+ ADE IK  +FV EY+GE+I  +  E+R+        T+FY
Sbjct: 650 KRIYPKQYTFWANDRGWGLKADEFIKTKQFVNEYIGEIITMEESEKRILWANENNITDFY 709

Query: 170 LCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYD 229
             E++   +IDA    N SR+IN+SC PN   +KWI++ E RIG+FA RDI+KGE LT+ 
Sbjct: 710 FMELDNGRLIDARQFSNLSRFINNSCDPNLVAEKWIVNREHRIGLFALRDIQKGEELTFQ 769

Query: 230 YQYE 233
           Y  +
Sbjct: 770 YNLQ 773


>gi|428176462|gb|EKX45346.1| hypothetical protein GUITHDRAFT_61291, partial [Guillardia theta
           CCMP2712]
          Length = 215

 Score =  110 bits (274), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 64/193 (33%), Positives = 101/193 (52%), Gaps = 7/193 (3%)

Query: 47  IKRNIYL-TKRIKRRLEDDGIFCSCTASPG-SSGVCDRDCHCGM--LLSSCSSGCKCGNS 102
           I +N+Y+  +R   +  +    C C   PG     C    +C +  L   C   C C + 
Sbjct: 1   ISKNVYVGEERAALKSMESRSACQCIYVPGVPETACGEHSNCMLRDLYIECDYRCPCSSH 60

Query: 103 CLNKPFQNRPVKKMKLVQTEKC-GAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMK 161
           CLNK  Q R   K  +   +   G  +  DED+++G+ V+EY+GEVI  +    R+   +
Sbjct: 61  CLNKRLQKRQWAKCSIFLAKNGRGWALRNDEDLRQGQLVMEYIGEVISGEEVSRRM--EE 118

Query: 162 HLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIK 221
           + G+ + Y+ ++N++  ID+T K N +R+INH C PN EMQKW +  +  + +FA   I 
Sbjct: 119 YAGKRHTYMLKLNQEEFIDSTRKANLARFINHCCEPNCEMQKWYVGNKQCVALFAKYFIP 178

Query: 222 KGENLTYDYQYEF 234
            G  LT+DY  EF
Sbjct: 179 SGSELTFDYDMEF 191


>gi|159113654|ref|XP_001707053.1| Histone methyltransferase HMT1 [Giardia lamblia ATCC 50803]
 gi|52857644|gb|AAU89075.1| histone methyltransferase HMT1 [Giardia intestinalis]
 gi|157435155|gb|EDO79379.1| Histone methyltransferase HMT1 [Giardia lamblia ATCC 50803]
          Length = 298

 Score =  110 bits (274), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 72/228 (31%), Positives = 107/228 (46%), Gaps = 43/228 (18%)

Query: 41  AIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCG 100
           ++ Y  +KRNIY+    KR       FC+CT   G+   C   C    +   C   C  G
Sbjct: 52  SLHYTHVKRNIYVG--CKRPSAARKTFCTCTCKEGTG--CGTSCELRKVHLECYKECCAG 107

Query: 101 NSCLNKPFQNRPV-----------------------------KKMKLVQTE-----KCGA 126
           + C +K F  RP+                             ++M+  +T      + G 
Sbjct: 108 SPC-SKQFIVRPLFGNSIDLPKDATTLHKDNSRLADCGNQRLQRMQYARTAVYPAGRKGY 166

Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
           G+ A   I+RG  V EY+GEVI  + C  R    K   + + Y   ++R++ IDA +KGN
Sbjct: 167 GLFALTSIQRGALVTEYIGEVITREECMRR----KKSAKGHLYFLALDRELYIDAAHKGN 222

Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEF 234
           +SR+INHSC PN E+Q W +  E R  I A R I   E L++DY+++F
Sbjct: 223 ESRFINHSCDPNCEVQLWYVGEEPRAAIVALRSIAPHEELSFDYKFDF 270


>gi|330792328|ref|XP_003284241.1| hypothetical protein DICPUDRAFT_27300 [Dictyostelium purpureum]
 gi|325085814|gb|EGC39214.1| hypothetical protein DICPUDRAFT_27300 [Dictyostelium purpureum]
          Length = 151

 Score =  110 bits (274), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 52/109 (47%), Positives = 74/109 (67%), Gaps = 2/109 (1%)

Query: 125 GAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYK 184
           G G+++ E+I +G+FV+EY GEVI   TC  R+   ++  E  FY   +N    +DA+ +
Sbjct: 2   GWGLISCENINKGDFVMEYCGEVISKTTCLNRM--QENENEKFFYFLTLNSKECLDASRR 59

Query: 185 GNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
           GN +R+INHSC PN E QKWI+ GE +IGIF+ + I+KG  LT+DY YE
Sbjct: 60  GNLARFINHSCDPNCETQKWIVGGEVKIGIFSIKPIEKGTELTFDYNYE 108


>gi|320036571|gb|EFW18510.1| histone-lysine N-methyltransferase [Coccidioides posadasii str.
           Silveira]
          Length = 742

 Score =  110 bits (274), Expect = 7e-22,   Method: Composition-based stats.
 Identities = 62/178 (34%), Positives = 99/178 (55%), Gaps = 18/178 (10%)

Query: 68  CSCTASPGSSGVCDRDCHCGMLLSSCS-SGCKCGNS-CLNKPFQ--NRPVKK-------M 116
           C+CT   G    CD +C    +   C  + CK G+  C N+PF    R  K        +
Sbjct: 346 CTCTPETG----CDENCQNRYMFYECDDTNCKLGSELCRNRPFSALRRRAKAGGKFNIGV 401

Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
           ++++TE  G G+ ++      + ++EY GE++  + CE R+  +    E  +YL   +++
Sbjct: 402 EVIKTEDRGYGVRSNRSFDPNQIIVEYTGEILTQEECERRMRTVYKKNEC-YYLMYFDQN 460

Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD-IKKGENLTYDYQYE 233
           MVIDAT +G+ +R+INHSC PN  M+KW + G+ R+ +FA  D I  GE LTYDY ++
Sbjct: 461 MVIDAT-RGSIARFINHSCEPNCRMEKWTVAGKPRMALFAGEDGIMTGEELTYDYNFD 517


>gi|303313071|ref|XP_003066547.1| SET domain containing protein [Coccidioides posadasii C735 delta
           SOWgp]
 gi|240106209|gb|EER24402.1| SET domain containing protein [Coccidioides posadasii C735 delta
           SOWgp]
          Length = 742

 Score =  110 bits (274), Expect = 7e-22,   Method: Composition-based stats.
 Identities = 62/178 (34%), Positives = 99/178 (55%), Gaps = 18/178 (10%)

Query: 68  CSCTASPGSSGVCDRDCHCGMLLSSCS-SGCKCGNS-CLNKPFQ--NRPVKK-------M 116
           C+CT   G    CD +C    +   C  + CK G+  C N+PF    R  K        +
Sbjct: 346 CTCTPETG----CDENCQNRYMFYECDDTNCKLGSELCRNRPFSALRRRAKAGGKFNIGV 401

Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
           ++++TE  G G+ ++      + ++EY GE++  + CE R+  +    E  +YL   +++
Sbjct: 402 EVIKTEDRGYGVRSNRSFDPNQIIVEYTGEILTQEECERRMRTVYKKNEC-YYLMYFDQN 460

Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD-IKKGENLTYDYQYE 233
           MVIDAT +G+ +R+INHSC PN  M+KW + G+ R+ +FA  D I  GE LTYDY ++
Sbjct: 461 MVIDAT-RGSIARFINHSCEPNCRMEKWTVAGKPRMALFAGEDGIMTGEELTYDYNFD 517


>gi|119192128|ref|XP_001246670.1| hypothetical protein CIMG_00441 [Coccidioides immitis RS]
 gi|392864092|gb|EAS35106.2| histone-lysine N-methyltransferase [Coccidioides immitis RS]
          Length = 742

 Score =  110 bits (274), Expect = 7e-22,   Method: Composition-based stats.
 Identities = 62/178 (34%), Positives = 99/178 (55%), Gaps = 18/178 (10%)

Query: 68  CSCTASPGSSGVCDRDCHCGMLLSSCS-SGCKCGNS-CLNKPFQ--NRPVKK-------M 116
           C+CT   G    CD +C    +   C  + CK G+  C N+PF    R  K        +
Sbjct: 346 CTCTPETG----CDENCQNRYMFYECDDTNCKLGSELCRNRPFSALRRRAKAGGKFNIGV 401

Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
           ++++TE  G G+ ++      + ++EY GE++  + CE R+  +    E  +YL   +++
Sbjct: 402 EVIKTEDRGYGVRSNRSFDPNQIIVEYTGEILTQEECERRMRTVYKKNEC-YYLMYFDQN 460

Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD-IKKGENLTYDYQYE 233
           MVIDAT +G+ +R+INHSC PN  M+KW + G+ R+ +FA  D I  GE LTYDY ++
Sbjct: 461 MVIDAT-RGSIARFINHSCEPNCRMEKWTVAGKPRMALFAGEDGIMTGEELTYDYNFD 517


>gi|449017981|dbj|BAM81383.1| similar to histone methyltransferase Set2p [Cyanidioschyzon merolae
           strain 10D]
          Length = 1354

 Score =  110 bits (274), Expect = 7e-22,   Method: Composition-based stats.
 Identities = 68/199 (34%), Positives = 103/199 (51%), Gaps = 14/199 (7%)

Query: 40  KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPG---SSGVCDRDCH---CGMLLSSC 93
           K  P++ +  N YL          D   C     PG   +S VC  +C+   C +  S  
Sbjct: 787 KPSPFIHLTANEYLVPPSVAPAHADVCQCIGACVPGVCLNSTVC-VECNPATCPVARSRS 845

Query: 94  SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKC-GAGIVADEDIKRGEFVIEYVGEVIDDQT 152
           S+  +CGN      FQ +    ++L  +    G GI +   +K+G+F++EY+GEVI    
Sbjct: 846 STDPQCGN----MRFQRQAYAPVELFFSPNGRGCGIRSRAPLKKGDFIVEYMGEVIGPTE 901

Query: 153 CEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRI 212
              R  K  H  E + Y   +++   +DA+ KG   R++NHSC PN   QKW++ G+ R+
Sbjct: 902 LARR--KRDHALERHVYFMTLDQSTFLDASRKGTWGRFLNHSCEPNCHTQKWLVLGKVRV 959

Query: 213 GIFATRDIKKGENLTYDYQ 231
           GIFA+RDI  GE LT+DY+
Sbjct: 960 GIFASRDIAAGEELTFDYR 978


>gi|388852139|emb|CCF54145.1| uncharacterized protein [Ustilago hordei]
          Length = 1338

 Score =  109 bits (273), Expect = 8e-22,   Method: Composition-based stats.
 Identities = 74/205 (36%), Positives = 106/205 (51%), Gaps = 18/205 (8%)

Query: 38  KPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCS-SG 96
           K K   Y  I RN Y ++    +L+ +   CSC   PGS   C  DC   ML+  C    
Sbjct: 648 KRKPPRYQQISRNKYYSRP---KLQGEVPLCSC--QPGSG--CGSDCINRMLMFICDPKT 700

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
           C  G++C N     RP  K  +    + G G+   E IK+ +F+ EY GEVID     +R
Sbjct: 701 CPSGSNCTNISLGRRPTVKTAVHYYGRRGFGLKTLEPIKKDDFIDEYRGEVIDLHEASKR 760

Query: 157 LWKMKHLGETNFYLCEINRDM--VIDATYKGNKSRYINHSCCPNTEMQKWIIDG------ 208
           +   ++    NFYL + +     ++D   KGN +R+ NHSC PN  ++K+II G      
Sbjct: 761 VTD-EYKATGNFYLLDYDTAAGELLDGGRKGNITRFANHSCEPNCRIEKFIICGTDEALS 819

Query: 209 -ETRIGIFATRDIKKGENLTYDYQY 232
            E +IG+FA RDI+ GE LTY+Y +
Sbjct: 820 AEFQIGLFALRDIEAGEELTYNYGW 844


>gi|308485102|ref|XP_003104750.1| CRE-MET-1 protein [Caenorhabditis remanei]
 gi|308257448|gb|EFP01401.1| CRE-MET-1 protein [Caenorhabditis remanei]
          Length = 1582

 Score =  109 bits (273), Expect = 9e-22,   Method: Composition-based stats.
 Identities = 69/197 (35%), Positives = 103/197 (52%), Gaps = 16/197 (8%)

Query: 44  YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCGNSC 103
           Y+ I  N Y  +   ++ + D + C C  S       D +C    +L+ C S C    +C
Sbjct: 632 YITIAENKYFNRNANKK-KTDSLICEC--SRLGLTCSDNNCVNRAMLTECPSSCPA--NC 686

Query: 104 LNKPFQNRPVKKMKLVQTEKC-GAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
            N+ F  +    ++   T    G G+ A +DIK+G F+IEYVGEV++    E+R  K K+
Sbjct: 687 KNQRFAKKKYASVEAFHTGTAKGCGLRALKDIKKGRFIIEYVGEVVERDDYEKR--KKKY 744

Query: 163 LGE---TNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGE----TRIGIF 215
             +    + YLC+      IDAT  GN SR++NHSC PN   +KW +       +RIG F
Sbjct: 745 AADEKHKHHYLCDTGV-YTIDATVYGNPSRFVNHSCEPNAVCEKWSVPKTPGDISRIGFF 803

Query: 216 ATRDIKKGENLTYDYQY 232
           A + IK GE +T+DYQ+
Sbjct: 804 AKKSIKAGEEITFDYQF 820


>gi|307106603|gb|EFN54848.1| hypothetical protein CHLNCDRAFT_35816, partial [Chlorella
           variabilis]
          Length = 239

 Score =  109 bits (272), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 56/122 (45%), Positives = 74/122 (60%), Gaps = 2/122 (1%)

Query: 116 MKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINR 175
           M++  T   G G+ A   I RG F++EY GEVIDD+ C  R    K   E +FY+ E+  
Sbjct: 1   MEVYLTPDKGWGVRAMAFIPRGTFIVEYAGEVIDDKECSRRAEDAKARNEPHFYMMEMAP 60

Query: 176 DMVIDATYKGNKSRYINHSCCPNTEMQKWIIDG--ETRIGIFATRDIKKGENLTYDYQYE 233
            ++IDA  KGN +R +N SC PN E QKW   G  E R+GIF+ RD+  GE LTYDYQ++
Sbjct: 61  GLIIDARSKGNLARLLNSSCDPNCETQKWHDAGNSEVRVGIFSLRDVLPGEELTYDYQFQ 120

Query: 234 FL 235
             
Sbjct: 121 HF 122


>gi|345566121|gb|EGX49068.1| hypothetical protein AOL_s00079g289 [Arthrobotrys oligospora ATCC
           24927]
          Length = 980

 Score =  109 bits (272), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 61/172 (35%), Positives = 91/172 (52%), Gaps = 3/172 (1%)

Query: 62  EDDGIFCSCTAS--PGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKL 118
           E++ + C C      G +  C ++C        C  G   CG  C N+ FQ R    + +
Sbjct: 161 EEELMSCDCRPEYDDGVNHACSQNCINAETFVECVDGDSNCGGQCQNQRFQKREYANVSV 220

Query: 119 VQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMV 178
           +QTE  G G+ A+  ++ G F+ EYVGEVI +    +R          +FY   I     
Sbjct: 221 IQTEMKGYGLRANTSMEPGTFIYEYVGEVIGESQFRKRRELYGKEDIKHFYFMSIKVGEY 280

Query: 179 IDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDY 230
           IDAT +G  +R+ NHSC PN+ ++KW++ G+ R+GIFA   I+ GE LT+DY
Sbjct: 281 IDATKRGCLARFCNHSCNPNSMVEKWVVGGKLRMGIFAKVKIEAGEELTFDY 332


>gi|340506525|gb|EGR32648.1| SET domain protein [Ichthyophthirius multifiliis]
          Length = 978

 Score =  109 bits (272), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 59/142 (41%), Positives = 90/142 (63%), Gaps = 2/142 (1%)

Query: 93  CSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQT 152
           C+  C+CG+ C N+ FQ      +    T   G G+ A + I +G F+I+Y+GEV D  +
Sbjct: 27  CNLSCRCGDLCQNRRFQKHEDACVYPYPTSGKGWGLCAGQYIPKGTFIIQYIGEVFDINS 86

Query: 153 CEERLWKMKHLGETN-FYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETR 211
            EE + ++K    +   YL +I+++ VID T+KGN +R+INHSC PN   QKW + GE  
Sbjct: 87  -EEGIKRVKDYSRSTCTYLMKIDKNEVIDPTFKGNLARFINHSCDPNCITQKWHVLGEIC 145

Query: 212 IGIFATRDIKKGENLTYDYQYE 233
           IGIFA ++IK+ + LT+DYQ++
Sbjct: 146 IGIFAIKNIKEDDELTFDYQFD 167


>gi|301115083|ref|XP_002999311.1| histone-lysine N-methyltransferase, putative [Phytophthora
           infestans T30-4]
 gi|262111405|gb|EEY69457.1| histone-lysine N-methyltransferase, putative [Phytophthora
           infestans T30-4]
          Length = 905

 Score =  109 bits (272), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 80/252 (31%), Positives = 115/252 (45%), Gaps = 39/252 (15%)

Query: 1   MPAAKKNSDNSRIGHAFNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRR 60
           +P    ++ N    H F +LLK I   V  + P          Y  + R+IY  K+ K  
Sbjct: 422 LPKRDPDASNPNDCHHF-RLLKNIIEDVRLQPPT---------YRKLGRSIYTFKQPKVS 471

Query: 61  LEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSC----------------SSGCKCGNSCL 104
           LED    C C         C  DC   +  + C                 S C  G +C 
Sbjct: 472 LED-APTCVCKER------CGDDCINRLSFTECFGPAPTPGMKFNKHNRESNCMLGENCG 524

Query: 105 NKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLG 164
           N+    +   + +   T + G  +   E  K G+ VIEYVGEVI+++  E RL  + H  
Sbjct: 525 NRALHQKVYPRFQKFHTVEKGWALRLLEPAKAGQLVIEYVGEVINEEEKERRL--LDHAK 582

Query: 165 ET----NFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDI 220
            +    N Y+ E+ +   IDA +KG+ SR+INHSC PN  + KW + G  RI I A +DI
Sbjct: 583 NSPEDKNMYIMELGKGEYIDARFKGSVSRFINHSCDPNCHLLKWRVKGVNRIAITALKDI 642

Query: 221 KKGENLTYDYQY 232
           + G  L+YDYQ+
Sbjct: 643 EPGTELSYDYQF 654


>gi|336463582|gb|EGO51822.1| hypothetical protein NEUTE1DRAFT_125465 [Neurospora tetrasperma
           FGSC 2508]
          Length = 1162

 Score =  108 bits (271), Expect = 1e-21,   Method: Composition-based stats.
 Identities = 67/204 (32%), Positives = 105/204 (51%), Gaps = 26/204 (12%)

Query: 38  KPKAIPYVFIKRNIYLTK-----RIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSS 92
           +PK   Y  + +N ++ +     +     ED    C CT   G    C +DC   ++L  
Sbjct: 630 QPKPAAYRTMTKNRFIGQAAAIWKKTPHFEDFASKCVCTPEDG----CAQDCQNRVMLYE 685

Query: 93  CS-SGCKCGNS-CLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDD 150
           C  + C  G   C N+ FQ      M   +T+K G   +  E       ++EY GE+I D
Sbjct: 686 CDDTNCNVGKEFCQNRAFQ------MLTERTKKGGRYRIGVE------IIMEYTGEIITD 733

Query: 151 QTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGET 210
           + CE R+   ++     +YL   +++M+IDAT  G+ +R++NHSC PN  M KWI+ G+ 
Sbjct: 734 EECERRM-NEEYKNNECYYLMSFDQNMIIDAT-TGSIARFVNHSCSPNCRMIKWIVSGQP 791

Query: 211 RIGIFA-TRDIKKGENLTYDYQYE 233
           R+ +FA  R I+ GE LTYDY ++
Sbjct: 792 RMALFAGDRPIQTGEELTYDYNFD 815


>gi|350297194|gb|EGZ78171.1| hypothetical protein NEUTE2DRAFT_101826 [Neurospora tetrasperma
           FGSC 2509]
          Length = 1162

 Score =  108 bits (271), Expect = 1e-21,   Method: Composition-based stats.
 Identities = 67/204 (32%), Positives = 105/204 (51%), Gaps = 26/204 (12%)

Query: 38  KPKAIPYVFIKRNIYLTK-----RIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSS 92
           +PK   Y  + +N ++ +     +     ED    C CT   G    C +DC   ++L  
Sbjct: 630 QPKPAAYRTMTKNRFIGQAAAIWKKTPHFEDFASKCVCTPEDG----CAQDCQNRVMLYE 685

Query: 93  CS-SGCKCGNS-CLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDD 150
           C  + C  G   C N+ FQ      M   +T+K G   +  E       ++EY GE+I D
Sbjct: 686 CDDTNCNVGKEFCQNRAFQ------MLTERTKKGGRYRIGVE------IIMEYTGEIITD 733

Query: 151 QTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGET 210
           + CE R+   ++     +YL   +++M+IDAT  G+ +R++NHSC PN  M KWI+ G+ 
Sbjct: 734 EECERRM-NEEYKNNECYYLMSFDQNMIIDAT-TGSIARFVNHSCSPNCRMIKWIVSGQP 791

Query: 211 RIGIFA-TRDIKKGENLTYDYQYE 233
           R+ +FA  R I+ GE LTYDY ++
Sbjct: 792 RMALFAGDRPIQTGEELTYDYNFD 815


>gi|348683753|gb|EGZ23568.1| hypothetical protein PHYSODRAFT_487680 [Phytophthora sojae]
          Length = 868

 Score =  108 bits (271), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 76/238 (31%), Positives = 111/238 (46%), Gaps = 39/238 (16%)

Query: 15  HAFNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASP 74
           H F +L++ I   V  + P          Y  + R+IY  K+ K  LED    C C    
Sbjct: 442 HHF-RLMRNIIEDVRLQPPT---------YRKLGRSIYTFKQPKVSLED-APMCECKLQ- 489

Query: 75  GSSGVCDRDCHCGMLLSSC----------------SSGCKCGNSCLNKPFQNRPVKKMKL 118
                C  DC   +  + C                 S C  G +C N+    +   + + 
Sbjct: 490 -----CGDDCINRLSFTECFGPAPTPGMKFNKQNRESNCMLGENCGNRALHQKIYPRFEK 544

Query: 119 VQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGET----NFYLCEIN 174
             T + G  +   E +K G+ VIEYVGEVI+++  E RL  + H   +    N Y+ E+ 
Sbjct: 545 FHTVEKGWALRVLEPVKAGQLVIEYVGEVINEEEKERRL--LDHAKNSPEDKNMYIMELG 602

Query: 175 RDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
           +   IDA +KG+ SR+INHSC PN  + KW + G  RI I A +DI+ G  L+YDYQ+
Sbjct: 603 KGEYIDARFKGSVSRFINHSCDPNCHLLKWRVKGVNRIAITALKDIEPGTELSYDYQF 660


>gi|242011020|ref|XP_002426255.1| protein MLP1, putative [Pediculus humanus corporis]
 gi|212510318|gb|EEB13517.1| protein MLP1, putative [Pediculus humanus corporis]
          Length = 2688

 Score =  108 bits (271), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 66/209 (31%), Positives = 108/209 (51%), Gaps = 5/209 (2%)

Query: 31   ELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGV-CDRDCHCGML 89
            +LP   I P +  Y  I+ NIY   + +     +   C+CT    +    C  +C   M+
Sbjct: 1861 QLPGRDIVP-SWNYKKIRTNIYYDVKPQMTGTYEAQACNCTVPLNTKQKGCGEECINRMV 1919

Query: 90   LSSCSSG-CKCGNSCLNKPFQNRPVKK-MKLVQTEKCGAGIVADEDIKRGEFVIEYVGEV 147
             + CS   C C   C N+  Q       ++   T+  G GI   + IK G+F++EYVGEV
Sbjct: 1920 FAECSPQLCPCKEKCSNQRIQKHEFAPGLQKFMTKNKGWGIRTKQPIKAGDFILEYVGEV 1979

Query: 148  IDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIID 207
            + D+  ++R+  + ++ + + Y   ++  +VID    G   R++NHSC PN EMQKW ++
Sbjct: 1980 VSDKEFKDRMASI-YVNDKHHYCLHLDGGLVIDGHRMGGDGRFVNHSCNPNCEMQKWSVN 2038

Query: 208  GETRIGIFATRDIKKGENLTYDYQYEFLH 236
            G  R+ +FA R+I   + LTYDY +   +
Sbjct: 2039 GLFRMALFALRNIPAHQELTYDYNFSLFN 2067


>gi|164429563|ref|XP_964116.2| hypothetical protein NCU01932 [Neurospora crassa OR74A]
 gi|157073530|gb|EAA34880.2| predicted protein [Neurospora crassa OR74A]
          Length = 1162

 Score =  108 bits (271), Expect = 2e-21,   Method: Composition-based stats.
 Identities = 67/204 (32%), Positives = 105/204 (51%), Gaps = 26/204 (12%)

Query: 38  KPKAIPYVFIKRNIYLTK-----RIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSS 92
           +PK   Y  + +N ++ +     +     ED    C CT   G    C +DC   ++L  
Sbjct: 630 QPKPAAYRTMTKNRFIGQAAAIWKKTPHFEDFASKCVCTPEDG----CAQDCQNRVMLYE 685

Query: 93  CS-SGCKCGNS-CLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDD 150
           C  + C  G   C N+ FQ      M   +T+K G   +  E       ++EY GE+I D
Sbjct: 686 CDDTNCNVGKEFCQNRAFQ------MLTERTKKGGRYRIGVE------IIMEYTGEIITD 733

Query: 151 QTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGET 210
           + CE R+   ++     +YL   +++M+IDAT  G+ +R++NHSC PN  M KWI+ G+ 
Sbjct: 734 EECERRM-NEEYKNNECYYLMSFDQNMIIDAT-TGSIARFVNHSCSPNCRMIKWIVSGQP 791

Query: 211 RIGIFA-TRDIKKGENLTYDYQYE 233
           R+ +FA  R I+ GE LTYDY ++
Sbjct: 792 RMALFAGDRPIQTGEELTYDYNFD 815


>gi|401884695|gb|EJT48845.1| hypothetical protein A1Q1_02180 [Trichosporon asahii var. asahii
           CBS 2479]
          Length = 1192

 Score =  108 bits (270), Expect = 2e-21,   Method: Composition-based stats.
 Identities = 71/200 (35%), Positives = 103/200 (51%), Gaps = 12/200 (6%)

Query: 38  KPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGC 97
           K +   Y  I  N +L +R K  + D  + C C    G    C       +   SC +G 
Sbjct: 471 KKRPPQYEMITSNQFL-ERPKIPVSDKEV-CHCVVGTGCGLSCTNRLQGILCGKSCPNGP 528

Query: 98  KCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERL 157
            CGN  L +    RP K + +  +   G G+ A EDI  GEFV++Y GEVI   T  +R+
Sbjct: 529 DCGNQALCR----RPAKAIIVALSSLHGYGVFAAEDIAAGEFVVDYRGEVISVDTFIDRI 584

Query: 158 WKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKW--IIDG--ETRIG 213
               H  + + +    +RD +ID++ KGN +R+INHSC PN  ++K+  + DG  E   G
Sbjct: 585 G--SHSEDDSVFAIAYDRDEIIDSSSKGNSARFINHSCDPNLVLRKFDTLGDGHEEHEFG 642

Query: 214 IFATRDIKKGENLTYDYQYE 233
           +++ R IK GE LTYDY  E
Sbjct: 643 LWSRRPIKAGEELTYDYNAE 662


>gi|451852041|gb|EMD65336.1| hypothetical protein COCSADRAFT_35392 [Cochliobolus sativus ND90Pr]
          Length = 787

 Score =  108 bits (270), Expect = 2e-21,   Method: Composition-based stats.
 Identities = 66/196 (33%), Positives = 98/196 (50%), Gaps = 17/196 (8%)

Query: 59  RRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGN-SCLNKPFQ------- 109
           +R + D   C C    G    C   CH  ++   C S+ C      C N+PF        
Sbjct: 366 KRDKQDTSQCYCDVDDG----CGESCHNRIMAYECDSTNCPLAKEQCNNRPFAELRKRAK 421

Query: 110 -NRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNF 168
            NR    ++++ TE  G G+ A    +  + ++EY GE+I    CE R+ K  +  +  +
Sbjct: 422 GNRYDYGVEVLNTEDRGFGVRAMRTFEPHQIIVEYAGEIITQSECERRM-KQVYKKDKCY 480

Query: 169 YLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLT 227
           YL   +  M+IDAT +G  +R++NHSC PN EM KW + GE R+ +FA +R I  GE LT
Sbjct: 481 YLMSFDNKMIIDAT-RGTIARFVNHSCEPNCEMIKWTVGGEPRMALFAGSRGIMTGEELT 539

Query: 228 YDYQYEFLHDSLIAYC 243
           YDY ++      I  C
Sbjct: 540 YDYNFDPFSQKNIQEC 555


>gi|451997634|gb|EMD90099.1| hypothetical protein COCHEDRAFT_1225655 [Cochliobolus
           heterostrophus C5]
          Length = 787

 Score =  108 bits (269), Expect = 2e-21,   Method: Composition-based stats.
 Identities = 66/196 (33%), Positives = 98/196 (50%), Gaps = 17/196 (8%)

Query: 59  RRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSC-SSGCK-CGNSCLNKPFQ------- 109
           +R + D   C C    G    C   CH  ++   C S+ C      C N+PF        
Sbjct: 366 KRDKQDTSQCYCDVDDG----CGESCHNRIMAYECDSTNCPLTKEQCNNRPFAELKKRAK 421

Query: 110 -NRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNF 168
            NR    ++++ TE  G G+ A    +  + ++EY GE+I    CE R+ K  +  +  +
Sbjct: 422 GNRYDYGVEVLNTEDRGFGVRAMRTFEPHQIIVEYAGEIITQSECERRM-KQVYKKDKCY 480

Query: 169 YLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLT 227
           YL   +  M+IDAT +G  +R++NHSC PN EM KW + GE R+ +FA +R I  GE LT
Sbjct: 481 YLMSFDNKMIIDAT-RGTIARFVNHSCEPNCEMIKWTVGGEPRMALFAGSRGIMTGEELT 539

Query: 228 YDYQYEFLHDSLIAYC 243
           YDY ++      I  C
Sbjct: 540 YDYNFDPFSQKNIQEC 555


>gi|116193789|ref|XP_001222707.1| hypothetical protein CHGG_06612 [Chaetomium globosum CBS 148.51]
 gi|88182525|gb|EAQ89993.1| hypothetical protein CHGG_06612 [Chaetomium globosum CBS 148.51]
          Length = 907

 Score =  108 bits (269), Expect = 2e-21,   Method: Composition-based stats.
 Identities = 60/178 (33%), Positives = 97/178 (54%), Gaps = 18/178 (10%)

Query: 68  CSCTASPGSSGVCDRDCHCGMLLSSCS-SGCKCGNS-CLNKPFQNRPVKKMK-------- 117
           C C  + G    CD DC   ++L  C  + C  G + C N+ FQ+   +  K        
Sbjct: 449 CVCQPADG----CDEDCQNRIMLYECDDTNCNFGKAHCQNRAFQDLQERTKKGGRYRVGV 504

Query: 118 -LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
            +V+T   G G+ ++   +  + ++EY GE+I +  CE R+   ++     +YL   +++
Sbjct: 505 EVVKTGDRGYGVRSNRCFEANQIIMEYTGEIITEAECERRM-NEEYKDNECYYLMSFDQN 563

Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDYQYE 233
           M+IDAT  G+ +R++NHSC PN  M KWI+ G+ R+ +FA  R I  GE LTYDY ++
Sbjct: 564 MIIDAT-TGSIARFVNHSCSPNCRMIKWIVAGQPRMALFAGDRPIMTGEELTYDYNFD 620


>gi|429862858|gb|ELA37465.1| histone-lysine n-methyltransferase [Colletotrichum gloeosporioides
           Nara gc5]
          Length = 805

 Score =  108 bits (269), Expect = 3e-21,   Method: Composition-based stats.
 Identities = 61/188 (32%), Positives = 97/188 (51%), Gaps = 18/188 (9%)

Query: 68  CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCG-NSCLNKPFQNRPVKK---------M 116
           C CT   G    C   C   ++L  C    C  G  +C N+ F +   +K         +
Sbjct: 394 CVCTPDDG----CGESCQNRIMLYECDEKNCNVGRENCTNRAFADLQERKAGGGKYRVGV 449

Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
           ++++T   G GI A+      + ++EY GE+I D+ C  R+ + K+     +YL   +++
Sbjct: 450 EVIKTADRGYGIRANRCFAPNQIIMEYTGEIITDEECSNRM-ETKYKDNKCYYLMSFDQN 508

Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDYQYEFL 235
           M+IDAT  G+ +R++NHSC PN  M KWI+ G+ R+ +FA  + I  GE LTYDY +   
Sbjct: 509 MIIDAT-TGSIARFVNHSCAPNCRMIKWIVSGQPRMALFAGDKPIMTGEELTYDYNFSPF 567

Query: 236 HDSLIAYC 243
            D  +  C
Sbjct: 568 SDENVQKC 575


>gi|301100332|ref|XP_002899256.1| histone-lysine N-methyltransferase, putative [Phytophthora
           infestans T30-4]
 gi|262104173|gb|EEY62225.1| histone-lysine N-methyltransferase, putative [Phytophthora
           infestans T30-4]
          Length = 1501

 Score =  108 bits (269), Expect = 3e-21,   Method: Composition-based stats.
 Identities = 59/154 (38%), Positives = 89/154 (57%), Gaps = 6/154 (3%)

Query: 93  CSSGCKCGNSCLNKPFQNRPVKK--MKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDD 150
           CS+ C     C N+  Q+ P +   ++  +TE+ G G+   + I +   V EYVGE+ID 
Sbjct: 103 CSASCPAAQYCRNQRLQH-PERYPLLEPFKTEQKGYGVRTRQHIPQLSIVGEYVGEIIDQ 161

Query: 151 QTCEERLWKM-KHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGE 209
           +    RL  + +H  E NFY   +   + IDA  KG+ +R++NHSC PN + +KW + GE
Sbjct: 162 KELARRLKSVPRH--ELNFYYLLLAPGVYIDARNKGSFTRFVNHSCEPNCKTEKWTVKGE 219

Query: 210 TRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
           TRI + A RDI+ GE LT+DY+++ L    I  C
Sbjct: 220 TRIAVSALRDIEVGEELTFDYKWKALGSRQITCC 253


>gi|357483173|ref|XP_003611873.1| Histone-lysine N-methyltransferase SUVR5 [Medicago truncatula]
 gi|355513208|gb|AES94831.1| Histone-lysine N-methyltransferase SUVR5 [Medicago truncatula]
          Length = 1507

 Score =  107 bits (268), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 67/167 (40%), Positives = 91/167 (54%), Gaps = 21/167 (12%)

Query: 87   GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
            G L+  C+  C+C  SC N+  QN    K+++ +TEK G G+ A E I RG FV EY+GE
Sbjct: 1325 GYLVYECNRMCRCNKSCPNRILQNGVRVKLEVFKTEKKGWGVRAGEAILRGTFVCEYIGE 1384

Query: 147  VIDDQTCEERLWKMKHLGETN-FYLCEIN---RDM----------VIDATYKGNKSRYIN 192
            V+D Q    R    K  G  N  Y  +IN    DM          VIDA+  GN SR+IN
Sbjct: 1385 VLDVQEAHNR---RKRYGTGNCSYFYDINARVNDMSRMIEEKAQYVIDASKNGNVSRFIN 1441

Query: 193  HSCCPNTEMQKWIIDG----ETRIGIFATRDIKKGENLTYDYQYEFL 235
            HSC PN    + +++      + IG +A++DI  GE LTY +QYE +
Sbjct: 1442 HSCSPNLVSHQVLVESMDCERSHIGFYASQDIALGEELTYGFQYELV 1488


>gi|258573585|ref|XP_002540974.1| predicted protein [Uncinocarpus reesii 1704]
 gi|237901240|gb|EEP75641.1| predicted protein [Uncinocarpus reesii 1704]
          Length = 727

 Score =  107 bits (268), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 72/225 (32%), Positives = 116/225 (51%), Gaps = 27/225 (12%)

Query: 33  PDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSS 92
           PD + K     +V    +I+   +IK     +   C+CT   G    CD +CH   +   
Sbjct: 333 PDEWRKANKNVFVGDATSIWKANKIK-----EHSTCTCTPETG----CDENCHNRYMFYE 383

Query: 93  CS-SGCKCGNS-CLNKPFQ--NRPVKK-------MKLVQTEKCGAGIVADEDIKRGEFVI 141
           C  + CK G   C N+PF    R  K        +++++TE  G G+ ++      + ++
Sbjct: 384 CDDTNCKLGPELCQNRPFSELRRRSKAGGKFNIGVEVIKTEDRGYGVRSNRAFNPNQIIV 443

Query: 142 EYVGEVIDDQTCEERLWKMKHLGETN--FYLCEINRDMVIDATYKGNKSRYINHSCCPNT 199
           EY GE++  + CE R   M+ + + N  +YL   +++MVIDAT +G+ +R+INHSC PN 
Sbjct: 444 EYTGEILTQEECERR---MRTVYKKNDCYYLMYFDQNMVIDAT-RGSIARFINHSCEPNC 499

Query: 200 EMQKWIIDGETRIGIFATRD-IKKGENLTYDYQYEFLHDSLIAYC 243
            M+KW + G+ R+ +FA  D I  GE LTYDY ++      +  C
Sbjct: 500 RMEKWTVAGKPRMALFAGEDGIMTGEELTYDYNFDPYSQKNVQEC 544


>gi|350631656|gb|EHA20027.1| hypothetical protein ASPNIDRAFT_179076 [Aspergillus niger ATCC
           1015]
          Length = 853

 Score =  107 bits (268), Expect = 4e-21,   Method: Composition-based stats.
 Identities = 57/177 (32%), Positives = 95/177 (53%), Gaps = 17/177 (9%)

Query: 68  CSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKK---------MK 117
           C CT   G    CD +C    +   C  G C  G  C N+ F+    +          ++
Sbjct: 427 CMCTPETG----CDEECQNRYMFYECDEGNCGVGEECGNRSFEELKQRTKAGGKYNIGVE 482

Query: 118 LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDM 177
           +++T   G G+ ++   +  + ++EY GE+I    CE+R+  +    E  +YL   +++M
Sbjct: 483 VIKTADRGYGVRSNRTFEPNQIIVEYTGEIITQTECEKRMRTIYKHNEC-YYLMYFDQNM 541

Query: 178 VIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDYQYE 233
           +IDAT +G+ +R++NHSC PN  M+KW + G+ R+ +FA  R I  GE LTYDY ++
Sbjct: 542 IIDAT-RGSIARFVNHSCEPNCRMEKWTVAGKPRMALFAGDRGIMTGEELTYDYNFD 597


>gi|301624151|ref|XP_002941371.1| PREDICTED: histone-lysine N-methyltransferase SETD2-like [Xenopus
            (Silurana) tropicalis]
          Length = 3106

 Score =  107 bits (268), Expect = 4e-21,   Method: Composition-based stats.
 Identities = 64/204 (31%), Positives = 103/204 (50%), Gaps = 20/204 (9%)

Query: 46   FIKRNIYLTKRIKRRLEDD--GIFCSC---TASPGSSG--VCDRDCHCGMLLSSCSSGCK 98
             I+ N+YLT+R K +   D   + C C   +    + G   C  DC   +L+  CSS C 
Sbjct: 1989 LIEENVYLTERKKSKSHRDIKRMQCECPVLSKEERAQGQVACGEDCLNRLLMIECSSRCP 2048

Query: 99   CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
             G+ C N+ FQ +    ++ + TEK G G+ A +D+K   FV+EY GEV+D +  + R+ 
Sbjct: 2049 NGDYCSNRSFQKKQHAGVEAILTEKKGWGLRAAKDLKSNTFVLEYCGEVLDHKEFKSRVK 2108

Query: 159  KMKHLGETNFYLCEINRDMVIDATYKG---------NKSRYINHSCCPNTEMQKWIIDGE 209
            +       ++Y   +  D V  +  +            S ++N    PNT  QKW ++G+
Sbjct: 2109 EYARNKNIHYYFMALKNDEVSSSGQRNLSPTNLPCPRSSSWVN----PNTVSQKWTVNGQ 2164

Query: 210  TRIGIFATRDIKKGENLTYDYQYE 233
             R+G F TR +  G  LT+DYQ++
Sbjct: 2165 VRVGFFTTRVVPAGSELTFDYQFQ 2188


>gi|358369137|dbj|GAA85752.1| histone-lysine N-methyltransferase [Aspergillus kawachii IFO 4308]
          Length = 852

 Score =  107 bits (267), Expect = 4e-21,   Method: Composition-based stats.
 Identities = 57/177 (32%), Positives = 95/177 (53%), Gaps = 17/177 (9%)

Query: 68  CSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKK---------MK 117
           C CT   G    CD +C    +   C  G C  G  C N+ F+    +          ++
Sbjct: 426 CMCTPETG----CDEECQNRYMFYECDEGNCGVGEECGNRSFEELKQRTKAGGKYNIGVE 481

Query: 118 LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDM 177
           +++T   G G+ ++   +  + ++EY GE+I    CE+R+  +    E  +YL   +++M
Sbjct: 482 VIKTADRGYGVRSNRTFEPNQIIVEYTGEIITQTECEKRMRTIYKHNEC-YYLMYFDQNM 540

Query: 178 VIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDYQYE 233
           +IDAT +G+ +R++NHSC PN  M+KW + G+ R+ +FA  R I  GE LTYDY ++
Sbjct: 541 IIDAT-RGSIARFVNHSCEPNCRMEKWTVAGKPRMALFAGDRGIMTGEELTYDYNFD 596


>gi|253743884|gb|EET00168.1| Histone methyltransferase HMT1 [Giardia intestinalis ATCC 50581]
          Length = 297

 Score =  107 bits (267), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 70/226 (30%), Positives = 104/226 (46%), Gaps = 40/226 (17%)

Query: 41  AIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCG 100
           ++ YV IKRN Y+    KR       FC+CT   G+   C   C    +   C   C  G
Sbjct: 52  SLHYVHIKRNTYVG--CKRPSTARKTFCTCTCREGAG--CGPGCELRRVHLECYKECLAG 107

Query: 101 NSCLNKPF---------------------------QNRPVKKMKLVQTE-----KCGAGI 128
             C  +P                             N+ +++++  +T      K G G+
Sbjct: 108 PLCFQRPIVKPLFSSIMDSPRLATFHKDNNKHGDCGNQRLQRLQYARTAVYPAGKKGYGL 167

Query: 129 VADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKS 188
            A  +I+RG  V EY+GEVI  + C  R    K     + Y   +++++ IDA  KGN+S
Sbjct: 168 FALTNIQRGTLVTEYIGEVITKEECMRR----KKDATGHLYFLALDKELYIDAARKGNES 223

Query: 189 RYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEF 234
           R+INHSC PN E+Q W +  E R  I A R I   E L++DY+++F
Sbjct: 224 RFINHSCDPNCEVQLWYVGEEPRAAIVALRSIVPYEELSFDYKFDF 269


>gi|317033153|ref|XP_001394944.2| histone-lysine N-methyltransferase (Ash1) [Aspergillus niger CBS
           513.88]
          Length = 825

 Score =  107 bits (267), Expect = 4e-21,   Method: Composition-based stats.
 Identities = 57/177 (32%), Positives = 95/177 (53%), Gaps = 17/177 (9%)

Query: 68  CSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKK---------MK 117
           C CT   G    CD +C    +   C  G C  G  C N+ F+    +          ++
Sbjct: 399 CMCTPETG----CDEECQNRYMFYECDEGNCGVGEECGNRSFEELKQRTKAGGKYNIGVE 454

Query: 118 LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDM 177
           +++T   G G+ ++   +  + ++EY GE+I    CE+R+  +    E  +YL   +++M
Sbjct: 455 VIKTADRGYGVRSNRTFEPNQIIVEYTGEIITQTECEKRMRTIYKHNEC-YYLMYFDQNM 513

Query: 178 VIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDYQYE 233
           +IDAT +G+ +R++NHSC PN  M+KW + G+ R+ +FA  R I  GE LTYDY ++
Sbjct: 514 IIDAT-RGSIARFVNHSCEPNCRMEKWTVAGKPRMALFAGDRGIMTGEELTYDYNFD 569


>gi|299116669|emb|CBN74814.1| Whsc1 protein [Ectocarpus siliculosus]
          Length = 1551

 Score =  107 bits (266), Expect = 6e-21,   Method: Composition-based stats.
 Identities = 71/207 (34%), Positives = 103/207 (49%), Gaps = 19/207 (9%)

Query: 43  PYVFIKRNIYLTKRIKRRLEDDGI-FCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKC-- 99
           PY  +K N+Y    +K  +  D I  C+C    G    CD  C   +LL  C+ G +C  
Sbjct: 260 PYRKLKANLYEDSSLKGLVPVDEIPLCNCRPEDG----CDASCINRLLLMECAPG-RCPT 314

Query: 100 ----GNSCLNKPFQNRPVKKMKLVQT-EKCGAGIVADED--IKRGEFVIEYVGEVIDDQT 152
                  C N   Q +     ++ +T E  G G+   E+   + G  + EY+GEVI    
Sbjct: 315 LRGASKYCNNNAIQTKTFPATEVFRTFEGRGWGLRLAEERGAEAGTLLHEYLGEVIMMDE 374

Query: 153 CEERLWKMKHLG----ETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDG 208
           C  RL K+   G      +FY   ++ ++V+D    G ++R+ NHSC PN  MQKW + G
Sbjct: 375 CRRRLRKVGRKGVEGSSGDFYFASLDGNLVLDGGPMGGEARFANHSCSPNCLMQKWSVLG 434

Query: 209 ETRIGIFATRDIKKGENLTYDYQYEFL 235
           ETR+ + A RDI  GE LTY+YQ + L
Sbjct: 435 ETRVVLVAARDISVGEELTYNYQADTL 461


>gi|378733973|gb|EHY60432.1| histone-lysine N-methyltransferase ASH1L [Exophiala dermatitidis
           NIH/UT8656]
          Length = 780

 Score =  107 bits (266), Expect = 6e-21,   Method: Composition-based stats.
 Identities = 75/242 (30%), Positives = 127/242 (52%), Gaps = 34/242 (14%)

Query: 15  HAFNKLLKQIGNPVEFELP-DWFI-----KPKAIPYVFIKRNIYLTK-----RIKRRLED 63
           +A  +LLKQ     +F+LP D F      +PK   +  + +N+++       R+ +  E 
Sbjct: 350 YAGERLLKQ---GRDFKLPFDVFSPLPAGQPKPDEWRKVNKNVFVGDAAQAWRVSKFEEH 406

Query: 64  DGIFCSCTASPGSSGVCDRDCHCGMLLSSCSS-GCKCGNS-CLNKPFQ--NRPVKK---- 115
               C C  S G    CD +C    +   C    C      C N+ F+   + VKK    
Sbjct: 407 SR--CMCKPSTG----CDHNCMNRYMFYECDDRNCNLSEELCGNRQFEALRQRVKKGGKY 460

Query: 116 ---MKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCE 172
              +++++TE  G G+ ++   +  + ++EY GE+I  + CE R+  M + G   +YL  
Sbjct: 461 NVGVEVIKTEDRGYGVRSNRTFEPNQIIVEYTGEIITQEECERRMNTM-YKGNECYYLML 519

Query: 173 INRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDYQ 231
            +++M+IDAT +G+ +R++NHSC PN  M+KW ++G+ R+ +FA  R I  GE LTYDY 
Sbjct: 520 FDQNMIIDAT-RGSIARFVNHSCAPNCRMEKWTVNGKPRMALFAGDRGIMTGEELTYDYN 578

Query: 232 YE 233
           ++
Sbjct: 579 FD 580


>gi|449664137|ref|XP_002169363.2| PREDICTED: histone-lysine N-methyltransferase SETD2-like [Hydra
           magnipapillata]
          Length = 655

 Score =  106 bits (265), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 67/197 (34%), Positives = 95/197 (48%), Gaps = 24/197 (12%)

Query: 43  PYVF--IKRNIYLTKRIKRRLEDD--GIFCSCTASPGSSGV--CDRDCHCGMLLSSCSSG 96
           P  F   + NIYL  + K R   +   + C C   P       CD +C   +L+  C++ 
Sbjct: 229 PLTFDSTETNIYLIDKEKSRSSKEVRRMICECVFEPDDPFFVGCDENCLNRLLMIECNNR 288

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
           C     C N+ FQ    K+    + E              G FV EY GEV+D      R
Sbjct: 289 CPTREFCTNRNFQ----KQEDFYENE--------------GSFVSEYCGEVVDYTEFHRR 330

Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
                  G  ++Y   +  + +IDAT KG+KSR+INHSC PN   QKW ++G  R+G FA
Sbjct: 331 TKLYNAEGMNHYYFMTLKTNEIIDATKKGSKSRFINHSCDPNCITQKWTVNGFLRVGFFA 390

Query: 217 TRDIKKGENLTYDYQYE 233
            R I+ GE L++DYQ++
Sbjct: 391 LRYIEAGEELSFDYQFQ 407


>gi|121704806|ref|XP_001270666.1| histone-lysine N-methyltransferase (Ash1), putative [Aspergillus
           clavatus NRRL 1]
 gi|119398812|gb|EAW09240.1| histone-lysine N-methyltransferase (Ash1), putative [Aspergillus
           clavatus NRRL 1]
          Length = 847

 Score =  106 bits (264), Expect = 9e-21,   Method: Composition-based stats.
 Identities = 59/179 (32%), Positives = 97/179 (54%), Gaps = 21/179 (11%)

Query: 68  CSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKK---------MK 117
           C C    G    CD +C    +   C  G C  G  C N+ F+    +          ++
Sbjct: 420 CMCIEETG----CDENCQNRYMFYECDEGNCGLGPDCGNRSFEELKQRTKVGGKYNIGVE 475

Query: 118 LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETN--FYLCEINR 175
           +++TE  G G+ ++   +  + ++EY GE+I    CE+R   M+ L + N  +YL   ++
Sbjct: 476 VIKTEDRGYGVRSNRTFEPNQVIVEYTGEIITQVECEKR---MRTLYKNNECYYLMYFDQ 532

Query: 176 DMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDYQYE 233
           +M+IDAT +G+ +R++NHSC PN  M+KW + G+ R+ +FA  R I  GE LTYDY ++
Sbjct: 533 NMIIDAT-RGSIARFVNHSCEPNCRMEKWTVAGKPRMALFAGDRGIMTGEELTYDYNFD 590


>gi|15488418|gb|AAL01110.1|AF408059_1 ASH1-like protein 1 [Arabidopsis thaliana]
          Length = 229

 Score =  106 bits (264), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 55/127 (43%), Positives = 77/127 (60%)

Query: 108 FQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETN 167
           FQ     K KL++ E  G G+VA E+IK G+F++EY GEVI  +  ++R    +  G  +
Sbjct: 1   FQKCEYAKTKLIKCEGRGWGLVALEEIKAGQFIMEYCGEVISWKEAKKRAQTYETHGVKD 60

Query: 168 FYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLT 227
            Y+  +N    IDAT KG+ +R+INHSC PN E +KW + GE R+GIFA   I     L 
Sbjct: 61  AYIISLNASEAIDATKKGSLARFINHSCRPNCETRKWNVLGEVRVGIFAKESISPRTELA 120

Query: 228 YDYQYEF 234
           YDY +E+
Sbjct: 121 YDYNFEW 127


>gi|380493696|emb|CCF33689.1| SET domain-containing protein [Colletotrichum higginsianum]
          Length = 833

 Score =  106 bits (264), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 63/176 (35%), Positives = 96/176 (54%), Gaps = 14/176 (7%)

Query: 80  CDRDCHCGMLLSSCS-SGCKCGNS-CLNKPF---QNRPVKKMK------LVQTEKCGAGI 128
           C  DC   ++L  C    C  G   C N+ F   Q+R V   K      +++T   G GI
Sbjct: 433 CGEDCQNRIMLYECDDKNCNVGREHCTNRAFADLQDRRVGGGKYRVGVEVIKTPDRGYGI 492

Query: 129 VADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKS 188
            A+   +  + ++EY GE+I D+ C ER+ + K+     +YL   +++M+IDAT KG+ +
Sbjct: 493 RANRCFESNQIIMEYTGEIITDEECSERM-ENKYKNNKCYYLMSFDQNMIIDAT-KGSIA 550

Query: 189 RYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDYQYEFLHDSLIAYC 243
           R++NHSC PN  M KWI+ G+ R+ +FA  + I  GE LTYDY +    D  +  C
Sbjct: 551 RFVNHSCAPNCRMIKWIVSGQPRMALFAGDKPIMTGEELTYDYNFSPFSDENVQTC 606


>gi|449497711|ref|XP_004160490.1| PREDICTED: histone-lysine N-methyltransferase SUVR5-like [Cucumis
            sativus]
          Length = 1419

 Score =  106 bits (264), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 60/166 (36%), Positives = 90/166 (54%), Gaps = 19/166 (11%)

Query: 87   GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
            G L+  C+  C C  +C N+  QN    K+++  TE  G  + A E I RG FV EYVGE
Sbjct: 1245 GYLVYECNERCSCSRACPNRVLQNGVHVKLEVFMTETKGWAVRAGEAIMRGTFVCEYVGE 1304

Query: 147  VIDDQTCEERLWKMKHLGETNFYLCEINRDM-------------VIDATYKGNKSRYINH 193
            V+D+Q    R  + K+  E N Y  +++  +             +IDAT+ GN SR+INH
Sbjct: 1305 VLDEQEANRR--RDKYNSEGNCYFLDVDAHINDISRLVDGSARYIIDATHYGNVSRFINH 1362

Query: 194  SCCPNTEMQKWIIDG----ETRIGIFATRDIKKGENLTYDYQYEFL 235
            SC PN    + +++      + IG++A R+I  GE LT++Y+ E L
Sbjct: 1363 SCSPNLVTYQVLVESMEYQRSHIGLYANRNIATGEELTFNYRRELL 1408


>gi|310794022|gb|EFQ29483.1| SET domain-containing protein [Glomerella graminicola M1.001]
          Length = 828

 Score =  105 bits (263), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 60/176 (34%), Positives = 96/176 (54%), Gaps = 14/176 (7%)

Query: 80  CDRDCHCGMLLSSCS-SGCKCGNS-CLNKPFQNRPVKK---------MKLVQTEKCGAGI 128
           C  DC   ++L  C    C  G   C N+ F +   +K         +++++T   G GI
Sbjct: 419 CGEDCQNRIMLYECDDKNCNVGREHCTNRAFADLQERKSGGGKYRVGVEVIKTSDRGYGI 478

Query: 129 VADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKS 188
            A+   +  + ++EY GE+I D+ C ER+ + K+     +YL   +++M+IDAT KG+ +
Sbjct: 479 RANRCFEPNQIIMEYTGEIITDEECSERM-ENKYKDSKCYYLMSFDQNMIIDAT-KGSIA 536

Query: 189 RYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDYQYEFLHDSLIAYC 243
           R++NHSC PN  M KWI+ G+ R+ +FA  + I  GE LTYDY +    D  +  C
Sbjct: 537 RFVNHSCAPNCRMIKWIVSGQPRMALFAGDKPIMTGEELTYDYNFSPFSDENVQKC 592


>gi|145533713|ref|XP_001452601.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124420300|emb|CAK85204.1| unnamed protein product [Paramecium tetraurelia]
          Length = 1137

 Score =  105 bits (262), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 66/181 (36%), Positives = 94/181 (51%), Gaps = 7/181 (3%)

Query: 63  DDGIFCSCTASPGSSGVCDRDCHCGMLLSSCS-SGCKCGNSCLNKPFQNRPVKKMKLVQT 121
           +D I  S    P  S  C   C      + C    C C   C N+ FQ     +   V  
Sbjct: 12  EDCIKFSRIQGPQYSYNCGDRCLNKFTCTECDVELCPCAELCKNRRFQKH---EDACVYP 68

Query: 122 EKCGA---GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMV 178
            KCG    G+ A + I++G+F+++Y+GE+    +   R    ++   T  YL ++N   V
Sbjct: 69  LKCGGKGMGLYAGDRIQKGQFIMQYIGEIFQINSALGRRRVQEYSKSTCTYLMKLNNQEV 128

Query: 179 IDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
           ID T KGN +R+INHSC PN   +KW + GE  IGIFA+RDI + E LT+DYQ++  H  
Sbjct: 129 IDPTTKGNLARFINHSCEPNCITEKWNVLGEVCIGIFASRDINEDEELTFDYQFDVFHTP 188

Query: 239 L 239
           L
Sbjct: 189 L 189


>gi|367041896|ref|XP_003651328.1| hypothetical protein THITE_2111453 [Thielavia terrestris NRRL 8126]
 gi|346998590|gb|AEO64992.1| hypothetical protein THITE_2111453 [Thielavia terrestris NRRL 8126]
          Length = 950

 Score =  105 bits (262), Expect = 2e-20,   Method: Composition-based stats.
 Identities = 58/178 (32%), Positives = 94/178 (52%), Gaps = 18/178 (10%)

Query: 68  CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNS-CLNKPFQNRPVKKMK-------- 117
           C C    G    C  DC   ++L  C  + C  G   C N+ FQ+   +  K        
Sbjct: 453 CVCKPEDG----CGEDCQNRIMLYECDETNCNIGREYCTNRAFQDLQERTKKGGRYRVGV 508

Query: 118 -LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
            + +T   G G+ ++   +  + ++EY GE+I ++ CE R+   ++     +YL   +++
Sbjct: 509 EVFKTPDRGYGVRSNRCFEPNQIIMEYTGEIITEEECERRM-NEEYKDNECYYLMSFDQN 567

Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDYQYE 233
           M+IDAT  G+ +R++NHSC PN  M KWI+ G+ R+ +FA  R I  GE LTYDY ++
Sbjct: 568 MIIDAT-TGSIARFVNHSCSPNCRMIKWIVSGQPRMALFAGDRPIMTGEELTYDYNFD 624


>gi|71003762|ref|XP_756547.1| hypothetical protein UM00400.1 [Ustilago maydis 521]
 gi|46095711|gb|EAK80944.1| hypothetical protein UM00400.1 [Ustilago maydis 521]
          Length = 1367

 Score =  105 bits (262), Expect = 2e-20,   Method: Composition-based stats.
 Identities = 71/205 (34%), Positives = 106/205 (51%), Gaps = 18/205 (8%)

Query: 38  KPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCS-SG 96
           K K   Y  I +N Y+T+    +L+ +   C+C   PGS   C  DC   ML+  C    
Sbjct: 654 KRKPPRYQQINKNKYVTRA---KLQGEVPLCNC--KPGSG--CGHDCINRMLMFICDPKT 706

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
           C   ++C N     RP  K  +    + G G+   E IKR +F+ EY GEVI+     +R
Sbjct: 707 CPSASNCTNISLGRRPHVKTAVAYYGRRGFGLKTLEAIKRDDFIDEYRGEVINLSEAAKR 766

Query: 157 LWKMKHLGETNFYLCEINRDM--VIDATYKGNKSRYINHSCCPNTEMQKWIIDG------ 208
           + + ++    N+YL + +     ++D   KGN +R+ NHSC PN  ++K+II G      
Sbjct: 767 VTE-EYKATGNYYLLDYDSAAGELLDGGRKGNITRFANHSCDPNCRIEKFIICGTDEALS 825

Query: 209 -ETRIGIFATRDIKKGENLTYDYQY 232
            E +IG+FA RDI  GE LTY+Y +
Sbjct: 826 AEFQIGLFANRDIAAGEELTYNYGW 850


>gi|323507954|emb|CBQ67825.1| conserved hypothetical protein [Sporisorium reilianum SRZ2]
          Length = 1423

 Score =  105 bits (262), Expect = 2e-20,   Method: Composition-based stats.
 Identities = 70/205 (34%), Positives = 107/205 (52%), Gaps = 18/205 (8%)

Query: 38  KPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCS-SG 96
           K K   Y  I +N Y+++    +L+ +   C+C   PGS   C  DC   ML+  C    
Sbjct: 679 KRKPPRYQQISKNKYVSRA---KLQGETPLCNC--KPGSG--CGADCINRMLMFICDPRT 731

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
           C   ++C N     RP  K  +    + G G+   E IK+ +F+ EY GEVI+     +R
Sbjct: 732 CPSASNCTNVSLGRRPTVKTTVAYYGRRGFGLKTLEAIKKDDFIDEYRGEVINLGEAAKR 791

Query: 157 LWKMKHLGETNFYLCEINRDM--VIDATYKGNKSRYINHSCCPNTEMQKWIIDG------ 208
           + + ++    NFYL + +     ++D   KGN +R+ NHSC PN  ++K+II G      
Sbjct: 792 VTE-EYKATGNFYLLDYDSAAGELLDGGRKGNITRFANHSCDPNCRIEKFIICGTDEALS 850

Query: 209 -ETRIGIFATRDIKKGENLTYDYQY 232
            E +IG+FA RDI +GE LTY+Y +
Sbjct: 851 AEFQIGLFANRDIAEGEELTYNYGW 875


>gi|406694205|gb|EKC97537.1| hypothetical protein A1Q2_08152 [Trichosporon asahii var. asahii
           CBS 8904]
          Length = 1192

 Score =  105 bits (262), Expect = 2e-20,   Method: Composition-based stats.
 Identities = 71/200 (35%), Positives = 101/200 (50%), Gaps = 12/200 (6%)

Query: 38  KPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGC 97
           K +   Y  I  N +L +R K    D  + C C    G    C       +   SC +G 
Sbjct: 471 KKRPPQYEMITSNQFL-ERPKIPASDKEV-CHCVVGTGCGLSCTNRLQGILCGKSCPNGP 528

Query: 98  KCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERL 157
            CGN  L +    RP K + +  +   G G  A EDI  GEFV++Y GEVI   T  +R+
Sbjct: 529 DCGNQALCR----RPAKAIIVALSSLHGYGEFAAEDIAAGEFVVDYRGEVISVDTFIDRI 584

Query: 158 WKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKW--IIDG--ETRIG 213
               H  + + +    +RD +ID++ KGN +R+INHSC PN  ++K+  + DG  E   G
Sbjct: 585 G--SHSEDDSVFAIAYDRDEIIDSSSKGNSARFINHSCDPNLVLRKFDTLGDGHEEHEFG 642

Query: 214 IFATRDIKKGENLTYDYQYE 233
           +++ R IK GE LTYDY  E
Sbjct: 643 LWSRRPIKAGEELTYDYNAE 662


>gi|308162821|gb|EFO65192.1| Histone methyltransferase HMT1 [Giardia lamblia P15]
          Length = 298

 Score =  105 bits (262), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 69/227 (30%), Positives = 102/227 (44%), Gaps = 41/227 (18%)

Query: 41  AIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCG 100
           ++ Y  IKRNIY+    KR       FC+C    G+   C   C    +   C   C  G
Sbjct: 52  SLHYTHIKRNIYVG--CKRPSAARKTFCTCACKEGAG--CGASCELRKVHLECYKECCAG 107

Query: 101 NSCLNKPF----------------------------QNRPVKKMKLVQTE-----KCGAG 127
             C  +                               N+ +++M+  +T      + G G
Sbjct: 108 TVCFKQAIVNPLFNSLIDPPRDIAESYKDSSKFIDCGNQRLQRMQYARTAVYPAGRKGYG 167

Query: 128 IVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNK 187
           + A   I+RG  V EYVGEVI  + C  R    K   + + Y   ++ ++ IDA +KGN+
Sbjct: 168 LFALNSIQRGALVTEYVGEVITREECLRR----KRSAKGHLYFLALDGELYIDAAHKGNE 223

Query: 188 SRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEF 234
           SR+INHSC PN E+Q W +  E R  I A R I   E L++DY+++F
Sbjct: 224 SRFINHSCDPNCEVQLWYVGEEPRAAIVALRSIAPHEELSFDYKFDF 270


>gi|443714650|gb|ELU06966.1| hypothetical protein CAPTEDRAFT_176480 [Capitella teleta]
          Length = 936

 Score =  105 bits (261), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 69/216 (31%), Positives = 108/216 (50%), Gaps = 10/216 (4%)

Query: 33  PDWFIKPKAIPYVFIKRNIYL-TKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLS 91
           P     PK   Y  I+ N+Y+ TK   +  E     C   + P     C   C   M+ +
Sbjct: 141 PQKLASPK---YKKIRHNVYVDTKPQCKGWEPHPCSCRIPSDPNEPA-CGDYCLNRMVYT 196

Query: 92  SCSSG-CKCGNSCLNKPFQ-NRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVID 149
            C++G C CG+ C N+  Q +   + ++   T   G G+ +   +  G+++ EY+GEV+ 
Sbjct: 197 ECNAGACPCGDRCSNQRIQRHHHAEGLEKFVTADRGHGVRSKHPLVNGQYICEYLGEVVS 256

Query: 150 DQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGE 209
           ++    R+    +    + Y   ++   VID    G+ SR+INHSC PN EMQKW I+G 
Sbjct: 257 EEEFRRRMAD-DYSAAPHHYCLNLDSGTVIDGYRMGSISRFINHSCEPNCEMQKWNINGV 315

Query: 210 TRIGIFATRDIKKGENLTYDYQYEF--LHDSLIAYC 243
            RI +F+ +DI  GE LTYDY ++   +H   I  C
Sbjct: 316 YRIALFSLKDIPPGEELTYDYNFQSYNVHSQQICKC 351


>gi|330920531|ref|XP_003299046.1| hypothetical protein PTT_09957 [Pyrenophora teres f. teres 0-1]
 gi|311327461|gb|EFQ92879.1| hypothetical protein PTT_09957 [Pyrenophora teres f. teres 0-1]
          Length = 791

 Score =  105 bits (261), Expect = 2e-20,   Method: Composition-based stats.
 Identities = 65/196 (33%), Positives = 97/196 (49%), Gaps = 17/196 (8%)

Query: 59  RRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCG-NSCLNKPFQ------- 109
           +R + D   C C    G    C   CH  ++   C ++ C      C N+PF        
Sbjct: 364 KRDKQDSSQCYCDVEDG----CGEACHNRIMAYECDNTNCPLPPEQCNNRPFAELKRRAK 419

Query: 110 -NRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNF 168
            NR    ++++ TE  G G+ A    +  + ++EY GE+I    CE R+ K  +  +  +
Sbjct: 420 GNRYDYGVEVLDTEDRGYGVRAMRTFEPHQIIVEYAGEIITQSECERRM-KQVYKKDKCY 478

Query: 169 YLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLT 227
           YL   +  M+IDAT +G  +R++NHSC PN EM KW + GE R+ +FA  R I  GE LT
Sbjct: 479 YLMSFDNKMIIDAT-RGTIARFVNHSCEPNCEMIKWTVGGEPRMALFAGPRGIMTGEELT 537

Query: 228 YDYQYEFLHDSLIAYC 243
           YDY ++      I  C
Sbjct: 538 YDYNFDPFSQKNIQEC 553


>gi|296413387|ref|XP_002836395.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295630213|emb|CAZ80586.1| unnamed protein product [Tuber melanosporum]
          Length = 901

 Score =  105 bits (261), Expect = 2e-20,   Method: Composition-based stats.
 Identities = 57/180 (31%), Positives = 96/180 (53%), Gaps = 18/180 (10%)

Query: 68  CSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCG-NSCLNKPFQN---------RPVKKM 116
           C C    G    CD +C        C  G C  G   C N+ F +         +  + +
Sbjct: 395 CVCAEEVG----CDENCLNMCTWVECDEGNCNVGVERCTNRAFADLKERVKSGTKFAEGV 450

Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
           ++V+T  CG G+ A       + ++EY GE+I  +  E R+ ++ +    N+YL   +++
Sbjct: 451 EVVKTANCGHGLRATRGFMPNQIIVEYTGEIITQEESERRMVEV-YKDNKNYYLMLFHQN 509

Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD-IKKGENLTYDYQYEFL 235
           M++DAT +G+ +R++NHSC PN  M+KW+++G  R+ +FA  D I+ GE LTYDY + + 
Sbjct: 510 MILDAT-RGSVARFVNHSCDPNCRMEKWLVEGRPRMALFAGDDGIEAGEELTYDYNFNWF 568


>gi|346977253|gb|EGY20705.1| hypothetical protein VDAG_10334 [Verticillium dahliae VdLs.17]
          Length = 787

 Score =  105 bits (261), Expect = 2e-20,   Method: Composition-based stats.
 Identities = 58/178 (32%), Positives = 97/178 (54%), Gaps = 18/178 (10%)

Query: 68  CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNS-CLNKPFQNRPVKKM--------- 116
           C C    G    CD DC   ++L  C  + C  G + C N+ F +   +K+         
Sbjct: 337 CVCKPQDG----CDEDCQNRIMLYECDENNCNVGKAFCTNRAFADLHERKVAGGKYRTGV 392

Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
           ++++T   G GI ++      + ++EY GE+I ++ CE R+   K+     +YL   +++
Sbjct: 393 EVIKTSDRGHGIRSNRCFDANQIIMEYTGEIITEEECENRM-NTKYKNNDCYYLMSFDQN 451

Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDYQYE 233
           M+IDAT  G+ +R++NHSC PN +M KWI+ G+ R+ +FA    I  G+ LTYDY ++
Sbjct: 452 MIIDAT-TGSIARFVNHSCKPNCKMIKWIVGGQPRMALFAGDAPIMTGDELTYDYNFD 508


>gi|115402571|ref|XP_001217362.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gi|114189208|gb|EAU30908.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 790

 Score =  104 bits (260), Expect = 2e-20,   Method: Composition-based stats.
 Identities = 56/177 (31%), Positives = 93/177 (52%), Gaps = 17/177 (9%)

Query: 68  CSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKK---------MK 117
           C CT   G    CD +C    +   C  G C  G  C N+ F     +          ++
Sbjct: 390 CMCTEETG----CDENCQNRYMFYECDDGNCSLGPGCGNRNFNELKQRTKAGGKYNIGVE 445

Query: 118 LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDM 177
           +++T   G G+ ++      + ++EY GE+I    CE+R+  +    E  +YL   +++M
Sbjct: 446 VIKTADRGYGVRSNRTFDPNQIIVEYTGEIITQSECEKRMRTIYKNNEC-YYLMYFDQNM 504

Query: 178 VIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDYQYE 233
           +IDAT +G+ +R++NHSC PN  M+KW + G+ R+ +FA  R +  GE LTYDY ++
Sbjct: 505 IIDAT-RGSIARFVNHSCEPNCRMEKWTVAGKPRMALFAGDRGVMTGEELTYDYNFD 560


>gi|358346394|ref|XP_003637253.1| Histone-lysine N-methyltransferase SETDB1, partial [Medicago
           truncatula]
 gi|355503188|gb|AES84391.1| Histone-lysine N-methyltransferase SETDB1, partial [Medicago
           truncatula]
          Length = 338

 Score =  104 bits (260), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 62/164 (37%), Positives = 90/164 (54%), Gaps = 15/164 (9%)

Query: 87  GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
           G L+  C+  CKC  +C N+  QN    K+++ +TEK G G+ A E I RG FV EY+GE
Sbjct: 156 GYLVYECNEECKCDKTCPNRILQNGIHVKLEVFKTEKKGWGVRACEAISRGTFVCEYIGE 215

Query: 147 VIDDQTCEERLWKMKHLGETNFY-----------LCEINRDMVIDATYKGNKSRYINHSC 195
           V+D+Q    R  +        FY           L E     VID+T  GN SR+IN+SC
Sbjct: 216 VLDEQEARNRRERYGKEHCDYFYDVDARVNDMSRLIEREARYVIDSTRYGNVSRFINNSC 275

Query: 196 CPNTEMQKWIIDG----ETRIGIFATRDIKKGENLTYDYQYEFL 235
            PN    + +++      + IG++A++DI KG+ LTY+Y YE +
Sbjct: 276 SPNLVNYQVLVESMDCKRSHIGLYASQDIAKGDELTYNYHYELV 319


>gi|171676169|ref|XP_001903038.1| hypothetical protein [Podospora anserina S mat+]
 gi|170936150|emb|CAP60810.1| unnamed protein product [Podospora anserina S mat+]
          Length = 894

 Score =  104 bits (260), Expect = 3e-20,   Method: Composition-based stats.
 Identities = 59/185 (31%), Positives = 96/185 (51%), Gaps = 18/185 (9%)

Query: 61  LEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCS-SGCKCGNS-CLNKPFQNRPVKKMK- 117
            ED    C C    G    C  DC   ++L  C  + C  G   C N+ FQ+   +  K 
Sbjct: 433 FEDFSSKCVCKPEDG----CAEDCQNRIMLYECDDTNCNAGREFCQNRAFQDLQERTKKG 488

Query: 118 --------LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFY 169
                   +++T   G G+ +    +  + ++EY GE+I ++ CE R+   K+     +Y
Sbjct: 489 GRFRVGVEVLKTSDRGYGVRSTRCFEPNQIIMEYTGEIITEEECERRM-NEKYKDNECYY 547

Query: 170 LCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTY 228
           L   +++M+IDAT  G+ +R++NHSC PN  M KWI+ G+ R+ +FA  + I  G+ LTY
Sbjct: 548 LMSFDQNMIIDAT-TGSMARFVNHSCSPNCRMIKWIVSGQPRMALFAGDKPIMTGDELTY 606

Query: 229 DYQYE 233
           DY ++
Sbjct: 607 DYNFD 611


>gi|359476736|ref|XP_002269759.2| PREDICTED: histone-lysine N-methyltransferase SUVR5-like [Vitis
            vinifera]
          Length = 1517

 Score =  104 bits (259), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 64/166 (38%), Positives = 91/166 (54%), Gaps = 18/166 (10%)

Query: 87   GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
            G L+  C+  C C  +C N+  QN    K+++ +TE+ G  + A E I RG F+ EY+GE
Sbjct: 1334 GYLVYECNGKCSCNRTCQNRVLQNGVRVKLEVFRTEEKGWAVRAGEAILRGTFICEYIGE 1393

Query: 147  VIDDQTCEERLWKMKHLGETNFYLCEIN---RDM----------VIDATYKGNKSRYINH 193
            V+ +Q  ++R    +H  E   Y  +I+    DM          VIDAT  GN SR+INH
Sbjct: 1394 VLSEQEADKR-GNNRHGEEGCSYFYDIDSHINDMSRLVEGQVPYVIDATRYGNVSRFINH 1452

Query: 194  SCCPNTEMQKWIIDGE----TRIGIFATRDIKKGENLTYDYQYEFL 235
            SC PN    + +++        IG+FA RDI  GE LTYDY+Y+ L
Sbjct: 1453 SCSPNLINHQVLVESMDCQLAHIGLFANRDISLGEELTYDYRYKPL 1498


>gi|320168697|gb|EFW45596.1| Setd1a protein [Capsaspora owczarzaki ATCC 30864]
          Length = 1312

 Score =  104 bits (259), Expect = 4e-20,   Method: Composition-based stats.
 Identities = 51/132 (38%), Positives = 79/132 (59%), Gaps = 3/132 (2%)

Query: 115  KMKLVQTEKCGA---GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLC 171
            + KL++ ++ G    G+ + E+I   + VIEYVGEVI     + R    +  G  + Y  
Sbjct: 1171 RKKLLKFQRSGIHAFGLFSQENISANDLVIEYVGEVIRQSISDIREHHYERRGIGSSYFF 1230

Query: 172  EINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQ 231
             I+ D V+DATYKGN +R++NH C PN   +  ++DG  RI I++ RDIKKGE +TYDY+
Sbjct: 1231 RIDEDHVVDATYKGNLARFMNHCCEPNCYAKIIMVDGHQRIVIYSKRDIKKGEEITYDYK 1290

Query: 232  YEFLHDSLIAYC 243
            + +  + +   C
Sbjct: 1291 FPYEENKIPCLC 1302


>gi|268566865|ref|XP_002639833.1| C. briggsae CBR-MET-1 protein [Caenorhabditis briggsae]
          Length = 2074

 Score =  104 bits (259), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 68/190 (35%), Positives = 103/190 (54%), Gaps = 18/190 (9%)

Query: 52  YLTKRI-KRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQN 110
           YLT+   K+R E   + C C  + G+    D  C    +++ C S C     C N+ F  
Sbjct: 672 YLTRSANKKRTE--SLLCECGRTGGT--CSDNTCVNRAMMTECPSSCTA--KCKNQRFAK 725

Query: 111 RPVKKMKLVQTEKC-GAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGET--- 166
           + +  ++   T    G G+ A +DIK+G F+IEY+GEV++    E+R  K+K+  +    
Sbjct: 726 KKIASVEAYHTGTAKGCGLRALKDIKKGRFIIEYIGEVVERDDYEKR--KLKYAADKKHK 783

Query: 167 NFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGE----TRIGIFATRDIKK 222
           + YLC+      IDAT  GN SR++NHSC PN   +KW +       +RIG FA + IK 
Sbjct: 784 HHYLCDTG-IYTIDATVYGNASRFVNHSCDPNAVCEKWSVPKTPGDISRIGFFAKKFIKS 842

Query: 223 GENLTYDYQY 232
           GE +T+DYQ+
Sbjct: 843 GEEITFDYQF 852


>gi|396469794|ref|XP_003838493.1| hypothetical protein LEMA_P114290.1 [Leptosphaeria maculans JN3]
 gi|312215061|emb|CBX95014.1| hypothetical protein LEMA_P114290.1 [Leptosphaeria maculans JN3]
          Length = 880

 Score =  104 bits (259), Expect = 4e-20,   Method: Composition-based stats.
 Identities = 63/193 (32%), Positives = 96/193 (49%), Gaps = 17/193 (8%)

Query: 62  EDDGIFCSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCG-NSCLNKPFQ--------NR 111
           + D   C C    G    C   CH  ++   C S+ C+     C N+PF         NR
Sbjct: 371 KQDSSQCYCGPEDG----CGEACHNRIMAYECDSTNCRLSPEQCGNRPFAELKRRAKGNR 426

Query: 112 PVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLC 171
               +++V T   G G+ A    +  + ++EY GE+I    CE R+ K  +  +  +YL 
Sbjct: 427 YDYGVEVVDTHDRGFGVRAMRTFEPHQIIVEYAGEIITQWECERRM-KQVYKKDKCYYLM 485

Query: 172 EINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDY 230
             +  M+IDAT +G  +R++NHSC PN EM KW + GE R+ +FA +R +  G+ LTYDY
Sbjct: 486 SFDNKMIIDAT-RGTIARFVNHSCEPNCEMIKWTVGGEPRMALFAGSRGVMTGDELTYDY 544

Query: 231 QYEFLHDSLIAYC 243
            ++      I  C
Sbjct: 545 NFDPFSQKNIQVC 557


>gi|297735229|emb|CBI17591.3| unnamed protein product [Vitis vinifera]
          Length = 1315

 Score =  104 bits (259), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 64/166 (38%), Positives = 91/166 (54%), Gaps = 18/166 (10%)

Query: 87   GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
            G L+  C+  C C  +C N+  QN    K+++ +TE+ G  + A E I RG F+ EY+GE
Sbjct: 1132 GYLVYECNGKCSCNRTCQNRVLQNGVRVKLEVFRTEEKGWAVRAGEAILRGTFICEYIGE 1191

Query: 147  VIDDQTCEERLWKMKHLGETNFYLCEIN---RDM----------VIDATYKGNKSRYINH 193
            V+ +Q  ++R    +H  E   Y  +I+    DM          VIDAT  GN SR+INH
Sbjct: 1192 VLSEQEADKR-GNNRHGEEGCSYFYDIDSHINDMSRLVEGQVPYVIDATRYGNVSRFINH 1250

Query: 194  SCCPNTEMQKWIIDGE----TRIGIFATRDIKKGENLTYDYQYEFL 235
            SC PN    + +++        IG+FA RDI  GE LTYDY+Y+ L
Sbjct: 1251 SCSPNLINHQVLVESMDCQLAHIGLFANRDISLGEELTYDYRYKPL 1296


>gi|392560212|gb|EIW53395.1| hypothetical protein TRAVEDRAFT_154887 [Trametes versicolor FP-101664
            SS1]
          Length = 1014

 Score =  104 bits (259), Expect = 4e-20,   Method: Composition-based stats.
 Identities = 53/117 (45%), Positives = 69/117 (58%)

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+ A E I RGE VIEYVGEVI  Q  ++R    +  G  + YL  I+ D+V+DAT KGN
Sbjct: 888  GLYAMEKISRGEMVIEYVGEVIRAQVADKREKAYERQGIGSSYLFRIDEDLVVDATKKGN 947

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
              R INHSC PN   +   I GE +I I+A +DI+ G  +TYDY +    D +   C
Sbjct: 948  LGRLINHSCDPNCTAKIITISGEKKIVIYAKQDIELGSEITYDYHFPIEQDKIPCLC 1004


>gi|407918366|gb|EKG11637.1| hypothetical protein MPH_11130 [Macrophomina phaseolina MS6]
          Length = 980

 Score =  103 bits (258), Expect = 4e-20,   Method: Composition-based stats.
 Identities = 64/204 (31%), Positives = 100/204 (49%), Gaps = 26/204 (12%)

Query: 58  KRRLEDDGIFCSCTASPGSSGVCDRD-CHCGMLLSSCSS-GCKCGNSCLNKPFQN----- 110
           K++ + +   C+C       G C+RD C    L   C    C  G +C N+ F N     
Sbjct: 436 KKQRQMEASTCNCV------GRCERDGCFNASLFFECDDRSCNLGPNCGNRQFNNLQKRY 489

Query: 111 ----RPVKKMK-------LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
               R  K  K       +++T   G G+ A     + + V+EY+GE+I  Q  + R+ +
Sbjct: 490 KDDTRQGKYFKSFNVGVEVMETPDRGHGVRAMRPFHQDQIVVEYIGEIITQQESDRRVNE 549

Query: 160 MKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD 219
           + +     FYL      ++ID  Y+GN +R++NHSC PN  M+KW ++GE R+ +FA R+
Sbjct: 550 V-YKDHKAFYLMNFYDKLIIDG-YRGNVARFVNHSCDPNCRMEKWTVNGEQRMALFANRN 607

Query: 220 IKKGENLTYDYQYEFLHDSLIAYC 243
           I  GE LT+ Y +E        YC
Sbjct: 608 IMTGEELTWHYNFESYGKEQPCYC 631


>gi|389646281|ref|XP_003720772.1| hypothetical protein MGG_02937 [Magnaporthe oryzae 70-15]
 gi|351638164|gb|EHA46029.1| hypothetical protein MGG_02937 [Magnaporthe oryzae 70-15]
          Length = 1015

 Score =  103 bits (258), Expect = 4e-20,   Method: Composition-based stats.
 Identities = 66/200 (33%), Positives = 105/200 (52%), Gaps = 20/200 (10%)

Query: 46  FIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCS-SGCKCGNS-C 103
           F+  ++ L K  K    D+   C CT   G    C  DC    +L  C+ + C  G   C
Sbjct: 448 FVGESLSLWK--KNYYYDNRSTCVCTKDDG----CGEDCLNRSVLYECNDTNCNVGREHC 501

Query: 104 LNKPFQNRPVKKMK---------LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCE 154
            N+ FQ+   +  K         +V T   G G+ A    + G+ ++EY GE+I ++ CE
Sbjct: 502 QNRAFQDLQDRNKKGGSYRVGVEVVHTGPRGFGVRASRCFEPGQIIMEYAGEIITEEECE 561

Query: 155 ERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGI 214
            R+ ++    E  +YL   +++M++DAT  G+ +R++NHSC PN  M KWI+ G+ R+ +
Sbjct: 562 RRMNEVYKDNEA-YYLMSFDQNMILDAT-TGSIARFVNHSCSPNCRMIKWIVCGKPRMAL 619

Query: 215 FATRD-IKKGENLTYDYQYE 233
           FA  + I  GE LTYDY ++
Sbjct: 620 FAGDNPIMTGEELTYDYNFD 639


>gi|449448546|ref|XP_004142027.1| PREDICTED: histone-lysine N-methyltransferase SUVR5-like [Cucumis
            sativus]
          Length = 1406

 Score =  103 bits (258), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 58/163 (35%), Positives = 89/163 (54%), Gaps = 19/163 (11%)

Query: 87   GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
            G L+  C+  C C  +C N+  QN    K+++  TE  G  + A E I RG FV EYVGE
Sbjct: 1245 GYLVYECNERCSCSRACPNRVLQNGVHVKLEVFMTETKGWAVRAGEAIMRGTFVCEYVGE 1304

Query: 147  VIDDQTCEERLWKMKHLGETNFYLCEINRDM-------------VIDATYKGNKSRYINH 193
            V+D+Q    R  + K+  E N Y  +++  +             +IDAT+ GN SR+INH
Sbjct: 1305 VLDEQEANRR--RDKYNSEGNCYFLDVDAHINDISRLVDGSARYIIDATHYGNVSRFINH 1362

Query: 194  SCCPNTEMQKWIIDG----ETRIGIFATRDIKKGENLTYDYQY 232
            SC PN    + +++      + IG++A R+I  GE LT++Y++
Sbjct: 1363 SCSPNLVTYQVLVESMEYQRSHIGLYANRNIATGEELTFNYRW 1405


>gi|425778490|gb|EKV16615.1| Histone-lysine N-methyltransferase (Ash1), putative [Penicillium
           digitatum PHI26]
 gi|425784214|gb|EKV22005.1| Histone-lysine N-methyltransferase (Ash1), putative [Penicillium
           digitatum Pd1]
          Length = 770

 Score =  103 bits (258), Expect = 5e-20,   Method: Composition-based stats.
 Identities = 55/187 (29%), Positives = 99/187 (52%), Gaps = 17/187 (9%)

Query: 68  CSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKK---------MK 117
           C CT   G    CD +C    +   C +G C+ G  C N+ F+    +          ++
Sbjct: 389 CMCTEDTG----CDENCQNRYMFYECDNGNCRLGPECGNRNFEGLKHRTKAGGKYNIGVE 444

Query: 118 LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDM 177
           +++T   G G+ ++      + ++EY GE++    CE+R+  +    E  +YL   +++M
Sbjct: 445 VIKTADRGYGVRSNRSFDPNQIIVEYTGEILTQLECEKRMRTVYKNNEC-YYLMYFDQNM 503

Query: 178 VIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDYQYEFLH 236
           +IDAT +G+ +R++NH+C PN  M+KW + G+ R+ +FA  R I  GE L+YDY ++   
Sbjct: 504 IIDAT-RGSIARFVNHACEPNCRMEKWTVAGKPRMALFAGDRGISTGEELSYDYNFDPYS 562

Query: 237 DSLIAYC 243
           +  +  C
Sbjct: 563 NKNVQQC 569


>gi|452836869|gb|EME38812.1| hypothetical protein DOTSEDRAFT_83490 [Dothistroma septosporum
           NZE10]
          Length = 1236

 Score =  103 bits (258), Expect = 5e-20,   Method: Composition-based stats.
 Identities = 60/184 (32%), Positives = 105/184 (57%), Gaps = 15/184 (8%)

Query: 68  CSCTASPGSSGV-CDRDCHCGMLLSSCSS-GCKC-GNSCLNKPFQ--NRPVKK------- 115
           C C+A      + CD  C   ++   C+   C     +C N+PF   +R +KK       
Sbjct: 550 CVCSAPTEEDDIGCDETCLNRVMQYECNEDNCALDAATCSNRPFAEIDRRLKKGGSFDIG 609

Query: 116 MKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINR 175
           +++V+T+K G GI +    +  + ++EY GE+I +  C+ R+ + ++  + N+YL E+ R
Sbjct: 610 VEVVKTDKRGFGIRSTRSFRPDQIIMEYTGEIISEGECQRRM-REEYKDKPNYYLMELER 668

Query: 176 DMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD-IKKGENLTYDYQYEF 234
            +VID T KG+ +R+INH+C PN  ++ + ++G  R+G+FA +  I  GE LTYDY ++ 
Sbjct: 669 GLVIDGT-KGSMARFINHACEPNCTVKMFRVNGVARMGVFAGKSGIMTGEELTYDYNFDN 727

Query: 235 LHDS 238
             +S
Sbjct: 728 FGES 731


>gi|170095481|ref|XP_001878961.1| histone methyltransferase [Laccaria bicolor S238N-H82]
 gi|164646265|gb|EDR10511.1| histone methyltransferase [Laccaria bicolor S238N-H82]
          Length = 144

 Score =  103 bits (258), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 54/117 (46%), Positives = 72/117 (61%)

Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
           G+ A E I +GE VIEYVGEVI  Q  E+R    +  G  + YL  I+ D+V+DAT KGN
Sbjct: 18  GLYAMEKISKGEMVIEYVGEVIRAQVAEKREKTYERQGIGSSYLFRIDEDLVVDATKKGN 77

Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
             R INHSC PN   +   I GE +I I+A +DI+ G+ +TYDY + F  D ++  C
Sbjct: 78  LGRLINHSCDPNCTAKIITISGEKKIVIYAKQDIELGDEITYDYHFPFEQDKILCLC 134


>gi|71000549|ref|XP_754958.1| histone-lysine N-methyltransferase (Ash1) [Aspergillus fumigatus
           Af293]
 gi|66852595|gb|EAL92920.1| histone-lysine N-methyltransferase (Ash1), putative [Aspergillus
           fumigatus Af293]
 gi|159127971|gb|EDP53086.1| histone-lysine N-methyltransferase (Ash1), putative [Aspergillus
           fumigatus A1163]
          Length = 845

 Score =  103 bits (257), Expect = 6e-20,   Method: Composition-based stats.
 Identities = 56/177 (31%), Positives = 94/177 (53%), Gaps = 17/177 (9%)

Query: 68  CSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKK---------MK 117
           C CT   G    CD +C    +   C  G C  G  C N+ F+    +          ++
Sbjct: 418 CMCTEETG----CDENCQNRYMFYECDDGNCGLGPGCGNRSFEELKQRTKAGGKYNIGVE 473

Query: 118 LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDM 177
           +++T   G G+ ++   +  + ++EY GE+I    CE+R+  +    E  +YL   +++M
Sbjct: 474 VIKTADRGYGVRSNRTFEPNQIIVEYTGEIITQAECEKRMRTIYKNNEC-YYLMYFDQNM 532

Query: 178 VIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDYQYE 233
           +IDAT +G+ +R++NHSC PN  M+KW + G+ R+ +FA    I  GE LTYDY ++
Sbjct: 533 IIDAT-RGSIARFVNHSCEPNCRMEKWTVAGKPRMALFAGDHGIMTGEELTYDYNFD 588


>gi|255945051|ref|XP_002563293.1| Pc20g07700 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211588028|emb|CAP86099.1| Pc20g07700 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 788

 Score =  103 bits (257), Expect = 6e-20,   Method: Composition-based stats.
 Identities = 53/187 (28%), Positives = 100/187 (53%), Gaps = 17/187 (9%)

Query: 68  CSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKK---------MK 117
           C CT   G    CD +C    +   C +G C+ G  C N+ F+    +          ++
Sbjct: 389 CMCTEDTG----CDENCQNRYMFYECDNGNCRLGPECGNRNFEGLKQRTKAGGKYNIGVE 444

Query: 118 LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDM 177
           +++T   G G+ ++   +  + ++EY GE++    CE+R+ +  +     +YL   +++M
Sbjct: 445 VIKTADRGYGVRSNRSFEPNQIIVEYTGEILTQLECEKRM-RTVYKNNDCYYLMYFDQNM 503

Query: 178 VIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDYQYEFLH 236
           +IDAT +G+ +R++NH+C PN  M+KW + G+ R+ +FA  R +  GE L+YDY ++   
Sbjct: 504 IIDAT-RGSIARFVNHACEPNCRMEKWTVAGKPRMALFAGDRGVSTGEELSYDYNFDPYS 562

Query: 237 DSLIAYC 243
           +  +  C
Sbjct: 563 NKNVQEC 569


>gi|356502205|ref|XP_003519911.1| PREDICTED: histone-lysine N-methyltransferase SUVR5-like [Glycine
            max]
          Length = 1494

 Score =  103 bits (257), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 62/164 (37%), Positives = 86/164 (52%), Gaps = 15/164 (9%)

Query: 87   GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
            G L+  C+  CKC  +C N+  QN    K+++ +TEK G  + A E I RG FV EY+GE
Sbjct: 1312 GYLVYECNQMCKCNKTCPNRILQNGIRIKLEVFKTEKKGWAVRAGEAILRGTFVCEYIGE 1371

Query: 147  VIDDQTCEERLWKMKHLGETNFY-----------LCEINRDMVIDATYKGNKSRYINHSC 195
            V+D Q  + R  +      + FY           L E     VID T  GN SR+IN+SC
Sbjct: 1372 VLDKQEAQNRRKRYGKEHCSYFYDVDDHVNDMGRLIEGQAHYVIDTTRFGNVSRFINNSC 1431

Query: 196  CPNTEMQKWIIDG----ETRIGIFATRDIKKGENLTYDYQYEFL 235
             PN    + +++        IG++A RDI  GE LTY+Y Y+ L
Sbjct: 1432 SPNLVSYQVLVESMDCERAHIGLYANRDIALGEELTYNYHYDLL 1475


>gi|238505954|ref|XP_002384179.1| histone-lysine N-methyltransferase (Ash1), putative [Aspergillus
           flavus NRRL3357]
 gi|220690293|gb|EED46643.1| histone-lysine N-methyltransferase (Ash1), putative [Aspergillus
           flavus NRRL3357]
 gi|391868614|gb|EIT77824.1| hypothetical protein Ao3042_05901 [Aspergillus oryzae 3.042]
          Length = 789

 Score =  103 bits (257), Expect = 7e-20,   Method: Composition-based stats.
 Identities = 71/240 (29%), Positives = 123/240 (51%), Gaps = 33/240 (13%)

Query: 16  AFNKLLKQIGNPVEFELP-DWFI-----KPKAIPYVFIKRNIYLTK-----RIKRRLEDD 64
           A ++LLKQ     +F+LP D F      +PK   +    +N+++ +     R  + LE  
Sbjct: 322 AGDRLLKQ---GRDFQLPFDIFSPLPSGQPKPNEWRKTNKNVFVGEASSIWRANKPLELS 378

Query: 65  GIFCSCTASPGSSGVCDRDCHCGMLLSSCS-SGCKCGNSCLNKPFQNRPVKK-------- 115
              C C    G    CD +C    +   C  + C  G  C N+ F+    +         
Sbjct: 379 K--CMCAEETG----CDEECQNRYMFYECDDTNCGVGPECGNRNFEELKQRTKAGGKYNI 432

Query: 116 -MKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEIN 174
            +++++TE  G G+ ++   +  + ++EY GE+I    CE+R+  +    E  +YL   +
Sbjct: 433 GVEVIKTEDRGYGVRSNRTFEPNQVIVEYTGEIITQAECEKRMRTIYKNNEC-YYLMYFD 491

Query: 175 RDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDYQYE 233
           ++M+IDAT +G+ +R++NHSC PN  M+KW + G+ R+ +FA  R I  G+ LTYDY ++
Sbjct: 492 QNMIIDAT-RGSIARFVNHSCEPNCRMEKWTVAGKPRMALFAGDRGIMTGDELTYDYNFD 550


>gi|317151143|ref|XP_001824470.2| histone-lysine N-methyltransferase (Ash1) [Aspergillus oryzae
           RIB40]
          Length = 796

 Score =  103 bits (257), Expect = 7e-20,   Method: Composition-based stats.
 Identities = 70/238 (29%), Positives = 122/238 (51%), Gaps = 33/238 (13%)

Query: 18  NKLLKQIGNPVEFELP-DWFI-----KPKAIPYVFIKRNIYLTK-----RIKRRLEDDGI 66
           ++LLKQ     +F+LP D F      +PK   +    +N+++ +     R  + LE    
Sbjct: 331 DRLLKQ---GRDFQLPFDIFSPLPSGQPKPNEWRKTNKNVFVGEASSIWRANKPLELSK- 386

Query: 67  FCSCTASPGSSGVCDRDCHCGMLLSSCS-SGCKCGNSCLNKPFQNRPVKK---------M 116
            C C    G    CD +C    +   C  + C  G  C N+ F+    +          +
Sbjct: 387 -CMCAEETG----CDEECQNRYMFYECDDTNCGVGPECGNRNFEELKQRTKAGGKYNIGV 441

Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
           ++++TE  G G+ ++   +  + ++EY GE+I    CE+R+  +    E  +YL   +++
Sbjct: 442 EVIKTEDRGYGVRSNRTFEPNQVIVEYTGEIITQAECEKRMRTIYKNNEC-YYLMYFDQN 500

Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDYQYE 233
           M+IDAT +G+ +R++NHSC PN  M+KW + G+ R+ +FA  R I  G+ LTYDY ++
Sbjct: 501 MIIDAT-RGSIARFVNHSCEPNCRMEKWTVAGKPRMALFAGDRGIMTGDELTYDYNFD 557


>gi|357143271|ref|XP_003572863.1| PREDICTED: histone-lysine N-methyltransferase SUVR5-like
            [Brachypodium distachyon]
          Length = 1625

 Score =  103 bits (256), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 66/171 (38%), Positives = 90/171 (52%), Gaps = 16/171 (9%)

Query: 87   GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
            G  +  C+S C C  SC NK  Q   + K++L  TE  G  + A + I RG FV EYVGE
Sbjct: 1447 GFPVYECNSLCTCDASCQNKVLQQGLLVKLELFSTENKGWAVRAADPIPRGTFVCEYVGE 1506

Query: 147  VIDDQTCEERLWKMKHLGETNFYLCEINRDM-------------VIDATYKGNKSRYINH 193
            V+ D     R  + +  GE + YL +IN  +             +IDAT  GN SR+INH
Sbjct: 1507 VVKDDEAM-RNTEREAKGECS-YLLQINSHIDQERAKTLGTIPYMIDATRYGNVSRFINH 1564

Query: 194  SCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLH-DSLIAYC 243
            SC PN   +  ++D    +G+FA +DI  GE L+YDY+ + L  D    YC
Sbjct: 1565 SCSPNLNTRLVLVDQLAHVGLFANQDIAVGEELSYDYRQKLLSGDGCPCYC 1615


>gi|113470945|gb|ABI34874.1| nuclear receptor binding SET domain protein 1b [Danio rerio]
          Length = 119

 Score =  103 bits (256), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 45/102 (44%), Positives = 68/102 (66%)

Query: 137 GEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCC 196
           G F+ EYVGEVID++ C  R+   +     NFY+  +++D +IDA  KGN++R++NH C 
Sbjct: 1   GGFISEYVGEVIDEEECRARIKHAQENNIGNFYMLTLDKDRIIDAGPKGNEARFMNHCCQ 60

Query: 197 PNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
           PN E QKW ++G+TR+G+F+  DI  G  LT++Y  E L + 
Sbjct: 61  PNRETQKWTVNGDTRVGLFSLTDIPAGTELTFNYNLECLGNG 102


>gi|357480379|ref|XP_003610475.1| Histone-lysine N-methyltransferase SETDB1 [Medicago truncatula]
 gi|357497957|ref|XP_003619267.1| Histone-lysine N-methyltransferase SETDB1 [Medicago truncatula]
 gi|355494282|gb|AES75485.1| Histone-lysine N-methyltransferase SETDB1 [Medicago truncatula]
 gi|355511530|gb|AES92672.1| Histone-lysine N-methyltransferase SETDB1 [Medicago truncatula]
          Length = 1047

 Score =  103 bits (256), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 62/167 (37%), Positives = 92/167 (55%), Gaps = 21/167 (12%)

Query: 87   GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
            G L+  C+  C+C  +C N+  QN    K+++ +TEK G G+ A E I RG FV EY+GE
Sbjct: 865  GYLVYECNDKCRCDKTCPNRILQNGIRVKLEVFKTEKKGWGVRAGEAISRGTFVCEYIGE 924

Query: 147  VIDDQTCEERLWKMKHLGETN---FY-----------LCEINRDMVIDATYKGNKSRYIN 192
            V+++Q    R    K  GE +   FY           L E     +ID+T  GN SR++N
Sbjct: 925  VLEEQEAHNRC---KSYGEEHCSYFYVVDARVNDMSRLIERQAQYIIDSTRYGNVSRFVN 981

Query: 193  HSCCPNTEMQKWIIDG----ETRIGIFATRDIKKGENLTYDYQYEFL 235
            +SC PN    + +++      +RIG++A+RDI  GE LT +Y YE +
Sbjct: 982  NSCSPNLLSYQVLVESMDCKRSRIGLYASRDIAFGEELTCNYHYELV 1028


>gi|224005002|ref|XP_002296152.1| SET-domain containing protein [Thalassiosira pseudonana CCMP1335]
 gi|209586184|gb|ACI64869.1| SET-domain containing protein [Thalassiosira pseudonana CCMP1335]
          Length = 173

 Score =  103 bits (256), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 52/153 (33%), Positives = 84/153 (54%), Gaps = 2/153 (1%)

Query: 81  DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
           D  C        C + C  G  C NK  Q R  KK+++      G G++  E+ K+G+F+
Sbjct: 5   DETCALFACQEECGNNCAAGPLCGNKRIQRREWKKLQVFDAGLKGRGLMVGEECKKGDFI 64

Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
            EYVG  +  +  +    + K   E   Y+  ++ D+ +DA ++G  +RYINHSC PN  
Sbjct: 65  CEYVGVAVKRRYLDGLFARYKS--ERMLYIMALDGDIYLDARHRGGIARYINHSCEPNCA 122

Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
           + +W + G  R G+FA R+I +GE L++DYQ++
Sbjct: 123 VHRWKVRGIIRAGVFALREILEGEELSFDYQWD 155


>gi|119493172|ref|XP_001263805.1| histone-lysine N-methyltransferase (Ash1), putative [Neosartorya
           fischeri NRRL 181]
 gi|119411965|gb|EAW21908.1| histone-lysine N-methyltransferase (Ash1), putative [Neosartorya
           fischeri NRRL 181]
          Length = 839

 Score =  103 bits (256), Expect = 8e-20,   Method: Composition-based stats.
 Identities = 56/177 (31%), Positives = 93/177 (52%), Gaps = 17/177 (9%)

Query: 68  CSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKK---------MK 117
           C CT   G    CD +C    +   C  G C  G  C N+ F     +          ++
Sbjct: 418 CMCTEETG----CDENCQNRYMFYECDDGNCGLGPDCGNRSFDELKQRTKAGGKYNIGVE 473

Query: 118 LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDM 177
           +++T   G G+ ++   +  + ++EY GE+I    CE+R+  +    E  +YL   +++M
Sbjct: 474 VIKTADRGYGVRSNRTFEPNQIIVEYTGEIITQAECEKRMRTIYKNNEC-YYLMYFDQNM 532

Query: 178 VIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDYQYE 233
           +IDAT +G+ +R++NHSC PN  M+KW + G+ R+ +FA    I  GE LTYDY ++
Sbjct: 533 IIDAT-RGSIARFVNHSCEPNCRMEKWTVAGKPRMALFAGDHGIMTGEELTYDYNFD 588


>gi|296804338|ref|XP_002843021.1| histone-lysine N-methyltransferase [Arthroderma otae CBS 113480]
 gi|238845623|gb|EEQ35285.1| histone-lysine N-methyltransferase [Arthroderma otae CBS 113480]
          Length = 719

 Score =  103 bits (256), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 78/241 (32%), Positives = 128/241 (53%), Gaps = 35/241 (14%)

Query: 20  LLKQIGNPVEFELP-DWFI-----KPKAIPYVFIKRNIYLTKRI---KRRLEDDGIFCSC 70
           LL + G P  F+LP D F      +PK   +  I +NI++       K  +  +   C C
Sbjct: 318 LLLEKGRP--FKLPYDIFSPLPRGQPKPDEWRKINKNIFVGDAAGIWKATMPTEQSTCLC 375

Query: 71  TASPGSSGVCDRDCHCGMLLSSCS-SGCKCGNS-CLNKPF---QNRPVKK-------MKL 118
           T   G    C  +C    +L  C  + CK G   C N+ F   +NR +K        +++
Sbjct: 376 TPEMG----CGENCQNRHMLYECDDNNCKLGEDLCRNRNFAKLRNR-IKTGGKYNIGVEV 430

Query: 119 VQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETN--FYLCEINRD 176
           ++TE  G G+ ++   +  + ++EY GE++     + R   MK + + N  FYL + +++
Sbjct: 431 IKTESRGYGVRSNRTFEPNQIIVEYTGEILTQIEAQRR---MKTIYKKNECFYLMDFDQN 487

Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD-IKKGENLTYDYQYEFL 235
           M+IDAT +G+ +R++NHSC PN +M+KWI+ G+ RI +FA  + I  GE LTYDY +E+ 
Sbjct: 488 MIIDAT-RGSIARFVNHSCEPNCKMEKWIVAGKPRIALFAGENGIMTGEELTYDYNFEYY 546

Query: 236 H 236
            
Sbjct: 547 Q 547


>gi|395329295|gb|EJF61682.1| hypothetical protein DICSQDRAFT_85722 [Dichomitus squalens LYAD-421
            SS1]
          Length = 1095

 Score =  103 bits (256), Expect = 9e-20,   Method: Composition-based stats.
 Identities = 52/117 (44%), Positives = 69/117 (58%)

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+ A E I RG+ VIEYVGEVI  Q  ++R    +  G  + YL  I+ D+V+DAT KGN
Sbjct: 969  GLYAMEKINRGDLVIEYVGEVIRAQVADKREKAYERQGIGSSYLFRIDEDLVVDATKKGN 1028

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
              R INHSC PN   +   I GE +I I+A +DI+ G  +TYDY +    D +   C
Sbjct: 1029 LGRLINHSCDPNCTAKIITISGEKKIVIYAKQDIELGSEITYDYHFPIEQDKIPCLC 1085


>gi|302682536|ref|XP_003030949.1| hypothetical protein SCHCODRAFT_77129 [Schizophyllum commune H4-8]
 gi|300104641|gb|EFI96046.1| hypothetical protein SCHCODRAFT_77129 [Schizophyllum commune H4-8]
          Length = 171

 Score =  102 bits (255), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 54/117 (46%), Positives = 72/117 (61%)

Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
           G+ A E I RGE VIEYVGEVI  Q  ++R    +  G  + YL  I+ ++V+DAT KGN
Sbjct: 45  GLYAMERIARGEMVIEYVGEVIRAQVADKREATYERQGIGSSYLFRIDEEIVVDATKKGN 104

Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
             R INHSC PN   +   I+GE +I I+A RDI+ G+ +TYDY + F  D +   C
Sbjct: 105 LGRLINHSCDPNCTAKIITINGEKKIVIYAKRDIELGDEITYDYHFPFEQDKIPCLC 161


>gi|443894422|dbj|GAC71770.1| histone H3 (Lys4) methyltransferase complex, subunit SET1 [Pseudozyma
            antarctica T-34]
          Length = 1366

 Score =  102 bits (255), Expect = 1e-19,   Method: Composition-based stats.
 Identities = 49/120 (40%), Positives = 75/120 (62%)

Query: 114  KKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEI 173
            K++K  ++     G+ A E I  G+ VIEYVGEV+  Q  +ER  + +  G  + YL  +
Sbjct: 1225 KQLKFAKSPIHDWGLYAMELIPAGDMVIEYVGEVVRQQVADEREKQYERQGNFSTYLFRV 1284

Query: 174  NRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
            + D+V+DAT+KGN +R +NH C PN   +   ++GE RI +FA   I+ GE LTYDY+++
Sbjct: 1285 DDDLVVDATHKGNIARLMNHCCTPNCNAKILTLNGEKRIVLFAKSPIRPGEELTYDYKFQ 1344


>gi|336372757|gb|EGO01096.1| hypothetical protein SERLA73DRAFT_50848 [Serpula lacrymans var.
           lacrymans S7.3]
          Length = 260

 Score =  102 bits (255), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 54/117 (46%), Positives = 71/117 (60%)

Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
           G+ A E I RGE VIEYVGEVI  Q  ++R    +  G  + YL  I+ D+V+DAT KGN
Sbjct: 134 GLYAMERISRGEMVIEYVGEVIRAQVADKREKVYERQGIGSSYLFRIDEDLVVDATKKGN 193

Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
             R INHSC PN   +   I+GE +I I+A +DI+ GE +TYDY +    D +   C
Sbjct: 194 LGRLINHSCDPNCTAKIITINGEKKIVIYAKQDIELGEEITYDYHFPIEQDKIPCLC 250


>gi|307194684|gb|EFN76943.1| Histone-lysine N-methyltransferase ash1 [Harpegnathos saltator]
          Length = 129

 Score =  102 bits (254), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 47/114 (41%), Positives = 70/114 (61%), Gaps = 1/114 (0%)

Query: 125 GAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYK 184
           G G+     IK G F++EYVGEV+ ++  + R+   ++  +T+ Y   ++  +VID    
Sbjct: 3   GLGVRTQLAIKSGNFILEYVGEVVSEREFKSRM-ATRYASDTHHYCLHLDGGLVIDGHRM 61

Query: 185 GNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
           G   R++NHSC PN EMQKW + G  R+ +FA+RDIK GE LTYDY +   + S
Sbjct: 62  GGDGRFVNHSCEPNCEMQKWSVHGLPRMALFASRDIKPGEELTYDYNFALFNPS 115


>gi|403223606|dbj|BAM41736.1| uncharacterized protein TOT_040000118 [Theileria orientalis strain
           Shintoku]
          Length = 944

 Score =  102 bits (254), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 65/177 (36%), Positives = 96/177 (54%), Gaps = 13/177 (7%)

Query: 63  DDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSS-GCKCGN-SCLNKPFQNRPVKKMKLVQ 120
           D    C CT S      C ++C   M    C+S  C   + +C NK FQN  + K+KLV 
Sbjct: 592 DTNTRCVCTKS------CGKECQNVMKNVECTSKNCALHDVNCGNKRFQNFLLPKLKLVY 645

Query: 121 TEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHL-----GETNFYLCEINR 175
            E  G G VA E+I+  E V EYVGEVI      + L            + ++Y+ ++++
Sbjct: 646 FEGKGIGAVATEEIRENELVCEYVGEVITQTDFHKSLASSSFAEIDDDNQCHWYVMKVHK 705

Query: 176 DMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
           ++ ID+T+ GN +R+INHSC PN       + G  R+G+FA+R I KGE +TY+Y +
Sbjct: 706 EVYIDSTHLGNVARFINHSCDPNCSSIPINVRGSYRMGVFASRKILKGEEVTYNYGF 762


>gi|46130624|ref|XP_389092.1| hypothetical protein FG08916.1 [Gibberella zeae PH-1]
          Length = 786

 Score =  102 bits (254), Expect = 1e-19,   Method: Composition-based stats.
 Identities = 55/178 (30%), Positives = 95/178 (53%), Gaps = 18/178 (10%)

Query: 68  CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNS-CLNKPFQNRPVKK---------M 116
           C C    G    C   C   ++L  C    C  G   C N+ F N   ++         +
Sbjct: 407 CVCKPEDG----CGESCQNRIMLYECDEQNCNAGKKYCTNRAFANLTARRNRGGKYRVGV 462

Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
           ++++T   G G+ ++   +  + ++EY GE+I ++ CE R+ ++    E  +YL   +++
Sbjct: 463 EVIKTSDRGYGVRSNRCFRPNQIIMEYAGEIITEEECERRMTEVYKDNEC-YYLMSFDQN 521

Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDYQYE 233
           M+IDAT  G+ +R++NHSC PN  M KWI+ G+ R+ +FA  + I  G+ LTYDY ++
Sbjct: 522 MIIDAT-TGSIARFVNHSCNPNCRMIKWIVSGQPRMALFAGDKPIMTGDELTYDYNFD 578


>gi|408391575|gb|EKJ70949.1| hypothetical protein FPSE_08917 [Fusarium pseudograminearum CS3096]
          Length = 786

 Score =  102 bits (254), Expect = 2e-19,   Method: Composition-based stats.
 Identities = 55/178 (30%), Positives = 95/178 (53%), Gaps = 18/178 (10%)

Query: 68  CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNS-CLNKPFQNRPVKK---------M 116
           C C    G    C   C   ++L  C    C  G   C N+ F N   ++         +
Sbjct: 407 CVCKPEDG----CAESCQNRIMLYECDEQNCNAGKKYCTNRAFANLTARRNRGGKYRVGV 462

Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
           ++++T   G G+ ++   +  + ++EY GE+I ++ CE R+ ++    E  +YL   +++
Sbjct: 463 EVIKTSDRGYGVRSNRCFRPNQIIMEYAGEIITEEECERRMTEVYKDNEC-YYLMSFDQN 521

Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDYQYE 233
           M+IDAT  G+ +R++NHSC PN  M KWI+ G+ R+ +FA  + I  G+ LTYDY ++
Sbjct: 522 MIIDAT-TGSIARFVNHSCNPNCRMIKWIVSGQPRMALFAGDKPIMTGDELTYDYNFD 578


>gi|169595256|ref|XP_001791052.1| hypothetical protein SNOG_00365 [Phaeosphaeria nodorum SN15]
 gi|160701054|gb|EAT91860.2| hypothetical protein SNOG_00365 [Phaeosphaeria nodorum SN15]
          Length = 950

 Score =  102 bits (253), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 70/199 (35%), Positives = 99/199 (49%), Gaps = 21/199 (10%)

Query: 46  FIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNS-C 103
           FI     L KR K+    D   C C A  G    C   CH  ++   C ++ C  G   C
Sbjct: 762 FIGEASALWKRDKQ----DASQCYCDAEDG----CGEACHNRIMAYECDNTNCPLGPELC 813

Query: 104 LNKPFQ--------NRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEE 155
            N+PF         NR    +++  T   G G+ A    +  + ++EY GE+I    CE 
Sbjct: 814 GNRPFAELKRRAKGNRYDYGVEVTDTPDRGYGVRAMRMFEPHQIIVEYAGEIITQSECER 873

Query: 156 RLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIF 215
           R+ K  +  +  +YL   +  M+IDAT +G  +R++NHSC PN EM KW + GE R+ +F
Sbjct: 874 RM-KQVYKKDKCYYLMSFDNKMIIDAT-RGTIARFVNHSCEPNCEMIKWTVGGEPRMALF 931

Query: 216 A-TRDIKKGENLTYDYQYE 233
           A  R I  GE LTYDY +E
Sbjct: 932 AGPRGIMTGEELTYDYNFE 950


>gi|453080036|gb|EMF08088.1| hypothetical protein SEPMUDRAFT_152375 [Mycosphaerella populorum
           SO2202]
          Length = 881

 Score =  102 bits (253), Expect = 2e-19,   Method: Composition-based stats.
 Identities = 60/199 (30%), Positives = 106/199 (53%), Gaps = 19/199 (9%)

Query: 58  KRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCG-NSCLNKPFQNRPVKK 115
           K + +D   +C+C    G    CD  C   ++   C  S C+   ++C N+PF     ++
Sbjct: 369 KNKNQDYVSYCTCLPETG----CDESCLNAVMGYECDESNCRLEPDNCSNRPFSELERRR 424

Query: 116 MK---------LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGET 166
            K         +V+T+  G G+ A    + G+ ++EY GE+I +  C+ER+    +L  T
Sbjct: 425 TKGGRYDIGVEVVKTKNRGHGVRAARPFQPGQLIMEYTGEIITEDECQERM-ATTYLNAT 483

Query: 167 NFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGEN 225
           N+++ E+   +++D   KG+++R+INHSC PN E++   +   +R+G++A    I  G+ 
Sbjct: 484 NYFVMEMENGLILDGN-KGSEARFINHSCDPNCEVKMTRVGQVSRLGVYAGPAGIMTGQE 542

Query: 226 LTYDYQYEFLHDSLIA-YC 243
           LTYDY ++   D   A YC
Sbjct: 543 LTYDYNFQNFSDHRQACYC 561


>gi|409047697|gb|EKM57176.1| hypothetical protein PHACADRAFT_142398 [Phanerochaete carnosa
            HHB-10118-sp]
          Length = 1389

 Score =  102 bits (253), Expect = 2e-19,   Method: Composition-based stats.
 Identities = 51/117 (43%), Positives = 69/117 (58%)

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+ A E I RGE VIEYVGE+I  Q  ++R    +  G  + YL  I+ D+V+DAT KGN
Sbjct: 1263 GLYAMEKISRGEMVIEYVGEIIRAQVADKREKAYERQGIGSSYLFRIDEDLVVDATKKGN 1322

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
              R INHSC PN   +   I+ E +I I+A +DI+ G  +TYDY +    D +   C
Sbjct: 1323 LGRLINHSCDPNCTAKIITINSEKKIVIYAKQDIELGSEITYDYHFPIEQDKIPCLC 1379


>gi|367021898|ref|XP_003660234.1| hypothetical protein MYCTH_2298281 [Myceliophthora thermophila ATCC
           42464]
 gi|347007501|gb|AEO54989.1| hypothetical protein MYCTH_2298281 [Myceliophthora thermophila ATCC
           42464]
          Length = 942

 Score =  102 bits (253), Expect = 2e-19,   Method: Composition-based stats.
 Identities = 64/213 (30%), Positives = 106/213 (49%), Gaps = 23/213 (10%)

Query: 38  KPKAIPYVFIKRNIYLTK-----RIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSS 92
           +PK   Y  + +N ++       +    +ED    C C    G    C  DC   ++L  
Sbjct: 422 QPKPAAYRTMTKNRFVGDAASYWKKTPHIEDFASKCVCKPEDG----CGEDCQNRIMLYE 477

Query: 93  C-SSGCKCGNS-CLNKPFQNRPVKKMK---------LVQTEKCGAGIVADEDIKRGEFVI 141
           C  + C  G   C N+ FQ+   +  K         + +T+  G G+ ++   +  + ++
Sbjct: 478 CDETNCGIGKEYCQNRAFQDLQERTKKGGRYRIGVEVFKTKDRGYGVRSNRCFEPNQIIM 537

Query: 142 EYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEM 201
           EY GE+I    CE R+   ++     +YL   +++M+IDAT  G+ +R++NHSC PN  M
Sbjct: 538 EYTGEIITVAECERRM-NEEYKDNECYYLMSFDQNMIIDAT-TGSIARFVNHSCSPNCRM 595

Query: 202 QKWIIDGETRIGIFA-TRDIKKGENLTYDYQYE 233
            KWI+ G+ R+ +FA  R I  GE LTYDY ++
Sbjct: 596 IKWIVAGQPRMALFAGDRPIMTGEELTYDYNFD 628


>gi|343429488|emb|CBQ73061.1| related to regulatory protein SET1 [Sporisorium reilianum SRZ2]
          Length = 1453

 Score =  102 bits (253), Expect = 2e-19,   Method: Composition-based stats.
 Identities = 49/120 (40%), Positives = 75/120 (62%)

Query: 114  KKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEI 173
            K++K  ++     G+ A E I  G+ VIEYVGEV+  Q  +ER  + +  G  + YL  +
Sbjct: 1312 KQLKFAKSPIHDWGLYAMEFIPAGDMVIEYVGEVVRQQVADEREKQYERQGNFSTYLFRV 1371

Query: 174  NRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
            + D+V+DAT+KGN +R +NH C PN   +   ++GE RI +FA   I+ GE LTYDY+++
Sbjct: 1372 DDDLVVDATHKGNIARLMNHCCTPNCNAKILTLNGEKRIVLFAKSPIRAGEELTYDYKFQ 1431


>gi|134079644|emb|CAK97070.1| unnamed protein product [Aspergillus niger]
          Length = 885

 Score =  102 bits (253), Expect = 2e-19,   Method: Composition-based stats.
 Identities = 57/178 (32%), Positives = 92/178 (51%), Gaps = 28/178 (15%)

Query: 68  CSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKK---------MK 117
           C CT   G    CD +C    +   C  G C  G  C N+ F+    +          ++
Sbjct: 395 CMCTPETG----CDEECQNRYMFYECDEGNCGVGEECGNRSFEELKQRTKAGGKYNIGVE 450

Query: 118 LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKM-KHLGETNFYLCEINRD 176
           +++T   G G+ ++   +  + ++EY GE+I    CE+R+  + KH           N +
Sbjct: 451 VIKTADRGYGVRSNRTFEPNQIIVEYTGEIITQTECEKRMRTIYKH-----------NEN 499

Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDYQYE 233
           M+IDAT +G+ +R++NHSC PN  M+KW + G+ R+ +FA  R I  GE LTYDY ++
Sbjct: 500 MIIDAT-RGSIARFVNHSCEPNCRMEKWTVAGKPRMALFAGDRGIMTGEELTYDYNFD 556


>gi|302505008|ref|XP_003014725.1| histone-lysine N-methyltransferase (Ash1), putative [Arthroderma
           benhamiae CBS 112371]
 gi|291178031|gb|EFE33822.1| histone-lysine N-methyltransferase (Ash1), putative [Arthroderma
           benhamiae CBS 112371]
          Length = 688

 Score =  101 bits (252), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 76/237 (32%), Positives = 124/237 (52%), Gaps = 33/237 (13%)

Query: 20  LLKQIGNPVEFELP-DWFI-----KPKAIPYVFIKRNIYLTKRI---KRRLEDDGIFCSC 70
           LL + G P  F LP D F      +PK   +  I +NI++       K  +  +   C C
Sbjct: 320 LLLEKGRP--FRLPYDIFSPLPRGQPKPDEWRKINKNIFIGDAAGIWKATMPTEQSTCLC 377

Query: 71  TASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNS-CLNKPFQN--RPVKK-------MKLV 119
           T   G    C+ +C    +   C  + CK G   C N+ F    + +K        ++++
Sbjct: 378 TPEMG----CEENCQNRHMFYECDENNCKLGEDLCRNRNFSELRKRIKTGGKYNIGVEVI 433

Query: 120 QTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETN--FYLCEINRDM 177
           +TE  G G+ ++   +  + ++EY GE++     + R   MK + + N  FYL + ++DM
Sbjct: 434 KTESRGYGVRSNRTFEPNQIIVEYTGEILTQIEAQRR---MKTIYKKNECFYLMDFDQDM 490

Query: 178 VIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD-IKKGENLTYDYQYE 233
           +IDAT +G+ +R++NHSC PN +M+KWI+ G+ RI +FA  + I  GE LTYDY +E
Sbjct: 491 IIDAT-RGSIARFVNHSCEPNCKMEKWIVAGKPRIALFAGDNGIMTGEELTYDYNFE 546


>gi|71015569|ref|XP_758824.1| hypothetical protein UM02677.1 [Ustilago maydis 521]
 gi|74702458|sp|Q4PB36.1|SET1_USTMA RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
            specific; AltName: Full=COMPASS component SET1; AltName:
            Full=SET domain-containing protein 1
 gi|46098614|gb|EAK83847.1| hypothetical protein UM02677.1 [Ustilago maydis 521]
          Length = 1468

 Score =  101 bits (252), Expect = 2e-19,   Method: Composition-based stats.
 Identities = 49/120 (40%), Positives = 75/120 (62%)

Query: 114  KKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEI 173
            K++K  ++     G+ A E I  G+ VIEYVGEV+  Q  +ER  + +  G  + YL  +
Sbjct: 1327 KQLKFAKSPIHDWGLYAMELIPAGDMVIEYVGEVVRQQVADEREKQYERQGNFSTYLFRV 1386

Query: 174  NRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
            + D+V+DAT+KGN +R +NH C PN   +   ++GE RI +FA   I+ GE LTYDY+++
Sbjct: 1387 DDDLVVDATHKGNIARLMNHCCTPNCNAKILTLNGEKRIVLFAKTAIRAGEELTYDYKFQ 1446


>gi|328714763|ref|XP_003245446.1| PREDICTED: hypothetical protein LOC100571907 [Acyrthosiphon pisum]
          Length = 2204

 Score =  101 bits (252), Expect = 2e-19,   Method: Composition-based stats.
 Identities = 56/134 (41%), Positives = 80/134 (59%), Gaps = 13/134 (9%)

Query: 114  KKMK--------LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCE--ERLWKMKHL 163
            KKMK        L +++  G G+ A  D+++   VIEY+GE+I  Q C+  E+L++ K+ 
Sbjct: 2056 KKMKQEWRNNVYLARSKIQGLGLYAARDLEKHTMVIEYIGEIIRSQLCDYREKLYEAKNR 2115

Query: 164  GETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKG 223
            G    Y+  ++ D V+DAT  G  +RYINHSC PN   +K  +D E RI IFA R I +G
Sbjct: 2116 G---IYMFRLDDDRVVDATISGGLARYINHSCNPNCVTEKVEVDRELRIIIFAKRRIARG 2172

Query: 224  ENLTYDYQYEFLHD 237
            E L YDYQ++   D
Sbjct: 2173 EELAYDYQFDIEDD 2186


>gi|402084863|gb|EJT79881.1| hypothetical protein GGTG_04964 [Gaeumannomyces graminis var.
           tritici R3-111a-1]
          Length = 1014

 Score =  101 bits (252), Expect = 2e-19,   Method: Composition-based stats.
 Identities = 60/200 (30%), Positives = 105/200 (52%), Gaps = 28/200 (14%)

Query: 46  FIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNS-C 103
           + K+N Y  + +          C CT + G    C   C    +L  C+ G C  G   C
Sbjct: 460 YWKKNTYYDRSV----------CVCTPADG----CGEHCLNRSVLYECNEGNCNVGRELC 505

Query: 104 LNKPFQNRPVKK---------MKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCE 154
            N+ FQ+   +          +++  T   G G+ A+   + G+ ++EY GE+I ++ C+
Sbjct: 506 KNRAFQDLQDRTKTGGSYRVGVEVYHTGDRGFGVRANRCFEPGQIIMEYAGEIITEEECD 565

Query: 155 ERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGI 214
            R+ ++ +  +  +YL   +++M++DAT  G+ +R++NHSC PN  M KWI+ G  R+ +
Sbjct: 566 RRMNEV-YKDKQCYYLMSFDQNMILDAT-TGSIARFVNHSCSPNCRMIKWIVSGVPRMAL 623

Query: 215 FA-TRDIKKGENLTYDYQYE 233
           FA  R I+ G+ LTYDY ++
Sbjct: 624 FAGDRQIQTGDELTYDYNFD 643


>gi|358385026|gb|EHK22623.1| hypothetical protein TRIVIDRAFT_212935 [Trichoderma virens Gv29-8]
          Length = 738

 Score =  101 bits (252), Expect = 2e-19,   Method: Composition-based stats.
 Identities = 56/178 (31%), Positives = 96/178 (53%), Gaps = 18/178 (10%)

Query: 68  CSCTASPGSSGVCDRDCHCGMLLSSCS-SGCKCGNS-CLNKPFQNRPVKK---------M 116
           C C    G    C   C   ++L  C  + C  G + C N+ F     ++         +
Sbjct: 396 CVCKPEDG----CAETCQNRIMLYECDDTNCNIGRAHCTNRAFAELTARRNRGGKYRVGV 451

Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
           ++++T   G G+ ++   +  + ++EY GE+I ++ CE R+ ++    E  +YL   +++
Sbjct: 452 EVIKTSDRGYGVRSNRCFEPHQIIMEYAGEIITEEECERRMNEIYKNNEC-YYLMSFDQN 510

Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDYQYE 233
           M+IDAT  G+ +R++NHSC PN  M KWI+ G+ R+ +FA  R I  GE LTYDY ++
Sbjct: 511 MIIDAT-TGSIARFVNHSCNPNCRMIKWIVSGQPRMALFAGDRPIMTGEELTYDYNFD 567


>gi|315047188|ref|XP_003172969.1| SUV39H [Arthroderma gypseum CBS 118893]
 gi|311343355|gb|EFR02558.1| SUV39H [Arthroderma gypseum CBS 118893]
          Length = 703

 Score =  101 bits (251), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 76/247 (30%), Positives = 128/247 (51%), Gaps = 33/247 (13%)

Query: 20  LLKQIGNPVEFELP-DWFI-----KPKAIPYVFIKRNIYLTKRI---KRRLEDDGIFCSC 70
           LL + G P  F LP D F      +PK   +  I +NI++       +  +  +   C C
Sbjct: 318 LLLERGRP--FRLPYDIFSPLPRGQPKPDEWRKINKNIFIGDAAGIWRATMPTEQSTCLC 375

Query: 71  TASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNS-CLNKPF-----QNRPVKK----MKLV 119
           T   G    C+ +C    +   C  + CK G   C N+ F     +N+   K    ++++
Sbjct: 376 TPEMG----CEENCQNRHMFYECDENNCKLGEDICRNRNFSELRRRNKTGGKYNIGVEVI 431

Query: 120 QTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETN--FYLCEINRDM 177
           +TE  G G+ ++   +  + ++EY GE++     + R   MK + + N  FYL + ++DM
Sbjct: 432 KTESRGYGVRSNRTFEPNQIIVEYTGEILTQIEAQRR---MKTIYKKNECFYLMDFDQDM 488

Query: 178 VIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD-IKKGENLTYDYQYEFLH 236
           +IDAT +G+ +R++NHSC PN +M+KWI+ G+ RI +FA  + I  GE LTYDY ++   
Sbjct: 489 IIDAT-RGSIARFVNHSCEPNCKMEKWIVAGKPRIALFAGDNGIMTGEELTYDYNFDPYS 547

Query: 237 DSLIAYC 243
           +  +  C
Sbjct: 548 NKNVQEC 554


>gi|84997445|ref|XP_953444.1| hypothetical protein [Theileria annulata strain Ankara]
 gi|65304440|emb|CAI76819.1| hypothetical protein, conserved [Theileria annulata]
          Length = 1083

 Score =  101 bits (251), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 63/160 (39%), Positives = 89/160 (55%), Gaps = 7/160 (4%)

Query: 80  CDRDCHCGMLLSSCS-SGCKCGN-SCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRG 137
           C  DC   M  + C+   C   + +C N+ F N    K+KL   +  G G VA EDI  G
Sbjct: 732 CGSDCSNVMKNTECTVKNCNLMDENCGNRRFLNFTGPKLKLNYVDGKGVGTVATEDINEG 791

Query: 138 EFVIEYVGEVI---DDQTC--EERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYIN 192
           E V EYVGEVI   D Q C       ++    ++++Y+ +I RD  ID+T+ GN +R+IN
Sbjct: 792 ELVCEYVGEVISQADFQRCLASASFAEIDDGNQSHWYVMKIQRDTYIDSTHLGNVARFIN 851

Query: 193 HSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
           HSC PN       + G  R+G+FA R IK+GE +TY+Y +
Sbjct: 852 HSCDPNCASVPINVRGTYRMGVFAQRKIKQGEEVTYNYGF 891


>gi|302664384|ref|XP_003023822.1| histone-lysine N-methyltransferase (Ash1), putative [Trichophyton
           verrucosum HKI 0517]
 gi|291187840|gb|EFE43204.1| histone-lysine N-methyltransferase (Ash1), putative [Trichophyton
           verrucosum HKI 0517]
          Length = 709

 Score =  101 bits (251), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 76/247 (30%), Positives = 127/247 (51%), Gaps = 33/247 (13%)

Query: 20  LLKQIGNPVEFELP-DWFI-----KPKAIPYVFIKRNIYLTKRI---KRRLEDDGIFCSC 70
           LL + G P  F LP D F      +PK   +  I +NI++       K  +  +   C C
Sbjct: 316 LLLEKGRP--FRLPYDIFSPLPRGQPKPDEWRKINKNIFIGDAAGIWKATMPTEQSTCLC 373

Query: 71  TASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNS-CLNKPFQN--RPVKK-------MKLV 119
           T   G    C+ +C    +   C  + CK G   C N+ F    + +K        ++++
Sbjct: 374 TPEMG----CEENCQNRHMFYECDENNCKLGEDLCRNRNFSELRKRIKTGGKYNIGVEVI 429

Query: 120 QTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETN--FYLCEINRDM 177
           +TE  G G+ ++   +  + ++EY GE++     + R   MK + + N  FYL + ++DM
Sbjct: 430 KTESRGYGVRSNRTFEPNQIIVEYTGEILTQIEAQRR---MKTIYKKNECFYLMDFDQDM 486

Query: 178 VIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD-IKKGENLTYDYQYEFLH 236
           +IDAT +G+ +R++NHSC PN +M+KWI+ G+ RI +FA  + I  GE LTYDY ++   
Sbjct: 487 IIDAT-RGSIARFVNHSCEPNCKMEKWIVAGKPRIALFAGDNGIMTGEELTYDYNFDPYS 545

Query: 237 DSLIAYC 243
           +  +  C
Sbjct: 546 NKNVQEC 552


>gi|224084157|ref|XP_002307227.1| SET domain protein [Populus trichocarpa]
 gi|222856676|gb|EEE94223.1| SET domain protein [Populus trichocarpa]
          Length = 196

 Score =  101 bits (251), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 57/164 (34%), Positives = 86/164 (52%), Gaps = 15/164 (9%)

Query: 87  GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
           G L+  C+S C C  +C N+  QN    K+++ +T+  G  + A E I RG F+ EY GE
Sbjct: 15  GYLVYECNSMCNCNKTCPNRVLQNGIRVKLEVFKTDNKGWAVRAGEPILRGTFICEYTGE 74

Query: 147 VIDDQTCEER-----------LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSC 195
           ++++Q    R           ++K+         + E      IDAT  GN SR+INHSC
Sbjct: 75  ILNEQEASNRRDRYGKEVCSYMYKIDAHTNDMSRMVEGQAHYFIDATKYGNVSRFINHSC 134

Query: 196 CPNTEMQKWIIDG----ETRIGIFATRDIKKGENLTYDYQYEFL 235
            PN    + ++D        IG++A++DI  GE LTY+Y+YE L
Sbjct: 135 MPNLVNHQVLVDSMDSQRAHIGLYASQDIAFGEELTYNYRYELL 178


>gi|302809376|ref|XP_002986381.1| hypothetical protein SELMODRAFT_123930 [Selaginella moellendorffii]
 gi|300145917|gb|EFJ12590.1| hypothetical protein SELMODRAFT_123930 [Selaginella moellendorffii]
          Length = 206

 Score =  101 bits (251), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 57/170 (33%), Positives = 89/170 (52%), Gaps = 21/170 (12%)

Query: 87  GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
           G L+  C++ C C   C N+  Q     K+++ +T   G  + A ++I RG FV EY+GE
Sbjct: 18  GYLVYECNTSCMCSEECPNRVLQRGVKVKLEVFKTRHKGWAVRAAQNISRGTFVCEYLGE 77

Query: 147 VIDDQTCEERLWKMKHLGETNFYLCEIN-----------------RDMVIDATYKGNKSR 189
           V++DQ    R  +   +G +  Y  +++                 +  VIDAT  GN +R
Sbjct: 78  VLNDQEANRRGERYDQVGCSYLYDIDVHLNTGGRSRRGPSRVPRIKPFVIDATKHGNVAR 137

Query: 190 YINHSCCPNTEMQKWIIDGE----TRIGIFATRDIKKGENLTYDYQYEFL 235
           +INHSC PN    + +++        IG+FA+RDI  GE L+YDY+Y+ L
Sbjct: 138 FINHSCSPNLVNYQVLVESMDYQLAHIGLFASRDILCGEELSYDYRYKLL 187


>gi|25141373|ref|NP_491340.2| Protein MET-1, isoform a [Caenorhabditis elegans]
 gi|351050564|emb|CCD65166.1| Protein MET-1, isoform a [Caenorhabditis elegans]
          Length = 1604

 Score =  101 bits (251), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 66/197 (33%), Positives = 103/197 (52%), Gaps = 16/197 (8%)

Query: 44  YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCGNSC 103
           +  I  + YLT+   ++ + + + C C  + G     D  C    +L+ C S C+    C
Sbjct: 618 FELISESKYLTRNANKK-KTESLTCECHRTGG--NCSDNTCVNRAMLTECPSSCQV--KC 672

Query: 104 LNKPFQNRPVKKMKLVQTEKC-GAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
            N+ F  +    ++   T    G G+ A +DIK+G F+IEY+GEV++    E+R  K K+
Sbjct: 673 KNQRFAKKKYAAVEAFHTGTAKGCGLRAVKDIKKGRFIIEYIGEVVERDDYEKR--KTKY 730

Query: 163 LGET---NFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGE----TRIGIF 215
             +    + YLC+      IDAT  GN SR++NHSC PN   +KW +        R+G F
Sbjct: 731 AADKKHKHHYLCDTGV-YTIDATVYGNPSRFVNHSCDPNAICEKWSVPRTPGDVNRVGFF 789

Query: 216 ATRDIKKGENLTYDYQY 232
           + R IK GE +T+DYQ+
Sbjct: 790 SKRFIKAGEEITFDYQF 806


>gi|327306067|ref|XP_003237725.1| hypothetical protein TERG_02440 [Trichophyton rubrum CBS 118892]
 gi|326460723|gb|EGD86176.1| hypothetical protein TERG_02440 [Trichophyton rubrum CBS 118892]
          Length = 711

 Score =  101 bits (251), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 76/247 (30%), Positives = 127/247 (51%), Gaps = 33/247 (13%)

Query: 20  LLKQIGNPVEFELP-DWFI-----KPKAIPYVFIKRNIYLTKRI---KRRLEDDGIFCSC 70
           LL + G P  F LP D F      +PK   +  I +NI++       K  +  +   C C
Sbjct: 319 LLLEKGRP--FRLPYDIFSPLPRGQPKPDEWRKINKNIFIGDAAGIWKATMPTEQSTCLC 376

Query: 71  TASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNS-CLNKPFQN--RPVKK-------MKLV 119
           T   G    C+ +C    +   C  + CK G   C N+ F    + +K        ++++
Sbjct: 377 TPEMG----CEENCQNRHMFYECDENNCKLGEDLCRNRNFSELRKRIKTGGKYNIGVEVI 432

Query: 120 QTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETN--FYLCEINRDM 177
           +TE  G G+ ++   +  + ++EY GE++     + R   MK + + N  FYL + ++DM
Sbjct: 433 KTESRGYGVRSNRTFEPNQIIVEYTGEILTQIEAQRR---MKTIYKKNECFYLMDFDQDM 489

Query: 178 VIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD-IKKGENLTYDYQYEFLH 236
           +IDAT +G+ +R++NHSC PN +M+KWI+ G+ RI +FA  + I  GE LTYDY ++   
Sbjct: 490 IIDAT-RGSIARFVNHSCEPNCKMEKWIVAGKPRIALFAGDNGIMTGEELTYDYNFDPYS 548

Query: 237 DSLIAYC 243
           +  +  C
Sbjct: 549 NKNVQEC 555


>gi|260833262|ref|XP_002611576.1| hypothetical protein BRAFLDRAFT_117164 [Branchiostoma floridae]
 gi|229296947|gb|EEN67586.1| hypothetical protein BRAFLDRAFT_117164 [Branchiostoma floridae]
          Length = 734

 Score =  101 bits (251), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 55/147 (37%), Positives = 80/147 (54%), Gaps = 3/147 (2%)

Query: 88  MLLSSCSSG-CKCGNSCLNKPFQNRPVKK-MKLVQTEKCGAGIVADEDIKRGEFVIEYVG 145
           M    CS   C   + C N+  Q       ++ + T+  G G+ +   I +G F++EYVG
Sbjct: 57  MSFMECSPNTCPYADQCANQRIQRHEWDPGLERIVTKDRGYGVRSKTPIPQGNFILEYVG 116

Query: 146 EVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWI 205
           EV+ +Q    R  ++ H    + Y   ++   VID    G + R++NHSC PN EMQKW 
Sbjct: 117 EVVSEQEFRRRTVEIYH-DHNHHYCLNLHSGAVIDGYKYGCEGRFVNHSCEPNCEMQKWS 175

Query: 206 IDGETRIGIFATRDIKKGENLTYDYQY 232
           ++G  RIG+FA RDI  GE LTYDY +
Sbjct: 176 VNGVYRIGLFALRDIPAGEELTYDYNF 202


>gi|407921620|gb|EKG14761.1| hypothetical protein MPH_08036 [Macrophomina phaseolina MS6]
          Length = 1167

 Score =  101 bits (251), Expect = 3e-19,   Method: Composition-based stats.
 Identities = 54/127 (42%), Positives = 75/127 (59%), Gaps = 3/127 (2%)

Query: 110  NRPVKKMKLVQTEKCGA---GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGET 166
            N+  K+ KLV+ ++      G+ A EDI   + +IEYVGE +  +  + R  K    G  
Sbjct: 1018 NQLKKRKKLVKFDRSAIHNWGLYAQEDISANDMIIEYVGEKVRQKVADIREIKYDKQGVG 1077

Query: 167  NFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENL 226
            + YL  I+ D V+DAT KG  +R+INHSC PN   +   +DG  RI I+A RDIK  E L
Sbjct: 1078 SSYLFRIDEDSVVDATKKGGIARFINHSCSPNCTAKIIRVDGTKRIVIYALRDIKTNEEL 1137

Query: 227  TYDYQYE 233
            TYDY++E
Sbjct: 1138 TYDYKFE 1144


>gi|193202392|ref|NP_871842.2| Protein MET-1, isoform b [Caenorhabditis elegans]
 gi|25395254|pir||B87754 protein C43E11.3 [imported] - Caenorhabditis elegans
 gi|351050565|emb|CCD65167.1| Protein MET-1, isoform b [Caenorhabditis elegans]
          Length = 1590

 Score =  101 bits (251), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 66/197 (33%), Positives = 103/197 (52%), Gaps = 16/197 (8%)

Query: 44  YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCGNSC 103
           +  I  + YLT+   ++ + + + C C  + G     D  C    +L+ C S C+    C
Sbjct: 604 FELISESKYLTRNANKK-KTESLTCECHRTGG--NCSDNTCVNRAMLTECPSSCQV--KC 658

Query: 104 LNKPFQNRPVKKMKLVQTEKC-GAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
            N+ F  +    ++   T    G G+ A +DIK+G F+IEY+GEV++    E+R  K K+
Sbjct: 659 KNQRFAKKKYAAVEAFHTGTAKGCGLRAVKDIKKGRFIIEYIGEVVERDDYEKR--KTKY 716

Query: 163 LGET---NFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGE----TRIGIF 215
             +    + YLC+      IDAT  GN SR++NHSC PN   +KW +        R+G F
Sbjct: 717 AADKKHKHHYLCDTGV-YTIDATVYGNPSRFVNHSCDPNAICEKWSVPRTPGDVNRVGFF 775

Query: 216 ATRDIKKGENLTYDYQY 232
           + R IK GE +T+DYQ+
Sbjct: 776 SKRFIKAGEEITFDYQF 792


>gi|346320473|gb|EGX90073.1| histone-lysine N-methyltransferase (Ash1), putative [Cordyceps
           militaris CM01]
          Length = 962

 Score =  101 bits (251), Expect = 3e-19,   Method: Composition-based stats.
 Identities = 55/178 (30%), Positives = 95/178 (53%), Gaps = 18/178 (10%)

Query: 68  CSCTASPGSSGVCDRDCHCGMLLSSCSS-GCKCGN-SCLNKPFQNRPVKKMK-------- 117
           C CT   G    C  +C   ++L  C+   C  G  +C N+ F     ++ K        
Sbjct: 546 CVCTPEDG----CGDNCQNRIMLYECNDINCNAGKATCTNRAFATLTARRAKGGKYRVGV 601

Query: 118 -LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
            +++T   G G+ ++   +  + ++EY GE+I +  C+ R+ K ++     +YL   +++
Sbjct: 602 EVIKTSDRGYGVRSNRCFRPHQIIMEYAGEIITEDECDRRM-KNEYKNNECYYLMSFDQN 660

Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDYQYE 233
           M+IDAT  G+ +R++NHSC PN  M KWI+ G+ R+ +FA    I  G+ LTYDY ++
Sbjct: 661 MIIDAT-TGSIARFVNHSCNPNCRMIKWIVSGQPRMALFAGDSPIMTGDELTYDYNFD 717


>gi|302814007|ref|XP_002988688.1| hypothetical protein SELMODRAFT_128419 [Selaginella moellendorffii]
 gi|300143509|gb|EFJ10199.1| hypothetical protein SELMODRAFT_128419 [Selaginella moellendorffii]
          Length = 206

 Score =  101 bits (251), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 57/170 (33%), Positives = 89/170 (52%), Gaps = 21/170 (12%)

Query: 87  GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
           G L+  C++ C C   C N+  Q     K+++ +T   G  + A ++I RG FV EY+GE
Sbjct: 18  GYLVYECNTSCMCSEECPNRVLQRGVNVKLEVFKTRHKGWAVRAAQNISRGTFVCEYLGE 77

Query: 147 VIDDQTCEERLWKMKHLGETNFYLCEIN-----------------RDMVIDATYKGNKSR 189
           V++DQ    R  +   +G +  Y  +++                 +  VIDAT  GN +R
Sbjct: 78  VLNDQEANRRGERYDQVGCSYLYDIDVHLNTGGRSRRGPSRVPRIKPFVIDATKHGNVAR 137

Query: 190 YINHSCCPNTEMQKWIIDGE----TRIGIFATRDIKKGENLTYDYQYEFL 235
           +INHSC PN    + +++        IG+FA+RDI  GE L+YDY+Y+ L
Sbjct: 138 FINHSCSPNLVNYQVLVESMDYQLAHIGLFASRDILCGEELSYDYRYKLL 187


>gi|219111565|ref|XP_002177534.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217412069|gb|EEC51997.1| predicted protein, partial [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 144

 Score =  101 bits (251), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 51/119 (42%), Positives = 75/119 (63%), Gaps = 2/119 (1%)

Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERL--WKMKHLGETNFYLCEIN 174
           K ++ +  G G+V  + I +G+ V+EYVG VID +  E+RL  W+  H  + NFY+  + 
Sbjct: 6   KPLREQGKGWGLVPCDKIGKGDLVLEYVGNVIDAKEKEDRLSEWERDHPNDPNFYIMSLR 65

Query: 175 RDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
               IDA +K N SR+INHSC PN  + +  ++G  R GIFA RDI+ GE L+YDY ++
Sbjct: 66  DQWYIDARHKANLSRFINHSCAPNCFLTQINVNGYARNGIFAKRDIQAGEFLSYDYHFD 124


>gi|326476825|gb|EGE00835.1| histone-lysine N-methyltransferase [Trichophyton tonsurans CBS
           112818]
          Length = 628

 Score =  100 bits (250), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 76/247 (30%), Positives = 127/247 (51%), Gaps = 33/247 (13%)

Query: 20  LLKQIGNPVEFELP-DWFI-----KPKAIPYVFIKRNIYLTKRI---KRRLEDDGIFCSC 70
           LL + G P  F LP D F      +PK   +  I +NI++       K  +  +   C C
Sbjct: 235 LLLEKGRP--FRLPYDIFSPLPRGQPKPDEWRKINKNIFIGDAAGIWKATMPTEQSTCLC 292

Query: 71  TASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNS-CLNKPFQ--NRPVKK-------MKLV 119
           T   G    C+ +C    +   C  + CK G   C N+ F    + +K        ++++
Sbjct: 293 TPEMG----CEENCQNRHMFYECDENNCKLGEDLCRNRNFSELRKRIKTGGKYNIGVEVI 348

Query: 120 QTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETN--FYLCEINRDM 177
           +TE  G G+ ++   +  + ++EY GE++     + R   MK + + N  FYL + ++DM
Sbjct: 349 KTESRGYGVRSNRTFEPNQIIVEYTGEILTQIEAQRR---MKTIYKKNECFYLMDFDQDM 405

Query: 178 VIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD-IKKGENLTYDYQYEFLH 236
           +IDAT +G+ +R++NHSC PN +M+KWI+ G+ RI +FA  + I  GE LTYDY ++   
Sbjct: 406 IIDAT-RGSIARFVNHSCEPNCKMEKWIVAGKPRIALFAGDNGIMTGEELTYDYNFDPYS 464

Query: 237 DSLIAYC 243
           +  +  C
Sbjct: 465 NKNVQEC 471


>gi|356561269|ref|XP_003548905.1| PREDICTED: histone-lysine N-methyltransferase SUVR5-like [Glycine
            max]
          Length = 1496

 Score =  100 bits (250), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 61/164 (37%), Positives = 86/164 (52%), Gaps = 15/164 (9%)

Query: 87   GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
            G L+  C+  CKC  +C N+  QN    K+++ +TEK G  + A E I RG FV EY+GE
Sbjct: 1314 GYLVYECNQMCKCYKTCPNRILQNGLRVKLEVFKTEKKGWALRAGEAILRGTFVCEYIGE 1373

Query: 147  VIDDQTCEERLWKMKHLGETNFY-----------LCEINRDMVIDATYKGNKSRYINHSC 195
            V+D +  + R  +      + FY           L E     VID T  GN SR+IN+SC
Sbjct: 1374 VLDTREAQNRRKRYGKEHCSYFYDVDDHVNDMSRLIEGQAHYVIDTTRFGNVSRFINNSC 1433

Query: 196  CPNTEMQKWIIDG----ETRIGIFATRDIKKGENLTYDYQYEFL 235
             PN    + +++        IG++A RDI  GE LTY+Y YE +
Sbjct: 1434 SPNLVSYQVLVESMDCERAHIGLYANRDIALGEELTYNYHYELV 1477


>gi|302921594|ref|XP_003053314.1| hypothetical protein NECHADRAFT_123121 [Nectria haematococca mpVI
           77-13-4]
 gi|256734254|gb|EEU47601.1| hypothetical protein NECHADRAFT_123121 [Nectria haematococca mpVI
           77-13-4]
          Length = 776

 Score =  100 bits (250), Expect = 4e-19,   Method: Composition-based stats.
 Identities = 55/178 (30%), Positives = 94/178 (52%), Gaps = 18/178 (10%)

Query: 68  CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNS-CLNKPFQNRPVKK---------M 116
           C C    G    C   C   ++L  C    C  G + C N+ F     ++         +
Sbjct: 405 CVCKPEDG----CGESCQNRIMLYECDEQNCNVGKAHCTNRAFATLTARRNQGGKYRVGV 460

Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
           ++++T   G G+ ++   +  + ++EY GE+I +  CE R+ ++    E  +YL   +++
Sbjct: 461 EVIKTSDRGYGVRSNRCFRPNQIIMEYAGEIITEAECERRMTEVYKDNEC-YYLMSFDQN 519

Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDYQYE 233
           M+IDAT  G+ +R++NHSC PN  M KWI+ G+ R+ +FA  + I  GE LTYDY ++
Sbjct: 520 MIIDAT-TGSIARFVNHSCNPNCRMIKWIVSGQPRMALFAGDKPITTGEELTYDYNFD 576


>gi|443896289|dbj|GAC73633.1| putative transcription factor ASH1/LIN-59 [Pseudozyma antarctica
           T-34]
          Length = 1319

 Score =  100 bits (250), Expect = 5e-19,   Method: Composition-based stats.
 Identities = 71/205 (34%), Positives = 105/205 (51%), Gaps = 18/205 (8%)

Query: 38  KPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCS-SG 96
           K K   Y  I +N Y ++    +L+ +   C+C   PGS   C  DC   ML   C    
Sbjct: 600 KRKPPRYQQISKNKYYSRP---KLQGEVPLCNC--KPGSG--CGSDCINRMLQFICDPRT 652

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
           C  GNSC N     R   K  +    + G G+   E IK+ +F+ EY GEVI+     +R
Sbjct: 653 CPNGNSCTNISLGRRTGIKTAVAYYGRRGFGLKTLEAIKQHDFIDEYRGEVINLSEAAKR 712

Query: 157 LWKMKHLGETNFYLCEINRDM--VIDATYKGNKSRYINHSCCPNTEMQKWIIDG------ 208
           + + ++    N+YL + +     ++D   KGN +R+ NHSC PN  ++K+II G      
Sbjct: 713 VTE-EYKATGNYYLLDYDSAAGELLDGGRKGNITRFANHSCDPNCRIEKFIICGTDEALS 771

Query: 209 -ETRIGIFATRDIKKGENLTYDYQY 232
            E +IG+FA RDI+ GE LTY+Y +
Sbjct: 772 AEFQIGLFANRDIEAGEELTYNYGW 796


>gi|261193681|ref|XP_002623246.1| histone-lysine N-methyltransferase [Ajellomyces dermatitidis
           SLH14081]
 gi|239588851|gb|EEQ71494.1| histone-lysine N-methyltransferase [Ajellomyces dermatitidis
           SLH14081]
          Length = 776

 Score =  100 bits (249), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 61/188 (32%), Positives = 98/188 (52%), Gaps = 24/188 (12%)

Query: 68  CSCTASPGSSGVCDRDCHCGMLLSSCS-SGCKCGNS-CLNKPFQN-RPVKKM-------- 116
           C CT   G    CD +C    +   C  + CK G   C N+ F+  R   K+        
Sbjct: 383 CMCTPESG----CDENCQNRYMFYECDDNNCKLGAELCGNRNFEGLRQRSKLGGKYNIGV 438

Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
           ++++T   G G+ ++      + ++EY GE++  + CE R+        T +YL   +R+
Sbjct: 439 EVIKTADRGYGVRSNRTFAPNQIIVEYTGEIVTQEECERRM-------RTCYYLMYFDRN 491

Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDYQYEFL 235
           M+IDAT +G+ +R++NHSC PN +M+KW + G+ R+ +FA    I  GE LTYDY +E+ 
Sbjct: 492 MIIDAT-RGSIARFVNHSCEPNCKMEKWTVAGKPRMALFAGEHGIMTGEELTYDYNFEYS 550

Query: 236 HDSLIAYC 243
              L   C
Sbjct: 551 PLHLAPQC 558


>gi|388853505|emb|CCF52904.1| related to regulatory protein SET1 [Ustilago hordei]
          Length = 1489

 Score =  100 bits (249), Expect = 5e-19,   Method: Composition-based stats.
 Identities = 48/120 (40%), Positives = 74/120 (61%)

Query: 114  KKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEI 173
            K++K  ++     G+ A E I  G+ VIEYVGE++  Q  + R  + +  G  + YL  +
Sbjct: 1348 KQLKFAKSPIHDWGLYAMEYIPAGDMVIEYVGEMVRQQVADNREKQYERQGNFSTYLFRV 1407

Query: 174  NRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
            + D+V+DAT+KGN +R +NH C PN   +   ++GE RI +FA   IK GE LTYDY+++
Sbjct: 1408 DDDLVVDATHKGNIARLMNHCCTPNCNAKILTVNGEKRIVLFAKSPIKAGEELTYDYKFQ 1467


>gi|225555248|gb|EEH03540.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR]
          Length = 803

 Score =  100 bits (249), Expect = 6e-19,   Method: Composition-based stats.
 Identities = 65/201 (32%), Positives = 104/201 (51%), Gaps = 21/201 (10%)

Query: 45  VFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCS-SGCKCGNS- 102
           VF+     + K IK R       C CT   G    CD +C    +   C  + CK G   
Sbjct: 355 VFVGDAACIWKAIKLRERST---CMCTPELG----CDENCQNRYMFYECDDNNCKLGAEL 407

Query: 103 CLNKPFQN-RPVKKM--------KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTC 153
           C N+ F+  R   KM        ++++T   G G+ ++      + ++EY GE+I  + C
Sbjct: 408 CGNRSFEGLRQRIKMGGRYNIGVEVIKTADRGYGVRSNRTFAPNQIIVEYTGEIITQEEC 467

Query: 154 EERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIG 213
           E R+  +    E  +YL   +++M+IDAT +G+ +R++NHSC PN  M+KW + G+ R+ 
Sbjct: 468 ERRMRTVYKDNEC-YYLMYFDQNMIIDAT-RGSIARFVNHSCEPNCRMEKWTVAGKPRMA 525

Query: 214 IFATRD-IKKGENLTYDYQYE 233
           +FA  + I  GE LTYDY ++
Sbjct: 526 LFAGENGIMTGEELTYDYNFD 546


>gi|390605099|gb|EIN14490.1| SET domain-containing protein [Punctularia strigosozonata HHB-11173
           SS5]
          Length = 164

 Score =  100 bits (249), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 53/117 (45%), Positives = 71/117 (60%)

Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
           G+ A E I RGE VIEYVGEVI  Q  ++R    +  G  + YL  I+ D+V+DAT KGN
Sbjct: 38  GLYAMERIARGEMVIEYVGEVIRAQVADKREKAYERQGIGSSYLFRIDEDLVVDATKKGN 97

Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
             R INHSC PN   +   I+GE +I I+A +DI+ G+ +TYDY +    D +   C
Sbjct: 98  LGRLINHSCDPNCTAKIITINGEKKIVIYAKQDIELGDEITYDYHFPIEQDKIPCLC 154


>gi|255562775|ref|XP_002522393.1| set domain protein, putative [Ricinus communis]
 gi|223538471|gb|EEF40077.1| set domain protein, putative [Ricinus communis]
          Length = 1516

 Score =  100 bits (248), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 61/166 (36%), Positives = 91/166 (54%), Gaps = 19/166 (11%)

Query: 87   GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
            G L+  C+  C C  +C N+  QN    K+++ +T+  G  + A E I  G FV EY+GE
Sbjct: 1334 GYLVYECNQMCSCSKTCPNRVLQNGIRVKLEVYKTKNKGWAVRAGEPILSGTFVCEYIGE 1393

Query: 147  VIDDQTCEERLWKMKHLGETNFYLCEIN---RDM----------VIDATYKGNKSRYINH 193
            V+D+    +R  + ++  E+  Y+ +I+    DM          VIDAT  GN SR+INH
Sbjct: 1394 VLDEVEANQR--RGRYSEESCSYMYDIDAHTNDMSRLMEGQVKYVIDATKHGNVSRFINH 1451

Query: 194  SCCPNTEMQKWIIDG----ETRIGIFATRDIKKGENLTYDYQYEFL 235
            SC PN    + II+        IG++A+RDI  GE LTY+Y+Y  +
Sbjct: 1452 SCLPNLVNHQVIINSMDAQRAHIGLYASRDIAFGEELTYNYRYNLV 1497


>gi|322694149|gb|EFY85986.1| histone-lysine N-methyltransferase (Ash1), putative [Metarhizium
           acridum CQMa 102]
          Length = 760

 Score =  100 bits (248), Expect = 7e-19,   Method: Composition-based stats.
 Identities = 52/158 (32%), Positives = 90/158 (56%), Gaps = 14/158 (8%)

Query: 88  MLLSSCSSG-CKCGNS-CLNKPFQNRPVKKMK---------LVQTEKCGAGIVADEDIKR 136
           ++L  C +G C  G   C N+ F +   ++ K         +++T   G G+ ++   K 
Sbjct: 391 IMLYECDAGNCNIGKELCTNRSFSDLAARRSKGGKYRVGVEVIKTPDRGYGVRSNRCFKA 450

Query: 137 GEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCC 196
            + ++EY GE+I ++ CE R+   ++     +YL   +++M+IDAT  G+ +R++NHSC 
Sbjct: 451 NQIIMEYTGEIITEEECERRM-NEEYKNNECYYLMSFDQNMIIDAT-TGSIARFVNHSCN 508

Query: 197 PNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDYQYE 233
           PN  M KWI+ G+ R+ +FA  R I  G+ LTYDY ++
Sbjct: 509 PNCRMIKWIVSGQPRMALFAGDRPIMTGDELTYDYNFD 546


>gi|168031194|ref|XP_001768106.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162680544|gb|EDQ66979.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 740

 Score =  100 bits (248), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 60/177 (33%), Positives = 91/177 (51%), Gaps = 29/177 (16%)

Query: 87  GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
           G ++  C+S C+C +SC N+  Q     K+++ ++   G G+ A E I RG FV EY+GE
Sbjct: 546 GYMVYECNSSCQCKDSCRNRVLQKGVRLKLEVFKSRHKGWGVRAAEPISRGTFVCEYIGE 605

Query: 147 VIDDQTCEERLWKMKHLGETNFYLC------------------------EINRDMVIDAT 182
           V++D+   ER  +  H   T+ ++C                         +++  VIDAT
Sbjct: 606 VLNDKEANERGKREPH-EYTDTFVCSRYDQVGCSYLYNIDAHLDVIGSKSVSKPFVIDAT 664

Query: 183 YKGNKSRYINHSCCPNTEMQKWIIDGE----TRIGIFATRDIKKGENLTYDYQYEFL 235
             GN +R+INHSC PN    + +++        IG FA RDI  GE L YDY+Y+ L
Sbjct: 665 KYGNVARFINHSCEPNLINYEVLVESMDCQLAHIGFFANRDIAIGEELAYDYRYKLL 721


>gi|189202688|ref|XP_001937680.1| SET domain containing protein [Pyrenophora tritici-repentis
           Pt-1C-BFP]
 gi|187984779|gb|EDU50267.1| SET domain containing protein [Pyrenophora tritici-repentis
           Pt-1C-BFP]
          Length = 646

 Score = 99.8 bits (247), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 65/196 (33%), Positives = 97/196 (49%), Gaps = 17/196 (8%)

Query: 59  RRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCG-NSCLNKPFQ------- 109
           +R + D   C C    G    C   CH  ++   C ++ C      C N+PF        
Sbjct: 334 KRDKQDSSQCYCDVEDG----CGEACHNRIMAYECDNTNCPLPPEQCNNRPFAELKRRSK 389

Query: 110 -NRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNF 168
            NR    ++++ TE  G G+ A    +  + ++EY GE+I    CE R+ K  +  +  +
Sbjct: 390 GNRYDYGVEVLDTEDRGYGVRAMRTFEPHQIIVEYAGEIITQSECERRM-KQVYKKDKCY 448

Query: 169 YLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLT 227
           YL   +  M+IDAT +G  +R++NHSC PN EM KW + GE R+ +FA  R I  GE LT
Sbjct: 449 YLMSFDNKMIIDAT-RGTIARFVNHSCEPNCEMIKWTVGGEPRMALFAGPRGIMTGEELT 507

Query: 228 YDYQYEFLHDSLIAYC 243
           YDY ++      I  C
Sbjct: 508 YDYNFDPFSQKNIQEC 523


>gi|84310015|emb|CAJ18337.1| putative H3K9 histone methyltransferase [Araneus diadematus]
          Length = 467

 Score = 99.8 bits (247), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 56/152 (36%), Positives = 83/152 (54%), Gaps = 7/152 (4%)

Query: 87  GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEK-CGAGIVADEDIKRGEFVIEYVG 145
           G  +  C+  CKC NSC+N+  Q+ P  K+ + +T   CG G+   E ++RG+FV+EY+G
Sbjct: 284 GYPIYECNRRCKCDNSCINRVVQHGPKVKVAIFRTTNGCGWGLKTLELVQRGQFVLEYLG 343

Query: 146 EVIDDQTCEERLWKMKHLGETNFYLCEINRD--MVIDATYKGNKSRYINHSCCPNTEMQK 203
           E+I  +  EER     HLG T  +  +  +D    +D+   GN S +INHSC PN     
Sbjct: 344 EIITSEHAEERGEVYDHLGRTYLFDMDWEKDCKYTVDSMLFGNASHFINHSCDPNLATYT 403

Query: 204 WIIDGE----TRIGIFATRDIKKGENLTYDYQ 231
             I+ +     RI  FA + I   E LT+DY+
Sbjct: 404 VWINQQDPMLPRIAFFAKKKINPDEELTFDYK 435


>gi|339253208|ref|XP_003371827.1| histone-lysine N-methyltransferase, H3 lysine-36 specific
           [Trichinella spiralis]
 gi|316967862|gb|EFV52228.1| histone-lysine N-methyltransferase, H3 lysine-36 specific
           [Trichinella spiralis]
          Length = 187

 Score = 99.8 bits (247), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 48/108 (44%), Positives = 68/108 (62%)

Query: 125 GAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYK 184
           G G+ A +DI + EF++E+VGEVI  +    R  +      T+ Y  ++   +VIDAT  
Sbjct: 76  GIGLKARQDILKDEFIVEFVGEVITSKENCRRNKQQMRWANTHQYPMQLTSGLVIDATSF 135

Query: 185 GNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
           GN +R+INHSC PN   +KW+++G+ R+ I A RDI KGE LT DY Y
Sbjct: 136 GNAARFINHSCDPNCRAEKWMVEGKHRLAIVANRDISKGEELTLDYGY 183


>gi|224094656|ref|XP_002310194.1| SET domain protein [Populus trichocarpa]
 gi|222853097|gb|EEE90644.1| SET domain protein [Populus trichocarpa]
          Length = 174

 Score = 99.4 bits (246), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 56/154 (36%), Positives = 82/154 (53%), Gaps = 15/154 (9%)

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
           C C  +C N+  QN    K+++ +T+  G  + A E I RG F+ EY+GEV+D+Q   +R
Sbjct: 2   CSCNKTCPNRVLQNGIRVKLEVFKTDNKGWAVRAGEPILRGTFICEYIGEVLDEQEANDR 61

Query: 157 LWKMKHLGETNFY-----------LCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWI 205
             +    G +  Y           + E      IDAT  GN SR+INHSC PN    + +
Sbjct: 62  RDRYGKEGCSYMYKIDAHTNDMSRMVEGQSHYFIDATKYGNVSRFINHSCMPNLANHQVL 121

Query: 206 IDG----ETRIGIFATRDIKKGENLTYDYQYEFL 235
           ++        IG++A+RDI  GE LTY+Y+YE L
Sbjct: 122 VNSMDSQRAHIGLYASRDISFGEELTYNYRYELL 155


>gi|392590566|gb|EIW79895.1| SET domain-containing protein [Coniophora puteana RWD-64-598 SS2]
          Length = 160

 Score = 99.4 bits (246), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 55/142 (38%), Positives = 79/142 (55%), Gaps = 2/142 (1%)

Query: 104 LNKPFQNRPVKK--MKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMK 161
           LN  F     +K  ++  ++     G+ A E + RGE VIEYVGEV+  Q  ++R    +
Sbjct: 9   LNVKFNQLQTRKKHLRFARSPIHDWGLYAMERVSRGEMVIEYVGEVVRAQVADKREKAYE 68

Query: 162 HLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIK 221
             G  + YL  I+ D+V+DAT KGN  R INHSC PN   +   I GE +I I+A +DI+
Sbjct: 69  RQGIGSSYLFRIDEDLVVDATKKGNLGRLINHSCDPNCTARIITISGEKKIVIYAKQDIE 128

Query: 222 KGENLTYDYQYEFLHDSLIAYC 243
            G+ +TYDY +    D +   C
Sbjct: 129 LGDEITYDYHFPIEQDKIPCLC 150


>gi|341874300|gb|EGT30235.1| hypothetical protein CAEBREN_24272 [Caenorhabditis brenneri]
          Length = 249

 Score = 99.0 bits (245), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 66/194 (34%), Positives = 98/194 (50%), Gaps = 12/194 (6%)

Query: 44  YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCGNSC 103
           Y  I + I  T   K++  DD   C C  S       D  C    L   C   CK  N C
Sbjct: 43  YTHIDKCINRTGNNKQKRSDDPDECDCHKS--RKRCSDVTCLNRALKIECPKSCKARN-C 99

Query: 104 LNKPFQNRPVKKMKLVQTEKC-GAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
            N+    +   K+ +  T    G G+ A ++IK G+F+ E++GE+I +   EE   + K 
Sbjct: 100 ANQRIAKQMYAKVHVQHTNSNKGFGLYAKKEIKAGQFITEFMGEIITE---EEEFRRRKA 156

Query: 163 LGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGET----RIGIFATR 218
                 Y+ E + +++ DAT  GN++R+INHSC PNT  ++WI+ G T    RI  FA  
Sbjct: 157 KPGVYRYILE-SANLIFDATKYGNEARFINHSCAPNTRCERWIVPGRTSNIPRIAFFANT 215

Query: 219 DIKKGENLTYDYQY 232
            IK G+ LT+DY++
Sbjct: 216 TIKAGQELTFDYKF 229


>gi|398388287|ref|XP_003847605.1| hypothetical protein MYCGRDRAFT_97419 [Zymoseptoria tritici IPO323]
 gi|339467478|gb|EGP82581.1| hypothetical protein MYCGRDRAFT_97419 [Zymoseptoria tritici IPO323]
          Length = 1069

 Score = 99.0 bits (245), Expect = 1e-18,   Method: Composition-based stats.
 Identities = 63/190 (33%), Positives = 100/190 (52%), Gaps = 17/190 (8%)

Query: 68  CSCTASPGSSGV-CDRDCHCGMLLSSCS-SGCKCGNS-CLNKPFQNRPVKKMK------- 117
           C C   PG   + CD DC   ++   C+   C+   S C N+ F     +  K       
Sbjct: 661 CICR-DPGQGDLGCDYDCINRVMQYECNDQNCRLPASVCSNRAFAQLAARTKKGGLFDVG 719

Query: 118 --LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINR 175
             +V+T + G GI +      G+ ++EY GE+I +  C+ R+ ++ H  +  +YL E+ R
Sbjct: 720 VEVVKTHERGFGIRSCRSFNAGQIIMEYTGEIISEGECQRRMREVYH-DKACYYLMELER 778

Query: 176 DMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD-IKKGENLTYDYQYE- 233
           ++VID T KG+ +R+INHSC PN E++   ++G  R+ +FA    I  GE LTYDY ++ 
Sbjct: 779 NLVIDGT-KGSMARFINHSCEPNCEVRMVKVNGTPRMAVFAGESGIMTGEELTYDYNFDN 837

Query: 234 FLHDSLIAYC 243
           F   +   YC
Sbjct: 838 FGESAQKCYC 847


>gi|400595512|gb|EJP63307.1| SET domain-containing protein [Beauveria bassiana ARSEF 2860]
          Length = 824

 Score = 99.0 bits (245), Expect = 1e-18,   Method: Composition-based stats.
 Identities = 54/178 (30%), Positives = 95/178 (53%), Gaps = 18/178 (10%)

Query: 68  CSCTASPGSSGVCDRDCHCGMLLSSCSS-GCKCGN-SCLNKPFQNRPVKKMK-------- 117
           C C    G    C  +C   ++L  C+   C  G  +C N+ F     ++ K        
Sbjct: 416 CVCKPEDG----CGDNCQNRIMLYECNDINCNAGKETCTNRAFATLTARRAKGGKYRVGV 471

Query: 118 -LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
            +++T   G G+ ++   +  + ++EY GE+I ++ CE R+   ++     +YL   +++
Sbjct: 472 EVIKTSDRGYGVRSNRCFRPHQIIMEYAGEIITEEECERRM-NNEYKNNECYYLMSFDQN 530

Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD-IKKGENLTYDYQYE 233
           M+IDAT  G+ +R++NHSC PN  M KWI+ G+ R+ +FA  + I  G+ LTYDY ++
Sbjct: 531 MIIDAT-TGSIARFVNHSCNPNCRMIKWIVSGQPRMALFAGDNPIMTGDELTYDYNFD 587


>gi|328875267|gb|EGG23632.1| hypothetical protein DFA_05766 [Dictyostelium fasciculatum]
          Length = 1603

 Score = 99.0 bits (245), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 47/106 (44%), Positives = 72/106 (67%), Gaps = 1/106 (0%)

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G  A + I  GEF+IEYVGE+I  +  +ER  K ++ G  + ++  I +D+++DAT+KGN
Sbjct: 1478 GAFAKKSIVAGEFIIEYVGEIIRQKIADERERKYQNDG-VDCFMFAIEKDVIVDATFKGN 1536

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            ++R+ NHSC PN + +   IDG  +I I A++DI K E +TYDYQ+
Sbjct: 1537 RARFANHSCEPNAKTKIISIDGVKKIIIVASKDIAKNEEITYDYQF 1582


>gi|389746109|gb|EIM87289.1| SET domain-containing protein [Stereum hirsutum FP-91666 SS1]
          Length = 191

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 53/117 (45%), Positives = 70/117 (59%)

Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
           G+ A E I RGE VIEYVGEVI  Q  ++R    +  G  + YL  I+ D+V+DAT KGN
Sbjct: 65  GLYAMERIARGEMVIEYVGEVIRAQIADKREKAYERQGIGSSYLFRIDEDLVVDATKKGN 124

Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
             R INHSC PN   +   I GE +I I+A +DI+ G+ +TYDY +    D +   C
Sbjct: 125 LGRLINHSCDPNCTAKIITILGEKKIVIYAKQDIELGDEITYDYHFPIEQDKIPCLC 181


>gi|213402529|ref|XP_002172037.1| histone-lysine N-methyltransferase [Schizosaccharomyces japonicus
           yFS275]
 gi|212000084|gb|EEB05744.1| histone-lysine N-methyltransferase [Schizosaccharomyces japonicus
           yFS275]
          Length = 977

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Composition-based stats.
 Identities = 48/117 (41%), Positives = 73/117 (62%)

Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
           G+ A E+I + + VIEYVGE++  +  + R  K    G  + YL  I++D ++DAT KGN
Sbjct: 851 GLFAMENIDKNDMVIEYVGEIVRQRVADTRERKYVREGIGDSYLFRIDKDAIVDATKKGN 910

Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
            +R+INHSC PN   +   ++G  +I I+A RDI++GE LTYDY++    D +   C
Sbjct: 911 IARFINHSCAPNCIAKIIRVEGHQKIVIYADRDIEEGEELTYDYKFPEEVDKIPCLC 967


>gi|403176252|ref|XP_003334956.2| hypothetical protein PGTG_16563 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
 gi|375172169|gb|EFP90537.2| hypothetical protein PGTG_16563 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
          Length = 765

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Composition-based stats.
 Identities = 74/228 (32%), Positives = 113/228 (49%), Gaps = 28/228 (12%)

Query: 25  GNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLED-DGIFCSCTASPGSSGVCDRD 83
           G+P   +  +W  +P A  Y+ IK N+Y  +  K  + D     C+CT      G C   
Sbjct: 431 GDP-NLDAENW--EPPA--YIKIKSNVYPFR--KPEISDFPPAICNCT-----DGGCGEK 478

Query: 84  CHCGMLLSSCSSG-CKCGNSCLNKPFQNRPV----------KKMKLVQTEKC-GAGIVAD 131
           C    +   C    C+ G +C N PF  R            K +K+  T    G G+  +
Sbjct: 479 CLNRAMFYLCDPKLCQLGQACKNLPFNQRKDAIDESSPRLGKGLKVFYTGPIKGWGLKTE 538

Query: 132 EDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYI 191
             +K+  F+I+Y GE+I    C +R+   ++ G  +FYL + +   VIDA  KGN +R+I
Sbjct: 539 IKLKKDRFLIDYRGEIISRDACYKRVLN-EYSGCKHFYLMDYDGSDVIDAGLKGNCARFI 597

Query: 192 NHSCCPNTEMQKWIIDG--ETRIGIFATRDIKKGENLTYDYQYEFLHD 237
           NHSC PN  ++++ + G  E + GIF  RDI+ GE L+YDY +    D
Sbjct: 598 NHSCTPNLRVERFKLSGLEEYQFGIFTLRDIEIGEELSYDYGWRNFSD 645


>gi|225685245|gb|EEH23529.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb03]
          Length = 756

 Score = 98.6 bits (244), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 59/188 (31%), Positives = 101/188 (53%), Gaps = 18/188 (9%)

Query: 68  CSCTASPGSSGVCDRDCHCGMLLSSCSS-GCKCGNS-CLNKPFQN--RPVKK-------M 116
           C CT   G    CD +C    +   C    CK G   C N+ F++  + +K        +
Sbjct: 312 CMCTPESG----CDENCQNRCMFYECDDDNCKLGAELCRNRSFEDLRQRIKAGGKYNIGV 367

Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
           ++++T   G G+ ++      + ++EY GE+I  + CE R+  +    E  +YL   +++
Sbjct: 368 EVIKTADRGYGVRSNRTFAPNQIIVEYTGEIITQKECERRMRTVYKNNEC-YYLMYFDQN 426

Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD-IKKGENLTYDYQYEFL 235
           M+IDAT +G+ +R++NHSC PN EM+KW + G+ R+ +FA ++ I  GE LTYDY ++  
Sbjct: 427 MIIDAT-RGSIARFVNHSCEPNCEMEKWTVAGKPRMALFAGKNGITTGEELTYDYNFDPY 485

Query: 236 HDSLIAYC 243
               +  C
Sbjct: 486 SQKNVQEC 493


>gi|239613825|gb|EEQ90812.1| histone-lysine N-methyltransferase [Ajellomyces dermatitidis ER-3]
          Length = 776

 Score = 98.6 bits (244), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 60/188 (31%), Positives = 98/188 (52%), Gaps = 24/188 (12%)

Query: 68  CSCTASPGSSGVCDRDCHCGMLLSSCS-SGCKCGNS-CLNKPFQN-RPVKKM-------- 116
           C CT   G    CD +C    +   C  + CK G   C N+ F+  R   K+        
Sbjct: 383 CMCTPESG----CDENCQNRYMFYECDDNNCKLGAELCGNRNFEGLRQRSKLGGKYNIGV 438

Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
           ++++T   G G+ ++      + ++EY GE++  + CE R+        T +YL   +R+
Sbjct: 439 EVIKTADRGYGVRSNRTFAPNQIIVEYTGEIVTQEECERRM-------RTCYYLMYFDRN 491

Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDYQYEFL 235
           M+IDAT +G+ +R++NHSC PN +++KW + G+ R+ +FA    I  GE LTYDY +E+ 
Sbjct: 492 MIIDAT-RGSIARFVNHSCEPNCKIEKWTVAGKPRMALFAGEHGIMTGEELTYDYNFEYS 550

Query: 236 HDSLIAYC 243
              L   C
Sbjct: 551 PLHLAPQC 558


>gi|328768995|gb|EGF79040.1| hypothetical protein BATDEDRAFT_35515 [Batrachochytrium dendrobatidis
            JAM81]
          Length = 1367

 Score = 98.6 bits (244), Expect = 2e-18,   Method: Composition-based stats.
 Identities = 49/119 (41%), Positives = 72/119 (60%), Gaps = 4/119 (3%)

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCE--ERLWKMKHLGETNFYLCEINRDMVIDATYK 184
            G+ A E I   + VIEY+GE+I  +  +  E+L++   +G +  YL  ++ D +IDAT  
Sbjct: 1241 GLFAMEPIDANDMVIEYIGEIIRQKVADHREKLYEASGIGSS--YLFRVDEDTIIDATKT 1298

Query: 185  GNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
            GN +R+INH C PN   +   +DG  RI I+A RDIK+GE LTYDY++    D +   C
Sbjct: 1299 GNLARFINHCCEPNCNAKVISVDGTKRIVIYANRDIKEGEELTYDYKFPIEEDKIPCLC 1357


>gi|295663144|ref|XP_002792125.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
 gi|226279300|gb|EEH34866.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
          Length = 816

 Score = 98.6 bits (244), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 58/188 (30%), Positives = 103/188 (54%), Gaps = 18/188 (9%)

Query: 68  CSCTASPGSSGVCDRDCHCGMLLSSCS-SGCKCGNS-CLNKPFQN--RPVKK-------M 116
           C CT   G    CD +C    +   C  + CK G   C N+ F++  + +K        +
Sbjct: 373 CMCTPENG----CDENCQNRCMFYECDDNNCKLGAELCRNRSFEDLRQRIKAGGKYNIGV 428

Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
           ++++T   G G+ ++   +  + ++EY GE++  + CE R+  +    E  +YL   +++
Sbjct: 429 EVIKTADRGYGVRSNRTFEPNQIIVEYTGEIVTQKECERRMRTVYKNNEC-YYLMYFDQN 487

Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD-IKKGENLTYDYQYEFL 235
           M+IDAT +G+ +R++NHSC PN EM+KW + G+ R+ +FA ++ I  GE LTYDY ++  
Sbjct: 488 MIIDAT-RGSIARFVNHSCEPNCEMEKWTVAGKPRMALFAGKNGITTGEELTYDYNFDPY 546

Query: 236 HDSLIAYC 243
               +  C
Sbjct: 547 SQKNVQEC 554


>gi|358253208|dbj|GAA52505.1| histone-lysine N-methyltransferase SETD2 [Clonorchis sinensis]
          Length = 757

 Score = 98.6 bits (244), Expect = 2e-18,   Method: Composition-based stats.
 Identities = 47/97 (48%), Positives = 61/97 (62%)

Query: 136 RGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSC 195
           +  F+IEYVGEVID      R+ + + LG  + Y   +  D  IDA  KGN +R++NHSC
Sbjct: 4   KSTFIIEYVGEVIDFAEFRRRIRRYERLGHAHHYFMALESDRFIDAGAKGNWARFVNHSC 63

Query: 196 CPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            PN   QKW +DGE RIG FA  DI+ GE +T DYQ+
Sbjct: 64  EPNCVTQKWSVDGEIRIGFFAREDIEAGEEITIDYQF 100


>gi|301114847|ref|XP_002999193.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262111287|gb|EEY69339.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 295

 Score = 98.6 bits (244), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 51/119 (42%), Positives = 74/119 (62%), Gaps = 2/119 (1%)

Query: 125 GAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYK 184
           G G+   EDIK G FV EY+GE++ +Q  E  + ++ +  E + Y+  ++   VIDAT  
Sbjct: 13  GLGMKLLEDIKAGSFVGEYMGEIVTEQ--EYYMRRVLYHNEKHRYMMVLSGGEVIDATRM 70

Query: 185 GNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
           G  +R+INHSC PN  ++KW ++GE R  IFA RDI  GE LT+DY++E    + I  C
Sbjct: 71  GGWARFINHSCNPNCGVEKWDVNGEERCAIFALRDIVAGEELTFDYKFESFSKAEITEC 129


>gi|340519773|gb|EGR50011.1| predicted protein [Trichoderma reesei QM6a]
          Length = 654

 Score = 98.2 bits (243), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 55/176 (31%), Positives = 95/176 (53%), Gaps = 14/176 (7%)

Query: 80  CDRDCHCGMLLSSCS-SGCKCGNS-CLNKPFQNRPVKKMK---------LVQTEKCGAGI 128
           C   C   ++L  C  + C  G + C N+ F +   ++ +         +++T   G G+
Sbjct: 416 CGESCQNRIMLYECDDTNCNIGKARCTNRAFADLTARRARGGKYRVGVEVIKTADRGYGV 475

Query: 129 VADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKS 188
            ++   K  + ++EY GE+I ++ CE R+ ++    E  +YL   +++M+IDAT  G+ +
Sbjct: 476 RSNRCFKPHQIIMEYAGEIITEEECERRMNEVYKNNEC-YYLMSFDQNMIIDATT-GSIA 533

Query: 189 RYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDYQYEFLHDSLIAYC 243
           R++NHSC PN  M KWI+ G+ R+ +FA  R I  GE LTYDY ++      +  C
Sbjct: 534 RFVNHSCNPNCRMIKWIVSGQPRMALFAGDRPIMTGEELTYDYNFDPFSAKNVQKC 589


>gi|359492362|ref|XP_002284621.2| PREDICTED: uncharacterized protein LOC100245350 [Vitis vinifera]
          Length = 2184

 Score = 98.2 bits (243), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 61/190 (32%), Positives = 87/190 (45%), Gaps = 40/190 (21%)

Query: 44   YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNS 102
            ++ I+ N++L +  + +  D+ + C C         C  +C   ML   C  G C CG+ 
Sbjct: 1217 WMLIRSNLFLHRSRRTQTIDEVMVCHCKRPVEGRFGCGDECLNRMLNIECVQGTCPCGDL 1276

Query: 103  CLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
            C N+                                       +V+D QT E R  +   
Sbjct: 1277 CSNQ---------------------------------------QVLDLQTYEARQKEYAS 1297

Query: 163  LGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKK 222
             G  +FY   +N   VIDA  KGN  R+INHSC PN   +KW+++GE  IG+FA RDIKK
Sbjct: 1298 RGHKHFYFMTLNGSEVIDACAKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALRDIKK 1357

Query: 223  GENLTYDYQY 232
            GE +T+DY Y
Sbjct: 1358 GEEVTFDYNY 1367


>gi|341896622|gb|EGT52557.1| hypothetical protein CAEBREN_29129 [Caenorhabditis brenneri]
          Length = 873

 Score = 98.2 bits (243), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 67/194 (34%), Positives = 96/194 (49%), Gaps = 14/194 (7%)

Query: 44  YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCGNSC 103
           Y  I  N Y TK +      +   C C  + G     +  C    +++ C S CK    C
Sbjct: 182 YEHIVENQY-TKELMSEERTESAHCECHEN-GKDCSSEATCTNMGMMTECPSTCK-AEGC 238

Query: 104 LNKPFQNRPVKKMKL-VQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
            N+  Q R   ++ +   +E+ G G++  + I+RG+FVIEYVGEV      E+     +H
Sbjct: 239 RNQRIQRREYAELAIKFISEEVGHGVITLKPIRRGQFVIEYVGEVTQIGDEEKATSDSRH 298

Query: 163 LGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGE----TRIGIFATR 218
                F+  E +R   IDAT KGN SR+INH C PN E  KW + G      R+  FA  
Sbjct: 299 ---RYFFDSESHR---IDATRKGNISRFINHHCSPNLECHKWSVPGTPDNLQRLVFFAKN 352

Query: 219 DIKKGENLTYDYQY 232
           DI+ GE + +DYQ+
Sbjct: 353 DIEAGEEINFDYQF 366


>gi|168060592|ref|XP_001782279.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666292|gb|EDQ52951.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 690

 Score = 97.4 bits (241), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 55/163 (33%), Positives = 84/163 (51%), Gaps = 14/163 (8%)

Query: 87  GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
           G ++  C+S C C   C N+  Q     K+++ ++   G  + + + I  G FV EY+GE
Sbjct: 509 GYMVYECNSSCLCREDCQNRVLQKGVRVKLEVFKSRHKGWAVRSAQPIPSGTFVCEYIGE 568

Query: 147 VIDDQTCEERLWKMKHLGETNFYLCEINRDM----------VIDATYKGNKSRYINHSCC 196
           V++D+   +R  +    G +  Y  + + DM          VIDAT  GN +R+INHSC 
Sbjct: 569 VVNDREANQRGVRYDQDGCSYLYDIDAHLDMSISRAGAKPFVIDATKHGNVARFINHSCA 628

Query: 197 PNTEMQKWIIDGE----TRIGIFATRDIKKGENLTYDYQYEFL 235
           PN    + +++        IG FA RDI  GE L YDY+Y+ L
Sbjct: 629 PNLINYEVLVESMDCQLAHIGFFANRDISAGEELAYDYRYKLL 671


>gi|299746032|ref|XP_002910994.1| Setd1a protein [Coprinopsis cinerea okayama7#130]
 gi|298406870|gb|EFI27500.1| Setd1a protein [Coprinopsis cinerea okayama7#130]
          Length = 1614

 Score = 97.4 bits (241), Expect = 4e-18,   Method: Composition-based stats.
 Identities = 51/114 (44%), Positives = 68/114 (59%)

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+ A E I +GE VIEYVGEVI  Q  ++R    +  G  + YL  I+ + V+DAT KGN
Sbjct: 1487 GLYAMEKISKGEMVIEYVGEVIRAQVADKREKTYEKQGIGSSYLFRIDEEFVVDATKKGN 1546

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLI 240
              R INHSC PN   +   I G  +I I+A +DI+ GE +TYDY +    D+ I
Sbjct: 1547 LGRLINHSCDPNCTAKIITISGVKKIVIYAKQDIELGEEITYDYHFPIEQDNKI 1600


>gi|409078063|gb|EKM78427.1| hypothetical protein AGABI1DRAFT_41599 [Agaricus bisporus var.
           burnettii JB137-S8]
 gi|426194069|gb|EKV44001.1| histone methyltransferase [Agaricus bisporus var. bisporus H97]
          Length = 163

 Score = 97.4 bits (241), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 53/130 (40%), Positives = 74/130 (56%)

Query: 114 KKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEI 173
           K ++  ++     G+ A E I RGE VIEYVGEVI     ++R    +  G  + YL  I
Sbjct: 24  KHLRFAKSPIHDWGLYAMERIARGEMVIEYVGEVIRAAVADKREKAYERQGIGSSYLFRI 83

Query: 174 NRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
           + D+V+DAT KGN  R INHSC PN   +   I G  +I I+A +DI+ G+ +TYDY + 
Sbjct: 84  DEDLVVDATKKGNLGRLINHSCDPNCTAKIITISGVKKIVIYAKQDIELGDEITYDYHFP 143

Query: 234 FLHDSLIAYC 243
           F  D +   C
Sbjct: 144 FEQDKIPCLC 153


>gi|326429534|gb|EGD75104.1| set-domain-containing protein [Salpingoeca sp. ATCC 50818]
          Length = 508

 Score = 97.4 bits (241), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 59/157 (37%), Positives = 84/157 (53%), Gaps = 11/157 (7%)

Query: 84  CHCGMLLSSCSS----GCKCGN---SCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKR 136
           C C   +  C       C  G+   SC N   Q R  K + L  ++  G GI   EDI++
Sbjct: 338 CPCAAAVRECDPDLCISCGAGDEEPSCKNCGIQRRQHKHLLLAPSDVAGWGIFTKEDIQK 397

Query: 137 GEFVIEYVGEVIDDQTCEERLWKM-KHLGETNFYLCEINRDMVIDATYKGNKSRYINHSC 195
           G F+ EY GEVI  +  E R     KH+     +L  +N + V+DAT KGNK R+ NH+ 
Sbjct: 398 GAFISEYCGEVISQEEAERRGKVYDKHMCS---FLFNLNAEYVVDATRKGNKIRFANHAN 454

Query: 196 CPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            PN   +  +++GE RIGI+A R+I  G+ L +DY+Y
Sbjct: 455 DPNCNARVMLVNGEHRIGIYAKRNIPAGKELFFDYRY 491


>gi|242066244|ref|XP_002454411.1| hypothetical protein SORBIDRAFT_04g030350 [Sorghum bicolor]
 gi|241934242|gb|EES07387.1| hypothetical protein SORBIDRAFT_04g030350 [Sorghum bicolor]
          Length = 633

 Score = 97.1 bits (240), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 60/166 (36%), Positives = 85/166 (51%), Gaps = 18/166 (10%)

Query: 87  GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
           G  +  C+S C C +SC NK  Q   + K++L +TE  G  I A E I +G FV EY+GE
Sbjct: 450 GYPIYECNSSCTCDSSCQNKVLQKGLLVKLELFRTENKGWAIRAAEPIPQGTFVCEYIGE 509

Query: 147 VIDDQTCEERLWKMKHLGETNFYLCEINRDM-------------VIDATYKGNKSRYINH 193
           V+      +    +   G  + YL +I   +             +IDAT  GN SRYINH
Sbjct: 510 VVKADKTMKNAESVSSKGGCS-YLFDIASQIDMERVRTVGAIEYLIDATRSGNVSRYINH 568

Query: 194 SCCPNTEMQKWIIDGE----TRIGIFATRDIKKGENLTYDYQYEFL 235
           SC PN   +  +++ +      IG+FA RDI  GE L YDY+ + +
Sbjct: 569 SCSPNLSTRLVLVESKDCQLAHIGLFANRDIAVGEELAYDYRQKLV 614


>gi|302416827|ref|XP_003006245.1| histone-lysine N-methyltransferase [Verticillium albo-atrum VaMs.102]
 gi|261355661|gb|EEY18089.1| histone-lysine N-methyltransferase [Verticillium albo-atrum VaMs.102]
          Length = 1135

 Score = 97.1 bits (240), Expect = 5e-18,   Method: Composition-based stats.
 Identities = 49/120 (40%), Positives = 71/120 (59%)

Query: 114  KKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEI 173
            K +K  ++     G+  +E+I + + +IEYVGE +  Q  E R  +    G  + YL  I
Sbjct: 993  KPVKFARSAIHNWGLYTEENINKDDMIIEYVGEQVRQQISEIREVRYLKQGMGSSYLFRI 1052

Query: 174  NRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
            + + VIDAT KG  +R+INHSC PN   +   +DG  RI I+A RDI + E LTYDY++E
Sbjct: 1053 DENTVIDATKKGGIARFINHSCMPNCTAKIIKVDGSKRIVIYALRDIARTEELTYDYKFE 1112


>gi|345315868|ref|XP_001516412.2| PREDICTED: probable histone-lysine N-methyltransferase NSD2,
           partial [Ornithorhynchus anatinus]
          Length = 951

 Score = 97.1 bits (240), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 60/158 (37%), Positives = 84/158 (53%), Gaps = 4/158 (2%)

Query: 40  KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
           K  PY  IK N    K ++    D      C   P     C  D +C   ML+  C    
Sbjct: 795 KPPPYKHIKVNKPCGK-VQIYTADISEIPKCNCKPTDESPCGFDSECLNRMLMYECHPQV 853

Query: 97  CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
           C  G  C N+ F  R   + K+V+T+  G G++A  DIK+GEFV EYVGE+ID++ C  R
Sbjct: 854 CPAGEQCQNQCFTKRQYPETKIVKTDGKGWGLLAKRDIKKGEFVNEYVGELIDEEECMAR 913

Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
           +   +    T+FY+  I++D +IDA  KGN SR++NHS
Sbjct: 914 IKHAQENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHS 951


>gi|384499027|gb|EIE89518.1| hypothetical protein RO3G_14229 [Rhizopus delemar RA 99-880]
          Length = 1674

 Score = 97.1 bits (240), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 48/119 (40%), Positives = 74/119 (62%)

Query: 114  KKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEI 173
            K++K  ++     G+ A+E I   + VIEYVGE+I  Q  EER  + +  G  + YL  +
Sbjct: 1535 KQLKFAKSPIHDWGLFAEEHIDVNDMVIEYVGEMIRQQVAEEREKQYERCGIGSSYLFRV 1594

Query: 174  NRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            + D VIDAT +G+ +R+INH C PN   +   +D + +I I+A RDI+ GE +TYDY++
Sbjct: 1595 DDDTVIDATKRGSIARFINHCCSPNCSAKIITVDKQKKIVIYANRDIEPGEEITYDYKF 1653


>gi|346974289|gb|EGY17741.1| histone-lysine N-methyltransferase [Verticillium dahliae VdLs.17]
          Length = 1148

 Score = 97.1 bits (240), Expect = 6e-18,   Method: Composition-based stats.
 Identities = 49/120 (40%), Positives = 71/120 (59%)

Query: 114  KKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEI 173
            K +K  ++     G+  +E+I + + +IEYVGE +  Q  E R  +    G  + YL  I
Sbjct: 1006 KPVKFARSAIHNWGLYTEENINKDDMIIEYVGEQVRQQISEIREVRYLKQGMGSSYLFRI 1065

Query: 174  NRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
            + + VIDAT KG  +R+INHSC PN   +   +DG  RI I+A RDI + E LTYDY++E
Sbjct: 1066 DENTVIDATKKGGIARFINHSCMPNCTAKIIKVDGSKRIVIYALRDIARTEELTYDYKFE 1125


>gi|389740685|gb|EIM81875.1| SET domain-containing protein [Stereum hirsutum FP-91666 SS1]
          Length = 397

 Score = 96.7 bits (239), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 71/202 (35%), Positives = 102/202 (50%), Gaps = 18/202 (8%)

Query: 49  RNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG--CKCG------ 100
           RN + +KR KRR       C+C+ S   +     DC C      C      +CG      
Sbjct: 183 RNCHCSKRCKRRWRG----CTCSKSKRRNLCRTNDCPCWRAHRECDPELCTRCGAKYLFD 238

Query: 101 NSCLNKPFQNRPVKKMKL-VQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
           + C N   Q   +K+ +L   T   G  + ADE    G++++EYVGE++ D+T E R   
Sbjct: 239 DECKNIAIQQNHIKRTELRSATYGLGLFLAADETALEGDYIMEYVGELVYDETAESRQDV 298

Query: 160 MKHLGETNFYLCEINRDMVIDATYKGNKSRYINH--SCCPNTEMQKWIIDGETRIGIFAT 217
            K+LG   FY   +N  + IDA+  GN++R+INH  S   N   +  +++GE RIGIFA 
Sbjct: 299 AKYLGRNYFY--RLNETLNIDASRVGNEARFINHAPSRLANCNAEVLLVNGEHRIGIFAA 356

Query: 218 RDIKKGENLTYDYQYE-FLHDS 238
           R I+  E L  DY    F+HDS
Sbjct: 357 RYIESNEELFLDYGPGFFIHDS 378


>gi|429329896|gb|AFZ81655.1| hypothetical protein BEWA_010720 [Babesia equi]
          Length = 1153

 Score = 96.7 bits (239), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 62/172 (36%), Positives = 89/172 (51%), Gaps = 13/172 (7%)

Query: 74  PGSSGVCDRDCHCGMLLSSCSSGCKCG--------NSCLNKPFQNRPVKKMKLVQTEKCG 125
           P S+  C  +  CG   S+     +C          +C N+ F N    K+KL   E  G
Sbjct: 587 PDSNTRCTCEISCGPDCSNILKNVECTPKNCSFHEKNCGNRRFTNISAPKLKLGFVEGKG 646

Query: 126 AGIVADEDIKRGEFVIEYVGEVIDDQTCEERL--WKMKHLGETN---FYLCEINRDMVID 180
            G  A EDI+  E V EYVGEVI     +  L  W    + + N   +Y+ +I++D+ ID
Sbjct: 647 IGAFATEDIEIDELVCEYVGEVITHSDFQRSLSSWSFAEIDDNNQCHWYIMKIHKDIYID 706

Query: 181 ATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
           +T+ GN +R+INHSC PN       + G  R+G+FA R I KGE +TY+Y +
Sbjct: 707 STHLGNVARFINHSCDPNCSSIPINVRGIYRMGVFAQRKIIKGEEVTYNYGF 758


>gi|406868443|gb|EKD21480.1| SET domain-containing protein [Marssonina brunnea f. sp.
           'multigermtubi' MB_m1]
          Length = 796

 Score = 96.7 bits (239), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 58/188 (30%), Positives = 101/188 (53%), Gaps = 18/188 (9%)

Query: 68  CSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCG-NSCLNKPFQNRPVKK---------M 116
           C C    G    CD DC    +L  C  G C  G + C N+ F +   ++         +
Sbjct: 420 CVCKPEEG----CDEDCMNRTMLYECDDGNCASGRDRCTNRAFADLQERRKAGGKYRIGV 475

Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
           ++++T   G G+ ++   +  + ++EY GE+I ++ C+ R+ +     E  +YL   +++
Sbjct: 476 EVIKTADRGYGVRSNRCFEANQIIVEYTGEIITEEECDRRMNEDYKNNEC-YYLMSFDQN 534

Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD-IKKGENLTYDYQYEFL 235
           M+IDAT KG+ +R++NHSC PN +M KWI+ G+ R+ +FA  + I  G+ LTYDY ++  
Sbjct: 535 MIIDAT-KGSIARFVNHSCNPNCKMVKWIVGGKPRMALFAGDNPIMTGDELTYDYNFDPF 593

Query: 236 HDSLIAYC 243
               +  C
Sbjct: 594 SAKNVQEC 601


>gi|219130356|ref|XP_002185333.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217403248|gb|EEC43202.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 164

 Score = 96.7 bits (239), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 45/141 (31%), Positives = 76/141 (53%), Gaps = 2/141 (1%)

Query: 93  CSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQT 152
           C   C     C N+   N   + +++V+  + G G+V +ED+  G+ ++EYVG  +  + 
Sbjct: 1   CPKTCPGDMFCGNRRITNGEFRTVRVVEAGRKGRGLVVEEDVDVGDMILEYVGRAVPQKQ 60

Query: 153 CEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRI 212
             +   + +H  +   Y+  +   + IDA  KG  +RYINHSC PN ++Q+W + G  R 
Sbjct: 61  LAKYFRRYQH--DRRLYIMSLGDGIYIDARSKGGLARYINHSCEPNCQVQRWKVKGVLRA 118

Query: 213 GIFATRDIKKGENLTYDYQYE 233
            +  TR +  G  LT+DYQ+E
Sbjct: 119 VVVPTRSLSAGTELTFDYQWE 139


>gi|226294579|gb|EEH49999.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb18]
          Length = 808

 Score = 96.7 bits (239), Expect = 9e-18,   Method: Composition-based stats.
 Identities = 56/178 (31%), Positives = 95/178 (53%), Gaps = 27/178 (15%)

Query: 68  CSCTASPGSSGVCDRDCHCGMLLSSCSS-GCKCGNS-CLNKPFQN--RPVKK-------M 116
           C CT   G    CD +C    +   C    CK G   C N+ F++  + +K        +
Sbjct: 373 CMCTPESG----CDENCQNRCMFYECDDDNCKLGAELCRNRSFEDLRQRIKAGGKYNIGV 428

Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
           ++++T   G G+ ++      + ++EY GE+I  + CE R+  +           + N +
Sbjct: 429 EVIKTADRGYGVRSNRTFAPNQIIVEYTGEIITQKECERRMRTVY----------KNNEN 478

Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD-IKKGENLTYDYQYE 233
           M+IDAT +G+ +R++NHSC PN EM+KW + G+ R+ +FA ++ I  GE LTYDY ++
Sbjct: 479 MIIDAT-RGSIARFVNHSCEPNCEMEKWTVAGKPRMALFAGKNGITTGEELTYDYNFD 535


>gi|405968248|gb|EKC33333.1| Histone-lysine N-methyltransferase SUV39H2 [Crassostrea gigas]
          Length = 628

 Score = 96.3 bits (238), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 62/152 (40%), Positives = 83/152 (54%), Gaps = 8/152 (5%)

Query: 87  GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKC-GAGIVADEDIKRGEFVIEYVG 145
           G  +  C+  CKCG  C N+  Q     K+ L +T    G G+   + IK G FV+EYVG
Sbjct: 312 GTPIYECNKRCKCGPECPNRVVQQGRKFKVCLFRTANGRGWGVKTLQKIKEGSFVVEYVG 371

Query: 146 EVIDDQTCEERLWKMKHLGETNFYLCEINR---DMVIDATYKGNKSRYINHSCCPNTEM- 201
           EVI D+  E R  +   +G T  +  + N       +DA Y GN S +INHSC PN E+ 
Sbjct: 372 EVITDKEAERRGKQYDAVGRTYLFDLDYNPGDCPFTVDAGYYGNVSHFINHSCDPNLEVF 431

Query: 202 QKWI--IDGE-TRIGIFATRDIKKGENLTYDY 230
             WI  +D    RI +F+ RDI+KGE LT+DY
Sbjct: 432 AVWINTLDPRLPRIALFSKRDIEKGEELTFDY 463


>gi|254565269|ref|XP_002489745.1| Histone methyltransferase with a role in transcriptional elongation
           [Komagataella pastoris GS115]
 gi|238029541|emb|CAY67464.1| Histone methyltransferase with a role in transcriptional elongation
           [Komagataella pastoris GS115]
 gi|328350161|emb|CCA36561.1| histone-lysine N-methyltransferase ASH1L [Komagataella pastoris CBS
           7435]
          Length = 589

 Score = 96.3 bits (238), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 59/184 (32%), Positives = 92/184 (50%), Gaps = 34/184 (18%)

Query: 91  SSCSSGCKCGNSCLNKPFQNRPVKKM-KLVQTEKCGA----------------------- 126
           +SC+    CG+ CLN+  Q      + KL +T  CG                        
Sbjct: 184 NSCNCRVNCGDLCLNRILQIECDSTICKLSETRSCGNTQFTTLELGISSLFREGTRVCRI 243

Query: 127 ------GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVID 180
                 G+VA  +    E + EY GEVI+ +  +ERL K K     ++Y   + + + ID
Sbjct: 244 DDKKGYGLVAIREFAPYELICEYTGEVINQEVVKERLSKKKVF---HYYHLSLEQGLSID 300

Query: 181 ATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDYQYEFLHDSL 239
           +T KG+ +R++NHSC PN E+QKW +  E RIG+FA ++ I  G+ +TYDY + +L ++ 
Sbjct: 301 STVKGSVARFVNHSCAPNAEVQKWYVQDEPRIGLFAGSKGIIPGDEITYDYNFIWLENAE 360

Query: 240 IAYC 243
              C
Sbjct: 361 PQLC 364


>gi|348683877|gb|EGZ23692.1| hypothetical protein PHYSODRAFT_462592 [Phytophthora sojae]
          Length = 292

 Score = 96.3 bits (238), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 51/127 (40%), Positives = 78/127 (61%), Gaps = 2/127 (1%)

Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
           +LV   + G G+   E++K G FV EY+GE++ +Q  E  + ++ +  E + Y+  ++  
Sbjct: 5   QLVDCGRKGLGLKLLENVKAGSFVGEYMGEIVTEQ--EYYMRRVLYHNEKHRYMMVLSGG 62

Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLH 236
            VIDAT  G  +R+INHSC PN  ++KW ++GE R  IFA RDI  GE LT+DY++E   
Sbjct: 63  EVIDATRMGGWARFINHSCDPNCGVEKWDVNGEERCAIFALRDIVVGEELTFDYKFESFS 122

Query: 237 DSLIAYC 243
            + I  C
Sbjct: 123 KAEITEC 129


>gi|339244151|ref|XP_003378001.1| putative SET domain protein [Trichinella spiralis]
 gi|316973127|gb|EFV56754.1| putative SET domain protein [Trichinella spiralis]
          Length = 1416

 Score = 96.3 bits (238), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 56/141 (39%), Positives = 85/141 (60%), Gaps = 10/141 (7%)

Query: 101  NSCLNKPFQNRPVKK-----MKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCE- 154
             S + K +Q R +K+     + L +++  G G+ A+ D++    VIEYVGEVI ++  E 
Sbjct: 1258 QSLVAKCYQYRKLKQEWKSNVYLARSKIQGLGLFANRDVEMNAMVIEYVGEVIRNEVAER 1317

Query: 155  -ERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIG 213
             E+ ++ ++ G    Y+  ++ D VIDAT  G  +RYINHSC PN   ++   D E+RI 
Sbjct: 1318 REKSYQKRNRG---VYMFRLDSDHVIDATVAGGPARYINHSCDPNCIAERIDFDRESRIV 1374

Query: 214  IFATRDIKKGENLTYDYQYEF 234
            I + R I KGE LTYDYQ++F
Sbjct: 1375 IMSCRPICKGEELTYDYQFDF 1395


>gi|62858103|ref|NP_001016508.1| histone-lysine N-methyltransferase SUV39H2 [Xenopus (Silurana)
           tropicalis]
 gi|143586876|sp|Q28CQ7.2|SUV92_XENTR RecName: Full=Histone-lysine N-methyltransferase SUV39H2; AltName:
           Full=Suppressor of variegation 3-9 homolog 2;
           Short=Su(var)3-9 homolog 2
          Length = 406

 Score = 96.3 bits (238), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 60/163 (36%), Positives = 86/163 (52%), Gaps = 10/163 (6%)

Query: 87  GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKC-GAGIVADEDIKRGEFVIEYVG 145
           G  +  C+S CKCG  C N+  Q  P   + + +T+   G G+   + IK+  FV+EYVG
Sbjct: 219 GRPIYECNSRCKCGPDCPNRVVQKGPPYSLCIFRTDNGRGWGVKTLQKIKKNSFVMEYVG 278

Query: 146 EVIDDQTCEERLWKMKHLGETNFYLCEINRD-MVIDATYKGNKSRYINHSCCPNTEMQKW 204
           EVI  +  E R  +    G T  +  +   D   +DA   GN S ++NHSC PN ++   
Sbjct: 279 EVITSEEAERRGQQYDSRGITYLFDLDYEADEFTVDAARYGNVSHFVNHSCDPNLQVFNV 338

Query: 205 IIDGE----TRIGIFATRDIKKGENLTYDYQY----EFLHDSL 239
            ID       RI +F+TR+IK GE LT+DYQ     +F  DS+
Sbjct: 339 FIDNLDVRLPRIALFSTRNIKAGEELTFDYQMKGSGDFSTDSI 381


>gi|391331299|ref|XP_003740087.1| PREDICTED: uncharacterized protein LOC100899404 [Metaseiulus
            occidentalis]
          Length = 2686

 Score = 96.3 bits (238), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 50/121 (41%), Positives = 73/121 (60%), Gaps = 5/121 (4%)

Query: 125  GAGIVADEDIKRGEFVIEYVGEVIDDQTCE--ERLWKMKHLGETNFYLCEINRDMVIDAT 182
            G G+   +DI +GE +IEY GEVI    C+  E+ ++ + LG    Y+  ++ D V+DAT
Sbjct: 2559 GRGLYCKKDIAKGEMIIEYAGEVIRASLCDRREKYYEGRGLG---CYMFRMDNDEVVDAT 2615

Query: 183  YKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAY 242
             KGN +R+INHSC PN   +   +D +  I I+A R+I+ GE LTYDY++    D L   
Sbjct: 2616 VKGNAARFINHSCDPNCYSKMITVDNKKHIVIYALREIRTGEELTYDYKFPIEDDKLHCT 2675

Query: 243  C 243
            C
Sbjct: 2676 C 2676


>gi|242786320|ref|XP_002480782.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
 gi|218720929|gb|EED20348.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
          Length = 1155

 Score = 96.3 bits (238), Expect = 1e-17,   Method: Composition-based stats.
 Identities = 49/112 (43%), Positives = 68/112 (60%)

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+ A+E+I   + +IEYVGE +  Q  + R  +    G  + YL  I+ + VIDAT +G 
Sbjct: 1026 GLYAEENIAANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENAVIDATKRGG 1085

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
             +R+INHSC PN   +   +DG  RI I+A RDI K E LTYDY++E   DS
Sbjct: 1086 IARFINHSCTPNCTAKIIRVDGSKRIVIYALRDISKDEELTYDYKFEREWDS 1137


>gi|165971516|gb|AAI58300.1| suppressor of variegation 3-9 homolog 2 [Xenopus (Silurana)
           tropicalis]
          Length = 406

 Score = 96.3 bits (238), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 57/151 (37%), Positives = 81/151 (53%), Gaps = 6/151 (3%)

Query: 87  GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKC-GAGIVADEDIKRGEFVIEYVG 145
           G  +  C+S CKCG  C N+  Q  P   + + +T+   G G+   + IK+  FV+EYVG
Sbjct: 219 GRPIYECNSRCKCGPDCPNRVVQKGPPYSLCIFRTDNGRGWGVKTLQKIKKNSFVMEYVG 278

Query: 146 EVIDDQTCEERLWKMKHLGETNFYLCEINRD-MVIDATYKGNKSRYINHSCCPNTEMQKW 204
           EVI  +  E R  +    G T  +  +   D   +DA   GN S ++NHSC PN ++   
Sbjct: 279 EVITSEEAERRGQQYDSRGITYLFDLDYEADEFTVDAARYGNVSHFVNHSCDPNLQVFNV 338

Query: 205 IIDGE----TRIGIFATRDIKKGENLTYDYQ 231
            ID       RI +F+TR+IK GE LT+DYQ
Sbjct: 339 FIDNLDVRLPRIALFSTRNIKAGEELTFDYQ 369


>gi|341876597|gb|EGT32532.1| hypothetical protein CAEBREN_21874 [Caenorhabditis brenneri]
          Length = 1048

 Score = 96.3 bits (238), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 60/160 (37%), Positives = 90/160 (56%), Gaps = 14/160 (8%)

Query: 83  DCHC--GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKC-GAGIVADEDIKRGEF 139
           D HC    +++ C S C+    C N+ F  +    ++   T    G G+ A +DIK G+F
Sbjct: 654 DSHCVNRAMMTECPSSCR-AKKCKNQRFAKKKYAAVEPFHTGTAKGCGLRAVKDIKAGKF 712

Query: 140 VIEYVGEVIDDQTCEERLWKMKHLGET---NFYLCEINRDMVIDATYKGNKSRYINHSCC 196
           +IEY+GEV++ +  E+R  K+K+  +    + YLC+      IDAT  GN SR++NHSC 
Sbjct: 713 IIEYIGEVLEREDYEKR--KVKYAADKKHKHHYLCDTGV-YTIDATEFGNPSRFVNHSCD 769

Query: 197 PNTEMQKWIIDGE----TRIGIFATRDIKKGENLTYDYQY 232
           PN   +KW +       +RIG FA R IK GE + +DYQ+
Sbjct: 770 PNAVCEKWSVPKTPGDISRIGFFAKRFIKAGEEICFDYQF 809


>gi|353243391|emb|CCA74938.1| related to regulatory protein SET1 [Piriformospora indica DSM 11827]
          Length = 1224

 Score = 96.3 bits (238), Expect = 1e-17,   Method: Composition-based stats.
 Identities = 51/130 (39%), Positives = 74/130 (56%)

Query: 114  KKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEI 173
            K+++  ++     G+ A E I +GE VIEYVGEVI  Q  E+R    +  G  + YL  I
Sbjct: 1085 KQLQFARSPIHDWGLYALERIPKGEMVIEYVGEVIRQQVAEKRERAYERSGIGSSYLFRI 1144

Query: 174  NRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
            + D+V+DAT  GN  R INHSC PN   +   I G+ +I I+A  DI  G+ +TYDY + 
Sbjct: 1145 DDDLVVDATKIGNLGRLINHSCDPNCTAKIITIGGQKKIVIYAKVDIHPGDEVTYDYHFP 1204

Query: 234  FLHDSLIAYC 243
              ++ +   C
Sbjct: 1205 IENEKIPCLC 1214


>gi|347840183|emb|CCD54755.1| hypothetical protein [Botryotinia fuckeliana]
          Length = 806

 Score = 96.3 bits (238), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 58/198 (29%), Positives = 101/198 (51%), Gaps = 22/198 (11%)

Query: 64  DGIFCSCTASPGSSGVCDRDCHCGMLLSSCS-SGCKCG-NSCLNKPFQ------------ 109
           D  F  C   P +   CD DC   ++L  C  + C  G +SC N+ F             
Sbjct: 399 DSFFSKCICKPDAG--CDEDCQNKIMLYECDDTNCGAGRDSCTNRAFAELVNRRKGNSFR 456

Query: 110 ---NRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGET 166
              N+    +++++T   G G+ ++      + ++EY GE+I +  C+ R+    +    
Sbjct: 457 KGGNKYEIGVEVIKTADRGYGVRSNRCFNANQIIVEYTGEIITEDECDRRM-NEDYKDND 515

Query: 167 NFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD-IKKGEN 225
            +YL   +++M+IDAT +G+ +R++NHSC PN  M KWI++G+ R+ +FA  + I  G+ 
Sbjct: 516 CYYLMSFDQNMIIDAT-RGSIARFVNHSCRPNCRMVKWIVEGKPRMALFAGDNPIMTGDE 574

Query: 226 LTYDYQYEFLHDSLIAYC 243
           LTYDY ++      +  C
Sbjct: 575 LTYDYNFDPFSAKNVQAC 592


>gi|89267215|emb|CAJ81421.1| suppressor of variegation 3-9 homolog 2 [Xenopus (Silurana)
           tropicalis]
          Length = 410

 Score = 96.3 bits (238), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 57/151 (37%), Positives = 81/151 (53%), Gaps = 6/151 (3%)

Query: 87  GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKC-GAGIVADEDIKRGEFVIEYVG 145
           G  +  C+S CKCG  C N+  Q  P   + + +T+   G G+   + IK+  FV+EYVG
Sbjct: 223 GRPIYECNSRCKCGPDCPNRVVQKGPPYSLCIFRTDNGRGWGVKTLQKIKKNSFVMEYVG 282

Query: 146 EVIDDQTCEERLWKMKHLGETNFYLCEINRD-MVIDATYKGNKSRYINHSCCPNTEMQKW 204
           EVI  +  E R  +    G T  +  +   D   +DA   GN S ++NHSC PN ++   
Sbjct: 283 EVITSEEAERRGQQYDSRGITYLFDLDYEADEFTVDAARYGNVSHFVNHSCDPNLQVFNV 342

Query: 205 IIDGE----TRIGIFATRDIKKGENLTYDYQ 231
            ID       RI +F+TR+IK GE LT+DYQ
Sbjct: 343 FIDNLDVRLPRIALFSTRNIKAGEELTFDYQ 373


>gi|358056651|dbj|GAA97314.1| hypothetical protein E5Q_03992 [Mixia osmundae IAM 14324]
          Length = 844

 Score = 95.9 bits (237), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 72/209 (34%), Positives = 101/209 (48%), Gaps = 14/209 (6%)

Query: 43  PYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGN 101
           PY  I  N Y  ++ +  +     FC   A PG  G C  +C   ++   C    C C +
Sbjct: 305 PYKIIASNAYPERQKQPSVLQAICFCKPPA-PGEVG-CGDNCINRVMAYMCDPKLCPCKD 362

Query: 102 SCLNKPFQNRPVKKMK-------LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCE 154
            C N P Q R     K       +  T   G G+ A   IK G F++EY GEVI      
Sbjct: 363 KCTNGPLQTRKSAAGKGDKEGVAVFYTGSRGFGLKATLPIKSGAFIMEYKGEVISINESY 422

Query: 155 ERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEM--QKWIIDGETRI 212
            R+ K+ + G  N+YL   + + V+DA  KGN +R+INHSC PN E+   K+    E +I
Sbjct: 423 RRV-KLIYPG-PNYYLLSYDDEEVLDAGLKGNATRFINHSCDPNCEVVRLKFADYDEFQI 480

Query: 213 GIFATRDIKKGENLTYDYQYEFLHDSLIA 241
           G+FA RDI   E +TY+Y ++    S  A
Sbjct: 481 GLFALRDISPEEEITYNYGWQSFSSSSTA 509


>gi|340959767|gb|EGS20948.1| hypothetical protein CTHT_0027870 [Chaetomium thermophilum var.
            thermophilum DSM 1495]
          Length = 1295

 Score = 95.9 bits (237), Expect = 1e-17,   Method: Composition-based stats.
 Identities = 51/120 (42%), Positives = 71/120 (59%)

Query: 114  KKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEI 173
            K +K  ++     G+ A E+I + E +IEYVGE +  Q  E R  +    G  + YL  I
Sbjct: 1153 KPVKFARSAIHNWGLYAMENIPKDEMIIEYVGEEVRQQIAELRESRYLKSGIGSSYLFRI 1212

Query: 174  NRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
            + + VIDAT KG  +R+INHSC PN   +   ++G  RI I+A RDI K E LTYDY++E
Sbjct: 1213 DDNTVIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAKNEELTYDYKFE 1272


>gi|212543321|ref|XP_002151815.1| SET domain protein [Talaromyces marneffei ATCC 18224]
 gi|210066722|gb|EEA20815.1| SET domain protein [Talaromyces marneffei ATCC 18224]
          Length = 1188

 Score = 95.9 bits (237), Expect = 1e-17,   Method: Composition-based stats.
 Identities = 49/112 (43%), Positives = 68/112 (60%)

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+ A+E+I   + +IEYVGE +  Q  + R  +    G  + YL  I+ + VIDAT +G 
Sbjct: 1059 GLYAEENITANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENAVIDATKRGG 1118

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
             +R+INHSC PN   +   +DG  RI I+A RDI K E LTYDY++E   DS
Sbjct: 1119 IARFINHSCTPNCTAKIIRVDGSKRIVIYALRDISKDEELTYDYKFEREWDS 1170


>gi|156351239|ref|XP_001622422.1| predicted protein [Nematostella vectensis]
 gi|156208959|gb|EDO30322.1| predicted protein [Nematostella vectensis]
          Length = 688

 Score = 95.9 bits (237), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 54/138 (39%), Positives = 76/138 (55%), Gaps = 12/138 (8%)

Query: 100 GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
             +C N   Q    K M L  ++  G GI   + +K+ EF+ EY GEVI     + R   
Sbjct: 538 SKTCKNVSLQRGQRKHMLLAPSDVAGWGIYIKQSVKKNEFISEYCGEVISQDEADRR--- 594

Query: 160 MKHLGET-NFYLCE----INRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGI 214
               G+  + Y+C     +N D V+DAT KGNK R+ NHS  PN   +  +++G+ RIGI
Sbjct: 595 ----GKVYDKYMCSFLFNLNNDFVVDATRKGNKIRFANHSISPNCYAKVMMVNGDHRIGI 650

Query: 215 FATRDIKKGENLTYDYQY 232
           FA RDI+ GE L +DY+Y
Sbjct: 651 FAKRDIEAGEELFFDYRY 668


>gi|147901855|ref|NP_001091337.1| suppressor of variegation 3-9 homolog 2 [Xenopus laevis]
 gi|125858896|gb|AAI29544.1| LOC100037174 protein [Xenopus laevis]
          Length = 406

 Score = 95.9 bits (237), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 58/157 (36%), Positives = 83/157 (52%), Gaps = 6/157 (3%)

Query: 87  GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKC-GAGIVADEDIKRGEFVIEYVG 145
           G  +  C+S CKCG  C N+  Q  P   + + +T+   G G+   + IK+  FV+EYVG
Sbjct: 219 GRPIFECNSRCKCGPDCPNRVVQKGPPYSLCIFRTDNGRGWGVKTLQKIKKNSFVMEYVG 278

Query: 146 EVIDDQTCEERLWKMKHLGETNFYLCEINRD-MVIDATYKGNKSRYINHSCCPNTEMQKW 204
           EVI  +  E R  +    G T  +  +   D   +DA   GN S ++NHSC PN ++   
Sbjct: 279 EVITSEEAERRGQQYDSKGITYLFDLDYEADEFTVDAARYGNVSHFVNHSCDPNLQVFNV 338

Query: 205 IIDGE----TRIGIFATRDIKKGENLTYDYQYEFLHD 237
            ID       RI +F+TR+IK GE LT+DYQ +   D
Sbjct: 339 FIDNLDVRLPRIALFSTRNIKAGEELTFDYQMKGYGD 375


>gi|296422581|ref|XP_002840838.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295637063|emb|CAZ85029.1| unnamed protein product [Tuber melanosporum]
          Length = 1200

 Score = 95.9 bits (237), Expect = 1e-17,   Method: Composition-based stats.
 Identities = 50/120 (41%), Positives = 71/120 (59%)

Query: 114  KKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEI 173
            K +K  ++     G+ A E+I  G+ +IEYVGE+I  Q  + R  K    G  + YL  I
Sbjct: 1058 KPVKFARSAIHNWGLYAMENISAGDMIIEYVGEIIRQQVADMREKKYLKSGIGSSYLFRI 1117

Query: 174  NRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
            +   VIDAT  G  +R+INHSC PN   +   ++G  RI I+A RDI++ E LTYDY++E
Sbjct: 1118 DDTTVIDATKAGGIARFINHSCTPNCTAKIIKVEGSKRIVIYALRDIRENEELTYDYKFE 1177


>gi|357139465|ref|XP_003571302.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-4
           specific-like [Brachypodium distachyon]
          Length = 319

 Score = 95.5 bits (236), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 46/130 (35%), Positives = 81/130 (62%), Gaps = 1/130 (0%)

Query: 114 KKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEI 173
           KK+++ +++    G++A  DI +GE ++EY+GE +     + R  + +  G+ +++  +I
Sbjct: 181 KKLRVERSKIHELGVMAVTDIMKGELIVEYIGERMPKWVADLRGLRYEKAGKGDYFF-KI 239

Query: 174 NRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
           +  +VIDAT +G  +RYINHSC PN E +  + +G+ RI I+A + IK G  LTYDY++ 
Sbjct: 240 DAGLVIDATLRGGIARYINHSCEPNCETRVILSNGQRRIFIYANQKIKAGTELTYDYKFP 299

Query: 234 FLHDSLIAYC 243
           F  + +   C
Sbjct: 300 FEENKIPCSC 309


>gi|384484496|gb|EIE76676.1| hypothetical protein RO3G_01380 [Rhizopus delemar RA 99-880]
          Length = 565

 Score = 95.5 bits (236), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 47/117 (40%), Positives = 70/117 (59%)

Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
           G+ A E I   + VIEY+GEVI  Q  E R    + +G  + YL  ++ DMVIDAT KG 
Sbjct: 439 GLYAGESIDAHDIVIEYIGEVIRQQVAEIREKHYERIGIGSSYLFRVDDDMVIDATKKGG 498

Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
            +R+INH C PN   +   +D + ++ I+A RDI+ GE +TYDY++    + +  +C
Sbjct: 499 MARFINHCCTPNCSAKIITVDKQKKVVIYANRDIEPGEEITYDYKFPIEAEKIPCFC 555


>gi|195438882|ref|XP_002067361.1| GK16377 [Drosophila willistoni]
 gi|194163446|gb|EDW78347.1| GK16377 [Drosophila willistoni]
          Length = 2510

 Score = 95.5 bits (236), Expect = 2e-17,   Method: Composition-based stats.
 Identities = 62/179 (34%), Positives = 93/179 (51%), Gaps = 25/179 (13%)

Query: 82   RDCHCGMLLSSCSSGCKCGNSC-----LNKPFQNRPV-------KKMK--------LVQT 121
            R  H      SCSS  +  NS      +  P+  + V       KKMK        L ++
Sbjct: 2318 RKPHTQRTAGSCSSTQRMANSAAIAGEVACPYSKQFVHSKSSQYKKMKQEWRNNVYLARS 2377

Query: 122  EKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCE--ERLWKMKHLGETNFYLCEINRDMVI 179
            +  G G+ A  DI++   +IEY+GEVI  +  E  E+ ++ K+ G    Y+  ++ D V+
Sbjct: 2378 KIQGLGLYAARDIEKHTMIIEYIGEVIRTEVSEIREKQYEAKNRG---IYMFRLDEDRVV 2434

Query: 180  DATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
            DAT  G  +RYINHSC PN   +   +D + RI IFA R I +GE L+YDY+++   +S
Sbjct: 2435 DATLSGGLARYINHSCNPNCVTEIVEVDRDVRIIIFAKRKIYRGEELSYDYKFDIEDES 2493


>gi|317037780|ref|XP_001399137.2| histone-lysine N-methyltransferase, H3 lysine-4 specific [Aspergillus
            niger CBS 513.88]
          Length = 1239

 Score = 95.1 bits (235), Expect = 2e-17,   Method: Composition-based stats.
 Identities = 48/112 (42%), Positives = 69/112 (61%)

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+ A+E+I   + +IEYVGE +  Q  + R  +    G  + YL  I+ + VIDAT +G 
Sbjct: 1110 GLYAEENISANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKRGG 1169

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
             +R+INHSC PN   +   +DG  RI I+A RDI++ E LTYDY++E   DS
Sbjct: 1170 IARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDS 1221


>gi|327349992|gb|EGE78849.1| histone-lysine N-methyltransferase [Ajellomyces dermatitidis ATCC
           18188]
          Length = 775

 Score = 95.1 bits (235), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 58/188 (30%), Positives = 98/188 (52%), Gaps = 18/188 (9%)

Query: 68  CSCTASPGSSGVCDRDCHCGMLLSSCS-SGCKCGNS-CLNKPFQN-RPVKKM-------- 116
           C CT   G    CD +C    +   C  + CK G   C N+ F+  R   K+        
Sbjct: 376 CMCTPESG----CDENCQNRYMFYECDDNNCKLGAELCGNRNFEGLRQRSKLGGKYNIGV 431

Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
           ++++T   G G+ ++      + ++EY GE++  + CE R+  +    E  +YL   +R+
Sbjct: 432 EVIKTADRGYGVRSNRTFAPNQIIVEYTGEIVTQEECERRMRTVYKNNEC-YYLMYFDRN 490

Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDYQYEFL 235
           M+IDAT +G+ +R++NHSC PN +++KW + G+ R+ +FA    I  GE LTYDY ++  
Sbjct: 491 MIIDAT-RGSIARFVNHSCEPNCKIEKWTVAGKPRMALFAGEHGIMTGEELTYDYNFDPY 549

Query: 236 HDSLIAYC 243
               +  C
Sbjct: 550 SQKNVQEC 557


>gi|261201264|ref|XP_002627032.1| histone-lysine N-methyltransferase [Ajellomyces dermatitidis
            SLH14081]
 gi|239592091|gb|EEQ74672.1| histone-lysine N-methyltransferase [Ajellomyces dermatitidis
            SLH14081]
 gi|239611745|gb|EEQ88732.1| histone-lysine N-methyltransferase [Ajellomyces dermatitidis ER-3]
          Length = 1259

 Score = 95.1 bits (235), Expect = 2e-17,   Method: Composition-based stats.
 Identities = 48/112 (42%), Positives = 69/112 (61%)

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+ A+E+I   + +IEYVGE +  Q  + R  +    G  + YL  I+ + VIDAT +G 
Sbjct: 1130 GLYAEENISANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKRGG 1189

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
             +R+INHSC PN   +   +DG  RI I+A RDI++ E LTYDY++E   DS
Sbjct: 1190 IARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDS 1241


>gi|154278862|ref|XP_001540244.1| hypothetical protein HCAG_04084 [Ajellomyces capsulatus NAm1]
 gi|150412187|gb|EDN07574.1| hypothetical protein HCAG_04084 [Ajellomyces capsulatus NAm1]
          Length = 1266

 Score = 95.1 bits (235), Expect = 2e-17,   Method: Composition-based stats.
 Identities = 48/112 (42%), Positives = 69/112 (61%)

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+ A+E+I   + +IEYVGE +  Q  + R  +    G  + YL  I+ + VIDAT +G 
Sbjct: 1137 GLYAEENISANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKRGG 1196

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
             +R+INHSC PN   +   +DG  RI I+A RDI++ E LTYDY++E   DS
Sbjct: 1197 IARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDS 1248


>gi|350630881|gb|EHA19253.1| hypothetical protein ASPNIDRAFT_56859 [Aspergillus niger ATCC 1015]
          Length = 1101

 Score = 95.1 bits (235), Expect = 2e-17,   Method: Composition-based stats.
 Identities = 48/112 (42%), Positives = 69/112 (61%)

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+ A+E+I   + +IEYVGE +  Q  + R  +    G  + YL  I+ + VIDAT +G 
Sbjct: 972  GLYAEENISANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKRGG 1031

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
             +R+INHSC PN   +   +DG  RI I+A RDI++ E LTYDY++E   DS
Sbjct: 1032 IARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDS 1083


>gi|255938628|ref|XP_002560084.1| Pc14g00900 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211584705|emb|CAP74231.1| Pc14g00900 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 1202

 Score = 95.1 bits (235), Expect = 2e-17,   Method: Composition-based stats.
 Identities = 48/112 (42%), Positives = 69/112 (61%)

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+ A+E+I   + +IEYVGE +  Q  + R  +    G  + YL  I+ + VIDAT +G 
Sbjct: 1073 GLYAEENITANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKRGG 1132

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
             +R+INHSC PN   +   +DG  RI I+A RDI++ E LTYDY++E   DS
Sbjct: 1133 IARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDS 1184


>gi|358338843|dbj|GAA57433.1| histone-lysine N-methyltransferase trithorax, partial [Clonorchis
           sinensis]
          Length = 328

 Score = 95.1 bits (235), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 50/119 (42%), Positives = 69/119 (57%), Gaps = 1/119 (0%)

Query: 125 GAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYK 184
           G G+ A    +  E VIEY+GE+I +  CE R  + +  G  + Y+  I+ D+VIDATY 
Sbjct: 201 GRGLFALRGFREDEMVIEYMGELIRNFVCETREIRYRSAG-VDCYMFRIDSDLVIDATYA 259

Query: 185 GNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
           GN +R+INHSC PN   +   +D +  I I A R I  GE LTYDY++    D L+  C
Sbjct: 260 GNAARFINHSCDPNCYAKVVTVDDKKHIVILAQRRIYPGEELTYDYRFPKESDKLLCNC 318


>gi|396457972|ref|XP_003833599.1| hypothetical protein LEMA_P063600.1 [Leptosphaeria maculans JN3]
 gi|312210147|emb|CBX90234.1| hypothetical protein LEMA_P063600.1 [Leptosphaeria maculans JN3]
          Length = 1372

 Score = 95.1 bits (235), Expect = 2e-17,   Method: Composition-based stats.
 Identities = 51/127 (40%), Positives = 74/127 (58%), Gaps = 3/127 (2%)

Query: 110  NRPVKKMKLVQTEKCGA---GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGET 166
            N+  K+ KLV+ ++      G+ A+E+I   + +IEYVGE +  +  + R  K    G  
Sbjct: 1223 NQLKKRKKLVKFDRSAIHNWGLYAEENIAANDMIIEYVGEKVRQRVADLREAKYDQQGVG 1282

Query: 167  NFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENL 226
            + YL  I+ D VIDAT  G  +R+INHSC PN   +   +D   RI I+A RDI + E L
Sbjct: 1283 SSYLFRIDEDTVIDATKMGGIARFINHSCTPNCTAKIIRVDTSKRIVIYALRDISQDEEL 1342

Query: 227  TYDYQYE 233
            TYDY++E
Sbjct: 1343 TYDYKFE 1349


>gi|358373521|dbj|GAA90119.1| SET domain protein [Aspergillus kawachii IFO 4308]
          Length = 1239

 Score = 95.1 bits (235), Expect = 2e-17,   Method: Composition-based stats.
 Identities = 48/112 (42%), Positives = 69/112 (61%)

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+ A+E+I   + +IEYVGE +  Q  + R  +    G  + YL  I+ + VIDAT +G 
Sbjct: 1110 GLYAEENISANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKRGG 1169

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
             +R+INHSC PN   +   +DG  RI I+A RDI++ E LTYDY++E   DS
Sbjct: 1170 IARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDS 1221


>gi|115400872|ref|XP_001216024.1| hypothetical protein ATEG_07403 [Aspergillus terreus NIH2624]
 gi|114189965|gb|EAU31665.1| hypothetical protein ATEG_07403 [Aspergillus terreus NIH2624]
          Length = 1230

 Score = 95.1 bits (235), Expect = 2e-17,   Method: Composition-based stats.
 Identities = 48/112 (42%), Positives = 69/112 (61%)

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+ A+E+I   + +IEYVGE +  Q  + R  +    G  + YL  I+ + VIDAT +G 
Sbjct: 1101 GLYAEENISANDMIIEYVGEKVRQQVADMRERQYLKSGIGSSYLFRIDENTVIDATKRGG 1160

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
             +R+INHSC PN   +   +DG  RI I+A RDI++ E LTYDY++E   DS
Sbjct: 1161 IARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDS 1212


>gi|134084734|emb|CAK43391.1| unnamed protein product [Aspergillus niger]
          Length = 1079

 Score = 95.1 bits (235), Expect = 3e-17,   Method: Composition-based stats.
 Identities = 48/112 (42%), Positives = 69/112 (61%)

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+ A+E+I   + +IEYVGE +  Q  + R  +    G  + YL  I+ + VIDAT +G 
Sbjct: 950  GLYAEENISANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKRGG 1009

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
             +R+INHSC PN   +   +DG  RI I+A RDI++ E LTYDY++E   DS
Sbjct: 1010 IARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDS 1061


>gi|345567899|gb|EGX50801.1| hypothetical protein AOL_s00054g887 [Arthrobotrys oligospora ATCC
            24927]
          Length = 1338

 Score = 94.7 bits (234), Expect = 3e-17,   Method: Composition-based stats.
 Identities = 50/112 (44%), Positives = 66/112 (58%)

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+ A E+I   E +IEYVGE++  Q  + R       G  + YL  I+   VIDAT KG 
Sbjct: 1209 GLYAMENISNNEMIIEYVGEIVRQQVADLREKNYLRSGIGSSYLFRIDETTVIDATKKGG 1268

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
             +R+INHSC PN   +   ++G  RI I+A RDI K E LTYDY++E   DS
Sbjct: 1269 IARFINHSCTPNCTAKIIKVEGTKRIVIYALRDIHKDEELTYDYKFEREIDS 1320


>gi|299117155|emb|CBN75119.1| histone methyltransferase [Ectocarpus siliculosus]
          Length = 2067

 Score = 94.7 bits (234), Expect = 3e-17,   Method: Composition-based stats.
 Identities = 43/119 (36%), Positives = 75/119 (63%)

Query: 114  KKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEI 173
            KK++  ++     G+ ADE I  G+ VIEY GE+I +   ++R  + + +   + Y+  +
Sbjct: 1925 KKLRFGRSSVHAWGVFADEPIAAGDLVIEYRGEIIGNAVADKREKQYEDMQIGSDYMFRV 1984

Query: 174  NRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            + D V+DAT+KG+ +R+INHSC P+   +   ++G  +I I+A RD+  GE L+YDY++
Sbjct: 1985 DEDTVVDATFKGSLARFINHSCDPSCTTRIITVEGSKKIVIYAERDVAMGEELSYDYKF 2043


>gi|341896557|gb|EGT52492.1| hypothetical protein CAEBREN_15183 [Caenorhabditis brenneri]
          Length = 1112

 Score = 94.7 bits (234), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 60/146 (41%), Positives = 84/146 (57%), Gaps = 9/146 (6%)

Query: 93  CSSGCKCGNSCLNKPFQNRPVKKMK--LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDD 150
           C   CK  N C N+ F  +   K +   +  EK G G+V+DEDI  G+F+ EY+G   +D
Sbjct: 290 CPRKCKAAN-CQNQRFTEKKYAKTRRAFMGPEK-GYGLVSDEDIPVGKFIYEYLGRATND 347

Query: 151 QTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWII---D 207
           +  +E   K     + + YL +     VIDAT KGN+SR+INHSC PN   ++WI+    
Sbjct: 348 EGVKEIQKKNDADSKRHRYLLKSG-TYVIDATDKGNESRFINHSCDPNAVCERWIVPKTP 406

Query: 208 GE-TRIGIFATRDIKKGENLTYDYQY 232
           GE +RIG FA + IKKGE + + Y Y
Sbjct: 407 GELSRIGFFAIKPIKKGEEILFKYNY 432


>gi|119185079|ref|XP_001243361.1| hypothetical protein CIMG_07257 [Coccidioides immitis RS]
 gi|121936913|sp|Q1DR06.1|SET1_COCIM RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
            specific; AltName: Full=COMPASS component SET1; AltName:
            Full=SET domain-containing protein 1
 gi|392866240|gb|EAS28850.2| histone-lysine N-methyltransferase, H3 lysine-4 specific
            [Coccidioides immitis RS]
          Length = 1271

 Score = 94.7 bits (234), Expect = 3e-17,   Method: Composition-based stats.
 Identities = 48/112 (42%), Positives = 68/112 (60%)

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+ A+E+I   + +IEYVGE +  Q  + R  +    G  + YL  I+ + VIDAT +G 
Sbjct: 1142 GLYAEENISANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKRGG 1201

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
             +R+INHSC PN   +   +DG  RI I+A RDI + E LTYDY++E   DS
Sbjct: 1202 IARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIDRDEELTYDYKFEREWDS 1253


>gi|326477398|gb|EGE01408.1| histone-lysine N-methyltransferase [Trichophyton equinum CBS 127.97]
          Length = 1331

 Score = 94.7 bits (234), Expect = 3e-17,   Method: Composition-based stats.
 Identities = 48/112 (42%), Positives = 68/112 (60%)

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+ A+E+I   + +IEYVGE +  Q  + R  +    G  + YL  I+ + VIDAT  G 
Sbjct: 1202 GLYAEENITANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKHGG 1261

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
             +R+INHSC PN   +   +DG  RI I+A RDI++ E LTYDY++E   DS
Sbjct: 1262 IARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDS 1313


>gi|326472906|gb|EGD96915.1| histone-lysine N-methyltransferase [Trichophyton tonsurans CBS
            112818]
          Length = 1330

 Score = 94.7 bits (234), Expect = 3e-17,   Method: Composition-based stats.
 Identities = 48/112 (42%), Positives = 68/112 (60%)

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+ A+E+I   + +IEYVGE +  Q  + R  +    G  + YL  I+ + VIDAT  G 
Sbjct: 1201 GLYAEENITANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKHGG 1260

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
             +R+INHSC PN   +   +DG  RI I+A RDI++ E LTYDY++E   DS
Sbjct: 1261 IARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDS 1312


>gi|315045626|ref|XP_003172188.1| histone-lysine N-methyltransferase [Arthroderma gypseum CBS 118893]
 gi|311342574|gb|EFR01777.1| histone-lysine N-methyltransferase [Arthroderma gypseum CBS 118893]
          Length = 1334

 Score = 94.7 bits (234), Expect = 3e-17,   Method: Composition-based stats.
 Identities = 48/112 (42%), Positives = 68/112 (60%)

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+ A+E+I   + +IEYVGE +  Q  + R  +    G  + YL  I+ + VIDAT  G 
Sbjct: 1205 GLYAEENITANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKHGG 1264

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
             +R+INHSC PN   +   +DG  RI I+A RDI++ E LTYDY++E   DS
Sbjct: 1265 IARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDS 1316


>gi|296805347|ref|XP_002843498.1| histone-lysine N-methyltransferase [Arthroderma otae CBS 113480]
 gi|238844800|gb|EEQ34462.1| histone-lysine N-methyltransferase [Arthroderma otae CBS 113480]
          Length = 1344

 Score = 94.7 bits (234), Expect = 3e-17,   Method: Composition-based stats.
 Identities = 48/112 (42%), Positives = 68/112 (60%)

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+ A+E+I   + +IEYVGE +  Q  + R  +    G  + YL  I+ + VIDAT  G 
Sbjct: 1215 GLYAEENITANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKHGG 1274

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
             +R+INHSC PN   +   +DG  RI I+A RDI++ E LTYDY++E   DS
Sbjct: 1275 IARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDS 1326


>gi|320032561|gb|EFW14513.1| histone-lysine N-methyltransferase [Coccidioides posadasii str.
            Silveira]
          Length = 1271

 Score = 94.7 bits (234), Expect = 3e-17,   Method: Composition-based stats.
 Identities = 48/112 (42%), Positives = 68/112 (60%)

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+ A+E+I   + +IEYVGE +  Q  + R  +    G  + YL  I+ + VIDAT +G 
Sbjct: 1142 GLYAEENISANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKRGG 1201

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
             +R+INHSC PN   +   +DG  RI I+A RDI + E LTYDY++E   DS
Sbjct: 1202 IARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIDRDEELTYDYKFEREWDS 1253


>gi|449017355|dbj|BAM80757.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 529

 Score = 94.7 bits (234), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 42/99 (42%), Positives = 62/99 (62%), Gaps = 2/99 (2%)

Query: 135 KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
           +  +F++EY+GEVI       R  K  H  E + Y   +++   +DA+ KG   R++NHS
Sbjct: 66  QESDFIVEYMGEVIGPTELARR--KRDHALERHVYFMTLDQSTFLDASRKGTWGRFLNHS 123

Query: 195 CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
           C PN   QKW++ G+ R+GIFA+RDI  GE LT+DY+ E
Sbjct: 124 CEPNCHTQKWLVLGKVRVGIFASRDIAAGEELTFDYRME 162


>gi|303313714|ref|XP_003066866.1| SET domain containing protein [Coccidioides posadasii C735 delta
            SOWgp]
 gi|240106533|gb|EER24721.1| SET domain containing protein [Coccidioides posadasii C735 delta
            SOWgp]
          Length = 1271

 Score = 94.7 bits (234), Expect = 3e-17,   Method: Composition-based stats.
 Identities = 48/112 (42%), Positives = 68/112 (60%)

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+ A+E+I   + +IEYVGE +  Q  + R  +    G  + YL  I+ + VIDAT +G 
Sbjct: 1142 GLYAEENISANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKRGG 1201

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
             +R+INHSC PN   +   +DG  RI I+A RDI + E LTYDY++E   DS
Sbjct: 1202 IARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIDRDEELTYDYKFEREWDS 1253


>gi|255558564|ref|XP_002520307.1| huntingtin interacting protein, putative [Ricinus communis]
 gi|223540526|gb|EEF42093.1| huntingtin interacting protein, putative [Ricinus communis]
          Length = 1746

 Score = 94.7 bits (234), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 62/190 (32%), Positives = 83/190 (43%), Gaps = 40/190 (21%)

Query: 44  YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNS 102
           +  I  N +L +  K +  D+ + C C         C  +C   ML   C  G C CG+ 
Sbjct: 801 FTRISTNQFLHRSRKTQTIDEIMVCHCKLPLDGRLGCRDECLNRMLNIECVRGTCPCGDL 860

Query: 103 CLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
           C N+                                       +V+D  T E R  +   
Sbjct: 861 CSNQ---------------------------------------QVLDMHTYEARQREYAF 881

Query: 163 LGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKK 222
            G  +FY   +N   VIDA  KGN  R+INHSC PN   +KW+++GE  IG+FA RDIKK
Sbjct: 882 QGHKHFYFMTLNGSEVIDACAKGNLGRFINHSCDPNCRTEKWVVNGEICIGLFALRDIKK 941

Query: 223 GENLTYDYQY 232
           GE LT+DY Y
Sbjct: 942 GEELTFDYNY 951


>gi|302666919|ref|XP_003025054.1| hypothetical protein TRV_00712 [Trichophyton verrucosum HKI 0517]
 gi|291189136|gb|EFE44443.1| hypothetical protein TRV_00712 [Trichophyton verrucosum HKI 0517]
          Length = 1376

 Score = 94.4 bits (233), Expect = 3e-17,   Method: Composition-based stats.
 Identities = 48/112 (42%), Positives = 68/112 (60%)

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+ A+E+I   + +IEYVGE +  Q  + R  +    G  + YL  I+ + VIDAT  G 
Sbjct: 1247 GLYAEENITANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKHGG 1306

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
             +R+INHSC PN   +   +DG  RI I+A RDI++ E LTYDY++E   DS
Sbjct: 1307 IARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDS 1358


>gi|327348240|gb|EGE77097.1| histone-lysine N-methyltransferase [Ajellomyces dermatitidis ATCC
            18188]
          Length = 1280

 Score = 94.4 bits (233), Expect = 3e-17,   Method: Composition-based stats.
 Identities = 48/112 (42%), Positives = 68/112 (60%)

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+ A+E+I   + +IEYVGE +  Q    R  +    G  + YL  I+ + VIDAT +G 
Sbjct: 1151 GLYAEENISANDMIIEYVGEKVRQQVANMRERRYLKSGIGSSYLFRIDENTVIDATKRGG 1210

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
             +R+INHSC PN   +   +DG  RI I+A RDI++ E LTYDY++E   DS
Sbjct: 1211 IARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDS 1262


>gi|121709862|ref|XP_001272547.1| SET domain protein [Aspergillus clavatus NRRL 1]
 gi|119400697|gb|EAW11121.1| SET domain protein [Aspergillus clavatus NRRL 1]
          Length = 1232

 Score = 94.4 bits (233), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 48/112 (42%), Positives = 69/112 (61%)

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+ A+E+I   + +IEYVGE +  Q  + R  +    G  + YL  I+ + VIDAT +G 
Sbjct: 1103 GLYAEENISANDMIIEYVGEKVRQQVADMRERQYLKSGIGSSYLFRIDENTVIDATKRGG 1162

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
             +R+INHSC PN   +   +DG  RI I+A RDI++ E LTYDY++E   DS
Sbjct: 1163 IARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDS 1214


>gi|325089235|gb|EGC42545.1| histone-lysine N-methyltransferase [Ajellomyces capsulatus H88]
          Length = 1267

 Score = 94.4 bits (233), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 48/112 (42%), Positives = 69/112 (61%)

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+ A+E+I   + +IEYVGE +  Q  + R  +    G  + YL  I+ + VIDAT +G 
Sbjct: 1138 GLYAEENISANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKRGG 1197

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
             +R+INHSC PN   +   +DG  RI I+A RDI++ E LTYDY++E   DS
Sbjct: 1198 IARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDS 1249


>gi|169769549|ref|XP_001819244.1| histone-lysine N-methyltransferase, H3 lysine-4 specific [Aspergillus
            oryzae RIB40]
 gi|121933328|sp|Q2UMH3.1|SET1_ASPOR RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
            specific; AltName: Full=COMPASS component SET1; AltName:
            Full=SET domain-containing protein 1
 gi|83767103|dbj|BAE57242.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 1229

 Score = 94.4 bits (233), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 48/112 (42%), Positives = 69/112 (61%)

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+ A+E+I   + +IEYVGE +  Q  + R  +    G  + YL  I+ + VIDAT +G 
Sbjct: 1100 GLYAEENISANDMIIEYVGEKVRQQVADMRERQYLKSGIGSSYLFRIDENTVIDATKRGG 1159

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
             +R+INHSC PN   +   +DG  RI I+A RDI++ E LTYDY++E   DS
Sbjct: 1160 IARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDS 1211


>gi|327304525|ref|XP_003236954.1| histone-lysine N-methyltransferase [Trichophyton rubrum CBS 118892]
 gi|326459952|gb|EGD85405.1| histone-lysine N-methyltransferase [Trichophyton rubrum CBS 118892]
          Length = 1337

 Score = 94.4 bits (233), Expect = 4e-17,   Method: Composition-based stats.
 Identities = 48/112 (42%), Positives = 68/112 (60%)

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+ A+E+I   + +IEYVGE +  Q  + R  +    G  + YL  I+ + VIDAT  G 
Sbjct: 1208 GLYAEENITANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKHGG 1267

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
             +R+INHSC PN   +   +DG  RI I+A RDI++ E LTYDY++E   DS
Sbjct: 1268 IARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDS 1319


>gi|391863483|gb|EIT72791.1| histone H3 (Lys4) methyltransferase complex, subunit SET1
            [Aspergillus oryzae 3.042]
          Length = 1223

 Score = 94.4 bits (233), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 48/112 (42%), Positives = 69/112 (61%)

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+ A+E+I   + +IEYVGE +  Q  + R  +    G  + YL  I+ + VIDAT +G 
Sbjct: 1094 GLYAEENISANDMIIEYVGEKVRQQVADMRERQYLKSGIGSSYLFRIDENTVIDATKRGG 1153

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
             +R+INHSC PN   +   +DG  RI I+A RDI++ E LTYDY++E   DS
Sbjct: 1154 IARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDS 1205


>gi|225554361|gb|EEH02660.1| histone-lysine N-methyltransferase [Ajellomyces capsulatus G186AR]
          Length = 1267

 Score = 94.4 bits (233), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 48/112 (42%), Positives = 69/112 (61%)

Query: 127  GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
            G+ A+E+I   + +IEYVGE +  Q  + R  +    G  + YL  I+ + VIDAT +G 
Sbjct: 1138 GLYAEENISANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKRGG 1197

Query: 187  KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
             +R+INHSC PN   +   +DG  RI I+A RDI++ E LTYDY++E   DS
Sbjct: 1198 IARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDS 1249


>gi|449295340|gb|EMC91362.1| hypothetical protein BAUCODRAFT_80239 [Baudoinia compniacensis UAMH
            10762]
          Length = 1279

 Score = 94.4 bits (233), Expect = 4e-17,   Method: Composition-based stats.
 Identities = 49/127 (38%), Positives = 77/127 (60%), Gaps = 3/127 (2%)

Query: 110  NRPVKKMKLVQTEKC---GAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGET 166
            N+  K+ KLV+ ++    G G+ A+E+I   + +IEYVGE +  +  + R  + +  G  
Sbjct: 1130 NQLKKRKKLVKFDRSAIHGWGLYAEENITVNDLIIEYVGEKVRQKVADLRELRYEKQGVG 1189

Query: 167  NFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENL 226
            + YL  +  D ++DAT KG  +R+INHSC PN   +   ++G  RI I+A +DI K E L
Sbjct: 1190 SSYLFRMMDDEIVDATKKGGIARFINHSCSPNCTAKIIKVEGTPRIVIYALKDIGKNEEL 1249

Query: 227  TYDYQYE 233
            TYDY++E
Sbjct: 1250 TYDYKFE 1256


>gi|71029610|ref|XP_764448.1| hypothetical protein [Theileria parva strain Muguga]
 gi|68351402|gb|EAN32165.1| hypothetical protein TP04_0811 [Theileria parva]
          Length = 995

 Score = 94.4 bits (233), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 60/175 (34%), Positives = 92/175 (52%), Gaps = 13/175 (7%)

Query: 71  TASPGSSGVCDRDCHCGMLLSSCSSGCKCGN--------SCLNKPFQNRPVKKMKLVQTE 122
             +P +   C  D  CG   S+ +   +C          +C N+ F +    K++L   +
Sbjct: 655 VVAPEAEMKCHCDKKCGSDCSNVTKNIECTVKNCGLADVNCGNRRFAHFSGPKLRLNYVD 714

Query: 123 KCGAGIVADEDIKRGEFVIEYVGEVI---DDQTC--EERLWKMKHLGETNFYLCEINRDM 177
             G G VA E+I  GE V EYVGEVI   D Q C       ++    ++++Y+ +I+RD 
Sbjct: 715 GKGVGAVATEEIGEGELVCEYVGEVISQADFQRCLASASFAEIDDGNQSHWYVMKIHRDT 774

Query: 178 VIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
            ID+T+ GN +R+INHSC PN       + G  R+G+FA R IK+ E +TY+Y +
Sbjct: 775 YIDSTHLGNVARFINHSCDPNCASVPINVKGTYRMGVFALRKIKQDEEVTYNYGF 829


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.322    0.139    0.444 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,957,617,097
Number of Sequences: 23463169
Number of extensions: 163495975
Number of successful extensions: 343004
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 3845
Number of HSP's successfully gapped in prelim test: 1338
Number of HSP's that attempted gapping in prelim test: 331477
Number of HSP's gapped (non-prelim): 7075
length of query: 243
length of database: 8,064,228,071
effective HSP length: 138
effective length of query: 105
effective length of database: 9,121,278,045
effective search space: 957734194725
effective search space used: 957734194725
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 75 (33.5 bits)