BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 026129
(243 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|449505027|ref|XP_004162355.1| PREDICTED: histone-lysine N-methyltransferase ASHH3-like [Cucumis
sativus]
Length = 373
Score = 419 bits (1076), Expect = e-115, Method: Compositional matrix adjust.
Identities = 198/233 (84%), Positives = 217/233 (93%), Gaps = 1/233 (0%)
Query: 1 MPAAKKNSDNSRIGHAFNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRR 60
MPA KKNS+ SRIG+ F+KL++QIGNPV+FELPDWF K K +PY FI+RNIYLTK+ KRR
Sbjct: 1 MPAIKKNSECSRIGNVFHKLMRQIGNPVDFELPDWFSKWKPMPYTFIRRNIYLTKKFKRR 60
Query: 61 LEDDGIFCSCTASPGSSGVC-DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLV 119
+EDDGIFCSC+ SPGS GVC D+DCHCGMLLSSCSSGCKCG CLNKPFQ+RPVKKMKLV
Sbjct: 61 VEDDGIFCSCSPSPGSPGVCCDKDCHCGMLLSSCSSGCKCGVLCLNKPFQHRPVKKMKLV 120
Query: 120 QTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVI 179
+TEKCG+GIVADEDIK+GEFVIEYVGEVIDD+TCEERLW MKH GETNFYLCEINRDMVI
Sbjct: 121 KTEKCGSGIVADEDIKQGEFVIEYVGEVIDDKTCEERLWNMKHRGETNFYLCEINRDMVI 180
Query: 180 DATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
DATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDI KGE+LTYDYQ+
Sbjct: 181 DATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIPKGEHLTYDYQF 233
>gi|449442399|ref|XP_004138969.1| PREDICTED: histone-lysine N-methyltransferase ASHH3-like [Cucumis
sativus]
Length = 373
Score = 418 bits (1075), Expect = e-115, Method: Compositional matrix adjust.
Identities = 198/233 (84%), Positives = 216/233 (92%), Gaps = 1/233 (0%)
Query: 1 MPAAKKNSDNSRIGHAFNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRR 60
MPA KKNS+ SRIG+ F+KL++QIGNPV FELPDWF K K +PY FI+RNIYLTK+ KRR
Sbjct: 1 MPAIKKNSECSRIGNVFHKLMRQIGNPVNFELPDWFSKWKPMPYTFIRRNIYLTKKFKRR 60
Query: 61 LEDDGIFCSCTASPGSSGVC-DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLV 119
+EDDGIFCSC+ SPGS GVC D+DCHCGMLLSSCSSGCKCG CLNKPFQ+RPVKKMKLV
Sbjct: 61 VEDDGIFCSCSPSPGSPGVCCDKDCHCGMLLSSCSSGCKCGVLCLNKPFQHRPVKKMKLV 120
Query: 120 QTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVI 179
+TEKCG+GIVADEDIK+GEFVIEYVGEVIDD+TCEERLW MKH GETNFYLCEINRDMVI
Sbjct: 121 KTEKCGSGIVADEDIKQGEFVIEYVGEVIDDKTCEERLWNMKHRGETNFYLCEINRDMVI 180
Query: 180 DATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
DATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDI KGE+LTYDYQ+
Sbjct: 181 DATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIPKGEHLTYDYQF 233
>gi|224117806|ref|XP_002331636.1| SET domain protein [Populus trichocarpa]
gi|222874032|gb|EEF11163.1| SET domain protein [Populus trichocarpa]
Length = 351
Score = 411 bits (1057), Expect = e-112, Method: Compositional matrix adjust.
Identities = 199/232 (85%), Positives = 212/232 (91%), Gaps = 1/232 (0%)
Query: 1 MPAAKKNSDNSRIGHAFNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRR 60
MPAAKKN D RI AFNKLLK+IGN VEFELPD F K K+ Y IKRNIYLTK+IKRR
Sbjct: 1 MPAAKKNLDQ-RITLAFNKLLKEIGNQVEFELPDAFNKSKSTAYTLIKRNIYLTKKIKRR 59
Query: 61 LEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQ 120
LEDDGIFCSC+AS GSS VCDRDCHCGMLLSSCSSGCKCG+SC NKPFQ+RP+KKMKLVQ
Sbjct: 60 LEDDGIFCSCSASLGSSRVCDRDCHCGMLLSSCSSGCKCGSSCSNKPFQHRPLKKMKLVQ 119
Query: 121 TEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVID 180
TEKCG+GIVADEDIK+GEFVIEYVGEVIDD TCEERLWKMKH GETNFYLCEINR+MVID
Sbjct: 120 TEKCGSGIVADEDIKQGEFVIEYVGEVIDDNTCEERLWKMKHRGETNFYLCEINRNMVID 179
Query: 181 ATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
ATYKGNKSRYINHSC PNTEMQKWIIDGETRIGIFAT DI+KGE+LTYDYQ+
Sbjct: 180 ATYKGNKSRYINHSCSPNTEMQKWIIDGETRIGIFATHDIRKGEHLTYDYQF 231
>gi|356530969|ref|XP_003534051.1| PREDICTED: histone-lysine N-methyltransferase ASHH3-like [Glycine
max]
Length = 349
Score = 407 bits (1045), Expect = e-111, Method: Compositional matrix adjust.
Identities = 192/233 (82%), Positives = 210/233 (90%), Gaps = 1/233 (0%)
Query: 1 MPAAKKNSDNSRIGHAFNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKR- 59
MPA KKN + + G FNKL K++G PV+FELPDWF K K + Y +IKRNIYLTK++KR
Sbjct: 1 MPAMKKNPEQTHFGSVFNKLGKELGEPVDFELPDWFNKSKPMQYTYIKRNIYLTKKVKRS 60
Query: 60 RLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLV 119
R +DDGIFCSCT SPGS+ VC RDCHCGMLLSSCSSGCKCG+SCLNKPFQNRPVKKMKLV
Sbjct: 61 RFDDDGIFCSCTPSPGSTSVCGRDCHCGMLLSSCSSGCKCGSSCLNKPFQNRPVKKMKLV 120
Query: 120 QTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVI 179
+TEKCG+GIVADEDIK GEFVIEYVGEVIDD+TCEERLW MKH GETNFYLCEINRDMVI
Sbjct: 121 KTEKCGSGIVADEDIKLGEFVIEYVGEVIDDKTCEERLWNMKHRGETNFYLCEINRDMVI 180
Query: 180 DATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
DATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFAT DI+KGE+LTYDYQ+
Sbjct: 181 DATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATSDIQKGEHLTYDYQF 233
>gi|356559949|ref|XP_003548258.1| PREDICTED: histone-lysine N-methyltransferase ASHH3-like [Glycine
max]
Length = 349
Score = 402 bits (1032), Expect = e-110, Method: Compositional matrix adjust.
Identities = 192/233 (82%), Positives = 210/233 (90%), Gaps = 1/233 (0%)
Query: 1 MPAAKKNSDNSRIGHAFNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKR- 59
MPA KKN + + IG FN L K++G PV+FELPD F K K + Y +IKRNIYLTK++KR
Sbjct: 1 MPAMKKNPELTCIGSVFNMLGKELGEPVDFELPDSFNKSKPMQYTYIKRNIYLTKKVKRS 60
Query: 60 RLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLV 119
R +DDGIFCSCT SPGS+ VC RDCHCGMLLSSCSSGCKCG+SCLNKPFQNRPVKKMKLV
Sbjct: 61 RFDDDGIFCSCTPSPGSTSVCGRDCHCGMLLSSCSSGCKCGSSCLNKPFQNRPVKKMKLV 120
Query: 120 QTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVI 179
+TEKCG+GIVADEDIK GEFVIEYVGEVIDD+TCEERLW MKH GETNFYLCEINRDMVI
Sbjct: 121 KTEKCGSGIVADEDIKLGEFVIEYVGEVIDDKTCEERLWNMKHSGETNFYLCEINRDMVI 180
Query: 180 DATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
DATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDI+KGE+LTYDYQ+
Sbjct: 181 DATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIQKGEHLTYDYQF 233
>gi|297737225|emb|CBI26426.3| unnamed protein product [Vitis vinifera]
Length = 438
Score = 372 bits (954), Expect = e-100, Method: Compositional matrix adjust.
Identities = 190/235 (80%), Positives = 211/235 (89%), Gaps = 3/235 (1%)
Query: 1 MPAAKK---NSDNSRIGHAFNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRI 57
MP+AKK N + +G AF+KLLK++GNPV+FELP F K K Y FI+RNIYLTK+I
Sbjct: 83 MPSAKKLKQNIEQKGVGGAFSKLLKELGNPVDFELPSSFNKWKPTSYTFIRRNIYLTKKI 142
Query: 58 KRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMK 117
KRRLEDDGIFCSC++ GSSGVC RDC CGML SSCSSGCKCG SCLNKPFQ+RPVKKMK
Sbjct: 143 KRRLEDDGIFCSCSSGSGSSGVCGRDCLCGMLQSSCSSGCKCGTSCLNKPFQSRPVKKMK 202
Query: 118 LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDM 177
+V+TEKCG+GIVADEDIK+GEFVIEYVGEVIDD+TCE+RLWKMKHLGETNFYLCEINRDM
Sbjct: 203 MVETEKCGSGIVADEDIKQGEFVIEYVGEVIDDKTCEDRLWKMKHLGETNFYLCEINRDM 262
Query: 178 VIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
VIDATYKGNKSRYINHSC PNTEMQKW IDGETRIGIFATRDIK+GE+LTYDYQ+
Sbjct: 263 VIDATYKGNKSRYINHSCDPNTEMQKWRIDGETRIGIFATRDIKRGEHLTYDYQF 317
>gi|359489946|ref|XP_002268035.2| PREDICTED: histone-lysine N-methyltransferase ASHH3-like [Vitis
vinifera]
Length = 377
Score = 368 bits (945), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 186/228 (81%), Positives = 207/228 (90%)
Query: 5 KKNSDNSRIGHAFNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDD 64
K+N + +G AF+KLLK++GNPV+FELP F K K Y FI+RNIYLTK+IKRRLEDD
Sbjct: 13 KQNIEQKGVGGAFSKLLKELGNPVDFELPSSFNKWKPTSYTFIRRNIYLTKKIKRRLEDD 72
Query: 65 GIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKC 124
GIFCSC++ GSSGVC RDC CGML SSCSSGCKCG SCLNKPFQ+RPVKKMK+V+TEKC
Sbjct: 73 GIFCSCSSGSGSSGVCGRDCLCGMLQSSCSSGCKCGTSCLNKPFQSRPVKKMKMVETEKC 132
Query: 125 GAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYK 184
G+GIVADEDIK+GEFVIEYVGEVIDD+TCE+RLWKMKHLGETNFYLCEINRDMVIDATYK
Sbjct: 133 GSGIVADEDIKQGEFVIEYVGEVIDDKTCEDRLWKMKHLGETNFYLCEINRDMVIDATYK 192
Query: 185 GNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
GNKSRYINHSC PNTEMQKW IDGETRIGIFATRDIK+GE+LTYDYQ+
Sbjct: 193 GNKSRYINHSCDPNTEMQKWRIDGETRIGIFATRDIKRGEHLTYDYQF 240
>gi|357498513|ref|XP_003619545.1| Histone-lysine N-methyltransferase ASHH3 [Medicago truncatula]
gi|355494560|gb|AES75763.1| Histone-lysine N-methyltransferase ASHH3 [Medicago truncatula]
Length = 348
Score = 355 bits (910), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 175/232 (75%), Positives = 195/232 (84%), Gaps = 3/232 (1%)
Query: 1 MPAAKKNSDNSRIGHAFNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRR 60
MP KKNS+ +R F KL+ +G V+FELPD IK + Y IK ++Y+TK KR+
Sbjct: 1 MPIMKKNSEQNRFEAVFTKLMNNLGETVDFELPD-SIKNRTTQYTHIK-HVYVTKNRKRK 58
Query: 61 LEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQ 120
DDGIFCSC +S S+ VC RDCHCGMLLSSCSSGCKCG SCLNK FQ+RPVKKMKLV+
Sbjct: 59 -SDDGIFCSCESSSDSTAVCGRDCHCGMLLSSCSSGCKCGISCLNKAFQHRPVKKMKLVK 117
Query: 121 TEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVID 180
TEKCG+GIVADEDIK GEFVIEYVGEVIDD+TCE+RLW MK GETNFYLCEINRDMVID
Sbjct: 118 TEKCGSGIVADEDIKLGEFVIEYVGEVIDDKTCEQRLWNMKDRGETNFYLCEINRDMVID 177
Query: 181 ATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
AT KGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA+RDIKKGE+LTYDYQ+
Sbjct: 178 ATNKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFASRDIKKGEHLTYDYQF 229
>gi|297824409|ref|XP_002880087.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297325926|gb|EFH56346.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 363
Score = 337 bits (863), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 179/234 (76%), Positives = 200/234 (85%), Gaps = 2/234 (0%)
Query: 1 MPAAKKNSDNSRIGHAFNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRR 60
MPA KK SD + +G FNKLL QIG EFELPDW K K PY+FI+RNIYLTK++KRR
Sbjct: 1 MPATKKISDRNHLGQVFNKLLNQIGESEEFELPDWLNKGKPTPYIFIRRNIYLTKKVKRR 60
Query: 61 LEDDGIFCSCTASPGSS--GVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKL 118
+EDDGIFCSC++S S VC +CHCGML SSCSS CKCG+ C NKPFQ R VKKMKL
Sbjct: 61 VEDDGIFCSCSSSSPGSSSTVCGSNCHCGMLFSSCSSSCKCGSECNNKPFQQRHVKKMKL 120
Query: 119 VQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMV 178
+QTEKCG+GIVA+E+IK GEF+IEYVGEVIDD+TCEERLWKMKH GETNFYLCEI RDMV
Sbjct: 121 IQTEKCGSGIVAEEEIKPGEFIIEYVGEVIDDKTCEERLWKMKHRGETNFYLCEITRDMV 180
Query: 179 IDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
IDAT+KGNKSRYINHSC PNT+MQKWIIDGETRIGIFATR IKKGE+LTYDYQ+
Sbjct: 181 IDATHKGNKSRYINHSCNPNTQMQKWIIDGETRIGIFATRGIKKGEHLTYDYQF 234
>gi|18406465|ref|NP_566010.1| histone-lysine N-methyltransferase ASHH3 [Arabidopsis thaliana]
gi|94707125|sp|Q945S8.2|ASHH3_ARATH RecName: Full=Histone-lysine N-methyltransferase ASHH3; AltName:
Full=ASH1 homolog 3; AltName: Full=Protein SET DOMAIN
GROUP 7
gi|15028059|gb|AAK76560.1| unknown protein [Arabidopsis thaliana]
gi|20197070|gb|AAC23419.2| expressed protein [Arabidopsis thaliana]
gi|20259301|gb|AAM14386.1| unknown protein [Arabidopsis thaliana]
gi|330255289|gb|AEC10383.1| histone-lysine N-methyltransferase ASHH3 [Arabidopsis thaliana]
Length = 363
Score = 333 bits (853), Expect = 5e-89, Method: Compositional matrix adjust.
Identities = 176/234 (75%), Positives = 201/234 (85%), Gaps = 2/234 (0%)
Query: 1 MPAAKKNSDNSRIGHAFNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRR 60
MPA+KK SD + +G F+KLL QIG EFELP+W K K PY+FI+RNIYLTK++KRR
Sbjct: 1 MPASKKISDRNHLGQVFDKLLNQIGESEEFELPEWLNKGKPTPYIFIRRNIYLTKKVKRR 60
Query: 61 LEDDGIFCSCTASPGSS--GVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKL 118
+EDDGIFCSC++S S VC +CHCGML SSCSS CKCG+ C NKPFQ R VKKMKL
Sbjct: 61 VEDDGIFCSCSSSSPGSSSTVCGSNCHCGMLFSSCSSSCKCGSECNNKPFQQRHVKKMKL 120
Query: 119 VQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMV 178
+QTEKCG+GIVA+E+I+ GEF+IEYVGEVIDD+TCEERLWKMKH GETNFYLCEI RDMV
Sbjct: 121 IQTEKCGSGIVAEEEIEAGEFIIEYVGEVIDDKTCEERLWKMKHRGETNFYLCEITRDMV 180
Query: 179 IDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
IDAT+KGNKSRYINHSC PNT+MQKWIIDGETRIGIFATR IKKGE+LTYDYQ+
Sbjct: 181 IDATHKGNKSRYINHSCNPNTQMQKWIIDGETRIGIFATRGIKKGEHLTYDYQF 234
>gi|218201888|gb|EEC84315.1| hypothetical protein OsI_30811 [Oryza sativa Indica Group]
Length = 360
Score = 304 bits (779), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 145/218 (66%), Positives = 178/218 (81%), Gaps = 4/218 (1%)
Query: 15 HAFNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASP 74
+ F++L+K I PV+F+LP + K+ YV IKRN Y+T++ R+EDDGIFCSCT S
Sbjct: 13 NVFDQLIKAIKAPVDFDLPPVLKEWKSNYYVPIKRNAYMTRK---RVEDDGIFCSCTPS- 68
Query: 75 GSSGVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDI 134
GSS CD+DC CGML S CSS CKC N C NKPFQ+R ++K KL++TEKCG G+VA+EDI
Sbjct: 69 GSSATCDKDCQCGMLFSCCSSTCKCENKCANKPFQHRTLRKTKLIKTEKCGNGVVAEEDI 128
Query: 135 KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
K+GEFVIEYVGEVIDD+TCE+RLWKMK G+TNFYLCE++ +MVIDAT KGN SR+INHS
Sbjct: 129 KKGEFVIEYVGEVIDDRTCEQRLWKMKRQGDTNFYLCEVSSNMVIDATNKGNMSRFINHS 188
Query: 195 CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
C PNTEMQKW ++GETR+GIFA RDIK GE LTYDY++
Sbjct: 189 CEPNTEMQKWTVEGETRVGIFALRDIKTGEELTYDYKF 226
>gi|222641285|gb|EEE69417.1| hypothetical protein OsJ_28789 [Oryza sativa Japonica Group]
Length = 360
Score = 304 bits (778), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 145/218 (66%), Positives = 178/218 (81%), Gaps = 4/218 (1%)
Query: 15 HAFNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASP 74
+ F++L+K I PV+F+LP + K+ YV IKRN Y+T++ R+EDDGIFCSCT S
Sbjct: 13 NVFDQLIKAIKAPVDFDLPPVLKEWKSNYYVPIKRNAYITRK---RVEDDGIFCSCTPS- 68
Query: 75 GSSGVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDI 134
GSS CD+DC CGML S CSS CKC N C NKPFQ+R ++K KL++TEKCG G+VA+EDI
Sbjct: 69 GSSATCDKDCQCGMLFSCCSSTCKCENKCANKPFQHRTLRKTKLIKTEKCGNGVVAEEDI 128
Query: 135 KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
K+GEFVIEYVGEVIDD+TCE+RLWKMK G+TNFYLCE++ +MVIDAT KGN SR+INHS
Sbjct: 129 KKGEFVIEYVGEVIDDRTCEQRLWKMKRQGDTNFYLCEVSSNMVIDATNKGNMSRFINHS 188
Query: 195 CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
C PNTEMQKW ++GETR+GIFA RDIK GE LTYDY++
Sbjct: 189 CEPNTEMQKWTVEGETRVGIFALRDIKTGEELTYDYKF 226
>gi|115478464|ref|NP_001062827.1| Os09g0307800 [Oryza sativa Japonica Group]
gi|51091678|dbj|BAD36461.1| putative SET domain protein 110 [Oryza sativa Japonica Group]
gi|51091893|dbj|BAD36704.1| putative SET domain protein 110 [Oryza sativa Japonica Group]
gi|113631060|dbj|BAF24741.1| Os09g0307800 [Oryza sativa Japonica Group]
Length = 340
Score = 304 bits (778), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 145/218 (66%), Positives = 178/218 (81%), Gaps = 4/218 (1%)
Query: 15 HAFNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASP 74
+ F++L+K I PV+F+LP + K+ YV IKRN Y+T++ R+EDDGIFCSCT S
Sbjct: 13 NVFDQLIKAIKAPVDFDLPPVLKEWKSNYYVPIKRNAYITRK---RVEDDGIFCSCTPS- 68
Query: 75 GSSGVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDI 134
GSS CD+DC CGML S CSS CKC N C NKPFQ+R ++K KL++TEKCG G+VA+EDI
Sbjct: 69 GSSATCDKDCQCGMLFSCCSSTCKCENKCANKPFQHRTLRKTKLIKTEKCGNGVVAEEDI 128
Query: 135 KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
K+GEFVIEYVGEVIDD+TCE+RLWKMK G+TNFYLCE++ +MVIDAT KGN SR+INHS
Sbjct: 129 KKGEFVIEYVGEVIDDRTCEQRLWKMKRQGDTNFYLCEVSSNMVIDATNKGNMSRFINHS 188
Query: 195 CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
C PNTEMQKW ++GETR+GIFA RDIK GE LTYDY++
Sbjct: 189 CEPNTEMQKWTVEGETRVGIFALRDIKTGEELTYDYKF 226
>gi|326496078|dbj|BAJ90660.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 362
Score = 294 bits (752), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 141/221 (63%), Positives = 173/221 (78%), Gaps = 3/221 (1%)
Query: 12 RIGHAFNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCT 71
R+ K LK + V+F+LP + K Y+ IKRN+YLTK+ R+EDDGIFCSC+
Sbjct: 29 RVFDELTKALKDPADFVDFDLPSALKEWKLGYYIPIKRNVYLTKK---RVEDDGIFCSCS 85
Query: 72 ASPGSSGVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVAD 131
S GS C +DC CGML S CSS CKCGN+C NK FQ RP+ K KL++TEKCG G++A+
Sbjct: 86 LSSGSPVTCGKDCQCGMLFSCCSSNCKCGNNCANKSFQLRPLFKTKLIKTEKCGFGLIAE 145
Query: 132 EDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYI 191
++IK+GEFVIEYVGEVIDD+TCEERLWKMK TNFYLCE++ +MVIDAT KGNKSR+I
Sbjct: 146 DEIKKGEFVIEYVGEVIDDRTCEERLWKMKRQRYTNFYLCEVSSNMVIDATNKGNKSRFI 205
Query: 192 NHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
NHSC PNTEMQKW +DGETR+GIFA RDI++GE LTYDY++
Sbjct: 206 NHSCEPNTEMQKWTVDGETRVGIFALRDIERGEELTYDYKF 246
>gi|357157974|ref|XP_003577976.1| PREDICTED: histone-lysine N-methyltransferase ASHH3-like
[Brachypodium distachyon]
Length = 338
Score = 293 bits (751), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 144/220 (65%), Positives = 173/220 (78%), Gaps = 6/220 (2%)
Query: 16 AFNKLLKQIGNP---VEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTA 72
FNKL +++ +P V+F+LP + K Y+ IKRN+YLTK+ R+EDDGIFCSC
Sbjct: 6 VFNKLTRKLQDPADFVDFDLPPALKEWKLGYYIPIKRNVYLTKK---RVEDDGIFCSCPF 62
Query: 73 SPGSSGVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADE 132
GSS C +DC CGML S CSS CKC N C NK FQ RP+KK KL++TEKCG G+VAD+
Sbjct: 63 ISGSSVACGKDCQCGMLFSCCSSNCKCENRCANKSFQLRPLKKTKLIKTEKCGFGLVADD 122
Query: 133 DIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYIN 192
I++GEF+IEYVGEVIDD+TCEERLWKMK TNFYLCE++ +MVIDAT KGNKSR+IN
Sbjct: 123 GIQKGEFIIEYVGEVIDDRTCEERLWKMKRQRYTNFYLCEVSSNMVIDATNKGNKSRFIN 182
Query: 193 HSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
HSC PNTEMQKW +DGETR+GIFA RDIKKGE LTYDY++
Sbjct: 183 HSCQPNTEMQKWTVDGETRVGIFALRDIKKGEELTYDYKF 222
>gi|357116306|ref|XP_003559923.1| PREDICTED: histone-lysine N-methyltransferase ASHH3-like
[Brachypodium distachyon]
Length = 349
Score = 293 bits (750), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 143/220 (65%), Positives = 172/220 (78%), Gaps = 6/220 (2%)
Query: 16 AFNKLLKQIGNP---VEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTA 72
FNKL K++ +P V+F+LP + K Y+ IKRN+YL +K+R+EDDGIFC+C
Sbjct: 17 VFNKLTKKLQDPADFVDFDLPSALKEWKLGYYIPIKRNVYL---MKKRVEDDGIFCACPF 73
Query: 73 SPGSSGVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADE 132
GSS C +DC CGML S CSS CKC N C NK FQ RP+KK KL++TEKCG G+VADE
Sbjct: 74 ISGSSVACGKDCQCGMLFSCCSSNCKCENRCANKSFQLRPMKKTKLIKTEKCGFGLVADE 133
Query: 133 DIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYIN 192
I++GEF+IEYVGEVIDD+TCEERLWKMK TNFYLCE++ +MVIDAT KGNKSR+IN
Sbjct: 134 GIQQGEFIIEYVGEVIDDRTCEERLWKMKRQRYTNFYLCEVSSNMVIDATNKGNKSRFIN 193
Query: 193 HSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
HSC PNTEMQKW +DGETR+GIFA DIKKGE LTYDY++
Sbjct: 194 HSCQPNTEMQKWTVDGETRVGIFALHDIKKGEELTYDYKF 233
>gi|15232214|ref|NP_191555.1| putative histone-lysine N-methyltransferase ASHH4 [Arabidopsis
thaliana]
gi|75264575|sp|Q9M1X9.1|ASHH4_ARATH RecName: Full=Putative histone-lysine N-methyltransferase ASHH4;
AltName: Full=ASH1 homolog 4; AltName: Full=Protein SET
DOMAIN GROUP 24
gi|7019690|emb|CAB75815.1| putative protein [Arabidopsis thaliana]
gi|332646470|gb|AEE79991.1| putative histone-lysine N-methyltransferase ASHH4 [Arabidopsis
thaliana]
Length = 352
Score = 290 bits (743), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 162/245 (66%), Positives = 189/245 (77%), Gaps = 7/245 (2%)
Query: 1 MPAAKKNSDNSRIGHAFNKLLKQIGNPVEFELPDWF--IKPKAIPYVFIKRNIYLTKRIK 58
M ++KK SD ++I + KL KQIG + E PD +KP +FIKRNIYL K++K
Sbjct: 1 MSSSKKGSDRNQIRKSLRKLKKQIGELEKLESPDRLNNVKP-----IFIKRNIYLKKKLK 55
Query: 59 RRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKL 118
++++D GIFCSC+ PGSS +C DC+CG+LLSSCSS CKC + C NKPFQ R +KKMKL
Sbjct: 56 KKVKDHGIFCSCSLDPGSSTLCGSDCNCGILLSSCSSSCKCSSECTNKPFQQRHIKKMKL 115
Query: 119 VQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMV 178
VQTEKCG GIVADEDI GEF+IEYVGEVIDD+ CEERLWK+ H ETNFYLC+IN +MV
Sbjct: 116 VQTEKCGYGIVADEDINSGEFIIEYVGEVIDDKICEERLWKLNHKVETNFYLCQINWNMV 175
Query: 179 IDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
IDAT+KGNKSRYINHSC PNTEMQKWIIDGETRIGIFATR I KGE LTYDYQ+
Sbjct: 176 IDATHKGNKSRYINHSCSPNTEMQKWIIDGETRIGIFATRFINKGEQLTYDYQFVQFGAD 235
Query: 239 LIAYC 243
YC
Sbjct: 236 QDCYC 240
>gi|224095256|ref|XP_002310367.1| SET domain protein [Populus trichocarpa]
gi|222853270|gb|EEE90817.1| SET domain protein [Populus trichocarpa]
Length = 281
Score = 283 bits (723), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 133/145 (91%), Positives = 141/145 (97%)
Query: 88 MLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEV 147
MLLSSCSSGCKCG+SC NKPFQ+R VKKMKLVQTEKCG+GIVADEDIK+GEFVIEYVGEV
Sbjct: 1 MLLSSCSSGCKCGSSCSNKPFQHRQVKKMKLVQTEKCGSGIVADEDIKQGEFVIEYVGEV 60
Query: 148 IDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIID 207
IDD+TCEERLWKMKH GETNFYLCEINRDMVIDATYKGNKSRYINHSC PNTEMQKWIID
Sbjct: 61 IDDKTCEERLWKMKHCGETNFYLCEINRDMVIDATYKGNKSRYINHSCSPNTEMQKWIID 120
Query: 208 GETRIGIFATRDIKKGENLTYDYQY 232
GETRIGIFATRDI+KGE+LTYDYQ+
Sbjct: 121 GETRIGIFATRDIRKGEHLTYDYQF 145
>gi|242048842|ref|XP_002462165.1| hypothetical protein SORBIDRAFT_02g020844 [Sorghum bicolor]
gi|241925542|gb|EER98686.1| hypothetical protein SORBIDRAFT_02g020844 [Sorghum bicolor]
Length = 341
Score = 280 bits (716), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 139/231 (60%), Positives = 175/231 (75%), Gaps = 7/231 (3%)
Query: 15 HAFNKLLKQIGNPVEFELPDWFIKPKAI--PYVFIKRNIYLTKRIKRRLEDDGIFCSCTA 72
+ F +L+ +I P +F+ P F+K + Y IKRN+Y TKR R+ED GI C C
Sbjct: 21 NVFEQLISKI-EPADFD-PRPFLKQLNVLGRYEPIKRNVYCTKR---RVEDYGISCRCKP 75
Query: 73 SPGSSGVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADE 132
SPGSS VC RDCHCGML S CSS C+C N+C NK FQ+RP+KK KL++TEKCG G+VA++
Sbjct: 76 SPGSSVVCGRDCHCGMLFSCCSSQCECDNACTNKSFQHRPLKKTKLIKTEKCGHGLVAED 135
Query: 133 DIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYIN 192
+IK+GEFVIEYVGEVIDD+ CE RLW MK L +T+FYLCE++ +MVIDAT KGN SR+IN
Sbjct: 136 EIKKGEFVIEYVGEVIDDRACENRLWTMKRLNDTDFYLCEVSSNMVIDATNKGNLSRFIN 195
Query: 193 HSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
HSC PNT+MQKW +DGETR+GIFA RDIK GE LTYDY++ + + +C
Sbjct: 196 HSCEPNTKMQKWTVDGETRVGIFALRDIKIGEELTYDYKFVQFGAAQVCHC 246
>gi|297817294|ref|XP_002876530.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297322368|gb|EFH52789.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 354
Score = 279 bits (713), Expect = 8e-73, Method: Compositional matrix adjust.
Identities = 160/244 (65%), Positives = 186/244 (76%), Gaps = 3/244 (1%)
Query: 1 MPAAKKNSDNSRIGHAFNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRR 60
M ++KK SD ++I + KL KQIG + E P + K P +FIKRNIYL K+ K+R
Sbjct: 1 MSSSKKGSDRNQIRKSLRKLKKQIGELEKLESPPEPLN-KGKP-IFIKRNIYLKKKFKKR 58
Query: 61 LEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQ 120
++D GIFCSC+ PGSS +C DC+CG+LLSSCSS CKC + C NKPFQ R +KKMKLVQ
Sbjct: 59 VKDHGIFCSCSLDPGSSTICGSDCNCGILLSSCSSSCKCSSECTNKPFQQRHIKKMKLVQ 118
Query: 121 TEKCGAGIVADEDIKRGEFVIEYVGE-VIDDQTCEERLWKMKHLGETNFYLCEINRDMVI 179
TEKCG GIVADEDI GEF+IEYVGE VID++ CEERLWK+ H E NFYLC+IN +MVI
Sbjct: 119 TEKCGYGIVADEDINSGEFIIEYVGEVVIDEKICEERLWKLNHKVEKNFYLCQINWNMVI 178
Query: 180 DATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSL 239
DAT+KGNKSRYINHSC PNTEMQKWIIDGETRIGIFATR I KGE LTYDYQ+
Sbjct: 179 DATHKGNKSRYINHSCNPNTEMQKWIIDGETRIGIFATRFINKGEQLTYDYQFVQFGADQ 238
Query: 240 IAYC 243
YC
Sbjct: 239 DCYC 242
>gi|226493201|ref|NP_001149253.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20specific [Zea mays]
gi|194704072|gb|ACF86120.1| unknown [Zea mays]
gi|195625808|gb|ACG34734.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20specific [Zea mays]
gi|238014446|gb|ACR38258.1| unknown [Zea mays]
gi|414589294|tpg|DAA39865.1| TPA: putative histone-lysine N-methyltransferase family protein
isoform 1 [Zea mays]
gi|414589295|tpg|DAA39866.1| TPA: putative histone-lysine N-methyltransferase family protein
isoform 2 [Zea mays]
Length = 339
Score = 277 bits (709), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 136/233 (58%), Positives = 177/233 (75%), Gaps = 5/233 (2%)
Query: 12 RIGHAFNKLLKQIGNPVEFELPDWFIKPKAIP-YVFIKRNIYLTKRIKRRLEDDGIFCSC 70
+I + F++L+ +I + +F+ + + + Y IKRN+Y TKR R+ED GI C C
Sbjct: 17 QIDNVFDQLINKIEH-ADFDTRPFLKQLNVLGRYEPIKRNVYCTKR---RIEDYGISCHC 72
Query: 71 TASPGSSGVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVA 130
SPGSS VC RDCHCGML S CSS C+C ++C NK FQ+RP+KK KL++TEKCG G+VA
Sbjct: 73 KPSPGSSVVCGRDCHCGMLFSCCSSQCECDDTCTNKSFQHRPLKKTKLIKTEKCGHGLVA 132
Query: 131 DEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRY 190
+++IK+GEFVIEYVGEVIDD+TCE RLW MK L +T+FYLCE++ +MVIDAT KGN+SR+
Sbjct: 133 EDEIKKGEFVIEYVGEVIDDRTCENRLWTMKRLLDTDFYLCEVSSNMVIDATNKGNRSRF 192
Query: 191 INHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
INHSC PNT MQKW +DGETR+GIFA RDIK GE LTYDY++ + + +C
Sbjct: 193 INHSCEPNTAMQKWTVDGETRVGIFALRDIKIGEELTYDYKFVQFGAAQVCHC 245
>gi|414589296|tpg|DAA39867.1| TPA: putative histone-lysine N-methyltransferase family protein
[Zea mays]
Length = 343
Score = 275 bits (703), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 138/235 (58%), Positives = 175/235 (74%), Gaps = 7/235 (2%)
Query: 12 RIGHAFNKLLKQIGNPVEFELPDWFIKPKAIP-YVFIKRNIYLTKRIKRRLEDDGIFCSC 70
+I + F++L+ +I + +F+ + + + Y IKRN+Y TKR R+ED GI C C
Sbjct: 17 QIDNVFDQLINKIEH-ADFDTRPFLKQLNVLGRYEPIKRNVYCTKR---RIEDYGISCHC 72
Query: 71 TASPGSSGVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVA 130
SPGSS VC RDCHCGML S CSS C+C ++C NK FQ+RP+KK KL++TEKCG G+VA
Sbjct: 73 KPSPGSSVVCGRDCHCGMLFSCCSSQCECDDTCTNKSFQHRPLKKTKLIKTEKCGHGLVA 132
Query: 131 DEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRY 190
+++IK+GEFVIEYVGEVIDD+TCE RLW MK L +T+FYLCE++ +MVIDAT KGN+SR+
Sbjct: 133 EDEIKKGEFVIEYVGEVIDDRTCENRLWTMKRLLDTDFYLCEVSSNMVIDATNKGNRSRF 192
Query: 191 INHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQ--YEFLHDSLIAYC 243
INHSC PNT MQKW +DGETR+GIFA RDIK GE LTYDY Y F+ C
Sbjct: 193 INHSCEPNTAMQKWTVDGETRVGIFALRDIKIGEELTYDYNIMYRFVQFGAAQVC 247
>gi|162460550|ref|NP_001105653.1| LOC542662 [Zea mays]
gi|24021802|gb|AAN41254.1| SET domain protein 110 [Zea mays]
gi|195652527|gb|ACG45731.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20specific [Zea mays]
Length = 342
Score = 267 bits (683), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 134/234 (57%), Positives = 170/234 (72%), Gaps = 5/234 (2%)
Query: 12 RIGHAFNKLLKQIGNPVEFELPDWFIKPKAI--PYVFIKRNIYLTKRIKRRLEDDGIFCS 69
+I + F++L+ +I P F+K + Y IKRN+Y TKR +ED GI C
Sbjct: 18 QIENVFDQLISKIEQADPDFDPRPFLKQLNVLGRYEPIKRNVYFTKRY---IEDYGISCH 74
Query: 70 CTASPGSSGVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIV 129
C SPGSS VC RDC+C ML S CSS C+C +C NK FQ+RP+ K KL++TEKCG G+V
Sbjct: 75 CKPSPGSSVVCGRDCYCSMLFSCCSSQCECDIACTNKSFQHRPLTKTKLIKTEKCGHGLV 134
Query: 130 ADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSR 189
A+++IK+GEFVIEYVGEVIDD+TCE RLW MK L +T+FYLCE++ +MVIDAT KGN SR
Sbjct: 135 AEDEIKKGEFVIEYVGEVIDDRTCENRLWTMKRLDDTDFYLCEVSSNMVIDATNKGNLSR 194
Query: 190 YINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
+INHSC PNT MQKW +DGETR+GIFA RDIK GE LTYDY++ + + +C
Sbjct: 195 FINHSCEPNTAMQKWTVDGETRVGIFALRDIKIGEELTYDYKFVQFGAAQVCHC 248
>gi|414884958|tpg|DAA60972.1| TPA: putative histone-lysine N-methyltransferase family protein
isoform 1 [Zea mays]
gi|414884959|tpg|DAA60973.1| TPA: putative histone-lysine N-methyltransferase family protein
isoform 2 [Zea mays]
Length = 337
Score = 267 bits (683), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 134/234 (57%), Positives = 170/234 (72%), Gaps = 5/234 (2%)
Query: 12 RIGHAFNKLLKQIGNPVEFELPDWFIKPKAI--PYVFIKRNIYLTKRIKRRLEDDGIFCS 69
+I + F++L+ +I P F+K + Y IKRN+Y TKR +ED GI C
Sbjct: 18 QIENVFDQLISKIEQADPDFDPRPFLKQLNVLGRYEPIKRNVYFTKRY---IEDYGISCH 74
Query: 70 CTASPGSSGVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIV 129
C SPGSS VC RDC+C ML S CSS C+C +C NK FQ+RP+ K KL++TEKCG G+V
Sbjct: 75 CKPSPGSSVVCGRDCYCSMLFSCCSSQCECDIACTNKSFQHRPLTKTKLIKTEKCGHGLV 134
Query: 130 ADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSR 189
A+++IK+GEFVIEYVGEVIDD+TCE RLW MK L +T+FYLCE++ +MVIDAT KGN SR
Sbjct: 135 AEDEIKKGEFVIEYVGEVIDDRTCENRLWTMKRLDDTDFYLCEVSSNMVIDATNKGNLSR 194
Query: 190 YINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
+INHSC PNT MQKW +DGETR+GIFA RDIK GE LTYDY++ + + +C
Sbjct: 195 FINHSCEPNTAMQKWTVDGETRVGIFALRDIKIGEELTYDYKFVQFGAAQVCHC 248
>gi|302825340|ref|XP_002994293.1| hypothetical protein SELMODRAFT_432224 [Selaginella moellendorffii]
gi|300137824|gb|EFJ04637.1| hypothetical protein SELMODRAFT_432224 [Selaginella moellendorffii]
Length = 820
Score = 245 bits (625), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 113/209 (54%), Positives = 150/209 (71%), Gaps = 7/209 (3%)
Query: 27 PVEFELPDWFIK---PKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRD 83
P +F LP +IK ++ PY+ I+RN+YLTK+ + +DDG+ CSC S +C D
Sbjct: 90 PCDFRLPSSWIKRLDQESPPYIHIRRNVYLTKKTPKVSKDDGMQCSCKGSK----ICSSD 145
Query: 84 CHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEY 143
C C +L +SCSS C C C N PFQ R ++++L +TE CG G+ ADE+I+RG+F+IEY
Sbjct: 146 CICRLLYTSCSSSCACEGKCENLPFQKRDGRRLRLKETENCGWGLFADENIERGDFLIEY 205
Query: 144 VGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQK 203
+GEVIDD+TCEERLW +K GE NFYLCE+ D VIDAT+KGN SR+INHSC PN +++K
Sbjct: 206 IGEVIDDKTCEERLWDLKERGENNFYLCEVGHDKVIDATFKGNMSRFINHSCNPNAQLRK 265
Query: 204 WIIDGETRIGIFATRDIKKGENLTYDYQY 232
W DGE RIG+FA I KG+ +TYDY+Y
Sbjct: 266 WQCDGELRIGVFAVSRILKGQEITYDYKY 294
>gi|302795285|ref|XP_002979406.1| hypothetical protein SELMODRAFT_110353 [Selaginella moellendorffii]
gi|300153174|gb|EFJ19814.1| hypothetical protein SELMODRAFT_110353 [Selaginella moellendorffii]
Length = 274
Score = 238 bits (606), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 112/207 (54%), Positives = 149/207 (71%), Gaps = 7/207 (3%)
Query: 29 EFELPDWFIK---PKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCH 85
+F LP +IK ++ PY+ I+RN+YLTK+ + +DDG+ CSC S +C DC
Sbjct: 1 DFRLPSSWIKRLDQESPPYIHIRRNVYLTKKTPKVSKDDGMQCSCKGSK----ICSSDCI 56
Query: 86 CGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVG 145
C +L +SCSS C C C N PFQ R ++++L +TE CG G+ ADE+I+RG+F+IEY+G
Sbjct: 57 CRLLYTSCSSSCACEGKCENLPFQKRDGRRLRLKETENCGWGLFADENIERGDFLIEYIG 116
Query: 146 EVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWI 205
EVIDD+TCEERLW +K GE NFYLCE+ D VIDAT+KGN SR+INHSC PN +++KW
Sbjct: 117 EVIDDKTCEERLWDLKERGENNFYLCEVGHDKVIDATFKGNMSRFINHSCDPNAQLRKWQ 176
Query: 206 IDGETRIGIFATRDIKKGENLTYDYQY 232
DGE RIG+FA I KG+ +TYDY+Y
Sbjct: 177 CDGELRIGVFAVSRILKGQEITYDYKY 203
>gi|168031756|ref|XP_001768386.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162680311|gb|EDQ66748.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 615
Score = 223 bits (568), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 103/148 (69%), Positives = 122/148 (82%)
Query: 85 HCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYV 144
H ML +CSS C CG+SC N PFQ P KMK V+TE+CG G+VADEDIK G F++EYV
Sbjct: 295 HVRMLFMTCSSNCGCGDSCTNLPFQKLPGSKMKAVKTERCGWGLVADEDIKAGSFLVEYV 354
Query: 145 GEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKW 204
GEVIDDQTCEERLW MK GE NFY+CEI+R+MVIDAT+KGN SR+INHSC PN+E+QKW
Sbjct: 355 GEVIDDQTCEERLWAMKKQGEMNFYMCEISREMVIDATFKGNLSRFINHSCQPNSELQKW 414
Query: 205 IIDGETRIGIFATRDIKKGENLTYDYQY 232
IDGETRIG+FA DIK+G+ +TYDYQ+
Sbjct: 415 DIDGETRIGVFAITDIKRGDFVTYDYQF 442
>gi|225445763|ref|XP_002272781.1| PREDICTED: histone-lysine N-methyltransferase ASHR3 [Vitis
vinifera]
gi|297743704|emb|CBI36587.3| unnamed protein product [Vitis vinifera]
Length = 494
Score = 211 bits (538), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 105/206 (50%), Positives = 138/206 (66%), Gaps = 8/206 (3%)
Query: 38 KPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGC 97
K + PYV I+RN+YL K+ KR DGI C+ S VC +C C + SCS C
Sbjct: 260 KMEPTPYVHIRRNVYLVKK-KRDDAADGIGCT-----NCSSVCSENCVCRVQCISCSKSC 313
Query: 98 KCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERL 157
C +C N+PF R KK+K+V+TE CG G+ A E I +G+FVIEY+GEVIDD CE RL
Sbjct: 314 HCSENCTNRPF--RKEKKIKIVKTELCGWGVDAAESINKGDFVIEYIGEVIDDALCERRL 371
Query: 158 WKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFAT 217
W MK G+ NFY+CEI +D IDAT+KGN SR++NHSC PN +++KW ++GETR+G+FA
Sbjct: 372 WDMKDRGDQNFYMCEIRKDFTIDATFKGNASRFLNHSCDPNCKLEKWQVEGETRVGVFAA 431
Query: 218 RDIKKGENLTYDYQYEFLHDSLIAYC 243
R IK GE LTYDY++ + +C
Sbjct: 432 RSIKAGEPLTYDYRFVRFGPEVKCHC 457
>gi|255576669|ref|XP_002529224.1| set domain protein, putative [Ricinus communis]
gi|223531342|gb|EEF33180.1| set domain protein, putative [Ricinus communis]
Length = 156
Score = 211 bits (536), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 104/123 (84%), Positives = 109/123 (88%), Gaps = 2/123 (1%)
Query: 1 MPAAKKN--SDNSRIGHAFNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIK 58
MP A KN D++RI AFNKLLKQIGNPVEFELP+WF K K PY FIKRNIYLTKRIK
Sbjct: 6 MPVATKNFDCDHNRINIAFNKLLKQIGNPVEFELPEWFNKCKPTPYTFIKRNIYLTKRIK 65
Query: 59 RRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKL 118
RRLEDDGIFCSC SPGSSGVCDRDCHCGMLLSSCSSGCKCG+SCLNKPFQ+RPVKKMKL
Sbjct: 66 RRLEDDGIFCSCGPSPGSSGVCDRDCHCGMLLSSCSSGCKCGDSCLNKPFQHRPVKKMKL 125
Query: 119 VQT 121
VQT
Sbjct: 126 VQT 128
>gi|242065740|ref|XP_002454159.1| hypothetical protein SORBIDRAFT_04g025725 [Sorghum bicolor]
gi|241933990|gb|EES07135.1| hypothetical protein SORBIDRAFT_04g025725 [Sorghum bicolor]
Length = 328
Score = 210 bits (534), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 100/204 (49%), Positives = 142/204 (69%), Gaps = 6/204 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKC 99
K PY I+RN+YL K+ + + D +C A C DC + +SCS C+C
Sbjct: 30 KPPPYTSIRRNVYLIKKKRTSVRVDIGCTNCRAD----STCKEDCEFRGISTSCSKNCRC 85
Query: 100 GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
+ C N+PF R KK+K+V+T++CG G VA E ++RG+FVIEYVGEVIDD TCE+RLW
Sbjct: 86 SDLCTNRPF--RKDKKIKIVKTKRCGWGAVALEPLERGDFVIEYVGEVIDDATCEQRLWD 143
Query: 160 MKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD 219
+++ G+ NFY+CEI++D IDAT+KGN SR++NHSC PN +++KW +DGETR+G+FA+R
Sbjct: 144 IRYRGDKNFYMCEISKDFTIDATFKGNVSRFLNHSCEPNCKLEKWQVDGETRVGVFASRS 203
Query: 220 IKKGENLTYDYQYEFLHDSLIAYC 243
IK GE LTYDY++ + + +C
Sbjct: 204 IKVGEPLTYDYRFVHFGEKVKCHC 227
>gi|357142828|ref|XP_003572707.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
ASHR3-like [Brachypodium distachyon]
Length = 466
Score = 207 bits (528), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 101/204 (49%), Positives = 140/204 (68%), Gaps = 6/204 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKC 99
K Y+ I+RN+YL +K++ D CT S VC DC C L SCS C C
Sbjct: 162 KPPSYIHIRRNVYL---VKKKRADSSAETGCTNCRADS-VCKDDCECRGLSMSCSKSCHC 217
Query: 100 GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
+ C NKPF R KK+K+V++E CG G +A E +++ +F+IEYVGEVIDD TCE+RLW+
Sbjct: 218 SDLCTNKPF--RKDKKIKIVKSEGCGWGAIALEPLEKCDFIIEYVGEVIDDATCEQRLWE 275
Query: 160 MKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD 219
MK G+ NFY+CEI++D IDAT+KGN SR++NHSC PN +++KW +DGETR+G+FA+R
Sbjct: 276 MKRRGDKNFYMCEISKDYTIDATFKGNTSRFLNHSCDPNCKLEKWQVDGETRVGVFASRF 335
Query: 220 IKKGENLTYDYQYEFLHDSLIAYC 243
I+ GE LTYDY++ + + +C
Sbjct: 336 IEVGEPLTYDYRFVHFGEKVKCHC 359
>gi|18417683|ref|NP_567859.1| histone-lysine N-methyltransferase ASHR3 [Arabidopsis thaliana]
gi|75164864|sp|Q949T8.1|ASHR3_ARATH RecName: Full=Histone-lysine N-methyltransferase ASHR3; AltName:
Full=ASH1-related protein 3; AltName: Full=Protein SET
DOMAIN GROUP 4; AltName: Full=Protein stamen loss
gi|15292921|gb|AAK92831.1| unknown protein [Arabidopsis thaliana]
gi|20465681|gb|AAM20309.1| unknown protein [Arabidopsis thaliana]
gi|56201422|dbj|BAD72877.1| stamen loss [Arabidopsis thaliana]
gi|332660421|gb|AEE85821.1| histone-lysine N-methyltransferase ASHR3 [Arabidopsis thaliana]
Length = 497
Score = 207 bits (527), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 102/205 (49%), Positives = 137/205 (66%), Gaps = 8/205 (3%)
Query: 28 VEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCG 87
++ D +K YV I+RNIYL K+ KR +DG+ C+ P CDR C C
Sbjct: 248 IDLAWKDSVVKEDPPSYVHIRRNIYLVKK-KRDNANDGVGCT-NCGPN----CDRSCVCR 301
Query: 88 MLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEV 147
+ SCS GC C SC N+PF R KK+K+V+TE CG G+ A E I + +F++EY+GEV
Sbjct: 302 VQCISCSKGCSCPESCGNRPF--RKEKKIKIVKTEHCGWGVEAAESINKEDFIVEYIGEV 359
Query: 148 IDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIID 207
I D CE+RLW MKH G +FY+CEI +D IDAT+KGN SR++NHSC PN ++KW ++
Sbjct: 360 ISDAQCEQRLWDMKHKGMKDFYMCEIQKDFTIDATFKGNASRFLNHSCNPNCVLEKWQVE 419
Query: 208 GETRIGIFATRDIKKGENLTYDYQY 232
GETR+G+FA R I+ GE LTYDY++
Sbjct: 420 GETRVGVFAARQIEAGEPLTYDYRF 444
>gi|326522849|dbj|BAJ88470.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 394
Score = 205 bits (522), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 101/204 (49%), Positives = 139/204 (68%), Gaps = 6/204 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKC 99
K Y I+RN+YL +K++ D CT S VC DC C L SCS C C
Sbjct: 94 KPPSYTHIRRNVYL---VKKKRADSSAETGCTNC-KSDSVCKDDCECRGLSMSCSKSCHC 149
Query: 100 GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
+ C NKPF R KK+K+V++E CG G VA E +++G+F+IEYVGEVI+D TCE+RLW
Sbjct: 150 SDLCSNKPF--RKDKKIKIVKSEGCGWGAVALEPLEKGDFIIEYVGEVINDATCEQRLWD 207
Query: 160 MKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD 219
MK G+ NFY+CEI++D IDAT+KGN SR++NHSC PN +++KW ++GETR+G+FA+R
Sbjct: 208 MKRRGDKNFYMCEISKDFTIDATFKGNTSRFLNHSCDPNCKLEKWQVEGETRVGVFASRI 267
Query: 220 IKKGENLTYDYQYEFLHDSLIAYC 243
I+ GE LTYDY++ + + +C
Sbjct: 268 IEVGEPLTYDYRFVHFGEKVKCHC 291
>gi|297802948|ref|XP_002869358.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297315194|gb|EFH45617.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 497
Score = 205 bits (522), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 98/205 (47%), Positives = 136/205 (66%), Gaps = 8/205 (3%)
Query: 28 VEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCG 87
++ D +K PYV I+RNIY K+ KR +DG+ C+ C R C C
Sbjct: 248 IDLTWKDSVVKDDLPPYVHIRRNIYFVKK-KRDNANDGVGCT-----NCGPTCCRSCVCR 301
Query: 88 MLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEV 147
+ SCS GC+C +C N+PF R KK+++V+TE CG G+ A E I + +F++EY+GEV
Sbjct: 302 VQCISCSKGCRCPETCGNRPF--RKEKKIRIVKTEHCGWGVEAAESINKEDFIVEYIGEV 359
Query: 148 IDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIID 207
I D CE+RLW MKH G +FY+CEI +D IDAT+KGN SR++NHSC PN ++KW ++
Sbjct: 360 ISDAQCEQRLWDMKHKGMKDFYMCEIQKDFTIDATFKGNASRFLNHSCSPNCVLEKWQVE 419
Query: 208 GETRIGIFATRDIKKGENLTYDYQY 232
GETR+G+FA R I+ GE LTYDY++
Sbjct: 420 GETRVGVFAARQIEAGEPLTYDYRF 444
>gi|449457959|ref|XP_004146715.1| PREDICTED: histone-lysine N-methyltransferase ASHR3-like [Cucumis
sativus]
Length = 502
Score = 205 bits (521), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 99/190 (52%), Positives = 130/190 (68%), Gaps = 7/190 (3%)
Query: 43 PYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCGNS 102
PY I+RNIYL K+ + DG+ C+ C DC C + SCS C C ++
Sbjct: 270 PYTHIRRNIYLVKKKRDTGVADGLGCT-----NCKTECSDDCVCRVQCISCSRACHCRDT 324
Query: 103 CLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
C N+PF R KK+K+V+TE CG G+ A E I +GEFVIEY+GEVIDD CE+RLW MK+
Sbjct: 325 CTNRPF--RKEKKVKIVKTELCGWGVEAAESIGKGEFVIEYIGEVIDDALCEQRLWDMKY 382
Query: 163 LGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKK 222
G NFY+CEI +D IDAT+KGN SR++NHSC PN ++KW ++GETR+G+FA R I+
Sbjct: 383 KGMKNFYMCEIRKDFTIDATFKGNASRFLNHSCDPNCSLEKWQVEGETRVGVFAARSIEV 442
Query: 223 GENLTYDYQY 232
GE LTYDY++
Sbjct: 443 GEPLTYDYRF 452
>gi|449530608|ref|XP_004172286.1| PREDICTED: histone-lysine N-methyltransferase ASHR3-like, partial
[Cucumis sativus]
Length = 285
Score = 204 bits (520), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 99/190 (52%), Positives = 131/190 (68%), Gaps = 7/190 (3%)
Query: 43 PYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCGNS 102
PY I+RNIYL K+ + DG+ C+ + C DC C + SCS C C ++
Sbjct: 53 PYTHIRRNIYLVKKKRDTGVADGLGCTNCKTE-----CSDDCVCRVQCISCSRACHCRDT 107
Query: 103 CLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
C N+PF R KK+K+V+TE CG G+ A E I +GEFVIEY+GEVIDD CE+RLW MK+
Sbjct: 108 CTNRPF--RKEKKVKIVKTELCGWGVEAAESIGKGEFVIEYIGEVIDDALCEQRLWDMKY 165
Query: 163 LGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKK 222
G NFY+CEI +D IDAT+KGN SR++NHSC PN ++KW ++GETR+G+FA R I+
Sbjct: 166 KGMKNFYMCEIRKDFTIDATFKGNASRFLNHSCDPNCSLEKWQVEGETRVGVFAARSIEV 225
Query: 223 GENLTYDYQY 232
GE LTYDY++
Sbjct: 226 GEPLTYDYRF 235
>gi|326487338|dbj|BAJ89653.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 340
Score = 204 bits (519), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 101/204 (49%), Positives = 138/204 (67%), Gaps = 6/204 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKC 99
K Y I+RN+YL +K++ D CT S VC DC C L SCS C C
Sbjct: 40 KPPSYTHIRRNVYL---VKKKRADSSAETGCTNCKSDS-VCKDDCECRGLSMSCSKSCHC 95
Query: 100 GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
+ C NKPF R KK+K+V++E CG G VA E +++G+F+IEYVGEVI+D TCE+RLW
Sbjct: 96 SDLCSNKPF--RKDKKIKIVKSEGCGWGAVALEPLEKGDFIIEYVGEVINDATCEQRLWD 153
Query: 160 MKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD 219
MK G+ NFY+CEI++D IDAT+KGN SR++NHSC PN +++KW ++GETR G+FA+R
Sbjct: 154 MKRRGDKNFYMCEISKDFTIDATFKGNTSRFLNHSCDPNCKLEKWQVEGETRAGVFASRI 213
Query: 220 IKKGENLTYDYQYEFLHDSLIAYC 243
I+ GE LTYDY++ + + +C
Sbjct: 214 IEVGEPLTYDYRFVHFGEKVKCHC 237
>gi|284434727|gb|ADB85424.1| putative retrotransposon protein [Phyllostachys edulis]
Length = 1946
Score = 202 bits (513), Expect = 1e-49, Method: Composition-based stats.
Identities = 88/125 (70%), Positives = 104/125 (83%)
Query: 107 PFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGET 166
P + P L TEKCG G+VA+++IK+GEFVIEYVGEVIDD+TCEERLWKMK +T
Sbjct: 95 PLTSNPFYHRSLTLTEKCGFGLVAEDEIKKGEFVIEYVGEVIDDRTCEERLWKMKRQRDT 154
Query: 167 NFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENL 226
NFYLCE++ +MVIDAT KGN SR+INHSC PNTEMQKW +DGETR+GIFA RDIK+GE L
Sbjct: 155 NFYLCEVSSNMVIDATNKGNMSRFINHSCEPNTEMQKWTVDGETRVGIFALRDIKEGEEL 214
Query: 227 TYDYQ 231
TYDY+
Sbjct: 215 TYDYK 219
>gi|222623224|gb|EEE57356.1| hypothetical protein OsJ_07493 [Oryza sativa Japonica Group]
Length = 384
Score = 201 bits (512), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 101/226 (44%), Positives = 140/226 (61%), Gaps = 15/226 (6%)
Query: 25 GNPVEFELPDWFIKPKAI-------PYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
G P E + +W K A+ F+ KR R E CS +
Sbjct: 55 GRPFEEYVKEWKAKKAALGVPAGRCELPFLTGTPKAKKRPDSRAEAGCTNCS------AD 108
Query: 78 GVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRG 137
C DC C L SCS C C + C NKPF R KK+K V+T++CG G ++ E +++G
Sbjct: 109 STCKDDCECRGLYMSCSKNCHCSDMCTNKPF--RKDKKIKAVKTKRCGWGAISLEPLEKG 166
Query: 138 EFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCP 197
+F+IEYVGEVI+D TCE+RLW MK G+ NFY+CEI++D IDAT+KGN SR++NHSC P
Sbjct: 167 DFIIEYVGEVINDATCEQRLWDMKRRGDKNFYMCEISKDFTIDATFKGNTSRFLNHSCDP 226
Query: 198 NTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
N +++KW +DGETR+G+FA+R I+ GE+LTYDY++ + + YC
Sbjct: 227 NCKLEKWQVDGETRVGVFASRSIQVGEHLTYDYRFVHFGEKVKCYC 272
>gi|224143888|ref|XP_002325110.1| SET domain protein [Populus trichocarpa]
gi|222866544|gb|EEF03675.1| SET domain protein [Populus trichocarpa]
Length = 516
Score = 200 bits (509), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 98/201 (48%), Positives = 135/201 (67%), Gaps = 8/201 (3%)
Query: 43 PYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCGNS 102
PYV I+RN+YL K+ KR D + C+ S C +C C + SCS C+C +
Sbjct: 285 PYVHIRRNVYLVKK-KRDDSDGDVGCT-----NCSSTCCENCVCRVQCISCSKACRCPET 338
Query: 103 CLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
C N+PF R KK+K+V+TE CG G+ A E + +G+F+IEY+GEVIDD+ CE+RLW MK+
Sbjct: 339 CTNRPF--RKEKKIKIVKTEFCGWGVEAAEPLNKGDFIIEYIGEVIDDKLCEQRLWDMKY 396
Query: 163 LGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKK 222
G NFY+CEI +D IDAT+KGN SR++NHSC PN ++KW ++GETR+G+FA I+
Sbjct: 397 KGVQNFYMCEIRKDFTIDATFKGNSSRFLNHSCKPNCILEKWDVEGETRVGVFAAGSIRV 456
Query: 223 GENLTYDYQYEFLHDSLIAYC 243
GE LTYDY++ + YC
Sbjct: 457 GEPLTYDYRFVRFGPEVKCYC 477
>gi|356514491|ref|XP_003525939.1| PREDICTED: histone-lysine N-methyltransferase ASHR3-like [Glycine
max]
Length = 459
Score = 196 bits (498), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 102/190 (53%), Positives = 132/190 (69%), Gaps = 6/190 (3%)
Query: 43 PYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCGNS 102
PYV I+RNIYL K+ KR DDG + S S+ C DC C + SCS C+C +
Sbjct: 224 PYVHIRRNIYLVKK-KRSDADDG---AGCTSCSSTSTCSDDCVCRVQCISCSKACRCSEN 279
Query: 103 CLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
C N+PF R KK+K+V+TE CG G+ A E I +G F+IEY+GEVIDD CE+RLW MK+
Sbjct: 280 CNNRPF--RKEKKIKIVKTELCGWGVEAAETIDKGGFIIEYIGEVIDDALCEKRLWDMKY 337
Query: 163 LGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKK 222
G NFY+CEI +D IDAT+KGN SR++NHSC PN ++KW +DGETR+G+FA I+
Sbjct: 338 RGVQNFYMCEIRKDFTIDATFKGNTSRFLNHSCDPNCVLEKWQVDGETRVGVFAACSIEA 397
Query: 223 GENLTYDYQY 232
GE LTYDY++
Sbjct: 398 GEPLTYDYRF 407
>gi|238013758|gb|ACR37914.1| unknown [Zea mays]
Length = 252
Score = 189 bits (480), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 84/152 (55%), Positives = 118/152 (77%), Gaps = 2/152 (1%)
Query: 92 SCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQ 151
SCS C+C + C N+PF R KK+K+V+T++CG G VA E ++RG+FVIEYVGEVIDD
Sbjct: 2 SCSKNCRCSDLCTNRPF--RKDKKIKIVKTKRCGWGAVALEPLERGDFVIEYVGEVIDDA 59
Query: 152 TCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETR 211
TCE+RLW ++ G+ NFY+CEI++D IDAT+KGN SR++NHSC PN +++KW +DGETR
Sbjct: 60 TCEQRLWDIRRRGDKNFYMCEISKDFTIDATFKGNVSRFLNHSCEPNCKLEKWQVDGETR 119
Query: 212 IGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
+G+FA+R I+ GE LTYDY++ + + +C
Sbjct: 120 VGVFASRSIEVGEPLTYDYRFVHFGEKVKCHC 151
>gi|297599580|ref|NP_001047405.2| Os02g0611300 [Oryza sativa Japonica Group]
gi|255671080|dbj|BAF09319.2| Os02g0611300 [Oryza sativa Japonica Group]
Length = 263
Score = 189 bits (479), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 83/152 (54%), Positives = 117/152 (76%), Gaps = 2/152 (1%)
Query: 92 SCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQ 151
SCS C C + C NKPF R KK+K V+T++CG G ++ E +++G+F+IEYVGEVI+D
Sbjct: 2 SCSKNCHCSDMCTNKPF--RKDKKIKAVKTKRCGWGAISLEPLEKGDFIIEYVGEVINDA 59
Query: 152 TCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETR 211
TCE+RLW MK G+ NFY+CEI++D IDAT+KGN SR++NHSC PN +++KW +DGETR
Sbjct: 60 TCEQRLWDMKRRGDKNFYMCEISKDFTIDATFKGNTSRFLNHSCDPNCKLEKWQVDGETR 119
Query: 212 IGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
+G+FA+R I+ GE+LTYDY++ + + YC
Sbjct: 120 VGVFASRSIQVGEHLTYDYRFVHFGEKVKCYC 151
>gi|218191149|gb|EEC73576.1| hypothetical protein OsI_08033 [Oryza sativa Indica Group]
Length = 163
Score = 187 bits (474), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 84/153 (54%), Positives = 118/153 (77%), Gaps = 2/153 (1%)
Query: 89 LLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVI 148
L SCS C C + C NKPF R KK+K V+T++CG G ++ E +++G+F+IEYVGEVI
Sbjct: 4 LYMSCSKNCHCSDMCTNKPF--RKDKKIKAVKTKRCGWGAISLEPLEKGDFIIEYVGEVI 61
Query: 149 DDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDG 208
+D TCE+RLW MK G+ NFY+CEI++D IDAT+KGN SR++NHSC PN +++KW +DG
Sbjct: 62 NDATCEQRLWDMKRRGDKNFYMCEISKDFTIDATFKGNTSRFLNHSCDPNCKLEKWQVDG 121
Query: 209 ETRIGIFATRDIKKGENLTYDYQYEFLHDSLIA 241
ETR+G+FA+R I+ GE+LTYDY+Y + L++
Sbjct: 122 ETRVGVFASRSIQVGEHLTYDYRYSSCIEKLLS 154
>gi|255574954|ref|XP_002528383.1| set domain protein, putative [Ricinus communis]
gi|223532171|gb|EEF33976.1| set domain protein, putative [Ricinus communis]
Length = 500
Score = 186 bits (473), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 97/190 (51%), Positives = 131/190 (68%), Gaps = 8/190 (4%)
Query: 43 PYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCGNS 102
PYV I+RN+YL K+ + +++D SC++S VC C SCS C C +
Sbjct: 267 PYVHIRRNVYLVKKKRDDVDNDVGCTSCSSSCCEDCVCRVQC------ISCSRACHCSEN 320
Query: 103 CLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
C N+PF R KK+K+V+TE CG G+ E I +G+F+IEY+GEVIDD CE+RLW MK+
Sbjct: 321 CTNRPF--RKEKKIKIVKTEFCGWGVETVEPINKGDFIIEYIGEVIDDAVCEQRLWDMKY 378
Query: 163 LGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKK 222
G NFY+CEI +D IDAT+KGN SR++NHSC PN ++KW ++GETR+G+FA R IK
Sbjct: 379 KGVQNFYMCEIRKDFTIDATFKGNSSRFLNHSCDPNCILEKWQVEGETRVGVFAARSIKV 438
Query: 223 GENLTYDYQY 232
GE LTYDY++
Sbjct: 439 GEPLTYDYRF 448
>gi|357141465|ref|XP_003572234.1| PREDICTED: histone-lysine N-methyltransferase ASHR3-like
[Brachypodium distachyon]
Length = 347
Score = 184 bits (466), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 91/208 (43%), Positives = 139/208 (66%), Gaps = 8/208 (3%)
Query: 26 NPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCH 85
+P++ E + KP YV +KR+I + + + D I CT C C
Sbjct: 128 DPIKTEELESLTKPPT--YVLLKRSILC---MIHKCDGDAIEGGCTDC-DPPLACKTMCS 181
Query: 86 CGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVG 145
C + SCS CKC N C N+PF+ K++++V+T+ CG G+VA E I++G+FVIE+VG
Sbjct: 182 CRSVWISCSRACKCSNECTNRPFRRE--KRIEVVKTQHCGWGVVALESIQKGDFVIEFVG 239
Query: 146 EVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWI 205
EVIDD TCEERL MK G+ NFY+C++N++ VIDAT++GN R++NHSC PN +++KW
Sbjct: 240 EVIDDVTCEERLEDMKRRGDQNFYMCKVNKNFVIDATFRGNACRFLNHSCEPNCQLEKWQ 299
Query: 206 IDGETRIGIFATRDIKKGENLTYDYQYE 233
++G+TR+G+FA++ I+ G+ LTY Y+++
Sbjct: 300 VNGKTRLGVFASQAIEVGKPLTYSYRFK 327
>gi|147765700|emb|CAN77905.1| hypothetical protein VITISV_024391 [Vitis vinifera]
Length = 290
Score = 182 bits (463), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 90/134 (67%), Positives = 100/134 (74%), Gaps = 17/134 (12%)
Query: 99 CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
C P+ K++ TEKCG+GIVADEDIK+GEFVIEYVGEVIDD+TCE+RLW
Sbjct: 56 CAPPSYTAKLDGMPMHLSKVMLTEKCGSGIVADEDIKQGEFVIEYVGEVIDDKTCEDRLW 115
Query: 159 KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATR 218
KMKHLGETNFYLCEINRDMVIDATYKGNK IDGETRIGIFATR
Sbjct: 116 KMKHLGETNFYLCEINRDMVIDATYKGNKR-----------------IDGETRIGIFATR 158
Query: 219 DIKKGENLTYDYQY 232
DIK+GE+LTYDYQ+
Sbjct: 159 DIKRGEHLTYDYQF 172
>gi|2980780|emb|CAA18207.1| putative protein [Arabidopsis thaliana]
gi|7269987|emb|CAB79804.1| putative protein [Arabidopsis thaliana]
Length = 477
Score = 175 bits (444), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 77/137 (56%), Positives = 103/137 (75%), Gaps = 2/137 (1%)
Query: 96 GCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEE 155
GC C SC N+PF R KK+K+V+TE CG G+ A E I + +F++EY+GEVI D CE+
Sbjct: 290 GCSCPESCGNRPF--RKEKKIKIVKTEHCGWGVEAAESINKEDFIVEYIGEVISDAQCEQ 347
Query: 156 RLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIF 215
RLW MKH G +FY+CEI +D IDAT+KGN SR++NHSC PN ++KW ++GETR+G+F
Sbjct: 348 RLWDMKHKGMKDFYMCEIQKDFTIDATFKGNASRFLNHSCNPNCVLEKWQVEGETRVGVF 407
Query: 216 ATRDIKKGENLTYDYQY 232
A R I+ GE LTYDY++
Sbjct: 408 AARQIEAGEPLTYDYRF 424
>gi|168025972|ref|XP_001765507.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683357|gb|EDQ69768.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 993
Score = 171 bits (433), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 83/145 (57%), Positives = 102/145 (70%), Gaps = 3/145 (2%)
Query: 89 LLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVI 148
L SCS CKC + C NKPF R K++K+ +T CG G IK+ EFVIEY GEVI
Sbjct: 792 LSMSCSKDCKCSDKCCNKPF--RKDKRLKVSKTAHCGWGAFTSVAIKKDEFVIEYTGEVI 849
Query: 149 DDQTCEERLWKMKHLGET-NFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIID 207
DD CE+RLW+MK NFY+CEI +D +IDAT KGN SRY+NHSC PN ++KW +D
Sbjct: 850 DDAMCEKRLWEMKGRRSICNFYMCEIAKDFIIDATRKGNASRYLNHSCQPNCRLEKWRVD 909
Query: 208 GETRIGIFATRDIKKGENLTYDYQY 232
GETR+G+FA R+I GE LTYDY+Y
Sbjct: 910 GETRVGVFAGRNIIAGEELTYDYKY 934
>gi|212721872|ref|NP_001132822.1| uncharacterized protein LOC100194312 [Zea mays]
gi|194695492|gb|ACF81830.1| unknown [Zea mays]
gi|413923000|gb|AFW62932.1| putative SET-domain containing protein family [Zea mays]
Length = 418
Score = 165 bits (418), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 74/142 (52%), Positives = 106/142 (74%), Gaps = 1/142 (0%)
Query: 103 CLNKPFQNRPVKKMKLVQ-TEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMK 161
C+ + N V+ + Q T++CG G VA E ++RG+FVIEYVGEVIDD TCE+RLW ++
Sbjct: 176 CVAEDTSNFTVESFRCPQHTKRCGWGAVALEPLERGDFVIEYVGEVIDDATCEQRLWDIR 235
Query: 162 HLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIK 221
G+ NFY+CEI++D IDAT+KGN SR++NHSC PN +++KW +DGETR+G+FA+R I+
Sbjct: 236 RRGDKNFYMCEISKDFTIDATFKGNVSRFLNHSCEPNCKLEKWQVDGETRVGVFASRSIE 295
Query: 222 KGENLTYDYQYEFLHDSLIAYC 243
GE LTYDY++ + + +C
Sbjct: 296 VGEPLTYDYRFVHFGEKVKCHC 317
>gi|42407424|dbj|BAD10031.1| SET domain protein-like [Oryza sativa Japonica Group]
Length = 437
Score = 163 bits (412), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 72/133 (54%), Positives = 105/133 (78%), Gaps = 2/133 (1%)
Query: 101 NSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKM 160
+ C NKPF+ + KK+++V+T+ CG G A E I++ +FVIE+VGEVIDD+TCEERL M
Sbjct: 278 HECTNKPFRRQ--KKIEIVKTQYCGWGSRALEAIEKDDFVIEFVGEVIDDETCEERLEDM 335
Query: 161 KHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDI 220
+ G+ NFY+C++ +D VIDAT+KGN R+ NHSC PN ++QKW ++G+TR+G+FA++ I
Sbjct: 336 RRRGDKNFYMCKVKKDFVIDATFKGNDCRFFNHSCEPNCQLQKWQVNGKTRLGVFASKAI 395
Query: 221 KKGENLTYDYQYE 233
+ GE LTYDY++E
Sbjct: 396 EVGEPLTYDYRFE 408
>gi|145353759|ref|XP_001421172.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|145357147|ref|XP_001422783.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144581408|gb|ABO99465.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144583027|gb|ABP01142.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 503
Score = 162 bits (409), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 84/208 (40%), Positives = 120/208 (57%), Gaps = 10/208 (4%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRL-EDDGIFCSCTASP--GSSGV----CDRDCHCGMLLSS 92
K P+ I R++++++ +L + + C C P G S C ++C L S
Sbjct: 199 KPPPFERIHRSVFVSRPPPVKLHKSETAVCDCHPPPSRGDSETIRDGCGQECLNRKLRFS 258
Query: 93 CSS-GCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQ 151
C S C CG++C N+P P K K+++TE G G+ E ++ G F++EY GE++D+
Sbjct: 259 CDSRTCPCGDACSNRPLSQLPAPKTKIIRTENRGWGLTLQEPVRAGTFIVEYAGEILDEH 318
Query: 152 TCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWI--IDGE 209
C ERLW K GE NFYL EI+ + VIDA +KG+ +R+IN SC PN E Q+W+ E
Sbjct: 319 ECAERLWYDKQSGEENFYLMEISANYVIDAKFKGSIARFINSSCHPNCETQRWVDASTNE 378
Query: 210 TRIGIFATRDIKKGENLTYDYQYEFLHD 237
TR+GIFAT DI G LTYDY + D
Sbjct: 379 TRVGIFATEDIASGTELTYDYNFAHFGD 406
>gi|440801495|gb|ELR22513.1| SET domain containing protein [Acanthamoeba castellanii str. Neff]
Length = 981
Score = 159 bits (401), Expect = 1e-36, Method: Composition-based stats.
Identities = 78/197 (39%), Positives = 114/197 (57%), Gaps = 2/197 (1%)
Query: 38 KPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCS-SG 96
KP IP+ +IK+N+YL + ++ D+ C+C G C +C +L C
Sbjct: 332 KPPPIPFSYIKKNMYLHRSMRHWSSDEVPVCNCRLVSGKKA-CGENCINRVLNIECKLKH 390
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C CG +C N+ FQ R K+ T K G G+ A E I +G F+IEYVGEVI C++R
Sbjct: 391 CPCGTNCSNRQFQLRKYAKIDRFLTGKKGWGLRAREKIPKGTFIIEYVGEVISTDMCQDR 450
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + +G ++Y ++ IDA+ KGN +R+INHSC PN + KW +D E R+GIFA
Sbjct: 451 MKYYEEMGLEHYYFLTLDGSECIDASQKGNLARFINHSCNPNAKTHKWTVDKEIRVGIFA 510
Query: 217 TRDIKKGENLTYDYQYE 233
DI G+ +T+DYQ+E
Sbjct: 511 EEDIPVGQEITFDYQFE 527
>gi|168044865|ref|XP_001774900.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162673794|gb|EDQ60312.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1980
Score = 157 bits (397), Expect = 4e-36, Method: Composition-based stats.
Identities = 80/191 (41%), Positives = 112/191 (58%), Gaps = 3/191 (1%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTA-SPGSSGVCDRDCHCGMLLSSC-SSGCKCGN 101
+ IKRNI+ + K + D+ + C C PG G C DC ML C C CG
Sbjct: 832 WTLIKRNIFKHRSQKHQDADEAMVCQCAPPKPGEVG-CGEDCLNRMLNVECLPQQCPCGP 890
Query: 102 SCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMK 161
C N+ FQ R ++L + K G G+ A E+I RG F+IEYVGEV+D + E R +
Sbjct: 891 FCTNQQFQKRLYANVELFRCGKKGHGLRALENIPRGTFIIEYVGEVLDMPSFEARQKEYS 950
Query: 162 HLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIK 221
+ +FY ++ + +IDA KGN R+INHSC PN + +KW++DGE IG+FA RD+K
Sbjct: 951 MNSQKHFYFMTLSANEIIDACSKGNLGRFINHSCEPNCQTEKWMVDGEVCIGLFAIRDVK 1010
Query: 222 KGENLTYDYQY 232
KGE +T+DY +
Sbjct: 1011 KGEEVTFDYNF 1021
>gi|145355325|ref|XP_001421914.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144582153|gb|ABP00208.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 860
Score = 155 bits (392), Expect = 1e-35, Method: Composition-based stats.
Identities = 75/186 (40%), Positives = 113/186 (60%), Gaps = 3/186 (1%)
Query: 49 RNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKP 107
RNI+ + + EDD + C+C +P S C DC ++LS C C CG++C N+
Sbjct: 114 RNIFAHRAPRTPSEDDAMICAC--APESGAGCGSDCLNRLVLSECDPAHCPCGSACGNQR 171
Query: 108 FQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETN 167
+ + +T K G G+ A E + GEFV+EY GEV+ ++ +ER + + G ++
Sbjct: 172 MSRGESRATTVRRTGKKGHGLFAAERVGAGEFVLEYCGEVLHEEAYKERKRRYQDEGRSH 231
Query: 168 FYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLT 227
+Y ++ IDAT +GN+ R++NHSC PN E QKW++ GE IGIFATRDI++GE LT
Sbjct: 232 YYFMTLSSSETIDATIRGNEGRFLNHSCAPNCETQKWMVRGELCIGIFATRDIEEGEELT 291
Query: 228 YDYQYE 233
DY++E
Sbjct: 292 IDYKFE 297
>gi|390342260|ref|XP_003725626.1| PREDICTED: uncharacterized protein LOC578079 isoform 1
[Strongylocentrotus purpuratus]
Length = 3023
Score = 154 bits (389), Expect = 3e-35, Method: Composition-based stats.
Identities = 80/212 (37%), Positives = 117/212 (55%), Gaps = 8/212 (3%)
Query: 30 FELPDWFI--KPKAIPYVFIKRNIYLTKRIKRRL--EDDGIFCSCTASPGSS----GVCD 81
+ +W I + K +V + N YLT+R + + E + C C S C
Sbjct: 1571 LQYAEWGIVERSKMPRFVELNENYYLTERRRSKAGKEARRMVCCCATSREERRRGIQACG 1630
Query: 82 RDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVI 141
DC +L+ C S C CG+ C N+ FQ R ++ + TE+ G G+ A E++K EFV+
Sbjct: 1631 EDCLNRVLMLECGSRCPCGDYCTNRRFQKRENARVGVFYTEEKGHGLKAKEELKDNEFVM 1690
Query: 142 EYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEM 201
EYVGEV++ + R + +FY + D +IDAT KGN SR++NHSC PN E
Sbjct: 1691 EYVGEVLNFHEFKHRAKQYSKDKNLHFYFMALKSDEIIDATEKGNVSRFMNHSCDPNCET 1750
Query: 202 QKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
QKW ++G+ R+G F R +K GE LT+DYQ+E
Sbjct: 1751 QKWTVNGQLRVGFFTKRQVKPGEELTFDYQFE 1782
>gi|390342258|ref|XP_783359.3| PREDICTED: uncharacterized protein LOC578079 isoform 2
[Strongylocentrotus purpuratus]
Length = 3024
Score = 154 bits (389), Expect = 3e-35, Method: Composition-based stats.
Identities = 80/212 (37%), Positives = 117/212 (55%), Gaps = 8/212 (3%)
Query: 30 FELPDWFI--KPKAIPYVFIKRNIYLTKRIKRRL--EDDGIFCSCTASPGSS----GVCD 81
+ +W I + K +V + N YLT+R + + E + C C S C
Sbjct: 1571 LQYAEWGIVERSKMPRFVELNENYYLTERRRSKAGKEARRMVCCCATSREERRRGIQACG 1630
Query: 82 RDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVI 141
DC +L+ C S C CG+ C N+ FQ R ++ + TE+ G G+ A E++K EFV+
Sbjct: 1631 EDCLNRVLMLECGSRCPCGDYCTNRRFQKRENARVGVFYTEEKGHGLKAKEELKDNEFVM 1690
Query: 142 EYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEM 201
EYVGEV++ + R + +FY + D +IDAT KGN SR++NHSC PN E
Sbjct: 1691 EYVGEVLNFHEFKHRAKQYSKDKNLHFYFMALKSDEIIDATEKGNVSRFMNHSCDPNCET 1750
Query: 202 QKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
QKW ++G+ R+G F R +K GE LT+DYQ+E
Sbjct: 1751 QKWTVNGQLRVGFFTKRQVKPGEELTFDYQFE 1782
>gi|320169023|gb|EFW45922.1| ASHH3 [Capsaspora owczarzaki ATCC 30864]
Length = 885
Score = 154 bits (388), Expect = 4e-35, Method: Composition-based stats.
Identities = 80/214 (37%), Positives = 123/214 (57%), Gaps = 12/214 (5%)
Query: 38 KPKAIPYVFIKRNIYLTKRIKRRLEDDGIF---CSCTASPGS---SGVCDRDCHCGMLLS 91
+P+ P+ I + Y+ K+ D F C C+ P + + C ++C ML++
Sbjct: 120 EPRDKPFQVITHSEYINLSAKK----DPKFKFECECSFDPATDDPATACGKNCLNRMLMA 175
Query: 92 SCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDD 150
CS C CG C N+ FQNR M++ +TEK G G++ ED+ G+F+IEYVG+V+ +
Sbjct: 176 ECSPKRCPCGTYCTNQRFQNRQYPAMEVFRTEKKGNGLMVLEDLAPGQFLIEYVGDVVHN 235
Query: 151 QTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGET 210
+ ++R +FY ++ D VIDAT +G+ SR+INHSC PN E QKW++D
Sbjct: 236 REFKKRTKSYHERQYDHFYFMTLSSDEVIDATVRGSISRFINHSCEPNCETQKWVVDRRI 295
Query: 211 RIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
R+GIFA + IK G +T+DY++E F + YC
Sbjct: 296 RVGIFAKKAIKAGTEITFDYKFERFSDEGQACYC 329
>gi|222640644|gb|EEE68776.1| hypothetical protein OsJ_27487 [Oryza sativa Japonica Group]
Length = 405
Score = 152 bits (384), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 66/118 (55%), Positives = 92/118 (77%)
Query: 118 LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDM 177
L+QT+ CG G A E I++ +FVIE+VGEVIDD+TCEERL M+ G+ NFY+C++ +D
Sbjct: 281 LLQTQYCGWGSRALEAIEKDDFVIEFVGEVIDDETCEERLEDMRRRGDKNFYMCKVKKDF 340
Query: 178 VIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
VIDAT+KGN R+ NHSC PN ++QKW ++G+TR+G+FA++ I+ GE LTYDY + L
Sbjct: 341 VIDATFKGNDCRFFNHSCEPNCQLQKWQVNGKTRLGVFASKAIEVGEPLTYDYSFRLL 398
>gi|414870328|tpg|DAA48885.1| TPA: putative SET-domain containing protein family [Zea mays]
Length = 266
Score = 152 bits (384), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 69/137 (50%), Positives = 100/137 (72%), Gaps = 2/137 (1%)
Query: 96 GCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEE 155
GC C C NKPF R KK+K+V+T +CG G +A E I + +FVIE+VGEVIDD CE+
Sbjct: 111 GCPCSVKCSNKPF--RREKKIKIVKTRQCGWGAIALETIGKDDFVIEFVGEVIDDAMCED 168
Query: 156 RLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIF 215
RL M+ + NFY+C++++D VID T+KGN R+ NHSC PN ++KW ++G+TR+G+F
Sbjct: 169 RLQDMRQRRDQNFYMCKVDKDFVIDPTFKGNACRFFNHSCQPNCRLEKWQVNGKTRLGVF 228
Query: 216 ATRDIKKGENLTYDYQY 232
A++ I+ G LTY+Y++
Sbjct: 229 ASQTIEVGMPLTYNYRF 245
>gi|218201221|gb|EEC83648.1| hypothetical protein OsI_29393 [Oryza sativa Indica Group]
Length = 451
Score = 152 bits (384), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 65/118 (55%), Positives = 92/118 (77%)
Query: 118 LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDM 177
L+QT+ CG G A E I++ +FVIE+VGEVIDD+TCEERL M+ G+ NFY+C++ +D
Sbjct: 327 LLQTQYCGWGSRALEAIEKDDFVIEFVGEVIDDETCEERLEDMRRRGDKNFYMCKVKKDF 386
Query: 178 VIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
VIDAT+KGN R+ NHSC PN ++QKW ++G+TR+G+FA++ I+ GE LT+DY + L
Sbjct: 387 VIDATFKGNDCRFFNHSCEPNCQLQKWQVNGKTRLGVFASKAIEVGEPLTFDYSFRLL 444
>gi|405952170|gb|EKC20012.1| Histone-lysine N-methyltransferase SETD2 [Crassostrea gigas]
Length = 1451
Score = 152 bits (383), Expect = 2e-34, Method: Composition-based stats.
Identities = 77/201 (38%), Positives = 113/201 (56%), Gaps = 6/201 (2%)
Query: 39 PKAIPYVFIKRNIYLTKRIKRRLEDD--GIFCSCTASPGSSGV----CDRDCHCGMLLSS 92
PK P+ I+ NIYL +R K + D + C C+ S + C DC ML
Sbjct: 403 PKKPPFEPIEDNIYLCERKKNKKMKDVRRMVCDCSTSKEDRDMGYEACGEDCLNRMLYIE 462
Query: 93 CSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQT 152
C + C CG C NK FQ + ++ T+ G G+ A ++ G+FV+EYVGEV+D +
Sbjct: 463 CGNRCPCGEYCTNKRFQKKQYADVEAFVTDWKGMGLRATAALQPGDFVMEYVGEVLDYKQ 522
Query: 153 CEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRI 212
+ R+ + +G+ + Y +N D VIDA+YKGN SRY+NHSC PN E QKW ++G R+
Sbjct: 523 FKSRVKQQAKMGQEHHYFMALNSDEVIDASYKGNVSRYMNHSCDPNCETQKWTVNGVLRV 582
Query: 213 GIFATRDIKKGENLTYDYQYE 233
G F + ++ L +DYQ+E
Sbjct: 583 GFFVKKAVEPLTELNFDYQFE 603
>gi|189237403|ref|XP_973596.2| PREDICTED: similar to AGAP011688-PA [Tribolium castaneum]
gi|270007628|gb|EFA04076.1| hypothetical protein TcasGA2_TC014310 [Tribolium castaneum]
Length = 1569
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 79/195 (40%), Positives = 114/195 (58%), Gaps = 7/195 (3%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASP-----GSSGVCDRDCHCGMLLSSCSSGCK 98
+V +K N+YLT R+ + E + C C +P G G C DC +L+ C C
Sbjct: 500 FVHLKENLYLTDRMSCK-EAKKMTCDCFLTPEEIERGELG-CGEDCLNRLLMIECGGLCP 557
Query: 99 CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
G+ C NK FQ +++ +TEK G G+ A +I GEF++EYVGEV+D + + R
Sbjct: 558 VGDRCTNKKFQKSQFAPVEVFKTEKKGLGLRAAANIPYGEFILEYVGEVLDPEEFDNRAD 617
Query: 159 KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATR 218
+ ++Y + D +IDAT KGN SR+INHSC PN E QKW ++GE RIG F+TR
Sbjct: 618 DYSNDKNKHYYFMSLRADAIIDATMKGNISRFINHSCDPNAETQKWTVNGELRIGFFSTR 677
Query: 219 DIKKGENLTYDYQYE 233
I GE +T+DY+++
Sbjct: 678 TILAGEEITFDYRFQ 692
>gi|168009924|ref|XP_001757655.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691349|gb|EDQ77712.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1715
Score = 151 bits (382), Expect = 2e-34, Method: Composition-based stats.
Identities = 78/191 (40%), Positives = 111/191 (58%), Gaps = 3/191 (1%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTA-SPGSSGVCDRDCHCGMLLSSC-SSGCKCGN 101
+ IKRNI+ + K + D+ + C C PG G C DC +L C C CG
Sbjct: 670 WTLIKRNIFKHRNQKHQDADEAMVCQCAPPKPGEVG-CGEDCLNRILNVECLPQQCPCGP 728
Query: 102 SCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMK 161
C N+ FQ R ++L + K G G+ A E+I RG F+IEYVGEV+D + E R +
Sbjct: 729 LCTNQQFQKRTYANVELFRCGKKGHGLRALENIPRGTFIIEYVGEVLDMPSFEARQKEYS 788
Query: 162 HLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIK 221
+ +FY ++ + +IDA KGN R+INHSC PN + +KW++DGE IG+FA RDIK
Sbjct: 789 MNSQKHFYFMTLSANEIIDACNKGNLGRFINHSCEPNCQTEKWMVDGEVCIGLFAIRDIK 848
Query: 222 KGENLTYDYQY 232
+ E +T+DY +
Sbjct: 849 EREEVTFDYNF 859
>gi|158301050|ref|XP_001238385.2| AGAP011688-PA [Anopheles gambiae str. PEST]
gi|157013454|gb|EAU75883.2| AGAP011688-PA [Anopheles gambiae str. PEST]
Length = 2404
Score = 151 bits (381), Expect = 3e-34, Method: Composition-based stats.
Identities = 83/217 (38%), Positives = 119/217 (54%), Gaps = 10/217 (4%)
Query: 25 GNPVEFELPDWFIK---PKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTAS-----PGS 76
G+ EL + +++ K + I+ NIY + RI R E + C C + G
Sbjct: 1185 GDSANVELSEEYLRDMEEKLSRFETIRENIYHSDRIVSR-EAKKMTCDCFLTHEEIERGE 1243
Query: 77 SGVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKR 136
G C DC +L+ C S C G+ C N+ FQ + ++ +TEK G GI A I
Sbjct: 1244 HG-CGEDCLNRLLMIECGSRCTVGDRCTNRRFQRQEYAHCQVFRTEKKGFGIQASSAIAP 1302
Query: 137 GEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCC 196
GEF++EYVGEV++ +ER ++Y + D +IDAT KGN SR+INHSC
Sbjct: 1303 GEFIMEYVGEVLNSAQFDERAEAYSREKNKHYYFMALRSDGIIDATTKGNISRFINHSCD 1362
Query: 197 PNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
PN E QKW ++GE RIG F+T+ I GE +T+DYQ++
Sbjct: 1363 PNAETQKWTVNGELRIGFFSTKYILPGEEITFDYQFQ 1399
>gi|427779581|gb|JAA55242.1| Putative histone-lysine n-methyltransferase setd2 [Rhipicephalus
pulchellus]
Length = 2038
Score = 150 bits (380), Expect = 3e-34, Method: Composition-based stats.
Identities = 80/203 (39%), Positives = 115/203 (56%), Gaps = 12/203 (5%)
Query: 40 KAIPYVFIKRNIYLTKRIK-------RRLEDDGIFCSCTASPGSSGV--CDRDCHCGMLL 90
K++ Y I+ N+YL +R K RR+ D CS T G+ C+ DC +L+
Sbjct: 789 KSMDYEDIEENLYLFERRKTKSKKEVRRMICD---CSLTKDEKDRGIMGCEEDCLNRLLM 845
Query: 91 SSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDD 150
C S C G +C N+ FQ + K++ TEK G G+ E + G FV+EYVGEV+
Sbjct: 846 IECGSRCPNGENCSNRRFQKKSYIKVEKFMTEKKGWGLRTLETVSSGTFVMEYVGEVLTP 905
Query: 151 QTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGET 210
+ +R+ + ++Y + D +IDAT KGN SR+INHSC PN E QKW ++GE
Sbjct: 906 EDFRKRVKQYARDNNQHYYFMALRADEIIDATQKGNVSRFINHSCDPNCETQKWTVNGEL 965
Query: 211 RIGIFATRDIKKGENLTYDYQYE 233
RIG F R ++ GE LT+DYQ++
Sbjct: 966 RIGFFTRRPLRAGEELTFDYQFQ 988
>gi|301614673|ref|XP_002936809.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Xenopus
(Silurana) tropicalis]
Length = 1298
Score = 150 bits (378), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 83/202 (41%), Positives = 120/202 (59%), Gaps = 10/202 (4%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
K PY IK N K ++ D C P S C D +C ML+ C
Sbjct: 931 KPPPYKHIKVNKPYGK-VQVYTADISEIPKCNCKPSSEKPCGFDSECLNRMLMYECHPQV 989
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G+ C N+ F R + K+++TE G G++A DIK+GEFV EY+GE+ID++ C
Sbjct: 990 CPAGDRCQNQCFNKRQYPETKIIKTEGKGWGLIATRDIKKGEFVNEYIGELIDEEEC--- 1046
Query: 157 LWKMKHLGE---TNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIG 213
+++++H E T+FY+ I++D +IDA KGN SR++NHSC PN E QKW ++G+TR+G
Sbjct: 1047 MYRIRHAQENDITHFYMLTIDKDRIIDAGPKGNFSRFMNHSCQPNCETQKWSVNGDTRVG 1106
Query: 214 IFATRDIKKGENLTYDYQYEFL 235
+FA RDI GE LT++Y + L
Sbjct: 1107 LFAVRDIPAGEELTFNYNLDCL 1128
>gi|412985970|emb|CCO17170.1| predicted protein [Bathycoccus prasinos]
Length = 1058
Score = 150 bits (378), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 70/188 (37%), Positives = 115/188 (61%), Gaps = 1/188 (0%)
Query: 47 IKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLN 105
+ RNI+ + ++ EDD + C+C P + C+ +C ++L+ C C C +C N
Sbjct: 167 VSRNIFKHREPRKLTEDDIMVCNCAIDPETGSGCEEECVNRLVLTECDPKFCPCSTACKN 226
Query: 106 KPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGE 165
+ F + K++++ ++ K G G+ ++E+IK GEFV+EYVGEV+ ++ EER +
Sbjct: 227 QRFSRKKFKELEVKRSSKKGHGLFSEENIKPGEFVLEYVGEVLHEEAYEERKRQYAREKR 286
Query: 166 TNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGEN 225
+FY ++ IDAT +G R++NHSC PN E QKW++ GE IG+FA R+I+ GE
Sbjct: 287 RHFYFMTLSSSETIDATERGGVGRFLNHSCDPNCETQKWMVKGELCIGVFALREIQPGEE 346
Query: 226 LTYDYQYE 233
+T DY++E
Sbjct: 347 ITIDYKFE 354
>gi|297602734|ref|NP_001052807.2| Os04g0429100 [Oryza sativa Japonica Group]
gi|255675465|dbj|BAF14721.2| Os04g0429100, partial [Oryza sativa Japonica Group]
Length = 612
Score = 149 bits (377), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 81/204 (39%), Positives = 120/204 (58%), Gaps = 12/204 (5%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCG------MLLSSCSSG- 96
Y+ I+ N +L +R KR+ E+D C C + + D D CG + + C+ G
Sbjct: 108 YIHIETNDFLHRRHKRQKEEDIAVCECQYN-----LLDPDSACGDRCLNVLTSTECTPGY 162
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C CG C N+ FQ +LV+TE G G++ADE+I G+FV+EY GEVI + + R
Sbjct: 163 CLCGVYCKNQRFQKSQYAATRLVKTEGRGWGLLADENIMAGQFVMEYCGEVISWKEAKRR 222
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
++ G T+ Y+ +N D IDAT KG+ +R+INHSC PN E +KW + GE R+GIFA
Sbjct: 223 SQAYENQGLTDAYIIYLNADESIDATKKGSLARFINHSCQPNCETRKWNVLGEVRVGIFA 282
Query: 217 TRDIKKGENLTYDYQYEFLHDSLI 240
+DI G L+YDY +E+ +++
Sbjct: 283 KQDIPIGTELSYDYNFEWFGGAMV 306
>gi|432879768|ref|XP_004073538.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific-like [Oryzias latipes]
Length = 2321
Score = 149 bits (376), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 81/196 (41%), Positives = 115/196 (58%), Gaps = 2/196 (1%)
Query: 40 KAIPYVFIKRNIYLTK-RIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-C 97
K PY IK N + K +I + C+C AS + D +C ML+ C C
Sbjct: 1789 KPPPYRHIKVNRPIGKVQIITADLSEVPRCNCKASDENPCGADSECINRMLMYECHPQVC 1848
Query: 98 KCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERL 157
G C N+ F R +++ +T CG G+ A DIK+G FV EYVGEVID++ C R+
Sbjct: 1849 AAGERCQNQAFTKRQYTTVEIYRTLSCGWGLRAVSDIKKGAFVSEYVGEVIDEEECRARI 1908
Query: 158 WKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFAT 217
+ NFY+ +++D VIDA KGN++R++NHSC PN E QKW ++G+TR+G+FA
Sbjct: 1909 RHAQEHDICNFYMLTLDKDRVIDAGPKGNQARFMNHSCQPNCETQKWTVNGDTRVGLFAL 1968
Query: 218 RDIKKGENLTYDYQYE 233
+DI KGE LT++Y E
Sbjct: 1969 QDIAKGEELTFNYNLE 1984
>gi|255078218|ref|XP_002502689.1| set domain protein [Micromonas sp. RCC299]
gi|226517954|gb|ACO63947.1| set domain protein [Micromonas sp. RCC299]
Length = 1065
Score = 149 bits (376), Expect = 1e-33, Method: Composition-based stats.
Identities = 71/195 (36%), Positives = 109/195 (55%), Gaps = 1/195 (0%)
Query: 47 IKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLN 105
I RNI+ + K EDD + C+C P + C DC +L C C CG+ C N
Sbjct: 104 IHRNIFTHREPKSVTEDDVLICACVPDPETGKGCGPDCINRDVLVECDPAFCPCGDGCQN 163
Query: 106 KPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGE 165
+ FQ + K+ + +T + G G+ + +K+G+F+IEY+GEV+ + R + G
Sbjct: 164 QKFQKKDYAKLDIQRTGRKGHGLFTKQALKKGQFIIEYIGEVLHEDEYRSRKARYDDEGR 223
Query: 166 TNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGEN 225
++Y ++ IDA +GN R++NHSC PN E QKW+++GE IGI+A DI G+
Sbjct: 224 RHYYFMTLSSSETIDAAERGNAGRFLNHSCDPNCETQKWMVNGELCIGIYALTDIDAGDE 283
Query: 226 LTYDYQYEFLHDSLI 240
LT+DY +E D+ I
Sbjct: 284 LTFDYNFERYGDNPI 298
>gi|241753587|ref|XP_002401135.1| huntingtin interacting protein, putative [Ixodes scapularis]
gi|215508354|gb|EEC17808.1| huntingtin interacting protein, putative [Ixodes scapularis]
Length = 1594
Score = 149 bits (376), Expect = 1e-33, Method: Composition-based stats.
Identities = 79/212 (37%), Positives = 119/212 (56%), Gaps = 12/212 (5%)
Query: 40 KAIPYVFIKRNIY-----LTKRIKRRLEDDGIFCSCTASPGSSGV--CDRDCHCGMLLSS 92
+++ Y I+ N+Y K ++R + D CS T G+ C+ DC +L+
Sbjct: 541 RSLAYEDIEENLYRKKSKCKKEVRRMICD----CSLTKDERDRGIMGCEEDCLNRLLMIE 596
Query: 93 CSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQT 152
C S C G+SC N+ FQ + K++ TEK G G+ E + G FV+EYVGEV+ +
Sbjct: 597 CGSRCPNGDSCSNRRFQKKSYIKVEKFLTEKKGWGLRTVETLASGAFVMEYVGEVLTPED 656
Query: 153 CEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRI 212
+R+ + ++Y + D +IDAT KGN SR+INHSC PN E QKW ++GE RI
Sbjct: 657 FRKRVKQYARDNHQHYYFMALRSDEIIDATQKGNVSRFINHSCDPNCETQKWTVNGELRI 716
Query: 213 GIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
G F R ++ GE LT+DYQ++ + ++ YC
Sbjct: 717 GFFTRRPLRAGEELTFDYQFQRYGKEAQKCYC 748
>gi|198418893|ref|XP_002124393.1| PREDICTED: similar to SET domain containing 2 [Ciona intestinalis]
Length = 2228
Score = 149 bits (376), Expect = 1e-33, Method: Composition-based stats.
Identities = 78/200 (39%), Positives = 113/200 (56%), Gaps = 4/200 (2%)
Query: 47 IKRNIYLTKRIKRRLEDD--GIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCGNSCL 104
I NIYL ++ K R+ + + C C + + C DC +L+ CS+ C G C
Sbjct: 995 ISDNIYLCEKKKSRVRKEIRRMVCECDNNEDGTP-CGSDCLNRLLMIECSARCPLGEQCQ 1053
Query: 105 NKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLG 164
NK FQ + ++ QT+ G GI A E++ G V+EY GEV+D Q R
Sbjct: 1054 NKRFQRKQYVPTEVFQTKWKGWGIRATENLSPGMLVMEYCGEVLDLQEFGRRSLLYSRGN 1113
Query: 165 ETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGE 224
+ +FY +++D +IDAT KGN SR+INHSC PN E QKW ++G R+G F RDI KGE
Sbjct: 1114 QQHFYFMALSQDEIIDATTKGNTSRFINHSCDPNCETQKWTVNGRLRVGFFTMRDINKGE 1173
Query: 225 NLTYDYQYE-FLHDSLIAYC 243
+T+DYQ++ + ++ YC
Sbjct: 1174 EITFDYQFQRYGKEAQACYC 1193
>gi|302141761|emb|CBI18964.3| unnamed protein product [Vitis vinifera]
Length = 1958
Score = 149 bits (376), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 79/190 (41%), Positives = 111/190 (58%), Gaps = 1/190 (0%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNS 102
++ I+ N++L + + + D+ + C C C +C ML C G C CG+
Sbjct: 1194 WMLIRSNLFLHRSRRTQTIDEVMVCHCKRPVEGRFGCGDECLNRMLNIECVQGTCPCGDL 1253
Query: 103 CLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
C N+ FQ R K+K + K G G+ +DI +G+F+IEYVGEV+D QT E R +
Sbjct: 1254 CSNQQFQKRGYAKLKWFKCGKKGYGLQLQQDISQGQFLIEYVGEVLDLQTYEARQKEYAS 1313
Query: 163 LGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKK 222
G +FY +N VIDA KGN R+INHSC PN +KW+++GE IG+FA RDIKK
Sbjct: 1314 RGHKHFYFMTLNGSEVIDACAKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALRDIKK 1373
Query: 223 GENLTYDYQY 232
GE +T+DY Y
Sbjct: 1374 GEEVTFDYNY 1383
>gi|312378119|gb|EFR24776.1| hypothetical protein AND_10404 [Anopheles darlingi]
Length = 2632
Score = 149 bits (375), Expect = 1e-33, Method: Composition-based stats.
Identities = 78/192 (40%), Positives = 108/192 (56%), Gaps = 7/192 (3%)
Query: 47 IKRNIYLTKRIKRRLEDDGIFCSCTAS-----PGSSGVCDRDCHCGMLLSSCSSGCKCGN 101
I+ NIY + RI R E + C C + G G C DC +L+ C S C G
Sbjct: 1391 IRENIYYSDRIVSR-EAKKMTCDCFLTHEDIERGEMG-CGEDCLNRLLMIECGSRCTVGE 1448
Query: 102 SCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMK 161
C N+ FQ + ++ +TEK G GI A I GEF++EYVGEV++ ++R
Sbjct: 1449 RCTNRRFQRQEYAHCQVFRTEKKGFGIQASAPIAPGEFIMEYVGEVLNGSQFDQRAEAYS 1508
Query: 162 HLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIK 221
++Y + D +IDAT KGN SR+INHSC PN E QKW ++GE RIG F+T+ I
Sbjct: 1509 RDKNKHYYFMALRSDGIIDATTKGNISRFINHSCDPNAETQKWTVNGELRIGFFSTKYIL 1568
Query: 222 KGENLTYDYQYE 233
GE +T+DYQ++
Sbjct: 1569 PGEEITFDYQFQ 1580
>gi|332028801|gb|EGI68830.1| Putative histone-lysine N-methyltransferase NSD2 [Acromyrmex
echinatior]
Length = 1304
Score = 148 bits (374), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 82/192 (42%), Positives = 110/192 (57%), Gaps = 2/192 (1%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CK 98
K PYV +K N + +K + + C C + + DC +L+ CS G C
Sbjct: 874 KPPPYVKLKVNKPVGN-VKPTEVESIVACECDPEWDNPCAPNTDCLNRILMVECSPGICP 932
Query: 99 CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
G C+N+ F R M+ T G G+ EDIK G+FVIEYVGE+IDD + RL
Sbjct: 933 AGAKCMNQSFVLRQYPAMEPFHTMGRGWGLRTLEDIKTGQFVIEYVGEIIDDAEYKRRLH 992
Query: 159 KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATR 218
+ K L NFY I+ + IDA KGN SR++NHSC PN E QKW ++G+TRIG+FA R
Sbjct: 993 RKKELKNENFYFLTIDNNRTIDAEPKGNLSRFMNHSCAPNCETQKWTVNGDTRIGLFALR 1052
Query: 219 DIKKGENLTYDY 230
DI+ GE LT++Y
Sbjct: 1053 DIESGEELTFNY 1064
>gi|116309320|emb|CAH66406.1| OSIGBa0093L02.2 [Oryza sativa Indica Group]
Length = 519
Score = 148 bits (373), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 81/204 (39%), Positives = 120/204 (58%), Gaps = 12/204 (5%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCG------MLLSSCSSG- 96
Y+ I+ N +L +R KR+ E+D C C + + D D CG + + C+ G
Sbjct: 15 YIHIETNDFLHRRHKRQKEEDIAVCECQYN-----LLDPDSACGDRCLNVLTSTECTPGY 69
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C CG C N+ FQ +LV+TE G G++ADE+I G+FV+EY GEVI + + R
Sbjct: 70 CLCGVYCKNQRFQKSQYAATRLVKTEGRGWGLLADENIMAGQFVMEYCGEVISWKEAKRR 129
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
++ G T+ Y+ +N D IDAT KG+ +R+INHSC PN E +KW + GE R+GIFA
Sbjct: 130 SQAYENQGLTDAYIIYLNADESIDATKKGSLARFINHSCQPNCETRKWNVLGEVRVGIFA 189
Query: 217 TRDIKKGENLTYDYQYEFLHDSLI 240
+DI G L+YDY +E+ +++
Sbjct: 190 KQDIPIGTELSYDYNFEWFGGAMV 213
>gi|240254387|ref|NP_177854.6| histone-lysine N-methyltransferase SETD2 [Arabidopsis thaliana]
gi|157734196|gb|ABV68921.1| SDG8 [Arabidopsis thaliana]
gi|332197839|gb|AEE35960.1| histone-lysine N-methyltransferase SETD2 [Arabidopsis thaliana]
Length = 1805
Score = 148 bits (373), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 78/190 (41%), Positives = 112/190 (58%), Gaps = 1/190 (0%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNS 102
+ IK N +L + K + D+ + C C SP C +C ML C G C G+
Sbjct: 955 FKAIKTNQFLHRNRKSQTIDEIMVCHCKPSPDGRLGCGEECLNRMLNIECLQGTCPAGDL 1014
Query: 103 CLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
C N+ FQ R K + Q+ K G G+ ED++ G+F+IEYVGEV+D Q+ E R +
Sbjct: 1015 CSNQQFQKRKYVKFERFQSGKKGYGLRLLEDVREGQFLIEYVGEVLDMQSYETRQKEYAF 1074
Query: 163 LGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKK 222
G+ +FY +N + VIDA KGN R+INHSC PN +KW+++GE +GIF+ +D+KK
Sbjct: 1075 KGQKHFYFMTLNGNEVIDAGAKGNLGRFINHSCEPNCRTEKWMVNGEICVGIFSMQDLKK 1134
Query: 223 GENLTYDYQY 232
G+ LT+DY Y
Sbjct: 1135 GQELTFDYNY 1144
>gi|3540208|gb|AAC34358.1| Hypothetical protein [Arabidopsis thaliana]
Length = 1767
Score = 148 bits (373), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 78/190 (41%), Positives = 112/190 (58%), Gaps = 1/190 (0%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNS 102
+ IK N +L + K + D+ + C C SP C +C ML C G C G+
Sbjct: 955 FKAIKTNQFLHRNRKSQTIDEIMVCHCKPSPDGRLGCGEECLNRMLNIECLQGTCPAGDL 1014
Query: 103 CLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
C N+ FQ R K + Q+ K G G+ ED++ G+F+IEYVGEV+D Q+ E R +
Sbjct: 1015 CSNQQFQKRKYVKFERFQSGKKGYGLRLLEDVREGQFLIEYVGEVLDMQSYETRQKEYAF 1074
Query: 163 LGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKK 222
G+ +FY +N + VIDA KGN R+INHSC PN +KW+++GE +GIF+ +D+KK
Sbjct: 1075 KGQKHFYFMTLNGNEVIDAGAKGNLGRFINHSCEPNCRTEKWMVNGEICVGIFSMQDLKK 1134
Query: 223 GENLTYDYQY 232
G+ LT+DY Y
Sbjct: 1135 GQELTFDYNY 1144
>gi|328725324|ref|XP_003248429.1| PREDICTED: hypothetical protein LOC100165877 isoform 2
[Acyrthosiphon pisum]
Length = 1384
Score = 148 bits (373), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 93/250 (37%), Positives = 135/250 (54%), Gaps = 32/250 (12%)
Query: 18 NKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKR----------------IKRRL 61
N+LL+ N +EF+ K KA + + N+Y R +K+R+
Sbjct: 393 NQLLEARINDIEFDE-----KLKATNFTLVDENVYRPNRQTLKSYPPSEIPGLRKLKQRI 447
Query: 62 EDDGI--FCSCTASP-----GSSGVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVK 114
I +C CT S G++G CD C +L C GC C NK FQN+ K
Sbjct: 448 ISGDINDYCECTLSKEDIIEGNTG-CDDRCLNRLLKVECGLGCSLKRYCTNKQFQNKQFK 506
Query: 115 KMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEIN 174
K +++T+ G GI A EDI +G + EYVGEVID RL K ++ Y+ ++N
Sbjct: 507 KTNIIKTDNKGYGICAVEDIPKGALISEYVGEVIDYNEMCNRLTKKEYKNLN--YMVQLN 564
Query: 175 RDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEF 234
D +ID+T KGN +R+INHSC PN+ +KW + G++RIG F+TR I+KGE +T+DY ++
Sbjct: 565 PDEIIDSTSKGNVTRFINHSCDPNSVGEKWHVLGQSRIGFFSTRHIEKGEEITFDYSFQI 624
Query: 235 LHD-SLIAYC 243
D + I YC
Sbjct: 625 FGDGAQICYC 634
>gi|222628880|gb|EEE61012.1| hypothetical protein OsJ_14832 [Oryza sativa Japonica Group]
Length = 518
Score = 148 bits (373), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 81/204 (39%), Positives = 120/204 (58%), Gaps = 12/204 (5%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCG------MLLSSCSSG- 96
Y+ I+ N +L +R KR+ E+D C C + + D D CG + + C+ G
Sbjct: 14 YIHIETNDFLHRRHKRQKEEDIAVCECQYN-----LLDPDSACGDRCLNVLTSTECTPGY 68
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C CG C N+ FQ +LV+TE G G++ADE+I G+FV+EY GEVI + + R
Sbjct: 69 CLCGVYCKNQRFQKSQYAATRLVKTEGRGWGLLADENIMAGQFVMEYCGEVISWKEAKRR 128
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
++ G T+ Y+ +N D IDAT KG+ +R+INHSC PN E +KW + GE R+GIFA
Sbjct: 129 SQAYENQGLTDAYIIYLNADESIDATKKGSLARFINHSCQPNCETRKWNVLGEVRVGIFA 188
Query: 217 TRDIKKGENLTYDYQYEFLHDSLI 240
+DI G L+YDY +E+ +++
Sbjct: 189 KQDIPIGTELSYDYNFEWFGGAMV 212
>gi|255078696|ref|XP_002502928.1| set domain protein [Micromonas sp. RCC299]
gi|226518194|gb|ACO64186.1| set domain protein [Micromonas sp. RCC299]
Length = 377
Score = 148 bits (373), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 75/156 (48%), Positives = 96/156 (61%), Gaps = 3/156 (1%)
Query: 80 CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGE 138
C +C L++C C CG SC N+PF K + TE G G+ E +K G
Sbjct: 50 CGAECFNRTCLTTCDPRVCPCGPSCSNRPFHQLKSPKTDTLLTENRGWGLFLAEPVKAGT 109
Query: 139 FVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPN 198
F++EYVGE++D+ T E+RLW+ K GE NFYL E+ + IDA YKGN SR+IN SC PN
Sbjct: 110 FIVEYVGEILDEHTTEKRLWEDKKRGEDNFYLMEVMPNQCIDARYKGNLSRFINSSCHPN 169
Query: 199 TEMQKW--IIDGETRIGIFATRDIKKGENLTYDYQY 232
E QKW GETR+GIFA +DI +G LTYDY +
Sbjct: 170 CETQKWQDSATGETRVGIFAIQDIPEGTELTYDYNF 205
>gi|328725322|ref|XP_001947915.2| PREDICTED: hypothetical protein LOC100165877 isoform 3
[Acyrthosiphon pisum]
Length = 1568
Score = 148 bits (373), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 93/250 (37%), Positives = 135/250 (54%), Gaps = 32/250 (12%)
Query: 18 NKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKR----------------IKRRL 61
N+LL+ N +EF+ K KA + + N+Y R +K+R+
Sbjct: 393 NQLLEARINDIEFDE-----KLKATNFTLVDENVYRPNRQTLKSYPPSEIPGLRKLKQRI 447
Query: 62 EDDGI--FCSCTASP-----GSSGVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVK 114
I +C CT S G++G CD C +L C GC C NK FQN+ K
Sbjct: 448 ISGDINDYCECTLSKEDIIEGNTG-CDDRCLNRLLKVECGLGCSLKRYCTNKQFQNKQFK 506
Query: 115 KMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEIN 174
K +++T+ G GI A EDI +G + EYVGEVID RL K ++ Y+ ++N
Sbjct: 507 KTNIIKTDNKGYGICAVEDIPKGALISEYVGEVIDYNEMCNRLTKKEYKNLN--YMVQLN 564
Query: 175 RDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEF 234
D +ID+T KGN +R+INHSC PN+ +KW + G++RIG F+TR I+KGE +T+DY ++
Sbjct: 565 PDEIIDSTSKGNVTRFINHSCDPNSVGEKWHVLGQSRIGFFSTRHIEKGEEITFDYSFQI 624
Query: 235 LHD-SLIAYC 243
D + I YC
Sbjct: 625 FGDGAQICYC 634
>gi|145327721|ref|NP_001077836.1| histone-lysine N-methyltransferase SETD2 [Arabidopsis thaliana]
gi|332197840|gb|AEE35961.1| histone-lysine N-methyltransferase SETD2 [Arabidopsis thaliana]
Length = 1501
Score = 148 bits (373), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 78/190 (41%), Positives = 112/190 (58%), Gaps = 1/190 (0%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNS 102
+ IK N +L + K + D+ + C C SP C +C ML C G C G+
Sbjct: 955 FKAIKTNQFLHRNRKSQTIDEIMVCHCKPSPDGRLGCGEECLNRMLNIECLQGTCPAGDL 1014
Query: 103 CLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
C N+ FQ R K + Q+ K G G+ ED++ G+F+IEYVGEV+D Q+ E R +
Sbjct: 1015 CSNQQFQKRKYVKFERFQSGKKGYGLRLLEDVREGQFLIEYVGEVLDMQSYETRQKEYAF 1074
Query: 163 LGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKK 222
G+ +FY +N + VIDA KGN R+INHSC PN +KW+++GE +GIF+ +D+KK
Sbjct: 1075 KGQKHFYFMTLNGNEVIDAGAKGNLGRFINHSCEPNCRTEKWMVNGEICVGIFSMQDLKK 1134
Query: 223 GENLTYDYQY 232
G+ LT+DY Y
Sbjct: 1135 GQELTFDYNY 1144
>gi|94707110|sp|Q2LAE1.1|ASHH2_ARATH RecName: Full=Histone-lysine N-methyltransferase ASHH2; AltName:
Full=ASH1 homolog 2; AltName: Full=H3-K4-HMTase; AltName:
Full=Histone H3-K36 methyltransferase 8;
Short=H3-K36-HMTase 8; AltName: Full=Protein EARLY
FLOWERING IN SHORT DAYS; AltName: Full=Protein SET DOMAIN
GROUP 8
gi|85036158|gb|ABC69038.1| SDG8 [Arabidopsis thaliana]
Length = 1759
Score = 148 bits (373), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 78/190 (41%), Positives = 112/190 (58%), Gaps = 1/190 (0%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNS 102
+ IK N +L + K + D+ + C C SP C +C ML C G C G+
Sbjct: 955 FKAIKTNQFLHRNRKSQTIDEIMVCHCKPSPDGRLGCGEECLNRMLNIECLQGTCPAGDL 1014
Query: 103 CLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
C N+ FQ R K + Q+ K G G+ ED++ G+F+IEYVGEV+D Q+ E R +
Sbjct: 1015 CSNQQFQKRKYVKFERFQSGKKGYGLRLLEDVREGQFLIEYVGEVLDMQSYETRQKEYAF 1074
Query: 163 LGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKK 222
G+ +FY +N + VIDA KGN R+INHSC PN +KW+++GE +GIF+ +D+KK
Sbjct: 1075 KGQKHFYFMTLNGNEVIDAGAKGNLGRFINHSCEPNCRTEKWMVNGEICVGIFSMQDLKK 1134
Query: 223 GENLTYDYQY 232
G+ LT+DY Y
Sbjct: 1135 GQELTFDYNY 1144
>gi|218194855|gb|EEC77282.1| hypothetical protein OsI_15918 [Oryza sativa Indica Group]
Length = 472
Score = 147 bits (372), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 81/204 (39%), Positives = 120/204 (58%), Gaps = 12/204 (5%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCG------MLLSSCSSG- 96
Y+ I+ N +L +R KR+ E+D C C + + D D CG + + C+ G
Sbjct: 9 YIHIETNDFLHRRHKRQKEEDIAVCECQYN-----LLDPDSACGDRCLNVLTSTECTPGY 63
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C CG C N+ FQ +LV+TE G G++ADE+I G+FV+EY GEVI + + R
Sbjct: 64 CLCGVYCKNQRFQKSQYAATRLVKTEGRGWGLLADENIMAGQFVMEYCGEVISWKEAKRR 123
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
++ G T+ Y+ +N D IDAT KG+ +R+INHSC PN E +KW + GE R+GIFA
Sbjct: 124 SQAYENQGLTDAYIIYLNADESIDATKKGSLARFINHSCQPNCETRKWNVLGEVRVGIFA 183
Query: 217 TRDIKKGENLTYDYQYEFLHDSLI 240
+DI G L+YDY +E+ +++
Sbjct: 184 KQDIPIGTELSYDYNFEWFGGAMV 207
>gi|348527268|ref|XP_003451141.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like [Oreochromis
niloticus]
Length = 1605
Score = 147 bits (371), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 79/194 (40%), Positives = 112/194 (57%), Gaps = 4/194 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY FIK N + K ++ + D C P C + C ML C
Sbjct: 1229 KPPPYKFIKSNKPVGK-VQVHVADLSEIQRCNCKPADEHPCSLESQCLNRMLQYECHPQV 1287
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G+SC N+ F R + ++++TE G G+ ++ +K+G+FV EYVGEVID + C++R
Sbjct: 1288 CPAGDSCENQCFSKRLYAETEVIKTEGRGWGLRTNQALKKGDFVTEYVGEVIDSEECQQR 1347
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + TNFY+ + +D VIDA KGN SR+INHSC PN E QKW ++G+ RIGIFA
Sbjct: 1348 IKRAHENHVTNFYMLTLTKDRVIDAGPKGNSSRFINHSCSPNCETQKWTVNGDVRIGIFA 1407
Query: 217 TRDIKKGENLTYDY 230
DI+ G LT++Y
Sbjct: 1408 LCDIEAGTELTFNY 1421
>gi|449665927|ref|XP_002164851.2| PREDICTED: histone-lysine N-methyltransferase NSD2-like [Hydra
magnipapillata]
Length = 1214
Score = 147 bits (371), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 89/221 (40%), Positives = 124/221 (56%), Gaps = 5/221 (2%)
Query: 16 AFNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPG 75
AF +L KQ E KP+ P+ IK N ++ R + D + C S
Sbjct: 755 AFAELEKQRAEVSEILGRKITNKPQ--PFKLIKSNKPVS--CIRNILDQSEWPVCECSKE 810
Query: 76 SSGVCDRDCHCGMLLSSCSS-GCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDI 134
+ D +C MLL C++ C GN C N+ Q KK + E G G++AD DI
Sbjct: 811 TFCSSDSECLNRMLLFECNAKTCPAGNLCQNQQIQKNESKKCHPFKCEGRGWGLMADTDI 870
Query: 135 KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
K+GEFVIEYVGE+ID++TC R+ + ++Y I++D +IDA KGN SR++NHS
Sbjct: 871 KQGEFVIEYVGELIDEETCHRRVREYHEKDIFDYYFLTIDKDNIIDAYPKGNMSRFMNHS 930
Query: 195 CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
C PN E QKW ++GE R+ +FATRDIK GE L ++Y + L
Sbjct: 931 CNPNCETQKWTVNGEIRVALFATRDIKMGEELCFNYNLDSL 971
>gi|159473232|ref|XP_001694743.1| histone methyltransferase [Chlamydomonas reinhardtii]
gi|158276555|gb|EDP02327.1| histone methyltransferase [Chlamydomonas reinhardtii]
Length = 1105
Score = 147 bits (370), Expect = 6e-33, Method: Composition-based stats.
Identities = 81/211 (38%), Positives = 115/211 (54%), Gaps = 4/211 (1%)
Query: 37 IKPKAIPYVF--IKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCS 94
+K + IP V+ +K N+ ++ K + EDD + C C C DC ML C
Sbjct: 119 LKRRRIPAVWQLLKDNVLTHRKRKVQDEDDIMICHCKPVWRGGDGCGPDCINRMLCIECV 178
Query: 95 SG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTC 153
G C + C N+ F R +++ + G G+ A EDIK G+F+IEY+GEV+++
Sbjct: 179 PGFCPSEDKCTNQMFSKRMYANLEIRRAGAKGFGLFALEDIKAGQFIIEYIGEVLEEDEY 238
Query: 154 EERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIG 213
+ R +G+ ++Y I VIDA KGN SR+INHSC PN E QKW++ GE IG
Sbjct: 239 QRRKEYYMSVGQRHYYFMNIGNGEVIDACRKGNISRFINHSCEPNCETQKWLVHGELAIG 298
Query: 214 IFATRDIKKGENLTYDYQYEFLHDS-LIAYC 243
+FA RDI K LT+DY +E D + YC
Sbjct: 299 LFAVRDIPKDTELTFDYNFERYGDKPMRCYC 329
>gi|321468162|gb|EFX79148.1| hypothetical protein DAPPUDRAFT_319776 [Daphnia pulex]
Length = 1408
Score = 146 bits (369), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 78/205 (38%), Positives = 114/205 (55%), Gaps = 11/205 (5%)
Query: 36 FIKPKAIPYVFIKRNIYLTKR-----IKRRLEDDGIFCSCTASPGSSGV--CDRDCHCGM 88
I+ + + ++ N YLT R +KR L D CS T + G C DC +
Sbjct: 386 IIQERLKSFEIVEENQYLTSRKTSKEVKRMLCD----CSLTKEEIARGELGCGEDCINRL 441
Query: 89 LLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVI 148
L+ C C+ C NK FQ R K+++ TEK G G+ A +D+ G+F+IEYVGEVI
Sbjct: 442 LMIECGPRCQLAARCTNKRFQKRQYGKIEVFNTEKKGVGLRALQDMDPGDFIIEYVGEVI 501
Query: 149 DDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDG 208
D + R ++Y + D +IDAT +GN SR+INHSC PN E QKW ++G
Sbjct: 502 DPREFHRRAKDYAREKNKHYYFMALKSDAIIDATQQGNVSRFINHSCDPNAETQKWTVNG 561
Query: 209 ETRIGIFATRDIKKGENLTYDYQYE 233
+ R+G FA + +K G+ +T+DYQ++
Sbjct: 562 DLRVGFFARKSLKSGDEVTFDYQFQ 586
>gi|195478285|ref|XP_002100470.1| GE17076 [Drosophila yakuba]
gi|194187994|gb|EDX01578.1| GE17076 [Drosophila yakuba]
Length = 2397
Score = 146 bits (369), Expect = 7e-33, Method: Composition-based stats.
Identities = 81/205 (39%), Positives = 110/205 (53%), Gaps = 6/205 (2%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSC--TASPGSSG--VCDRDCHCGMLLSSCSSGCKC 99
+ +K N Y R + E+ + C C T + G C C ML+ C C
Sbjct: 1355 FQLLKENYYRCAR-QVSQENAEMQCDCFLTGDEEAQGHLSCGAGCINRMLMIECGPLCTN 1413
Query: 100 GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
G C NK FQ ++ +TEK G GI A+ I GEF++EYVGEVID + E R
Sbjct: 1414 GARCTNKRFQQHQCWPCRVFRTEKKGCGITAELQIPPGEFIMEYVGEVIDSEEFERRQHL 1473
Query: 160 MKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD 219
++Y + + +IDAT KGN SRYINHSC PN E QKW ++GE RIG F+ +
Sbjct: 1474 YSKDRNRHYYFMALRGEAIIDATSKGNISRYINHSCDPNAETQKWTVNGELRIGFFSVKP 1533
Query: 220 IKKGENLTYDYQYE-FLHDSLIAYC 243
I+ GE +T+DYQY+ + D+ YC
Sbjct: 1534 IQPGEEITFDYQYQRYGRDAQRCYC 1558
>gi|194766778|ref|XP_001965501.1| GF22528 [Drosophila ananassae]
gi|190619492|gb|EDV35016.1| GF22528 [Drosophila ananassae]
Length = 2414
Score = 146 bits (368), Expect = 8e-33, Method: Composition-based stats.
Identities = 80/202 (39%), Positives = 111/202 (54%), Gaps = 6/202 (2%)
Query: 47 IKRNIYLTKRIKRRLEDDGIFCSC--TASPGSSG--VCDRDCHCGMLLSSCSSGCKCGNS 102
++ N Y R + E+ + C C T + G C C ML+ C C G+
Sbjct: 1354 LRENYYRCAR-QVSQENAEMQCDCFLTGDEEAQGHLCCGSGCINRMLMIECGPLCSNGDR 1412
Query: 103 CLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
C NK FQ ++ +TEK G GI A+ I GEF++EYVGEVID + E R
Sbjct: 1413 CTNKRFQQHQCWPCRVFRTEKKGCGITAELQIPPGEFIMEYVGEVIDSEEFERRQHLYSR 1472
Query: 163 LGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKK 222
+ ++Y + + +IDAT KGN SRYINHSC PN E QKW ++GE RIG F+ + I+
Sbjct: 1473 DRKRHYYFMALRGEAIIDATSKGNISRYINHSCDPNAETQKWTVNGELRIGFFSVKTIQP 1532
Query: 223 GENLTYDYQYE-FLHDSLIAYC 243
GE +T+DYQY+ + D+ YC
Sbjct: 1533 GEEITFDYQYQRYGRDAQRCYC 1554
>gi|224084984|ref|XP_002307459.1| SET domain protein [Populus trichocarpa]
gi|222856908|gb|EEE94455.1| SET domain protein [Populus trichocarpa]
Length = 594
Score = 146 bits (368), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 81/194 (41%), Positives = 106/194 (54%), Gaps = 1/194 (0%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CK 98
K + I N +L + K + D+ + C C A C +C ML C G C
Sbjct: 86 KEYEFTRITTNQFLHRSRKTQTIDEIMVCYCKAPVAGRLGCGDECLNRMLNIECVQGTCP 145
Query: 99 CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
CG+ C N+ FQ R KM + K G G+ DEDI RG+F+IEYVGEV+D E R
Sbjct: 146 CGDHCSNQQFQKRNYAKMTWERCGKKGFGLRLDEDISRGQFLIEYVGEVLDVHAYEARQK 205
Query: 159 KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATR 218
G +FY ++ VIDA KGN R+INHSC PN +KW+++GE IG+FA R
Sbjct: 206 DYASKGHKHFYFMTLDGSEVIDACAKGNLGRFINHSCDPNCRTEKWVVNGEICIGLFALR 265
Query: 219 DIKKGENLTYDYQY 232
DIK GE +T+DY Y
Sbjct: 266 DIKMGEEVTFDYNY 279
>gi|332026544|gb|EGI66662.1| Histone-lysine N-methyltransferase SETD2 [Acromyrmex echinatior]
Length = 1841
Score = 146 bits (368), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 82/216 (37%), Positives = 118/216 (54%), Gaps = 7/216 (3%)
Query: 23 QIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASP-----GSS 77
+I P+ EL D ++ + + + N+YLT+R + E + C C + G
Sbjct: 778 KISLPMVLELEDREMEERLSQFEHLYENLYLTERYTNK-ETKRMVCDCFLTEEEIERGEL 836
Query: 78 GVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRG 137
G C DC +L+ C C G+ C NK FQN K ++ +TEK G G+ A DI G
Sbjct: 837 G-CGEDCLNRLLMIECGPRCVVGDRCTNKRFQNCQYAKCEVFRTEKKGFGLRAVVDIMAG 895
Query: 138 EFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCP 197
EF++EYVGEV+D + R + ++Y + D +IDAT KGN SR+INHSC P
Sbjct: 896 EFIMEYVGEVVDPKDFRRRAKEYSKDKNRHYYFMALKSDQIIDATMKGNISRFINHSCDP 955
Query: 198 NTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
N E QKW ++GE RIG F + I GE +T+DY ++
Sbjct: 956 NAETQKWTVNGELRIGFFNKKFIAAGEEITFDYHFQ 991
>gi|350421470|ref|XP_003492853.1| PREDICTED: hypothetical protein LOC100746901 [Bombus impatiens]
Length = 1777
Score = 145 bits (367), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 79/212 (37%), Positives = 119/212 (56%), Gaps = 7/212 (3%)
Query: 27 PVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASP-----GSSGVCD 81
P+ E+ D ++ + + +++ N+YLT+R + E + C C + G G C
Sbjct: 731 PMVPEIKDREMEERLSQFEYLRENLYLTERYTNK-ETKRMVCDCFLTEEEIERGELG-CG 788
Query: 82 RDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVI 141
DC +L+ C C G+ C NK FQN K ++ +TEK G G+ A D+ GEF++
Sbjct: 789 EDCLNRLLMIECGPRCVVGDRCTNKRFQNCEYAKCEVFRTEKKGFGLRAMVDLLAGEFIM 848
Query: 142 EYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEM 201
EYVGEV+D + R + ++Y + D +IDAT KGN SR+INHSC PN+E
Sbjct: 849 EYVGEVVDPKDFRRRAKEYSKDKNKHYYFMALKSDQIIDATLKGNVSRFINHSCDPNSET 908
Query: 202 QKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
QKW ++GE RIG F + I GE +T+DY ++
Sbjct: 909 QKWTVNGELRIGFFNKKFIAAGEEITFDYHFQ 940
>gi|47225482|emb|CAG11965.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1625
Score = 145 bits (366), Expect = 1e-32, Method: Composition-based stats.
Identities = 77/205 (37%), Positives = 115/205 (56%), Gaps = 9/205 (4%)
Query: 38 KPKAIPYVF--IKRNIYLTKRIKRRLEDD--GIFCSCTASPG---SSGV--CDRDCHCGM 88
K K +P F I+ N+YLT+R K + D + C C P S G C DC +
Sbjct: 207 KSKKMPPYFDLIEENLYLTERKKSKSHRDIKRMQCECPVLPREERSKGAMACGEDCLNRL 266
Query: 89 LLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVI 148
L+ CSS C+ G C N+ FQ R + ++ TE G G+ A +D+ FV+EY GEV+
Sbjct: 267 LMIECSSRCQNGAYCSNRRFQMRQHAEFDVILTENKGWGLRAAKDLPSNTFVLEYCGEVL 326
Query: 149 DDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDG 208
D + + R+ + ++Y + + +IDAT KGN SR++NHSC PN E QKW ++G
Sbjct: 327 DHKEFKTRVKEYARNKNIHYYFMSLKNNEIIDATLKGNLSRFMNHSCEPNCETQKWTVNG 386
Query: 209 ETRIGIFATRDIKKGENLTYDYQYE 233
+ R+G F T+ + G LT+DYQ++
Sbjct: 387 QLRVGFFTTKAVTAGTELTFDYQFQ 411
>gi|167526044|ref|XP_001747356.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774191|gb|EDQ87823.1| predicted protein [Monosiga brevicollis MX1]
Length = 734
Score = 145 bits (366), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 86/224 (38%), Positives = 118/224 (52%), Gaps = 21/224 (9%)
Query: 17 FNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGS 76
+L +Q V + DW I Y I+ IY K K LE+D + C C
Sbjct: 28 LQQLREQHKKAVHYTANDW------IEYEQIEAPIYKAKMHK--LEED-MECEC-----Q 73
Query: 77 SGVCDRDCHCG------MLLSSCS-SGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIV 129
D D +CG +L+ C+ + C CGN C N+ Q + ++++ +TEK G G+
Sbjct: 74 PNAADEDSYCGSNCLNRLLMVECNVARCPCGNKCRNRRLQKQQHARVEIFKTEKKGWGLR 133
Query: 130 ADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSR 189
A E I++G+F+ EY GEV D ER + G ++Y ++ D VIDAT KG SR
Sbjct: 134 ALEPIRKGDFIYEYCGEVFDQAVFRERQLEYAQEGRFHYYFMSLSADTVIDATRKGAVSR 193
Query: 190 YINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
+INHSC PN E QKW + G RIG F RDI E +T+DYQYE
Sbjct: 194 FINHSCDPNAETQKWTVGGVLRIGFFCIRDIAVNEEITFDYQYE 237
>gi|194895514|ref|XP_001978270.1| GG17783 [Drosophila erecta]
gi|190649919|gb|EDV47197.1| GG17783 [Drosophila erecta]
Length = 2384
Score = 145 bits (366), Expect = 1e-32, Method: Composition-based stats.
Identities = 81/205 (39%), Positives = 110/205 (53%), Gaps = 6/205 (2%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSC--TASPGSSG--VCDRDCHCGMLLSSCSSGCKC 99
+ +K N Y R + E+ + C C T + G C C ML+ C C
Sbjct: 1342 FQLLKENYYRCAR-QVSQENAEMQCDCFLTGDEEAQGHLSCGAGCINRMLMIECGPLCTN 1400
Query: 100 GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
G C NK FQ ++ +TEK G GI A+ I GEF++EYVGEVID + E R
Sbjct: 1401 GARCTNKRFQQHQCWPCRVFRTEKKGCGITAELQIPPGEFIMEYVGEVIDSEEFERRQHL 1460
Query: 160 MKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD 219
++Y + + +IDAT KGN SRYINHSC PN E QKW ++GE RIG F+ +
Sbjct: 1461 YSKDRNRHYYFMALRGEAIIDATSKGNISRYINHSCDPNAETQKWTVNGELRIGFFSVKP 1520
Query: 220 IKKGENLTYDYQYE-FLHDSLIAYC 243
I+ GE +T+DYQY+ + D+ YC
Sbjct: 1521 IQPGEEITFDYQYQRYGRDAQRCYC 1545
>gi|340726897|ref|XP_003401788.1| PREDICTED: hypothetical protein LOC100652142 [Bombus terrestris]
Length = 1777
Score = 145 bits (366), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 79/212 (37%), Positives = 119/212 (56%), Gaps = 7/212 (3%)
Query: 27 PVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASP-----GSSGVCD 81
P+ E+ D ++ + + +++ N+YLT+R + E + C C + G G C
Sbjct: 731 PMVPEIKDREMEERLSQFEYLRENLYLTERYTNK-ETKRMVCDCFLTEEEIERGELG-CG 788
Query: 82 RDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVI 141
DC +L+ C C G+ C NK FQN K ++ +TEK G G+ A D+ GEF++
Sbjct: 789 EDCLNRLLMIECGPRCVVGDRCTNKRFQNCEYAKCEVFRTEKKGFGLRAMVDLLAGEFIM 848
Query: 142 EYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEM 201
EYVGEV+D + R + ++Y + D +IDAT KGN SR+INHSC PN+E
Sbjct: 849 EYVGEVVDPKDFRRRAKEYSKDKNKHYYFMALKSDQIIDATLKGNVSRFINHSCDPNSET 908
Query: 202 QKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
QKW ++GE RIG F + I GE +T+DY ++
Sbjct: 909 QKWTVNGELRIGFFNKKFIAAGEEITFDYHFQ 940
>gi|302801428|ref|XP_002982470.1| hypothetical protein SELMODRAFT_421873 [Selaginella moellendorffii]
gi|300149569|gb|EFJ16223.1| hypothetical protein SELMODRAFT_421873 [Selaginella moellendorffii]
Length = 1285
Score = 145 bits (366), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 78/204 (38%), Positives = 112/204 (54%), Gaps = 1/204 (0%)
Query: 41 AIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKC 99
+ + I+ NI+ + K +DD + C C S C DC M+ CS C C
Sbjct: 501 GVSWTQIRHNIFQHRHQKTLDDDDTLICLCKPPKDGSPGCGEDCLNRMVNVECSPDTCPC 560
Query: 100 GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
G C N+ F R + LV+ K G G+ A E+I +G FVIEYVGEV+D ++ E R +
Sbjct: 561 GERCSNQQFGKREYSNVALVRCGKKGFGLKALENIAKGSFVIEYVGEVLDSRSFELRQKE 620
Query: 160 MKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD 219
+ +FY +N VIDA KGN R+INHSC PN + +KW ++GE IG+FA RD
Sbjct: 621 YARQRQKHFYFMTLNSSEVIDACRKGNLGRFINHSCEPNCQTEKWCVNGEICIGLFAIRD 680
Query: 220 IKKGENLTYDYQYEFLHDSLIAYC 243
+ K E +T++Y +E L+ + C
Sbjct: 681 VAKNEEITFNYNFERLYGAAAKKC 704
>gi|224063022|ref|XP_002300966.1| SET domain protein [Populus trichocarpa]
gi|222842692|gb|EEE80239.1| SET domain protein [Populus trichocarpa]
Length = 605
Score = 145 bits (365), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 82/195 (42%), Positives = 109/195 (55%), Gaps = 2/195 (1%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS-GVCDRDCHCGMLLSSCSSG-C 97
K + I N +L + K + D+ + C C A G G C +C ML C G C
Sbjct: 51 KEYEFTRITTNQFLHRTRKTQTIDEIMVCYCKAPVGGRLGGCGDECLNRMLNIECVQGTC 110
Query: 98 KCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERL 157
CG+ C N+ FQ KM + K G G+ +EDI RG+F+IEYVGEV+D E R
Sbjct: 111 PCGDLCSNQQFQKHNYAKMTWDRCGKKGFGLRLEEDITRGQFLIEYVGEVLDVHAYEARQ 170
Query: 158 WKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFAT 217
+ G +FY ++ VIDA KGN R+INHSC PN +KW+++GE IG+FA
Sbjct: 171 KEYASKGHKHFYFMTLDGSEVIDACVKGNLGRFINHSCDPNCRTEKWVVNGEICIGLFAL 230
Query: 218 RDIKKGENLTYDYQY 232
RDIKKGE +T+DY Y
Sbjct: 231 RDIKKGEEVTFDYNY 245
>gi|383860108|ref|XP_003705533.1| PREDICTED: uncharacterized protein LOC100883855 [Megachile
rotundata]
Length = 1766
Score = 145 bits (365), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 79/212 (37%), Positives = 118/212 (55%), Gaps = 7/212 (3%)
Query: 27 PVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASP-----GSSGVCD 81
P+ E+ D ++ + + ++ N+YLT+R + E + C C + G G C
Sbjct: 723 PMVPEIKDREMEERLSQFEYLSENLYLTERYTNK-ETKRMVCDCFLTEEEIERGELG-CG 780
Query: 82 RDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVI 141
DC +L+ C C G+ C NK FQN K ++ +TEK G G+ A D+ GEF++
Sbjct: 781 EDCLNRLLMIECGPRCVVGDRCTNKRFQNCEYAKCEVFRTEKKGFGLRAMADMLAGEFIM 840
Query: 142 EYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEM 201
EYVGEV+D + R + ++Y + D +IDAT KGN SR+INHSC PN+E
Sbjct: 841 EYVGEVVDPKDFRRRAKEYSKDKNRHYYFMALKSDQIIDATMKGNVSRFINHSCDPNSET 900
Query: 202 QKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
QKW ++GE RIG F + I GE +T+DY ++
Sbjct: 901 QKWTVNGELRIGFFNKKFIAAGEEITFDYHFQ 932
>gi|242073096|ref|XP_002446484.1| hypothetical protein SORBIDRAFT_06g016720 [Sorghum bicolor]
gi|241937667|gb|EES10812.1| hypothetical protein SORBIDRAFT_06g016720 [Sorghum bicolor]
Length = 521
Score = 145 bits (365), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 81/198 (40%), Positives = 115/198 (58%), Gaps = 12/198 (6%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCG------MLLSSCSSG- 96
Y+ I RN + ++ KR+ E+D C C + D + CG + + C+ G
Sbjct: 13 YIHIDRNDFSYRKHKRQTEEDIAVCECQYD-----ILDPESPCGDRCLNLLTNTECTPGY 67
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C+CG C N+ FQ + +LV+TE G G+VADE+I G+FVIEY GEVI + + R
Sbjct: 68 CRCGVYCKNQRFQKCQYARTRLVRTEGRGWGLVADENIMAGQFVIEYCGEVISWKESKRR 127
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ G + Y+ +N D IDAT KGN +R+INHSC PN E +KW + GE R+GIFA
Sbjct: 128 AQAYETQGLKDAYIIYLNADESIDATRKGNFARFINHSCQPNCETRKWNVLGEVRVGIFA 187
Query: 217 TRDIKKGENLTYDYQYEF 234
+DI G L+YDY +E+
Sbjct: 188 KQDIPFGTELSYDYNFEW 205
>gi|359078405|ref|XP_002697155.2| PREDICTED: histone-lysine N-methyltransferase SETD2 [Bos taurus]
Length = 1448
Score = 145 bits (365), Expect = 2e-32, Method: Composition-based stats.
Identities = 75/207 (36%), Positives = 115/207 (55%), Gaps = 9/207 (4%)
Query: 36 FIKPKAIPYVF--IKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHC 86
+ K +P F I+ N+YLT+R K + D C +P S C DC
Sbjct: 386 YAKQGKMPCYFDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLN 445
Query: 87 GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
+L+ CSS C G+ C N+ FQ + ++++ TEK G G+ A +D+ FV+EY GE
Sbjct: 446 RLLMIECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGE 505
Query: 147 VIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWII 206
V+D + + R+ + ++Y + D +IDAT KGN SR++NHSC PN E QKW +
Sbjct: 506 VLDHKEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTV 565
Query: 207 DGETRIGIFATRDIKKGENLTYDYQYE 233
+G+ R+G F T+ + G LT+DYQ++
Sbjct: 566 NGQLRVGFFTTKLVPSGSELTFDYQFQ 592
>gi|380019005|ref|XP_003693408.1| PREDICTED: uncharacterized protein LOC100869667 [Apis florea]
Length = 1392
Score = 144 bits (364), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 80/216 (37%), Positives = 120/216 (55%), Gaps = 7/216 (3%)
Query: 23 QIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASP-----GSS 77
+I P+ E+ D ++ + + ++ N+YLT+R + E + C C + G
Sbjct: 344 RISLPMVSEIEDREMEERLSQFENLRENLYLTERYTNK-ETKRMVCDCFLTEEEIERGEL 402
Query: 78 GVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRG 137
G C DC +L+ C C G+ C NK FQN K ++ +TEK G G+ A D+ G
Sbjct: 403 G-CGEDCLNRLLMIECGPRCVVGDRCTNKRFQNCEYAKCEVFRTEKKGFGLRAMVDLLAG 461
Query: 138 EFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCP 197
EF++EYVGEV+D + R + ++Y + D +IDAT KGN SR+INHSC P
Sbjct: 462 EFIMEYVGEVVDPKDFRRRAKEYSKDKNKHYYFMALKSDQIIDATMKGNVSRFINHSCDP 521
Query: 198 NTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
N+E QKW ++GE RIG F + I GE +T+DY ++
Sbjct: 522 NSETQKWTVNGELRIGFFNKKFIAAGEEITFDYHFQ 557
>gi|449492020|ref|XP_004174653.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
SETD2 [Taeniopygia guttata]
Length = 2489
Score = 144 bits (364), Expect = 3e-32, Method: Composition-based stats.
Identities = 76/207 (36%), Positives = 117/207 (56%), Gaps = 9/207 (4%)
Query: 36 FIKPKAIPYVF--IKRNIYLTKRIKRRLEDD--GIFCSC---TASPGSSG--VCDRDCHC 86
F K +P F I+ N+YLT+R K + D + C C + + G C DC
Sbjct: 1447 FAKQGKMPCYFDLIEENVYLTERKKNKSHRDIKRMLCECPPLSKEERAQGEVACGEDCLN 1506
Query: 87 GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
+L+ CSS C G+ C N+ FQ + ++++ TEK G G+ A +D+ FV+EY GE
Sbjct: 1507 RLLMIECSSRCPNGDYCSNRRFQKKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGE 1566
Query: 147 VIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWII 206
V+D + + R+ + ++Y + D +IDAT KGN SR++NHSC PN E QKW +
Sbjct: 1567 VLDHKEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTV 1626
Query: 207 DGETRIGIFATRDIKKGENLTYDYQYE 233
+G+ R+G F T+ + G LT+DYQ++
Sbjct: 1627 NGQLRVGFFTTKLVPSGSELTFDYQFQ 1653
>gi|195012609|ref|XP_001983710.1| GH16034 [Drosophila grimshawi]
gi|193897192|gb|EDV96058.1| GH16034 [Drosophila grimshawi]
Length = 2059
Score = 144 bits (364), Expect = 3e-32, Method: Composition-based stats.
Identities = 81/206 (39%), Positives = 108/206 (52%), Gaps = 7/206 (3%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSC-----TASPGSSGVCDRDCHCGMLLSSCSSGCK 98
+ ++ N Y R R E+ + C C S G C C ML+ C C
Sbjct: 1084 FQMLRDNYYRCARQVSR-ENAEMQCDCFVTGDEESLGQMRCCGDGCINRMLMIECGPLCT 1142
Query: 99 CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
G C NK FQ ++ +TEK G GI A+ + GEF++EYVGEVID + E R
Sbjct: 1143 NGERCTNKRFQQHQGWPCRVFRTEKKGCGITAELQMPSGEFIMEYVGEVIDSEEFERRQH 1202
Query: 159 KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATR 218
++Y + D +IDAT KGN SRYINHSC PN E QKW ++GE RIG F+ +
Sbjct: 1203 LYSEDRNRHYYFMALRSDSIIDATSKGNISRYINHSCDPNAETQKWTVNGELRIGFFSLK 1262
Query: 219 DIKKGENLTYDYQYE-FLHDSLIAYC 243
I GE +T+DYQY+ + D+ YC
Sbjct: 1263 TIMPGEEITFDYQYQRYGRDAQRCYC 1288
>gi|322788177|gb|EFZ13959.1| hypothetical protein SINV_06678 [Solenopsis invicta]
Length = 1093
Score = 144 bits (364), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 73/149 (48%), Positives = 95/149 (63%), Gaps = 1/149 (0%)
Query: 83 DCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVI 141
DC +L+ CS G C + C+N+ F R M+ TE G G+ A EDI G+FVI
Sbjct: 858 DCLNRILMVECSPGICPAEDKCMNQSFVLRQYPAMEPFHTEGRGWGLRALEDIAAGKFVI 917
Query: 142 EYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEM 201
EYVGE+IDD + RL + K L NFY I+ + IDA KGN SR++NHSC PN E
Sbjct: 918 EYVGEIIDDAEYKRRLHRKKELKNENFYFLTIDNNRTIDAEPKGNLSRFMNHSCAPNCET 977
Query: 202 QKWIIDGETRIGIFATRDIKKGENLTYDY 230
QKW ++G+TRIG+FA RDI+ GE LT++Y
Sbjct: 978 QKWTVNGDTRIGLFALRDIESGEELTFNY 1006
>gi|345480373|ref|XP_001606723.2| PREDICTED: hypothetical protein LOC100123115 [Nasonia vitripennis]
Length = 1746
Score = 144 bits (364), Expect = 3e-32, Method: Composition-based stats.
Identities = 74/192 (38%), Positives = 110/192 (57%), Gaps = 7/192 (3%)
Query: 47 IKRNIYLTKRIKRRLEDDGIFCSCTASP-----GSSGVCDRDCHCGMLLSSCSSGCKCGN 101
+K N+YLT+R + E + C C + G G C DC +L+ C S C G+
Sbjct: 775 LKENLYLTERFTSK-ETKRMVCECFLTEEEFQRGELG-CGEDCLNRLLMIECGSRCVVGD 832
Query: 102 SCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMK 161
C NK FQN ++ +TEK G G+ A +++ G+F++EYVGEV+D + +R +
Sbjct: 833 RCTNKRFQNCEYANCEVFRTEKKGFGLRATTNLEAGDFIMEYVGEVLDPKDFRKRAKEYS 892
Query: 162 HLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIK 221
++Y + D +IDAT KGN SR+INHSC PN E QKW ++GE RIG F + +
Sbjct: 893 KDKNRHYYFMALKSDQIIDATMKGNISRFINHSCDPNAETQKWTVNGELRIGFFNKKFVA 952
Query: 222 KGENLTYDYQYE 233
GE +T+DY ++
Sbjct: 953 AGEEITFDYHFQ 964
>gi|328790605|ref|XP_003251435.1| PREDICTED: hypothetical protein LOC100578450 [Apis mellifera]
Length = 1394
Score = 144 bits (363), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 80/216 (37%), Positives = 120/216 (55%), Gaps = 7/216 (3%)
Query: 23 QIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASP-----GSS 77
+I P+ E+ D ++ + + ++ N+YLT+R + E + C C + G
Sbjct: 344 RISLPMVSEIEDREMEERLSQFENLRENLYLTERYTNK-ETKRMVCDCFLTEEEIERGEL 402
Query: 78 GVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRG 137
G C DC +L+ C C G+ C NK FQN K ++ +TEK G G+ A D+ G
Sbjct: 403 G-CGEDCLNRLLMIECGPRCVVGDRCTNKRFQNCEYAKCEVFRTEKKGFGLRAMVDLLAG 461
Query: 138 EFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCP 197
EF++EYVGEV+D + R + ++Y + D +IDAT KGN SR+INHSC P
Sbjct: 462 EFIMEYVGEVVDPKDFRRRAKEYSKDKNKHYYFMALKSDQIIDATMKGNVSRFINHSCDP 521
Query: 198 NTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
N+E QKW ++GE RIG F + I GE +T+DY ++
Sbjct: 522 NSETQKWTVNGELRIGFFNKKFIAAGEEITFDYHFQ 557
>gi|242061944|ref|XP_002452261.1| hypothetical protein SORBIDRAFT_04g022620 [Sorghum bicolor]
gi|241932092|gb|EES05237.1| hypothetical protein SORBIDRAFT_04g022620 [Sorghum bicolor]
Length = 1840
Score = 144 bits (363), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 75/194 (38%), Positives = 107/194 (55%), Gaps = 1/194 (0%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSS-GCK 98
K + ++ N++L + + + D+ + C+C C C ML C+ C
Sbjct: 1056 KPPSWTLVRSNLFLHRNRRTQSIDESMVCNCKPPQDGRMGCRDGCLNRMLNIECAKRTCP 1115
Query: 99 CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
CG C N+ FQ R K++ + K G G+ ED+ G F+IEYVGEV+D + E R
Sbjct: 1116 CGEQCSNQKFQRRSYAKLRWFYSGKKGYGLQLQEDVTEGRFLIEYVGEVLDITSYESRQR 1175
Query: 159 KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATR 218
G+ +FY +N VIDA KGN R+INHSC PN +KW+++GE IGIF+ R
Sbjct: 1176 YYASKGQKHFYFMALNGGEVIDACTKGNLGRFINHSCSPNCRTEKWMVNGEVCIGIFSLR 1235
Query: 219 DIKKGENLTYDYQY 232
+IKKGE LT+DY Y
Sbjct: 1236 NIKKGEELTFDYNY 1249
>gi|195126250|ref|XP_002007587.1| GI12297 [Drosophila mojavensis]
gi|193919196|gb|EDW18063.1| GI12297 [Drosophila mojavensis]
Length = 1972
Score = 144 bits (363), Expect = 3e-32, Method: Composition-based stats.
Identities = 80/212 (37%), Positives = 113/212 (53%), Gaps = 4/212 (1%)
Query: 36 FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGV---CDRDCHCGMLLSS 92
F++ + + I R+ Y + E+ + C C + G C C ML+
Sbjct: 968 FLRQEGLNTFQILRDNYYRCARQVSKENAEMQCDCFVTGDEEGQGMRCGDGCINRMLMIE 1027
Query: 93 CSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQT 152
C C G+ C NK FQ ++ +TEK G GI A+ I+ GEF++EYVGEVID +
Sbjct: 1028 CGPLCSYGDRCTNKRFQQHQGWPCRVFRTEKKGCGITAELQIQPGEFIMEYVGEVIDSEE 1087
Query: 153 CEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRI 212
E R ++Y + + +IDAT KGN SRYINHSC PN E QKW ++GE RI
Sbjct: 1088 FERRQHLYSEDRNRHYYFMALRGEAIIDATTKGNISRYINHSCDPNAETQKWTVNGELRI 1147
Query: 213 GIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
G F+ + I GE +T+DYQY+ + D+ YC
Sbjct: 1148 GFFSVKTIMPGEEITFDYQYQRYGRDAQRCYC 1179
>gi|198467361|ref|XP_001354372.2| GA14357 [Drosophila pseudoobscura pseudoobscura]
gi|198149208|gb|EAL31425.2| GA14357 [Drosophila pseudoobscura pseudoobscura]
Length = 2918
Score = 144 bits (363), Expect = 3e-32, Method: Composition-based stats.
Identities = 79/202 (39%), Positives = 111/202 (54%), Gaps = 6/202 (2%)
Query: 47 IKRNIYLTKRIKRRLEDDGIFCSC--TASPGSSG--VCDRDCHCGMLLSSCSSGCKCGNS 102
++ N Y R + E+ + C C T + G C C ML+ C C G+
Sbjct: 1841 LRENYYRCAR-QVSQENAEMQCDCFLTGDEEAQGHLCCGAGCINRMLMIECGPLCTNGDR 1899
Query: 103 CLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
C NK FQ ++ +TEK G GI A+ I GEF++EYVGEVID + E R +
Sbjct: 1900 CTNKRFQLHQCWPCRVFRTEKKGCGITAELQIPAGEFIMEYVGEVIDSEEFERRQHRYSK 1959
Query: 163 LGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKK 222
++Y + + +IDAT +GN SRYINHSC PN E QKW ++GE RIG F+ ++I
Sbjct: 1960 DRNRHYYFMALRGEAIIDATMRGNISRYINHSCDPNAETQKWTVNGELRIGFFSLKNILP 2019
Query: 223 GENLTYDYQYE-FLHDSLIAYC 243
GE +T+DYQY+ + D+ YC
Sbjct: 2020 GEEITFDYQYQRYGRDAQRCYC 2041
>gi|354484245|ref|XP_003504300.1| PREDICTED: histone-lysine N-methyltransferase SETD2-like [Cricetulus
griseus]
gi|344236054|gb|EGV92157.1| Histone-lysine N-methyltransferase SETD2 [Cricetulus griseus]
Length = 2412
Score = 144 bits (363), Expect = 3e-32, Method: Composition-based stats.
Identities = 76/207 (36%), Positives = 115/207 (55%), Gaps = 9/207 (4%)
Query: 36 FIKPKAIPYVF--IKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHC 86
F K +P F I+ N+YLT+R K + D C +P S C DC
Sbjct: 1312 FAKLGKMPCYFDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEVACGEDCLN 1371
Query: 87 GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
+L+ CSS C G+ C N+ FQ + ++++ TEK G G+ A +D+ FV+EY GE
Sbjct: 1372 RLLMIECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGE 1431
Query: 147 VIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWII 206
V+D + + R+ + ++Y + D +IDAT KGN SR++NHSC PN E QKW +
Sbjct: 1432 VLDHKEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTV 1491
Query: 207 DGETRIGIFATRDIKKGENLTYDYQYE 233
+G+ R+G F T+ + G LT+DYQ++
Sbjct: 1492 NGQLRVGFFTTKLVPSGSELTFDYQFQ 1518
>gi|197927225|ref|NP_001074809.2| SET domain containing 2 [Mus musculus]
Length = 2537
Score = 144 bits (363), Expect = 3e-32, Method: Composition-based stats.
Identities = 76/207 (36%), Positives = 115/207 (55%), Gaps = 9/207 (4%)
Query: 36 FIKPKAIPYVF--IKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHC 86
F K +P F I+ N+YLT+R K + D C +P S C DC
Sbjct: 1437 FAKLGKMPCYFDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEVACGEDCLN 1496
Query: 87 GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
+L+ CSS C G+ C N+ FQ + ++++ TEK G G+ A +D+ FV+EY GE
Sbjct: 1497 RLLMIECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGE 1556
Query: 147 VIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWII 206
V+D + + R+ + ++Y + D +IDAT KGN SR++NHSC PN E QKW +
Sbjct: 1557 VLDHKEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTV 1616
Query: 207 DGETRIGIFATRDIKKGENLTYDYQYE 233
+G+ R+G F T+ + G LT+DYQ++
Sbjct: 1617 NGQLRVGFFTTKLVPSGSELTFDYQFQ 1643
>gi|432092361|gb|ELK24976.1| Histone-lysine N-methyltransferase SETD2 [Myotis davidii]
Length = 2865
Score = 144 bits (363), Expect = 4e-32, Method: Composition-based stats.
Identities = 72/195 (36%), Positives = 110/195 (56%), Gaps = 7/195 (3%)
Query: 46 FIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSGCK 98
I+ N+YLT+R K + D C +P S C DC +L+ CSS C
Sbjct: 1778 LIEENVYLTERKKNKSHRDIKRMQCECAPLSKEERAQGEIACGEDCLNRLLMIECSSRCP 1837
Query: 99 CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
G+ C N+ FQ + ++++ TEK G G+ A +D+ FV+EY GEV+D + + R+
Sbjct: 1838 NGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKARVK 1897
Query: 159 KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATR 218
+ ++Y + D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F T+
Sbjct: 1898 EYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTK 1957
Query: 219 DIKKGENLTYDYQYE 233
+ G LT+DYQ++
Sbjct: 1958 LVPSGSELTFDYQFQ 1972
>gi|281343603|gb|EFB19187.1| hypothetical protein PANDA_000629 [Ailuropoda melanoleuca]
Length = 2535
Score = 144 bits (363), Expect = 4e-32, Method: Composition-based stats.
Identities = 75/207 (36%), Positives = 115/207 (55%), Gaps = 9/207 (4%)
Query: 36 FIKPKAIPYVF--IKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHC 86
+ K +P F I+ N+YLT+R K + D C +P S C DC
Sbjct: 1434 YAKQGKMPCYFDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLN 1493
Query: 87 GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
+L+ CSS C G+ C N+ FQ + ++++ TEK G G+ A +D+ FV+EY GE
Sbjct: 1494 RLLMIECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGE 1553
Query: 147 VIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWII 206
V+D + + R+ + ++Y + D +IDAT KGN SR++NHSC PN E QKW +
Sbjct: 1554 VLDHKEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTV 1613
Query: 207 DGETRIGIFATRDIKKGENLTYDYQYE 233
+G+ R+G F T+ + G LT+DYQ++
Sbjct: 1614 NGQLRVGFFTTKLVPSGSELTFDYQFQ 1640
>gi|301754075|ref|XP_002912890.1| PREDICTED: histone-lysine N-methyltransferase SETD2-like [Ailuropoda
melanoleuca]
Length = 2549
Score = 144 bits (363), Expect = 4e-32, Method: Composition-based stats.
Identities = 75/207 (36%), Positives = 115/207 (55%), Gaps = 9/207 (4%)
Query: 36 FIKPKAIPYVF--IKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHC 86
+ K +P F I+ N+YLT+R K + D C +P S C DC
Sbjct: 1448 YAKQGKMPCYFDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLN 1507
Query: 87 GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
+L+ CSS C G+ C N+ FQ + ++++ TEK G G+ A +D+ FV+EY GE
Sbjct: 1508 RLLMIECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGE 1567
Query: 147 VIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWII 206
V+D + + R+ + ++Y + D +IDAT KGN SR++NHSC PN E QKW +
Sbjct: 1568 VLDHKEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTV 1627
Query: 207 DGETRIGIFATRDIKKGENLTYDYQYE 233
+G+ R+G F T+ + G LT+DYQ++
Sbjct: 1628 NGQLRVGFFTTKLVPSGSELTFDYQFQ 1654
>gi|395516140|ref|XP_003762252.1| PREDICTED: histone-lysine N-methyltransferase SETD2 [Sarcophilus
harrisii]
Length = 2570
Score = 144 bits (362), Expect = 4e-32, Method: Composition-based stats.
Identities = 72/195 (36%), Positives = 110/195 (56%), Gaps = 7/195 (3%)
Query: 46 FIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSGCK 98
I+ N+YLT+R K + D C +P S C DC +L+ CSS C
Sbjct: 1477 LIEENVYLTERKKNKSHRDIKRMQCECTPVSKDDRAQGEIACGEDCLNRLLMIECSSRCP 1536
Query: 99 CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
G+ C N+ FQ + ++++ TEK G G+ A +D+ FV+EY GEV+D + + R+
Sbjct: 1537 NGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKARVK 1596
Query: 159 KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATR 218
+ ++Y + D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F T+
Sbjct: 1597 EYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTK 1656
Query: 219 DIKKGENLTYDYQYE 233
+ G LT+DYQ++
Sbjct: 1657 LVPSGSELTFDYQFQ 1671
>gi|195352880|ref|XP_002042939.1| GM11634 [Drosophila sechellia]
gi|194126986|gb|EDW49029.1| GM11634 [Drosophila sechellia]
Length = 1965
Score = 144 bits (362), Expect = 4e-32, Method: Composition-based stats.
Identities = 82/205 (40%), Positives = 109/205 (53%), Gaps = 6/205 (2%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSC--TASPGSSG--VCDRDCHCGMLLSSCSSGCKC 99
+ +K N Y R + E+ + C C T + G C C ML+ C C
Sbjct: 1194 FQLLKENFYRCAR-QVSQENAEMQCDCFLTGDEEAQGHLSCGAGCINRMLMIECGPLCSN 1252
Query: 100 GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
G C NK FQ ++ +TEK G GI A+ I GEF++EYVGEVID + E R
Sbjct: 1253 GARCTNKRFQQHQCWPCRVFRTEKKGCGITAELLIPPGEFIMEYVGEVIDSEEFERRQHL 1312
Query: 160 MKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD 219
++Y + + VIDAT KGN SRYINHSC PN E QKW ++GE RIG F+ +
Sbjct: 1313 YSKDRNRHYYFMALRGEAVIDATSKGNISRYINHSCDPNAETQKWTVNGELRIGFFSVKP 1372
Query: 220 IKKGENLTYDYQY-EFLHDSLIAYC 243
I+ GE +T+DYQY + D+ YC
Sbjct: 1373 IQPGEEITFDYQYLRYGRDAQRCYC 1397
>gi|281360813|ref|NP_001162740.1| Set2, isoform B [Drosophila melanogaster]
gi|118582047|sp|Q9VYD1.2|C1716_DROME RecName: Full=Probable histone-lysine N-methyltransferase CG1716
gi|92109778|gb|ABE73213.1| LD27386p [Drosophila melanogaster]
gi|272506087|gb|ACZ95275.1| Set2, isoform B [Drosophila melanogaster]
Length = 2313
Score = 144 bits (362), Expect = 4e-32, Method: Composition-based stats.
Identities = 82/205 (40%), Positives = 109/205 (53%), Gaps = 6/205 (2%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSC--TASPGSSG--VCDRDCHCGMLLSSCSSGCKC 99
+ +K N Y R + E+ + C C T + G C C ML+ C C
Sbjct: 1289 FQLLKENFYRCAR-QVSQENAEMQCDCFLTGDEEAQGHLSCGAGCINRMLMIECGPLCSN 1347
Query: 100 GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
G C NK FQ ++ +TEK G GI A+ I GEF++EYVGEVID + E R
Sbjct: 1348 GARCTNKRFQQHQCWPCRVFRTEKKGCGITAELLIPPGEFIMEYVGEVIDSEEFERRQHL 1407
Query: 160 MKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD 219
++Y + + VIDAT KGN SRYINHSC PN E QKW ++GE RIG F+ +
Sbjct: 1408 YSKDRNRHYYFMALRGEAVIDATSKGNISRYINHSCDPNAETQKWTVNGELRIGFFSVKP 1467
Query: 220 IKKGENLTYDYQY-EFLHDSLIAYC 243
I+ GE +T+DYQY + D+ YC
Sbjct: 1468 IQPGEEITFDYQYLRYGRDAQRCYC 1492
>gi|24641786|ref|NP_572888.2| Set2, isoform A [Drosophila melanogaster]
gi|22832197|gb|AAF48273.2| Set2, isoform A [Drosophila melanogaster]
Length = 2362
Score = 144 bits (362), Expect = 4e-32, Method: Composition-based stats.
Identities = 82/205 (40%), Positives = 109/205 (53%), Gaps = 6/205 (2%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSC--TASPGSSG--VCDRDCHCGMLLSSCSSGCKC 99
+ +K N Y R + E+ + C C T + G C C ML+ C C
Sbjct: 1338 FQLLKENFYRCAR-QVSQENAEMQCDCFLTGDEEAQGHLSCGAGCINRMLMIECGPLCSN 1396
Query: 100 GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
G C NK FQ ++ +TEK G GI A+ I GEF++EYVGEVID + E R
Sbjct: 1397 GARCTNKRFQQHQCWPCRVFRTEKKGCGITAELLIPPGEFIMEYVGEVIDSEEFERRQHL 1456
Query: 160 MKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD 219
++Y + + VIDAT KGN SRYINHSC PN E QKW ++GE RIG F+ +
Sbjct: 1457 YSKDRNRHYYFMALRGEAVIDATSKGNISRYINHSCDPNAETQKWTVNGELRIGFFSVKP 1516
Query: 220 IKKGENLTYDYQY-EFLHDSLIAYC 243
I+ GE +T+DYQY + D+ YC
Sbjct: 1517 IQPGEEITFDYQYLRYGRDAQRCYC 1541
>gi|334333796|ref|XP_001375978.2| PREDICTED: histone-lysine N-methyltransferase SETD2 [Monodelphis
domestica]
Length = 2592
Score = 144 bits (362), Expect = 4e-32, Method: Composition-based stats.
Identities = 72/195 (36%), Positives = 110/195 (56%), Gaps = 7/195 (3%)
Query: 46 FIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSGCK 98
I+ N+YLT+R K + D C +P S C DC +L+ CSS C
Sbjct: 1498 LIEENVYLTERKKNKSHRDIKRMQCECTPVSKDDRAQGEIACGEDCLNRLLMIECSSRCP 1557
Query: 99 CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
G+ C N+ FQ + ++++ TEK G G+ A +D+ FV+EY GEV+D + + R+
Sbjct: 1558 NGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKARVK 1617
Query: 159 KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATR 218
+ ++Y + D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F T+
Sbjct: 1618 EYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTK 1677
Query: 219 DIKKGENLTYDYQYE 233
+ G LT+DYQ++
Sbjct: 1678 LVPSGSELTFDYQFQ 1692
>gi|196013861|ref|XP_002116791.1| hypothetical protein TRIADDRAFT_31338 [Trichoplax adhaerens]
gi|190580769|gb|EDV20850.1| hypothetical protein TRIADDRAFT_31338 [Trichoplax adhaerens]
Length = 725
Score = 144 bits (362), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 77/195 (39%), Positives = 116/195 (59%), Gaps = 6/195 (3%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSC-SSG 96
K Y IK N + I+ L + + C+ P C +C ML+ C SS
Sbjct: 369 KPPSYKHIKMNKLVDALIRPNLTE---YTPCSCKPTDEAPCGSSSNCINRMLMCECNSSM 425
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G+ C N+ FQ K ++ ++ +CG G+ + EDI G V+EYVGE+++++TC +R
Sbjct: 426 CPAGDKCQNQRFQKLEYAKSEIFKSNQCGWGLKSAEDIYAGTLVVEYVGELLNEKTCYQR 485
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + GE NFY+ I++D++IDA KGN +R++NHSC PN E KW ++G T IG+FA
Sbjct: 486 IKMAQSKGEKNFYMLNIDKDVIIDAGQKGNLARFMNHSCQPNCETHKWTVNGLTCIGLFA 545
Query: 217 TRDIKKGENLTYDYQ 231
DIK+GE LT+DY+
Sbjct: 546 IDDIKQGEELTFDYR 560
>gi|308810955|ref|XP_003082786.1| Histone H3 (Lys4) methyltransferase complex, subunit SET1 and
related methyltransferases (ISS) [Ostreococcus tauri]
gi|116061255|emb|CAL56643.1| Histone H3 (Lys4) methyltransferase complex, subunit SET1 and
related methyltransferases (ISS) [Ostreococcus tauri]
Length = 501
Score = 144 bits (362), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 80/218 (36%), Positives = 114/218 (52%), Gaps = 9/218 (4%)
Query: 24 IGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRD 83
+G P +E + P V + + T K+ L D G +P C +
Sbjct: 191 MGKPAAYERINRSTFVSVPPPVKMHISESATCECKQPLRDGG------GTPTLRNGCGPE 244
Query: 84 CHCGMLLSSCSS-GCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIE 142
C L SC S C CG++C N+P P K K+++TE G G++ E + G F++E
Sbjct: 245 CINRKLRFSCDSRTCPCGDACSNRPLSQLPTPKTKIIRTENRGWGLMLQEPVTAGTFIVE 304
Query: 143 YVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQ 202
Y GE++++ RLW K GE NFYL EI+ + VIDA +KG+ +R+IN C PN E Q
Sbjct: 305 YTGEILNEADVANRLWLDKQEGEENFYLMEISTNYVIDAKFKGSLARFINSGCHPNCETQ 364
Query: 203 KWI--IDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
+W+ ETR+GIFA DI G LTYDY + D+
Sbjct: 365 RWVDASTNETRVGIFAIEDIPAGTELTYDYHFAHFGDA 402
>gi|440891718|gb|ELR45266.1| Histone-lysine N-methyltransferase SETD2, partial [Bos grunniens
mutus]
Length = 2533
Score = 144 bits (362), Expect = 4e-32, Method: Composition-based stats.
Identities = 75/207 (36%), Positives = 115/207 (55%), Gaps = 9/207 (4%)
Query: 36 FIKPKAIPYVF--IKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHC 86
+ K +P F I+ N+YLT+R K + D C +P S C DC
Sbjct: 1432 YAKQGKMPCYFDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLN 1491
Query: 87 GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
+L+ CSS C G+ C N+ FQ + ++++ TEK G G+ A +D+ FV+EY GE
Sbjct: 1492 RLLMIECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGE 1551
Query: 147 VIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWII 206
V+D + + R+ + ++Y + D +IDAT KGN SR++NHSC PN E QKW +
Sbjct: 1552 VLDHKEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTV 1611
Query: 207 DGETRIGIFATRDIKKGENLTYDYQYE 233
+G+ R+G F T+ + G LT+DYQ++
Sbjct: 1612 NGQLRVGFFTTKLVPSGSELTFDYQFQ 1638
>gi|119914792|ref|XP_589886.3| PREDICTED: histone-lysine N-methyltransferase SETD2 [Bos taurus]
Length = 2547
Score = 144 bits (362), Expect = 4e-32, Method: Composition-based stats.
Identities = 75/207 (36%), Positives = 115/207 (55%), Gaps = 9/207 (4%)
Query: 36 FIKPKAIPYVF--IKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHC 86
+ K +P F I+ N+YLT+R K + D C +P S C DC
Sbjct: 1446 YAKQGKMPCYFDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLN 1505
Query: 87 GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
+L+ CSS C G+ C N+ FQ + ++++ TEK G G+ A +D+ FV+EY GE
Sbjct: 1506 RLLMIECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGE 1565
Query: 147 VIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWII 206
V+D + + R+ + ++Y + D +IDAT KGN SR++NHSC PN E QKW +
Sbjct: 1566 VLDHKEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTV 1625
Query: 207 DGETRIGIFATRDIKKGENLTYDYQYE 233
+G+ R+G F T+ + G LT+DYQ++
Sbjct: 1626 NGQLRVGFFTTKLVPSGSELTFDYQFQ 1652
>gi|296474690|tpg|DAA16805.1| TPA: Wolf-Hirschhorn syndrome candidate 1 protein-like [Bos taurus]
Length = 2547
Score = 144 bits (362), Expect = 4e-32, Method: Composition-based stats.
Identities = 75/207 (36%), Positives = 115/207 (55%), Gaps = 9/207 (4%)
Query: 36 FIKPKAIPYVF--IKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHC 86
+ K +P F I+ N+YLT+R K + D C +P S C DC
Sbjct: 1446 YAKQGKMPCYFDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLN 1505
Query: 87 GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
+L+ CSS C G+ C N+ FQ + ++++ TEK G G+ A +D+ FV+EY GE
Sbjct: 1506 RLLMIECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGE 1565
Query: 147 VIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWII 206
V+D + + R+ + ++Y + D +IDAT KGN SR++NHSC PN E QKW +
Sbjct: 1566 VLDHKEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTV 1625
Query: 207 DGETRIGIFATRDIKKGENLTYDYQYE 233
+G+ R+G F T+ + G LT+DYQ++
Sbjct: 1626 NGQLRVGFFTTKLVPSGSELTFDYQFQ 1652
>gi|195566590|ref|XP_002106863.1| GD17127 [Drosophila simulans]
gi|194204255|gb|EDX17831.1| GD17127 [Drosophila simulans]
Length = 2246
Score = 144 bits (362), Expect = 4e-32, Method: Composition-based stats.
Identities = 82/205 (40%), Positives = 109/205 (53%), Gaps = 6/205 (2%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSC--TASPGSSG--VCDRDCHCGMLLSSCSSGCKC 99
+ +K N Y R + E+ + C C T + G C C ML+ C C
Sbjct: 1289 FQLLKENFYRCAR-QVSQENAEMQCDCFLTGDEEAQGHLSCGAGCINRMLMIECGPLCSN 1347
Query: 100 GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
G C NK FQ ++ +TEK G GI A+ I GEF++EYVGEVID + E R
Sbjct: 1348 GARCTNKRFQQHQCWPCRVFRTEKKGCGITAELLIPPGEFIMEYVGEVIDSEEFERRQHL 1407
Query: 160 MKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD 219
++Y + + VIDAT KGN SRYINHSC PN E QKW ++GE RIG F+ +
Sbjct: 1408 YSKDRNRHYYFMALRGEAVIDATSKGNISRYINHSCDPNAETQKWTVNGELRIGFFSVKP 1467
Query: 220 IKKGENLTYDYQY-EFLHDSLIAYC 243
I+ GE +T+DYQY + D+ YC
Sbjct: 1468 IQPGEEITFDYQYLRYGRDAQRCYC 1492
>gi|73985747|ref|XP_864158.1| PREDICTED: histone-lysine N-methyltransferase SETD2 isoform 11 [Canis
lupus familiaris]
Length = 2562
Score = 144 bits (362), Expect = 4e-32, Method: Composition-based stats.
Identities = 72/195 (36%), Positives = 110/195 (56%), Gaps = 7/195 (3%)
Query: 46 FIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSGCK 98
I+ N+YLT+R K + D C +P S C DC +L+ CSS C
Sbjct: 1473 LIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLNRLLMIECSSRCP 1532
Query: 99 CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
G+ C N+ FQ + ++++ TEK G G+ A +D+ FV+EY GEV+D + + R+
Sbjct: 1533 NGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKARVK 1592
Query: 159 KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATR 218
+ ++Y + D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F T+
Sbjct: 1593 EYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTK 1652
Query: 219 DIKKGENLTYDYQYE 233
+ G LT+DYQ++
Sbjct: 1653 LVPSGSELTFDYQFQ 1667
>gi|195448204|ref|XP_002071555.1| GK25076 [Drosophila willistoni]
gi|194167640|gb|EDW82541.1| GK25076 [Drosophila willistoni]
Length = 2217
Score = 144 bits (362), Expect = 5e-32, Method: Composition-based stats.
Identities = 81/202 (40%), Positives = 109/202 (53%), Gaps = 6/202 (2%)
Query: 47 IKRNIYLTKRIKRRLEDDGIFCSC--TASPGSSG--VCDRDCHCGMLLSSCSSGCKCGNS 102
+K N Y R + E+ + C C T + G C C ML+ C C G+
Sbjct: 1088 LKENYYRCAR-QVSQENAEMQCDCFLTGDEEAQGHLCCGARCINRMLMIECGPLCTNGDR 1146
Query: 103 CLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
C NK FQ ++ +TEK G GI A+ I GEF++EYVGEVID + E R
Sbjct: 1147 CTNKRFQQHQCWPCRVFRTEKKGCGITAELQIPPGEFIMEYVGEVIDAEEFERRQHLYSK 1206
Query: 163 LGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKK 222
++Y + + +IDAT KGN SRYINHSC PN E QKW ++GE RIG F+ + I
Sbjct: 1207 DRNRHYYFMALRGEAIIDATSKGNISRYINHSCDPNAETQKWTVNGELRIGFFSVKTILP 1266
Query: 223 GENLTYDYQYE-FLHDSLIAYC 243
GE +T+DYQY+ + D+ YC
Sbjct: 1267 GEEITFDYQYQRYGRDAQRCYC 1288
>gi|410914004|ref|XP_003970478.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific-like [Takifugu rubripes]
Length = 1169
Score = 143 bits (361), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 77/199 (38%), Positives = 115/199 (57%), Gaps = 2/199 (1%)
Query: 38 KPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG- 96
KP ++ + R I + I L + C+C AS + D +C ML+ C
Sbjct: 701 KPPPYRHIKVNRAIGKVQFITADLSEIP-RCNCKASDENPCGVDSECINRMLMYECHPQV 759
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R +++ +T CG G+ A DIK+G FVIEYVGEVID++ C R
Sbjct: 760 CAAGERCQNQAFTKREYTPVEIYRTLSCGWGLRAVSDIKKGAFVIEYVGEVIDEEECRAR 819
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + NFY+ +++D +IDA KGN++R++NH C PN E QKW ++G+TR+G+FA
Sbjct: 820 IKHAQENDIFNFYMLTLDKDRIIDAGPKGNQARFMNHCCQPNCETQKWTVNGDTRVGLFA 879
Query: 217 TRDIKKGENLTYDYQYEFL 235
+DI KG+ L ++Y E L
Sbjct: 880 LQDIPKGKELNFNYNLECL 898
>gi|348582642|ref|XP_003477085.1| PREDICTED: histone-lysine N-methyltransferase SETD2-like [Cavia
porcellus]
Length = 2565
Score = 143 bits (361), Expect = 5e-32, Method: Composition-based stats.
Identities = 75/207 (36%), Positives = 115/207 (55%), Gaps = 9/207 (4%)
Query: 36 FIKPKAIPYVF--IKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHC 86
+ K +P F I+ N+YLT+R K + D C +P S C DC
Sbjct: 1464 YAKQGKMPCYFDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEVACGEDCLN 1523
Query: 87 GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
+L+ CSS C G+ C N+ FQ + ++++ TEK G G+ A +D+ FV+EY GE
Sbjct: 1524 RLLMIECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGE 1583
Query: 147 VIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWII 206
V+D + + R+ + ++Y + D +IDAT KGN SR++NHSC PN E QKW +
Sbjct: 1584 VLDHKEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTV 1643
Query: 207 DGETRIGIFATRDIKKGENLTYDYQYE 233
+G+ R+G F T+ + G LT+DYQ++
Sbjct: 1644 NGQLRVGFFTTKLVPSGSELTFDYQFQ 1670
>gi|297842509|ref|XP_002889136.1| hypothetical protein ARALYDRAFT_476894 [Arabidopsis lyrata subsp.
lyrata]
gi|297334977|gb|EFH65395.1| hypothetical protein ARALYDRAFT_476894 [Arabidopsis lyrata subsp.
lyrata]
Length = 1766
Score = 143 bits (361), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 76/190 (40%), Positives = 111/190 (58%), Gaps = 1/190 (0%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNS 102
+ IK N +L + K + D+ + C C P C +C ML C G C G+
Sbjct: 955 FKAIKTNQFLHRNRKSQTIDEIMVCHCKPPPDGRLGCGEECLNRMLNIECLQGTCPAGDL 1014
Query: 103 CLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
C N+ FQ R K + Q+ K G G+ ED++ G+F+IEYVGEV+D Q+ + R +
Sbjct: 1015 CSNQQFQKRKYVKFERFQSGKKGYGLRLLEDVREGQFLIEYVGEVLDMQSYDTRQKEYAC 1074
Query: 163 LGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKK 222
G+ +FY +N + VIDA KGN R+INHSC PN +KW+++GE +GIF+ +D+KK
Sbjct: 1075 KGQKHFYFMTLNGNEVIDAGAKGNLGRFINHSCEPNCRTEKWMVNGEICVGIFSMKDLKK 1134
Query: 223 GENLTYDYQY 232
G+ LT+DY Y
Sbjct: 1135 GQELTFDYNY 1144
>gi|195376627|ref|XP_002047094.1| GJ13235 [Drosophila virilis]
gi|194154252|gb|EDW69436.1| GJ13235 [Drosophila virilis]
Length = 2005
Score = 143 bits (361), Expect = 5e-32, Method: Composition-based stats.
Identities = 76/187 (40%), Positives = 102/187 (54%), Gaps = 5/187 (2%)
Query: 62 EDDGIFCSCTASPGSSGV----CDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMK 117
E+ + C C + G+ C C ML+ C C G C NK FQ +
Sbjct: 1017 ENAEMQCDCFVTGDEEGLGQLRCGDGCINRMLMIECGPLCTYGERCTNKRFQQHQGWPCR 1076
Query: 118 LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDM 177
+ +TEK G GI A+ I GEF++EYVGEVID + E R ++Y + +
Sbjct: 1077 VFRTEKKGCGITAELQIPPGEFIMEYVGEVIDSEEFERRQHLYSEDRNRHYYFMALRGEA 1136
Query: 178 VIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLH 236
+IDAT KGN SRYINHSC PN E QKW ++GE RIG F+ + I GE +T+DYQY+ +
Sbjct: 1137 IIDATTKGNISRYINHSCDPNAETQKWTVNGELRIGFFSVKTILPGEEITFDYQYQRYGR 1196
Query: 237 DSLIAYC 243
D+ YC
Sbjct: 1197 DAQRCYC 1203
>gi|301122693|ref|XP_002909073.1| histone-lysine N-methyltransferase, putative [Phytophthora
infestans T30-4]
gi|262099835|gb|EEY57887.1| histone-lysine N-methyltransferase, putative [Phytophthora
infestans T30-4]
Length = 751
Score = 143 bits (361), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 74/168 (44%), Positives = 101/168 (60%), Gaps = 4/168 (2%)
Query: 80 CDRDCHCGMLLSSCS-SGCKCGNS--CLNKPFQNRPVKKMKLVQTEK-CGAGIVADEDIK 135
C C+ ML SCS C + C N+ + R +K M++ G G++ +EDI
Sbjct: 164 CGEGCYNRMLFISCSDETCSAPDPSVCSNRAIKRRQLKSMRVEYIPGGPGFGLITNEDIN 223
Query: 136 RGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSC 195
GEFVIEYVGEVIDD+ CE R+ + GE NFY+ E+ +++VIDA Y+ N SR+INHSC
Sbjct: 224 AGEFVIEYVGEVIDDKECERRMITYRDNGEVNFYMMELEKNIVIDAKYRSNDSRFINHSC 283
Query: 196 CPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
PN+ QKW +DG RIGIFA R+I E +T DY + ++ C
Sbjct: 284 DPNSVTQKWNVDGMQRIGIFARRNIAPNEEITIDYNFSHFGEAADCRC 331
>gi|380812066|gb|AFE77908.1| histone-lysine N-methyltransferase SETD2 [Macaca mulatta]
Length = 2565
Score = 143 bits (361), Expect = 6e-32, Method: Composition-based stats.
Identities = 72/195 (36%), Positives = 110/195 (56%), Gaps = 7/195 (3%)
Query: 46 FIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSGCK 98
I+ N+YLT+R K + D C +P S C DC +L+ CSS C
Sbjct: 1476 LIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLNRLLMIECSSRCP 1535
Query: 99 CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
G+ C N+ FQ + ++++ TEK G G+ A +D+ FV+EY GEV+D + + R+
Sbjct: 1536 NGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKARVK 1595
Query: 159 KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATR 218
+ ++Y + D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F T+
Sbjct: 1596 EYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTK 1655
Query: 219 DIKKGENLTYDYQYE 233
+ G LT+DYQ++
Sbjct: 1656 LVPSGSELTFDYQFQ 1670
>gi|356507632|ref|XP_003522568.1| PREDICTED: histone-lysine N-methyltransferase ASHH2-like [Glycine
max]
Length = 2081
Score = 143 bits (361), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 78/190 (41%), Positives = 109/190 (57%), Gaps = 1/190 (0%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNS 102
+ I N +L + K + D+ + C C S G C +C +L C G C CG+
Sbjct: 1154 FTNILTNEFLHRSHKTQTIDEIMVCHCKPSQGGKLGCGDECLNRILNIECVQGTCPCGDR 1213
Query: 103 CLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
C N+ FQ +K + K G G+ A ED+ +G+F+IEYVGEV+D QT E R +
Sbjct: 1214 CSNQQFQKHKYASLKWFKCGKKGYGLKAIEDVAQGQFLIEYVGEVLDMQTYEARQREYAL 1273
Query: 163 LGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKK 222
G +FY +N VIDA+ KGN R+INHSC PN +KW+++GE IG+FA R++KK
Sbjct: 1274 KGHRHFYFMTLNGSEVIDASAKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALRNVKK 1333
Query: 223 GENLTYDYQY 232
E LT+DY Y
Sbjct: 1334 DEELTFDYNY 1343
>gi|109040979|ref|XP_001113652.1| PREDICTED: histone-lysine N-methyltransferase SETD2-like isoform 2
[Macaca mulatta]
Length = 2550
Score = 143 bits (361), Expect = 6e-32, Method: Composition-based stats.
Identities = 72/195 (36%), Positives = 110/195 (56%), Gaps = 7/195 (3%)
Query: 46 FIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSGCK 98
I+ N+YLT+R K + D C +P S C DC +L+ CSS C
Sbjct: 1461 LIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLNRLLMIECSSRCP 1520
Query: 99 CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
G+ C N+ FQ + ++++ TEK G G+ A +D+ FV+EY GEV+D + + R+
Sbjct: 1521 NGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKARVK 1580
Query: 159 KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATR 218
+ ++Y + D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F T+
Sbjct: 1581 EYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTK 1640
Query: 219 DIKKGENLTYDYQYE 233
+ G LT+DYQ++
Sbjct: 1641 LVPSGSELTFDYQFQ 1655
>gi|402860278|ref|XP_003894560.1| PREDICTED: histone-lysine N-methyltransferase SETD2 [Papio anubis]
Length = 2521
Score = 143 bits (361), Expect = 6e-32, Method: Composition-based stats.
Identities = 72/195 (36%), Positives = 110/195 (56%), Gaps = 7/195 (3%)
Query: 46 FIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSGCK 98
I+ N+YLT+R K + D C +P S C DC +L+ CSS C
Sbjct: 1432 LIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLNRLLMIECSSRCP 1491
Query: 99 CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
G+ C N+ FQ + ++++ TEK G G+ A +D+ FV+EY GEV+D + + R+
Sbjct: 1492 NGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKARVK 1551
Query: 159 KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATR 218
+ ++Y + D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F T+
Sbjct: 1552 EYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTK 1611
Query: 219 DIKKGENLTYDYQYE 233
+ G LT+DYQ++
Sbjct: 1612 LVPSGSELTFDYQFQ 1626
>gi|157824020|ref|NP_001101659.1| histone-lysine N-methyltransferase SETD2 [Rattus norvegicus]
gi|149018436|gb|EDL77077.1| kinesin family member 9 (predicted) [Rattus norvegicus]
Length = 2294
Score = 143 bits (361), Expect = 6e-32, Method: Composition-based stats.
Identities = 76/207 (36%), Positives = 115/207 (55%), Gaps = 9/207 (4%)
Query: 36 FIKPKAIPYVF--IKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHC 86
F K +P F I+ N+YLT+R K + D C +P S C DC
Sbjct: 1194 FAKLGKMPCYFDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEVACGEDCLN 1253
Query: 87 GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
+L+ CSS C G+ C N+ FQ + ++++ TEK G G+ A +D+ FV+EY GE
Sbjct: 1254 RLLMIECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGE 1313
Query: 147 VIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWII 206
V+D + + R+ + ++Y + D +IDAT KGN SR++NHSC PN E QKW +
Sbjct: 1314 VLDHKEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTV 1373
Query: 207 DGETRIGIFATRDIKKGENLTYDYQYE 233
+G+ R+G F T+ + G LT+DYQ++
Sbjct: 1374 NGQLRVGFFTTKLVPSGSELTFDYQFQ 1400
>gi|15150415|gb|AAK84931.1| SD01656p [Drosophila melanogaster]
Length = 1443
Score = 143 bits (361), Expect = 6e-32, Method: Composition-based stats.
Identities = 82/205 (40%), Positives = 109/205 (53%), Gaps = 6/205 (2%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSC--TASPGSSG--VCDRDCHCGMLLSSCSSGCKC 99
+ +K N Y R + E+ + C C T + G C C ML+ C C
Sbjct: 419 FQLLKENFYRCAR-QVSQENAEMQCDCFLTGDEEAQGHLSCGAGCINRMLMIECGPLCSN 477
Query: 100 GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
G C NK FQ ++ +TEK G GI A+ I GEF++EYVGEVID + E R
Sbjct: 478 GARCTNKRFQQHQCWPCRVFRTEKKGCGITAELLIPPGEFIMEYVGEVIDSEEFERRQHL 537
Query: 160 MKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD 219
++Y + + VIDAT KGN SRYINHSC PN E QKW ++GE RIG F+ +
Sbjct: 538 YSKDRNRHYYFMALRGEAVIDATSKGNISRYINHSCDPNAETQKWTVNGELRIGFFSVKP 597
Query: 220 IKKGENLTYDYQY-EFLHDSLIAYC 243
I+ GE +T+DYQY + D+ YC
Sbjct: 598 IQPGEEITFDYQYLRYGRDAQRCYC 622
>gi|363729887|ref|XP_418510.3| PREDICTED: histone-lysine N-methyltransferase SETD2 [Gallus gallus]
Length = 2554
Score = 143 bits (361), Expect = 6e-32, Method: Composition-based stats.
Identities = 76/207 (36%), Positives = 117/207 (56%), Gaps = 9/207 (4%)
Query: 36 FIKPKAIPYVF--IKRNIYLTKRIKRRLEDD--GIFCSC---TASPGSSG--VCDRDCHC 86
F K +P F I+ N+YLT+R K + D + C C + + G C DC
Sbjct: 1461 FAKQGKMPCYFDLIEENVYLTERKKSKSHRDIKRMLCECPPLSKEERAQGEVACGEDCLN 1520
Query: 87 GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
+L+ CSS C G+ C N+ FQ + ++++ TEK G G+ A +D+ FV+EY GE
Sbjct: 1521 RLLMIECSSRCPNGDYCSNRRFQKKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGE 1580
Query: 147 VIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWII 206
V+D + + R+ + ++Y + D +IDAT KGN SR++NHSC PN E QKW +
Sbjct: 1581 VLDHKEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTV 1640
Query: 207 DGETRIGIFATRDIKKGENLTYDYQYE 233
+G+ R+G F T+ + G LT+DYQ++
Sbjct: 1641 NGQLRVGFFTTKLVPSGSELTFDYQFQ 1667
>gi|426340342|ref|XP_004034089.1| PREDICTED: histone-lysine N-methyltransferase SETD2 [Gorilla gorilla
gorilla]
Length = 2564
Score = 143 bits (361), Expect = 6e-32, Method: Composition-based stats.
Identities = 72/195 (36%), Positives = 110/195 (56%), Gaps = 7/195 (3%)
Query: 46 FIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSGCK 98
I+ N+YLT+R K + D C +P S C DC +L+ CSS C
Sbjct: 1475 LIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLNRLLMIECSSRCP 1534
Query: 99 CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
G+ C N+ FQ + ++++ TEK G G+ A +D+ FV+EY GEV+D + + R+
Sbjct: 1535 NGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKARVK 1594
Query: 159 KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATR 218
+ ++Y + D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F T+
Sbjct: 1595 EYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTK 1654
Query: 219 DIKKGENLTYDYQYE 233
+ G LT+DYQ++
Sbjct: 1655 LVPSGSELTFDYQFQ 1669
>gi|115446669|ref|NP_001047114.1| Os02g0554000 [Oryza sativa Japonica Group]
gi|50725771|dbj|BAD33302.1| SET domain-containing protein-like [Oryza sativa Japonica Group]
gi|113536645|dbj|BAF09028.1| Os02g0554000 [Oryza sativa Japonica Group]
Length = 637
Score = 143 bits (360), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 75/190 (39%), Positives = 104/190 (54%), Gaps = 1/190 (0%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSS-GCKCGNS 102
+ ++ N++L + + + D+ + C+C C C +L C+ C CG
Sbjct: 124 FALLRSNLFLHRNRRTQSIDESMVCNCKPPHDDRMGCRDGCLNRILNIECTKRTCPCGEH 183
Query: 103 CLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
C N+ FQ R K+ T K G G+ ED+ G F+IEYVGEV+D E R
Sbjct: 184 CSNQQFQRRTYAKLGKFHTGKKGYGLQLKEDVSEGRFLIEYVGEVLDITAYESRQRYYAS 243
Query: 163 LGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKK 222
G+ +FY +N VIDA KGN R+INHSC PN +KW+++GE IGIFA R+IKK
Sbjct: 244 KGQKHFYFMALNGGEVIDACTKGNLGRFINHSCSPNCRTEKWMVNGEVCIGIFAMRNIKK 303
Query: 223 GENLTYDYQY 232
GE LT+DY Y
Sbjct: 304 GEELTFDYNY 313
>gi|357163489|ref|XP_003579748.1| PREDICTED: histone-lysine N-methyltransferase ASHH1-like
[Brachypodium distachyon]
Length = 517
Score = 143 bits (360), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 80/202 (39%), Positives = 117/202 (57%), Gaps = 6/202 (2%)
Query: 43 PYVFIKRNIYLTKRIKRRLEDDGIFCSCT---ASPGSSGVCDRDCHCGMLLSSCSSG-CK 98
PY+ I+ N +L +R KR+ E+D C C P S+ C C + C+ G C+
Sbjct: 10 PYIHIETNDFLHRRHKRQKEEDIAVCECQYNLMDPDSA--CGERCWNVSTNTECTPGYCR 67
Query: 99 CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
CG C N+ FQ + +LV+TE G G++A+E+I G+FVIEY GEVI + + R
Sbjct: 68 CGVYCKNQRFQKCQYARTRLVKTEGRGWGLLAEENIMAGQFVIEYCGEVISWKEAKRRSQ 127
Query: 159 KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATR 218
+ G Y+ +N IDAT KG+ +R+INHSC PN E +KW + GE R+GIFA +
Sbjct: 128 AYEDQGLMEAYIIYLNTAESIDATKKGSLARFINHSCQPNCETRKWNVLGEVRVGIFAKQ 187
Query: 219 DIKKGENLTYDYQYEFLHDSLI 240
DI G L+YDY +E+ +++
Sbjct: 188 DIPIGMELSYDYNFEWFGGAIV 209
>gi|302798461|ref|XP_002980990.1| hypothetical protein SELMODRAFT_3415 [Selaginella moellendorffii]
gi|300151044|gb|EFJ17691.1| hypothetical protein SELMODRAFT_3415 [Selaginella moellendorffii]
Length = 242
Score = 143 bits (360), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 78/201 (38%), Positives = 111/201 (55%), Gaps = 1/201 (0%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCS-SGCKCGNS 102
+ I+ NI+ + K +DD + C C S C DC M+ CS C CG
Sbjct: 1 WTQIRHNIFQHRHQKTLDDDDTLICLCKPPKDGSPGCGEDCLNRMVNVECSPDTCPCGER 60
Query: 103 CLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
C N+ F R + LV+ K G G+ A E+I +G FVIEYVGEV+D ++ E R +
Sbjct: 61 CSNQQFGKREYSNVALVRCGKKGFGLKALENIAKGSFVIEYVGEVLDSRSFELRQKEYAR 120
Query: 163 LGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKK 222
+ +FY +N VIDA KGN R+INHSC PN + +KW ++GE IG+FA RD+ K
Sbjct: 121 QRQKHFYFMTLNSSEVIDACRKGNLGRFINHSCEPNCQTEKWCVNGEICIGLFAIRDVAK 180
Query: 223 GENLTYDYQYEFLHDSLIAYC 243
E +T++Y +E L+ + C
Sbjct: 181 NEEITFNYNFERLYGAAAKKC 201
>gi|327289513|ref|XP_003229469.1| PREDICTED: histone-lysine N-methyltransferase SETD2-like [Anolis
carolinensis]
Length = 2579
Score = 143 bits (360), Expect = 7e-32, Method: Composition-based stats.
Identities = 77/207 (37%), Positives = 115/207 (55%), Gaps = 9/207 (4%)
Query: 36 FIKPKAIPYVF--IKRNIYLTKRIKRRLEDD--GIFCSC-TASPGSSG----VCDRDCHC 86
F K +P F I+ N+YLT+R K + D + C C T S C DC
Sbjct: 1481 FAKHGRMPCYFDLIEENVYLTERKKNKSHRDIKRMLCECPTLSKDERAQGEVACGEDCLN 1540
Query: 87 GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
+L+ CSS C G C N+ FQ + ++++ TEK G G+ A +D+ FV+EY GE
Sbjct: 1541 RLLMIECSSRCPNGEHCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGE 1600
Query: 147 VIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWII 206
V+D + + R+ + ++Y + D +IDAT KGN SR++NHSC PN E QKW +
Sbjct: 1601 VLDHKEFKTRVKEYARSKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTV 1660
Query: 207 DGETRIGIFATRDIKKGENLTYDYQYE 233
+G+ R+G F T+ + G LT+DYQ++
Sbjct: 1661 NGQLRVGFFTTKMVPSGSELTFDYQFQ 1687
>gi|345493934|ref|XP_001600694.2| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1 [Nasonia
vitripennis]
Length = 1382
Score = 143 bits (360), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 81/192 (42%), Positives = 109/192 (56%), Gaps = 2/192 (1%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CK 98
K PYV +K N + +K D + C C + D DC +L+ CS C
Sbjct: 932 KPPPYVKLKVNKPVG-NVKVPEVDSMVACDCNPNQPYPCSPDSDCLNRILMIECSPDTCP 990
Query: 99 CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
C N+ F R MK TE+ G G+V+ E IK G+F+IEYVGEVID+ + RL
Sbjct: 991 ASTKCQNQLFVQRKYPAMKPAHTEERGWGLVSLEPIKHGQFIIEYVGEVIDEAEYKLRLQ 1050
Query: 159 KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATR 218
+ K N+Y I+ +IDA KGN SR++NHSC PN E QKW ++G+TRIG+FA R
Sbjct: 1051 QKKERKNENYYFLTIDNSRMIDAEPKGNLSRFMNHSCQPNCETQKWKVNGDTRIGLFALR 1110
Query: 219 DIKKGENLTYDY 230
DI+ GE LT++Y
Sbjct: 1111 DIEPGEELTFNY 1122
>gi|412991390|emb|CCO16235.1| unnamed protein product [Bathycoccus prasinos]
Length = 825
Score = 143 bits (360), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 70/156 (44%), Positives = 96/156 (61%), Gaps = 3/156 (1%)
Query: 80 CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGE 138
C DC +C + C CG+ C N+PFQ+ P K+K+ TE G G+ +D+ G+
Sbjct: 461 CGHDCVNRETRYTCDTRVCPCGDDCSNRPFQHLPQPKVKVQLTENRGYGLFLQQDVFEGD 520
Query: 139 FVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPN 198
F++EY+GE++D++ C RL K E NFYL EI +IDA + GN +R+IN SC PN
Sbjct: 521 FIVEYMGEIVDEEECTRRLLACKGKNEPNFYLMEITPSQIIDARFCGNNARFINSSCHPN 580
Query: 199 TEMQKWI--IDGETRIGIFATRDIKKGENLTYDYQY 232
E Q+W+ ETR+GIFAT DIK G LTYDY +
Sbjct: 581 CETQRWVDASTNETRVGIFATEDIKSGTELTYDYNF 616
>gi|410905477|ref|XP_003966218.1| PREDICTED: histone-lysine N-methyltransferase SETD2-like [Takifugu
rubripes]
Length = 1950
Score = 142 bits (359), Expect = 9e-32, Method: Composition-based stats.
Identities = 77/205 (37%), Positives = 114/205 (55%), Gaps = 9/205 (4%)
Query: 38 KPKAIPYVF--IKRNIYLTKRIKRRLEDD--GIFCSCTASPG---SSGV--CDRDCHCGM 88
K K +P F I+ N+YLT+R K + D + C C P S G C DC +
Sbjct: 837 KSKKMPPYFDLIEENLYLTERKKSKSHRDIKRMQCECPVLPREERSKGALACGEDCLNRL 896
Query: 89 LLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVI 148
L+ CSS C+ G C N+ FQ R ++ TE G G+ A +D+ FV+EY GEV+
Sbjct: 897 LMIECSSRCQNGAYCSNRRFQMRQHADFDVILTEDKGWGLRAAKDLPSNTFVLEYCGEVL 956
Query: 149 DDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDG 208
D + + R+ + ++Y + + +IDAT KGN SR++NHSC PN E QKW ++G
Sbjct: 957 DHKEFKTRVKEYARNKNIHYYFMALKNNEIIDATLKGNLSRFMNHSCEPNCETQKWTVNG 1016
Query: 209 ETRIGIFATRDIKKGENLTYDYQYE 233
+ R+G F T+ + G LT+DYQ++
Sbjct: 1017 QLRVGFFTTKAVTAGTELTFDYQFQ 1041
>gi|195171947|ref|XP_002026763.1| GL27000 [Drosophila persimilis]
gi|194111702|gb|EDW33745.1| GL27000 [Drosophila persimilis]
Length = 944
Score = 142 bits (359), Expect = 9e-32, Method: Composition-based stats.
Identities = 71/165 (43%), Positives = 97/165 (58%), Gaps = 1/165 (0%)
Query: 80 CDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEF 139
C C ML+ C C G+ C NK FQ ++ +TEK G GI A+ I GEF
Sbjct: 19 CGAGCINRMLMIECGPLCTNGDRCTNKRFQLHQCWPCRVFRTEKKGCGITAELQIPAGEF 78
Query: 140 VIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNT 199
++EYVGEVID + E R + ++Y + + +IDAT +GN SRYINHSC PN
Sbjct: 79 IMEYVGEVIDSEEFERRQHRYSKDRNRHYYFMALRGEAIIDATMRGNISRYINHSCDPNA 138
Query: 200 EMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
E QKW ++GE RIG F+ ++I GE +T+DYQY+ + D+ YC
Sbjct: 139 ETQKWTVNGELRIGFFSLKNILPGEEITFDYQYQRYGRDAQRCYC 183
>gi|327265653|ref|XP_003217622.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific-like [Anolis carolinensis]
Length = 2106
Score = 142 bits (359), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 78/199 (39%), Positives = 111/199 (55%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N + K ++ D C P C D C MLL C
Sbjct: 1297 KPPPYKHIKVNRPIGK-VQIFTADLSEIPRCNCKPADENPCGLDSECINRMLLYECHPAV 1355
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C GN C N+ F R ++++ +T G G+ A DIK+GEFV EYVGE+ID++ C R
Sbjct: 1356 CPAGNRCQNQCFTKRQYPEVEIFRTLARGWGLQAKRDIKKGEFVNEYVGELIDEEECRAR 1415
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + TNFY+ +++D +IDA KGN +R++NH C PN E QKW ++G+TR+G+FA
Sbjct: 1416 IRHAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFA 1475
Query: 217 TRDIKKGENLTYDYQYEFL 235
++K G LT++Y E L
Sbjct: 1476 ITNVKAGTELTFNYNLECL 1494
>gi|345493936|ref|XP_003427184.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2 [Nasonia
vitripennis]
Length = 1317
Score = 142 bits (359), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 81/192 (42%), Positives = 109/192 (56%), Gaps = 2/192 (1%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CK 98
K PYV +K N + +K D + C C + D DC +L+ CS C
Sbjct: 867 KPPPYVKLKVNKPVG-NVKVPEVDSMVACDCNPNQPYPCSPDSDCLNRILMIECSPDTCP 925
Query: 99 CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
C N+ F R MK TE+ G G+V+ E IK G+F+IEYVGEVID+ + RL
Sbjct: 926 ASTKCQNQLFVQRKYPAMKPAHTEERGWGLVSLEPIKHGQFIIEYVGEVIDEAEYKLRLQ 985
Query: 159 KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATR 218
+ K N+Y I+ +IDA KGN SR++NHSC PN E QKW ++G+TRIG+FA R
Sbjct: 986 QKKERKNENYYFLTIDNSRMIDAEPKGNLSRFMNHSCQPNCETQKWKVNGDTRIGLFALR 1045
Query: 219 DIKKGENLTYDY 230
DI+ GE LT++Y
Sbjct: 1046 DIEPGEELTFNY 1057
>gi|52545752|emb|CAH56331.1| hypothetical protein [Homo sapiens]
Length = 881
Score = 142 bits (359), Expect = 1e-31, Method: Composition-based stats.
Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
C+C A+ + D +C MLL C + C G C N+ F R ++++ +T + G
Sbjct: 80 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGW 139
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ DIK+GEFV EYVGE+ID++ C R+ + TNFY+ +++D +IDA KGN
Sbjct: 140 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 199
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
+R++NH C PN E QKW ++G+TR+G+FA DIK G LT++Y E L
Sbjct: 200 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 248
>gi|291223879|ref|XP_002731935.1| PREDICTED: Wolf-Hirschhorn syndrome candidate 1 protein-like
[Saccoglossus kowalevskii]
Length = 1787
Score = 142 bits (359), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 81/200 (40%), Positives = 117/200 (58%), Gaps = 4/200 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLE-DDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-C 97
K P+ FIK N + K + ++ + C C A + D DC MLL C S C
Sbjct: 1247 KPAPFKFIKTNKPVGKVVMQQCDISQCTPCECKADMKNPCGPDSDCLNRMLLIECHSQVC 1306
Query: 98 KCGNSCLNKPFQNRPVKKMKLVQTEKCGAG--IVADEDIKRGEFVIEYVGEVIDDQTCEE 155
G++C N+ FQ + +T++ G G + +DIK+G+FV EYVGE++D++TC E
Sbjct: 1307 PAGDNCQNQRFQKMQYPETIPFRTDEKGRGWGLKTTQDIKKGDFVHEYVGELVDEETCRE 1366
Query: 156 RLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIF 215
R+ K + L NFY+ I++D VIDA KGN +R++NHSC PN E KW I +TR+G+F
Sbjct: 1367 RIKKCQQLDIDNFYMLTIDKDHVIDAGPKGNLARFMNHSCDPNCETMKWTILPDTRVGLF 1426
Query: 216 ATRDIKKGENLTYDYQYEFL 235
A RDI G LT++Y + L
Sbjct: 1427 AKRDITAGSELTFNYNLDCL 1446
>gi|410923178|ref|XP_003975059.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like [Takifugu
rubripes]
Length = 1499
Score = 142 bits (358), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 78/194 (40%), Positives = 109/194 (56%), Gaps = 4/194 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY FIK N + K ++ + D C P C C ML C
Sbjct: 1121 KPPPYKFIKSNKPVGK-VQMHVADLSEIQRCNCRPTDEHPCGLHSQCLNRMLQYECHPQV 1179
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G++C N+ F R + ++V+T G G+ A++ +K+GEFV EYVGEVID + C++R
Sbjct: 1180 CPAGDNCENQGFTKRLYAETEVVKTADRGWGLKANQPLKKGEFVTEYVGEVIDAEECQQR 1239
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + TNFY+ + +D VIDA KGN SR+INHSC PN E QKW ++G+ IG+FA
Sbjct: 1240 IKRAHENHMTNFYMLTLTKDRVIDAAQKGNLSRFINHSCSPNCETQKWTVNGDVHIGLFA 1299
Query: 217 TRDIKKGENLTYDY 230
DI G LT++Y
Sbjct: 1300 LCDIDAGTELTFNY 1313
>gi|292621054|ref|XP_683890.4| PREDICTED: hypothetical protein LOC556086 [Danio rerio]
Length = 2055
Score = 142 bits (358), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 80/198 (40%), Positives = 115/198 (58%), Gaps = 2/198 (1%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGI-FCSCTASPGSSGVCDRDCHCGMLLSSCSSG-C 97
K PY IK N + K + + I C+C A+ + D +C MLL C S C
Sbjct: 1549 KPPPYKHIKVNKQIGKVLIITADLSEIPRCNCKATDENPCGIDSECINRMLLYECHSQVC 1608
Query: 98 KCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERL 157
G C N+ F R ++++ +T G G+ + DIK+G FV EYVGEVID++ C R+
Sbjct: 1609 PAGERCQNQSFTKRQYTEVEIFRTLSRGWGLRSISDIKKGAFVNEYVGEVIDEEECRSRI 1668
Query: 158 WKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFAT 217
+ NFY+ +++D +IDA KGN+SR++NHSC PN E QKW ++G+TR+G+FA
Sbjct: 1669 KNAQDNDICNFYMLTLDKDRIIDAGPKGNESRFMNHSCQPNCETQKWTVNGDTRVGLFAL 1728
Query: 218 RDIKKGENLTYDYQYEFL 235
DI KG LT++Y E L
Sbjct: 1729 EDIPKGVELTFNYNLECL 1746
>gi|170039780|ref|XP_001847701.1| huntingtin interacting protein [Culex quinquefasciatus]
gi|167863380|gb|EDS26763.1| huntingtin interacting protein [Culex quinquefasciatus]
Length = 2379
Score = 142 bits (358), Expect = 1e-31, Method: Composition-based stats.
Identities = 80/216 (37%), Positives = 116/216 (53%), Gaps = 10/216 (4%)
Query: 26 NPVEFELPDWFIK---PKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASP-----GSS 77
N EL D +++ K + I+ N+Y ++ + E + C C + G
Sbjct: 1236 NVTNIELSDEYLRDMEEKLSRFETIRENMYHCDKVISK-EAKKMMCDCFLTEEEKDRGEL 1294
Query: 78 GVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRG 137
G C DC +L+ C S C G C NK FQ ++ +TEK G GI A +I G
Sbjct: 1295 G-CGEDCLNKLLMIECGSRCVVGERCTNKRFQKLEYANTQVFRTEKKGFGIQASTEIAPG 1353
Query: 138 EFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCP 197
EF++EYVGEV++ + +ER ++Y + D +IDAT KGN SR+INHSC
Sbjct: 1354 EFIMEYVGEVLNSEQFDERAELYSKEKNKHYYFMALRSDAIIDATTKGNISRFINHSCDA 1413
Query: 198 NTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
N E QKW ++GE RIG F+T+ I GE +T+DYQ++
Sbjct: 1414 NAETQKWTVNGELRIGFFSTKYILPGEEVTFDYQFQ 1449
>gi|444706655|gb|ELW47981.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Tupaia chinensis]
Length = 2687
Score = 142 bits (358), Expect = 1e-31, Method: Composition-based stats.
Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
C+C A+ + D +C MLL C + C G C N+ F R ++++ +T + G
Sbjct: 1887 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGW 1946
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ DIK+GEFV EYVGE+ID++ C R+ + TNFY+ +++D +IDA KGN
Sbjct: 1947 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 2006
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
+R++NH C PN E QKW ++G+TR+G+FA DIK G LT++Y E L
Sbjct: 2007 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 2055
>gi|76666643|ref|XP_613048.2| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 1
[Bos taurus]
gi|297476142|ref|XP_002688498.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Bos
taurus]
gi|296486298|tpg|DAA28411.1| TPA: Wolf-Hirschhorn syndrome candidate 1 [Bos taurus]
Length = 1365
Score = 142 bits (357), Expect = 1e-31, Method: Composition-based stats.
Identities = 79/196 (40%), Positives = 110/196 (56%), Gaps = 4/196 (2%)
Query: 43 PYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG-CKC 99
PY IK N K ++ D C P C D C ML+ C C
Sbjct: 990 PYKHIKVNKPYGK-VQVHTADISEIPKCNCKPTDENPCGSDSQCLNRMLMFECHPQVCPA 1048
Query: 100 GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
G SC N+ F R + K+V+T+ G G+VA DI++GEFV EYVGE+ID++ C R+ +
Sbjct: 1049 GESCQNQCFTKRQYPETKIVRTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMARIKR 1108
Query: 160 MKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD 219
T+FY+ I++D +IDA KGN SR++NHSC PN E KW ++G+TR+G+FA D
Sbjct: 1109 AHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCD 1168
Query: 220 IKKGENLTYDYQYEFL 235
I G LT++Y + L
Sbjct: 1169 IPAGTELTFNYNLDCL 1184
>gi|432873648|ref|XP_004072321.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like [Oryzias
latipes]
Length = 1597
Score = 142 bits (357), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 77/194 (39%), Positives = 111/194 (57%), Gaps = 4/194 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N + K ++ + D C P C D C ML C
Sbjct: 1222 KPPPYKVIKSNKPVGK-VQMHVADLSEIPRCNCKPVDEHPCSIDSQCLNRMLQYECHPQV 1280
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G++C N+ F R + ++V+TE G G+ ++ +++G+FV EYVGEVID + C++R
Sbjct: 1281 CPAGDNCENQNFTKRLYAETEVVKTEGRGWGLQTNQALRKGDFVAEYVGEVIDSEECQQR 1340
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + TNFY+ + +D VIDA KGN +R+INHSC PN E QKW ++G+ RIGIFA
Sbjct: 1341 IKRAHENHVTNFYMLTLTKDRVIDAGPKGNSARFINHSCNPNCETQKWTVNGDVRIGIFA 1400
Query: 217 TRDIKKGENLTYDY 230
DI+ G LT++Y
Sbjct: 1401 LCDIEAGTELTFNY 1414
>gi|326921432|ref|XP_003206963.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
SETD2-like [Meleagris gallopavo]
Length = 2147
Score = 142 bits (357), Expect = 2e-31, Method: Composition-based stats.
Identities = 76/207 (36%), Positives = 117/207 (56%), Gaps = 9/207 (4%)
Query: 36 FIKPKAIPYVF--IKRNIYLTKRIKRRLEDD--GIFCSC---TASPGSSG--VCDRDCHC 86
F K +P F I+ N+YLT+R K + D + C C + + G C DC
Sbjct: 1268 FAKQGKMPCYFDLIEENVYLTERKKSKSHRDIKRMLCECPPLSKEERAQGEVACGEDCLN 1327
Query: 87 GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
+L+ CSS C G+ C N+ FQ + ++++ TEK G G+ A +D+ FV+EY GE
Sbjct: 1328 RLLMIECSSRCPNGDFCSNRRFQKKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGE 1387
Query: 147 VIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWII 206
V+D + + R+ + ++Y + D +IDAT KGN SR++NHSC PN E QKW +
Sbjct: 1388 VLDHKEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTV 1447
Query: 207 DGETRIGIFATRDIKKGENLTYDYQYE 233
+G+ R+G F T+ + G LT+DYQ++
Sbjct: 1448 NGQLRVGFFTTKLVPSGSELTFDYQFQ 1474
>gi|384253874|gb|EIE27348.1| hypothetical protein COCSUDRAFT_55364 [Coccomyxa subellipsoidea
C-169]
Length = 1591
Score = 142 bits (357), Expect = 2e-31, Method: Composition-based stats.
Identities = 71/163 (43%), Positives = 94/163 (57%), Gaps = 4/163 (2%)
Query: 74 PGSSGVCDRDCHCGMLLSSCS-SGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADE 132
P +G C +C + C C CG+ C N+PF P K +L TE G G+ A E
Sbjct: 477 PERTG-CGENCLNRLSYIHCDPKQCPCGDYCSNRPFHLLPQPKTELFLTEDRGWGVKATE 535
Query: 133 DIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYIN 192
I RG F++EY GEVI++ C R+ + K G +FY+ E+ ++IDA KGN +R+IN
Sbjct: 536 HIPRGTFIVEYAGEVIEEHECRRRMAQAKVTGLQHFYMMELAPGLIIDARVKGNMARFIN 595
Query: 193 HSCCPNTEMQKW--IIDGETRIGIFATRDIKKGENLTYDYQYE 233
SC PN E QKW GE RIGIFA DI+ G L YDYQ++
Sbjct: 596 SSCAPNCESQKWHDAATGEIRIGIFAADDIEPGTELAYDYQFQ 638
>gi|413937237|gb|AFW71788.1| hypothetical protein ZEAMMB73_686749 [Zea mays]
Length = 1815
Score = 141 bits (356), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 76/194 (39%), Positives = 105/194 (54%), Gaps = 1/194 (0%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSC-SSGCK 98
K + ++ N++L + + + D+ + C+C C C ML C C
Sbjct: 993 KPPSWTLVRSNLFLHRNRRTQSIDESMVCNCKPPQDGRMGCRDGCLNRMLNIECVKRTCP 1052
Query: 99 CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
CG C N+ FQ R K++ + K G G+ ED+ G F+IEYVGEV+D + E R
Sbjct: 1053 CGEQCSNQKFQRRSYAKLRWFHSGKKGYGLQLQEDVTEGRFLIEYVGEVLDITSYESRQR 1112
Query: 159 KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATR 218
G+ +FY +N VIDA KGN R+INHSC PN +KW+++GE IGIFA R
Sbjct: 1113 YYACKGQKHFYFMALNGGEVIDACTKGNLGRFINHSCSPNCCTEKWMVNGEVCIGIFALR 1172
Query: 219 DIKKGENLTYDYQY 232
IKKGE LT+DY Y
Sbjct: 1173 SIKKGEELTFDYNY 1186
>gi|241998002|ref|XP_002433644.1| set domain protein, putative [Ixodes scapularis]
gi|215495403|gb|EEC05044.1| set domain protein, putative [Ixodes scapularis]
Length = 729
Score = 141 bits (356), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 75/196 (38%), Positives = 112/196 (57%), Gaps = 6/196 (3%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGI-FCSCTASPGSSGVCDRDCHCGMLLSSCSSG-C 97
K P+ I+ N + + L+ + C CTA C DC +LL C C
Sbjct: 500 KPAPFRMIRVNRPVGQVAVPALDLRTVSVCVCTAQ----DPCQADCLNRLLLYECRPDLC 555
Query: 98 KCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERL 157
G C N+ F R ++ +++ E G G+ D+ + G+FV+EYVGE+I++Q CE RL
Sbjct: 556 PAGEHCKNQHFLRREYAQVTVIRAEGRGWGLRTDQALTAGDFVMEYVGEIINEQECERRL 615
Query: 158 WKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFAT 217
++ +NFY ++RD +IDA +GN SR++NHSC PN E QKW ++G+TR+GIFA
Sbjct: 616 SRLHLEHSSNFYFLTLDRDRIIDAGPRGNLSRFMNHSCDPNCETQKWTVNGDTRVGIFAI 675
Query: 218 RDIKKGENLTYDYQYE 233
RDI G LT++Y +
Sbjct: 676 RDIAPGTELTFNYNLD 691
>gi|348535504|ref|XP_003455240.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Oreochromis niloticus]
Length = 2122
Score = 141 bits (356), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 73/197 (37%), Positives = 114/197 (57%), Gaps = 2/197 (1%)
Query: 38 KPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG- 96
KP ++ + R I + I L + C+C A + D +C ML+ C
Sbjct: 1570 KPPPYRHIRVNRPIGKVQIITADL-SEVPRCNCKALDENPCGIDSECINRMLMYECHPQV 1628
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G+ C N+ F R +++ +T CG G+ DI++G F+ EYVGEVID++ C R
Sbjct: 1629 CAAGDRCQNQSFTKRQYTPVEIFRTLSCGWGLRGLSDIRKGAFISEYVGEVIDEEECRAR 1688
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + NFY+ +++D +IDA KGN++R++NHSC PN E QKW ++G+TR+G+FA
Sbjct: 1689 IRHAQENDICNFYMLTLDKDRIIDAGPKGNQARFMNHSCQPNCETQKWTVNGDTRVGLFA 1748
Query: 217 TRDIKKGENLTYDYQYE 233
+D+ KGE LT++Y E
Sbjct: 1749 LQDVPKGEELTFNYNLE 1765
>gi|432901504|ref|XP_004076868.1| PREDICTED: uncharacterized protein LOC101161079 [Oryzias latipes]
Length = 2214
Score = 141 bits (356), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 78/198 (39%), Positives = 115/198 (58%), Gaps = 2/198 (1%)
Query: 40 KAIPYVFIKRNIYLTK-RIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-C 97
K PY IK N + K +I + C+C A+ S D +C MLL C+ C
Sbjct: 1749 KPPPYKHIKVNRPIGKVQIFTADLSEVPRCNCKATDESPCGSDSECINRMLLYECNPQVC 1808
Query: 98 KCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERL 157
G CLN+ F R ++++ +T G G+ DIK+G+FV EYVGEVID++ C R+
Sbjct: 1809 PAGEKCLNQAFTKRQYSQVEIFRTLSRGWGLRCVHDIKKGQFVSEYVGEVIDEEECRSRI 1868
Query: 158 WKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFAT 217
+ NFY+ +++D +IDA KGN++R++NH C PN E QKW ++G+TR+G+FA
Sbjct: 1869 RHAQENDICNFYMLTLDKDRIIDAGPKGNEARFMNHCCQPNCETQKWTVNGDTRVGLFAL 1928
Query: 218 RDIKKGENLTYDYQYEFL 235
D+K G LT++Y E L
Sbjct: 1929 VDVKAGTELTFNYNLECL 1946
>gi|410898830|ref|XP_003962900.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
[Takifugu rubripes]
Length = 1329
Score = 141 bits (356), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 79/207 (38%), Positives = 113/207 (54%), Gaps = 4/207 (1%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
K PY FIK N + K ++ D C P C + +C ML C
Sbjct: 947 KPPPYKFIKVNKPVGK-VQVYAADVSEIPKCNCKPSDERPCGFESECLNRMLQYECHPQV 1005
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R K+++T G G++ DIK+GEFV EY+GE+ID++ C R
Sbjct: 1006 CPSGERCGNQDFTQRLYPDTKIIKTPGKGWGLITLRDIKKGEFVNEYIGELIDEEECRAR 1065
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + TNFY+ I++D +IDA KGN SR++NHSC PN E QKW ++G+TR+G+FA
Sbjct: 1066 IKYAQENNVTNFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETQKWTVNGDTRVGLFA 1125
Query: 217 TRDIKKGENLTYDYQYEFLHDSLIAYC 243
D+ G LT++Y + L + A C
Sbjct: 1126 ICDVPAGTELTFNYNLDCLGNEKTACC 1152
>gi|426232375|ref|XP_004010202.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
N-methyltransferase NSD2 [Ovis aries]
Length = 1273
Score = 141 bits (356), Expect = 2e-31, Method: Composition-based stats.
Identities = 78/196 (39%), Positives = 110/196 (56%), Gaps = 4/196 (2%)
Query: 43 PYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG-CKC 99
PY IK N K ++ D C P C + C ML+ C C
Sbjct: 936 PYKHIKVNKPYGK-VQVHTADISEIPKCNCKPTDENPCGSESQCLNRMLMFECHPQVCPA 994
Query: 100 GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
G SC N+ F R + K+V+T+ G G+VA DI++GEFV EYVGE+ID++ C R+ +
Sbjct: 995 GESCQNQCFTKRQYPETKIVRTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMARIKR 1054
Query: 160 MKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD 219
T+FY+ I++D +IDA KGN SR++NHSC PN E KW ++G+TR+G+FA D
Sbjct: 1055 AHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCD 1114
Query: 220 IKKGENLTYDYQYEFL 235
I G LT++Y + L
Sbjct: 1115 IPAGTELTFNYNLDCL 1130
>gi|47226564|emb|CAG08580.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1404
Score = 141 bits (356), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 79/194 (40%), Positives = 110/194 (56%), Gaps = 4/194 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY FIK N + K ++ + D C P C C ML C
Sbjct: 1059 KPPPYKFIKSNKPVGK-VQMHIADLSEVQRCNCRPTDEHPCGLQSQCLNRMLQYECHPQV 1117
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G++C N+ F R + ++V+T G G+ A++ IK+GEFVIEYVGEVID + C++R
Sbjct: 1118 CPAGDNCENQCFTKRLYAETEVVKTADRGWGLKANQPIKKGEFVIEYVGEVIDAEECQQR 1177
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + TNFY+ + +D VIDA KGN SR+INHSC PN E QKW ++G+ IG+FA
Sbjct: 1178 IKRAHENHMTNFYMLTLTKDRVIDAGQKGNLSRFINHSCSPNCETQKWTVNGDVHIGLFA 1237
Query: 217 TRDIKKGENLTYDY 230
DI+ LT++Y
Sbjct: 1238 LCDIETDTELTFNY 1251
>gi|413937236|gb|AFW71787.1| hypothetical protein ZEAMMB73_686749 [Zea mays]
Length = 1756
Score = 141 bits (356), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 76/194 (39%), Positives = 105/194 (54%), Gaps = 1/194 (0%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSC-SSGCK 98
K + ++ N++L + + + D+ + C+C C C ML C C
Sbjct: 934 KPPSWTLVRSNLFLHRNRRTQSIDESMVCNCKPPQDGRMGCRDGCLNRMLNIECVKRTCP 993
Query: 99 CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
CG C N+ FQ R K++ + K G G+ ED+ G F+IEYVGEV+D + E R
Sbjct: 994 CGEQCSNQKFQRRSYAKLRWFHSGKKGYGLQLQEDVTEGRFLIEYVGEVLDITSYESRQR 1053
Query: 159 KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATR 218
G+ +FY +N VIDA KGN R+INHSC PN +KW+++GE IGIFA R
Sbjct: 1054 YYACKGQKHFYFMALNGGEVIDACTKGNLGRFINHSCSPNCCTEKWMVNGEVCIGIFALR 1113
Query: 219 DIKKGENLTYDYQY 232
IKKGE LT+DY Y
Sbjct: 1114 SIKKGEELTFDYNY 1127
>gi|426229361|ref|XP_004008759.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Ovis aries]
Length = 2698
Score = 141 bits (355), Expect = 2e-31, Method: Composition-based stats.
Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
C+C A+ + D +C MLL C + C G C N+ F R ++++ +T + G
Sbjct: 1898 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFTKRQYPEVEIFRTLQRGW 1957
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ DIK+GEFV EYVGE+ID++ C R+ + TNFY+ +++D +IDA KGN
Sbjct: 1958 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 2017
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
+R++NH C PN E QKW ++G+TR+G+FA DIK G LT++Y E L
Sbjct: 2018 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 2066
>gi|440898362|gb|ELR49876.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Bos grunniens mutus]
Length = 2698
Score = 141 bits (355), Expect = 2e-31, Method: Composition-based stats.
Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
C+C A+ + D +C MLL C + C G C N+ F R ++++ +T + G
Sbjct: 1898 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFTKRQYPEVEIFRTLQRGW 1957
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ DIK+GEFV EYVGE+ID++ C R+ + TNFY+ +++D +IDA KGN
Sbjct: 1958 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 2017
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
+R++NH C PN E QKW ++G+TR+G+FA DIK G LT++Y E L
Sbjct: 2018 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 2066
>gi|10438794|dbj|BAB15346.1| unnamed protein product [Homo sapiens]
Length = 1069
Score = 141 bits (355), Expect = 2e-31, Method: Composition-based stats.
Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
C+C A+ + D +C MLL C + C G C N+ F R ++++ +T + G
Sbjct: 268 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGW 327
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ DIK+GEFV EYVGE+ID++ C R+ + TNFY+ +++D +IDA KGN
Sbjct: 328 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 387
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
+R++NH C PN E QKW ++G+TR+G+FA DIK G LT++Y E L
Sbjct: 388 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 436
>gi|291227185|ref|XP_002733567.1| PREDICTED: HSPC069-like [Saccoglossus kowalevskii]
Length = 2376
Score = 141 bits (355), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 79/196 (40%), Positives = 110/196 (56%), Gaps = 6/196 (3%)
Query: 44 YVFIKRNIYLTKR--IKRRLEDDGIFCSCT--ASPGSSGV--CDRDCHCGMLLSSCSSGC 97
Y I N+YLT+R K R E + C C+ A G+ C DC +L+ C+S C
Sbjct: 1089 YEDIAENVYLTERKKSKARKEIKRMQCDCSTCAEDRDMGILACGDDCLNRLLMIECTSRC 1148
Query: 98 KCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERL 157
CG+ C N+ FQ R K+++ +T G G+ +I G+FV+EYVGEV++ + R
Sbjct: 1149 PCGDYCTNRSFQRRENAKVEIFKTPWKGFGLRTCAEIPEGKFVLEYVGEVLNYSEFKSRT 1208
Query: 158 WKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFAT 217
++Y + D +IDAT KGN SR+INHSC PN E QKW ++G R+G F
Sbjct: 1209 KHYNKDNRKHYYFMALTSDEIIDATKKGNVSRFINHSCDPNCETQKWTVNGHIRVGFFTK 1268
Query: 218 RDIKKGENLTYDYQYE 233
R I GE LT+DYQ+E
Sbjct: 1269 RAIPAGEELTFDYQFE 1284
>gi|119895257|ref|XP_592234.3| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific, partial [Bos taurus]
Length = 2389
Score = 141 bits (355), Expect = 3e-31, Method: Composition-based stats.
Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
C+C A+ + D +C MLL C + C G C N+ F R ++++ +T + G
Sbjct: 1589 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFTKRQYPEVEIFRTLQRGW 1648
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ DIK+GEFV EYVGE+ID++ C R+ + TNFY+ +++D +IDA KGN
Sbjct: 1649 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 1708
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
+R++NH C PN E QKW ++G+TR+G+FA DIK G LT++Y E L
Sbjct: 1709 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 1757
>gi|157112020|ref|XP_001657377.1| huntingtin interacting protein [Aedes aegypti]
gi|108878208|gb|EAT42433.1| AAEL006013-PA [Aedes aegypti]
Length = 2367
Score = 141 bits (355), Expect = 3e-31, Method: Composition-based stats.
Identities = 75/192 (39%), Positives = 107/192 (55%), Gaps = 7/192 (3%)
Query: 47 IKRNIYLTKRIKRRLEDDGIFCSCTASP-----GSSGVCDRDCHCGMLLSSCSSGCKCGN 101
I+ N+Y ++ + E + C C + G G C DC +L+ C S C G
Sbjct: 1202 IRENMYHCDKVISK-EAKKMNCDCFLTTEEIDRGELG-CGEDCLNRLLMIECGSRCTIGE 1259
Query: 102 SCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMK 161
C NK FQ ++ +TEK G GI A +I G+F++EYVGEV++ + +ER
Sbjct: 1260 RCTNKRFQKLEYANCQVFRTEKKGFGIQASTEIVPGDFIMEYVGEVLNSEQFDERAELYS 1319
Query: 162 HLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIK 221
++Y + D +IDAT KGN SR+INHSC PN E QKW ++GE RIG F T+ I
Sbjct: 1320 KEKNQHYYFMALRSDAIIDATTKGNISRFINHSCDPNAETQKWTVNGELRIGFFCTKYIM 1379
Query: 222 KGENLTYDYQYE 233
GE +T+DYQ++
Sbjct: 1380 PGEEITFDYQFQ 1391
>gi|149039889|gb|EDL94005.1| nuclear receptor binding SET domain protein 1 (predicted), isoform
CRA_b [Rattus norvegicus]
Length = 2586
Score = 141 bits (355), Expect = 3e-31, Method: Composition-based stats.
Identities = 70/169 (41%), Positives = 103/169 (60%), Gaps = 1/169 (0%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
C+C A+ + D +C MLL C + C G C N+ F R +++ +T + G
Sbjct: 1790 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPDVEIFRTLQRGW 1849
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ DIK+GEFV EYVGE+ID++ C R+ + TNFY+ +++D +IDA KGN
Sbjct: 1850 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 1909
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
+R++NH C PN E QKW ++G+TR+G+FA DIK G LT++Y E L
Sbjct: 1910 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 1958
>gi|354471955|ref|XP_003498206.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Cricetulus griseus]
Length = 2690
Score = 141 bits (355), Expect = 3e-31, Method: Composition-based stats.
Identities = 70/169 (41%), Positives = 103/169 (60%), Gaps = 1/169 (0%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
C+C A+ + D +C MLL C + C G C N+ F R +++ +T + G
Sbjct: 1898 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPDVEIFRTLQRGW 1957
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ DIK+GEFV EYVGE+ID++ C R+ + TNFY+ +++D +IDA KGN
Sbjct: 1958 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 2017
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
+R++NH C PN E QKW ++G+TR+G+FA DIK G LT++Y E L
Sbjct: 2018 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 2066
>gi|221058949|ref|XP_002260120.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
knowlesi strain H]
gi|193810193|emb|CAQ41387.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
knowlesi strain H]
Length = 2872
Score = 140 bits (354), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 82/216 (37%), Positives = 119/216 (55%), Gaps = 21/216 (9%)
Query: 42 IPYVFIKRNIYLTKRIKRRL--EDDGIFCSCTASPGSSGVCD-RDCHCGMLLSSCSSGCK 98
+ + +I +N+YL + K L + D C C G CD C+ + CS +
Sbjct: 2371 LNFEYISKNVYLNDQNKNLLACKSDDYRCLC------QGECDPYSCYNSLSKIQCSKN-R 2423
Query: 99 CG-------NSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQ 151
C C N+PF+ +K +++ QTE+ G G+ + DIK GE + EYVGEV+ +
Sbjct: 2424 CNLPIQIQDKKCFNRPFRQSAIKDLEIRQTERTGYGVFSKRDIKNGELICEYVGEVLAKK 2483
Query: 152 TCEERLW----KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIID 207
EER+ + K N+Y +INRD+ ID+ KGN SR++NHSC PN+ QKWI+
Sbjct: 2484 EFEERVEAYQEESKKTNMYNWYSIQINRDVHIDSRKKGNISRFVNHSCSPNSVSQKWIVR 2543
Query: 208 GETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
G RIGIFA RDI GE +TY+Y Y F+ ++ C
Sbjct: 2544 GFYRIGIFAQRDIPAGEEITYNYSYNFVFNNFECLC 2579
>gi|344240382|gb|EGV96485.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Cricetulus griseus]
Length = 2318
Score = 140 bits (354), Expect = 3e-31, Method: Composition-based stats.
Identities = 70/169 (41%), Positives = 103/169 (60%), Gaps = 1/169 (0%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
C+C A+ + D +C MLL C + C G C N+ F R +++ +T + G
Sbjct: 1526 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPDVEIFRTLQRGW 1585
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ DIK+GEFV EYVGE+ID++ C R+ + TNFY+ +++D +IDA KGN
Sbjct: 1586 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 1645
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
+R++NH C PN E QKW ++G+TR+G+FA DIK G LT++Y E L
Sbjct: 1646 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 1694
>gi|432114829|gb|ELK36567.1| Putative histone-lysine N-methyltransferase NSD2 [Myotis davidii]
Length = 1037
Score = 140 bits (354), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 79/199 (39%), Positives = 111/199 (55%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N K ++ D C P C D C ML+ C
Sbjct: 601 KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDDSPCGLDSECLNRMLMFECHPQV 659
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G +C N+ F R + K+V+T+ G G+VA DI++GEFV EYVGE+ID++ C R
Sbjct: 660 CPAGEACQNQSFSKRQYPETKIVRTDGKGWGLVATRDIRKGEFVNEYVGELIDEEECMAR 719
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + T+FY+ I++D +IDA KGN SR++NHSC PN E KW ++G+TR+G+FA
Sbjct: 720 IKHAQENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 779
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 780 VCDIPAGTELTFNYNLDCL 798
>gi|348675982|gb|EGZ15800.1| hypothetical protein PHYSODRAFT_263017 [Phytophthora sojae]
Length = 823
Score = 140 bits (354), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 72/167 (43%), Positives = 100/167 (59%), Gaps = 3/167 (1%)
Query: 80 CDRDCHCGMLLSSCS-SGCKCGN--SCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKR 136
C C+ ML SCS C + C N+ + R +K +++ G G+VA+E I
Sbjct: 198 CGEGCYNRMLFISCSDETCSAPDLSMCSNRAIKRRELKSVRVEYIPGPGFGLVANEKINA 257
Query: 137 GEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCC 196
GEF+IEYVGEVIDD CE R+ + + GE NFY+ E+ +++VIDA Y+ N SR+INH C
Sbjct: 258 GEFIIEYVGEVIDDIECERRMIQYRDNGEVNFYMMELEKNIVIDAKYRSNDSRFINHCCD 317
Query: 197 PNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
PN+ QKW +DG RIGIFA R+I E +T DY + ++ C
Sbjct: 318 PNSVTQKWNVDGMQRIGIFARRNIAPDEEITIDYNFSHFGEAADCKC 364
>gi|291387888|ref|XP_002710468.1| PREDICTED: nuclear receptor binding SET domain protein 1 isoform 1
[Oryctolagus cuniculus]
Length = 2700
Score = 140 bits (354), Expect = 4e-31, Method: Composition-based stats.
Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
C+C A+ + D +C MLL C + C G C N+ F R ++++ +T + G
Sbjct: 1898 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFTKRQYPEVEIFRTLQRGW 1957
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ DIK+GEFV EYVGE+ID++ C R+ + TNFY+ +++D +IDA KGN
Sbjct: 1958 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 2017
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
+R++NH C PN E QKW ++G+TR+G+FA DIK G LT++Y E L
Sbjct: 2018 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 2066
>gi|383864320|ref|XP_003707627.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific-like [Megachile rotundata]
Length = 1302
Score = 140 bits (354), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 82/192 (42%), Positives = 109/192 (56%), Gaps = 2/192 (1%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CK 98
K PYV +K N + +K + + C C A + DC +LL CS G C
Sbjct: 834 KPPPYVKLKVNKPVGN-VKPAEVESIVACDCDAEWNNPCAPGTDCLNRILLVECSPGICP 892
Query: 99 CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
G C N+ F R M+ T G G+ + E IK G+FVIEYVGEVID+ + RL
Sbjct: 893 AGPKCNNQAFVRRQYPAMEPFHTVARGWGLRSLEFIKAGQFVIEYVGEVIDEAEYKRRLH 952
Query: 159 KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATR 218
+ K L NFY I+ + +IDA KGN SR++NHSC PN E QKW ++G+TRIG+FA
Sbjct: 953 RKKELKNENFYFLTIDNNRMIDAEPKGNLSRFMNHSCSPNCETQKWTVNGDTRIGLFALC 1012
Query: 219 DIKKGENLTYDY 230
DI+ GE LT++Y
Sbjct: 1013 DIEPGEELTFNY 1024
>gi|359067302|ref|XP_002689078.2| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Bos taurus]
Length = 1470
Score = 140 bits (354), Expect = 4e-31, Method: Composition-based stats.
Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
C+C A+ + D +C MLL C + C G C N+ F R ++++ +T + G
Sbjct: 670 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFTKRQYPEVEIFRTLQRGW 729
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ DIK+GEFV EYVGE+ID++ C R+ + TNFY+ +++D +IDA KGN
Sbjct: 730 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 789
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
+R++NH C PN E QKW ++G+TR+G+FA DIK G LT++Y E L
Sbjct: 790 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 838
>gi|332216412|ref|XP_003257344.1| PREDICTED: histone-lysine N-methyltransferase SETD2 [Nomascus
leucogenys]
Length = 2499
Score = 140 bits (354), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 72/197 (36%), Positives = 111/197 (56%), Gaps = 7/197 (3%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSG 96
+ I+ N+YLT+R K + D C +P S C DC +L+ CSS
Sbjct: 1458 FDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLNRLLMIECSSR 1517
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G+ C N+ FQ + ++++ TEK G G+ A +D+ FV+EY GEV+D + + R
Sbjct: 1518 CPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKAR 1577
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + ++Y + D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F
Sbjct: 1578 VKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFT 1637
Query: 217 TRDIKKGENLTYDYQYE 233
T+ + G LT+DYQ++
Sbjct: 1638 TKLVPSGSELTFDYQFQ 1654
>gi|296225059|ref|XP_002758501.1| PREDICTED: histone-lysine N-methyltransferase SETD2 [Callithrix
jacchus]
Length = 2510
Score = 140 bits (353), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 72/197 (36%), Positives = 111/197 (56%), Gaps = 7/197 (3%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSG 96
+ I+ N+YLT+R K + D C +P S C DC +L+ CSS
Sbjct: 1419 FDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLNRLLMIECSSR 1478
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G+ C N+ FQ + ++++ TEK G G+ A +D+ FV+EY GEV+D + + R
Sbjct: 1479 CPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKAR 1538
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + ++Y + D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F
Sbjct: 1539 VKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFT 1598
Query: 217 TRDIKKGENLTYDYQYE 233
T+ + G LT+DYQ++
Sbjct: 1599 TKLVPSGSELTFDYQFQ 1615
>gi|410220670|gb|JAA07554.1| SET domain containing 2 [Pan troglodytes]
gi|410261336|gb|JAA18634.1| SET domain containing 2 [Pan troglodytes]
gi|410295964|gb|JAA26582.1| SET domain containing 2 [Pan troglodytes]
gi|410339683|gb|JAA38788.1| SET domain containing 2 [Pan troglodytes]
Length = 2564
Score = 140 bits (353), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 72/197 (36%), Positives = 111/197 (56%), Gaps = 7/197 (3%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSG 96
+ I+ N+YLT+R K + D C +P S C DC +L+ CSS
Sbjct: 1473 FDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLNRLLMIECSSR 1532
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G+ C N+ FQ + ++++ TEK G G+ A +D+ FV+EY GEV+D + + R
Sbjct: 1533 CPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKAR 1592
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + ++Y + D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F
Sbjct: 1593 VKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFT 1652
Query: 217 TRDIKKGENLTYDYQYE 233
T+ + G LT+DYQ++
Sbjct: 1653 TKLVPSGSELTFDYQFQ 1669
>gi|297671474|ref|XP_002813857.1| PREDICTED: histone-lysine N-methyltransferase SETD2 [Pongo abelii]
Length = 2563
Score = 140 bits (353), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 72/197 (36%), Positives = 111/197 (56%), Gaps = 7/197 (3%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSG 96
+ I+ N+YLT+R K + D C +P S C DC +L+ CSS
Sbjct: 1472 FDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLNRLLMIECSSR 1531
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G+ C N+ FQ + ++++ TEK G G+ A +D+ FV+EY GEV+D + + R
Sbjct: 1532 CPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKAR 1591
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + ++Y + D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F
Sbjct: 1592 VKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFT 1651
Query: 217 TRDIKKGENLTYDYQYE 233
T+ + G LT+DYQ++
Sbjct: 1652 TKLVPSGSELTFDYQFQ 1668
>gi|397495290|ref|XP_003818492.1| PREDICTED: histone-lysine N-methyltransferase SETD2 [Pan paniscus]
Length = 2564
Score = 140 bits (353), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 72/197 (36%), Positives = 111/197 (56%), Gaps = 7/197 (3%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSG 96
+ I+ N+YLT+R K + D C +P S C DC +L+ CSS
Sbjct: 1473 FDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLNRLLMIECSSR 1532
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G+ C N+ FQ + ++++ TEK G G+ A +D+ FV+EY GEV+D + + R
Sbjct: 1533 CPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKAR 1592
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + ++Y + D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F
Sbjct: 1593 VKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFT 1652
Query: 217 TRDIKKGENLTYDYQYE 233
T+ + G LT+DYQ++
Sbjct: 1653 TKLVPSGSELTFDYQFQ 1669
>gi|114586572|ref|XP_516423.2| PREDICTED: histone-lysine N-methyltransferase SETD2 isoform 3 [Pan
troglodytes]
Length = 2549
Score = 140 bits (353), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 72/197 (36%), Positives = 111/197 (56%), Gaps = 7/197 (3%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSG 96
+ I+ N+YLT+R K + D C +P S C DC +L+ CSS
Sbjct: 1458 FDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLNRLLMIECSSR 1517
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G+ C N+ FQ + ++++ TEK G G+ A +D+ FV+EY GEV+D + + R
Sbjct: 1518 CPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKAR 1577
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + ++Y + D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F
Sbjct: 1578 VKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFT 1637
Query: 217 TRDIKKGENLTYDYQYE 233
T+ + G LT+DYQ++
Sbjct: 1638 TKLVPSGSELTFDYQFQ 1654
>gi|195130337|ref|XP_002009608.1| GI15146 [Drosophila mojavensis]
gi|193908058|gb|EDW06925.1| GI15146 [Drosophila mojavensis]
Length = 1885
Score = 140 bits (353), Expect = 4e-31, Method: Composition-based stats.
Identities = 71/165 (43%), Positives = 96/165 (58%), Gaps = 1/165 (0%)
Query: 80 CDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEF 139
C C ML+ C C G+ C NK FQ ++ +T+K G GI A+ I GEF
Sbjct: 948 CGAGCINRMLMIECGPLCTYGDRCTNKRFQQHQGWPCRVFRTKKKGCGITAEMLIPPGEF 1007
Query: 140 VIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNT 199
++EYVGEVID + E R + ++Y + + +IDAT KGN SRYINHSC PN
Sbjct: 1008 IMEYVGEVIDSEEFERRQHHYSQIRNRHYYFMALRGEAIIDATVKGNISRYINHSCDPNA 1067
Query: 200 EMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
E QKW ++GE RIG F+ + I GE +T+DYQY+ + D+ YC
Sbjct: 1068 ETQKWTVNGELRIGFFSVKTILPGEEITFDYQYQRYGRDAQRCYC 1112
>gi|148677064|gb|EDL09011.1| mCG15806 [Mus musculus]
Length = 2034
Score = 140 bits (353), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 75/207 (36%), Positives = 115/207 (55%), Gaps = 9/207 (4%)
Query: 36 FIKPKAIP--YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHC 86
F K +P + I+ N+YLT+R K + D C +P S C DC
Sbjct: 934 FAKLGKMPCYFDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEVACGEDCLN 993
Query: 87 GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
+L+ CSS C G+ C N+ FQ + ++++ TEK G G+ A +D+ FV+EY GE
Sbjct: 994 RLLMIECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGE 1053
Query: 147 VIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWII 206
V+D + + R+ + ++Y + D +IDAT KGN SR++NHSC PN E QKW +
Sbjct: 1054 VLDHKEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTV 1113
Query: 207 DGETRIGIFATRDIKKGENLTYDYQYE 233
+G+ R+G F T+ + G LT+DYQ++
Sbjct: 1114 NGQLRVGFFTTKLVPSGSELTFDYQFQ 1140
>gi|405966105|gb|EKC31425.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Crassostrea gigas]
Length = 1079
Score = 140 bits (353), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 78/199 (39%), Positives = 113/199 (56%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSC-SSG 96
K P+ F+K NI + + + + I C P S C D DC ML+ C S
Sbjct: 635 KPAPFKFVKNNIPVGSVVIPKPDLTSIP-RCECDPNSEAPCSSDTDCLNRMLMYECHPSV 693
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C+CG C N+ FQ R +TE G G+ + DIK+G+FV EYVGE+ID++ + R
Sbjct: 694 CQCGEKCHNQRFQRREYPDCTPFKTEGRGWGLRTNVDIKKGQFVHEYVGELIDEEEVKRR 753
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + +N+Y+ ++++ VIDA KGN SR++NHSC PN E QKW +G+ R+G+FA
Sbjct: 754 IDESHENNISNYYMLTLDKNRVIDAGPKGNLSRFMNHSCAPNCETQKWTANGDVRVGLFA 813
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y E L
Sbjct: 814 IYDIPAGTELTFNYNLECL 832
>gi|403268536|ref|XP_003926329.1| PREDICTED: histone-lysine N-methyltransferase SETD2 [Saimiri
boliviensis boliviensis]
Length = 2057
Score = 140 bits (353), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 72/197 (36%), Positives = 111/197 (56%), Gaps = 7/197 (3%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSG 96
+ I+ N+YLT+R K + D C +P S C DC +L+ CSS
Sbjct: 966 FDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLNRLLMIECSSR 1025
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G+ C N+ FQ + ++++ TEK G G+ A +D+ FV+EY GEV+D + + R
Sbjct: 1026 CPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKAR 1085
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + ++Y + D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F
Sbjct: 1086 VKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFT 1145
Query: 217 TRDIKKGENLTYDYQYE 233
T+ + G LT+DYQ++
Sbjct: 1146 TKLVPSGSELTFDYQFQ 1162
>gi|119585214|gb|EAW64810.1| SET domain containing 2, isoform CRA_f [Homo sapiens]
Length = 2342
Score = 140 bits (353), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 72/197 (36%), Positives = 111/197 (56%), Gaps = 7/197 (3%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSG 96
+ I+ N+YLT+R K + D C +P S C DC +L+ CSS
Sbjct: 1251 FDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLNRLLMIECSSR 1310
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G+ C N+ FQ + ++++ TEK G G+ A +D+ FV+EY GEV+D + + R
Sbjct: 1311 CPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKAR 1370
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + ++Y + D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F
Sbjct: 1371 VKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFT 1430
Query: 217 TRDIKKGENLTYDYQYE 233
T+ + G LT+DYQ++
Sbjct: 1431 TKLVPSGSELTFDYQFQ 1447
>gi|20521978|dbj|BAB21823.2| KIAA1732 protein [Homo sapiens]
Length = 1915
Score = 140 bits (353), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 72/197 (36%), Positives = 111/197 (56%), Gaps = 7/197 (3%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSG 96
+ I+ N+YLT+R K + D C +P S C DC +L+ CSS
Sbjct: 824 FDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLNRLLMIECSSR 883
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G+ C N+ FQ + ++++ TEK G G+ A +D+ FV+EY GEV+D + + R
Sbjct: 884 CPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKAR 943
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + ++Y + D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F
Sbjct: 944 VKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFT 1003
Query: 217 TRDIKKGENLTYDYQYE 233
T+ + G LT+DYQ++
Sbjct: 1004 TKLVPSGSELTFDYQFQ 1020
>gi|128485462|ref|NP_001076020.1| probable histone-lysine N-methyltransferase NSD2 [Danio rerio]
Length = 1461
Score = 140 bits (353), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 77/199 (38%), Positives = 112/199 (56%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K P+ +IK N R++ D C P + C + C MLL C
Sbjct: 1078 KPPPFKYIKVN-KPCGRVQVYTADISEIPKCNCKPSTERPCSFESECLNRMLLYECHPQV 1136
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R + K+++T G G+++ DIK+GEFV EYVGE+ID++ C R
Sbjct: 1137 CPAGERCQNQDFTKRLYPETKIIRTAGKGWGLISLRDIKKGEFVNEYVGELIDEEECRSR 1196
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + T+FY+ I++D +IDA KGN SR++NHSC PN E QKW ++G+TR+G+FA
Sbjct: 1197 IRHAQENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETQKWTVNGDTRVGLFA 1256
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1257 VCDIPAGTELTFNYNLDCL 1275
>gi|338714932|ref|XP_001495700.3| PREDICTED: histone-lysine N-methyltransferase SETD2 [Equus caballus]
Length = 2064
Score = 140 bits (353), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 72/197 (36%), Positives = 111/197 (56%), Gaps = 7/197 (3%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSG 96
+ I+ N+YLT+R K + D C +P S C DC +L+ CSS
Sbjct: 973 FDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLNRLLMIECSSR 1032
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G+ C N+ FQ + ++++ TEK G G+ A +D+ FV+EY GEV+D + + R
Sbjct: 1033 CPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKAR 1092
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + ++Y + D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F
Sbjct: 1093 VKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFT 1152
Query: 217 TRDIKKGENLTYDYQYE 233
T+ + G LT+DYQ++
Sbjct: 1153 TKLVPSGSELTFDYQFQ 1169
>gi|296485540|tpg|DAA27655.1| TPA: nuclear receptor binding SET domain protein 1 [Bos taurus]
Length = 1275
Score = 140 bits (353), Expect = 5e-31, Method: Composition-based stats.
Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
C+C A+ + D +C MLL C + C G C N+ F R ++++ +T + G
Sbjct: 473 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFTKRQYPEVEIFRTLQRGW 532
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ DIK+GEFV EYVGE+ID++ C R+ + TNFY+ +++D +IDA KGN
Sbjct: 533 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 592
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
+R++NH C PN E QKW ++G+TR+G+FA DIK G LT++Y E L
Sbjct: 593 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 641
>gi|357619110|gb|EHJ71815.1| putative huntingtin interacting protein [Danaus plexippus]
Length = 225
Score = 140 bits (353), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 78/194 (40%), Positives = 106/194 (54%), Gaps = 5/194 (2%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSC--TASPGSSG--VCDRDCHCGMLLSSCSSGCKC 99
Y + N YL +R+ R E + C C T G C DC +L+ C+S C
Sbjct: 4 YQHLDENEYLCERMVSR-ETKKMICDCFMTKEELERGELACGEDCLNRLLMIECNSRCPV 62
Query: 100 GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
G C N+ FQ + +K+ +K G G+ A DI GEF++EYVGEV+D +R
Sbjct: 63 GERCTNRRFQKKENGPLKVFYADKKGCGVEATTDITNGEFLMEYVGEVLDYDQFYKRAQA 122
Query: 160 MKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD 219
+ Y + D VIDAT KGN SR+INHSC PN E QKW ++GE RIG F+ R+
Sbjct: 123 YSDDNNLHHYFMSLKGDTVIDATLKGNISRFINHSCEPNAETQKWTVNGELRIGFFSKRE 182
Query: 220 IKKGENLTYDYQYE 233
I GE +T+DYQ++
Sbjct: 183 ISAGEEITFDYQFQ 196
>gi|348534024|ref|XP_003454503.1| PREDICTED: histone-lysine N-methyltransferase SETD2-like [Oreochromis
niloticus]
Length = 2253
Score = 140 bits (353), Expect = 5e-31, Method: Composition-based stats.
Identities = 76/207 (36%), Positives = 116/207 (56%), Gaps = 9/207 (4%)
Query: 36 FIKPKAIP--YVFIKRNIYLTKRIKRRLEDD--GIFCSCTASPG---SSGV--CDRDCHC 86
F K K +P + I+ N+YLT+R K + D + C C P + GV C DC
Sbjct: 968 FAKSKKMPPYFYLIEDNMYLTERKKSKSHRDIKRMQCECPVLPREDRARGVLACGDDCLN 1027
Query: 87 GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
+L+ CSS C G C N+ FQ + +++ TE G G+ A +D+ FV+EY GE
Sbjct: 1028 RLLMIECSSRCLNGAYCSNRRFQMKQHADFEVILTEDKGWGLRAAKDLAPNTFVLEYCGE 1087
Query: 147 VIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWII 206
V+D + + R+ + ++Y + + +IDAT KGN SR++NHSC PN E QKW +
Sbjct: 1088 VLDHKEFKTRVKEYARNKNIHYYFMSLKNNEIIDATQKGNCSRFMNHSCEPNCETQKWTV 1147
Query: 207 DGETRIGIFATRDIKKGENLTYDYQYE 233
+G+ R+G F T+ + G LT+DYQ++
Sbjct: 1148 NGQLRVGFFTTKAVTAGTELTFDYQFQ 1174
>gi|60688116|gb|AAH90954.1| SETD2 protein [Homo sapiens]
Length = 1845
Score = 140 bits (353), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 72/197 (36%), Positives = 111/197 (56%), Gaps = 7/197 (3%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSG 96
+ I+ N+YLT+R K + D C +P S C DC +L+ CSS
Sbjct: 754 FDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLNRLLMIECSSR 813
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G+ C N+ FQ + ++++ TEK G G+ A +D+ FV+EY GEV+D + + R
Sbjct: 814 CPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKAR 873
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + ++Y + D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F
Sbjct: 874 VKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFT 933
Query: 217 TRDIKKGENLTYDYQYE 233
T+ + G LT+DYQ++
Sbjct: 934 TKLVPSGSELTFDYQFQ 950
>gi|213624868|gb|AAI71696.1| Wolf-Hirschhorn syndrome candidate 1 [Danio rerio]
Length = 1461
Score = 140 bits (353), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 77/199 (38%), Positives = 112/199 (56%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K P+ +IK N R++ D C P + C + C MLL C
Sbjct: 1078 KPPPFKYIKVN-KPCGRVQVYTADISEIPKCNCKPSTERPCSFESECLNRMLLYECHPQV 1136
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R + K+++T G G+++ DIK+GEFV EYVGE+ID++ C R
Sbjct: 1137 CPAGERCQNQDFTKRLYPETKIIRTAGKGWGLISLRDIKKGEFVNEYVGELIDEEECRSR 1196
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + T+FY+ I++D +IDA KGN SR++NHSC PN E QKW ++G+TR+G+FA
Sbjct: 1197 IRHAQENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETQKWTVNGDTRVGLFA 1256
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1257 VCDIPAGTELTFNYNLDCL 1275
>gi|109658484|gb|AAI17163.1| SET domain containing 2 [Homo sapiens]
gi|109658962|gb|AAI17165.1| SET domain containing 2 [Homo sapiens]
Length = 2061
Score = 140 bits (353), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 72/197 (36%), Positives = 111/197 (56%), Gaps = 7/197 (3%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSG 96
+ I+ N+YLT+R K + D C +P S C DC +L+ CSS
Sbjct: 970 FDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLNRLLMIECSSR 1029
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G+ C N+ FQ + ++++ TEK G G+ A +D+ FV+EY GEV+D + + R
Sbjct: 1030 CPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKAR 1089
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + ++Y + D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F
Sbjct: 1090 VKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFT 1149
Query: 217 TRDIKKGENLTYDYQYE 233
T+ + G LT+DYQ++
Sbjct: 1150 TKLVPSGSELTFDYQFQ 1166
>gi|12697196|emb|CAC28349.1| huntingtin interacting protein 1 [Homo sapiens]
gi|50512435|gb|AAT77612.1| HSPC069 isoform a [Homo sapiens]
Length = 2061
Score = 140 bits (353), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 72/197 (36%), Positives = 111/197 (56%), Gaps = 7/197 (3%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSG 96
+ I+ N+YLT+R K + D C +P S C DC +L+ CSS
Sbjct: 970 FDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLNRLLMIECSSR 1029
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G+ C N+ FQ + ++++ TEK G G+ A +D+ FV+EY GEV+D + + R
Sbjct: 1030 CPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKAR 1089
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + ++Y + D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F
Sbjct: 1090 VKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFT 1149
Query: 217 TRDIKKGENLTYDYQYE 233
T+ + G LT+DYQ++
Sbjct: 1150 TKLVPSGSELTFDYQFQ 1166
>gi|340378403|ref|XP_003387717.1| PREDICTED: histone-lysine N-methyltransferase SETD2-like
[Amphimedon queenslandica]
Length = 862
Score = 140 bits (353), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 77/204 (37%), Positives = 110/204 (53%), Gaps = 11/204 (5%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDG-----IFCSCTASPGS---SGVCDRDCHCGMLLSSCSS 95
+ FI N+Y+ +R R DG + C+C P + S C +C +L+ C S
Sbjct: 48 FDFITENVYIVERGTTR---DGKRSKRMMCTCQFDPETDNHSEACGENCLNRLLMIECGS 104
Query: 96 GCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEE 155
C CG C NK F +++++TE G G+ A DI R FV+EY GEV + E
Sbjct: 105 RCPCGEYCTNKRFTRSSYANVEVIKTEMKGWGLKATCDISRYSFVMEYCGEVCSLEEFER 164
Query: 156 RLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIF 215
R + ++Y + D ++DAT KGN SR+INHSC PN E QKW ++G R+G F
Sbjct: 165 RRNIYEKESRRHYYFMSLKTDEILDATRKGNLSRFINHSCEPNCETQKWTVNGRLRVGFF 224
Query: 216 ATRDIKKGENLTYDYQYEFLHDSL 239
A R I GE LT+DYQ++ +S+
Sbjct: 225 ALRHIPAGEELTFDYQFQRFGESV 248
>gi|197313748|ref|NP_054878.5| histone-lysine N-methyltransferase SETD2 [Homo sapiens]
gi|296452963|sp|Q9BYW2.3|SETD2_HUMAN RecName: Full=Histone-lysine N-methyltransferase SETD2; AltName:
Full=HIF-1; AltName: Full=Huntingtin yeast partner B;
AltName: Full=Huntingtin-interacting protein 1;
Short=HIP-1; AltName: Full=Huntingtin-interacting protein
B; AltName: Full=Lysine N-methyltransferase 3A; AltName:
Full=SET domain-containing protein 2; Short=hSET2;
AltName: Full=p231HBP
Length = 2564
Score = 140 bits (353), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 72/197 (36%), Positives = 111/197 (56%), Gaps = 7/197 (3%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSG 96
+ I+ N+YLT+R K + D C +P S C DC +L+ CSS
Sbjct: 1473 FDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLNRLLMIECSSR 1532
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G+ C N+ FQ + ++++ TEK G G+ A +D+ FV+EY GEV+D + + R
Sbjct: 1533 CPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKAR 1592
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + ++Y + D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F
Sbjct: 1593 VKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFT 1652
Query: 217 TRDIKKGENLTYDYQYE 233
T+ + G LT+DYQ++
Sbjct: 1653 TKLVPSGSELTFDYQFQ 1669
>gi|326674803|ref|XP_003200208.1| PREDICTED: histone-lysine N-methyltransferase SETD2-like [Danio
rerio]
Length = 1428
Score = 140 bits (353), Expect = 5e-31, Method: Composition-based stats.
Identities = 79/221 (35%), Positives = 122/221 (55%), Gaps = 10/221 (4%)
Query: 21 LKQIGNPVEFELPDWFIKPKAIPYV-FIKRNIYLTKRIKRRLEDD--GIFCSC---TASP 74
LK+ +P F D + K PY I+ N+YLT+R K + D + C C +
Sbjct: 482 LKEFSDP--FVWRDKAKQKKMPPYFDLIEENLYLTERKKNKSHRDIKRMQCECAILSKEE 539
Query: 75 GSSGV--CDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADE 132
+ G+ C DC +L+ CSS C G C N+ FQ + +++ TE G G+ A +
Sbjct: 540 RARGILACGEDCLNRLLMIECSSRCLNGAYCSNRRFQMKQHADYEVILTESKGWGLRAAK 599
Query: 133 DIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYIN 192
D++ FV+EY GEV+D + + R+ + ++Y + + +IDAT KGN SR++N
Sbjct: 600 DLQPNTFVLEYCGEVLDHREFKARVKEYARNKNIHYYFMALKNNEIIDATLKGNCSRFMN 659
Query: 193 HSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
HSC PN E QKW ++G+ RIG F T+ + G LT+DYQ++
Sbjct: 660 HSCEPNCETQKWTVNGQLRIGFFTTKAVTAGTELTFDYQFQ 700
>gi|431905124|gb|ELK10179.1| Histone-lysine N-methyltransferase SETD2 [Pteropus alecto]
Length = 2482
Score = 140 bits (353), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 72/197 (36%), Positives = 111/197 (56%), Gaps = 7/197 (3%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSG 96
+ I+ N+YLT+R K + D C +P S C DC +L+ CSS
Sbjct: 1391 FDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLNRLLMIECSSR 1450
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G+ C N+ FQ + ++++ TEK G G+ A +D+ FV+EY GEV+D + + R
Sbjct: 1451 CPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKAR 1510
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + ++Y + D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F
Sbjct: 1511 VKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFT 1570
Query: 217 TRDIKKGENLTYDYQYE 233
T+ + G LT+DYQ++
Sbjct: 1571 TKLVPSGSELTFDYQFQ 1587
>gi|410951014|ref|XP_003982197.1| PREDICTED: histone-lysine N-methyltransferase SETD2 [Felis catus]
Length = 2064
Score = 140 bits (353), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 72/197 (36%), Positives = 111/197 (56%), Gaps = 7/197 (3%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSG 96
+ I+ N+YLT+R K + D C +P S C DC +L+ CSS
Sbjct: 973 FDLIEENVYLTERKKNKSHRDIKRMQCECAPLSKDERAQGEIACGEDCLNRLLMIECSSR 1032
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G+ C N+ FQ + ++++ TEK G G+ A +D+ FV+EY GEV+D + + R
Sbjct: 1033 CPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKAR 1092
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + ++Y + D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F
Sbjct: 1093 VKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFT 1152
Query: 217 TRDIKKGENLTYDYQYE 233
T+ + G LT+DYQ++
Sbjct: 1153 TKLVPSGSELTFDYQFQ 1169
>gi|417406999|gb|JAA50136.1| Putative clathrin coat binding protein/huntingtin [Desmodus rotundus]
Length = 2557
Score = 140 bits (352), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 72/197 (36%), Positives = 111/197 (56%), Gaps = 7/197 (3%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSG 96
+ I+ N+YLT+R K + D C +P S C DC +L+ CSS
Sbjct: 1466 FDLIEENVYLTERKKNKSHRDIKRMQCECAPLSKDERAQGEIACGEDCLNRLLMIECSSR 1525
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G+ C N+ FQ + ++++ TEK G G+ A +D+ FV+EY GEV+D + + R
Sbjct: 1526 CPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKAR 1585
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + ++Y + D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F
Sbjct: 1586 VKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFT 1645
Query: 217 TRDIKKGENLTYDYQYE 233
T+ + G LT+DYQ++
Sbjct: 1646 TKLVPSGSELTFDYQFQ 1662
>gi|344276291|ref|XP_003409942.1| PREDICTED: histone-lysine N-methyltransferase SETD2 [Loxodonta
africana]
Length = 2551
Score = 140 bits (352), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 72/197 (36%), Positives = 110/197 (55%), Gaps = 7/197 (3%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSG 96
+ I+ N+YLT+R K + D C +P S C DC +L+ CSS
Sbjct: 1459 FDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLNRLLMIECSSR 1518
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G+ C N+ FQ + ++++ TEK G G+ A D+ FV+EY GEV+D + + R
Sbjct: 1519 CPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAARDLPSNTFVLEYCGEVLDHKEFKAR 1578
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + ++Y + D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F
Sbjct: 1579 VKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFT 1638
Query: 217 TRDIKKGENLTYDYQYE 233
T+ + G LT+DYQ++
Sbjct: 1639 TKLVPSGSELTFDYQFQ 1655
>gi|355559685|gb|EHH16413.1| hypothetical protein EGK_11693 [Macaca mulatta]
Length = 2343
Score = 140 bits (352), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 72/197 (36%), Positives = 111/197 (56%), Gaps = 7/197 (3%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSG 96
+ I+ N+YLT+R K + D C +P S C DC +L+ CSS
Sbjct: 1252 FDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLNRLLMIECSSR 1311
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G+ C N+ FQ + ++++ TEK G G+ A +D+ FV+EY GEV+D + + R
Sbjct: 1312 CPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKAR 1371
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + ++Y + D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F
Sbjct: 1372 VKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFT 1431
Query: 217 TRDIKKGENLTYDYQYE 233
T+ + G LT+DYQ++
Sbjct: 1432 TKLVPSGSELTFDYQFQ 1448
>gi|6841376|gb|AAF29041.1|AF161554_1 HSPC069 [Homo sapiens]
Length = 591
Score = 140 bits (352), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 72/197 (36%), Positives = 111/197 (56%), Gaps = 7/197 (3%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSG 96
+ I+ N+YLT+R K + D C +P S C DC +L+ CSS
Sbjct: 23 FDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDKELKGEIACGEDCLNRLLMIECSSR 82
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G+ C N+ FQ + ++++ TEK G G+ A +D+ FV+EY GEV+D + + R
Sbjct: 83 CPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKAR 142
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + ++Y + D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F
Sbjct: 143 VKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFT 202
Query: 217 TRDIKKGENLTYDYQYE 233
T+ + G LT+DYQ++
Sbjct: 203 TKLVPSGSELTFDYQFQ 219
>gi|50512437|gb|AAT77613.1| HSPC069 isoform b [Homo sapiens]
Length = 1211
Score = 140 bits (352), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 72/197 (36%), Positives = 111/197 (56%), Gaps = 7/197 (3%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSG 96
+ I+ N+YLT+R K + D C +P S C DC +L+ CSS
Sbjct: 970 FDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLNRLLMIECSSR 1029
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G+ C N+ FQ + ++++ TEK G G+ A +D+ FV+EY GEV+D + + R
Sbjct: 1030 CPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKAR 1089
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + ++Y + D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F
Sbjct: 1090 VKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFT 1149
Query: 217 TRDIKKGENLTYDYQYE 233
T+ + G LT+DYQ++
Sbjct: 1150 TKLVPSGSELTFDYQFQ 1166
>gi|119585209|gb|EAW64805.1| SET domain containing 2, isoform CRA_a [Homo sapiens]
Length = 1538
Score = 140 bits (352), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 72/197 (36%), Positives = 111/197 (56%), Gaps = 7/197 (3%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSG 96
+ I+ N+YLT+R K + D C +P S C DC +L+ CSS
Sbjct: 970 FDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLNRLLMIECSSR 1029
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G+ C N+ FQ + ++++ TEK G G+ A +D+ FV+EY GEV+D + + R
Sbjct: 1030 CPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKAR 1089
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + ++Y + D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F
Sbjct: 1090 VKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFT 1149
Query: 217 TRDIKKGENLTYDYQYE 233
T+ + G LT+DYQ++
Sbjct: 1150 TKLVPSGSELTFDYQFQ 1166
>gi|86278478|gb|ABC88477.1| Wolf-Hirschhorn syndrome candidate 1 protein [Danio rerio]
Length = 1366
Score = 140 bits (352), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 77/199 (38%), Positives = 112/199 (56%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K P+ +IK N R++ D C P + C + C MLL C
Sbjct: 983 KPPPFKYIKVN-KPCGRVQVYTADISEIPKCNCKPSTERPCSFESECLNRMLLYECHPQV 1041
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R + K+++T G G+++ DIK+GEFV EYVGE+ID++ C R
Sbjct: 1042 CPAGERCQNQDFTKRLYPETKIIRTAGKGWGLISLRDIKKGEFVNEYVGELIDEEECRSR 1101
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + T+FY+ I++D +IDA KGN SR++NHSC PN E QKW ++G+TR+G+FA
Sbjct: 1102 IRHAQENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETQKWTVNGDTRVGLFA 1161
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1162 VCDIPAGTELTFNYNLDCL 1180
>gi|148709229|gb|EDL41175.1| nuclear receptor-binding SET-domain protein 1, isoform CRA_a [Mus
musculus]
Length = 2588
Score = 140 bits (352), Expect = 6e-31, Method: Composition-based stats.
Identities = 70/169 (41%), Positives = 103/169 (60%), Gaps = 1/169 (0%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
C+C A+ + D +C MLL C + C G C N+ F R +++ +T + G
Sbjct: 1793 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGVRCQNQCFSKRQYPDVEIFRTLQRGW 1852
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ DIK+GEFV EYVGE+ID++ C R+ + TNFY+ +++D +IDA KGN
Sbjct: 1853 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 1912
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
+R++NH C PN E QKW ++G+TR+G+FA DIK G LT++Y E L
Sbjct: 1913 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 1961
>gi|68565655|sp|O88491.1|NSD1_MOUSE RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific; AltName: Full=H3-K36-HMTase; AltName:
Full=H4-K20-HMTase; AltName: Full=Nuclear
receptor-binding SET domain-containing protein 1;
Short=NR-binding SET domain-containing protein
gi|3329465|gb|AAC40182.1| NSD1 protein [Mus musculus]
Length = 2588
Score = 140 bits (352), Expect = 6e-31, Method: Composition-based stats.
Identities = 70/169 (41%), Positives = 103/169 (60%), Gaps = 1/169 (0%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
C+C A+ + D +C MLL C + C G C N+ F R +++ +T + G
Sbjct: 1793 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGVRCQNQCFSKRQYPDVEIFRTLQRGW 1852
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ DIK+GEFV EYVGE+ID++ C R+ + TNFY+ +++D +IDA KGN
Sbjct: 1853 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 1912
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
+R++NH C PN E QKW ++G+TR+G+FA DIK G LT++Y E L
Sbjct: 1913 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 1961
>gi|350420879|ref|XP_003492658.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
isoform 1 [Bombus impatiens]
Length = 1230
Score = 140 bits (352), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 75/166 (45%), Positives = 99/166 (59%), Gaps = 1/166 (0%)
Query: 66 IFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKC 124
+ C C A + DC +LL CS G C G C N+ F R M+ T
Sbjct: 786 VACDCDAEWENPCAPGTDCLNRILLVECSPGICPAGAKCKNQAFVQRQYPAMEPFHTVGR 845
Query: 125 GAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYK 184
G G+ + E IK G+FVIEYVGEVID+ + RL + K L NFY I+ + +IDA K
Sbjct: 846 GWGLRSLEHIKAGQFVIEYVGEVIDEAEYKRRLHRKKELKNENFYFLTIDNNRMIDAEPK 905
Query: 185 GNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDY 230
GN SR++NHSC PN E QKW ++G+TRIG+FA DI++GE LT++Y
Sbjct: 906 GNLSRFMNHSCSPNCETQKWTVNGDTRIGLFALCDIERGEELTFNY 951
>gi|356518575|ref|XP_003527954.1| PREDICTED: histone-lysine N-methyltransferase ASHH2-like [Glycine
max]
Length = 2037
Score = 140 bits (352), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 77/190 (40%), Positives = 107/190 (56%), Gaps = 1/190 (0%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNS 102
+ I N +L + K + D+ + C C S C +C +L C G C CG+
Sbjct: 1110 FTHILTNEFLHRSHKTQTIDEIMVCHCKPSQEGKLGCGDECLNRILNIECVQGTCPCGDR 1169
Query: 103 CLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
C N+ FQ +K + K G G+ A E++ +G+F+IEYVGEV+D Q E R +
Sbjct: 1170 CSNQQFQKHKYASLKWFKCGKKGYGLKAIENVAQGQFLIEYVGEVLDMQAYEARQREYAL 1229
Query: 163 LGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKK 222
G +FY +N VIDA+ KGN R+INHSC PN +KW+++GE IG+FA RDIKK
Sbjct: 1230 KGHRHFYFMTLNGSEVIDASAKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALRDIKK 1289
Query: 223 GENLTYDYQY 232
E LT+DY Y
Sbjct: 1290 DEELTFDYNY 1299
>gi|443722431|gb|ELU11300.1| hypothetical protein CAPTEDRAFT_160470, partial [Capitella teleta]
Length = 282
Score = 140 bits (352), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 74/171 (43%), Positives = 93/171 (54%), Gaps = 7/171 (4%)
Query: 70 CTASPGSS-------GVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTE 122
C SP S C DC ML+ C S C CG+ C NK FQ R K + + E
Sbjct: 3 CDCSPPSQEDISMGLAFCGEDCLNRMLMIECGSRCPCGDMCTNKRFQRRHYAKTEPFRAE 62
Query: 123 KCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDAT 182
G G+ A D+ G FV+EYVGEV+D R + T+ Y +N D +IDAT
Sbjct: 63 VKGWGLRATSDLSSGVFVMEYVGEVLDYPNFRLRCKQYAEDNHTHHYFMALNGDEIIDAT 122
Query: 183 YKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
KGN SR+INHSC PN E QKW ++G+ R+G F R I G LT+DYQ+E
Sbjct: 123 QKGNTSRFINHSCDPNCETQKWTVNGQLRVGFFTLRSIPAGTELTFDYQFE 173
>gi|350420881|ref|XP_003492659.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
isoform 2 [Bombus impatiens]
Length = 1239
Score = 140 bits (352), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 75/166 (45%), Positives = 99/166 (59%), Gaps = 1/166 (0%)
Query: 66 IFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKC 124
+ C C A + DC +LL CS G C G C N+ F R M+ T
Sbjct: 795 VACDCDAEWENPCAPGTDCLNRILLVECSPGICPAGAKCKNQAFVQRQYPAMEPFHTVGR 854
Query: 125 GAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYK 184
G G+ + E IK G+FVIEYVGEVID+ + RL + K L NFY I+ + +IDA K
Sbjct: 855 GWGLRSLEHIKAGQFVIEYVGEVIDEAEYKRRLHRKKELKNENFYFLTIDNNRMIDAEPK 914
Query: 185 GNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDY 230
GN SR++NHSC PN E QKW ++G+TRIG+FA DI++GE LT++Y
Sbjct: 915 GNLSRFMNHSCSPNCETQKWTVNGDTRIGLFALCDIERGEELTFNY 960
>gi|328865276|gb|EGG13662.1| SET domain-containing protein [Dictyostelium fasciculatum]
Length = 1418
Score = 140 bits (352), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 83/202 (41%), Positives = 113/202 (55%), Gaps = 8/202 (3%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNS 102
+ F NIYL ++ + +++ I C+C A C C C SG C G+
Sbjct: 991 FKFTGSNIYLERKKRTEEQENIIMCNCPAE----SPCLESCLNRKSYFECHSGYCIHGDR 1046
Query: 103 CLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
C N+ FQ + ++ EK G G+ A ++I F+IEY GEVI Q C ER+ + +
Sbjct: 1047 CRNQRFQKQQYCRILPFSAEKKGWGLKAVDNIGAKTFIIEYCGEVISKQKCLERMTESE- 1105
Query: 163 LGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKK 222
E FY ++R +DA+ KGN +R+INHSC PN E QKW +DGE RIGIFA RDIK+
Sbjct: 1106 -SEKYFYFLTLDRLECLDASRKGNLARFINHSCDPNCETQKWNVDGEVRIGIFAIRDIKR 1164
Query: 223 GENLTYDYQYE-FLHDSLIAYC 243
GE LT+DY YE F + YC
Sbjct: 1165 GEELTFDYNYERFGTSKQVCYC 1186
>gi|195392836|ref|XP_002055060.1| GJ19006 [Drosophila virilis]
gi|194149570|gb|EDW65261.1| GJ19006 [Drosophila virilis]
Length = 2101
Score = 140 bits (352), Expect = 7e-31, Method: Composition-based stats.
Identities = 72/165 (43%), Positives = 94/165 (56%), Gaps = 1/165 (0%)
Query: 80 CDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEF 139
C C ML+ C C G C NK FQ ++ +TEK G GI A+ I GEF
Sbjct: 1028 CGAGCINRMLMIECGPLCTNGERCTNKRFQLHQCWPCRVFRTEKKGCGITAELQIPPGEF 1087
Query: 140 VIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNT 199
++EYVGEVID + E R ++Y + + +IDAT KGN SRYINHSC PN
Sbjct: 1088 IMEYVGEVIDSEEFERRQHIYSRDRNRHYYFMALRGEAIIDATAKGNISRYINHSCDPNA 1147
Query: 200 EMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
E QKW ++GE RIG F+ + I GE +T+DYQY+ + D+ YC
Sbjct: 1148 ETQKWTVNGELRIGFFSVKTIMPGEEITFDYQYQRYGRDAQRCYC 1192
>gi|119585211|gb|EAW64807.1| SET domain containing 2, isoform CRA_c [Homo sapiens]
Length = 1819
Score = 140 bits (352), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 72/197 (36%), Positives = 111/197 (56%), Gaps = 7/197 (3%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSG 96
+ I+ N+YLT+R K + D C +P S C DC +L+ CSS
Sbjct: 1251 FDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLNRLLMIECSSR 1310
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G+ C N+ FQ + ++++ TEK G G+ A +D+ FV+EY GEV+D + + R
Sbjct: 1311 CPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKAR 1370
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + ++Y + D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F
Sbjct: 1371 VKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFT 1430
Query: 217 TRDIKKGENLTYDYQYE 233
T+ + G LT+DYQ++
Sbjct: 1431 TKLVPSGSELTFDYQFQ 1447
>gi|355746723|gb|EHH51337.1| hypothetical protein EGM_10693 [Macaca fascicularis]
Length = 2343
Score = 140 bits (352), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 72/197 (36%), Positives = 110/197 (55%), Gaps = 7/197 (3%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSG 96
+ I+ N+YLT+R K + D C +P S C DC +L+ CSS
Sbjct: 1252 FDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLNRLLMIECSSR 1311
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G+ C N+ FQ + ++++ TEK G G+ A D+ FV+EY GEV+D + + R
Sbjct: 1312 CPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAARDLPSNTFVLEYCGEVLDHKEFKAR 1371
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + ++Y + D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F
Sbjct: 1372 VKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFT 1431
Query: 217 TRDIKKGENLTYDYQYE 233
T+ + G LT+DYQ++
Sbjct: 1432 TKLVPSGSELTFDYQFQ 1448
>gi|340372263|ref|XP_003384664.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
[Amphimedon queenslandica]
Length = 1171
Score = 139 bits (351), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 75/193 (38%), Positives = 109/193 (56%), Gaps = 5/193 (2%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSS-GCKCGNS 102
+V IKRN YLT R E + C+ ++ G S +C + + CS+ C G+
Sbjct: 709 FVHIKRNKYLTSRPPLETETEPCKCTESSPCGPSS----NCINRAIFTECSTENCPAGDK 764
Query: 103 CLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
C N+ + T G G+ + G+FVIEYVGE++D +ERL K +
Sbjct: 765 CQNQRMLRNESVPTQTFYTGNRGWGLKTMRSLSPGDFVIEYVGEIVDMAAVQERLKKTQE 824
Query: 163 LGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKK 222
++FY + R+++IDA K N +R+INHSC PN E QKW ++GETRIGIFA +DIK+
Sbjct: 825 ASVSSFYFLTLERNLIIDARVKSNHARFINHSCDPNCETQKWTVNGETRIGIFAIKDIKE 884
Query: 223 GENLTYDYQYEFL 235
LT+DYQ++ L
Sbjct: 885 DTELTFDYQFDCL 897
>gi|242017983|ref|XP_002429463.1| set domain protein, putative [Pediculus humanus corporis]
gi|212514395|gb|EEB16725.1| set domain protein, putative [Pediculus humanus corporis]
Length = 2269
Score = 139 bits (350), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 73/197 (37%), Positives = 113/197 (57%), Gaps = 2/197 (1%)
Query: 39 PKAIPYVFIKRNIYLTK-RIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG- 96
P+ P+V IK N + K ++ + + C C + S DC MLL C+
Sbjct: 1622 PRPPPFVKIKTNRPVGKVQLGSGYQSEPQTCDCDPNYESPCGPGSDCLNRMLLVECNPNY 1681
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F+ R ++ +TE+ G G+ + DI +G FVIEYVGEV+DD+ + R
Sbjct: 1682 CNAGTRCKNQSFEKREYPPLEPYRTERRGWGLRSTVDIPKGSFVIEYVGEVVDDEEFKRR 1741
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + + + N+Y I++D +IDA KGN +R++NHSC PN E QKW ++ +TR+G+FA
Sbjct: 1742 MKRKQETMDNNYYFLTIDKDRIIDAGPKGNLARFMNHSCSPNCETQKWTVNMDTRVGLFA 1801
Query: 217 TRDIKKGENLTYDYQYE 233
DI G LT++Y +
Sbjct: 1802 LEDIPAGTELTFNYNLD 1818
>gi|66828443|ref|XP_647576.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|60475584|gb|EAL73519.1| SET domain-containing protein [Dictyostelium discoideum AX4]
Length = 898
Score = 139 bits (350), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 81/211 (38%), Positives = 118/211 (55%), Gaps = 6/211 (2%)
Query: 25 GNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGI-FCSCTASPGSSGVCDRD 83
NP ++ K + + FI RN ++ + K E D I C+C+ S GS VC D
Sbjct: 528 ANPDYSDIAVLIQKSRNKKFGFISRNFFIERTEKILYEIDDIDICNCSKSSGS--VCGDD 585
Query: 84 CHCGMLLSSCS-SGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIE 142
C C+ C+ G C N+ FQ + +K T K G G++A+EDI+ +F++E
Sbjct: 586 CLNRESYVECNIEHCELGKKCTNQRFQRKQYSNIKPAFTGKKGWGLIANEDIEEKQFIME 645
Query: 143 YVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQ 202
Y GEVI QTC R+ + ++ E FY ++ +DA+ +GN +R++NHSC PN E Q
Sbjct: 646 YCGEVISKQTCLRRMKEAEN--EKFFYFLTLDSKECLDASKRGNLARFMNHSCDPNCETQ 703
Query: 203 KWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
KW + GE +IGIFA + I KG LT+DY YE
Sbjct: 704 KWTVGGEVKIGIFAIKPIPKGTELTFDYNYE 734
>gi|302839691|ref|XP_002951402.1| histone H3 Lys 36 methyltransferase/ASH1 [Volvox carteri f.
nagariensis]
gi|300263377|gb|EFJ47578.1| histone H3 Lys 36 methyltransferase/ASH1 [Volvox carteri f.
nagariensis]
Length = 2345
Score = 139 bits (350), Expect = 1e-30, Method: Composition-based stats.
Identities = 84/214 (39%), Positives = 115/214 (53%), Gaps = 16/214 (7%)
Query: 30 FELP-DWFIK--PKAIPYVFIKRNIYLTKRIKRRLEDDGI-FCSCT----ASPG--SSGV 79
F LP DW + P+ P I+RN++L++ +RL D I C+C A+PG
Sbjct: 1478 FALPRDWLVTRPPRYEP---IRRNVWLSRNKPKRLPKDEINICACRPPPFAAPGELQRMG 1534
Query: 80 CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGE 138
C ++C + C + C CG C N+ K ++ TE G G+ E + +G
Sbjct: 1535 CAQNCLNRLSSVLCDAKLCPCGELCSNRSLHLLRQPKTEVFLTENRGWGVRTMEPLSKGA 1594
Query: 139 FVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPN 198
F+IEY GEVIDD+ R+ + GE +FY+ E+ + IDA KGN +R IN SC PN
Sbjct: 1595 FIIEYAGEVIDDRELGRRMEHARMNGEPHFYIMELAAGLYIDARRKGNIARLINSSCDPN 1654
Query: 199 TEMQKW--IIDGETRIGIFATRDIKKGENLTYDY 230
E QKW GE R+GIFA+RDI GE L YDY
Sbjct: 1655 CETQKWHDASTGEIRVGIFASRDIPPGEELVYDY 1688
>gi|157278865|gb|AAI15212.1| Whsc1 protein [Danio rerio]
Length = 486
Score = 139 bits (350), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 77/199 (38%), Positives = 112/199 (56%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K P+ +IK N R++ D C P + C + C MLL C
Sbjct: 103 KPPPFKYIKVN-KPCGRVQVYTADISEIPKCNCKPSTERPCSFESECLNRMLLYECHPQV 161
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R + K+++T G G+++ DIK+GEFV EYVGE+ID++ C R
Sbjct: 162 CPAGERCQNQDFTKRLYPETKIIRTAGKGWGLISLRDIKKGEFVNEYVGELIDEEECRSR 221
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + T+FY+ I++D +IDA KGN SR++NHSC PN E QKW ++G+TR+G+FA
Sbjct: 222 IRHAQENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETQKWTVNGDTRVGLFA 281
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 282 VCDIPAGTELTFNYNLDCL 300
>gi|242019388|ref|XP_002430143.1| histone-lysine N-methyltransferase SUVR5, putative [Pediculus
humanus corporis]
gi|212515234|gb|EEB17405.1| histone-lysine N-methyltransferase SUVR5, putative [Pediculus
humanus corporis]
Length = 1448
Score = 139 bits (350), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 68/154 (44%), Positives = 90/154 (58%)
Query: 80 CDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEF 139
C DC ML C S C G+ C NK FQ K+ +TEK G G+ A+ED+ F
Sbjct: 528 CGEDCLNRMLFLECGSRCALGDRCTNKRFQKLQYANCKIFKTEKKGFGLRAEEDLSGNTF 587
Query: 140 VIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNT 199
++EYVGEV++ + R+ +FY + D VIDAT KGN SR+INHSC PN
Sbjct: 588 IMEYVGEVVNQKEFGRRVKMYAKENNKHFYFMALKGDAVIDATNKGNISRFINHSCDPNA 647
Query: 200 EMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
E QKW I+GE R+G F R + GE +T+DYQ++
Sbjct: 648 ETQKWTINGELRVGFFTRRFVAAGEEITFDYQFQ 681
>gi|395505173|ref|XP_003756919.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Sarcophilus harrisii]
Length = 2717
Score = 139 bits (349), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 71/169 (42%), Positives = 104/169 (61%), Gaps = 1/169 (0%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
C+C AS + D +C MLL C + C G C N+ F R ++++ +T + G
Sbjct: 1898 CNCKASDDNPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGW 1957
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ DIK+GEFV EYVGE+ID++ C R+ + TNFY+ +++D +IDA KGN
Sbjct: 1958 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 2017
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
+R++NH C PN E QKW ++G+TR+G+FA DIK G LT++Y E L
Sbjct: 2018 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 2066
>gi|357149500|ref|XP_003575133.1| PREDICTED: histone-lysine N-methyltransferase ASHH2-like
[Brachypodium distachyon]
Length = 1022
Score = 139 bits (349), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 77/196 (39%), Positives = 108/196 (55%), Gaps = 5/196 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSS-GCK 98
+A + +K N +L + + + D+ + C+C C C ML C+ C
Sbjct: 171 RAPSWTAVKTNSFLHRNRRTQSIDESMVCNCKPPQDGRMGCRDGCLNRMLNIECTKRTCP 230
Query: 99 CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQT--CEER 156
CG C N+ FQ R K+ + K G G+ E++ G F+IEYVGEV+D C +R
Sbjct: 231 CGEYCSNQQFQRRSYAKISWFCSGKKGFGLQLKEEVTEGRFLIEYVGEVLDITAYECRQR 290
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ K G+ +FY +N VIDA KGN R+INHSC PN +KW+++GE IGIFA
Sbjct: 291 YYASK--GQKHFYFMALNGGEVIDACTKGNLGRFINHSCSPNCRTEKWMVNGEVCIGIFA 348
Query: 217 TRDIKKGENLTYDYQY 232
R+IKKGE LT+DY Y
Sbjct: 349 MRNIKKGEELTFDYNY 364
>gi|449474840|ref|XP_002193971.2| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Taeniopygia guttata]
Length = 1651
Score = 139 bits (349), Expect = 2e-30, Method: Composition-based stats.
Identities = 77/199 (38%), Positives = 110/199 (55%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSS-G 96
K PY IK N + K ++ D C P C D C MLL C
Sbjct: 595 KPPPYKHIKVNRPVGK-VQIFTADLSEIPRCNCKPTDENPCGLDSECINRMLLYECHPLV 653
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R ++++ +T G G+ A DI++GEFV EYVGE+ID++ C R
Sbjct: 654 CPAGERCQNQCFSKRQYPEVQIFRTLARGWGLQAKTDIRKGEFVNEYVGELIDEEECRAR 713
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + TNFY+ +++D +IDA KGN +R++NH C PN E QKW ++G+TR+G+FA
Sbjct: 714 IRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWCVNGDTRVGLFA 773
Query: 217 TRDIKKGENLTYDYQYEFL 235
+IK G LT++Y E L
Sbjct: 774 LVNIKAGTELTFNYNLECL 792
>gi|356576073|ref|XP_003556159.1| PREDICTED: histone-lysine N-methyltransferase ASHH1-like [Glycine
max]
Length = 480
Score = 138 bits (348), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 77/194 (39%), Positives = 110/194 (56%), Gaps = 2/194 (1%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCT-ASPGSSGVCDRDCHCGMLLSSCSSG-CKCGN 101
Y+ + +N + +R K++ E+D C C + C C + + C+ G C CG
Sbjct: 11 YIHMNQNEFCIRRHKKQKEEDIAICECKYDADDPDNACGDSCLNVLTSTECTPGYCPCGV 70
Query: 102 SCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMK 161
C N+ FQ K KL +TE G G++ADEDIK G+FVIEY GEVI + + R +
Sbjct: 71 LCKNQKFQKCEYAKTKLFKTEGRGWGLLADEDIKAGQFVIEYCGEVISWKEAKRRSQAYE 130
Query: 162 HLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIK 221
+ G + ++ +N IDAT KG+ +R+INHSC PN E +KW + GE R+GIFA DI
Sbjct: 131 NQGLKDAFIICLNASESIDATRKGSLARFINHSCQPNCETRKWNVLGEIRVGIFAKHDIP 190
Query: 222 KGENLTYDYQYEFL 235
G L YDY +E+
Sbjct: 191 IGNELAYDYNFEWF 204
>gi|68611259|emb|CAD41011.3| OSJNBa0042L16.10 [Oryza sativa Japonica Group]
Length = 1153
Score = 138 bits (348), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 81/222 (36%), Positives = 121/222 (54%), Gaps = 30/222 (13%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCG------MLLSSCSSG- 96
Y+ I+ N +L +R KR+ E+D C C + + D D CG + + C+ G
Sbjct: 194 YIHIETNDFLHRRHKRQKEEDIAVCECQYN-----LLDPDSACGDRCLNVLTSTECTPGY 248
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDI------------------KRGE 138
C CG C N+ FQ +LV+TE G G++ADE+I + G+
Sbjct: 249 CLCGVYCKNQRFQKSQYAATRLVKTEGRGWGLLADENIMVTEFTLILWSANVVKYIQAGQ 308
Query: 139 FVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPN 198
FV+EY GEVI + + R ++ G T+ Y+ +N D IDAT KG+ +R+INHSC PN
Sbjct: 309 FVMEYCGEVISWKEAKRRSQAYENQGLTDAYIIYLNADESIDATKKGSLARFINHSCQPN 368
Query: 199 TEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLI 240
E +KW + GE R+GIFA +DI G L+YDY +E+ +++
Sbjct: 369 CETRKWNVLGEVRVGIFAKQDIPIGTELSYDYNFEWFGGAMV 410
>gi|326488341|dbj|BAJ93839.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 1070
Score = 138 bits (348), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 75/194 (38%), Positives = 106/194 (54%), Gaps = 1/194 (0%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSS-GCK 98
+A + ++ N YL + + + D+ + C+C C C ML C+ C
Sbjct: 269 RAPSWTNLRSNAYLHRNRRNQSIDESMVCNCKHPQDGRMGCRDGCLNRMLNIECAKRTCP 328
Query: 99 CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
C C N+ FQ R K++ + K G G+ E++ G F+IEYVGEV+D T E R
Sbjct: 329 CEEQCSNQQFQRRNYAKIEWFHSGKKGYGLQLLEEVSEGRFLIEYVGEVLDITTYESRQR 388
Query: 159 KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATR 218
G+ +FY ++ VIDA KGN R+INHSC PN +KW+++GE IGIFA R
Sbjct: 389 DYASKGKKHFYFMALDGGEVIDACTKGNLGRFINHSCSPNCRTEKWMVNGEVCIGIFAMR 448
Query: 219 DIKKGENLTYDYQY 232
+IKKGE LT+DY Y
Sbjct: 449 NIKKGEELTFDYNY 462
>gi|350587283|ref|XP_003128857.3| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Sus
scrofa]
Length = 1338
Score = 138 bits (348), Expect = 2e-30, Method: Composition-based stats.
Identities = 69/169 (40%), Positives = 103/169 (60%), Gaps = 1/169 (0%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
C+C + + D +C ML+ C C G C N+ F R + K+++T+ G
Sbjct: 989 CNCKPTDENPCGLDSECLNRMLMFECHPQVCPAGEFCQNQCFTKRQYPETKIIKTDGKGW 1048
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+VA DI++GEFV EYVGE+ID++ C R+ + + T FY+ I++D +IDA KGN
Sbjct: 1049 GLVAKRDIRKGEFVNEYVGELIDEEECMARIRRAQEHDITRFYMLTIDKDRIIDAGPKGN 1108
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
SR++NHSC PN E KW ++G+TR+G+FA DI G LT++Y + L
Sbjct: 1109 YSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTELTFNYNLDCL 1157
>gi|403072167|pdb|4FMU|A Chain A, Crystal Structure Of Methyltransferase Domain Of Human Set
Domain- Containing Protein 2 Compound: Pr-Snf
gi|407944022|pdb|4H12|A Chain A, The Crystal Structure Of Methyltransferase Domain Of Human
Set Domain- Containing Protein 2 In Complex With
S-Adenosyl-L-Homocysteine
Length = 278
Score = 138 bits (348), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 72/197 (36%), Positives = 111/197 (56%), Gaps = 7/197 (3%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHCGMLLSSCSSG 96
+ I+ N+YLT+R K + D C +P S C DC +L+ CSS
Sbjct: 40 FDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLNRLLMIECSSR 99
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G+ C N+ FQ + ++++ TEK G G+ A +D+ FV+EY GEV+D + + R
Sbjct: 100 CPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKAR 159
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + ++Y + D +IDAT KGN SR++NHSC PN E QKW ++G+ R+G F
Sbjct: 160 VKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFT 219
Query: 217 TRDIKKGENLTYDYQYE 233
T+ + G LT+DYQ++
Sbjct: 220 TKLVPSGSELTFDYQFQ 236
>gi|126303359|ref|XP_001372863.1| PREDICTED: histone-lysine N-methyltransferase NSD3 [Monodelphis
domestica]
Length = 1435
Score = 138 bits (348), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 82/222 (36%), Positives = 118/222 (53%), Gaps = 6/222 (2%)
Query: 17 FNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGS 76
F++L Q N EL KP PY IK N + K ++ ++ D C P
Sbjct: 1046 FHELKAQRENKEALELERNSRKPP--PYKHIKANKVIGK-VQIQVADLSEIPRCNCKPAD 1102
Query: 77 SGVCDRDCHC--GMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADED 133
C + C ML C C G+ C N+ F R ++++TE+ G G+
Sbjct: 1103 ENPCGLESECLNRMLQYECHPQVCPAGDRCQNQCFTKRLYPDAEVIKTERRGWGLRTKRS 1162
Query: 134 IKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINH 193
IK+GEFV EYVGE+ID++ C R+ + TNFY+ + +D +IDA KGN SR++NH
Sbjct: 1163 IKKGEFVNEYVGELIDEEECRLRIKRAHENSITNFYMLTVTKDRIIDAGPKGNYSRFMNH 1222
Query: 194 SCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
SC PN E QKW ++G+ R+G+FA DI G LT++Y + L
Sbjct: 1223 SCNPNCETQKWTVNGDIRVGLFALCDIPAGVELTFNYNLDCL 1264
>gi|334311241|ref|XP_003339591.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
H3 lysine-36 and H4 lysine-20 specific-like [Monodelphis
domestica]
Length = 2705
Score = 138 bits (348), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 71/169 (42%), Positives = 104/169 (61%), Gaps = 1/169 (0%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
C+C AS + D +C MLL C + C G C N+ F R ++++ +T + G
Sbjct: 1897 CNCKASDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGW 1956
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ DIK+GEFV EYVGE+ID++ C R+ + TNFY+ +++D +IDA KGN
Sbjct: 1957 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 2016
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
+R++NH C PN E QKW ++G+TR+G+FA DIK G LT++Y E L
Sbjct: 2017 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 2065
>gi|363739108|ref|XP_414538.3| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Gallus gallus]
Length = 2412
Score = 138 bits (348), Expect = 2e-30, Method: Composition-based stats.
Identities = 77/199 (38%), Positives = 110/199 (55%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSS-G 96
K PY IK N + K ++ D C P C D C MLL C
Sbjct: 1581 KPPPYKHIKVNRPVGK-VQIFTADLSEIPRCNCKPTDENPCGLDSECINRMLLYECHPLV 1639
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R ++++ +T G G+ A DI++GEFV EYVGE+ID++ C R
Sbjct: 1640 CPAGERCQNQCFSKRQYPEVQIFRTLARGWGLQAKTDIRKGEFVNEYVGELIDEEECRAR 1699
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + TNFY+ +++D +IDA KGN +R++NH C PN E QKW ++G+TR+G+FA
Sbjct: 1700 IRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWCVNGDTRVGLFA 1759
Query: 217 TRDIKKGENLTYDYQYEFL 235
+IK G LT++Y E L
Sbjct: 1760 IVNIKAGTELTFNYNLECL 1778
>gi|157126101|ref|XP_001654536.1| set domain protein [Aedes aegypti]
gi|108873380|gb|EAT37605.1| AAEL010414-PA [Aedes aegypti]
Length = 1480
Score = 138 bits (348), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 74/195 (37%), Positives = 106/195 (54%), Gaps = 1/195 (0%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSS-GCKCGNS 102
+V IK N Y+ + E DG C C A+ D +C L+ C+ C G
Sbjct: 1149 FVKIKSNRYVPPLKAPKDEMDGNVCVCKATDSDPCGPDSNCINRALMVECNPKSCPAGEL 1208
Query: 103 CLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
C N+ F+ R + + + G G+VA EDI++G+FVIEYVGEVI ++ E RL
Sbjct: 1209 CQNQCFEKRQYPSLAARRIPQKGWGLVAQEDIRQGQFVIEYVGEVISNEELERRLQHKVA 1268
Query: 163 LGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKK 222
+ N+Y ++ ++ IDA KGN +R+INHSC PN E W + G +G+FA DIK
Sbjct: 1269 QKDENYYFLTVDSELTIDAGPKGNLARFINHSCEPNCETMLWTVGGAQSVGLFAIMDIKA 1328
Query: 223 GENLTYDYQYEFLHD 237
GE LT++Y +E D
Sbjct: 1329 GEELTFNYNFESKSD 1343
>gi|444511191|gb|ELV09829.1| Histone-lysine N-methyltransferase NSD3 [Tupaia chinensis]
Length = 1235
Score = 138 bits (348), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 80/222 (36%), Positives = 117/222 (52%), Gaps = 6/222 (2%)
Query: 17 FNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGS 76
F +L Q + E+ KP PY IK N + K ++ ++ D C P
Sbjct: 846 FQELKAQRESKEALEMEKHSRKPP--PYKHIKANKVIGK-VQIQVADLAEIPRCNCKPAD 902
Query: 77 SGVCDRDCHC--GMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADED 133
C + C ML C C G+ C N+ F R ++++TE+ G G+
Sbjct: 903 ENPCGLESECLNRMLQYECHPQVCPAGDRCQNQCFTKRLYPDAEVIKTERRGWGLRTKRS 962
Query: 134 IKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINH 193
IK+GEFV EYVGE+ID++ C R+ + TNFY+ + +D +IDA KGN SR++NH
Sbjct: 963 IKKGEFVNEYVGELIDEEECRLRIQRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNH 1022
Query: 194 SCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
SC PN E QKW ++G+ R+G+FA DI G LT++Y + L
Sbjct: 1023 SCNPNCETQKWTVNGDVRVGLFALCDIPAGMELTFNYNLDCL 1064
>gi|94732456|emb|CAK03662.1| novel protein similar to vertebrate Wolf-Hirschhorn syndrome
candidate 1 (WHSC1) [Danio rerio]
Length = 728
Score = 138 bits (347), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 77/199 (38%), Positives = 112/199 (56%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K P+ +IK N R++ D C P + C + C MLL C
Sbjct: 345 KPPPFKYIKVN-KPCGRVQVYTADISEIPKCNCKPSTERPCSFESECLNRMLLYECHPQV 403
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R + K+++T G G+++ DIK+GEFV EYVGE+ID++ C R
Sbjct: 404 CPAGERCQNQDFTKRLYPETKIIRTAGKGWGLISLRDIKKGEFVNEYVGELIDEEECRSR 463
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + T+FY+ I++D +IDA KGN SR++NHSC PN E QKW ++G+TR+G+FA
Sbjct: 464 IRHAQENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETQKWTVNGDTRVGLFA 523
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 524 VCDIPAGTELTFNYNLDCL 542
>gi|417406466|gb|JAA49891.1| Putative histone-lysine n-methyltransferase nsd3-like isoform 3
[Desmodus rotundus]
Length = 1438
Score = 138 bits (347), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 76/199 (38%), Positives = 108/199 (54%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N + K ++ ++ D C P C D C ML C
Sbjct: 1070 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLDSECLNRMLQYECHPQV 1128
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R ++++TE+ G G+ IK+GEFV EYVGE+ID++ C R
Sbjct: 1129 CPAGERCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1188
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + TNFY+ + +D +IDA KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1189 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1248
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1249 LCDIPAGMELTFNYNLDCL 1267
>gi|384250559|gb|EIE24038.1| SET domain-containing protein, partial [Coccomyxa subellipsoidea
C-169]
Length = 295
Score = 138 bits (347), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 72/195 (36%), Positives = 104/195 (53%), Gaps = 1/195 (0%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNS 102
+ I+ NIY + K + EDD + C C C DC ML C G C CG
Sbjct: 11 WQLIRENIYTHRAPKVQDEDDVMLCHCRLPNDGGPGCGPDCLNRMLNMECVPGYCPCGER 70
Query: 103 CLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
C N+ F R K++ + G G+ A +D+ G+F++EY+GEV++++ R +
Sbjct: 71 CSNQQFSKRQYAKLEKRRAGAKGFGLFATQDLVAGQFIVEYIGEVLEEEEYLRRKDYYQE 130
Query: 163 LGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKK 222
G+ ++Y I VIDA KG R+INHSC PN E QKW++ GE IG++A +DI
Sbjct: 131 SGQRHYYFMNIGNGEVIDAARKGALGRFINHSCNPNCETQKWVVRGELAIGLYALKDIPA 190
Query: 223 GENLTYDYQYEFLHD 237
G LT+DY +E D
Sbjct: 191 GVELTFDYNFERYGD 205
>gi|302846429|ref|XP_002954751.1| histone H3 methyltransferase [Volvox carteri f. nagariensis]
gi|300259934|gb|EFJ44157.1| histone H3 methyltransferase [Volvox carteri f. nagariensis]
Length = 261
Score = 138 bits (347), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 81/210 (38%), Positives = 113/210 (53%), Gaps = 4/210 (1%)
Query: 38 KPKAIPYVF--IKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSS 95
K + IP V+ +K N+ ++ K + EDD + C C C DC ML C
Sbjct: 1 KRRRIPAVWQLLKDNVLAHRKRKVQDEDDIMICHCKPPWRGGDGCGPDCINRMLCIECVP 60
Query: 96 G-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCE 154
G C C N+ F R +++ + G G+ A EDIK G+F+IEY+GEV+++ +
Sbjct: 61 GFCPSEEKCTNQMFSKRMYANLEIRRAGAKGFGLFALEDIKAGQFIIEYIGEVLEEDEYQ 120
Query: 155 ERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGI 214
R +G+ ++Y I VIDA KGN SR+INHSC PN E QKW++ GE IG+
Sbjct: 121 RRKEYYMSVGQRHYYFMNIGNGEVIDACRKGNISRFINHSCEPNCETQKWLVRGELAIGL 180
Query: 215 FATRDIKKGENLTYDYQYEFLHDS-LIAYC 243
FA RDI K LT+DY +E D + YC
Sbjct: 181 FAVRDIPKDTELTFDYNFERYGDKPMRCYC 210
>gi|224082490|ref|XP_002306713.1| SET domain protein [Populus trichocarpa]
gi|222856162|gb|EEE93709.1| SET domain protein [Populus trichocarpa]
Length = 495
Score = 138 bits (347), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 76/195 (38%), Positives = 116/195 (59%), Gaps = 6/195 (3%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTAS---PGSSGVCDRDCHCGMLLSSCSSG-CKC 99
Y I+RN ++ ++ K++ E+D C C + P S+ C C + + C+ G C C
Sbjct: 11 YEHIERNDFIYRKHKKQKEEDIAICECKFNGDDPDSA--CGERCLNLLTSTECTPGYCPC 68
Query: 100 GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
G C N+ FQ K +L +TE G G++ADE+IK G+F+IEY GEVI + ++R
Sbjct: 69 GVYCKNQRFQKFEYAKTQLFKTEGRGWGLLADEEIKAGQFIIEYCGEVISWKEAKKRSQV 128
Query: 160 MKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD 219
++ G + ++ +N IDAT KG+ +R+INHSC PN E +KW + GE R+GIFA ++
Sbjct: 129 YENQGLKDAFIISLNSTESIDATKKGSLARFINHSCQPNCETRKWTVLGEIRVGIFAKQN 188
Query: 220 IKKGENLTYDYQYEF 234
I G L YDY +E+
Sbjct: 189 ISIGTELAYDYNFEW 203
>gi|326928449|ref|XP_003210391.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific-like, partial [Meleagris gallopavo]
Length = 2336
Score = 137 bits (346), Expect = 3e-30, Method: Composition-based stats.
Identities = 77/199 (38%), Positives = 110/199 (55%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSS-G 96
K PY IK N + K ++ D C P C D C MLL C
Sbjct: 1569 KPPPYKHIKVNRPVGK-VQIFTADLSEIPRCNCKPTDENPCGLDSECINRMLLYECHPLV 1627
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R ++++ +T G G+ A DI++GEFV EYVGE+ID++ C R
Sbjct: 1628 CPAGERCQNQCFSKRQYPEVQIFRTLARGWGLQAKTDIRKGEFVNEYVGELIDEEECRAR 1687
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + TNFY+ +++D +IDA KGN +R++NH C PN E QKW ++G+TR+G+FA
Sbjct: 1688 IRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWCVNGDTRVGLFA 1747
Query: 217 TRDIKKGENLTYDYQYEFL 235
+IK G LT++Y E L
Sbjct: 1748 IVNIKAGTELTFNYNLECL 1766
>gi|255575537|ref|XP_002528669.1| set domain protein, putative [Ricinus communis]
gi|223531892|gb|EEF33708.1| set domain protein, putative [Ricinus communis]
Length = 495
Score = 137 bits (346), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 79/205 (38%), Positives = 119/205 (58%), Gaps = 7/205 (3%)
Query: 34 DWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSC---TASPGSSGVCDRDCHCGMLL 90
D +I A+ Y I++N + ++ +++ E+D C C + P S+ C C +
Sbjct: 2 DQWIDALAL-YEHIQQNDFSYRKHRKQKEEDIAICECRFDASDPESA--CGERCLNVLTS 58
Query: 91 SSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVID 149
+ C+ G C+CG C N+ FQ K +L +TE G G++ADEDIK G+F+IEY GEVI
Sbjct: 59 TECTPGYCRCGIFCKNQRFQKCEYFKTRLFKTEGRGWGLLADEDIKAGQFIIEYCGEVIS 118
Query: 150 DQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGE 209
+ + R + G + ++ +N IDAT KG+ +R+INHSC PN E +KW + GE
Sbjct: 119 WKEAKRRSQAYERQGLKDAFIISLNSSESIDATRKGSLARFINHSCQPNCETRKWNVLGE 178
Query: 210 TRIGIFATRDIKKGENLTYDYQYEF 234
R+GIFA +DI G L YDY +E+
Sbjct: 179 IRVGIFAKQDISIGTELAYDYNFEW 203
>gi|118572948|sp|Q6P2L6.2|NSD3_MOUSE RecName: Full=Histone-lysine N-methyltransferase NSD3; AltName:
Full=Nuclear SET domain-containing protein 3; AltName:
Full=Wolf-Hirschhorn syndrome candidate 1-like protein 1
homolog; Short=WHSC1-like protein 1
Length = 1439
Score = 137 bits (346), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 80/222 (36%), Positives = 118/222 (53%), Gaps = 6/222 (2%)
Query: 17 FNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGS 76
F +L Q + E+ KP PY IK N + K ++ ++ D C PG
Sbjct: 1051 FQELKAQRESKEALEMERTSRKPP--PYKHIKANKVIGK-VQVQVADLSEIPRCNCKPGD 1107
Query: 77 SGVCDRDCHCGMLLSS--CSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADED 133
C + C +S C C G+ C N+ F R ++++TE+ G G+
Sbjct: 1108 ENPCGLESQCLNRMSQYECHPQVCPAGDRCQNQCFTKRLYPDAEVIKTERRGWGLRTKRS 1167
Query: 134 IKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINH 193
IK+GEFV EYVGE+ID++ C R+ + TNFY+ + +D +IDA KGN SR++NH
Sbjct: 1168 IKKGEFVNEYVGELIDEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNH 1227
Query: 194 SCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
SC PN E QKW ++G+ R+G+FA DI G LT++Y + L
Sbjct: 1228 SCNPNCETQKWTVNGDVRVGLFALCDIPAGMELTFNYNLDCL 1269
>gi|124486903|ref|NP_001074738.1| histone-lysine N-methyltransferase NSD3 isoform 2 [Mus musculus]
gi|189442807|gb|AAI67226.1| Wolf-Hirschhorn syndrome candidate 1-like 1 (human) [synthetic
construct]
Length = 1446
Score = 137 bits (346), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 80/222 (36%), Positives = 118/222 (53%), Gaps = 6/222 (2%)
Query: 17 FNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGS 76
F +L Q + E+ KP PY IK N + K ++ ++ D C PG
Sbjct: 1058 FQELKAQRESKEALEMERTSRKPP--PYKHIKANKVIGK-VQVQVADLSEIPRCNCKPGD 1114
Query: 77 SGVCDRDCHCGMLLSS--CSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADED 133
C + C +S C C G+ C N+ F R ++++TE+ G G+
Sbjct: 1115 ENPCGLESQCLNRMSQYECHPQVCPAGDRCQNQCFTKRLYPDAEVIKTERRGWGLRTKRS 1174
Query: 134 IKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINH 193
IK+GEFV EYVGE+ID++ C R+ + TNFY+ + +D +IDA KGN SR++NH
Sbjct: 1175 IKKGEFVNEYVGELIDEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNH 1234
Query: 194 SCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
SC PN E QKW ++G+ R+G+FA DI G LT++Y + L
Sbjct: 1235 SCNPNCETQKWTVNGDVRVGLFALCDIPAGMELTFNYNLDCL 1276
>gi|348574862|ref|XP_003473209.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
H3 lysine-36 and H4 lysine-20 specific-like [Cavia
porcellus]
Length = 2509
Score = 137 bits (346), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
C+C A+ + D +C MLL C + C G C N+ F R ++++ +T + G
Sbjct: 1708 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGW 1767
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ DIK+GEFV EYVGE+ID++ C R+ + TNFY+ +++D +IDA KGN
Sbjct: 1768 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 1827
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
+R++NH C PN E QKW ++G+TR+G+FA DIK G LT++Y E L
Sbjct: 1828 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 1876
>gi|395543169|ref|XP_003773493.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
N-methyltransferase NSD2 [Sarcophilus harrisii]
Length = 1464
Score = 137 bits (346), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 79/199 (39%), Positives = 110/199 (55%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
K PY IK N K ++ D C P C D +C ML+ C
Sbjct: 988 KPPPYKHIKVNKPCGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMYECHPQV 1046
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R + K+++T+ G G+VA DIK+GEFV EYVGE+ID++ C R
Sbjct: 1047 CPAGEQCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIKKGEFVNEYVGELIDEEECMAR 1106
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ T+FY+ I++D +IDA KGN SR++NHSC PN E KW ++G+TR+G+FA
Sbjct: 1107 IKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1166
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1167 VCDIPAGTELTFNYNLDCL 1185
>gi|355750457|gb|EHH54795.1| hypothetical protein EGM_15701 [Macaca fascicularis]
Length = 2695
Score = 137 bits (346), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
C+C A+ + D +C MLL C + C G C N+ F R ++++ +T + G
Sbjct: 1894 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGW 1953
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ DIK+GEFV EYVGE+ID++ C R+ + TNFY+ +++D +IDA KGN
Sbjct: 1954 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 2013
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
+R++NH C PN E QKW ++G+TR+G+FA DIK G LT++Y E L
Sbjct: 2014 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 2062
>gi|380815580|gb|AFE79664.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform b [Macaca mulatta]
gi|383420749|gb|AFH33588.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform b [Macaca mulatta]
Length = 2695
Score = 137 bits (346), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
C+C A+ + D +C MLL C + C G C N+ F R ++++ +T + G
Sbjct: 1894 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGW 1953
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ DIK+GEFV EYVGE+ID++ C R+ + TNFY+ +++D +IDA KGN
Sbjct: 1954 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 2013
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
+R++NH C PN E QKW ++G+TR+G+FA DIK G LT++Y E L
Sbjct: 2014 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 2062
>gi|73953273|ref|XP_865778.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific isoform 5 [Canis lupus familiaris]
Length = 2698
Score = 137 bits (346), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
C+C A+ + D +C MLL C + C G C N+ F R ++++ +T + G
Sbjct: 1896 CNCKATDDNPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFTKRQYPEVEIFRTLQRGW 1955
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ DIK+GEFV EYVGE+ID++ C R+ + TNFY+ +++D +IDA KGN
Sbjct: 1956 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 2015
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
+R++NH C PN E QKW ++G+TR+G+FA DIK G LT++Y E L
Sbjct: 2016 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 2064
>gi|19923586|ref|NP_071900.2| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform b [Homo sapiens]
gi|32469769|sp|Q96L73.1|NSD1_HUMAN RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific; AltName: Full=Androgen receptor
coactivator 267 kDa protein; AltName: Full=Androgen
receptor-associated protein of 267 kDa; AltName:
Full=H3-K36-HMTase; AltName: Full=H4-K20-HMTase; AltName:
Full=Lysine N-methyltransferase 3B; AltName: Full=Nuclear
receptor-binding SET domain-containing protein 1;
Short=NR-binding SET domain-containing protein
gi|17530097|gb|AAL40694.1|AF395588_1 putative nuclear protein NSD1 [Homo sapiens]
gi|16751269|gb|AAL06645.1| androgen receptor associated coregulator 267-b [Homo sapiens]
gi|119605438|gb|EAW85032.1| nuclear receptor binding SET domain protein 1, isoform CRA_b [Homo
sapiens]
Length = 2696
Score = 137 bits (346), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
C+C A+ + D +C MLL C + C G C N+ F R ++++ +T + G
Sbjct: 1895 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGW 1954
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ DIK+GEFV EYVGE+ID++ C R+ + TNFY+ +++D +IDA KGN
Sbjct: 1955 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 2014
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
+R++NH C PN E QKW ++G+TR+G+FA DIK G LT++Y E L
Sbjct: 2015 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 2063
>gi|355691890|gb|EHH27075.1| hypothetical protein EGK_17188 [Macaca mulatta]
Length = 2695
Score = 137 bits (346), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
C+C A+ + D +C MLL C + C G C N+ F R ++++ +T + G
Sbjct: 1894 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGW 1953
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ DIK+GEFV EYVGE+ID++ C R+ + TNFY+ +++D +IDA KGN
Sbjct: 1954 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 2013
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
+R++NH C PN E QKW ++G+TR+G+FA DIK G LT++Y E L
Sbjct: 2014 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 2062
>gi|410216828|gb|JAA05633.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
gi|410260118|gb|JAA18025.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
Length = 2428
Score = 137 bits (346), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
C+C A+ + D +C MLL C + C G C N+ F R ++++ +T + G
Sbjct: 1627 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGW 1686
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ DIK+GEFV EYVGE+ID++ C R+ + TNFY+ +++D +IDA KGN
Sbjct: 1687 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 1746
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
+R++NH C PN E QKW ++G+TR+G+FA DIK G LT++Y E L
Sbjct: 1747 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 1795
>gi|441595720|ref|XP_004087266.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
H3 lysine-36 and H4 lysine-20 specific [Nomascus
leucogenys]
Length = 2697
Score = 137 bits (345), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
C+C A+ + D +C MLL C + C G C N+ F R ++++ +T + G
Sbjct: 1896 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGW 1955
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ DIK+GEFV EYVGE+ID++ C R+ + TNFY+ +++D +IDA KGN
Sbjct: 1956 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 2015
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
+R++NH C PN E QKW ++G+TR+G+FA DIK G LT++Y E L
Sbjct: 2016 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 2064
>gi|410949106|ref|XP_003981265.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Felis catus]
Length = 2432
Score = 137 bits (345), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
C+C A+ + D +C MLL C + C G C N+ F R ++++ +T + G
Sbjct: 1630 CNCKATDDNPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFTKRQYPEVEIFRTLQRGW 1689
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ DIK+GEFV EYVGE+ID++ C R+ + TNFY+ +++D +IDA KGN
Sbjct: 1690 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 1749
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
+R++NH C PN E QKW ++G+TR+G+FA DIK G LT++Y E L
Sbjct: 1750 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 1798
>gi|402873563|ref|XP_003900641.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific, partial [Papio anubis]
Length = 2343
Score = 137 bits (345), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
C+C A+ + D +C MLL C + C G C N+ F R ++++ +T + G
Sbjct: 1539 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGW 1598
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ DIK+GEFV EYVGE+ID++ C R+ + TNFY+ +++D +IDA KGN
Sbjct: 1599 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 1658
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
+R++NH C PN E QKW ++G+TR+G+FA DIK G LT++Y E L
Sbjct: 1659 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 1707
>gi|296193510|ref|XP_002806650.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
H3 lysine-36 and H4 lysine-20 specific [Callithrix
jacchus]
Length = 2692
Score = 137 bits (345), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
C+C A+ + D +C MLL C + C G C N+ F R ++++ +T + G
Sbjct: 1895 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGW 1954
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ DIK+GEFV EYVGE+ID++ C R+ + TNFY+ +++D +IDA KGN
Sbjct: 1955 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 2014
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
+R++NH C PN E QKW ++G+TR+G+FA DIK G LT++Y E L
Sbjct: 2015 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 2063
>gi|410303854|gb|JAA30527.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
gi|410341931|gb|JAA39912.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
Length = 2428
Score = 137 bits (345), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
C+C A+ + D +C MLL C + C G C N+ F R ++++ +T + G
Sbjct: 1627 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGW 1686
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ DIK+GEFV EYVGE+ID++ C R+ + TNFY+ +++D +IDA KGN
Sbjct: 1687 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 1746
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
+R++NH C PN E QKW ++G+TR+G+FA DIK G LT++Y E L
Sbjct: 1747 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 1795
>gi|114603589|ref|XP_527132.2| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific isoform 8 [Pan troglodytes]
gi|397470588|ref|XP_003806901.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Pan paniscus]
gi|410303856|gb|JAA30528.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
gi|410341933|gb|JAA39913.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
Length = 2697
Score = 137 bits (345), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
C+C A+ + D +C MLL C + C G C N+ F R ++++ +T + G
Sbjct: 1896 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGW 1955
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ DIK+GEFV EYVGE+ID++ C R+ + TNFY+ +++D +IDA KGN
Sbjct: 1956 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 2015
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
+R++NH C PN E QKW ++G+TR+G+FA DIK G LT++Y E L
Sbjct: 2016 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 2064
>gi|51849607|dbj|BAD42330.1| hypothetical protein [Nannochloris bacillaris]
Length = 334
Score = 137 bits (345), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 71/198 (35%), Positives = 107/198 (54%), Gaps = 4/198 (2%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGV---CDRDCHCGMLLSSCSSG-CKC 99
+ I +NIY+ + K+ ED+ + C C G+ C +C ML C + C C
Sbjct: 61 WQLIAKNIYMHRERKQLDEDEVMICQCKPIWGTDTTTIGCGENCLNRMLNIECVAKYCPC 120
Query: 100 GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
G C N+ F R K+++ + G G+ A ED+K G+F++EYVGEV++++ R
Sbjct: 121 GERCTNRGFSKRAYAKLEIRRAGAKGFGLFAAEDVKAGQFIVEYVGEVLEEEEYARRKEF 180
Query: 160 MKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD 219
G+ ++Y + VIDA +G R+INHSC PN E QKW++ GE IG+FA D
Sbjct: 181 YIATGQRHYYFMNVGNGEVIDAARRGGLGRFINHSCEPNCETQKWVVRGELAIGLFALED 240
Query: 220 IKKGENLTYDYQYEFLHD 237
+ G LT+DY +E D
Sbjct: 241 VPAGSVLTFDYNFERYGD 258
>gi|301785552|ref|XP_002928188.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific-like [Ailuropoda melanoleuca]
gi|281342107|gb|EFB17691.1| hypothetical protein PANDA_018107 [Ailuropoda melanoleuca]
Length = 2699
Score = 137 bits (345), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
C+C A+ + D +C MLL C + C G C N+ F R ++++ +T + G
Sbjct: 1899 CNCKATDDNPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFTKRQYPEVEIFRTLQRGW 1958
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ DIK+GEFV EYVGE+ID++ C R+ + TNFY+ +++D +IDA KGN
Sbjct: 1959 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 2018
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
+R++NH C PN E QKW ++G+TR+G+FA DIK G LT++Y E L
Sbjct: 2019 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 2067
>gi|410216830|gb|JAA05634.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
gi|410260120|gb|JAA18026.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
Length = 2697
Score = 137 bits (345), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
C+C A+ + D +C MLL C + C G C N+ F R ++++ +T + G
Sbjct: 1896 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGW 1955
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ DIK+GEFV EYVGE+ID++ C R+ + TNFY+ +++D +IDA KGN
Sbjct: 1956 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 2015
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
+R++NH C PN E QKW ++G+TR+G+FA DIK G LT++Y E L
Sbjct: 2016 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 2064
>gi|351708443|gb|EHB11362.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Heterocephalus glaber]
Length = 2698
Score = 137 bits (345), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
C+C A+ + D +C MLL C + C G C N+ F R ++++ +T + G
Sbjct: 1895 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGW 1954
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ DIK+GEFV EYVGE+ID++ C R+ + TNFY+ +++D +IDA KGN
Sbjct: 1955 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 2014
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
+R++NH C PN E QKW ++G+TR+G+FA DIK G LT++Y E L
Sbjct: 2015 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 2063
>gi|148700883|gb|EDL32830.1| mCG14519 [Mus musculus]
Length = 1381
Score = 137 bits (345), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 80/222 (36%), Positives = 118/222 (53%), Gaps = 6/222 (2%)
Query: 17 FNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGS 76
F +L Q + E+ KP PY IK N + K ++ ++ D C PG
Sbjct: 993 FQELKAQRESKEALEMERTSRKPP--PYKHIKANKVIGK-VQVQVADLSEIPRCNCKPGD 1049
Query: 77 SGVCDRDCHCGMLLSS--CSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADED 133
C + C +S C C G+ C N+ F R ++++TE+ G G+
Sbjct: 1050 ENPCGLESQCLNRMSQYECHPQVCPAGDRCQNQCFTKRLYPDAEVIKTERRGWGLRTKRS 1109
Query: 134 IKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINH 193
IK+GEFV EYVGE+ID++ C R+ + TNFY+ + +D +IDA KGN SR++NH
Sbjct: 1110 IKKGEFVNEYVGELIDEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNH 1169
Query: 194 SCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
SC PN E QKW ++G+ R+G+FA DI G LT++Y + L
Sbjct: 1170 SCNPNCETQKWTVNGDVRVGLFALCDIPAGMELTFNYNLDCL 1211
>gi|403287002|ref|XP_003934751.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Saimiri
boliviensis boliviensis]
Length = 1368
Score = 137 bits (345), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 79/199 (39%), Positives = 110/199 (55%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
K PY IK N K ++ D C P C D +C ML+ C
Sbjct: 987 KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMFECHPQV 1045
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R + K+++T+ G G+VA DI++GEFV EYVGEVID++ C R
Sbjct: 1046 CPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGEVIDEEECMAR 1105
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ T+FY+ I++D +IDA KGN SR++NHSC PN E KW ++G+TR+G+FA
Sbjct: 1106 IKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1165
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1166 VCDIPAGTELTFNYNLDCL 1184
>gi|380815578|gb|AFE79663.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform a [Macaca mulatta]
gi|383420747|gb|AFH33587.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform a [Macaca mulatta]
Length = 2426
Score = 137 bits (345), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
C+C A+ + D +C MLL C + C G C N+ F R ++++ +T + G
Sbjct: 1625 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGW 1684
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ DIK+GEFV EYVGE+ID++ C R+ + TNFY+ +++D +IDA KGN
Sbjct: 1685 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 1744
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
+R++NH C PN E QKW ++G+TR+G+FA DIK G LT++Y E L
Sbjct: 1745 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 1793
>gi|403294386|ref|XP_003938171.1| PREDICTED: histone-lysine N-methyltransferase NSD3 [Saimiri
boliviensis boliviensis]
Length = 1438
Score = 137 bits (345), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 76/205 (37%), Positives = 111/205 (54%), Gaps = 4/205 (1%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N + K ++ ++ D C P C + C ML C
Sbjct: 1070 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1128
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G+ C N+ F R ++++TE+ G G+ IK+GEFV EYVGE+ID++ C R
Sbjct: 1129 CPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1188
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + TNFY+ + +D +IDA KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1189 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1248
Query: 217 TRDIKKGENLTYDYQYEFLHDSLIA 241
DI G LT++Y + L + A
Sbjct: 1249 LCDIPAGMELTFNYNLDCLGNGRTA 1273
>gi|296197020|ref|XP_002746091.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 1
[Callithrix jacchus]
Length = 1365
Score = 137 bits (345), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 79/199 (39%), Positives = 110/199 (55%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
K PY IK N K ++ D C P C D +C ML+ C
Sbjct: 987 KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMFECHPQV 1045
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R + K+++T+ G G+VA DI++GEFV EYVGEVID++ C R
Sbjct: 1046 CPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGEVIDEEECMAR 1105
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ T+FY+ I++D +IDA KGN SR++NHSC PN E KW ++G+TR+G+FA
Sbjct: 1106 IKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1165
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1166 VCDIPAGTELTFNYNLDCL 1184
>gi|403290056|ref|XP_003936149.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Saimiri boliviensis boliviensis]
Length = 2697
Score = 137 bits (345), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
C+C A+ + D +C MLL C + C G C N+ F R ++++ +T + G
Sbjct: 1896 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGW 1955
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ DIK+GEFV EYVGE+ID++ C R+ + TNFY+ +++D +IDA KGN
Sbjct: 1956 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 2015
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
+R++NH C PN E QKW ++G+TR+G+FA DIK G LT++Y E L
Sbjct: 2016 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 2064
>gi|390461098|ref|XP_003732596.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 2
[Callithrix jacchus]
Length = 1400
Score = 137 bits (345), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 79/199 (39%), Positives = 110/199 (55%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
K PY IK N K ++ D C P C D +C ML+ C
Sbjct: 1022 KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMFECHPQV 1080
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R + K+++T+ G G+VA DI++GEFV EYVGEVID++ C R
Sbjct: 1081 CPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGEVIDEEECMAR 1140
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ T+FY+ I++D +IDA KGN SR++NHSC PN E KW ++G+TR+G+FA
Sbjct: 1141 IKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1200
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1201 VCDIPAGTELTFNYNLDCL 1219
>gi|187956219|gb|AAI50629.1| Nuclear receptor binding SET domain protein 1 [Homo sapiens]
Length = 2427
Score = 137 bits (345), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
C+C A+ + D +C MLL C + C G C N+ F R ++++ +T + G
Sbjct: 1626 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGW 1685
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ DIK+GEFV EYVGE+ID++ C R+ + TNFY+ +++D +IDA KGN
Sbjct: 1686 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 1745
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
+R++NH C PN E QKW ++G+TR+G+FA DIK G LT++Y E L
Sbjct: 1746 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 1794
>gi|27477095|ref|NP_758859.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform a [Homo sapiens]
gi|16755530|gb|AAL27991.1|AF380302_1 androgen receptor-associated coregulator 267-a [Homo sapiens]
gi|119605437|gb|EAW85031.1| nuclear receptor binding SET domain protein 1, isoform CRA_a [Homo
sapiens]
Length = 2427
Score = 137 bits (345), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
C+C A+ + D +C MLL C + C G C N+ F R ++++ +T + G
Sbjct: 1626 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGW 1685
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ DIK+GEFV EYVGE+ID++ C R+ + TNFY+ +++D +IDA KGN
Sbjct: 1686 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 1745
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
+R++NH C PN E QKW ++G+TR+G+FA DIK G LT++Y E L
Sbjct: 1746 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 1794
>gi|119605439|gb|EAW85033.1| nuclear receptor binding SET domain protein 1, isoform CRA_c [Homo
sapiens]
Length = 2593
Score = 137 bits (345), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
C+C A+ + D +C MLL C + C G C N+ F R ++++ +T + G
Sbjct: 1792 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGW 1851
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ DIK+GEFV EYVGE+ID++ C R+ + TNFY+ +++D +IDA KGN
Sbjct: 1852 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 1911
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
+R++NH C PN E QKW ++G+TR+G+FA DIK G LT++Y E L
Sbjct: 1912 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 1960
>gi|126332220|ref|XP_001374612.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2
[Monodelphis domestica]
Length = 1366
Score = 137 bits (345), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 79/199 (39%), Positives = 110/199 (55%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
K PY IK N K ++ D C P C D +C ML+ C
Sbjct: 988 KPPPYKHIKVNKPCGK-VQIYTADVSEIPKCNCKPTDENPCGFDSECLNRMLMYECHPQV 1046
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R + K+++T+ G G+VA DIK+GEFV EYVGE+ID++ C R
Sbjct: 1047 CPAGEQCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIKKGEFVNEYVGELIDEEECMAR 1106
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ T+FY+ I++D +IDA KGN SR++NHSC PN E KW ++G+TR+G+FA
Sbjct: 1107 IKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1166
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1167 VCDIPAGTELTFNYNLDCL 1185
>gi|356535766|ref|XP_003536414.1| PREDICTED: histone-lysine N-methyltransferase ASHH1-like [Glycine
max]
gi|34529091|dbj|BAC85636.1| unnamed protein product [Homo sapiens]
Length = 480
Score = 137 bits (345), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 77/194 (39%), Positives = 109/194 (56%), Gaps = 2/194 (1%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCT-ASPGSSGVCDRDCHCGMLLSSCSSG-CKCGN 101
Y+ I +N + +R K++ E+D C C + C C + + C+ G C C
Sbjct: 11 YIHINQNEFCIRRHKKQKEEDIAICECKYDADDPDNACGDSCLNVLTSTECTPGYCHCDI 70
Query: 102 SCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMK 161
C N+ FQ K KL +TE G G++ADEDIK G+FVIEY GEVI + + R +
Sbjct: 71 LCKNQKFQKCEYAKTKLFKTEGRGWGLLADEDIKAGQFVIEYCGEVISWKEAKRRSQAYE 130
Query: 162 HLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIK 221
+ G + ++ +N IDAT KG+ +R+INHSC PN E +KW + GE R+GIFA DI
Sbjct: 131 NQGLKDAFIIFLNVSESIDATRKGSLARFINHSCQPNCETRKWNVLGEIRVGIFAKHDIP 190
Query: 222 KGENLTYDYQYEFL 235
G L YDY +E+
Sbjct: 191 IGTELAYDYNFEWF 204
>gi|395861196|ref|XP_003802879.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific-like [Otolemur garnettii]
Length = 2410
Score = 137 bits (345), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
C+C A+ + D +C MLL C + C G C N+ F R ++++ +T + G
Sbjct: 1611 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGW 1670
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ DIK+GEFV EYVGE+ID++ C R+ + TNFY+ +++D +IDA KGN
Sbjct: 1671 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 1730
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
+R++NH C PN E QKW ++G+TR+G+FA DIK G LT++Y E L
Sbjct: 1731 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 1779
>gi|340718068|ref|XP_003397494.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
H3 lysine-36 and H4 lysine-20 specific-like [Bombus
terrestris]
Length = 1238
Score = 137 bits (345), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 75/166 (45%), Positives = 98/166 (59%), Gaps = 1/166 (0%)
Query: 66 IFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKC 124
+ C C A + DC +LL CS G C G C N+ F R M+ T
Sbjct: 794 VACDCDAEWENPCAPGTDCLNRILLVECSPGICPAGAKCKNQAFVQRQYPAMEPFHTVGR 853
Query: 125 GAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYK 184
G G+ + E IK G+FVIEYVGEVID+ + RL + K L NFY I+ + +IDA K
Sbjct: 854 GWGLRSLEHIKAGQFVIEYVGEVIDEAEYKRRLHRKKELKNENFYFLTIDNNRMIDAEPK 913
Query: 185 GNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDY 230
GN SR++NHSC PN E QKW ++G+TRIG+FA DI+ GE LT++Y
Sbjct: 914 GNLSRFMNHSCSPNCETQKWTVNGDTRIGLFALCDIECGEELTFNY 959
>gi|326438002|gb|EGD83572.1| hypothetical protein PTSG_04178 [Salpingoeca sp. ATCC 50818]
Length = 1398
Score = 137 bits (345), Expect = 4e-30, Method: Composition-based stats.
Identities = 74/238 (31%), Positives = 117/238 (49%), Gaps = 16/238 (6%)
Query: 21 LKQIGNPVEFELPDWFIKPK-----------AIPYVFIKRNIYLTKRIKR---RLEDDGI 66
L + P F LP +++ P+ I+ N+++ K ++ + +
Sbjct: 274 LNLLCQPEPFRLPASYVQASRQGTLAITEEACAPFRKIRVNVFVDKETEQFMAKHRQEVS 333
Query: 67 FCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRP-VKKMKLVQTEKCG 125
C C P C DC ++ S C C CG C NK FQ R +K T G
Sbjct: 334 RCECIPPPDGGPGCTHDCLNRLMYSECGPDCPCGRQCTNKRFQQRAWCTAIKRAPTPGKG 393
Query: 126 AGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKG 185
G+ A I +G FVIEY GE++ +R + + +F+ ++R +VIDA G
Sbjct: 394 YGVFATAYIPKGTFVIEYTGEIMTSSAFTQRANTL-YRARKHFHCLNLDRGLVIDAGQAG 452
Query: 186 NKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
+++R+INHSC PN ++KW ++G R G+FA+RDIK E L+YDY + ++ L+ C
Sbjct: 453 SEARFINHSCDPNCHIEKWNVNGHWRAGVFASRDIKGDEELSYDYNFHNFNEKLVCRC 510
>gi|149726051|ref|XP_001502479.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Equus caballus]
Length = 2700
Score = 137 bits (345), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
C+C A+ + D +C MLL C + C G C N+ F R ++++ +T + G
Sbjct: 1898 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFTKRQYPEVEIFRTLQRGW 1957
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ DIK+GEFV EYVGE+ID++ C R+ + TNFY+ +++D +IDA KGN
Sbjct: 1958 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 2017
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
+R++NH C PN E QKW ++G+TR+G+FA DIK G LT++Y E L
Sbjct: 2018 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 2066
>gi|417407050|gb|JAA50158.1| Putative histone-lysine n-methyltransferase h3 lysine-36 and h4
lysine-20 specific [Desmodus rotundus]
Length = 2699
Score = 137 bits (345), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
C+C A+ + D +C MLL C + C G C N+ F R ++++ +T + G
Sbjct: 1898 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFTKRQYPEVEIFRTLQRGW 1957
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ DIK+GEFV EYVGE+ID++ C R+ + TNFY+ +++D +IDA KGN
Sbjct: 1958 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 2017
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
+R++NH C PN E QKW ++G+TR+G+FA DIK G LT++Y E L
Sbjct: 2018 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 2066
>gi|410958014|ref|XP_003985618.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Felis
catus]
Length = 1300
Score = 137 bits (345), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 81/202 (40%), Positives = 113/202 (55%), Gaps = 10/202 (4%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
K PY IK N K ++ D C P C D +C ML+ C
Sbjct: 922 KPPPYKHIKVNKPYGK-VQIHTADISEIPKCNCKPTDENPCGFDSECLNRMLMFECHPQV 980
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R + K+++T+ G G+VA DI++GEFV EYVGE+ID++ C R
Sbjct: 981 CPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 1040
Query: 157 LWKMKHLGE---TNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIG 213
+ KH E T+FY+ I++D +IDA KGN SR++NHSC PN E KW ++G+TR+G
Sbjct: 1041 I---KHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVG 1097
Query: 214 IFATRDIKKGENLTYDYQYEFL 235
+FA DI G LT++Y + L
Sbjct: 1098 LFAVCDIPAGTELTFNYNLDCL 1119
>gi|350580826|ref|XP_003123715.3| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific, partial [Sus scrofa]
Length = 2392
Score = 137 bits (345), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
C+C A+ + D +C MLL C + C G C N+ F R ++++ +T + G
Sbjct: 1590 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFTKRQYPEVEIFRTLQRGW 1649
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ DIK+GEFV EYVGE+ID++ C R+ + TNFY+ +++D +IDA KGN
Sbjct: 1650 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 1709
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
+R++NH C PN E QKW ++G+TR+G+FA DIK G LT++Y E L
Sbjct: 1710 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 1758
>gi|16549858|dbj|BAB70868.1| unnamed protein product [Homo sapiens]
Length = 1059
Score = 137 bits (345), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
C+C A+ + D +C MLL C + C G C N+ F R ++++ +T + G
Sbjct: 858 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGW 917
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ DIK+GEFV EYVGE+ID++ C R+ + TNFY+ +++D +IDA KGN
Sbjct: 918 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 977
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
+R++NH C PN E QKW ++G+TR+G+FA DIK G LT++Y E L
Sbjct: 978 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 1026
>gi|291387890|ref|XP_002710469.1| PREDICTED: nuclear receptor binding SET domain protein 1 isoform 2
[Oryctolagus cuniculus]
Length = 2431
Score = 137 bits (345), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
C+C A+ + D +C MLL C + C G C N+ F R ++++ +T + G
Sbjct: 1629 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFTKRQYPEVEIFRTLQRGW 1688
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ DIK+GEFV EYVGE+ID++ C R+ + TNFY+ +++D +IDA KGN
Sbjct: 1689 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 1748
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
+R++NH C PN E QKW ++G+TR+G+FA DIK G LT++Y E L
Sbjct: 1749 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 1797
>gi|432099958|gb|ELK28852.1| Histone-lysine N-methyltransferase NSD3 [Myotis davidii]
Length = 1641
Score = 137 bits (344), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 109/199 (54%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N + K ++ ++ D C P C + C ML C
Sbjct: 1273 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1331
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G+ C N+ F R ++++TE+ G G+ IK+GEFV EYVGE+ID++ C R
Sbjct: 1332 CPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1391
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + TNFY+ + +D +IDA KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1392 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1451
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1452 LCDIPAGMELTFNYNLDCL 1470
>gi|389585110|dbj|GAB67841.1| hypothetical protein PCYB_124070 [Plasmodium cynomolgi strain B]
Length = 2974
Score = 137 bits (344), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 79/216 (36%), Positives = 120/216 (55%), Gaps = 21/216 (9%)
Query: 42 IPYVFIKRNIYLTKRIKRRL--EDDGIFCSCTASPGSSGVCDR-DCHCGMLLSSCSSGCK 98
+ + +I +N+YL + K L + D C C G CD C+ + CS +
Sbjct: 2473 LNFEYISKNVYLNDQNKNLLACKSDDYRCLC------QGECDPFSCYNSLSKIQCSRN-R 2525
Query: 99 CG-------NSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQ 151
C C N+PF++ +K++++ QTE+ G G+ DIK GE + EYVGEV+ +
Sbjct: 2526 CNLPIQVQDKKCFNRPFKHSAIKELEIKQTERTGFGVFCKRDIKNGELICEYVGEVLGKK 2585
Query: 152 TCEERLWKMKHLGET----NFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIID 207
EER+ + + N+Y +INRD+ ID+ KG+ SR++NHSC PN+ QKWI+
Sbjct: 2586 EFEERMEAYQEESKKTDMYNWYSIQINRDVHIDSRRKGSISRFVNHSCSPNSVSQKWIVR 2645
Query: 208 GETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
G RIGIFA +DI GE +TY+Y Y F+ ++ C
Sbjct: 2646 GFYRIGIFAQQDIPAGEEITYNYSYNFVFNNFECLC 2681
>gi|426359420|ref|XP_004046973.1| PREDICTED: histone-lysine N-methyltransferase NSD3 [Gorilla gorilla
gorilla]
Length = 1397
Score = 137 bits (344), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 109/199 (54%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N + K ++ ++ D C P C + C ML C
Sbjct: 1029 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1087
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G+ C N+ F R ++++TE+ G G+ IK+GEFV EYVGE+ID++ C R
Sbjct: 1088 CPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1147
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + TNFY+ + +D +IDA KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1148 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1207
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1208 LCDIPAGMELTFNYNLDCL 1226
>gi|92098122|gb|AAI15007.1| Wolf-Hirschhorn syndrome candidate 1-like 1 [Homo sapiens]
Length = 1437
Score = 137 bits (344), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 109/199 (54%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N + K ++ ++ D C P C + C ML C
Sbjct: 1069 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1127
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G+ C N+ F R ++++TE+ G G+ IK+GEFV EYVGE+ID++ C R
Sbjct: 1128 CPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1187
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + TNFY+ + +D +IDA KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1188 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1247
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1248 LCDIPAGMELTFNYNLDCL 1266
>gi|157821603|ref|NP_001099560.1| histone-lysine N-methyltransferase NSD3 [Rattus norvegicus]
gi|149057818|gb|EDM09061.1| Wolf-Hirschhorn syndrome candidate 1-like 1 (predicted) [Rattus
norvegicus]
Length = 1396
Score = 137 bits (344), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 80/222 (36%), Positives = 118/222 (53%), Gaps = 6/222 (2%)
Query: 17 FNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGS 76
F +L Q + E+ KP PY IK N + K ++ ++ D C PG
Sbjct: 1008 FQELKAQRESKEALEIEKTSRKPP--PYKHIKANKVIGK-VQIQVADLSEIPRCNCKPGD 1064
Query: 77 SGVCDRDCHCGMLLSS--CSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADED 133
C + C +S C C G+ C N+ F R ++++TE+ G G+
Sbjct: 1065 ENPCGLESECLNRMSQYECHPQVCPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRS 1124
Query: 134 IKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINH 193
IK+GEFV EYVGE+ID++ C R+ + TNFY+ + +D +IDA KGN SR++NH
Sbjct: 1125 IKKGEFVNEYVGELIDEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNH 1184
Query: 194 SCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
SC PN E QKW ++G+ R+G+FA DI G LT++Y + L
Sbjct: 1185 SCNPNCETQKWTVNGDVRVGLFALCDIPAGMELTFNYNLDCL 1226
>gi|355708046|gb|AES03147.1| nuclear receptor binding SET domain protein 1 [Mustela putorius
furo]
Length = 261
Score = 137 bits (344), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
C+C A+ + D +C MLL C + C G C N+ F R ++++ +T + G
Sbjct: 71 CNCKATDDNPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFTKRQYPEVEIFRTLQRGW 130
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ DIK+GEFV EYVGE+ID++ C R+ + TNFY+ +++D +IDA KGN
Sbjct: 131 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 190
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
+R++NH C PN E QKW ++G+TR+G+FA DIK G LT++Y E L
Sbjct: 191 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 239
>gi|344265319|ref|XP_003404732.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Loxodonta africana]
Length = 2702
Score = 137 bits (344), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
C+C A+ + D +C MLL C + C G C N+ F R ++++ +T + G
Sbjct: 1899 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFTKRQYPEVEIFRTLQRGW 1958
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ DIK+GEFV EYVGE+ID++ C R+ + TNFY+ +++D +IDA KGN
Sbjct: 1959 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 2018
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
+R++NH C PN E QKW ++G+TR+G+FA DIK G LT++Y E L
Sbjct: 2019 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 2067
>gi|12697312|emb|CAC28350.1| putative chromatin modulator [Homo sapiens]
Length = 1437
Score = 137 bits (344), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 109/199 (54%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N + K ++ ++ D C P C + C ML C
Sbjct: 1069 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1127
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G+ C N+ F R ++++TE+ G G+ IK+GEFV EYVGE+ID++ C R
Sbjct: 1128 CPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1187
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + TNFY+ + +D +IDA KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1188 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1247
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1248 LCDIPAGMELTFNYNLDCL 1266
>gi|380814048|gb|AFE78898.1| histone-lysine N-methyltransferase NSD3 isoform long [Macaca mulatta]
gi|383419477|gb|AFH32952.1| histone-lysine N-methyltransferase NSD3 isoform long [Macaca mulatta]
Length = 1438
Score = 137 bits (344), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 109/199 (54%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N + K ++ ++ D C P C + C ML C
Sbjct: 1070 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1128
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G+ C N+ F R ++++TE+ G G+ IK+GEFV EYVGE+ID++ C R
Sbjct: 1129 CPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1188
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + TNFY+ + +D +IDA KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1189 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1248
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1249 LCDIPAGMELTFNYNLDCL 1267
>gi|395739588|ref|XP_002819060.2| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1 [Pongo
abelii]
Length = 1438
Score = 137 bits (344), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 109/199 (54%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N + K ++ ++ D C P C + C ML C
Sbjct: 1070 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1128
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G+ C N+ F R ++++TE+ G G+ IK+GEFV EYVGE+ID++ C R
Sbjct: 1129 CPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1188
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + TNFY+ + +D +IDA KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1189 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1248
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1249 LCDIPAGMELTFNYNLDCL 1267
>gi|12697314|emb|CAC28351.1| Putative Chromatin modulator [Homo sapiens]
Length = 1388
Score = 137 bits (344), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 109/199 (54%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N + K ++ ++ D C P C + C ML C
Sbjct: 1020 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1078
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G+ C N+ F R ++++TE+ G G+ IK+GEFV EYVGE+ID++ C R
Sbjct: 1079 CPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1138
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + TNFY+ + +D +IDA KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1139 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1198
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1199 LCDIPAGMELTFNYNLDCL 1217
>gi|431902251|gb|ELK08752.1| Histone-lysine N-methyltransferase NSD3 [Pteropus alecto]
Length = 1322
Score = 137 bits (344), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 109/199 (54%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N + K ++ ++ D C P C + C ML C
Sbjct: 1029 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1087
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G+ C N+ F R ++++TE+ G G+ IK+GEFV EYVGE+ID++ C R
Sbjct: 1088 CPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1147
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + TNFY+ + +D +IDA KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1148 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1207
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1208 LCDIPAGMELTFNYNLDCL 1226
>gi|359493197|ref|XP_003634539.1| PREDICTED: histone-lysine N-methyltransferase ASHH1-like [Vitis
vinifera]
Length = 413
Score = 137 bits (344), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 79/195 (40%), Positives = 106/195 (54%), Gaps = 6/195 (3%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSC---TASPGSSGVCDRDCHCGMLLSSCSSG-CKC 99
Y+ I +N + + ++ EDD C C T P S+ C C + C+ C C
Sbjct: 41 YIHINKNDFSYREHIKQEEDDITICECKYNTNDPDSA--CGERCLNVLTSIECTPHYCPC 98
Query: 100 GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
C N+ FQ K KL +TE G G++A+EDIK G F+IEY GEVI ER
Sbjct: 99 SVHCKNQRFQKHEYAKTKLFRTEGRGWGLLANEDIKAGRFIIEYCGEVISWNEARERSLA 158
Query: 160 MKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD 219
G + Y+ +N IDAT G+++R+INHSC PN E +KW + GE RIGIFA RD
Sbjct: 159 YASQGINDAYIISLNARECIDATKSGSQARFINHSCEPNCETRKWSVLGEVRIGIFAMRD 218
Query: 220 IKKGENLTYDYQYEF 234
I G LTYDY +++
Sbjct: 219 ISIGTELTYDYNFQW 233
>gi|13699811|ref|NP_075447.1| histone-lysine N-methyltransferase NSD3 isoform long [Homo sapiens]
gi|74761342|sp|Q9BZ95.1|NSD3_HUMAN RecName: Full=Histone-lysine N-methyltransferase NSD3; AltName:
Full=Nuclear SET domain-containing protein 3; AltName:
Full=Protein whistle; AltName: Full=WHSC1-like 1 isoform
9 with methyltransferase activity to lysine; AltName:
Full=Wolf-Hirschhorn syndrome candidate 1-like protein 1;
Short=WHSC1-like protein 1
gi|12642817|gb|AAK00355.1| putative protein WHSC1L1l [Homo sapiens]
gi|75517229|gb|AAI01718.1| WHSC1L1 protein, isoform long [Homo sapiens]
gi|109731704|gb|AAI13470.1| Wolf-Hirschhorn syndrome candidate 1-like 1 [Homo sapiens]
gi|119583726|gb|EAW63322.1| Wolf-Hirschhorn syndrome candidate 1-like 1, isoform CRA_c [Homo
sapiens]
gi|168277388|dbj|BAG10672.1| histone-lysine N-methyltransferase NSD3 [synthetic construct]
Length = 1437
Score = 137 bits (344), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 109/199 (54%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N + K ++ ++ D C P C + C ML C
Sbjct: 1069 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1127
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G+ C N+ F R ++++TE+ G G+ IK+GEFV EYVGE+ID++ C R
Sbjct: 1128 CPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1187
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + TNFY+ + +D +IDA KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1188 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1247
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1248 LCDIPAGMELTFNYNLDCL 1266
>gi|397521377|ref|XP_003830773.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 3 [Pan
paniscus]
Length = 1388
Score = 137 bits (344), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 109/199 (54%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N + K ++ ++ D C P C + C ML C
Sbjct: 1020 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1078
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G+ C N+ F R ++++TE+ G G+ IK+GEFV EYVGE+ID++ C R
Sbjct: 1079 CPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1138
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + TNFY+ + +D +IDA KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1139 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1198
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1199 LCDIPAGMELTFNYNLDCL 1217
>gi|114619717|ref|XP_001170799.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 4 [Pan
troglodytes]
Length = 1388
Score = 137 bits (344), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 109/199 (54%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N + K ++ ++ D C P C + C ML C
Sbjct: 1020 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1078
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G+ C N+ F R ++++TE+ G G+ IK+GEFV EYVGE+ID++ C R
Sbjct: 1079 CPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1138
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + TNFY+ + +D +IDA KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1139 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1198
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1199 LCDIPAGMELTFNYNLDCL 1217
>gi|348520760|ref|XP_003447895.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
[Oreochromis niloticus]
Length = 1167
Score = 137 bits (344), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 77/199 (38%), Positives = 110/199 (55%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
K PY FIK N + K ++ D C P C + +C ML C
Sbjct: 779 KPPPYKFIKVNKPVGK-VQVYTADISEIPKCNCKPSGERPCGFESECLNRMLQYECHPQV 837
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R + K+++T G G++ DIK+GEFV EY+GE+ID++ C R
Sbjct: 838 CPSGERCCNQDFTKRLYPETKIIKTPGKGWGLICLRDIKKGEFVNEYIGELIDEEECRAR 897
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ T+FY+ I++D +IDA KGN SR++NHSC PN E QKW ++G+TR+G+FA
Sbjct: 898 IKYAHENNITDFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETQKWTVNGDTRVGLFA 957
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 958 VCDIPAGTELTFNYNLDCL 976
>gi|296222023|ref|XP_002757009.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 3
[Callithrix jacchus]
Length = 1389
Score = 137 bits (344), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 109/199 (54%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N + K ++ ++ D C P C + C ML C
Sbjct: 1021 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1079
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G+ C N+ F R ++++TE+ G G+ IK+GEFV EYVGE+ID++ C R
Sbjct: 1080 CPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1139
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + TNFY+ + +D +IDA KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1140 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1199
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1200 LCDIPAGMELTFNYNLDCL 1218
>gi|410307858|gb|JAA32529.1| Wolf-Hirschhorn syndrome candidate 1-like 1 [Pan troglodytes]
Length = 1437
Score = 137 bits (344), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 109/199 (54%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N + K ++ ++ D C P C + C ML C
Sbjct: 1069 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1127
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G+ C N+ F R ++++TE+ G G+ IK+GEFV EYVGE+ID++ C R
Sbjct: 1128 CPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1187
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + TNFY+ + +D +IDA KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1188 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1247
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1248 LCDIPAGMELTFNYNLDCL 1266
>gi|402878017|ref|XP_003902703.1| PREDICTED: histone-lysine N-methyltransferase NSD3 [Papio anubis]
Length = 1438
Score = 137 bits (344), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 109/199 (54%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N + K ++ ++ D C P C + C ML C
Sbjct: 1070 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1128
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G+ C N+ F R ++++TE+ G G+ IK+GEFV EYVGE+ID++ C R
Sbjct: 1129 CPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1188
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + TNFY+ + +D +IDA KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1189 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1248
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1249 LCDIPAGMELTFNYNLDCL 1267
>gi|397521373|ref|XP_003830771.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1 [Pan
paniscus]
Length = 1437
Score = 137 bits (344), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 109/199 (54%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N + K ++ ++ D C P C + C ML C
Sbjct: 1069 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1127
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G+ C N+ F R ++++TE+ G G+ IK+GEFV EYVGE+ID++ C R
Sbjct: 1128 CPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1187
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + TNFY+ + +D +IDA KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1188 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1247
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1248 LCDIPAGMELTFNYNLDCL 1266
>gi|296222021|ref|XP_002757008.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2
[Callithrix jacchus]
Length = 1438
Score = 137 bits (344), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 109/199 (54%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N + K ++ ++ D C P C + C ML C
Sbjct: 1070 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1128
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G+ C N+ F R ++++TE+ G G+ IK+GEFV EYVGE+ID++ C R
Sbjct: 1129 CPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1188
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + TNFY+ + +D +IDA KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1189 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1248
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1249 LCDIPAGMELTFNYNLDCL 1267
>gi|224050217|ref|XP_002195834.1| PREDICTED: histone-lysine N-methyltransferase NSD2 [Taeniopygia
guttata]
Length = 1339
Score = 137 bits (344), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 79/199 (39%), Positives = 110/199 (55%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
K PY IK N K ++ D C P C D +C ML+ C
Sbjct: 991 KPPPYKHIKVNKPCGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMYECHPQV 1049
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R + K+++T+ G G+VA DIK+GEFV EYVGE+ID++ C R
Sbjct: 1050 CPAGERCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIKKGEFVNEYVGELIDEEECMAR 1109
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ T+FY+ I++D +IDA KGN SR++NHSC PN E KW ++G+TR+G+FA
Sbjct: 1110 IKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1169
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1170 VCDIPAGTELTFNYNLDCL 1188
>gi|114619713|ref|XP_519713.2| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 6 [Pan
troglodytes]
gi|410226716|gb|JAA10577.1| Wolf-Hirschhorn syndrome candidate 1-like 1 [Pan troglodytes]
gi|410266866|gb|JAA21399.1| Wolf-Hirschhorn syndrome candidate 1-like 1 [Pan troglodytes]
gi|410353855|gb|JAA43531.1| Wolf-Hirschhorn syndrome candidate 1-like 1 [Pan troglodytes]
Length = 1437
Score = 137 bits (344), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 109/199 (54%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N + K ++ ++ D C P C + C ML C
Sbjct: 1069 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1127
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G+ C N+ F R ++++TE+ G G+ IK+GEFV EYVGE+ID++ C R
Sbjct: 1128 CPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1187
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + TNFY+ + +D +IDA KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1188 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1247
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1248 LCDIPAGMELTFNYNLDCL 1266
>gi|327284319|ref|XP_003226886.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like [Anolis
carolinensis]
Length = 1438
Score = 137 bits (344), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 74/199 (37%), Positives = 110/199 (55%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N + K ++ ++ D C P C + C ML C
Sbjct: 1070 KPPPYKHIKSNKVVGK-VQIQVADLSEIPRCNCKPSDENPCGLESECLNRMLQYECHPQV 1128
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R + ++++T++ G G+ +IK+GEFV EYVGE+ID++ C R
Sbjct: 1129 CPAGEGCQNQCFTKRLYPEAEIIKTDRRGWGLRTKRNIKKGEFVNEYVGELIDEEECRLR 1188
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + TNFY+ + +D +IDA KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1189 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCHPNCETQKWTVNGDVRVGLFA 1248
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1249 VCDIPAGMELTFNYNLDCL 1267
>gi|149634094|ref|XP_001506476.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1
[Ornithorhynchus anatinus]
Length = 1437
Score = 137 bits (344), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 81/222 (36%), Positives = 116/222 (52%), Gaps = 6/222 (2%)
Query: 17 FNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGS 76
F +L Q + EL KP PY IK N + K ++ ++ D C P
Sbjct: 1048 FQELKAQRESKEALELERNSRKPP--PYKHIKANKVIGK-VQIQVADLSEIPRCNCKPAD 1104
Query: 77 SGVCDRDCHC--GMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADED 133
C + C ML C C G C N+ F R ++++TE+ G G+
Sbjct: 1105 ENPCGLESECLNRMLQYECHPQVCPAGERCQNQCFTKRLYPDAEIIKTERRGWGLRTKRS 1164
Query: 134 IKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINH 193
IK+GEFV EYVGE+ID++ C R+ + TNFY+ + +D +IDA KGN SR++NH
Sbjct: 1165 IKKGEFVNEYVGELIDEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNH 1224
Query: 194 SCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
SC PN E QKW ++G+ R+G+FA DI G LT++Y + L
Sbjct: 1225 SCNPNCETQKWTVNGDVRVGLFALCDIPAGMELTFNYNLDCL 1266
>gi|344281620|ref|XP_003412576.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1
[Loxodonta africana]
Length = 1438
Score = 137 bits (344), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 109/199 (54%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N + K ++ ++ D C P C + C ML C
Sbjct: 1070 KPPPYKHIKANKVVGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1128
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G+ C N+ F R ++++TE+ G G+ IK+GEFV EYVGE+ID++ C R
Sbjct: 1129 CPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1188
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + TNFY+ + +D +IDA KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1189 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1248
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1249 LCDIPAGMELTFNYNLDCL 1267
>gi|291409090|ref|XP_002720827.1| PREDICTED: WHSC1L1 protein [Oryctolagus cuniculus]
Length = 1435
Score = 136 bits (343), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 108/199 (54%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N + K ++ ++ D C P C + C ML C
Sbjct: 1067 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESQCLNRMLQYECHPQV 1125
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R ++++TE+ G G+ IK+GEFV EYVGE+ID++ C R
Sbjct: 1126 CPAGERCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1185
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + TNFY+ + +D +IDA KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1186 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1245
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1246 LCDIPAGMELTFNYNLDCL 1264
>gi|348527922|ref|XP_003451468.1| PREDICTED: hypothetical protein LOC100692734 [Oreochromis niloticus]
Length = 2421
Score = 136 bits (343), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 77/198 (38%), Positives = 112/198 (56%), Gaps = 2/198 (1%)
Query: 40 KAIPYVFIKRNIYLTK-RIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-C 97
K PY IK N + K +I + C+C + S D +C MLL C C
Sbjct: 1924 KPPPYKHIKVNRPIGKVQIFTADLSEIPRCNCKKTDDSPCGMDSECINRMLLYECHPQVC 1983
Query: 98 KCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERL 157
CLN+ F R ++++ +T G G+ DIK+G+FV EYVGEVID++ C R+
Sbjct: 1984 PAAERCLNQAFTKRQYSQVEIFRTLSRGWGLRCVHDIKKGQFVSEYVGEVIDEEECRSRI 2043
Query: 158 WKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFAT 217
+ NFY+ +++D +IDA KGN++R++NHSC PN E QKW ++G+TR+G+FA
Sbjct: 2044 RHAQDNNICNFYMLTLDKDRIIDAGPKGNEARFMNHSCQPNCETQKWTVNGDTRVGLFAL 2103
Query: 218 RDIKKGENLTYDYQYEFL 235
DI G LT++Y E L
Sbjct: 2104 IDIAAGTELTFNYNLECL 2121
>gi|410914796|ref|XP_003970873.1| PREDICTED: uncharacterized protein LOC101068764 [Takifugu rubripes]
Length = 2363
Score = 136 bits (343), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 77/198 (38%), Positives = 112/198 (56%), Gaps = 2/198 (1%)
Query: 40 KAIPYVFIKRNIYLTK-RIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-C 97
K PY IK N + K +I + C+C A+ S + +C MLL C C
Sbjct: 1872 KPPPYKHIKVNRPIGKVQIFTADLSEIPRCNCKATDESPCGMESECINRMLLYECHPQVC 1931
Query: 98 KCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERL 157
G CLN+ F R ++++ +T G G+ DIK+G+FV EYVGEVID++ C R+
Sbjct: 1932 PAGERCLNQAFTKRQYSQVEIFRTLSRGWGLRCVHDIKKGQFVSEYVGEVIDEEECRSRI 1991
Query: 158 WKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFAT 217
+ NFY+ +++D +IDA KGN++R++NH C PN E QKW + G+TR+G+FA
Sbjct: 1992 RHAQENDICNFYMLTLDKDRIIDAGPKGNEARFMNHCCQPNCETQKWTVSGDTRVGLFAL 2051
Query: 218 RDIKKGENLTYDYQYEFL 235
DI G LT++Y E L
Sbjct: 2052 VDIPAGTELTFNYNLECL 2069
>gi|347972366|ref|XP_316738.5| AGAP004656-PA [Anopheles gambiae str. PEST]
gi|333469400|gb|EAA11974.5| AGAP004656-PA [Anopheles gambiae str. PEST]
Length = 1259
Score = 136 bits (343), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 86/206 (41%), Positives = 114/206 (55%), Gaps = 13/206 (6%)
Query: 40 KAIP-YVFIKRNIYLTK----RIKRRLEDDGIFCS-CTASPGSSGVCDRDCHC---GMLL 90
KA+P Y IK N Y+ + R+L DG+ S C P C C +++
Sbjct: 941 KAVPVYKKIKTNRYIPPLKPPSVNRQL--DGVEDSVCRCQPSDDDPCGPTSACLNRAIMM 998
Query: 91 SSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDD 150
S C SC N+ F R +++ G G+VA ED+K G+FVIEYVGEVI+
Sbjct: 999 ECSSKTCPAKESCSNQRFTKRIYPALEVRFFSDKGFGLVALEDLKSGQFVIEYVGEVINS 1058
Query: 151 QTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGET 210
+ + R+ M+ ETN+Y + D+ IDA KGN SR+INHSC PN E QKW I GET
Sbjct: 1059 EEFDRRVMMMQAAKETNYYFLTVEPDLTIDAGPKGNVSRFINHSCEPNCETQKWTI-GET 1117
Query: 211 R-IGIFATRDIKKGENLTYDYQYEFL 235
R IG+FA +DI GE LT++Y E L
Sbjct: 1118 RVIGLFAIKDINAGEELTFNYNLESL 1143
>gi|295424164|ref|NP_001074571.2| histone-lysine N-methyltransferase NSD2 isoform 1 [Mus musculus]
Length = 1366
Score = 136 bits (343), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 78/199 (39%), Positives = 109/199 (54%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N K ++ D C P C D C ML+ C
Sbjct: 988 KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGSDSECLNRMLMFECHPQV 1046
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R + K+++T+ G G+VA DI++GEFV EYVGE+ID++ C R
Sbjct: 1047 CPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 1106
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ T+FY+ I++D +IDA KGN SR++NHSC PN E KW ++G+TR+G+FA
Sbjct: 1107 IKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1166
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1167 VCDIPAGTELTFNYNLDCL 1185
>gi|148705490|gb|EDL37437.1| mCG16344 [Mus musculus]
Length = 1298
Score = 136 bits (343), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 78/199 (39%), Positives = 109/199 (54%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N K ++ D C P C D C ML+ C
Sbjct: 920 KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGSDSECLNRMLMFECHPQV 978
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R + K+++T+ G G+VA DI++GEFV EYVGE+ID++ C R
Sbjct: 979 CPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 1038
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ T+FY+ I++D +IDA KGN SR++NHSC PN E KW ++G+TR+G+FA
Sbjct: 1039 IKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1098
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1099 VCDIPAGTELTFNYNLDCL 1117
>gi|344244292|gb|EGW00396.1| putative histone-lysine N-methyltransferase NSD2 [Cricetulus griseus]
Length = 1344
Score = 136 bits (343), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 78/199 (39%), Positives = 109/199 (54%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N K ++ D C P C D C ML+ C
Sbjct: 966 KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGSDSECLNRMLMFECHPQV 1024
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R + K+++T+ G G+VA DI++GEFV EYVGE+ID++ C R
Sbjct: 1025 CPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 1084
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ T+FY+ I++D +IDA KGN SR++NHSC PN E KW ++G+TR+G+FA
Sbjct: 1085 IKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1144
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1145 VCDIPAGTELTFNYNLDCL 1163
>gi|351699801|gb|EHB02720.1| Histone-lysine N-methyltransferase NSD3 [Heterocephalus glaber]
Length = 1439
Score = 136 bits (343), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 108/199 (54%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N + K ++ + D C P C + C ML C
Sbjct: 1071 KPPPYKHIKANKVIGK-VQIHVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1129
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G+ C N+ F R ++++TE+ G G+ IK+GEFV EYVGE+ID++ C R
Sbjct: 1130 CPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1189
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + TNFY+ + +D +IDA KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1190 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1249
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1250 LCDIPAGMELTFNYNLDCL 1268
>gi|350593412|ref|XP_003483678.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
NSD3-like [Sus scrofa]
Length = 1438
Score = 136 bits (343), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 108/199 (54%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N + K ++ ++ D C P C + C ML C
Sbjct: 1070 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1128
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R ++++TE+ G G+ IK+GEFV EYVGE+ID++ C R
Sbjct: 1129 CPAGERCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1188
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + TNFY+ + +D +IDA KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1189 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1248
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1249 LCDIPAGMELTFNYNLDCL 1267
>gi|157822347|ref|NP_001100807.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Rattus norvegicus]
gi|149039888|gb|EDL94004.1| nuclear receptor binding SET domain protein 1 (predicted), isoform
CRA_a [Rattus norvegicus]
Length = 2381
Score = 136 bits (343), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 70/169 (41%), Positives = 103/169 (60%), Gaps = 1/169 (0%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
C+C A+ + D +C MLL C + C G C N+ F R +++ +T + G
Sbjct: 1585 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPDVEIFRTLQRGW 1644
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ DIK+GEFV EYVGE+ID++ C R+ + TNFY+ +++D +IDA KGN
Sbjct: 1645 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 1704
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
+R++NH C PN E QKW ++G+TR+G+FA DIK G LT++Y E L
Sbjct: 1705 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 1753
>gi|162318272|gb|AAI56161.1| Wolf-Hirschhorn syndrome candidate 1 (human) [synthetic construct]
gi|162318442|gb|AAI56968.1| Wolf-Hirschhorn syndrome candidate 1 (human) [synthetic construct]
Length = 1346
Score = 136 bits (343), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 78/199 (39%), Positives = 109/199 (54%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N K ++ D C P C D C ML+ C
Sbjct: 968 KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGSDSECLNRMLMFECHPQV 1026
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R + K+++T+ G G+VA DI++GEFV EYVGE+ID++ C R
Sbjct: 1027 CPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 1086
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ T+FY+ I++D +IDA KGN SR++NHSC PN E KW ++G+TR+G+FA
Sbjct: 1087 IKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1146
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1147 VCDIPAGTELTFNYNLDCL 1165
>gi|295424166|ref|NP_780440.2| histone-lysine N-methyltransferase NSD2 isoform 2 [Mus musculus]
gi|118572947|sp|Q8BVE8.2|NSD2_MOUSE RecName: Full=Histone-lysine N-methyltransferase NSD2; AltName:
Full=Multiple myeloma SET domain-containing protein;
Short=MMSET; AltName: Full=Nuclear SET domain-containing
protein 2; Short=NSD2; AltName: Full=Wolf-Hirschhorn
syndrome candidate 1 protein homolog; Short=WHSC1
Length = 1365
Score = 136 bits (343), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 78/199 (39%), Positives = 109/199 (54%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N K ++ D C P C D C ML+ C
Sbjct: 987 KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGSDSECLNRMLMFECHPQV 1045
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R + K+++T+ G G+VA DI++GEFV EYVGE+ID++ C R
Sbjct: 1046 CPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 1105
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ T+FY+ I++D +IDA KGN SR++NHSC PN E KW ++G+TR+G+FA
Sbjct: 1106 IKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1165
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1166 VCDIPAGTELTFNYNLDCL 1184
>gi|348554221|ref|XP_003462924.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like [Cavia
porcellus]
Length = 1438
Score = 136 bits (343), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 108/199 (54%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N + K ++ + D C P C + C ML C
Sbjct: 1070 KPPPYKHIKANKVIGK-VQIHVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1128
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G+ C N+ F R ++++TE+ G G+ IK+GEFV EYVGE+ID++ C R
Sbjct: 1129 CPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1188
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + TNFY+ + +D +IDA KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1189 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1248
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1249 LCDIPAGMELTFNYNLDCL 1267
>gi|297676794|ref|XP_002816309.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific isoform 1 [Pongo abelii]
Length = 2697
Score = 136 bits (343), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
C+C A+ + D +C MLL C + C G C N+ F R ++++ +T + G
Sbjct: 1896 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGW 1955
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ DIK+GEFV EYVGE+ID++ C R+ + TNFY+ +++D +IDA KGN
Sbjct: 1956 GLRTKIDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 2015
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
+R++NH C PN E QKW ++G+TR+G+FA DIK G LT++Y E L
Sbjct: 2016 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 2064
>gi|395847335|ref|XP_003796334.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1
[Otolemur garnettii]
Length = 1438
Score = 136 bits (343), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 108/199 (54%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N + K ++ ++ D C P C + C ML C
Sbjct: 1070 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1128
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R ++++TE+ G G+ IK+GEFV EYVGE+ID++ C R
Sbjct: 1129 CPAGERCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1188
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + TNFY+ + +D +IDA KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1189 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1248
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1249 LCDIPAGMELTFNYNLDCL 1267
>gi|315364634|pdb|3OOI|A Chain A, Crystal Structure Of Human Histone-Lysine
N-Methyltransferase Nsd1 Set Domain In Complex With
S-Adenosyl-L-Methionine
Length = 232
Score = 136 bits (343), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
C+C A+ + D +C MLL C + C G C N+ F R ++++ +T + G
Sbjct: 45 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGW 104
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ DIK+GEFV EYVGE+ID++ C R+ + TNFY+ +++D +IDA KGN
Sbjct: 105 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 164
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
+R++NH C PN E QKW ++G+TR+G+FA DIK G LT++Y E L
Sbjct: 165 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 213
>gi|354483938|ref|XP_003504149.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 1
[Cricetulus griseus]
Length = 1365
Score = 136 bits (343), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 78/199 (39%), Positives = 109/199 (54%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N K ++ D C P C D C ML+ C
Sbjct: 987 KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGSDSECLNRMLMFECHPQV 1045
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R + K+++T+ G G+VA DI++GEFV EYVGE+ID++ C R
Sbjct: 1046 CPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 1105
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ T+FY+ I++D +IDA KGN SR++NHSC PN E KW ++G+TR+G+FA
Sbjct: 1106 IKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1165
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1166 VCDIPAGTELTFNYNLDCL 1184
>gi|426256406|ref|XP_004021831.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2 [Ovis
aries]
Length = 1439
Score = 136 bits (343), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 108/199 (54%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N + K ++ ++ D C P C + C ML C
Sbjct: 1071 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1129
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R ++++TE+ G G+ IK+GEFV EYVGE+ID++ C R
Sbjct: 1130 CPAGERCQNQCFTKRLYPDAEVIRTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1189
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + TNFY+ + +D +IDA KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1190 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1249
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1250 LCDIPAGMELTFNYNLDCL 1268
>gi|395507430|ref|XP_003758027.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2
[Sarcophilus harrisii]
Length = 1389
Score = 136 bits (343), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 81/222 (36%), Positives = 117/222 (52%), Gaps = 6/222 (2%)
Query: 17 FNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGS 76
F +L Q + EL KP PY IK N + K ++ ++ D C P
Sbjct: 1000 FQELKAQRESKEALELERNSRKPP--PYKHIKANKVIGK-VQIQVADLSEIPRCNCKPAD 1056
Query: 77 SGVCDRDCHC--GMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADED 133
C + C ML C C G+ C N+ F R ++++TE+ G G+
Sbjct: 1057 ENPCGLESECLNRMLQYECHPQVCPAGDRCQNQCFTKRLYPDAEVIKTERRGWGLRTKRS 1116
Query: 134 IKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINH 193
IK+GEFV EYVGE+ID++ C R+ + TNFY+ + +D +IDA KGN SR++NH
Sbjct: 1117 IKKGEFVNEYVGELIDEEECRLRIKRAHENSITNFYMLTVTKDRIIDAGPKGNYSRFMNH 1176
Query: 194 SCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
SC PN E QKW ++G+ R+G+FA DI G LT++Y + L
Sbjct: 1177 SCNPNCETQKWTVNGDVRVGLFALCDIPAGVELTFNYNLDCL 1218
>gi|432952957|ref|XP_004085262.1| PREDICTED: histone-lysine N-methyltransferase NSD2-like, partial
[Oryzias latipes]
Length = 1167
Score = 136 bits (342), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 78/199 (39%), Positives = 109/199 (54%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
K PY FIK N R++ D C P C + +C ML C
Sbjct: 881 KPPPYKFIKVNRPYG-RVQVYTADVSEIPKCNCKPTDERPCGFESECLNRMLQYECHPQV 939
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R + K+++T G G+VA DIK+G+FV EY+GE+ID++ C R
Sbjct: 940 CPSGERCCNQDFTKRLYPETKIIKTPGKGWGLVALRDIKKGKFVNEYIGELIDEEECRAR 999
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ TNFY+ I++D +IDA KGN SR++NHSC PN E QKW ++G+TR+G+FA
Sbjct: 1000 IKYAHENNITNFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETQKWTVNGDTRVGLFA 1059
Query: 217 TRDIKKGENLTYDYQYEFL 235
I G LT++Y + L
Sbjct: 1060 VCHIPAGTELTFNYNLDCL 1078
>gi|395736540|ref|XP_003776772.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific isoform 2 [Pongo abelii]
Length = 2594
Score = 136 bits (342), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 70/169 (41%), Positives = 104/169 (61%), Gaps = 1/169 (0%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
C+C A+ + D +C MLL C + C G C N+ F R ++++ +T + G
Sbjct: 1793 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGW 1852
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ DIK+GEFV EYVGE+ID++ C R+ + TNFY+ +++D +IDA KGN
Sbjct: 1853 GLRTKIDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 1912
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
+R++NH C PN E QKW ++G+TR+G+FA DIK G LT++Y E L
Sbjct: 1913 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 1961
>gi|395507428|ref|XP_003758026.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1
[Sarcophilus harrisii]
Length = 1437
Score = 136 bits (342), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 81/222 (36%), Positives = 117/222 (52%), Gaps = 6/222 (2%)
Query: 17 FNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGS 76
F +L Q + EL KP PY IK N + K ++ ++ D C P
Sbjct: 1048 FQELKAQRESKEALELERNSRKPP--PYKHIKANKVIGK-VQIQVADLSEIPRCNCKPAD 1104
Query: 77 SGVCDRDCHC--GMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADED 133
C + C ML C C G+ C N+ F R ++++TE+ G G+
Sbjct: 1105 ENPCGLESECLNRMLQYECHPQVCPAGDRCQNQCFTKRLYPDAEVIKTERRGWGLRTKRS 1164
Query: 134 IKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINH 193
IK+GEFV EYVGE+ID++ C R+ + TNFY+ + +D +IDA KGN SR++NH
Sbjct: 1165 IKKGEFVNEYVGELIDEEECRLRIKRAHENSITNFYMLTVTKDRIIDAGPKGNYSRFMNH 1224
Query: 194 SCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
SC PN E QKW ++G+ R+G+FA DI G LT++Y + L
Sbjct: 1225 SCNPNCETQKWTVNGDVRVGLFALCDIPAGVELTFNYNLDCL 1266
>gi|358419376|ref|XP_003584219.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1 [Bos
taurus]
Length = 1439
Score = 136 bits (342), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 108/199 (54%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N + K ++ ++ D C P C + C ML C
Sbjct: 1071 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1129
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R ++++TE+ G G+ IK+GEFV EYVGE+ID++ C R
Sbjct: 1130 CPAGERCQNQCFTKRLYPDAEVIRTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1189
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + TNFY+ + +D +IDA KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1190 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1249
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1250 LCDIPAGMELTFNYNLDCL 1268
>gi|440907576|gb|ELR57709.1| Histone-lysine N-methyltransferase NSD3 [Bos grunniens mutus]
Length = 1446
Score = 136 bits (342), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 108/199 (54%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N + K ++ ++ D C P C + C ML C
Sbjct: 1080 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1138
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R ++++TE+ G G+ IK+GEFV EYVGE+ID++ C R
Sbjct: 1139 CPAGERCQNQCFTKRLYPDAEVIRTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1198
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + TNFY+ + +D +IDA KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1199 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1258
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1259 LCDIPAGMELTFNYNLDCL 1277
>gi|449490008|ref|XP_004176439.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
ASH1L-like [Taeniopygia guttata]
Length = 2968
Score = 136 bits (342), Expect = 8e-30, Method: Composition-based stats.
Identities = 80/227 (35%), Positives = 123/227 (54%), Gaps = 18/227 (7%)
Query: 19 KLLKQIGNPVEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFC 68
K L+Q ++F+LP W + KP Y I+ N+Y+ +K + C
Sbjct: 2041 KYLRQ--KRIDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTC 2096
Query: 69 SCTASPGSSGV-CDRDCHCGMLLSSCS-SGCKCGNSCLNKPFQNRP-VKKMKLVQTEKCG 125
+C SG C DC M+ + CS + C CG C N+ Q V+ ++ + E+ G
Sbjct: 2097 NCKKPEDDSGKGCVEDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKG 2156
Query: 126 AGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKG 185
GI E +K G+F+IEY+GEV+ +Q R+ + H ++ Y ++ MVID+ G
Sbjct: 2157 WGIRTKEPLKAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMG 2215
Query: 186 NKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
N++R+INHSC PN EMQKW ++G RIG++A +D+ G LTYDY +
Sbjct: 2216 NEARFINHSCNPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2262
>gi|357443507|ref|XP_003592031.1| hypothetical protein MTR_1g098000 [Medicago truncatula]
gi|355481079|gb|AES62282.1| hypothetical protein MTR_1g098000 [Medicago truncatula]
Length = 479
Score = 136 bits (342), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 76/194 (39%), Positives = 107/194 (55%), Gaps = 2/194 (1%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGS-SGVCDRDCHCGMLLSSCSSG-CKCGN 101
Y+ I +N + +R K++ E+D C C C C + + C+ G C C
Sbjct: 12 YIHINQNDFFMRRHKKQKEEDIAICECRYDEDDPDSACGDGCLNVLTSTECTPGFCPCDI 71
Query: 102 SCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMK 161
C N+ FQ K KL +TE G G++ADE+IK G+FVIEY GEVI + + R +
Sbjct: 72 HCKNQKFQKCEYAKTKLFKTEGRGWGLLADEEIKAGQFVIEYCGEVISCKEAKRRSHTYE 131
Query: 162 HLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIK 221
G + ++ +N IDAT KG+ +R+INHSC PN E +KW + GE R+GIFA DI
Sbjct: 132 IQGLKDAFIISLNASESIDATRKGSLARFINHSCQPNCETRKWNVMGEIRVGIFALEDIP 191
Query: 222 KGENLTYDYQYEFL 235
G L YDY +E+
Sbjct: 192 IGTELAYDYNFEWF 205
>gi|300796853|ref|NP_001178481.1| probable histone-lysine N-methyltransferase NSD2 [Rattus norvegicus]
Length = 1346
Score = 136 bits (342), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 78/199 (39%), Positives = 109/199 (54%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N K ++ D C P C D C ML+ C
Sbjct: 968 KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGSDSECLNRMLMFECHPQV 1026
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R + K+++T+ G G+VA DI++GEFV EYVGE+ID++ C R
Sbjct: 1027 CPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 1086
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ T+FY+ I++D +IDA KGN SR++NHSC PN E KW ++G+TR+G+FA
Sbjct: 1087 IKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1146
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1147 VCDIPAGTELTFNYNLDCL 1165
>gi|149047443|gb|EDM00113.1| similar to Wolf-Hirschhorn syndrome candidate 1 protein isoform 3
(predicted) [Rattus norvegicus]
Length = 1298
Score = 136 bits (342), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 78/199 (39%), Positives = 109/199 (54%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N K ++ D C P C D C ML+ C
Sbjct: 920 KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGSDSECLNRMLMFECHPQV 978
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R + K+++T+ G G+VA DI++GEFV EYVGE+ID++ C R
Sbjct: 979 CPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 1038
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ T+FY+ I++D +IDA KGN SR++NHSC PN E KW ++G+TR+G+FA
Sbjct: 1039 IKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1098
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1099 VCDIPAGTELTFNYNLDCL 1117
>gi|395847337|ref|XP_003796335.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2
[Otolemur garnettii]
Length = 1389
Score = 136 bits (342), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 108/199 (54%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N + K ++ ++ D C P C + C ML C
Sbjct: 1021 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1079
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R ++++TE+ G G+ IK+GEFV EYVGE+ID++ C R
Sbjct: 1080 CPAGERCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1139
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + TNFY+ + +D +IDA KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1140 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1199
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1200 LCDIPAGMELTFNYNLDCL 1218
>gi|355557406|gb|EHH14186.1| Putative histone-lysine N-methyltransferase NSD2 [Macaca mulatta]
Length = 1365
Score = 136 bits (342), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 78/199 (39%), Positives = 110/199 (55%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
K PY IK N K ++ D C P C D +C ML+ C
Sbjct: 987 KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMFECHPQV 1045
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R + K+++T+ G G+VA DI++GEFV EYVGE+ID++ C R
Sbjct: 1046 CPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 1105
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ T+FY+ I++D +IDA KGN SR++NHSC PN E KW ++G+TR+G+FA
Sbjct: 1106 IKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1165
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1166 VCDIPAGTELTFNYNLDCL 1184
>gi|224080887|ref|XP_002197925.1| PREDICTED: histone-lysine N-methyltransferase NSD3 [Taeniopygia
guttata]
Length = 1435
Score = 136 bits (342), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 108/199 (54%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N + K ++ ++ D C P C + C ML C
Sbjct: 1067 KPPPYKHIKSNKVIGK-VQIQVADLSEIPRCNCKPSDENPCGLESECLNRMLQYECHPQV 1125
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R ++++TE+ G G+ IK+GEFV EYVGE+ID++ C R
Sbjct: 1126 CPAGERCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1185
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + TNFY+ + +D +IDA KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1186 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDIRVGLFA 1245
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1246 LCDIPAGMELTFNYNLDCL 1264
>gi|449270866|gb|EMC81514.1| Histone-lysine N-methyltransferase NSD3 [Columba livia]
Length = 1440
Score = 136 bits (342), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 108/199 (54%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N + K ++ ++ D C P C + C ML C
Sbjct: 1072 KPPPYKHIKSNKVIGK-VQIQVADLSEIPRCNCKPSDENPCGLESECLNRMLQYECHPQV 1130
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R ++++TE+ G G+ IK+GEFV EYVGE+ID++ C R
Sbjct: 1131 CPAGERCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1190
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + TNFY+ + +D +IDA KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1191 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDIRVGLFA 1250
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1251 LCDIPAGMELTFNYNLDCL 1269
>gi|383421363|gb|AFH33895.1| putative histone-lysine N-methyltransferase NSD2 isoform 1 [Macaca
mulatta]
gi|384949270|gb|AFI38240.1| putative histone-lysine N-methyltransferase NSD2 isoform 1 [Macaca
mulatta]
gi|387540940|gb|AFJ71097.1| putative histone-lysine N-methyltransferase NSD2 isoform 1 [Macaca
mulatta]
Length = 1365
Score = 136 bits (342), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 78/199 (39%), Positives = 110/199 (55%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
K PY IK N K ++ D C P C D +C ML+ C
Sbjct: 987 KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMFECHPQV 1045
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R + K+++T+ G G+VA DI++GEFV EYVGE+ID++ C R
Sbjct: 1046 CPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 1105
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ T+FY+ I++D +IDA KGN SR++NHSC PN E KW ++G+TR+G+FA
Sbjct: 1106 IKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1165
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1166 VCDIPAGTELTFNYNLDCL 1184
>gi|355744804|gb|EHH49429.1| Putative histone-lysine N-methyltransferase NSD2 [Macaca
fascicularis]
Length = 1365
Score = 136 bits (342), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 78/199 (39%), Positives = 110/199 (55%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
K PY IK N K ++ D C P C D +C ML+ C
Sbjct: 987 KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMFECHPQV 1045
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R + K+++T+ G G+VA DI++GEFV EYVGE+ID++ C R
Sbjct: 1046 CPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 1105
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ T+FY+ I++D +IDA KGN SR++NHSC PN E KW ++G+TR+G+FA
Sbjct: 1106 IKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1165
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1166 VCDIPAGTELTFNYNLDCL 1184
>gi|170050731|ref|XP_001861443.1| set domain protein [Culex quinquefasciatus]
gi|167872245|gb|EDS35628.1| set domain protein [Culex quinquefasciatus]
Length = 1181
Score = 136 bits (342), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 76/210 (36%), Positives = 112/210 (53%), Gaps = 10/210 (4%)
Query: 34 DWFIKPKAIPYVFIKRNIY---LTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC---G 87
D KP YV IK N Y L R R E++ C C P + C D +C
Sbjct: 869 DLSFKPPM--YVKIKSNKYVAPLRGRNAARDEEEDSICEC--KPSDTDPCGLDSNCINRA 924
Query: 88 MLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEV 147
+L+ C G+SC N+ F+ + + + G G+VA EDI +G+FVIEYVGEV
Sbjct: 925 LLVECNPKTCPAGDSCQNQCFERKQYPALAAKRIPNKGWGLVAQEDIHQGQFVIEYVGEV 984
Query: 148 IDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIID 207
I+ + R+ + + + N+Y ++ ++ IDA KGN +R+INHSC PN E W +
Sbjct: 985 INGEELARRIKQKQEQKDENYYFLTVDSELTIDAGPKGNLARFINHSCEPNCETLLWKVG 1044
Query: 208 GETRIGIFATRDIKKGENLTYDYQYEFLHD 237
G +G+FA +D+K GE LT++Y +E D
Sbjct: 1045 GSQSVGLFALKDLKAGEELTFNYNFETFGD 1074
>gi|297672976|ref|XP_002814554.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
N-methyltransferase NSD2 [Pongo abelii]
Length = 1365
Score = 136 bits (342), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 78/199 (39%), Positives = 110/199 (55%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
K PY IK N K ++ D C P C D +C ML+ C
Sbjct: 987 KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMFECHPQV 1045
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R + K+++T+ G G+VA DI++GEFV EYVGE+ID++ C R
Sbjct: 1046 CPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 1105
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ T+FY+ I++D +IDA KGN SR++NHSC PN E KW ++G+TR+G+FA
Sbjct: 1106 IKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1165
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1166 VCDIPAGTELTFNYNLDCL 1184
>gi|426343599|ref|XP_004038381.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 1
[Gorilla gorilla gorilla]
Length = 1365
Score = 136 bits (342), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 78/199 (39%), Positives = 110/199 (55%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
K PY IK N K ++ D C P C D +C ML+ C
Sbjct: 987 KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMFECHPQV 1045
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R + K+++T+ G G+VA DI++GEFV EYVGE+ID++ C R
Sbjct: 1046 CPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 1105
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ T+FY+ I++D +IDA KGN SR++NHSC PN E KW ++G+TR+G+FA
Sbjct: 1106 IKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1165
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1166 VCDIPAGTELTFNYNLDCL 1184
>gi|19913348|ref|NP_579877.1| histone-lysine N-methyltransferase NSD2 isoform 1 [Homo sapiens]
gi|19913350|ref|NP_579878.1| histone-lysine N-methyltransferase NSD2 isoform 1 [Homo sapiens]
gi|19913358|ref|NP_579890.1| histone-lysine N-methyltransferase NSD2 isoform 1 [Homo sapiens]
gi|109633019|ref|NP_001035889.1| histone-lysine N-methyltransferase NSD2 isoform 1 [Homo sapiens]
gi|74706096|sp|O96028.1|NSD2_HUMAN RecName: Full=Histone-lysine N-methyltransferase NSD2; AltName:
Full=Multiple myeloma SET domain-containing protein;
Short=MMSET; AltName: Full=Nuclear SET domain-containing
protein 2; Short=NSD2; AltName: Full=Protein trithorax-5;
AltName: Full=Wolf-Hirschhorn syndrome candidate 1
protein; Short=WHSC1
gi|3249713|gb|AAC24150.1| MMSET type II [Homo sapiens]
gi|4378019|gb|AAD19343.1| putative WHSC1 protein [Homo sapiens]
gi|4521954|gb|AAD21770.1| putative WHSC1 protein [Homo sapiens]
gi|4521955|gb|AAD21771.1| putative WHSC1 protein [Homo sapiens]
gi|5123789|emb|CAB45386.1| TRX5 protein [Homo sapiens]
gi|6683809|gb|AAF23370.1| MMSET type II [Homo sapiens]
gi|119602958|gb|EAW82552.1| Wolf-Hirschhorn syndrome candidate 1, isoform CRA_e [Homo sapiens]
gi|119602959|gb|EAW82553.1| Wolf-Hirschhorn syndrome candidate 1, isoform CRA_e [Homo sapiens]
gi|119602962|gb|EAW82556.1| Wolf-Hirschhorn syndrome candidate 1, isoform CRA_e [Homo sapiens]
gi|168273154|dbj|BAG10416.1| histone-lysine N-methyltransferase NSD2 [synthetic construct]
gi|187252511|gb|AAI66668.1| Wolf-Hirschhorn syndrome candidate 1 [synthetic construct]
Length = 1365
Score = 136 bits (342), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 78/199 (39%), Positives = 110/199 (55%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
K PY IK N K ++ D C P C D +C ML+ C
Sbjct: 987 KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMFECHPQV 1045
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R + K+++T+ G G+VA DI++GEFV EYVGE+ID++ C R
Sbjct: 1046 CPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 1105
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ T+FY+ I++D +IDA KGN SR++NHSC PN E KW ++G+TR+G+FA
Sbjct: 1106 IKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1165
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1166 VCDIPAGTELTFNYNLDCL 1184
>gi|326432726|gb|EGD78296.1| hypothetical protein PTSG_09362 [Salpingoeca sp. ATCC 50818]
Length = 1279
Score = 136 bits (342), Expect = 1e-29, Method: Composition-based stats.
Identities = 78/199 (39%), Positives = 106/199 (53%), Gaps = 10/199 (5%)
Query: 38 KPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCS-SG 96
KP A V + N+Y R + ++ C SP + CD C ++ C
Sbjct: 995 KPPAFQRV--RTNVYTIPRKRTPGQE------CCCSPATD-TCDDSCLNRIVHCECDPKT 1045
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C + C N+ FQ R K+ T+ G G+ A EDI G+FVIEYVGE+ID C R
Sbjct: 1046 CPVKDKCQNRRFQRRQYPKLIPFLTQSKGWGLKAGEDIAEGQFVIEYVGEIIDATECRRR 1105
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
L + + +FY+ ++ +DA K N +R+INHSC PN E QKW + GETR+GIFA
Sbjct: 1106 LAASQAANDHSFYILSLSGSSFVDARNKANLARFINHSCGPNCETQKWNVLGETRVGIFA 1165
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI KG LT+DYQ + L
Sbjct: 1166 KEDIPKGTELTFDYQLDSL 1184
>gi|114592860|ref|XP_001146084.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 6
[Pan troglodytes]
gi|114592864|ref|XP_001146248.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 7
[Pan troglodytes]
gi|114592866|ref|XP_001146323.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 8
[Pan troglodytes]
gi|114592870|ref|XP_001146473.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform
10 [Pan troglodytes]
gi|397483594|ref|XP_003812984.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Pan
paniscus]
gi|410227780|gb|JAA11109.1| Wolf-Hirschhorn syndrome candidate 1 [Pan troglodytes]
gi|410259494|gb|JAA17713.1| Wolf-Hirschhorn syndrome candidate 1 [Pan troglodytes]
gi|410299310|gb|JAA28255.1| Wolf-Hirschhorn syndrome candidate 1 [Pan troglodytes]
gi|410334709|gb|JAA36301.1| Wolf-Hirschhorn syndrome candidate 1 [Pan troglodytes]
Length = 1365
Score = 136 bits (342), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 78/199 (39%), Positives = 110/199 (55%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
K PY IK N K ++ D C P C D +C ML+ C
Sbjct: 987 KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMFECHPQV 1045
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R + K+++T+ G G+VA DI++GEFV EYVGE+ID++ C R
Sbjct: 1046 CPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 1105
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ T+FY+ I++D +IDA KGN SR++NHSC PN E KW ++G+TR+G+FA
Sbjct: 1106 IKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1165
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1166 VCDIPAGTELTFNYNLDCL 1184
>gi|124513208|ref|XP_001349960.1| SET domain protein, putative [Plasmodium falciparum 3D7]
gi|23615377|emb|CAD52368.1| SET domain protein, putative [Plasmodium falciparum 3D7]
Length = 2548
Score = 136 bits (342), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 84/215 (39%), Positives = 119/215 (55%), Gaps = 19/215 (8%)
Query: 42 IPYVFIKRNIYLTKRIKRRL--EDDGIFCSCTASPGSSGVCD-RDCHCGMLLSSCS-SGC 97
+ + +I +NIYL + K L + D C C G C+ C+ + CS S C
Sbjct: 2044 LKFEYISKNIYLNDKNKNLLACKSDDYKCLC------QGECNLYTCYNSLSNIQCSKSRC 2097
Query: 98 KC-----GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQT 152
C N+PF+ VK +++ +TEK G G+ DIK GE + EYVGEV+ +
Sbjct: 2098 NLPEKIQDRKCFNRPFRKSFVKDLEIKKTEKTGYGVFCKRDIKNGELICEYVGEVLGKRE 2157
Query: 153 CEERLW----KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDG 208
E+RL + K N+Y+ +IN+D+ ID+ KG+ SR+INHSC PN+ QKWI+ G
Sbjct: 2158 FEKRLEVYQEESKKTDMYNWYIIQINKDVYIDSGKKGSISRFINHSCSPNSVSQKWIVRG 2217
Query: 209 ETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
RIGIFA RDI GE +TY+Y Y FL ++ C
Sbjct: 2218 FYRIGIFALRDIPSGEEITYNYSYNFLFNNFECLC 2252
>gi|357443463|ref|XP_003592009.1| Histone-lysine N-methyltransferase NSD3 [Medicago truncatula]
gi|355481057|gb|AES62260.1| Histone-lysine N-methyltransferase NSD3 [Medicago truncatula]
Length = 616
Score = 135 bits (341), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 78/196 (39%), Positives = 110/196 (56%), Gaps = 6/196 (3%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSC---TASPGSSGVCDRDCHCGMLLSSCSSG-CKC 99
Y+ I +N + +R K++ E+D C C P S+ C C + + C+ G C C
Sbjct: 149 YIHINQNDFFMRRHKKQKEEDIAICECRYDEDDPDSA--CGDGCLNVLTSTECTPGFCPC 206
Query: 100 GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
C N+ FQ K KL +TE G G++ADE+IK G+FVIEY GEVI + + R
Sbjct: 207 DIHCKNQKFQKCEYAKTKLFKTEGRGWGLLADEEIKAGQFVIEYCGEVISCKEAKRRSHT 266
Query: 160 MKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD 219
+ G + ++ +N IDAT KG+ +R+INHSC PN E +KW + GE R+GIFA D
Sbjct: 267 YEIQGLKDAFIISLNASESIDATRKGSLARFINHSCQPNCETRKWNVMGEIRVGIFALED 326
Query: 220 IKKGENLTYDYQYEFL 235
I G L YDY +E+
Sbjct: 327 IPIGTELAYDYNFEWF 342
>gi|296081208|emb|CBI18234.3| unnamed protein product [Vitis vinifera]
Length = 862
Score = 135 bits (341), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 76/193 (39%), Positives = 106/193 (54%), Gaps = 2/193 (1%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGS-SGVCDRDCHCGMLLSSCSSG-CKCGN 101
Y+ I RN + ++ ++ E+D C C C C + + C+ G C+CG
Sbjct: 426 YIHINRNDFSYRKHIKQQEEDIAICECKYDANDPDSACGEACLNVLTSTECTPGYCRCGL 485
Query: 102 SCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMK 161
C N+ FQ K KL +TE G G++ADE+IK G FVIEY GEVI + R
Sbjct: 486 FCKNQRFQKCEYAKTKLFRTEGRGWGLLADENIKAGRFVIEYCGEVISWKEARGRSQVYA 545
Query: 162 HLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIK 221
LG + ++ +N IDAT KG+ R+INHSC PN E +KW + GE R+GIFA +DI
Sbjct: 546 SLGLKDAFIISLNGSECIDATKKGSLGRFINHSCQPNCETRKWTVLGEVRVGIFAKQDIS 605
Query: 222 KGENLTYDYQYEF 234
G L Y+Y +E+
Sbjct: 606 IGTELAYNYNFEW 618
Score = 131 bits (330), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 78/188 (41%), Positives = 103/188 (54%), Gaps = 7/188 (3%)
Query: 51 IYLTKRIKRRLEDDGIFCSC---TASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNK 106
+++ IK+ EDD C C T P S+ C C + C+ C C C N+
Sbjct: 11 MFMHLHIKQE-EDDITICECKYNTNDPDSA--CGERCLNVLTSIECTPHYCPCSVHCKNQ 67
Query: 107 PFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGET 166
FQ K KL +TE G G++A+EDIK G F+IEY GEVI ER G
Sbjct: 68 RFQKHEYAKTKLFRTEGRGWGLLANEDIKAGRFIIEYCGEVISWNEARERSLAYASQGIN 127
Query: 167 NFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENL 226
+ Y+ +N IDAT G+++R+INHSC PN E +KW + GE RIGIFA RDI G L
Sbjct: 128 DAYIISLNARECIDATKSGSQARFINHSCEPNCETRKWSVLGEVRIGIFAMRDISIGTEL 187
Query: 227 TYDYQYEF 234
TYDY +++
Sbjct: 188 TYDYNFQW 195
>gi|148709230|gb|EDL41176.1| nuclear receptor-binding SET-domain protein 1, isoform CRA_b [Mus
musculus]
Length = 2382
Score = 135 bits (341), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 70/169 (41%), Positives = 103/169 (60%), Gaps = 1/169 (0%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
C+C A+ + D +C MLL C + C G C N+ F R +++ +T + G
Sbjct: 1587 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGVRCQNQCFSKRQYPDVEIFRTLQRGW 1646
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ DIK+GEFV EYVGE+ID++ C R+ + TNFY+ +++D +IDA KGN
Sbjct: 1647 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 1706
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
+R++NH C PN E QKW ++G+TR+G+FA DIK G LT++Y E L
Sbjct: 1707 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 1755
>gi|260941922|ref|XP_002615127.1| hypothetical protein CLUG_05142 [Clavispora lusitaniae ATCC 42720]
gi|238851550|gb|EEQ41014.1| hypothetical protein CLUG_05142 [Clavispora lusitaniae ATCC 42720]
Length = 717
Score = 135 bits (341), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 72/194 (37%), Positives = 108/194 (55%), Gaps = 7/194 (3%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTAS----PGSSGVCDRDCHCGMLLSS--CSSG- 96
+V + YLTK++ + + + C C + C D +C +S C +G
Sbjct: 24 FVPLAECTYLTKQLGGSGQKETMTCDCVEEWNEGKAQNLACGEDSNCINRATSVECVNGS 83
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C CG +C N+ FQ + +K++QTEK G G+VA D+ G FV EY+GEVID++T R
Sbjct: 84 CTCGQNCQNQRFQKKQYSSVKVIQTEKKGYGLVAQADVPEGSFVYEYIGEVIDEKTFRAR 143
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ +FY + +D IDAT KG+ +R+ NHSC PN + KW++ + R+GIFA
Sbjct: 144 MLDYDRRNLRHFYFMMLTKDAFIDATEKGSLARFCNHSCSPNAYVDKWVVGDKLRMGIFA 203
Query: 217 TRDIKKGENLTYDY 230
R I GE +T+DY
Sbjct: 204 RRRILAGEEITFDY 217
>gi|147816102|emb|CAN75267.1| hypothetical protein VITISV_028162 [Vitis vinifera]
Length = 460
Score = 135 bits (341), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 80/188 (42%), Positives = 104/188 (55%), Gaps = 7/188 (3%)
Query: 51 IYLTKRIKRRLEDDGIFCSC---TASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNK 106
I+L + IK+ EDD C C T P S+ C C + C+ C C C N+
Sbjct: 96 IFLIRHIKQE-EDDITICECKYNTNDPDSA--CGERCLNVLTSIECTPHYCPCSVHCKNQ 152
Query: 107 PFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGET 166
FQ K KL +TE G G++A+EDIK G F+IEY GEVI ER G
Sbjct: 153 RFQKHEYAKTKLFRTEGRGWGLLANEDIKAGRFIIEYCGEVISWNEARERSLAYASQGIN 212
Query: 167 NFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENL 226
+ Y+ +N IDAT G+++R+INHSC PN E +KW + GE RIGIFA RDI G L
Sbjct: 213 DAYIISLNARECIDATKSGSQARFINHSCEPNCETRKWSVLGEVRIGIFAMRDISIGTEL 272
Query: 227 TYDYQYEF 234
TYDY +++
Sbjct: 273 TYDYNFQW 280
>gi|301762334|ref|XP_002916587.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
N-methyltransferase NSD2-like [Ailuropoda melanoleuca]
Length = 1364
Score = 135 bits (341), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 78/199 (39%), Positives = 110/199 (55%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
K PY IK N K ++ D C P C D +C ML+ C
Sbjct: 986 KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMFECHPQV 1044
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R + K+++T+ G G+VA DI++GEFV EYVGE+ID++ C R
Sbjct: 1045 CPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 1104
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ T+FY+ I++D +IDA KGN SR++NHSC PN E KW ++G+TR+G+FA
Sbjct: 1105 IKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1164
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1165 VCDIPAGTELTFNYNLDCL 1183
>gi|328781326|ref|XP_003249962.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
[Apis mellifera]
Length = 1218
Score = 135 bits (341), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 74/166 (44%), Positives = 96/166 (57%), Gaps = 1/166 (0%)
Query: 66 IFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKC 124
+ C C + DC +LL CS G C G C N+ F R M+ T
Sbjct: 778 VACDCDPEWENPCAPGTDCLNRILLVECSPGICPAGPKCNNQAFVRRQYPAMEPFHTIGR 837
Query: 125 GAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYK 184
G G+ + E IK G+FVIEYVGEVID+ + RL + K L NFY I+ + IDA K
Sbjct: 838 GWGLRSLEHIKAGQFVIEYVGEVIDEAEYKRRLHRKKELKNENFYFLTIDNNRTIDAEPK 897
Query: 185 GNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDY 230
GN SR++NHSC PN E QKW ++G+TRIG+FA DI+ GE LT++Y
Sbjct: 898 GNLSRFMNHSCSPNCETQKWTVNGDTRIGLFALCDIEPGEELTFNY 943
>gi|359493199|ref|XP_003634540.1| PREDICTED: histone-lysine N-methyltransferase ASHH1-like [Vitis
vinifera]
Length = 515
Score = 135 bits (341), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 76/193 (39%), Positives = 106/193 (54%), Gaps = 2/193 (1%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGS-SGVCDRDCHCGMLLSSCSSG-CKCGN 101
Y+ I RN + ++ ++ E+D C C C C + + C+ G C+CG
Sbjct: 25 YIHINRNDFSYRKHIKQQEEDIAICECKYDANDPDSACGEACLNVLTSTECTPGYCRCGL 84
Query: 102 SCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMK 161
C N+ FQ K KL +TE G G++ADE+IK G FVIEY GEVI + R
Sbjct: 85 FCKNQRFQKCEYAKTKLFRTEGRGWGLLADENIKAGRFVIEYCGEVISWKEARGRSQVYA 144
Query: 162 HLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIK 221
LG + ++ +N IDAT KG+ R+INHSC PN E +KW + GE R+GIFA +DI
Sbjct: 145 SLGLKDAFIISLNGSECIDATKKGSLGRFINHSCQPNCETRKWTVLGEVRVGIFAKQDIS 204
Query: 222 KGENLTYDYQYEF 234
G L Y+Y +E+
Sbjct: 205 IGTELAYNYNFEW 217
>gi|375294191|ref|NP_001069063.2| histone-lysine N-methyltransferase NSD3 [Bos taurus]
gi|359080579|ref|XP_003588019.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1 [Bos
taurus]
Length = 1439
Score = 135 bits (341), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 76/208 (36%), Positives = 111/208 (53%), Gaps = 6/208 (2%)
Query: 31 ELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GM 88
E+P P PY +K N + K ++ ++ D C P C + C M
Sbjct: 1064 EVPQSLRDPP--PYQHLKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRM 1120
Query: 89 LLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEV 147
L C C G C N+ F R ++++TE+ G G+ IK+GEFV EYVGE+
Sbjct: 1121 LQYECHPQVCPAGERCQNQCFTKRLYPDAEVIRTERRGWGLRTKRSIKKGEFVNEYVGEL 1180
Query: 148 IDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIID 207
ID++ C R+ + TNFY+ + +D +IDA KGN SR++NHSC PN E QKW ++
Sbjct: 1181 IDEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVN 1240
Query: 208 GETRIGIFATRDIKKGENLTYDYQYEFL 235
G+ R+G+FA DI G LT++Y + L
Sbjct: 1241 GDVRVGLFALCDIPAGMELTFNYNLDCL 1268
>gi|118918400|ref|NP_032765.3| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Mus musculus]
Length = 2691
Score = 135 bits (341), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 70/169 (41%), Positives = 103/169 (60%), Gaps = 1/169 (0%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
C+C A+ + D +C MLL C + C G C N+ F R +++ +T + G
Sbjct: 1896 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGVRCQNQCFSKRQYPDVEIFRTLQRGW 1955
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ DIK+GEFV EYVGE+ID++ C R+ + TNFY+ +++D +IDA KGN
Sbjct: 1956 GLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGN 2015
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
+R++NH C PN E QKW ++G+TR+G+FA DIK G LT++Y E L
Sbjct: 2016 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 2064
>gi|281339990|gb|EFB15574.1| hypothetical protein PANDA_004672 [Ailuropoda melanoleuca]
Length = 1363
Score = 135 bits (341), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 78/199 (39%), Positives = 110/199 (55%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
K PY IK N K ++ D C P C D +C ML+ C
Sbjct: 985 KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMFECHPQV 1043
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R + K+++T+ G G+VA DI++GEFV EYVGE+ID++ C R
Sbjct: 1044 CPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 1103
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ T+FY+ I++D +IDA KGN SR++NHSC PN E KW ++G+TR+G+FA
Sbjct: 1104 IKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1163
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1164 VCDIPAGTELTFNYNLDCL 1182
>gi|348571625|ref|XP_003471596.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 1
[Cavia porcellus]
Length = 1366
Score = 135 bits (341), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 78/199 (39%), Positives = 110/199 (55%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
K PY IK N K ++ D C P C D +C ML+ C
Sbjct: 987 KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMFECHPQV 1045
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R + K+++T+ G G+VA DI++GEFV EYVGE+ID++ C R
Sbjct: 1046 CPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 1105
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ T+FY+ I++D +IDA KGN SR++NHSC PN E KW ++G+TR+G+FA
Sbjct: 1106 IKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1165
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1166 VCDIPAGTELTFNYNLDCL 1184
>gi|348571627|ref|XP_003471597.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 2
[Cavia porcellus]
Length = 1367
Score = 135 bits (341), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 78/199 (39%), Positives = 110/199 (55%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
K PY IK N K ++ D C P C D +C ML+ C
Sbjct: 988 KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMFECHPQV 1046
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R + K+++T+ G G+VA DI++GEFV EYVGE+ID++ C R
Sbjct: 1047 CPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 1106
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ T+FY+ I++D +IDA KGN SR++NHSC PN E KW ++G+TR+G+FA
Sbjct: 1107 IKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1166
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1167 VCDIPAGTELTFNYNLDCL 1185
>gi|345798392|ref|XP_536224.3| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
N-methyltransferase NSD2 [Canis lupus familiaris]
Length = 1364
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 78/199 (39%), Positives = 110/199 (55%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
K PY IK N K ++ D C P C D +C ML+ C
Sbjct: 986 KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMFECHPQV 1044
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R + K+++T+ G G+VA DI++GEFV EYVGE+ID++ C R
Sbjct: 1045 CPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 1104
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ T+FY+ I++D +IDA KGN SR++NHSC PN E KW ++G+TR+G+FA
Sbjct: 1105 IKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1164
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1165 VCDIPAGTELTFNYNLDCL 1183
>gi|449446403|ref|XP_004140961.1| PREDICTED: histone-lysine N-methyltransferase ASHH1-like [Cucumis
sativus]
Length = 497
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 76/193 (39%), Positives = 110/193 (56%), Gaps = 2/193 (1%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS-GVCDRDCHCGMLLSSCSSG-CKCGN 101
Y IK+N + ++ K++ E+D C C + C C + + C+ G C G
Sbjct: 11 YKDIKQNEFCYRKHKKQKEEDIAVCECKFDDNDNDSACGEGCLNVLTSTECTPGHCPSGV 70
Query: 102 SCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMK 161
C N+ FQ K KL +TE G G++ADE+IK G+F+IEY GEVI + + R +
Sbjct: 71 HCRNQRFQKCEYAKTKLFKTEGRGWGLLADENIKNGQFIIEYCGEVISWKEAKRRSHTYE 130
Query: 162 HLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIK 221
+ G + Y+ +N IDAT KG+ +R+INHSC PN E +KW + GE R+GIFA +DI
Sbjct: 131 NQGLKDAYIISLNASESIDATRKGSLARFINHSCFPNCETRKWNVLGEIRVGIFAKQDIS 190
Query: 222 KGENLTYDYQYEF 234
G L YDY +E+
Sbjct: 191 IGTELAYDYNFEW 203
>gi|118101386|ref|XP_001232891.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1 [Gallus
gallus]
Length = 1436
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 74/199 (37%), Positives = 109/199 (54%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N + K ++ ++ D C P C + C ML C
Sbjct: 1068 KPPPYKHIKSNKVIGK-VQIQVADLSEIPRCNCKPSDENPCGLESECLNRMLQYECHPQV 1126
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R ++++T++ G G+ +IK+GEFV EYVGE+ID++ C R
Sbjct: 1127 CPAGERCQNQCFTKRLYPDAEIIKTDRRGWGLRTKRNIKKGEFVNEYVGELIDEEECRLR 1186
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + TNFY+ + +D +IDA KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1187 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDIRVGLFA 1246
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1247 LCDIPAGMELTFNYNLDCL 1265
>gi|326932813|ref|XP_003212507.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like isoform 1
[Meleagris gallopavo]
Length = 1436
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 74/199 (37%), Positives = 109/199 (54%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N + K ++ ++ D C P C + C ML C
Sbjct: 1068 KPPPYKHIKSNKVIGK-VQIQVADLSEIPRCNCKPSDENPCGLESECLNRMLQYECHPQV 1126
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R ++++T++ G G+ +IK+GEFV EYVGE+ID++ C R
Sbjct: 1127 CPAGERCQNQCFTKRLYPDAEIIKTDRRGWGLRTKRNIKKGEFVNEYVGELIDEEECRLR 1186
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + TNFY+ + +D +IDA KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1187 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDIRVGLFA 1246
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1247 LCDIPAGMELTFNYNLDCL 1265
>gi|37360238|dbj|BAC98097.1| mKIAA1090 protein [Mus musculus]
Length = 857
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 78/199 (39%), Positives = 109/199 (54%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N K ++ D C P C D C ML+ C
Sbjct: 479 KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGSDSECLNRMLMFECHPQV 537
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R + K+++T+ G G+VA DI++GEFV EYVGE+ID++ C R
Sbjct: 538 CPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 597
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ T+FY+ I++D +IDA KGN SR++NHSC PN E KW ++G+TR+G+FA
Sbjct: 598 IKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 657
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 658 VCDIPAGTELTFNYNLDCL 676
>gi|28204960|gb|AAH46473.1| Whsc1 protein, partial [Mus musculus]
Length = 851
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 78/199 (39%), Positives = 109/199 (54%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N K ++ D C P C D C ML+ C
Sbjct: 473 KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGSDSECLNRMLMFECHPQV 531
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R + K+++T+ G G+VA DI++GEFV EYVGE+ID++ C R
Sbjct: 532 CPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 591
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ T+FY+ I++D +IDA KGN SR++NHSC PN E KW ++G+TR+G+FA
Sbjct: 592 IKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 651
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 652 VCDIPAGTELTFNYNLDCL 670
>gi|156099290|ref|XP_001615647.1| hypothetical protein [Plasmodium vivax Sal-1]
gi|148804521|gb|EDL45920.1| hypothetical protein, conserved [Plasmodium vivax]
Length = 3021
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 78/216 (36%), Positives = 119/216 (55%), Gaps = 21/216 (9%)
Query: 42 IPYVFIKRNIYLTKRIKRRL--EDDGIFCSCTASPGSSGVCDR-DCHCGMLLSSCSSGCK 98
+ + +I +N+Y+ + K L + D C C G CD C+ + CS +
Sbjct: 2520 LNFEYISKNVYVNDQNKNLLSCKSDDYRCLC------QGECDPFSCYNSLSKIQCSKN-R 2572
Query: 99 CG-------NSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQ 151
C C N+PF++ +K +++ QTE+ G G+ DIK GE + EYVGEV+ +
Sbjct: 2573 CNLPIQIQDKKCFNRPFKHSAIKDLEIKQTERTGYGVFCKRDIKNGELICEYVGEVLGKK 2632
Query: 152 TCEERLWKMKHLGET----NFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIID 207
EER+ + + N+Y +INRD+ ID+ KG+ SR++NHSC PN+ QKWI+
Sbjct: 2633 EFEERMEAYQEESKKTDMYNWYSIQINRDVHIDSRRKGSISRFVNHSCSPNSVSQKWIVR 2692
Query: 208 GETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
G RIGIFA +DI GE +TY+Y Y F+ ++ C
Sbjct: 2693 GFYRIGIFAQQDIPAGEEITYNYSYNFVFNNFECLC 2728
>gi|149756942|ref|XP_001488967.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 1
[Equus caballus]
Length = 1365
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 78/199 (39%), Positives = 110/199 (55%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
K PY IK N K ++ D C P C D +C ML+ C
Sbjct: 987 KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMFECHPQV 1045
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R + K+++T+ G G+VA DI++GEFV EYVGE+ID++ C R
Sbjct: 1046 CPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 1105
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ T+FY+ I++D +IDA KGN SR++NHSC PN E KW ++G+TR+G+FA
Sbjct: 1106 IKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1165
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1166 VCDIPAGTELTFNYNLDCL 1184
>gi|326919530|ref|XP_003206033.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
[Meleagris gallopavo]
Length = 1348
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 78/199 (39%), Positives = 110/199 (55%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
K PY IK N K ++ D C P C D +C ML+ C
Sbjct: 971 KPPPYKHIKVNKPCGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMYECHPQV 1029
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R + ++++T+ G G+VA DIK+GEFV EYVGE+ID++ C R
Sbjct: 1030 CPAGERCQNQCFTKREYPETEIIKTDGKGWGLVAKRDIKKGEFVNEYVGELIDEEECMAR 1089
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ T+FY+ I++D +IDA KGN SR++NHSC PN E KW ++G+TR+G+FA
Sbjct: 1090 IKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1149
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1150 VCDIPAGTELTFNYNLDCL 1168
>gi|351704076|gb|EHB06995.1| Putative histone-lysine N-methyltransferase NSD2 [Heterocephalus
glaber]
Length = 1372
Score = 135 bits (340), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 77/196 (39%), Positives = 109/196 (55%), Gaps = 4/196 (2%)
Query: 43 PYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG-CKC 99
PY IK N K ++ D C P C D +C ML+ C C
Sbjct: 996 PYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMFECHPQVCPA 1054
Query: 100 GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
G C N+ F R + K+++T+ G G+VA DI++GEFV EYVGE+ID++ C R+
Sbjct: 1055 GEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMARIKY 1114
Query: 160 MKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD 219
T+FY+ I++D +IDA KGN SR++NHSC PN E KW ++G+TR+G+FA D
Sbjct: 1115 AHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCD 1174
Query: 220 IKKGENLTYDYQYEFL 235
I G LT++Y + L
Sbjct: 1175 IPAGTELTFNYNLDCL 1190
>gi|326932815|ref|XP_003212508.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like isoform 2
[Meleagris gallopavo]
Length = 1386
Score = 135 bits (340), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 74/199 (37%), Positives = 109/199 (54%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N + K ++ ++ D C P C + C ML C
Sbjct: 1018 KPPPYKHIKSNKVIGK-VQIQVADLSEIPRCNCKPSDENPCGLESECLNRMLQYECHPQV 1076
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R ++++T++ G G+ +IK+GEFV EYVGE+ID++ C R
Sbjct: 1077 CPAGERCQNQCFTKRLYPDAEIIKTDRRGWGLRTKRNIKKGEFVNEYVGELIDEEECRLR 1136
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + TNFY+ + +D +IDA KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1137 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDIRVGLFA 1196
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1197 LCDIPAGMELTFNYNLDCL 1215
>gi|118101388|ref|XP_424390.2| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2 [Gallus
gallus]
Length = 1386
Score = 135 bits (340), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 74/199 (37%), Positives = 109/199 (54%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N + K ++ ++ D C P C + C ML C
Sbjct: 1018 KPPPYKHIKSNKVIGK-VQIQVADLSEIPRCNCKPSDENPCGLESECLNRMLQYECHPQV 1076
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R ++++T++ G G+ +IK+GEFV EYVGE+ID++ C R
Sbjct: 1077 CPAGERCQNQCFTKRLYPDAEIIKTDRRGWGLRTKRNIKKGEFVNEYVGELIDEEECRLR 1136
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + TNFY+ + +D +IDA KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1137 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDIRVGLFA 1196
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1197 LCDIPAGMELTFNYNLDCL 1215
>gi|431897323|gb|ELK06585.1| Putative histone-lysine N-methyltransferase NSD2 [Pteropus alecto]
Length = 502
Score = 135 bits (340), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 78/199 (39%), Positives = 109/199 (54%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N K ++ D C P C D C ML+ C
Sbjct: 57 KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMFECHPQV 115
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R + K+++T+ G G+VA DI++GEFV EYVGE+ID+ C R
Sbjct: 116 CPAGEFCQNQCFTKRQYPETKIIRTDGKGWGLVAKRDIRKGEFVNEYVGELIDEDECMAR 175
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + T+FY+ I++D +IDA KGN SR++NHSC PN E KW ++G+TR+G+FA
Sbjct: 176 IKRAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 235
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 236 VCDIPAGTELTFNYNLDCL 254
>gi|118090799|ref|XP_420839.2| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Gallus
gallus]
Length = 1369
Score = 135 bits (340), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 78/199 (39%), Positives = 110/199 (55%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
K PY IK N K ++ D C P C D +C ML+ C
Sbjct: 992 KPPPYKHIKVNKPCGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMYECHPQV 1050
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R + ++++T+ G G+VA DIK+GEFV EYVGE+ID++ C R
Sbjct: 1051 CPAGERCQNQCFTKREYPETEIIKTDGKGWGLVAKRDIKKGEFVNEYVGELIDEEECMAR 1110
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ T+FY+ I++D +IDA KGN SR++NHSC PN E KW ++G+TR+G+FA
Sbjct: 1111 IKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1170
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1171 VCDIPAGTELTFNYNLDCL 1189
>gi|402852477|ref|XP_003890948.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
[Papio anubis]
Length = 1013
Score = 135 bits (339), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 78/199 (39%), Positives = 110/199 (55%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
K PY IK N K ++ D C P C D +C ML+ C
Sbjct: 635 KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMFECHPQV 693
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R + K+++T+ G G+VA DI++GEFV EYVGE+ID++ C R
Sbjct: 694 CPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 753
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ T+FY+ I++D +IDA KGN SR++NHSC PN E KW ++G+TR+G+FA
Sbjct: 754 IKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 813
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 814 VCDIPAGTELTFNYNLDCL 832
>gi|26347387|dbj|BAC37342.1| unnamed protein product [Mus musculus]
Length = 601
Score = 135 bits (339), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 78/199 (39%), Positives = 109/199 (54%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N K ++ D C P C D C ML+ C
Sbjct: 223 KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGSDSECLNRMLMFECHPQV 281
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R + K+++T+ G G+VA DI++GEFV EYVGE+ID++ C R
Sbjct: 282 CPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 341
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ T+FY+ I++D +IDA KGN SR++NHSC PN E KW ++G+TR+G+FA
Sbjct: 342 IKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 401
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 402 VCDIPAGTELTFNYNLDCL 420
>gi|313226807|emb|CBY21952.1| unnamed protein product [Oikopleura dioica]
Length = 216
Score = 135 bits (339), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 66/153 (43%), Positives = 93/153 (60%)
Query: 80 CDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEF 139
C +C L+ CSS C+ GN C NK FQ R K+ + G G+ A I RG+F
Sbjct: 16 CGEECLNRALMVECSSRCRTGNRCTNKNFQKRKYAPAKIFWAKGKGHGLKATAFIPRGKF 75
Query: 140 VIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNT 199
+IEY+GEV+ + ++R + G+ + Y E++R IDA +KG SR+INHSC PN+
Sbjct: 76 IIEYLGEVVSAKEFKKRSHEYARSGKQHHYFMELSRQATIDAYHKGAISRFINHSCEPNS 135
Query: 200 EMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
E QKW ++G RIG FA RDI+ E +T+DYQ+
Sbjct: 136 ETQKWTVNGLLRIGFFAIRDIQPEEEITFDYQF 168
>gi|190349638|gb|ACE75882.1| multiple-myeloma-related WHSC1/MMSET isoform RE-IIBP [Homo sapiens]
Length = 704
Score = 135 bits (339), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 78/199 (39%), Positives = 110/199 (55%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N K ++ D C +P C D C ML+ C
Sbjct: 326 KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCTPTDENPCGSDSECLNRMLMFECHPQV 384
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R + K+++T+ G G+VA DI++GEFV EYVGE+ID++ C R
Sbjct: 385 CPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 444
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ T+FY+ I++D +IDA KGN SR++NHSC PN E KW ++G+TR+G+FA
Sbjct: 445 IKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 504
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 505 VCDIPAGTELTFNYNLDCL 523
>gi|342209884|gb|AEL16989.1| ASH1-like protein [Phaseolus vulgaris]
Length = 481
Score = 135 bits (339), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 78/212 (36%), Positives = 113/212 (53%), Gaps = 11/212 (5%)
Query: 26 NPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS-GVCDRDC 84
+P ELP Y+ I +N + +R K++ E+D C C + C C
Sbjct: 2 DPFTEELPQ---------YIHINQNEFFMRRQKKQKEEDIAICECKYDANDTDSACGDSC 52
Query: 85 HCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEY 143
+ + C+ G C C C N+ FQ K KL +TE G G++A ED+K G+FVIEY
Sbjct: 53 LNVLTSTECTPGYCPCDILCKNQKFQKCEYAKTKLFKTEGRGWGLLAGEDLKAGQFVIEY 112
Query: 144 VGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQK 203
GEVI + + R ++ G + ++ +N IDAT KG+ +R+INHSC PN E +K
Sbjct: 113 CGEVISWKEAKRRSQAYENQGLKDAFIICLNASESIDATRKGSLARFINHSCRPNCETRK 172
Query: 204 WIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
W + GE R+GIFA D+ G L YDY +E+
Sbjct: 173 WNVLGEIRVGIFAKHDVPIGTELAYDYNFEWF 204
>gi|432909264|ref|XP_004078147.1| PREDICTED: histone-lysine N-methyltransferase SETD2-like [Oryzias
latipes]
Length = 1665
Score = 135 bits (339), Expect = 2e-29, Method: Composition-based stats.
Identities = 73/207 (35%), Positives = 115/207 (55%), Gaps = 9/207 (4%)
Query: 36 FIKPKAIPYVF--IKRNIYLTKRIKRRLEDD--GIFCSC---TASPGSSGV--CDRDCHC 86
F + K +P F I+ N+YLT+R K + D + C C + GV C DC
Sbjct: 544 FARSKKMPPYFDLIEENLYLTERKKSKSHRDIKRMQCECPVLSREERGKGVMACGEDCLN 603
Query: 87 GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
+L+ CSS C G C N+ FQ + +++ T+ G G+ A +++ FV+EY GE
Sbjct: 604 RLLMIECSSRCLNGAYCSNRRFQMKQHADFEVILTDDKGWGLRAAKEMAPNTFVLEYCGE 663
Query: 147 VIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWII 206
V+D + + R+ + ++Y + + +IDAT KGN SR++NHSC PN E QKW +
Sbjct: 664 VLDHKEFKTRVKEYARNKNIHYYFMSLKNNEIIDATLKGNCSRFMNHSCEPNCETQKWTV 723
Query: 207 DGETRIGIFATRDIKKGENLTYDYQYE 233
+G+ R+G F T+ + G LT+DYQ++
Sbjct: 724 NGQLRVGFFTTKAVAAGTELTFDYQFQ 750
>gi|31418293|gb|AAH53454.1| Whsc1 protein, partial [Mus musculus]
Length = 558
Score = 135 bits (339), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 78/199 (39%), Positives = 109/199 (54%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N K ++ D C P C D C ML+ C
Sbjct: 180 KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGSDSECLNRMLMFECHPQV 238
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R + K+++T+ G G+VA DI++GEFV EYVGE+ID++ C R
Sbjct: 239 CPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 298
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ T+FY+ I++D +IDA KGN SR++NHSC PN E KW ++G+TR+G+FA
Sbjct: 299 IKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 358
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 359 VCDIPAGTELTFNYNLDCL 377
>gi|384484414|gb|EIE76594.1| hypothetical protein RO3G_01298 [Rhizopus delemar RA 99-880]
Length = 291
Score = 134 bits (338), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 81/218 (37%), Positives = 119/218 (54%), Gaps = 16/218 (7%)
Query: 29 EFELPDWF-----------IKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
EFELP + +K K + I+ N+++ ++ K D+ C C G +
Sbjct: 49 EFELPPFIWQERELNLIKSLKDKRPVFTKIRSNLFVERKPKL-CPDERNVCHCKEPTGEN 107
Query: 78 GV-CDRDCHCGMLLSSCS-SGCKCGNSCLNKPFQNRPVK-KMKLVQTEKCGAGIVADEDI 134
V C DC +L C C CG C N FQ + K K+++ +T++ G G+ EDI
Sbjct: 108 EVGCGDDCINRLLFYECDPKHCPCGPKCSNNRFQKKQRKYKLEVFKTDQRGWGLRTLEDI 167
Query: 135 KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
K+G FVIEY GE+I + CEER+ ++ E NFY E ++ VIDA KG ++R+INHS
Sbjct: 168 KKGAFVIEYRGEIISQKLCEERMC-TDYVNENNFYFLEYSKGEVIDACTKGTEARFINHS 226
Query: 195 CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
C PN ++KW GE G+FA++DI L+YDY +
Sbjct: 227 CDPNCHIEKWSYRGEAHFGVFASKDIPAYSELSYDYNF 264
>gi|354472091|ref|XP_003498274.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1
[Cricetulus griseus]
Length = 1436
Score = 134 bits (338), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 79/222 (35%), Positives = 116/222 (52%), Gaps = 6/222 (2%)
Query: 17 FNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGS 76
F +L Q + E+ KP PY IK N + K + ++ D C P
Sbjct: 1048 FQELKAQRESKEALEIEKTSRKPP--PYKHIKANKVIGK-VPIQVADLSEIPRCNCKPAD 1104
Query: 77 SGVCDRDCHCGMLLSS--CSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADED 133
C + C +S C C G+ C N+ F R ++++TE+ G G+
Sbjct: 1105 ENPCGLESECLNRMSQYECHPQVCPAGDRCQNQCFTKRLYPDAEIIKTERKGWGLRTKRS 1164
Query: 134 IKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINH 193
IK+GEFV EYVGE+ID++ C R+ + TNFY+ + +D +IDA KGN SR++NH
Sbjct: 1165 IKKGEFVNEYVGELIDEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNH 1224
Query: 194 SCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
SC PN E QKW ++G+ R+G+FA DI G LT++Y + L
Sbjct: 1225 SCNPNCETQKWTVNGDVRVGLFAICDIPAGMELTFNYNLDCL 1266
>gi|195503632|ref|XP_002098733.1| GE10528 [Drosophila yakuba]
gi|194184834|gb|EDW98445.1| GE10528 [Drosophila yakuba]
Length = 1441
Score = 134 bits (338), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 76/205 (37%), Positives = 114/205 (55%), Gaps = 7/205 (3%)
Query: 43 PYVFIKRNIYLTK-RIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG-CK 98
PYV IK N + R + LED +C PG C D C ML + C+ CK
Sbjct: 1166 PYVKIKTNKAVPPVRFTQNLED---LSACYCQPGDDHPCGPDSGCINRMLFNECNPEFCK 1222
Query: 99 CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
GN C N+ F+ R ++++V + G G+V E I G+FVIEYVGEVI+ + R+
Sbjct: 1223 AGNRCENQMFEQRKSPRLEVVYMNERGFGLVNREPIAEGDFVIEYVGEVINHAEFQRRME 1282
Query: 159 KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATR 218
+ + + N+Y + +D +IDA KGN +R++NHSC PN E QKW ++ R+G+FA +
Sbjct: 1283 QKQRDRDENYYFLGVEKDFIIDAGPKGNLARFMNHSCEPNCETQKWTVNCIHRVGLFAIK 1342
Query: 219 DIKKGENLTYDYQYEFLHDSLIAYC 243
DI LT++Y ++ L ++ C
Sbjct: 1343 DIPVNTELTFNYLWDDLMNNSKKAC 1367
>gi|354472093|ref|XP_003498275.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2
[Cricetulus griseus]
Length = 1387
Score = 134 bits (338), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 79/222 (35%), Positives = 116/222 (52%), Gaps = 6/222 (2%)
Query: 17 FNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGS 76
F +L Q + E+ KP PY IK N + K + ++ D C P
Sbjct: 999 FQELKAQRESKEALEIEKTSRKPP--PYKHIKANKVIGK-VPIQVADLSEIPRCNCKPAD 1055
Query: 77 SGVCDRDCHCGMLLSS--CSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADED 133
C + C +S C C G+ C N+ F R ++++TE+ G G+
Sbjct: 1056 ENPCGLESECLNRMSQYECHPQVCPAGDRCQNQCFTKRLYPDAEIIKTERKGWGLRTKRS 1115
Query: 134 IKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINH 193
IK+GEFV EYVGE+ID++ C R+ + TNFY+ + +D +IDA KGN SR++NH
Sbjct: 1116 IKKGEFVNEYVGELIDEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNH 1175
Query: 194 SCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
SC PN E QKW ++G+ R+G+FA DI G LT++Y + L
Sbjct: 1176 SCNPNCETQKWTVNGDVRVGLFAICDIPAGMELTFNYNLDCL 1217
>gi|281346901|gb|EFB22485.1| hypothetical protein PANDA_005493 [Ailuropoda melanoleuca]
Length = 926
Score = 134 bits (338), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 108/199 (54%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N + K ++ + D C P C + C ML C
Sbjct: 558 KPPPYKHIKANKVIGK-VQIHVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 616
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R ++++TE+ G G+ IK+GEFV EYVGE+ID++ C+ R
Sbjct: 617 CPAGERCQNQCFTKRLYPDAEVIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECKLR 676
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + TNFY+ + +D +IDA KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 677 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 736
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 737 LCDIPAGMELTFNYNLDCL 755
>gi|441664377|ref|XP_003279042.2| PREDICTED: histone-lysine N-methyltransferase NSD2-like [Nomascus
leucogenys]
Length = 780
Score = 134 bits (337), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 78/199 (39%), Positives = 109/199 (54%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N K ++ D C P C D C ML+ C
Sbjct: 482 KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMFECHPQV 540
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R + K+++T+ G G+VA DI++GEFV EYVGE+ID++ C R
Sbjct: 541 CPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 600
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ T+FY+ I++D +IDA KGN SR++NHSC PN E KW ++G+TR+G+FA
Sbjct: 601 IKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 660
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 661 VCDIPAGTELTFNYNLDCL 679
>gi|156230137|gb|AAI52413.1| WHSC1 protein [Homo sapiens]
Length = 713
Score = 134 bits (337), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 78/199 (39%), Positives = 109/199 (54%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N K ++ D C P C D C ML+ C
Sbjct: 335 KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMFECHPQV 393
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R + K+++T+ G G+VA DI++GEFV EYVGE+ID++ C R
Sbjct: 394 CPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 453
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ T+FY+ I++D +IDA KGN SR++NHSC PN E KW ++G+TR+G+FA
Sbjct: 454 IKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 513
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 514 VCDIPAGTELTFNYNLDCL 532
>gi|297282129|ref|XP_002802212.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
[Macaca mulatta]
Length = 713
Score = 134 bits (337), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 78/199 (39%), Positives = 109/199 (54%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N K ++ D C P C D C ML+ C
Sbjct: 335 KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMFECHPQV 393
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R + K+++T+ G G+VA DI++GEFV EYVGE+ID++ C R
Sbjct: 394 CPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 453
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ T+FY+ I++D +IDA KGN SR++NHSC PN E KW ++G+TR+G+FA
Sbjct: 454 IKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 513
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 514 VCDIPAGTELTFNYNLDCL 532
>gi|40789042|dbj|BAA83042.2| KIAA1090 protein [Homo sapiens]
Length = 715
Score = 134 bits (337), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 78/199 (39%), Positives = 109/199 (54%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N K ++ D C P C D C ML+ C
Sbjct: 337 KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMFECHPQV 395
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R + K+++T+ G G+VA DI++GEFV EYVGE+ID++ C R
Sbjct: 396 CPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 455
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ T+FY+ I++D +IDA KGN SR++NHSC PN E KW ++G+TR+G+FA
Sbjct: 456 IKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 515
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 516 VCDIPAGTELTFNYNLDCL 534
>gi|313227685|emb|CBY22833.1| unnamed protein product [Oikopleura dioica]
Length = 1179
Score = 134 bits (337), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 74/189 (39%), Positives = 107/189 (56%), Gaps = 4/189 (2%)
Query: 55 KRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVK 114
KRI+ L + I C C CD +C L + CSS C G+ C N+ F R
Sbjct: 936 KRIEANLAETEI-CDCHKR---DNPCDEECTNRQLQNECSSKCLNGDKCQNRRFTKRHYP 991
Query: 115 KMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEIN 174
+ QT+ G G+ A + I +G F+IEY+GE+I RL + +G TN+Y+ E++
Sbjct: 992 PLLKFQTDWGGNGLKAKKLIPKGSFIIEYIGEIISHDESRIRLEESAKIGVTNYYILELD 1051
Query: 175 RDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEF 234
+IDA +GN +R+INHSC PN + WI+ G+TRIGIF+ RDI++GE LT++YQ +
Sbjct: 1052 NLRMIDAGPRGNIARFINHSCDPNCGIDPWIVQGDTRIGIFSKRDIQEGEELTFNYQLQQ 1111
Query: 235 LHDSLIAYC 243
D C
Sbjct: 1112 SSDEGKTKC 1120
>gi|119602957|gb|EAW82551.1| Wolf-Hirschhorn syndrome candidate 1, isoform CRA_d [Homo sapiens]
Length = 742
Score = 134 bits (337), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 78/199 (39%), Positives = 109/199 (54%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N K ++ D C P C D C ML+ C
Sbjct: 364 KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMFECHPQV 422
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R + K+++T+ G G+VA DI++GEFV EYVGE+ID++ C R
Sbjct: 423 CPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 482
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ T+FY+ I++D +IDA KGN SR++NHSC PN E KW ++G+TR+G+FA
Sbjct: 483 IKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 542
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 543 VCDIPAGTELTFNYNLDCL 561
>gi|12642795|gb|AAK00344.1|AF330040_1 IL-5 promoter REII-region-binding protein [Homo sapiens]
gi|119602961|gb|EAW82555.1| Wolf-Hirschhorn syndrome candidate 1, isoform CRA_g [Homo sapiens]
gi|133777178|gb|AAH94825.2| Wolf-Hirschhorn syndrome candidate 1 [Homo sapiens]
Length = 584
Score = 134 bits (337), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 78/199 (39%), Positives = 109/199 (54%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N K ++ D C P C D C ML+ C
Sbjct: 206 KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMFECHPQV 264
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R + K+++T+ G G+VA DI++GEFV EYVGE+ID++ C R
Sbjct: 265 CPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 324
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ T+FY+ I++D +IDA KGN SR++NHSC PN E KW ++G+TR+G+FA
Sbjct: 325 IKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 384
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 385 VCDIPAGTELTFNYNLDCL 403
>gi|326671180|ref|XP_694414.5| PREDICTED: histone-lysine N-methyltransferase NSD3 [Danio rerio]
Length = 1562
Score = 134 bits (337), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 73/199 (36%), Positives = 107/199 (53%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
+ PY IK N K ++ + D C P C +D C ML C
Sbjct: 1182 RPPPYKLIKSNKPFGK-VQLHVADLSEIPRCNCKPTDERPCSQDSQCLNRMLQYECHPQV 1240
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G+ C N+ F R ++++T G G+ +D+K+G+FV+EYVGE+ID + C++R
Sbjct: 1241 CPAGDRCHNQCFSKRLYPDTEVIKTTGRGWGLKTKQDLKKGDFVMEYVGELIDSEECKQR 1300
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ TNFY+ + +D VIDA KGN SR++NHSC PN E QKW ++G+ RIG+F
Sbjct: 1301 IRTANENHVTNFYMLTLTKDRVIDAGPKGNLSRFMNHSCSPNCETQKWTVNGDVRIGLFT 1360
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI LT++Y + L
Sbjct: 1361 LCDISADTELTFNYNLDCL 1379
>gi|355729163|gb|AES09785.1| Wolf-Hirschhorn syndrome candidate 1 [Mustela putorius furo]
Length = 409
Score = 134 bits (337), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 78/199 (39%), Positives = 110/199 (55%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
K PY IK N K ++ D C P C D +C ML+ C
Sbjct: 32 KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMFECHPQV 90
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R + K+++T+ G G+VA DI++GEFV EYVGE+ID++ C R
Sbjct: 91 CPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 150
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ T+FY+ I++D +IDA KGN SR++NHSC PN E KW ++G+TR+G+FA
Sbjct: 151 IKYAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 210
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 211 VCDIPAGTELTFNYNLDCL 229
>gi|328715143|ref|XP_001944096.2| PREDICTED: hypothetical protein LOC100167890 [Acyrthosiphon pisum]
Length = 2332
Score = 134 bits (337), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 69/164 (42%), Positives = 97/164 (59%), Gaps = 2/164 (1%)
Query: 80 CDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEF 139
C +C +L C C + C NK FQN+ K+ K+++T G G+ A EDI G F
Sbjct: 862 CATECLNRLLNIECGKACVLKSLCTNKQFQNKQFKRTKIIKTADKGYGVFALEDIPSGTF 921
Query: 140 VIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNT 199
V EY+GEVID CE + K L + N Y+ ++ D +IDAT KGN +R+INHSC PN
Sbjct: 922 VDEYMGEVIDQ--CEMIIRMKKILYKNNNYMVQLKHDEIIDATRKGNITRFINHSCEPNC 979
Query: 200 EMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
+KW + GE+R+G F+ I+KGE +T+DY +E D+ C
Sbjct: 980 VAEKWNVLGESRMGFFSKELIRKGEEITFDYSFEIFGDAAQQKC 1023
>gi|156390320|ref|XP_001635219.1| predicted protein [Nematostella vectensis]
gi|156222310|gb|EDO43156.1| predicted protein [Nematostella vectensis]
Length = 690
Score = 134 bits (337), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 78/192 (40%), Positives = 110/192 (57%), Gaps = 5/192 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSC-SSGCK 98
K Y IK N T +R + D C +P + D +C ML C +S C
Sbjct: 305 KPPQYKHIKTNKCTT--AQRIILDPSEMPVCECTPDQACGQDANCLNLMLQFECVASRCP 362
Query: 99 CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
G+ C N+ FQ R + + G G+ + + IK+G FVIEYVGE+IDD TC ER+
Sbjct: 363 AGDKCQNQRFQKRQYVDCEPFRAHSRGWGLRSKQAIKKGTFVIEYVGELIDDATCRERVK 422
Query: 159 KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATR 218
K +TN+Y+ I++D +IDA GN SR++NHSC PN E QKW ++GE R+G+F +R
Sbjct: 423 KGD--DDTNYYMLTIDKDCIIDAGPMGNLSRFMNHSCYPNCETQKWTVNGEVRVGLFTSR 480
Query: 219 DIKKGENLTYDY 230
D++ E LT+DY
Sbjct: 481 DVESQEELTFDY 492
>gi|15488420|gb|AAL01111.1|AF408060_1 ASH1-like protein 3 [Arabidopsis thaliana]
Length = 125
Score = 134 bits (336), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 81/125 (64%), Positives = 99/125 (79%), Gaps = 2/125 (1%)
Query: 11 SRIGHAFNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSC 70
+ +G F+KLL QIG EFELP+W K K PY+FI+RNIYLT+++KRR+EDDGIFCSC
Sbjct: 1 NHLGQVFDKLLNQIGESEEFELPEWLNKGKPTPYIFIRRNIYLTEKVKRRVEDDGIFCSC 60
Query: 71 TASPGSS--GVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGI 128
++S S VC +CHCGML SSCSS CKCG+ C NKPFQ R VKKMKL+QTEKCG+GI
Sbjct: 61 SSSSPGSSSTVCGSNCHCGMLFSSCSSSCKCGSECNNKPFQQRHVKKMKLIQTEKCGSGI 120
Query: 129 VADED 133
VA+E+
Sbjct: 121 VAEEE 125
>gi|380797995|gb|AFE70873.1| putative histone-lysine N-methyltransferase NSD2 isoform 1, partial
[Macaca mulatta]
Length = 421
Score = 134 bits (336), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 78/199 (39%), Positives = 109/199 (54%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N K ++ D C P C D C ML+ C
Sbjct: 43 KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMFECHPQV 101
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R + K+++T+ G G+VA DI++GEFV EYVGE+ID++ C R
Sbjct: 102 CPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 161
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ T+FY+ I++D +IDA KGN SR++NHSC PN E KW ++G+TR+G+FA
Sbjct: 162 IKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 221
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 222 VCDIPAGTELTFNYNLDCL 240
>gi|355729169|gb|AES09787.1| Wolf-Hirschhorn syndrome candidate 1-like 1 [Mustela putorius furo]
Length = 596
Score = 134 bits (336), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 76/205 (37%), Positives = 108/205 (52%), Gaps = 10/205 (4%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCG--------MLLS 91
K PY IK N + K ++ + D C P + CG ML
Sbjct: 223 KPPPYKHIKANKVIGK-VQIHVADLSEIPRCNCKPADENSPADENPCGLESECLNRMLQY 281
Query: 92 SCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDD 150
C C G C N+ F R ++++TE+ G G+ IK+GEFV EYVGE+ID+
Sbjct: 282 ECHPQVCPAGERCQNQCFTKRLYPDAEVIKTERRGWGLRTKRSIKKGEFVNEYVGELIDE 341
Query: 151 QTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGET 210
+ C R+ + TNFY+ + +D +IDA KGN SR++NHSC PN E QKW ++G+
Sbjct: 342 EECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDV 401
Query: 211 RIGIFATRDIKKGENLTYDYQYEFL 235
R+G+FA RDI G LT++Y + L
Sbjct: 402 RVGLFALRDIPAGMELTFNYNLDCL 426
>gi|162463380|ref|NP_001105665.1| SET domain-containing protein SET102 [Zea mays]
gi|22121720|gb|AAM89289.1| SET domain-containing protein SET102 [Zea mays]
gi|414587223|tpg|DAA37794.1| TPA: SET domain-containing protein SET102 [Zea mays]
Length = 513
Score = 134 bits (336), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 77/208 (37%), Positives = 115/208 (55%), Gaps = 12/208 (5%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCG------MLLSSC 93
+A Y+ I N + ++ K + E+D C C + D + CG + + C
Sbjct: 9 EAPSYIHIDSNAFSYRKHKSQTEEDIAVCECQYD-----ILDPESPCGDRCLNLLTNTEC 63
Query: 94 SSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQT 152
+ G C+CG C N+ FQ + +LV+T G G+VADE+I G+FVIEY GEVI +
Sbjct: 64 TPGYCRCGVYCKNQRFQKCQYARTRLVRTGGRGWGLVADENIMAGQFVIEYCGEVISWKE 123
Query: 153 CEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRI 212
+ R + + Y+ +N D IDAT KGN +R+INHSC PN E +KW + GE R+
Sbjct: 124 AKRRAQAYETQCLKDAYIIYLNADESIDATRKGNLARFINHSCQPNCETRKWNVLGEVRV 183
Query: 213 GIFATRDIKKGENLTYDYQYEFLHDSLI 240
GIFA ++I G L+YDY +E+ ++
Sbjct: 184 GIFAKQNIPFGTELSYDYNFEWYGGVMV 211
>gi|270014006|gb|EFA10454.1| hypothetical protein TcasGA2_TC012700 [Tribolium castaneum]
Length = 1740
Score = 134 bits (336), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 79/208 (37%), Positives = 117/208 (56%), Gaps = 6/208 (2%)
Query: 29 EFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHC 86
+FE + +KP PYV IK N + ++ + SC P C D DC
Sbjct: 1275 KFEEKNSGMKPP--PYVRIKVNKPVG-NVRVFDGNTSNTTSCDCDPNQPHPCGPDSDCLN 1331
Query: 87 GMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVG 145
+LL+ C+ C G+ C N+ F+ R + +T G G+ I++G+FVIEYVG
Sbjct: 1332 RLLLTECNPDVCPAGDRCNNQCFEKREYPPLVPHRTLYRGWGLKTLAPIRKGQFVIEYVG 1391
Query: 146 EVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWI 205
E+ID+Q + R+ KM E N+Y I++D ++DA KGN +R++NHSC PN E QKW
Sbjct: 1392 EMIDEQEYQRRVQKMHEQKEENYYFLTIDKDRMLDAGPKGNVARFMNHSCDPNCETQKWT 1451
Query: 206 IDGETRIGIFATRDIKKGENLTYDYQYE 233
++G+TR+G+FA DI G LT++Y E
Sbjct: 1452 VNGDTRVGLFANCDIPAGTELTFNYNLE 1479
>gi|91090902|ref|XP_973711.1| PREDICTED: similar to NSD1 [Tribolium castaneum]
Length = 1795
Score = 134 bits (336), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 79/208 (37%), Positives = 117/208 (56%), Gaps = 6/208 (2%)
Query: 29 EFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHC 86
+FE + +KP PYV IK N + ++ + SC P C D DC
Sbjct: 1330 KFEEKNSGMKPP--PYVRIKVNKPVG-NVRVFDGNTSNTTSCDCDPNQPHPCGPDSDCLN 1386
Query: 87 GMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVG 145
+LL+ C+ C G+ C N+ F+ R + +T G G+ I++G+FVIEYVG
Sbjct: 1387 RLLLTECNPDVCPAGDRCNNQCFEKREYPPLVPHRTLYRGWGLKTLAPIRKGQFVIEYVG 1446
Query: 146 EVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWI 205
E+ID+Q + R+ KM E N+Y I++D ++DA KGN +R++NHSC PN E QKW
Sbjct: 1447 EMIDEQEYQRRVQKMHEQKEENYYFLTIDKDRMLDAGPKGNVARFMNHSCDPNCETQKWT 1506
Query: 206 IDGETRIGIFATRDIKKGENLTYDYQYE 233
++G+TR+G+FA DI G LT++Y E
Sbjct: 1507 VNGDTRVGLFANCDIPAGTELTFNYNLE 1534
>gi|301615056|ref|XP_002936997.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Xenopus (Silurana) tropicalis]
Length = 2440
Score = 134 bits (336), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 79/198 (39%), Positives = 113/198 (57%), Gaps = 2/198 (1%)
Query: 40 KAIPYVFIKRNIYLTK-RIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSC-SSGC 97
K PY IK N + K +I + C+C A+ + D +C MLL C S C
Sbjct: 1685 KPPPYKHIKVNRPVGKVQILTADLSEIPRCNCKATDENPCGQDSECINRMLLYECHPSVC 1744
Query: 98 KCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERL 157
G C N+ F R ++++ +T G G+ DIK+GEFV EYVGE+ID++ C R+
Sbjct: 1745 PAGERCQNQAFSKRQYPEVEIFRTLSRGWGLRCRTDIKKGEFVNEYVGEMIDEEECRARI 1804
Query: 158 WKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFAT 217
+ TNFY+ +++D VIDA KGN +R++NH C PN E QKW ++G+TR+G+FA
Sbjct: 1805 RYAQEQDITNFYMLTLDKDRVIDAGPKGNFARFMNHCCQPNCETQKWTVNGDTRVGLFAL 1864
Query: 218 RDIKKGENLTYDYQYEFL 235
DIK LT++Y E L
Sbjct: 1865 CDIKAXVELTFNYNLECL 1882
>gi|307111585|gb|EFN59819.1| hypothetical protein CHLNCDRAFT_18588, partial [Chlorella
variabilis]
Length = 380
Score = 134 bits (336), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 74/203 (36%), Positives = 110/203 (54%), Gaps = 9/203 (4%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCT---ASPGSSGVCDRDCHCGMLLSSC-SSGCKC 99
+ I N+Y + K + EDD + C C A+ G++ C +C ML C C
Sbjct: 99 WQLITSNLYTHRERKEQDEDDIMICQCKKIWATDGTTVGCGAECLNRMLNIECVEEYCPS 158
Query: 100 GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER--- 156
G+ C N+ F R K+ + + G G+ A ED+K G+F+IEY+GEV++++ R
Sbjct: 159 GHRCSNQMFTKREYSKLDVKRAGAKGFGLFAAEDMKAGQFLIEYLGEVLEEEEYHRRQGA 218
Query: 157 LWKMKHL--GETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGI 214
WK + G+ ++Y + VIDA+ +GN R+INHSC PN E QKW++ GE IG+
Sbjct: 219 AWKEYFIETGQRHYYFMNVGNGEVIDASRRGNLGRFINHSCEPNCETQKWVVHGELAIGL 278
Query: 215 FATRDIKKGENLTYDYQYEFLHD 237
F DI G LT+DY +E D
Sbjct: 279 FTLEDISAGTELTFDYNFERYGD 301
>gi|444722051|gb|ELW62755.1| putative histone-lysine N-methyltransferase NSD2 [Tupaia chinensis]
Length = 1421
Score = 133 bits (335), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 70/169 (41%), Positives = 100/169 (59%), Gaps = 3/169 (1%)
Query: 70 CTASPGSSGVC--DRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
C P C D +C ML+ C C G C N+ F R + K+++T+ G
Sbjct: 859 CNCKPTDENPCGFDSECLNRMLMFECHPQVCPAGEYCQNQCFTKRQYPETKIIKTDGKGW 918
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+VA DI++GEFV EYVGE+ID++ C R+ T+FY+ I++D +IDA KGN
Sbjct: 919 GLVAKRDIRKGEFVNEYVGELIDEEECMARIKYAHENDITHFYMLTIDKDRIIDAGPKGN 978
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
SR++NHSC PN E KW ++G+TR+G+FA DI G LT++Y + L
Sbjct: 979 YSRFMNHSCQPNCETLKWTVNGDTRVGLFALCDIPAGTELTFNYNLDCL 1027
>gi|15213542|gb|AAK92049.1|AF322907_1 NSD1 [Homo sapiens]
Length = 2596
Score = 133 bits (335), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 71/172 (41%), Positives = 107/172 (62%), Gaps = 7/172 (4%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
C+C A+ + D +C MLL C + C G C N+ F R ++++ +T + G
Sbjct: 1792 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGW 1851
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGE---TNFYLCEINRDMVIDATY 183
G+ EDI++GEFV EYVGE+ID++ C + ++KH E T+FY+ I++D +IDA
Sbjct: 1852 GLRTQEDIRKGEFVNEYVGELIDEEEC---MARIKHAHENDITHFYMLTIDKDRIIDAGP 1908
Query: 184 KGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
KGN SR++NHSC PN E KW ++G+TR+G+FA DI G LT++Y + L
Sbjct: 1909 KGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTELTFNYNLDCL 1960
>gi|194907101|ref|XP_001981487.1| GG12082 [Drosophila erecta]
gi|190656125|gb|EDV53357.1| GG12082 [Drosophila erecta]
Length = 1441
Score = 133 bits (335), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 76/211 (36%), Positives = 116/211 (54%), Gaps = 7/211 (3%)
Query: 37 IKPKAIPYVFIKRNIYLTK-RIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSC 93
+K PYV IK N + R + LED +C PG C D C ML + C
Sbjct: 1160 VKLHPPPYVKIKTNKAVPPVRFTQNLED---LSACYCQPGDEHPCGPDSGCLNRMLFNEC 1216
Query: 94 SSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQT 152
+ CK G+ C N+ F+ R ++++V + G G+V E I G+FVIEYVGEVI+
Sbjct: 1217 NPEFCKAGSRCENQMFEQRKSPRLEVVYMNERGFGLVNREPIAEGDFVIEYVGEVINHAE 1276
Query: 153 CEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRI 212
+ R+ + + + N+Y + +D +IDA KGN +R++NHSC PN E QKW ++ R+
Sbjct: 1277 FQRRMEQKQRGRDENYYFLGVEKDFIIDAGPKGNLARFMNHSCEPNCETQKWTVNCIHRV 1336
Query: 213 GIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
G+FA +DI LT++Y ++ L ++ C
Sbjct: 1337 GLFAIKDIPANTELTFNYLWDDLMNNSKKAC 1367
>gi|198432159|ref|XP_002123225.1| PREDICTED: similar to Wolf-Hirschhorn syndrome candidate 1 protein,
partial [Ciona intestinalis]
Length = 752
Score = 133 bits (335), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 66/156 (42%), Positives = 97/156 (62%), Gaps = 1/156 (0%)
Query: 81 DRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEF 139
+ +C ML+ C S C G C N+ FQ R ++ +T G + A + IK+GEF
Sbjct: 247 ESECLNRMLMYECHSDLCPAGEKCQNQRFQKREYPSSEVFKTSWGGWALRAKDLIKKGEF 306
Query: 140 VIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNT 199
V EYVGE++D + C R+ TNFY+ I++D +IDA KGN SR++NHSC PN
Sbjct: 307 VSEYVGELVDSEECMRRIEDAHKNNVTNFYMLTIDKDRIIDAGPKGNYSRFMNHSCDPNC 366
Query: 200 EMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
E QKW+++G+TR+G+FA R+I+ GE L ++Y + L
Sbjct: 367 ETQKWMVNGDTRVGLFALREIQDGEELMFNYNLDCL 402
>gi|320166309|gb|EFW43208.1| ASHH4 [Capsaspora owczarzaki ATCC 30864]
Length = 453
Score = 133 bits (335), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 72/194 (37%), Positives = 110/194 (56%), Gaps = 9/194 (4%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNS 102
+ I++N Y+ + + + D+ C+C AS C C ++ C G C+ +
Sbjct: 153 FTRIRKNAYIDVKRPKLIADNE--CNCKAS----APCTESCLNRLMFVECVRGSCRMEDR 206
Query: 103 CLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMK 161
C N+ FQ K +K+ QT G G+ + I G+FV+EYVGEV+ D E R+W
Sbjct: 207 CQNRNFQRHNWTKNLKVFQTPNAGYGLRCTDPIAPGQFVMEYVGEVVSDAERERRMWG-P 265
Query: 162 HLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIK 221
+ G N Y E+ + ++IDA KG +R+INHSC PN ++KW ++GE R+GIFA+R I
Sbjct: 266 YAGNPNHYFLELEKGVLIDACSKGCDARFINHSCDPNCHVEKWNVNGEFRVGIFASRAIA 325
Query: 222 KGENLTYDYQYEFL 235
E L+YDY++E L
Sbjct: 326 PNEELSYDYRFETL 339
>gi|167526088|ref|XP_001747378.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774213|gb|EDQ87845.1| predicted protein [Monosiga brevicollis MX1]
Length = 1048
Score = 133 bits (335), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 76/196 (38%), Positives = 109/196 (55%), Gaps = 6/196 (3%)
Query: 44 YVFIKRNIYLTKRIK--RRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGC-KCG 100
+ IKRN+YL K R E+ I C C A+ + + C M+ C + C G
Sbjct: 694 FKLIKRNVYLPPAAKPPHRKEEQEI-CRCDAALNCADM--HSCLNRMMYIECDAKCCNNG 750
Query: 101 NSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKM 160
+C N+ FQ R K+ +TE G G+ +D++ G+ VIEYVGEVID C R+ +
Sbjct: 751 KNCRNQRFQRREYPKLIPFKTEHRGWGLRLGQDVEEGDLVIEYVGEVIDGAECRRRIDQY 810
Query: 161 KHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDI 220
+ ++FY+ + D +DA K N +R+INHSC PN QKW + GETR+GIFA R +
Sbjct: 811 EERNTSSFYILSLGSDTFVDAREKANMARFINHSCDPNCVTQKWNVLGETRVGIFAKRAL 870
Query: 221 KKGENLTYDYQYEFLH 236
KG LT+DY + L+
Sbjct: 871 AKGTELTFDYMLDCLN 886
>gi|367027906|ref|XP_003663237.1| methyltransferase-like protein [Myceliophthora thermophila ATCC
42464]
gi|347010506|gb|AEO57992.1| methyltransferase-like protein [Myceliophthora thermophila ATCC
42464]
Length = 950
Score = 133 bits (334), Expect = 7e-29, Method: Composition-based stats.
Identities = 71/188 (37%), Positives = 103/188 (54%), Gaps = 6/188 (3%)
Query: 62 EDDGIFCSCTAS--PGSSGVC--DRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKM 116
+ D + C C G + C D DC C SG C CG C N+ FQ + K+
Sbjct: 120 DHDALDCDCAEDWHDGQNHACGEDSDCINRATKIECVSGDCNCGEGCQNQRFQRKQYAKV 179
Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
+++TEK G G+ AD D++ +FV EYVGEVI++ T R+ K G +FY + ++
Sbjct: 180 SVIKTEKKGFGLRADTDLQANDFVYEYVGEVINEPTFRSRMVKYDREGIKHFYFMSLTKN 239
Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FL 235
+DAT KGN R+ NHSC PN + KW++ + R+GIFA R I+ GE L ++Y + +
Sbjct: 240 EFVDATKKGNLGRFCNHSCNPNCYVDKWVVGEKLRMGIFAGRAIRAGEELVFNYNVDRYG 299
Query: 236 HDSLIAYC 243
D YC
Sbjct: 300 ADPQPCYC 307
>gi|344238567|gb|EGV94670.1| Histone-lysine N-methyltransferase NSD3 [Cricetulus griseus]
Length = 620
Score = 133 bits (334), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 74/199 (37%), Positives = 108/199 (54%), Gaps = 4/199 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSS--CSSG- 96
K PY IK N + K + ++ D C P C + C +S C
Sbjct: 253 KPPPYKHIKANKVIGK-VPIQVADLSEIPRCNCKPADENPCGLESECLNRMSQYECHPQV 311
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G+ C N+ F R ++++TE+ G G+ IK+GEFV EYVGE+ID++ C R
Sbjct: 312 CPAGDRCQNQCFTKRLYPDAEIIKTERKGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 371
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + TNFY+ + +D +IDA KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 372 IKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 431
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 432 ICDIPAGMELTFNYNLDCL 450
>gi|47221386|emb|CAF97304.1| unnamed protein product [Tetraodon nigroviridis]
Length = 722
Score = 133 bits (334), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 68/169 (40%), Positives = 100/169 (59%), Gaps = 1/169 (0%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
C+C A+ S + +C MLL C C G CLN+ F R ++++ +T G
Sbjct: 396 CNCKATDESPCGMESECINRMLLYECHPQVCPAGERCLNQAFTKRQYSQVEIFRTLSRGW 455
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ DIK+G FV EYVGEVID++ C R+ + NFY+ +++D +IDA KGN
Sbjct: 456 GLRCVHDIKKGHFVSEYVGEVIDEEECRARIRHAQENDICNFYMLTLDKDRIIDAGPKGN 515
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
++R++NH C PN E QKW + G+TR+G+FA D+ G LT++Y E L
Sbjct: 516 EARFMNHCCQPNCETQKWTVSGDTRVGLFALVDVPAGTELTFNYNLECL 564
>gi|156391978|ref|XP_001635826.1| predicted protein [Nematostella vectensis]
gi|156222924|gb|EDO43763.1| predicted protein [Nematostella vectensis]
Length = 348
Score = 132 bits (333), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 73/186 (39%), Positives = 105/186 (56%), Gaps = 3/186 (1%)
Query: 51 IYLTKRIKR-RLEDDGIFCSCTASPGSSGV--CDRDCHCGMLLSSCSSGCKCGNSCLNKP 107
I L+ R KR E + C C P + C DC +L+ C+ C CG+ C N+
Sbjct: 11 ILLSSRKKRINKEVRKMTCECYPEPDNPDFVGCGEDCLNRLLMIECNHRCPCGDLCTNRR 70
Query: 108 FQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETN 167
FQ K+++ +TEK G G+ ED+++ +FVIEY GEV++ + + R + +
Sbjct: 71 FQEGCKIKVEVFKTEKKGWGVKTLEDLEQNQFVIEYCGEVMNYRDFQSRAQRYDRQKRRH 130
Query: 168 FYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLT 227
+Y + D +IDAT KG+ SR+INHSC PN QKW ++G RIG F R IK GE LT
Sbjct: 131 YYFMTLRADEIIDATLKGSISRFINHSCEPNCVTQKWTVNGLLRIGFFTLRTIKAGEELT 190
Query: 228 YDYQYE 233
+DYQ +
Sbjct: 191 FDYQLQ 196
>gi|344241969|gb|EGV98072.1| putative histone-lysine N-methyltransferase ASH1L [Cricetulus
griseus]
Length = 1546
Score = 132 bits (332), Expect = 1e-28, Method: Composition-based stats.
Identities = 78/227 (34%), Positives = 122/227 (53%), Gaps = 18/227 (7%)
Query: 19 KLLKQIGNPVEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFC 68
K L+Q ++F+LP W + KP Y I+ N+Y+ +K + C
Sbjct: 1132 KYLRQ--KRIDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTC 1187
Query: 69 SCTASPGSSGV-CDRDCHCGMLLSSCS-SGCKCGNSCLNKPFQNRP-VKKMKLVQTEKCG 125
+C + C DC M+ + CS + C CG C N+ Q V+ ++ + E+ G
Sbjct: 1188 NCKKPDDDTRKGCGDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKG 1247
Query: 126 AGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKG 185
GI E +K G+F+IEY+GEV+ +Q R+ + H ++ Y ++ MVID+ G
Sbjct: 1248 WGIRTKEPLKAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMG 1306
Query: 186 NKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
N++R+INHSC PN EMQKW ++G RIG++A +D+ G LTYDY +
Sbjct: 1307 NEARFINHSCDPNCEMQKWSVNGVYRIGLYALKDVLAGTELTYDYNF 1353
>gi|348530060|ref|XP_003452529.1| PREDICTED: hypothetical protein LOC100707110 [Oreochromis niloticus]
Length = 2876
Score = 131 bits (329), Expect = 3e-28, Method: Composition-based stats.
Identities = 79/226 (34%), Positives = 118/226 (52%), Gaps = 16/226 (7%)
Query: 19 KLLKQIGNPVEFELPD---WFIK-------PKAIPYVFIKRNIYLTKRIKRRLEDDGIFC 68
K L+Q ++F+LP W K P Y I+ N+Y+ + E C
Sbjct: 1913 KYLRQ--KRIDFQLPYDILWLWKHDQLHKRPDVPLYKKIRSNVYVDVKPLSGYETTTCNC 1970
Query: 69 SCTASPGSSGVCDRDCHCGMLLSSCS-SGCKCGNSCLNKPFQNRP-VKKMKLVQTEKCGA 126
G C +C M + CS S C CG+ C N+ Q V+ ++ +TE G
Sbjct: 1971 RLPEDSTEKG-CMDECLNRMSFAECSPSTCPCGDQCDNQHIQRHEWVQCLERFRTEGKGW 2029
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
GI E ++ G+F+IEY+GEV+ +Q R+ + ++ + Y ++ MVID+ GN
Sbjct: 2030 GIRTKESLRSGQFIIEYLGEVVSEQEFRSRMME-QYFSHSGHYCLNLDSGMVIDSYRMGN 2088
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
++R+INHSC PN EMQKW ++G RIG+FA +DI G LTYDY +
Sbjct: 2089 EARFINHSCEPNCEMQKWSVNGVYRIGLFALKDISSGTELTYDYNF 2134
>gi|342319484|gb|EGU11432.1| Histone-lysine N-methyltransferase, H3 lysine-36 specific
[Rhodotorula glutinis ATCC 204091]
Length = 844
Score = 131 bits (329), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 74/189 (39%), Positives = 103/189 (54%), Gaps = 9/189 (4%)
Query: 64 DGIFCSC-----TASPGSSGVCDRDCHC--GMLLSSCSSG-CKCGNSCLNKPFQNRPVKK 115
D C C T P + C +C M+ C G C+CG C N+ FQ R
Sbjct: 235 DASRCDCSWNTSTREPDADEACGEHSNCVNRMMQIECLMGDCRCGRHCQNQRFQKRQYAP 294
Query: 116 MKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINR 175
+++VQTEK G G+ A DI FV EYVGEVI Q + ++ + + G +FY ++R
Sbjct: 295 IEIVQTEKKGFGVRAGADISADTFVYEYVGEVIGPQPFQRKMKEYANEGIKHFYFMALDR 354
Query: 176 DMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-F 234
D+ IDAT KG K R++NHSC PN + KW + + R+GIF RDIK E LT++Y + +
Sbjct: 355 DVFIDATKKGGKGRFLNHSCNPNCVVAKWTVGRKMRMGIFTKRDIKMHEELTFNYNVDRY 414
Query: 235 LHDSLIAYC 243
H + YC
Sbjct: 415 GHVAQECYC 423
>gi|238883879|gb|EEQ47517.1| conserved hypothetical protein [Candida albicans WO-1]
Length = 846
Score = 130 bits (328), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 61/136 (44%), Positives = 92/136 (67%), Gaps = 4/136 (2%)
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C CG+ C N+ FQNR K+K++QTE G G++A++DI+ +F+ EY+GEVID+ + +R
Sbjct: 125 CLCGDDCQNQRFQNRQYSKVKVIQTELKGYGLIAEQDIEENQFIYEYIGEVIDEISFRQR 184
Query: 157 L--WKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGI 214
+ + ++HL +FY ++ D IDAT KG+ R+INHSC PN + KW + R+GI
Sbjct: 185 MIEYDLRHL--KHFYFMMLSNDSFIDATEKGSLGRFINHSCNPNAFVDKWHVGDRLRMGI 242
Query: 215 FATRDIKKGENLTYDY 230
FA R I +GE +T+DY
Sbjct: 243 FAKRKISRGEEITFDY 258
>gi|281206847|gb|EFA81031.1| SET domain-containing protein [Polysphondylium pallidum PN500]
Length = 1363
Score = 130 bits (328), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 90/246 (36%), Positives = 127/246 (51%), Gaps = 16/246 (6%)
Query: 6 KNSDNSRIGHAFNKLLKQIGNPVE-----FELPDWFIKPKAIPYVFIKRNIYLTKRIKRR 60
KN DNS + + +K I + +E ++ K + + + K + R KR
Sbjct: 874 KNEDNSIMNLPEPQAIKMIEDNIENGASYSQIAVLIQKARGKSFKYTKS----SNRRKRT 929
Query: 61 LEDDGI-FCSCTASPGSSGVCDRDCHCGMLLSSCS-SGCKCGNSCLNKPFQNRPVKKMKL 118
L++D I C+C +G C +C C S CK G C N+ FQ + +
Sbjct: 930 LQEDDIAICNCDII--DNGDCGPECLNRRSFIECEPSNCKLGKHCKNQRFQRQEYALIAP 987
Query: 119 VQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMV 178
+K G G+ A E I +FVIEY GEVI +R+ + GE FY ++ V
Sbjct: 988 FNAKKKGWGLKAKEKISAHQFVIEYCGEVITRAQSMDRMREAD--GEKYFYFLTLDSKEV 1045
Query: 179 IDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
+DA+ KGN +R+INHSC PN E QKW +DGETRIGIFA +DI+ G LT+DY YE + S
Sbjct: 1046 LDASRKGNLARFINHSCDPNCETQKWSVDGETRIGIFALKDIEAGTELTFDYNYERVGSS 1105
Query: 239 LIA-YC 243
+ YC
Sbjct: 1106 KQSCYC 1111
>gi|384499018|gb|EIE89509.1| hypothetical protein RO3G_14220 [Rhizopus delemar RA 99-880]
Length = 962
Score = 130 bits (328), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 74/210 (35%), Positives = 112/210 (53%), Gaps = 13/210 (6%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCG--------MLLSSCSS 95
YV + N+Y + + ++ + C C V D D CG M+ C++
Sbjct: 90 YVNLIENVYCGSATGKTIAEESMPCECKYL---QEVDDLDAACGDDNYCINRMMFMECTA 146
Query: 96 -GCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCE 154
C CG C N+ FQ R ++ +++TEK G G+ A D+ F++EY+GEVI +Q
Sbjct: 147 EDCPCGRYCRNRRFQLRQFARVDVIRTEKKGFGLRALTDLPTNSFIMEYIGEVIPNQEFI 206
Query: 155 ERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGI 214
R + + G ++Y + D +IDAT KG +R+INHSC PN QKW++ RIGI
Sbjct: 207 RRTKEYEASGLEHYYFMTLKTDEIIDATKKGCLARFINHSCNPNCVTQKWVVGKNMRIGI 266
Query: 215 FATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
F R IK GE LT+DY++E + + + YC
Sbjct: 267 FTNRGIKAGEELTFDYKFERYGAQAQVCYC 296
>gi|68483319|ref|XP_714401.1| likely histone lysine methyltransferase Set2p [Candida albicans
SC5314]
gi|68483410|ref|XP_714361.1| likely histone lysine methyltransferase Set2p [Candida albicans
SC5314]
gi|74585304|sp|Q59XV0.1|SET2_CANAL RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36
specific; AltName: Full=SET domain-containing protein 2
gi|46435920|gb|EAK95292.1| likely histone lysine methyltransferase Set2p [Candida albicans
SC5314]
gi|46435962|gb|EAK95333.1| likely histone lysine methyltransferase Set2p [Candida albicans
SC5314]
Length = 844
Score = 130 bits (328), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 61/136 (44%), Positives = 92/136 (67%), Gaps = 4/136 (2%)
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C CG+ C N+ FQNR K+K++QTE G G++A++DI+ +F+ EY+GEVID+ + +R
Sbjct: 125 CLCGDDCQNQRFQNRQYSKVKVIQTELKGYGLIAEQDIEENQFIYEYIGEVIDEISFRQR 184
Query: 157 L--WKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGI 214
+ + ++HL +FY ++ D IDAT KG+ R+INHSC PN + KW + R+GI
Sbjct: 185 MIEYDLRHL--KHFYFMMLSNDSFIDATEKGSLGRFINHSCNPNAFVDKWHVGDRLRMGI 242
Query: 215 FATRDIKKGENLTYDY 230
FA R I +GE +T+DY
Sbjct: 243 FAKRKISRGEEITFDY 258
>gi|384500869|gb|EIE91360.1| hypothetical protein RO3G_16071 [Rhizopus delemar RA 99-880]
Length = 883
Score = 130 bits (328), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 71/196 (36%), Positives = 104/196 (53%), Gaps = 6/196 (3%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPG---SSGVCDRDCHC---GMLLSSCSSGC 97
Y I NIY+ R + ++ + C C P S C D C M + + C
Sbjct: 173 YENIGENIYIGSATGRSMAEESMPCECKYDPDLDDPSEACGDDNACINRMMFMECIAQDC 232
Query: 98 KCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERL 157
CG C N+ FQ ++ +++TEK G G+ A D+ F++EY+GEVI R
Sbjct: 233 PCGRLCRNRRFQLGQYARVDVIRTEKKGYGLRALTDLSSNSFIMEYIGEVITQNEFLHRT 292
Query: 158 WKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFAT 217
+ G ++Y + D +IDAT KG +R++NHSC PN QKW+I + RIGIF +
Sbjct: 293 REYDAQGFKHYYFMTLKNDEIIDATRKGCLARFMNHSCRPNCVTQKWVIGKKMRIGIFTS 352
Query: 218 RDIKKGENLTYDYQYE 233
R+IK GE LT+DY++E
Sbjct: 353 RNIKAGEELTFDYKFE 368
>gi|357627347|gb|EHJ77076.1| hypothetical protein KGM_14526 [Danaus plexippus]
Length = 1912
Score = 130 bits (328), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 78/245 (31%), Positives = 127/245 (51%), Gaps = 16/245 (6%)
Query: 3 AAKKNSDNSRIGHAFNKLLK---------QIGNPVEFELPDWFIKPKAIP--YVFIKRNI 51
+ K +S S+I AF ++ ++ P E E D I +P YV +K N
Sbjct: 1540 SGKVSSQKSKIDAAFTMAMEHAQRACSILKMAAPNEEESSD--IASSLLPPHYVKLKVNK 1597
Query: 52 YLTKRIKRRLE-DDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSGCKCGNSCLNKPF 108
++++ ++ C P C C MLL+ C C+ G+ C N+ F
Sbjct: 1598 PCGSLCGKKIDLEESSLTQCECDPNDVDPCGPYTQCLNRMLLTECGPTCRAGDRCNNRAF 1657
Query: 109 QNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNF 168
+ R ++ +T G G+ +D++ G+FVIEYVGE+ID++ R+ + + + NF
Sbjct: 1658 EKRLYPRLGPYRTPHRGWGLRTMQDLRAGQFVIEYVGELIDEEEFRRRMNRKHEVRDENF 1717
Query: 169 YLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTY 228
Y ++++ +IDA KGN +R++NHSC PN E QKW + G+ R+G+FA RDI LT+
Sbjct: 1718 YFLTLDKERMIDAGPKGNLARFMNHSCEPNCETQKWTVLGDVRVGLFALRDIPANSELTF 1777
Query: 229 DYQYE 233
+Y E
Sbjct: 1778 NYNLE 1782
>gi|322799945|gb|EFZ21071.1| hypothetical protein SINV_04653 [Solenopsis invicta]
Length = 1817
Score = 130 bits (326), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 76/216 (35%), Positives = 114/216 (52%), Gaps = 15/216 (6%)
Query: 23 QIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASP-----GSS 77
+I P+ E D ++ + + ++ N+YLT+R + E + C C + G
Sbjct: 768 KISLPMVLETEDREMEERLSQFEHLQENLYLTERYTNK-ETKRMVCDCFLTEEEIERGEL 826
Query: 78 GVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRG 137
G C DC +L+ C C G+ C N+ FQN K ++ +TEK G G+ A +I G
Sbjct: 827 G-CGEDCLNRLLMIECGPRCVVGDRCTNRRFQNCDYAKCEVFRTEKKGFGLRAVVNITVG 885
Query: 138 EFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCP 197
EF++EYVGEV+D + R + ++Y + D +IDAT KGN SR+INHSC P
Sbjct: 886 EFIMEYVGEVVDPKDFRRRAKEYSKDKNRHYYFMALKSDQIIDATMKGNISRFINHSCDP 945
Query: 198 NTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
N E QKW ++GE RI GE +T+DY ++
Sbjct: 946 NAETQKWTVNGELRIAA--------GEEITFDYHFQ 973
>gi|198476699|ref|XP_002132425.1| GA25456 [Drosophila pseudoobscura pseudoobscura]
gi|198137811|gb|EDY69827.1| GA25456 [Drosophila pseudoobscura pseudoobscura]
Length = 483
Score = 130 bits (326), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 65/154 (42%), Positives = 87/154 (56%)
Query: 80 CDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEF 139
C C ML+ C C G C NK FQ ++ TEK G GI A+ I GEF
Sbjct: 137 CGSGCINRMLMIECGPLCSNGERCTNKRFQLNQCWPCRVFHTEKKGCGITAELAIPAGEF 196
Query: 140 VIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNT 199
++EYVGEVID E R + ++Y + +IDAT GN SR++NHSC PN
Sbjct: 197 IMEYVGEVIDSAEFERRQHRYAEGRNRHYYFMALRGGAIIDATMGGNISRFMNHSCDPNA 256
Query: 200 EMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
E QKW ++GE RIG+F+ + I GE +T+DY+Y+
Sbjct: 257 ETQKWTVNGELRIGLFSVKTIMPGEEITFDYRYQ 290
>gi|328768890|gb|EGF78935.1| hypothetical protein BATDEDRAFT_90118 [Batrachochytrium
dendrobatidis JAM81]
Length = 1361
Score = 130 bits (326), Expect = 6e-28, Method: Composition-based stats.
Identities = 80/227 (35%), Positives = 119/227 (52%), Gaps = 12/227 (5%)
Query: 18 NKLLKQIGNPVEFELPDWF-----IKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSC-- 70
N+ ++ + PV + W+ +K YV I+ NIY R ++ D + C+C
Sbjct: 135 NESVQALSGPVCGQNGQWYENESVLKEVTEAYVHIESNIY-RGRTNGKVSSDFMQCNCEY 193
Query: 71 -TASPGSSGVC--DRDCHCGMLLSSCSS-GCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
++ S C D DC L CS+ C GN+C N+ FQ +++ + G
Sbjct: 194 NSSRDPSWMACGEDSDCINRQLSLECSAEDCPTGNACQNRRFQLCQYSPIQVARAGSKGF 253
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
GI A E+I G F+IEY GEVI +R+ + + +FY + +D IDA+ KGN
Sbjct: 254 GIYARENIAGGAFIIEYCGEVIPASLFGKRITEHSNNSAQHFYFMSLKKDEYIDASKKGN 313
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
SRY+NHSC PN +QKW++ RIG+FA R I K LT+DY++E
Sbjct: 314 LSRYLNHSCDPNCSLQKWLVGDTIRIGLFALRAIPKNAELTFDYKFE 360
>gi|344302485|gb|EGW32759.1| hypothetical protein SPAPADRAFT_54781 [Spathaspora passalidarum
NRRL Y-27907]
Length = 718
Score = 130 bits (326), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 62/155 (40%), Positives = 91/155 (58%), Gaps = 3/155 (1%)
Query: 79 VCDRDCHCGMLLSSC---SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIK 135
C D HC ++S + C CGN C N+ FQ R + + QTE G G+ AD +I
Sbjct: 75 ACGEDSHCINRVTSVECMNRHCLCGNDCQNQRFQKREYAAVSVFQTELKGYGLRADLEIG 134
Query: 136 RGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSC 195
G+F+ EY GEVID+ T +R+ + +FY + D IDAT +G+ +R++NHSC
Sbjct: 135 EGQFIYEYTGEVIDEATFRQRMVEYDQKSFKHFYFMMLKSDSFIDATVRGSLARFVNHSC 194
Query: 196 CPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDY 230
PN + KW++ + R+GIFA R I +GE +T+DY
Sbjct: 195 NPNAYVDKWVVGDKLRMGIFAKRKIARGEEITFDY 229
>gi|297842431|ref|XP_002889097.1| hypothetical protein ARALYDRAFT_316589 [Arabidopsis lyrata subsp.
lyrata]
gi|297334938|gb|EFH65356.1| hypothetical protein ARALYDRAFT_316589 [Arabidopsis lyrata subsp.
lyrata]
Length = 525
Score = 130 bits (326), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 75/193 (38%), Positives = 105/193 (54%), Gaps = 2/193 (1%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGS-SGVCDRDCHCGMLLSSCSSG-CKCGN 101
Y I +N + ++ K++ E+D C C G C C + + C+ G C CG
Sbjct: 17 YEHIYQNDFSYRKHKKQKEEDISICECKFDFGDPDSACGERCLNVITNTECTPGYCPCGV 76
Query: 102 SCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMK 161
C N+ FQ K KL++ E G G+VA EDIK G+F+IEY GEVI + + R +
Sbjct: 77 YCKNQKFQKCEYAKTKLIKCEGRGWGLVALEDIKAGQFIIEYCGEVISWKEAKRRAQTYE 136
Query: 162 HLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIK 221
G + Y+ +N IDAT KG+ +R+INHSC PN E +KW + GE R+GIFA I
Sbjct: 137 THGVKDAYIISLNASEAIDATKKGSLARFINHSCRPNCETRKWNVLGEVRVGIFAKESIS 196
Query: 222 KGENLTYDYQYEF 234
L YDY +E+
Sbjct: 197 PRTELAYDYNFEW 209
>gi|195155589|ref|XP_002018686.1| GL25818 [Drosophila persimilis]
gi|194114839|gb|EDW36882.1| GL25818 [Drosophila persimilis]
Length = 476
Score = 130 bits (326), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 65/154 (42%), Positives = 88/154 (57%)
Query: 80 CDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEF 139
C C ML+ C C G C NK FQ ++ +TEK G GI A+ I GEF
Sbjct: 137 CGSGCINRMLMIECGPLCSNGERCTNKRFQLNQCWPCRVFRTEKKGCGISAELAIPAGEF 196
Query: 140 VIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNT 199
++EYVGEVID E R + ++Y + +IDAT GN SR++NHSC PN
Sbjct: 197 IMEYVGEVIDSAEFERRQHRYAEGRNRHYYFMALRGGAIIDATMGGNISRFMNHSCDPNA 256
Query: 200 EMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
E QKW ++GE RIG+F+ + I GE +T+DY+Y+
Sbjct: 257 ETQKWTVNGELRIGLFSVKSIMPGEEITFDYRYQ 290
>gi|451846131|gb|EMD59442.1| hypothetical protein COCSADRAFT_258710 [Cochliobolus sativus
ND90Pr]
Length = 923
Score = 130 bits (326), Expect = 7e-28, Method: Composition-based stats.
Identities = 62/164 (37%), Positives = 92/164 (56%), Gaps = 1/164 (0%)
Query: 81 DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
D DC C C CG C N+ FQ + + +++TEK G G+ A++D+ GEFV
Sbjct: 175 DSDCINRATKMECVGDCGCGRRCQNQRFQRKQYANVAVIKTEKKGFGLRANKDMAPGEFV 234
Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
EY+GEVID++T R+ + G +FY + + +DAT KGN R+ NHSC PN
Sbjct: 235 FEYIGEVIDERTFRRRMGQYDEEGIKHFYFMSLTKGEFVDATKKGNLGRFCNHSCNPNCF 294
Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
+ KW++ + R+GIF R +K GE L ++Y + + D YC
Sbjct: 295 VDKWVVGDKLRMGIFVERQVKAGEELVFNYNVDRYGADPQPCYC 338
>gi|359493195|ref|XP_003634538.1| PREDICTED: histone-lysine N-methyltransferase ASHH1-like [Vitis
vinifera]
Length = 413
Score = 130 bits (326), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 81/236 (34%), Positives = 117/236 (49%), Gaps = 12/236 (5%)
Query: 2 PAAKKNSDNSRIGHAFNKLLKQIGNPVEFELPDWFIKPKAIP-YVFIKRNIYLTKRIKRR 60
PAA++ + R+ + + Q + + E +P ++ I RN + ++ ++
Sbjct: 8 PAARRTLNRGRVASSLCHIDPQGQSDQQIE---------GVPEFIHINRNDFSYRKHIKQ 58
Query: 61 LEDDGIFCSCT-ASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKL 118
E+D C C + C C + C+ C C C N+ FQ R K KL
Sbjct: 59 KENDIAICECKYKANDPDSACGERCWNVLTSIECTPRYCPCSIHCKNQRFQKREYAKTKL 118
Query: 119 VQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMV 178
+ E G G++A E+IK GEFV+EY GEVI R G + Y+ +N
Sbjct: 119 FRAEGRGWGLLATENIKAGEFVMEYCGEVISRTEARGRSQVYVSQGLKDVYIIPLNAREC 178
Query: 179 IDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEF 234
IDAT KGN +R+INHSC PN E KW + GE R+GIFA R+I G LTY Y +E+
Sbjct: 179 IDATKKGNLARFINHSCQPNCETMKWSVLGEDRVGIFALRNISVGTELTYSYNFEW 234
>gi|451994892|gb|EMD87361.1| hypothetical protein COCHEDRAFT_1144880 [Cochliobolus
heterostrophus C5]
Length = 923
Score = 129 bits (325), Expect = 7e-28, Method: Composition-based stats.
Identities = 75/224 (33%), Positives = 112/224 (50%), Gaps = 10/224 (4%)
Query: 27 PVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTAS--PGS--SGVC-- 80
P+ +LPD K + + Y K I D + C C P + + C
Sbjct: 118 PLYLDLPDSTEAAKET-FTVLTECTYANKTIGT--TDPALECDCQEEYDPATRKNHACGE 174
Query: 81 DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
D DC C C CG C N+ FQ + + +++TEK G G+ A++D+ GEFV
Sbjct: 175 DSDCINRATKMECVGDCGCGRRCQNQRFQRKQYANVAVIKTEKKGFGLRANKDMAPGEFV 234
Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
EY+GEVID++T R+ + G +FY + + +DAT KGN R+ NHSC PN
Sbjct: 235 FEYIGEVIDERTFRRRMGQYDEEGIKHFYFMSLTKGEFVDATKKGNLGRFCNHSCNPNCF 294
Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
+ KW++ + R+GIF R +K GE L ++Y + + D YC
Sbjct: 295 VDKWVVGDKLRMGIFVERQVKAGEELVFNYNVDRYGADPQPCYC 338
>gi|300124007|emb|CBK25278.2| unnamed protein product [Blastocystis hominis]
Length = 223
Score = 129 bits (325), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 66/146 (45%), Positives = 87/146 (59%)
Query: 95 SGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCE 154
S C CG+ C N+ F+ +K+ L E+ G G+ D I RG F+IEYVGEVI + E
Sbjct: 45 STCPCGDDCTNQMFRRHEYQKLVLFYDEQKGWGVRTDVFIPRGTFIIEYVGEVISQKESE 104
Query: 155 ERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGI 214
R K G+ + Y + D +IDAT KGN SR+INHSC PN E+QKW +GI
Sbjct: 105 YRRQVGKRQGQMHMYYMSLAPDQLIDATDKGNASRFINHSCDPNCEIQKWATSSTYSVGI 164
Query: 215 FATRDIKKGENLTYDYQYEFLHDSLI 240
FA RDI GE +T+DYQ+E + + I
Sbjct: 165 FAIRDIIPGEEITFDYQFERIGNGAI 190
>gi|391338508|ref|XP_003743600.1| PREDICTED: probable histone-lysine N-methyltransferase Mes-4-like
[Metaseiulus occidentalis]
Length = 890
Score = 129 bits (325), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 74/208 (35%), Positives = 113/208 (54%), Gaps = 6/208 (2%)
Query: 29 EFELPDWFIKPKAIPYVF--IKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC 86
EF+L + K P F I N ++ ++R D CSC S S D C
Sbjct: 654 EFQLNLQAVHSKKKPPKFRKISSNKMVSASCRKRRLHDVDPCSCKVS---SPCVDFTCTN 710
Query: 87 GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQT-EKCGAGIVADEDIKRGEFVIEYVG 145
+ CS C C N+ FQ R + K+ T + G G+ A ED+K+ +FVIEY+G
Sbjct: 711 RAVYVECSDDCAQREKCKNRRFQERAYRATKVFLTPDGRGWGLQAVEDLKKDDFVIEYIG 770
Query: 146 EVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWI 205
E+ID + C+ RL ++ +G+ N Y E + + +IDA GN++R+INHSC PN +KW
Sbjct: 771 EIIDSKECDRRLSQLHAIGDPNVYFLEADANYIIDAGPAGNEARFINHSCNPNLRTEKWT 830
Query: 206 IDGETRIGIFATRDIKKGENLTYDYQYE 233
++ + +G+FA R+I GE LT++Y +
Sbjct: 831 VNDKLCVGLFAKRNIAAGEELTFNYNMQ 858
>gi|50553372|ref|XP_504097.1| YALI0E18260p [Yarrowia lipolytica]
gi|74633562|sp|Q6C5G5.1|SET2_YARLI RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36
specific; AltName: Full=SET domain-containing protein 2
gi|49649966|emb|CAG79692.1| YALI0E18260p [Yarrowia lipolytica CLIB122]
Length = 768
Score = 129 bits (325), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 70/179 (39%), Positives = 100/179 (55%), Gaps = 5/179 (2%)
Query: 66 IFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCG 125
+ C C P + CD D C L+S +C C NK FQ + + ++ TEK G
Sbjct: 48 MACDCKPGPTA---CDEDSGCINRLTSIEC-VRCCKGCQNKRFQGKKYASVDVISTEKKG 103
Query: 126 AGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKG 185
G+ A +DI GEFV EYVGEVID+ T +ER G +FY + + IDAT KG
Sbjct: 104 FGLRATKDIAAGEFVYEYVGEVIDEPTFKERTAIYTTQGVKHFYFMMLQKGEFIDATAKG 163
Query: 186 NKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
R+ NHSC PN ++KW++ R+GIFA+R I++GE +T+DY + + ++ YC
Sbjct: 164 GLGRFCNHSCAPNGHVEKWVVGKRLRMGIFASRHIQRGEEVTFDYNVDRYGAEAQACYC 222
>gi|395857586|ref|XP_003801172.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
N-methyltransferase NSD2 [Otolemur garnettii]
Length = 1371
Score = 129 bits (324), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 76/195 (38%), Positives = 106/195 (54%), Gaps = 4/195 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
K PY IK N K ++ D C P C D +C ML+ C
Sbjct: 987 KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMFECHPQV 1045
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R + K+++T+ G G+VA DI++GEFV EYVGE+ID++ C R
Sbjct: 1046 CPAGEYCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMAR 1105
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ T+FY+ I++D +IDA KGN SR++NHSC PN E KW ++G+TR+G+FA
Sbjct: 1106 IKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1165
Query: 217 TRDIKKGENLTYDYQ 231
DI G L Y+
Sbjct: 1166 VCDIPAGTALGGLYE 1180
>gi|195352984|ref|XP_002042990.1| GM16309 [Drosophila sechellia]
gi|194127055|gb|EDW49098.1| GM16309 [Drosophila sechellia]
Length = 1418
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 76/211 (36%), Positives = 115/211 (54%), Gaps = 7/211 (3%)
Query: 37 IKPKAIPYVFIKRNIYLTK-RIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSC 93
IK PYV IK N + R + LED +C P C + C ML + C
Sbjct: 1147 IKLHPPPYVKIKTNKAVPPLRFSQNLED---LSTCNCLPVDEHPCGPEAGCLNRMLFNEC 1203
Query: 94 SSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQT 152
+ CK G+ C N+ F+ R ++++V + G G+V E I G+FVIEYVGEVI+
Sbjct: 1204 NPEYCKAGSLCENRMFEQRKSPRLEVVYMNERGFGLVNREPIAAGDFVIEYVGEVINHAE 1263
Query: 153 CEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRI 212
+ R+ + + + N+Y + +D +IDA KGN +R++NHSC PN E QKW ++ R+
Sbjct: 1264 FQRRMEQKQRDRDENYYFLGVEKDFIIDAGPKGNLARFMNHSCEPNCETQKWTVNCIHRV 1323
Query: 213 GIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
GIFA +DI LT++Y ++ L ++ C
Sbjct: 1324 GIFAIKDIPVNTELTFNYLWDDLMNNSKKAC 1354
>gi|326933478|ref|XP_003212830.1| PREDICTED: probable histone-lysine N-methyltransferase ASH1L-like
[Meleagris gallopavo]
Length = 2974
Score = 128 bits (322), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 76/218 (34%), Positives = 118/218 (54%), Gaps = 16/218 (7%)
Query: 28 VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
++F+LP W + KP Y I+ N+Y+ +K + C+C +
Sbjct: 2054 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDEN 2111
Query: 78 GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
G C DC M+ + CS C CG C N+ Q V+ ++ + E+ G GI E +
Sbjct: 2112 GKGCMEDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2171
Query: 135 KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
K G+F+IEY+GEV+ +Q R+ + H ++ Y ++ MVID+ GN++R+INHS
Sbjct: 2172 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2230
Query: 195 CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
C PN EMQKW ++G RIG++A +D+ G LTYDY +
Sbjct: 2231 CNPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2268
>gi|363742848|ref|XP_422858.3| PREDICTED: probable histone-lysine N-methyltransferase ASH1L [Gallus
gallus]
Length = 2954
Score = 128 bits (322), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 76/218 (34%), Positives = 118/218 (54%), Gaps = 16/218 (7%)
Query: 28 VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
++F+LP W + KP Y I+ N+Y+ +K + C+C +
Sbjct: 2034 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDEN 2091
Query: 78 GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
G C DC M+ + CS C CG C N+ Q V+ ++ + E+ G GI E +
Sbjct: 2092 GKGCMEDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2151
Query: 135 KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
K G+F+IEY+GEV+ +Q R+ + H ++ Y ++ MVID+ GN++R+INHS
Sbjct: 2152 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2210
Query: 195 CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
C PN EMQKW ++G RIG++A +D+ G LTYDY +
Sbjct: 2211 CNPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2248
>gi|432881031|ref|XP_004073771.1| PREDICTED: histone-lysine N-methyltransferase ASH1L-like [Oryzias
latipes]
Length = 2798
Score = 128 bits (322), Expect = 2e-27, Method: Composition-based stats.
Identities = 80/227 (35%), Positives = 120/227 (52%), Gaps = 18/227 (7%)
Query: 19 KLLKQIGNPVEFELP---------DWFIKPKAIP-YVFIKRNIYLTKRIKRRLEDDGIFC 68
K L+Q ++F+LP D K +P Y I+ N+Y+ +K + C
Sbjct: 1850 KYLRQ--KRIDFQLPYDILWLWKHDQLSKRPDVPLYKKIRSNVYVD--VKPFSGYETTTC 1905
Query: 69 SCTASPGSSGV-CDRDCHCGMLLSSCS-SGCKCGNSCLNKPFQNRP-VKKMKLVQTEKCG 125
SC S C DC M + CS S C CG C N+ Q V+ ++ + E G
Sbjct: 1906 SCKPPENSDDKGCLDDCLNRMSYAECSPSTCPCGEQCDNQHIQRHEWVQCLERFRAEGKG 1965
Query: 126 AGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKG 185
GI E ++ G+F+IEY+GEV+ +Q R+ + ++ + Y ++ MVID+ G
Sbjct: 1966 WGIRTKEPLRAGQFIIEYLGEVVSEQEFRSRMME-QYFSHSGHYCLNLDSGMVIDSYRMG 2024
Query: 186 NKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
N++R+INHSC PN EMQKW ++G RIG+FA +D+ G LTYDY +
Sbjct: 2025 NEARFINHSCDPNCEMQKWSVNGVYRIGLFALKDVSSGTELTYDYNF 2071
>gi|326676505|ref|XP_692254.4| PREDICTED: probable histone-lysine N-methyltransferase ASH1L [Danio
rerio]
Length = 2933
Score = 128 bits (322), Expect = 2e-27, Method: Composition-based stats.
Identities = 78/227 (34%), Positives = 122/227 (53%), Gaps = 18/227 (7%)
Query: 19 KLLKQIGNPVEFELP---------DWFIKPKAIP-YVFIKRNIYLTKRIKRRLEDDGIFC 68
K L+Q ++F+LP D K +P Y I+ N+Y+ +K + C
Sbjct: 1957 KYLRQ--KRIDFQLPYDILWLWKHDQLYKRPDVPLYKKIRSNVYVD--VKPLSGYEATTC 2012
Query: 69 SCTASPGSSGV-CDRDCHCGMLLSSCS-SGCKCGNSCLNKPFQNRP-VKKMKLVQTEKCG 125
+C SS C DC M+ + CS S C C + C N+ Q V+ ++ + E G
Sbjct: 2013 NCRLPDDSSEKGCQDDCLNRMIYAECSPSTCPCSDQCDNQRIQKHEWVQCLERFRAEGKG 2072
Query: 126 AGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKG 185
GI + ++ G+F+IEY+GEV+ +Q R+ + ++ + Y ++ MVID+ G
Sbjct: 2073 WGIRTKQPLRAGQFIIEYLGEVVSEQEFRSRMME-QYFSHSGHYCLNLDSGMVIDSYRMG 2131
Query: 186 NKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
N++R++NHSC PN EMQKW ++G RIG+FA +DI G LTYDY +
Sbjct: 2132 NEARFVNHSCEPNCEMQKWSVNGVYRIGLFALKDINSGTELTYDYNF 2178
>gi|281338719|gb|EFB14303.1| hypothetical protein PANDA_018255 [Ailuropoda melanoleuca]
Length = 2981
Score = 128 bits (321), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 76/218 (34%), Positives = 118/218 (54%), Gaps = 16/218 (7%)
Query: 28 VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
++F+LP W + KP Y I+ N+Y+ +K + C+C +
Sbjct: 2060 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPEDDT 2117
Query: 78 GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
G C DC M+ + CS C CG C N+ Q V+ ++ + E+ G GI E +
Sbjct: 2118 GKGCVDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2177
Query: 135 KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
K G+F+IEY+GEV+ +Q R+ + H ++ Y ++ MVID+ GN++R+INHS
Sbjct: 2178 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2236
Query: 195 CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
C PN EMQKW ++G RIG++A +D+ G LTYDY +
Sbjct: 2237 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2274
>gi|73960946|ref|XP_537251.2| PREDICTED: probable histone-lysine N-methyltransferase ASH1L isoform
1 [Canis lupus familiaris]
Length = 2965
Score = 128 bits (321), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 76/218 (34%), Positives = 118/218 (54%), Gaps = 16/218 (7%)
Query: 28 VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
++F+LP W + KP Y I+ N+Y+ +K + C+C +
Sbjct: 2044 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPEDDT 2101
Query: 78 GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
G C DC M+ + CS C CG C N+ Q V+ ++ + E+ G GI E +
Sbjct: 2102 GKGCVDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2161
Query: 135 KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
K G+F+IEY+GEV+ +Q R+ + H ++ Y ++ MVID+ GN++R+INHS
Sbjct: 2162 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2220
Query: 195 CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
C PN EMQKW ++G RIG++A +D+ G LTYDY +
Sbjct: 2221 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2258
>gi|390355937|ref|XP_003728662.1| PREDICTED: uncharacterized protein LOC579712 isoform 2
[Strongylocentrotus purpuratus]
Length = 3111
Score = 128 bits (321), Expect = 2e-27, Method: Composition-based stats.
Identities = 80/221 (36%), Positives = 116/221 (52%), Gaps = 24/221 (10%)
Query: 29 EFELP-----DWF-----IKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTA--SPGS 76
+FELP W + +A + I+ NIY +K ++ + + CSC +P
Sbjct: 2203 DFELPYDIWWQWVHNKLPSRTQAPKFKKIRNNIYFD--LKPTIQAEVVRCSCKRPYNPEE 2260
Query: 77 SGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKK-MKLVQTEKCGAGIVADEDI 134
G C DC M+ CSS C CG+ C N+ Q ++ TE G G+ + I
Sbjct: 2261 KG-CGEDCLNRMIQHECSSASCPCGDQCANQVIQRHNWSPGLRRFMTENRGWGVRTLQPI 2319
Query: 135 KRGEFVIEYVGEVIDDQTCEERLWKM---KHLGETNFYLCEINRDMVIDATYKGNKSRYI 191
+ F+IEY+GEVI + LWK + + + Y ++ MVID GN+ R++
Sbjct: 2320 RHSSFIIEYLGEVIS----VKELWKRALDDYQYQKHHYCLNLDGGMVIDGYRYGNEGRFV 2375
Query: 192 NHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
NHSC PN EMQKW+++G RIG+FA RDI+ GE LTYDY +
Sbjct: 2376 NHSCNPNCEMQKWMVNGLYRIGMFALRDIQPGEELTYDYNF 2416
>gi|452822434|gb|EME29453.1| chondroitin-glucuronate 5-epimerase [Galdieria sulphuraria]
Length = 1135
Score = 128 bits (321), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 73/196 (37%), Positives = 110/196 (56%), Gaps = 7/196 (3%)
Query: 40 KAIPYVFIKRNIYL---TKRIKRRLEDDGIF-CSCTASPGSSGVCDRDCHCGMLLSSCSS 95
K P+V +KRN Y+ I+ + E D + C C++ D C + + C
Sbjct: 736 KPAPFVRLKRNEYICDYIGPIRGKGEMDKVMICHCSSVADLPCCMDSSCLNRVSFTECHP 795
Query: 96 G-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCE 154
C+ G+ C N+ FQ ++KL Q + G G+ A E + +G F+IEY GEVID + E
Sbjct: 796 EYCRTGSKCQNQRFQKCEYARVKLFQAGERGWGLKAAEFLPKGTFIIEYQGEVIDTEEYE 855
Query: 155 ERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGI 214
R K ++ GE +FY ++ D +IDA+ K N +R+INHSC PN +KW + GE +GI
Sbjct: 856 RR--KRRYAGERHFYFMSLDSDHMIDASRKSNMARFINHSCQPNCHTEKWTVLGEPCVGI 913
Query: 215 FATRDIKKGENLTYDY 230
FA++DI+ G L +DY
Sbjct: 914 FASQDIEAGTELVFDY 929
>gi|390355935|ref|XP_003728661.1| PREDICTED: uncharacterized protein LOC579712 isoform 1
[Strongylocentrotus purpuratus]
Length = 3164
Score = 128 bits (321), Expect = 2e-27, Method: Composition-based stats.
Identities = 80/221 (36%), Positives = 116/221 (52%), Gaps = 24/221 (10%)
Query: 29 EFELP-----DWF-----IKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTA--SPGS 76
+FELP W + +A + I+ NIY +K ++ + + CSC +P
Sbjct: 2256 DFELPYDIWWQWVHNKLPSRTQAPKFKKIRNNIYFD--LKPTIQAEVVRCSCKRPYNPEE 2313
Query: 77 SGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKK-MKLVQTEKCGAGIVADEDI 134
G C DC M+ CSS C CG+ C N+ Q ++ TE G G+ + I
Sbjct: 2314 KG-CGEDCLNRMIQHECSSASCPCGDQCANQVIQRHNWSPGLRRFMTENRGWGVRTLQPI 2372
Query: 135 KRGEFVIEYVGEVIDDQTCEERLWKM---KHLGETNFYLCEINRDMVIDATYKGNKSRYI 191
+ F+IEY+GEVI + LWK + + + Y ++ MVID GN+ R++
Sbjct: 2373 RHSSFIIEYLGEVIS----VKELWKRALDDYQYQKHHYCLNLDGGMVIDGYRYGNEGRFV 2428
Query: 192 NHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
NHSC PN EMQKW+++G RIG+FA RDI+ GE LTYDY +
Sbjct: 2429 NHSCNPNCEMQKWMVNGLYRIGMFALRDIQPGEELTYDYNF 2469
>gi|62531333|gb|AAH93421.1| Whsc1 protein [Danio rerio]
Length = 320
Score = 128 bits (321), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 60/133 (45%), Positives = 89/133 (66%)
Query: 103 CLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
C N+ F R + K+++T G G+++ DIK+GEFV EYVGE+ID++ C R+ +
Sbjct: 2 CQNQDFTKRLYPETKIIRTAGKGWGLISLRDIKKGEFVNEYVGELIDEEECRSRIRNAQE 61
Query: 163 LGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKK 222
T+FY+ I++D +IDA KGN SR++NHSC PN E QKW ++G+TR+G+FA DI
Sbjct: 62 NDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETQKWTVNGDTRVGLFAVCDIPA 121
Query: 223 GENLTYDYQYEFL 235
G LT++Y + L
Sbjct: 122 GTELTFNYNLDCL 134
>gi|301785832|ref|XP_002928328.1| PREDICTED: probable histone-lysine N-methyltransferase ASH1L-like
[Ailuropoda melanoleuca]
Length = 2965
Score = 128 bits (321), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 76/218 (34%), Positives = 118/218 (54%), Gaps = 16/218 (7%)
Query: 28 VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
++F+LP W + KP Y I+ N+Y+ +K + C+C +
Sbjct: 2044 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPEDDT 2101
Query: 78 GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
G C DC M+ + CS C CG C N+ Q V+ ++ + E+ G GI E +
Sbjct: 2102 GKGCVDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2161
Query: 135 KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
K G+F+IEY+GEV+ +Q R+ + H ++ Y ++ MVID+ GN++R+INHS
Sbjct: 2162 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2220
Query: 195 CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
C PN EMQKW ++G RIG++A +D+ G LTYDY +
Sbjct: 2221 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2258
>gi|255716246|ref|XP_002554404.1| KLTH0F04510p [Lachancea thermotolerans]
gi|238935787|emb|CAR23967.1| KLTH0F04510p [Lachancea thermotolerans CBS 6340]
Length = 686
Score = 128 bits (321), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 72/193 (37%), Positives = 103/193 (53%), Gaps = 6/193 (3%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTA--SPGSSGVC--DRDCHCGMLLSSCSSGC-- 97
+ +K+ Y +K + D I C C G + C D DC + L C +
Sbjct: 36 FTELKKCSYASKELGNSNSHDFIECDCYEEFQDGVNHACGEDSDCINRLTLIECVNDLCG 95
Query: 98 KCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERL 157
CGN C N+ FQ R + + QT+K G G+ A +DI+ EF+ EY+GEVI + +R+
Sbjct: 96 SCGNDCRNQRFQKREYADIAVFQTQKKGYGVRAQKDIEAHEFIYEYIGEVIAEDEFRDRM 155
Query: 158 WKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFAT 217
H G +FY + IDAT KG+ +R+ NHSC PN + KW + G+ R+GIFA
Sbjct: 156 IDYDHRGLKHFYFMMLQTGEFIDATVKGSLARFCNHSCNPNAYVNKWAVAGKLRMGIFAN 215
Query: 218 RDIKKGENLTYDY 230
R I KGE +T+DY
Sbjct: 216 RKILKGEEITFDY 228
>gi|443897443|dbj|GAC74784.1| clathrin coat binding protein [Pseudozyma antarctica T-34]
Length = 925
Score = 128 bits (321), Expect = 3e-27, Method: Composition-based stats.
Identities = 76/204 (37%), Positives = 109/204 (53%), Gaps = 7/204 (3%)
Query: 47 IKRNIYLTKRIKRRLE--DDGIFCSCT-ASPGSSGVCDRDCHC--GMLLSSCSSG-CKCG 100
I N Y K++ R DD + C C A+ + C + C M CSS C+ G
Sbjct: 117 ITHNDYHDKKLGRPPGKFDDYMLCDCNPATENTDLACTDESGCINRMTQIECSSSKCRWG 176
Query: 101 NSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKM 160
C N+ F R + +VQTEK G G+ A DI + FV EYVGEV++ T +R+
Sbjct: 177 KLCRNQRFHRRQYVYVDIVQTEKKGFGLRAAHDIPKESFVYEYVGEVMNQHTFLDRMQLY 236
Query: 161 KHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDI 220
+ G +FY + + +DAT KG K R+INHSC PN + KW + R+GIFA R+I
Sbjct: 237 RTEGIRHFYFMMLQPNEYLDATKKGGKGRFINHSCNPNCSVSKWQVGKHLRMGIFAKRNI 296
Query: 221 KKGENLTYDYQYE-FLHDSLIAYC 243
KGE LT++Y + + +D+ +C
Sbjct: 297 AKGEELTFNYNVDRYGNDAQECFC 320
>gi|344286471|ref|XP_003414981.1| PREDICTED: probable histone-lysine N-methyltransferase ASH1L
[Loxodonta africana]
Length = 2917
Score = 128 bits (321), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 76/218 (34%), Positives = 118/218 (54%), Gaps = 16/218 (7%)
Query: 28 VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
++F+LP W + KP Y I+ N+Y+ +K + C+C +
Sbjct: 1996 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDDA 2053
Query: 78 GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
G C DC M+ + CS C CG C N+ Q V+ ++ + E+ G GI E +
Sbjct: 2054 GKGCVDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2113
Query: 135 KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
K G+F+IEY+GEV+ +Q R+ + H ++ Y ++ MVID+ GN++R+INHS
Sbjct: 2114 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2172
Query: 195 CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
C PN EMQKW ++G RIG++A +D+ G LTYDY +
Sbjct: 2173 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2210
>gi|414587221|tpg|DAA37792.1| TPA: hypothetical protein ZEAMMB73_251567 [Zea mays]
Length = 503
Score = 127 bits (320), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 74/192 (38%), Positives = 107/192 (55%), Gaps = 12/192 (6%)
Query: 56 RIKRRLEDDGIFCSCTASPGSSGVCDRDCHCG------MLLSSCSSG-CKCGNSCLNKPF 108
R K + E+D C C + D + CG + + C+ G C+CG C N+ F
Sbjct: 15 RHKSQTEEDIAVCECQYD-----ILDPESPCGDRCLNLLTNTECTPGYCRCGVYCKNQRF 69
Query: 109 QNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNF 168
Q + +LV+T G G+VADE+I G+FVIEY GEVI + + R + +
Sbjct: 70 QKCQYARTRLVRTGGRGWGLVADENIMAGQFVIEYCGEVISWKEAKRRAQAYETQCLKDA 129
Query: 169 YLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTY 228
Y+ +N D IDAT KGN +R+INHSC PN E +KW + GE R+GIFA ++I G L+Y
Sbjct: 130 YIIYLNADESIDATRKGNLARFINHSCQPNCETRKWNVLGEVRVGIFAKQNIPFGTELSY 189
Query: 229 DYQYEFLHDSLI 240
DY +E+ ++
Sbjct: 190 DYNFEWYGGVMV 201
>gi|390355933|ref|XP_784903.3| PREDICTED: uncharacterized protein LOC579712 isoform 3
[Strongylocentrotus purpuratus]
Length = 3326
Score = 127 bits (320), Expect = 3e-27, Method: Composition-based stats.
Identities = 80/221 (36%), Positives = 116/221 (52%), Gaps = 24/221 (10%)
Query: 29 EFELP-----DWF-----IKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTA--SPGS 76
+FELP W + +A + I+ NIY +K ++ + + CSC +P
Sbjct: 2435 DFELPYDIWWQWVHNKLPSRTQAPKFKKIRNNIYFD--LKPTIQAEVVRCSCKRPYNPEE 2492
Query: 77 SGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKK-MKLVQTEKCGAGIVADEDI 134
G C DC M+ CSS C CG+ C N+ Q ++ TE G G+ + I
Sbjct: 2493 KG-CGEDCLNRMIQHECSSASCPCGDQCANQVIQRHNWSPGLRRFMTENRGWGVRTLQPI 2551
Query: 135 KRGEFVIEYVGEVIDDQTCEERLWKM---KHLGETNFYLCEINRDMVIDATYKGNKSRYI 191
+ F+IEY+GEVI + LWK + + + Y ++ MVID GN+ R++
Sbjct: 2552 RHSSFIIEYLGEVIS----VKELWKRALDDYQYQKHHYCLNLDGGMVIDGYRYGNEGRFV 2607
Query: 192 NHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
NHSC PN EMQKW+++G RIG+FA RDI+ GE LTYDY +
Sbjct: 2608 NHSCNPNCEMQKWMVNGLYRIGMFALRDIQPGEELTYDYNF 2648
>gi|47222897|emb|CAF99053.1| unnamed protein product [Tetraodon nigroviridis]
Length = 768
Score = 127 bits (320), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 77/226 (34%), Positives = 115/226 (50%), Gaps = 29/226 (12%)
Query: 38 KPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG- 96
KP ++ + R I + I L + C+C AS + D +C ML+ C
Sbjct: 366 KPPPYRHIKVNRAIGKVQFITADLSEI-PRCNCKASDENPCGVDSECINRMLMYECHPQV 424
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKR-------------------- 136
C G C N+ F R +++ +T CG G+ A DIK+
Sbjct: 425 CAAGERCQNQAFSKRQYTPVEIYRTLSCGWGLRAVSDIKKVRRGDGDGEQLQGVPGVLMT 484
Query: 137 -------GEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSR 189
G FVIEYVGEVID++ C R+ + NFY+ +++D +IDA KGN++R
Sbjct: 485 SSDATSQGAFVIEYVGEVIDEEECRARIKHAQENDIFNFYMLTLDKDRIIDAGPKGNQAR 544
Query: 190 YINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
++NH C PN E QKW ++G+TR+G+FA +DI KG+ L ++Y E L
Sbjct: 545 FMNHCCQPNCETQKWTVNGDTRVGLFALQDIPKGKELNFNYNLECL 590
>gi|395532131|ref|XP_003768125.1| PREDICTED: histone-lysine N-methyltransferase ASH1L isoform 2
[Sarcophilus harrisii]
Length = 2974
Score = 127 bits (320), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 80/233 (34%), Positives = 123/233 (52%), Gaps = 18/233 (7%)
Query: 13 IGHAFNKLLKQIGNPVEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLE 62
+ H K L+Q ++F+LP W + KP Y I+ N+Y+ +K
Sbjct: 2041 VFHDSGKYLRQ--KRIDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSG 2096
Query: 63 DDGIFCSCT-ASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLV 119
+ C+C G C DC M+ + CS C CG C N+ Q V+ ++
Sbjct: 2097 YEATTCNCKKPDDGDRKGCMDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERF 2156
Query: 120 QTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVI 179
+ E+ G GI E +K G+F+IEY+GEV+ +Q R+ + H ++ Y ++ MVI
Sbjct: 2157 RAEEKGWGIRTKEPLKAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVI 2215
Query: 180 DATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
D+ GN++R+INHSC PN EMQKW ++G RIG++A +D+ G LTYDY +
Sbjct: 2216 DSYRMGNEARFINHSCNPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2268
>gi|195453973|ref|XP_002074027.1| GK14418 [Drosophila willistoni]
gi|194170112|gb|EDW85013.1| GK14418 [Drosophila willistoni]
Length = 1420
Score = 127 bits (320), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 73/197 (37%), Positives = 111/197 (56%), Gaps = 7/197 (3%)
Query: 43 PYVFIKRNIYLTK-RIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG-CK 98
PYV IK N +T R+ LED + C PG C + C ML + C+ C+
Sbjct: 1124 PYVKIKANKPVTPVRLVSNLEDLSV---CDCQPGLEHPCGPESGCLNRMLFNECNPEYCR 1180
Query: 99 CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
G+ C N+ F+ R ++ +V G G+V E I G+FVIEYVGEVI+ ++R+
Sbjct: 1181 AGDRCENQMFEMRKSPRLDVVYMNDRGFGLVCREAIAEGDFVIEYVGEVINHAEFQKRML 1240
Query: 159 KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATR 218
+ + + N+Y + +D +IDA KGN +R++NHSC PN E QKW ++ R+G+FA +
Sbjct: 1241 QKQRDRDENYYFLGVEKDFIIDAGPKGNLARFMNHSCEPNCETQKWSVNCIHRVGLFAIK 1300
Query: 219 DIKKGENLTYDYQYEFL 235
DI LT++Y ++ L
Sbjct: 1301 DIPANTELTFNYLWDDL 1317
>gi|442621474|ref|NP_001263029.1| Mes-4, isoform B [Drosophila melanogaster]
gi|440217972|gb|AGB96409.1| Mes-4, isoform B [Drosophila melanogaster]
Length = 1423
Score = 127 bits (320), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 74/205 (36%), Positives = 113/205 (55%), Gaps = 7/205 (3%)
Query: 43 PYVFIKRNIYLTK-RIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG-CK 98
PYV IK N + R + LED +C P C + C ML + C+ CK
Sbjct: 1158 PYVKIKTNKAVPPLRFSQNLED---LSTCNCLPVDEHPCGPEAGCLNRMLFNECNPEYCK 1214
Query: 99 CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
G+ C N+ F+ R ++++V + G G+V E I G+FVIEYVGEVI+ + R+
Sbjct: 1215 AGSLCENRMFEQRKSPRLEVVYMNERGFGLVNREPIAVGDFVIEYVGEVINHAEFQRRME 1274
Query: 159 KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATR 218
+ + + N+Y + +D +IDA KGN +R++NHSC PN E QKW ++ R+GIFA +
Sbjct: 1275 QKQRDRDENYYFLGVEKDFIIDAGPKGNLARFMNHSCEPNCETQKWTVNCIHRVGIFAIK 1334
Query: 219 DIKKGENLTYDYQYEFLHDSLIAYC 243
DI LT++Y ++ L ++ C
Sbjct: 1335 DIPVNSELTFNYLWDDLMNNSKKAC 1359
>gi|209880293|ref|XP_002141586.1| SET domain-containing protein [Cryptosporidium muris RN66]
gi|209557192|gb|EEA07237.1| SET domain-containing protein [Cryptosporidium muris RN66]
Length = 517
Score = 127 bits (320), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 79/190 (41%), Positives = 108/190 (56%), Gaps = 13/190 (6%)
Query: 49 RNIYLTKR-IKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG----CKCGNS- 102
RN Y KR + +RL D CSCT S CD+ C + C+ CK G+
Sbjct: 52 RNEYCFKRYVNKRLAQDLGRCSCTKS------CDQYCTNKSIYIECNKDTCQICKQGSKD 105
Query: 103 CLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
C N F R K + L T+ G G+ EDIK GEFVIEYVGE+I+ + EER KM
Sbjct: 106 CGNCRFTLRQYKLVTLFDTKSKGIGLRTAEDIKLGEFVIEYVGEIINLKQLEERK-KMTS 164
Query: 163 LGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKK 222
+ Y+ + + IDATYKGN +R+INHSC PN QKW + + ++G+F+ +DIK
Sbjct: 165 KLTKHIYVFSLGNETYIDATYKGNLARFINHSCEPNLVAQKWFVGSDIKVGLFSLKDIKA 224
Query: 223 GENLTYDYQY 232
G+ LT+DY++
Sbjct: 225 GDELTFDYRF 234
>gi|431892339|gb|ELK02779.1| Putative histone-lysine N-methyltransferase ASH1L [Pteropus alecto]
Length = 1291
Score = 127 bits (320), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 76/217 (35%), Positives = 114/217 (52%), Gaps = 14/217 (6%)
Query: 28 VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
++F+LP W + KP Y I+ N+Y+ + E C
Sbjct: 374 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVDVKPLSGYEAPTCNCKKPDDAARK 433
Query: 78 GVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDIK 135
G D DC M+ + CS C CG C N+ Q V+ ++ + E+ G GI E +K
Sbjct: 434 GCVD-DCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLK 492
Query: 136 RGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSC 195
G+F+IEY+GEV+ +Q R+ + H ++ Y ++ MVID+ GN++R+INHSC
Sbjct: 493 AGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHSC 551
Query: 196 CPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
PN EMQKW ++G RIG++A RD+ G LTYDY +
Sbjct: 552 DPNCEMQKWSVNGVYRIGLYALRDMPAGTELTYDYNF 588
>gi|430813766|emb|CCJ28924.1| unnamed protein product [Pneumocystis jirovecii]
Length = 809
Score = 127 bits (320), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 66/185 (35%), Positives = 108/185 (58%), Gaps = 4/185 (2%)
Query: 50 NIYLTKRIKRRLEDDGIFCSCTAS--PGSSGVCDRDCHCGMLLSS--CSSGCKCGNSCLN 105
NIY +K + +D+ + C C + + C ++ C ++S C+ C CG C N
Sbjct: 139 NIYHSKDLGESGQDEIMTCECKSEWDGTKNHACGKNSDCINRMTSVECTDDCNCGEDCQN 198
Query: 106 KPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGE 165
+ F+ + + +++T+K G GI A+ D++ G+F+ EYVGEVID++ +R+ +
Sbjct: 199 RRFKLKQYSNIDVIKTKKKGYGIRANSDMESGQFIYEYVGEVIDERKFRKRMKIYANEDI 258
Query: 166 TNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGEN 225
+FY + R IDAT KG +R++NHSC PN + KW++ + R+GIF R+I KGE
Sbjct: 259 KHFYFMMLQRGEYIDATKKGGLARFLNHSCSPNCYVDKWVVGTKLRMGIFCKRNILKGEE 318
Query: 226 LTYDY 230
LT+DY
Sbjct: 319 LTFDY 323
>gi|390461100|ref|XP_003732597.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 3
[Callithrix jacchus]
Length = 1389
Score = 127 bits (320), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 78/199 (39%), Positives = 108/199 (54%), Gaps = 15/199 (7%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
K PY IK N K ++ D C P C D +C ML+ C
Sbjct: 1022 KPPPYKHIKVNKPYGK-VQIYTADISEIPKCNCKPTDENPCGFDSECLNRMLMFECHPQV 1080
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R + K+++T+ G G+VA DI++GEFV EYVGEVID++ C
Sbjct: 1081 CPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGEVIDEEEC--- 1137
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ ++KH E N+D +IDA KGN SR++NHSC PN E KW ++G+TR+G+FA
Sbjct: 1138 MARIKHAHE--------NQDRIIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFA 1189
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1190 VCDIPAGTELTFNYNLDCL 1208
>gi|24650756|ref|NP_733239.1| Mes-4, isoform A [Drosophila melanogaster]
gi|29427833|sp|Q8MT36.2|MES4_DROME RecName: Full=Probable histone-lysine N-methyltransferase Mes-4;
AltName: Full=Maternal-effect sterile 4 homolog
gi|23172478|gb|AAF56762.2| Mes-4, isoform A [Drosophila melanogaster]
gi|94400569|gb|ABF17912.1| FI01019p [Drosophila melanogaster]
Length = 1427
Score = 127 bits (320), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 74/205 (36%), Positives = 113/205 (55%), Gaps = 7/205 (3%)
Query: 43 PYVFIKRNIYLTK-RIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG-CK 98
PYV IK N + R + LED +C P C + C ML + C+ CK
Sbjct: 1162 PYVKIKTNKAVPPLRFSQNLED---LSTCNCLPVDEHPCGPEAGCLNRMLFNECNPEYCK 1218
Query: 99 CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
G+ C N+ F+ R ++++V + G G+V E I G+FVIEYVGEVI+ + R+
Sbjct: 1219 AGSLCENRMFEQRKSPRLEVVYMNERGFGLVNREPIAVGDFVIEYVGEVINHAEFQRRME 1278
Query: 159 KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATR 218
+ + + N+Y + +D +IDA KGN +R++NHSC PN E QKW ++ R+GIFA +
Sbjct: 1279 QKQRDRDENYYFLGVEKDFIIDAGPKGNLARFMNHSCEPNCETQKWTVNCIHRVGIFAIK 1338
Query: 219 DIKKGENLTYDYQYEFLHDSLIAYC 243
DI LT++Y ++ L ++ C
Sbjct: 1339 DIPVNSELTFNYLWDDLMNNSKKAC 1363
>gi|383854492|ref|XP_003702755.1| PREDICTED: uncharacterized protein LOC100877317 [Megachile rotundata]
Length = 2187
Score = 127 bits (319), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 78/237 (32%), Positives = 119/237 (50%), Gaps = 19/237 (8%)
Query: 19 KLLKQIGNPVEFELPDWFIKPKA-------IP---YVFIKRNIYLTKRIKRRLEDDGIFC 68
KLL+Q P + W++ ++ +P Y I+ N+Y +K + C
Sbjct: 1300 KLLRQRKMPFQLPYDLWWLHTQSRLPGRESVPSWNYKKIRTNVYYD--VKPTTLYEAQAC 1357
Query: 69 SCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGA 126
C G C DC M+ S CS C CG+ C N+ Q ++ TE G
Sbjct: 1358 ECKPESG----CGDDCINRMIFSECSPQLCPCGDKCENQKIQKHEWAPGLQKFMTEDKGW 1413
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ + IK G F++EYVGEV+ ++ + R+ ++ +T+ Y +++ +VID G
Sbjct: 1414 GVRTQQSIKSGVFILEYVGEVVSEREFKSRMA-TRYANDTHHYCLQLDGGLVIDGHRMGG 1472
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
R++NHSC PN EMQKW + G R+ +FA+RDIK GE LTYDY + + S C
Sbjct: 1473 DGRFVNHSCEPNCEMQKWSVHGLPRMALFASRDIKPGEELTYDYNFALFNPSEGQEC 1529
>gi|6143888|gb|AAF04434.1|AC010718_3 unknown protein; 29143-26659 [Arabidopsis thaliana]
Length = 528
Score = 127 bits (319), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 73/193 (37%), Positives = 106/193 (54%), Gaps = 2/193 (1%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGS-SGVCDRDCHCGMLLSSCSSG-CKCGN 101
Y I +N + ++ K++ E+D C C G C C + + C+ G C CG
Sbjct: 17 YEHIYQNDFSYRKHKKQKEEDISICECKFDFGDPDSACGERCLNVITNTECTPGYCPCGV 76
Query: 102 SCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMK 161
C N+ FQ K KL++ E G G+VA E+IK G+F++EY GEVI + ++R +
Sbjct: 77 YCKNQKFQKCEYAKTKLIKCEGRGWGLVALEEIKAGQFIMEYCGEVISWKEAKKRAQTYE 136
Query: 162 HLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIK 221
G + Y+ +N IDAT KG+ +R+INHSC PN E +KW + GE R+GIFA I
Sbjct: 137 THGVKDAYIISLNASEAIDATKKGSLARFINHSCRPNCETRKWNVLGEVRVGIFAKESIS 196
Query: 222 KGENLTYDYQYEF 234
L YDY +E+
Sbjct: 197 PRTELAYDYNFEW 209
>gi|195574451|ref|XP_002105202.1| GD18047 [Drosophila simulans]
gi|194201129|gb|EDX14705.1| GD18047 [Drosophila simulans]
Length = 567
Score = 127 bits (319), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 76/211 (36%), Positives = 115/211 (54%), Gaps = 7/211 (3%)
Query: 37 IKPKAIPYVFIKRNIYLTK-RIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSC 93
IK PYV IK N + R + LED +C P C + C ML + C
Sbjct: 296 IKLHPPPYVKIKTNKAVPPLRFSQNLED---LSTCNCLPVDEHPCGPEAGCLNRMLFNEC 352
Query: 94 SSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQT 152
+ CK G+ C N+ F+ R ++++V + G G+V E I G+FVIEYVGEVI+
Sbjct: 353 NPDYCKAGSLCENRMFEQRKSPRLEVVYMNERGFGLVNREPIAAGDFVIEYVGEVINHAE 412
Query: 153 CEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRI 212
+ R+ + + + N+Y + +D +IDA KGN +R++NHSC PN E QKW ++ R+
Sbjct: 413 FQRRMEQKQRDRDENYYFLGVEKDFIIDAGPKGNLARFMNHSCEPNCETQKWTVNCIHRV 472
Query: 213 GIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
GIFA +DI LT++Y ++ L ++ C
Sbjct: 473 GIFAIKDIPVNTELTFNYLWDDLMNNSKKAC 503
>gi|126307634|ref|XP_001366993.1| PREDICTED: probable histone-lysine N-methyltransferase ASH1L
[Monodelphis domestica]
Length = 2968
Score = 127 bits (319), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 76/218 (34%), Positives = 118/218 (54%), Gaps = 16/218 (7%)
Query: 28 VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCT-ASPGS 76
++F+LP W + KP Y I+ N+Y+ +K + C+C G+
Sbjct: 2048 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDGN 2105
Query: 77 SGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
C DC M+ + CS C CG C N+ Q V+ ++ + E+ G GI E +
Sbjct: 2106 RKGCMDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2165
Query: 135 KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
K G+F+IEY+GEV+ +Q R+ + H ++ Y ++ MVID+ GN++R+INHS
Sbjct: 2166 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2224
Query: 195 CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
C PN EMQKW ++G RIG++A +D+ G LTYDY +
Sbjct: 2225 CNPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2262
>gi|395532129|ref|XP_003768124.1| PREDICTED: histone-lysine N-methyltransferase ASH1L isoform 1
[Sarcophilus harrisii]
Length = 2969
Score = 127 bits (319), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 76/218 (34%), Positives = 117/218 (53%), Gaps = 16/218 (7%)
Query: 28 VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCT-ASPGS 76
++F+LP W + KP Y I+ N+Y+ +K + C+C G
Sbjct: 2049 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDGD 2106
Query: 77 SGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
C DC M+ + CS C CG C N+ Q V+ ++ + E+ G GI E +
Sbjct: 2107 RKGCMDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2166
Query: 135 KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
K G+F+IEY+GEV+ +Q R+ + H ++ Y ++ MVID+ GN++R+INHS
Sbjct: 2167 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2225
Query: 195 CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
C PN EMQKW ++G RIG++A +D+ G LTYDY +
Sbjct: 2226 CNPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2263
>gi|21392158|gb|AAM48433.1| RE61305p [Drosophila melanogaster]
Length = 1016
Score = 127 bits (319), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 74/205 (36%), Positives = 113/205 (55%), Gaps = 7/205 (3%)
Query: 43 PYVFIKRNIYLTK-RIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG-CK 98
PYV IK N + R + LED +C P C + C ML + C+ CK
Sbjct: 751 PYVKIKTNKAVPPLRFSQNLED---LSTCNCLPVDEHPCGPEAGCLNRMLFNECNPEYCK 807
Query: 99 CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
G+ C N+ F+ R ++++V + G G+V E I G+FVIEYVGEVI+ + R+
Sbjct: 808 AGSLCENRMFEQRKSPRLEVVYMNERGFGLVNREPIAVGDFVIEYVGEVINHAEFQRRME 867
Query: 159 KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATR 218
+ + + N+Y + +D +IDA KGN +R++NHSC PN E QKW ++ R+GIFA +
Sbjct: 868 QKQRDRDENYYFLGVEKDFIIDAGPKGNLARFMNHSCEPNCETQKWTVNCIHRVGIFAIK 927
Query: 219 DIKKGENLTYDYQYEFLHDSLIAYC 243
DI LT++Y ++ L ++ C
Sbjct: 928 DIPVNSELTFNYLWDDLMNNSKKAC 952
>gi|189195894|ref|XP_001934285.1| histone-lysine N-methyltransferase [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187980164|gb|EDU46790.1| histone-lysine N-methyltransferase [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 948
Score = 127 bits (319), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 72/224 (32%), Positives = 117/224 (52%), Gaps = 10/224 (4%)
Query: 27 PVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCT----ASPGSSGVCDR 82
P+ +LPD K +V + Y K I D + C C ++ ++ CD
Sbjct: 118 PLYLDLPDSTEAAKET-FVVLPECTYANKTIG--TTDPALECDCQEEYDSATKTNHACDE 174
Query: 83 DCHCGMLLS--SCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
D C + C C CG +C N+ FQ + + +++TEK G G+ A++++ G+FV
Sbjct: 175 DSDCINRATKMECVGDCSCGRTCQNQRFQRKQYADVTVIKTEKKGFGLRANKNMVPGDFV 234
Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
EY+GEVID++T R+ + G +FY + + +DAT KGN R+ NHSC PN
Sbjct: 235 FEYIGEVIDERTFRRRMGQYDEEGIKHFYFMSLTKGEFVDATKKGNLGRFCNHSCNPNCF 294
Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
+ KW++ + R+GIF R+++ GE L ++Y + + D YC
Sbjct: 295 VDKWVVGDKLRMGIFVERNVRAGEELVFNYNVDRYGADPQPCYC 338
>gi|294658433|ref|XP_460767.2| DEHA2F09350p [Debaryomyces hansenii CBS767]
gi|218511782|sp|Q6BM04.2|SET2_DEBHA RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36
specific; AltName: Full=SET domain-containing protein 2
gi|202953126|emb|CAG89108.2| DEHA2F09350p [Debaryomyces hansenii CBS767]
Length = 731
Score = 127 bits (318), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 61/151 (40%), Positives = 93/151 (61%), Gaps = 1/151 (0%)
Query: 81 DRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEF 139
D DC + CS+ C CGN C N+ FQ + + ++QTE G G+ A+EDI F
Sbjct: 76 DSDCINRVTSVECSNKFCTCGNDCQNQRFQKKQYANVTVIQTELKGYGLRANEDISESSF 135
Query: 140 VIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNT 199
+ EY+GEVID+++ +R+ +FY + +D IDAT KG+ +R+ NHSC PN
Sbjct: 136 IYEYIGEVIDEESFRKRMIDYDTKKLIHFYFMMLKKDSFIDATMKGSLARFCNHSCNPNA 195
Query: 200 EMQKWIIDGETRIGIFATRDIKKGENLTYDY 230
+ KW++ + R+GIF+ R+I+KGE +T+DY
Sbjct: 196 YVDKWVVGEKLRMGIFSKRNIQKGEEITFDY 226
>gi|22330671|ref|NP_177797.2| histone-lysine N-methyltransferase ASHH1 [Arabidopsis thaliana]
gi|42572135|ref|NP_974158.1| histone-lysine N-methyltransferase ASHH1 [Arabidopsis thaliana]
gi|75243465|sp|Q84WW6.1|ASHH1_ARATH RecName: Full=Histone-lysine N-methyltransferase ASHH1; AltName:
Full=ASH1 homolog 1; AltName: Full=Protein SET DOMAIN
GROUP 26
gi|25054844|gb|AAN71912.1| unknown protein [Arabidopsis thaliana]
gi|225898088|dbj|BAH30376.1| hypothetical protein [Arabidopsis thaliana]
gi|332197758|gb|AEE35879.1| histone-lysine N-methyltransferase ASHH1 [Arabidopsis thaliana]
gi|332197759|gb|AEE35880.1| histone-lysine N-methyltransferase ASHH1 [Arabidopsis thaliana]
Length = 492
Score = 127 bits (318), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 73/193 (37%), Positives = 106/193 (54%), Gaps = 2/193 (1%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGS-SGVCDRDCHCGMLLSSCSSG-CKCGN 101
Y I +N + ++ K++ E+D C C G C C + + C+ G C CG
Sbjct: 17 YEHIYQNDFSYRKHKKQKEEDISICECKFDFGDPDSACGERCLNVITNTECTPGYCPCGV 76
Query: 102 SCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMK 161
C N+ FQ K KL++ E G G+VA E+IK G+F++EY GEVI + ++R +
Sbjct: 77 YCKNQKFQKCEYAKTKLIKCEGRGWGLVALEEIKAGQFIMEYCGEVISWKEAKKRAQTYE 136
Query: 162 HLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIK 221
G + Y+ +N IDAT KG+ +R+INHSC PN E +KW + GE R+GIFA I
Sbjct: 137 THGVKDAYIISLNASEAIDATKKGSLARFINHSCRPNCETRKWNVLGEVRVGIFAKESIS 196
Query: 222 KGENLTYDYQYEF 234
L YDY +E+
Sbjct: 197 PRTELAYDYNFEW 209
>gi|393236319|gb|EJD43868.1| hypothetical protein AURDEDRAFT_114444 [Auricularia delicata
TFB-10046 SS5]
Length = 882
Score = 127 bits (318), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 75/213 (35%), Positives = 115/213 (53%), Gaps = 12/213 (5%)
Query: 36 FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS---GVCDRDCHCGMLLSS 92
FI+ KA Y +N+ L + R +E+ G C C+ P C DC +
Sbjct: 70 FIELKACTY----QNMQLGR--SRNIEE-GFACECSYDPEEDLPDAACGSDCINKLTQIE 122
Query: 93 C-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQ 151
C C+CG C N+ F R ++ +VQTEK G G+ A D+K+ +F+ EYVGEV+++
Sbjct: 123 CLEDECRCGVHCQNQRFAKRQYAQIHIVQTEKKGFGLRAATDLKKDDFIYEYVGEVVNNT 182
Query: 152 TCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETR 211
+R+ + G +FY + ++ IDAT KG R+ NHSC PN + KW + R
Sbjct: 183 QFMKRMREYADEGIRHFYFMMLQKEEFIDATKKGGIGRFANHSCNPNCYVAKWTVGKRIR 242
Query: 212 IGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
+GIFA R++K GE LT++Y + + H++ YC
Sbjct: 243 MGIFAQRNVKAGEELTFNYNVDRYGHEAQTCYC 275
>gi|189534132|ref|XP_001923087.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Danio rerio]
Length = 773
Score = 126 bits (317), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 74/198 (37%), Positives = 109/198 (55%), Gaps = 2/198 (1%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGI-FCSCTASPGSSGVCDRDCHCGMLLSSCSSG-C 97
K PY IK N K + + I C+C A+ + D +C MLL C C
Sbjct: 319 KPPPYKQIKVNKPFGKVLIISADLSEIPRCNCKATDENPCGMDSECINRMLLYECHPQVC 378
Query: 98 KCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERL 157
G C N+ F R +++ +T G G+ DIK+G F+ EYVGEVID++ C R+
Sbjct: 379 PAGERCQNQCFIKRQYCQVETFRTLSRGWGLRCVHDIKKGGFISEYVGEVIDEEECRARI 438
Query: 158 WKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFAT 217
+ NFY+ +++D +IDA KGN++R++NH C PN E QKW ++G+TR+G+F+
Sbjct: 439 KHAQENNIGNFYMLTLDKDRIIDAGPKGNEARFMNHCCQPNCETQKWTVNGDTRVGLFSL 498
Query: 218 RDIKKGENLTYDYQYEFL 235
DI G LT++Y E L
Sbjct: 499 TDIPAGTELTFNYNLECL 516
>gi|327286108|ref|XP_003227773.1| PREDICTED: probable histone-lysine N-methyltransferase ASH1L-like
[Anolis carolinensis]
Length = 2957
Score = 126 bits (317), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 76/218 (34%), Positives = 118/218 (54%), Gaps = 16/218 (7%)
Query: 28 VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
++F+LP W + KP Y I+ N+Y+ +K + C+C S+
Sbjct: 2036 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPEDSN 2093
Query: 78 GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
C DC M+ + CS C CG C N+ Q V+ ++ + E+ G GI E +
Sbjct: 2094 KKGCVDDCLNRMIFAECSPNTCPCGEHCCNQRIQRHEWVQCLERFRAEEKGWGIRTKESL 2153
Query: 135 KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
K G+F+IEY+GEV+ +Q R+ + H ++ Y ++ MVID+ GN++R+INHS
Sbjct: 2154 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2212
Query: 195 CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
C PN EMQKW ++G RIG++A +D+ G LTYDY +
Sbjct: 2213 CNPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2250
>gi|241952364|ref|XP_002418904.1| SET domain-containing protein, putative; histone methyltransferase,
putative; histone-lysine N-methyltransferase, putative
[Candida dubliniensis CD36]
gi|223642243|emb|CAX44211.1| SET domain-containing protein, putative [Candida dubliniensis CD36]
Length = 837
Score = 126 bits (317), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 60/137 (43%), Positives = 91/137 (66%), Gaps = 4/137 (2%)
Query: 96 GCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEE 155
C CG+ C N+ FQN K+K++QTE G G++A++DI+ +F+ EY+GEVID+ + +
Sbjct: 124 NCLCGDDCQNQRFQNCQYSKVKVIQTELKGYGLIAEQDIEENQFIYEYIGEVIDESSFRQ 183
Query: 156 RL--WKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIG 213
R+ + ++HL +FY ++ D IDAT KG+ R+INHSC PN + KW + R+G
Sbjct: 184 RMIEYDLRHL--KHFYFMMLSNDSFIDATQKGSLGRFINHSCNPNAFVDKWHVGDRLRMG 241
Query: 214 IFATRDIKKGENLTYDY 230
IFA R I +GE +T+DY
Sbjct: 242 IFAKRKILRGEEITFDY 258
>gi|347966112|ref|XP_321588.5| AGAP001535-PA [Anopheles gambiae str. PEST]
gi|333470205|gb|EAA00844.6| AGAP001535-PA [Anopheles gambiae str. PEST]
Length = 3613
Score = 126 bits (317), Expect = 7e-27, Method: Composition-based stats.
Identities = 71/214 (33%), Positives = 108/214 (50%), Gaps = 15/214 (7%)
Query: 27 PVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC 86
P +P W Y I+ N+Y + D+ C+C G C DC
Sbjct: 2682 PGRNSVPSW-------NYRKIRTNVYYDVKANPS-TDNNTQCNCKPDSG----CQDDCLN 2729
Query: 87 GMLLSSC-SSGCKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDIKRGEFVIEYV 144
M+ + C C CG+ C N Q ++ TE+ G GI + E I +G F++EY+
Sbjct: 2730 RMVYTECVPEQCPCGDRCRNTCIQRHEYAPGLERFMTEEKGWGIRSRERISKGTFIMEYL 2789
Query: 145 GEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKW 204
GEV+ ++ +ER+ M +L +T+ Y ++ +VID G+ R++NHSC PN EMQKW
Sbjct: 2790 GEVVTEREFKERMRTM-YLNDTHHYCLNLDGGLVIDGHRMGSDCRFVNHSCAPNCEMQKW 2848
Query: 205 IIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
++G R+ +FA RDI E L YDY + + S
Sbjct: 2849 SVNGLFRMALFAMRDIPPNEELCYDYNFSLFNPS 2882
>gi|414587222|tpg|DAA37793.1| TPA: hypothetical protein ZEAMMB73_251567 [Zea mays]
Length = 489
Score = 126 bits (317), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 73/190 (38%), Positives = 106/190 (55%), Gaps = 12/190 (6%)
Query: 58 KRRLEDDGIFCSCTASPGSSGVCDRDCHCG------MLLSSCSSG-CKCGNSCLNKPFQN 110
K + E+D C C + D + CG + + C+ G C+CG C N+ FQ
Sbjct: 3 KSQTEEDIAVCECQYD-----ILDPESPCGDRCLNLLTNTECTPGYCRCGVYCKNQRFQK 57
Query: 111 RPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYL 170
+ +LV+T G G+VADE+I G+FVIEY GEVI + + R + + Y+
Sbjct: 58 CQYARTRLVRTGGRGWGLVADENIMAGQFVIEYCGEVISWKEAKRRAQAYETQCLKDAYI 117
Query: 171 CEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDY 230
+N D IDAT KGN +R+INHSC PN E +KW + GE R+GIFA ++I G L+YDY
Sbjct: 118 IYLNADESIDATRKGNLARFINHSCQPNCETRKWNVLGEVRVGIFAKQNIPFGTELSYDY 177
Query: 231 QYEFLHDSLI 240
+E+ ++
Sbjct: 178 NFEWYGGVMV 187
>gi|401625193|gb|EJS43214.1| set2p [Saccharomyces arboricola H-6]
Length = 736
Score = 126 bits (317), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 70/186 (37%), Positives = 101/186 (54%), Gaps = 6/186 (3%)
Query: 51 IYLTKRIKRRLEDDGIFCSCTA--SPGSSGVCDRDCHC--GMLLSSCSSG--CKCGNSCL 104
IY KRI +D + C C S G + CD D C + L C + CGN C
Sbjct: 51 IYANKRIGTFKNNDFMECDCYEEFSDGVNHACDEDSDCINRLTLIECVNDLCSSCGNDCQ 110
Query: 105 NKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLG 164
N+ FQ + + + +T+ G G+ A+EDI+ +F+ EY GEVI++ ERL
Sbjct: 111 NQRFQKKQYSPIAIFKTKYKGYGVRAEEDIEANQFIYEYKGEVIEEMEFRERLIDYDQRH 170
Query: 165 ETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGE 224
+FY + IDAT KG+ +R+ NHSC PN + KW++ G+ R+GIFA + I KGE
Sbjct: 171 FKHFYFMMLQNGEFIDATIKGSLARFCNHSCSPNAYVNKWVVKGKLRMGIFAQKKILKGE 230
Query: 225 NLTYDY 230
+T+DY
Sbjct: 231 EITFDY 236
>gi|194746360|ref|XP_001955648.1| GF16138 [Drosophila ananassae]
gi|190628685|gb|EDV44209.1| GF16138 [Drosophila ananassae]
Length = 1460
Score = 126 bits (317), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 73/205 (35%), Positives = 112/205 (54%), Gaps = 7/205 (3%)
Query: 43 PYVFIKRNIYLTK-RIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG-CK 98
PY IK N + R + LE+ +C P C + C ML + C+ C+
Sbjct: 1171 PYAKIKANKAVAPVRFMQNLEE---LSACDCLPEHDHPCGPESGCLNRMLYNECNPDFCR 1227
Query: 99 CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
G+ C N+ F+ R ++++V + G G+V E I G FVIEYVGEVI+ +ERL
Sbjct: 1228 AGDRCENRMFEQRISPRLEVVYMNERGFGLVCREPIAEGTFVIEYVGEVINHAEFQERLI 1287
Query: 159 KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATR 218
+ + + N+Y + +D +IDA KGN +R++NHSC PN E QKW ++ R+GIFA +
Sbjct: 1288 QKQRNRDENYYFLGVEKDFIIDAGPKGNLARFMNHSCEPNCETQKWTVNCVHRVGIFAIK 1347
Query: 219 DIKKGENLTYDYQYEFLHDSLIAYC 243
DI LT++Y ++ L ++ C
Sbjct: 1348 DIPANTELTFNYLWDDLMNNSKKAC 1372
>gi|328785970|ref|XP_001122948.2| PREDICTED: hypothetical protein LOC727238 [Apis mellifera]
Length = 2172
Score = 126 bits (316), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 78/237 (32%), Positives = 118/237 (49%), Gaps = 19/237 (8%)
Query: 19 KLLKQIGNPVEFELPDWFIKPKA-------IP---YVFIKRNIYLTKRIKRRLEDDGIFC 68
KLL+Q P + W++ ++ +P Y I+ N+Y +K + C
Sbjct: 1285 KLLRQRKMPFQLPYDLWWLHTQSRLPSRESVPSWNYKKIRTNVYYD--VKPTTLYEAQAC 1342
Query: 69 SCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGA 126
C G C DC M+ S CS C CG+ C N+ Q ++ TE G
Sbjct: 1343 ECKPESG----CGDDCINRMVFSECSPQLCPCGDKCENQKIQKHEWAPGLQKFMTEDKGW 1398
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ + IK G F++EYVGEV+ ++ + R+ ++ +T+ Y ++ +VID G
Sbjct: 1399 GVRTQQSIKSGVFILEYVGEVVSEREFKSRMA-TRYANDTHHYCLHLDGGLVIDGHRMGG 1457
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
R++NHSC PN EMQKW + G R+ +FA+RDIK GE LTYDY + + S C
Sbjct: 1458 DGRFVNHSCEPNCEMQKWSVHGLPRMALFASRDIKPGEELTYDYNFALFNPSEGQEC 1514
>gi|303319123|ref|XP_003069561.1| SET domain containing protein [Coccidioides posadasii C735 delta
SOWgp]
gi|240109247|gb|EER27416.1| SET domain containing protein [Coccidioides posadasii C735 delta
SOWgp]
Length = 1011
Score = 126 bits (316), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 65/164 (39%), Positives = 94/164 (57%), Gaps = 1/164 (0%)
Query: 81 DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
D DC C C CG+SC N+ FQ R K+ +++TEK G G+ AD D++ EF+
Sbjct: 170 DSDCINRATKMECFGDCGCGDSCQNQRFQRREYAKVSVIKTEKKGYGLRADCDLRPNEFI 229
Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
EY+GEVI++ R+ + G +FY +N+ +DAT KGN R+ NHSC PN
Sbjct: 230 FEYIGEVINEPQFRRRMIQYDEEGIKHFYFMSLNKGEFVDATKKGNLGRFCNHSCNPNCY 289
Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
+ KW++ + R+GIFA R IK GE L ++Y + + D YC
Sbjct: 290 VDKWVVGEKLRMGIFAERYIKAGEELVFNYNVDRYGADPQPCYC 333
>gi|119182346|ref|XP_001242314.1| hypothetical protein CIMG_06210 [Coccidioides immitis RS]
Length = 1003
Score = 126 bits (316), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 65/164 (39%), Positives = 94/164 (57%), Gaps = 1/164 (0%)
Query: 81 DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
D DC C C CG+SC N+ FQ R K+ +++TEK G G+ AD D++ EF+
Sbjct: 162 DSDCINRATKMECFGDCGCGDSCQNQRFQRREYAKVSVIKTEKKGYGLRADCDLRPNEFI 221
Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
EY+GEVI++ R+ + G +FY +N+ +DAT KGN R+ NHSC PN
Sbjct: 222 FEYIGEVINEPQFRRRMIQYDEEGIKHFYFMSLNKGEFVDATKKGNLGRFCNHSCNPNCY 281
Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
+ KW++ + R+GIFA R IK GE L ++Y + + D YC
Sbjct: 282 VDKWVVGEKLRMGIFAERYIKAGEELVFNYNVDRYGADPQPCYC 325
>gi|390360513|ref|XP_785219.3| PREDICTED: histone-lysine N-methyltransferase NSD3-like
[Strongylocentrotus purpuratus]
Length = 1736
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 74/201 (36%), Positives = 109/201 (54%), Gaps = 7/201 (3%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFC-SCTASPGSSGVC--DRDCHCGMLLSSCSSG 96
K P+ FIK N + + D C +C P C D DC +LL C
Sbjct: 1156 KPAPFKFIKTNRPVGNVVMPAF--DITQCQACECRPDMENPCGPDSDCLNRILLIECHPQ 1213
Query: 97 -CKCGNS-CLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCE 154
C C N+ FQ R ++ G G+VA DIK+G+FV EYVGE++D++ C
Sbjct: 1214 ICPAKEEKCQNQRFQKRAYPDSCQMKVSHRGWGLVAMVDIKKGDFVNEYVGELVDEEECR 1273
Query: 155 ERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGI 214
R+ + T+FY +++D +IDA KGN SR++NHSC PN E QKW ++G+TR+G+
Sbjct: 1274 RRIKQAHEENITDFYFLTLDKDRIIDAGPKGNLSRFMNHSCQPNCETQKWTVNGDTRVGL 1333
Query: 215 FATRDIKKGENLTYDYQYEFL 235
FA R+I G ++++Y + L
Sbjct: 1334 FAIRNIAAGNEISFNYNLDCL 1354
>gi|320041043|gb|EFW22976.1| histone-lysine N-methyltransferase [Coccidioides posadasii str.
Silveira]
Length = 972
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 65/164 (39%), Positives = 94/164 (57%), Gaps = 1/164 (0%)
Query: 81 DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
D DC C C CG+SC N+ FQ R K+ +++TEK G G+ AD D++ EF+
Sbjct: 131 DSDCINRATKMECFGDCGCGDSCQNQRFQRREYAKVSVIKTEKKGYGLRADCDLRPNEFI 190
Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
EY+GEVI++ R+ + G +FY +N+ +DAT KGN R+ NHSC PN
Sbjct: 191 FEYIGEVINEPQFRRRMIQYDEEGIKHFYFMSLNKGEFVDATKKGNLGRFCNHSCNPNCY 250
Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
+ KW++ + R+GIFA R IK GE L ++Y + + D YC
Sbjct: 251 VDKWVVGEKLRMGIFAERYIKAGEELVFNYNVDRYGADPQPCYC 294
>gi|442570025|sp|Q1DU03.2|SET2_COCIM RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36
specific; AltName: Full=SET domain-containing protein 2
gi|392865212|gb|EAS30985.2| histone-lysine N-methyltransferase, H3 lysine-36 specific
[Coccidioides immitis RS]
Length = 1011
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 65/164 (39%), Positives = 94/164 (57%), Gaps = 1/164 (0%)
Query: 81 DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
D DC C C CG+SC N+ FQ R K+ +++TEK G G+ AD D++ EF+
Sbjct: 170 DSDCINRATKMECFGDCGCGDSCQNQRFQRREYAKVSVIKTEKKGYGLRADCDLRPNEFI 229
Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
EY+GEVI++ R+ + G +FY +N+ +DAT KGN R+ NHSC PN
Sbjct: 230 FEYIGEVINEPQFRRRMIQYDEEGIKHFYFMSLNKGEFVDATKKGNLGRFCNHSCNPNCY 289
Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
+ KW++ + R+GIFA R IK GE L ++Y + + D YC
Sbjct: 290 VDKWVVGEKLRMGIFAERYIKAGEELVFNYNVDRYGADPQPCYC 333
>gi|226291164|gb|EEH46592.1| histone-lysine N-methyltransferase [Paracoccidioides brasiliensis
Pb18]
Length = 980
Score = 125 bits (315), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 64/172 (37%), Positives = 96/172 (55%), Gaps = 3/172 (1%)
Query: 75 GSSGVC--DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADE 132
G +G C D DC C C CG+ C N+ FQ R + +++TEK G G+ AD
Sbjct: 163 GRNGACGEDSDCINRATKMECVGDCGCGDECQNQRFQRRQYANVTVIKTEKKGYGLRADS 222
Query: 133 DIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYIN 192
D++ +F+ EY+GEVI++ + +R+ G +FY + + +DAT KGN R+ N
Sbjct: 223 DLRPHQFIFEYIGEVINEASFRKRMIAYDEEGIKHFYFMSLTKGEFVDATKKGNLGRFCN 282
Query: 193 HSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
HSC PN + KW++ + R+GIFA R IK GE L ++Y + + D YC
Sbjct: 283 HSCNPNCYVDKWVVGEKLRMGIFAERHIKAGEELVFNYNVDRYGADPQPCYC 334
>gi|388858360|emb|CCF48089.1| related to SET2-Histone methyltransferase [Ustilago hordei]
Length = 985
Score = 125 bits (315), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 69/172 (40%), Positives = 96/172 (55%), Gaps = 4/172 (2%)
Query: 63 DDGIFCSCTASPGSSGVC---DRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKL 118
DD + C C S + + C M CSS C+ G C N+ F R + +
Sbjct: 165 DDYMLCDCNPHSESEDLACTDESGCINRMTQIECSSSKCRWGKLCRNQRFHRRQYVNVDI 224
Query: 119 VQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMV 178
VQTEK G G+ A +DI + FV EYVGEV++ T ER+ + + G +FY + +
Sbjct: 225 VQTEKKGFGLRAAQDIPKETFVYEYVGEVMNQPTFLERMQQYRQEGIRHFYFMMLQPNEY 284
Query: 179 IDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDY 230
+DAT KG K R+INHSC PN + KW + R+GIFA R+I+KGE LT++Y
Sbjct: 285 LDATKKGGKGRFINHSCNPNCSVSKWQVGKHLRMGIFAKRNIQKGEELTFNY 336
>gi|340959559|gb|EGS20740.1| hypothetical protein CTHT_0025760 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 939
Score = 125 bits (315), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 67/186 (36%), Positives = 99/186 (53%), Gaps = 6/186 (3%)
Query: 64 DGIFCSCTAS--PGSSGVCDRDCHC---GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKL 118
D + C C G + C D C + C CG C N+ FQ + K+ +
Sbjct: 125 DALDCDCAEDWRDGKNHACGEDSDCINRATKIECVIGDCNCGEGCQNQRFQRKQYAKVSV 184
Query: 119 VQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMV 178
++TEK G G+ AD D++ +F+ EYVGEVI++ T R+ K G +FY + ++
Sbjct: 185 IKTEKKGYGLRADTDLQPNDFIYEYVGEVINEPTFRSRMLKYDKEGIKHFYFMSLTKNEF 244
Query: 179 IDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHD 237
+DAT KGN R+ NHSC PN + KW++ + R+GIFA R IK GE L ++Y + + D
Sbjct: 245 VDATKKGNLGRFCNHSCNPNCYVDKWVVGDKLRMGIFAARYIKAGEELVFNYNVDRYGAD 304
Query: 238 SLIAYC 243
YC
Sbjct: 305 PQPCYC 310
>gi|340724009|ref|XP_003400378.1| PREDICTED: hypothetical protein LOC100647786 [Bombus terrestris]
Length = 2172
Score = 125 bits (315), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 78/237 (32%), Positives = 117/237 (49%), Gaps = 19/237 (8%)
Query: 19 KLLKQIGNPVEFELPDWFIKPKA-------IP---YVFIKRNIYLTKRIKRRLEDDGIFC 68
KLL+Q P + W++ ++ +P Y I+ N+Y +K + C
Sbjct: 1285 KLLRQRKMPFQLPYDLWWLHTQSRLPGRESVPSWNYKKIRTNVYYD--VKPTTLYEAQAC 1342
Query: 69 SCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGA 126
C G C DC M+ S CS C CG C N+ Q ++ TE G
Sbjct: 1343 ECKPESG----CGDDCINRMVFSECSPQLCPCGEKCENQKIQKHEWAPGLQKFMTEDKGW 1398
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ + IK G F++EYVGEV+ ++ + R+ ++ +T+ Y ++ +VID G
Sbjct: 1399 GVRTQQSIKSGVFILEYVGEVVSEREFKSRMA-TRYANDTHHYCLHLDGGLVIDGHRMGG 1457
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
R++NHSC PN EMQKW + G R+ +FA+RDIK GE LTYDY + + S C
Sbjct: 1458 DGRFVNHSCEPNCEMQKWSVHGLPRMALFASRDIKPGEELTYDYNFALFNPSEGQEC 1514
>gi|30704948|gb|AAH52194.1| Ash1l protein, partial [Mus musculus]
Length = 963
Score = 125 bits (315), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 75/218 (34%), Positives = 117/218 (53%), Gaps = 16/218 (7%)
Query: 28 VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
++F+LP W + KP Y I+ N+Y+ +K + C+C +
Sbjct: 43 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDDT 100
Query: 78 GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
C DC M+ + CS C CG C N+ Q V+ ++ + E+ G GI E +
Sbjct: 101 RKGCGDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 160
Query: 135 KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
K G+F+IEY+GEV+ +Q R+ + H ++ Y ++ MVID+ GN++R+INHS
Sbjct: 161 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 219
Query: 195 CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
C PN EMQKW ++G RIG++A +D+ G LTYDY +
Sbjct: 220 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 257
>gi|195108992|ref|XP_001999076.1| GI23270 [Drosophila mojavensis]
gi|193915670|gb|EDW14537.1| GI23270 [Drosophila mojavensis]
Length = 1433
Score = 125 bits (315), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 72/196 (36%), Positives = 106/196 (54%), Gaps = 5/196 (2%)
Query: 43 PYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG-CKC 99
PYV IK N + E D C C PGS+ C + C ML C C
Sbjct: 1139 PYVKIKTNKPVPPVRFVYNEQDLNICDC--KPGSAHPCGPESGCLNRMLYHECHPDYCHA 1196
Query: 100 GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
G C N+ F+ R ++++V + G G+V E IK G+F+IEYVGEVI+ + R+ +
Sbjct: 1197 GTRCENQLFELRKSPRLEVVYMNERGFGLVCREPIKEGDFIIEYVGEVINHAEFQRRMAQ 1256
Query: 160 MKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD 219
+ NFY + +D +IDA KGN +R++NHSC PN QKW ++ R+G+FA +D
Sbjct: 1257 KTKNRDENFYFLGVEKDFIIDAGPKGNLARFMNHSCEPNCATQKWTVNCNNRVGLFAIKD 1316
Query: 220 IKKGENLTYDYQYEFL 235
I + LT++Y ++ L
Sbjct: 1317 IPENTELTFNYLWDDL 1332
>gi|13442965|gb|AAK26242.1|AF247132_1 putative chromatin remodeling factor [Mus musculus]
Length = 2669
Score = 125 bits (315), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 75/218 (34%), Positives = 117/218 (53%), Gaps = 16/218 (7%)
Query: 28 VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
++F+LP W + KP Y I+ N+Y+ +K + C+C +
Sbjct: 1749 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDDT 1806
Query: 78 GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
C DC M+ + CS C CG C N+ Q V+ ++ + E+ G GI E +
Sbjct: 1807 RKGCGDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 1866
Query: 135 KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
K G+F+IEY+GEV+ +Q R+ + H ++ Y ++ MVID+ GN++R+INHS
Sbjct: 1867 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 1925
Query: 195 CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
C PN EMQKW ++G RIG++A +D+ G LTYDY +
Sbjct: 1926 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 1963
>gi|395845197|ref|XP_003795328.1| PREDICTED: histone-lysine N-methyltransferase ASH1L [Otolemur
garnettii]
Length = 2961
Score = 125 bits (315), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 76/218 (34%), Positives = 117/218 (53%), Gaps = 16/218 (7%)
Query: 28 VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
++F+LP W + KP Y I+ N+Y+ +K + C+C S
Sbjct: 2040 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDDS 2097
Query: 78 GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
C DC M+ + CS C CG C N+ Q V+ ++ + E+ G GI E +
Sbjct: 2098 RKGCVDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2157
Query: 135 KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
K G+F+IEY+GEV+ +Q R+ + H ++ Y ++ MVID+ GN++R+INHS
Sbjct: 2158 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2216
Query: 195 CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
C PN EMQKW ++G RIG++A +D+ G LTYDY +
Sbjct: 2217 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2254
>gi|350427843|ref|XP_003494899.1| PREDICTED: hypothetical protein LOC100746434 [Bombus impatiens]
Length = 2217
Score = 125 bits (315), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 78/237 (32%), Positives = 117/237 (49%), Gaps = 19/237 (8%)
Query: 19 KLLKQIGNPVEFELPDWFIKPKA-------IP---YVFIKRNIYLTKRIKRRLEDDGIFC 68
KLL+Q P + W++ ++ +P Y I+ N+Y +K + C
Sbjct: 1330 KLLRQRKMPFQLPYDLWWLHTQSRLPGRESVPSWNYKKIRTNVYYD--VKPTTLYEAQAC 1387
Query: 69 SCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGA 126
C G C DC M+ S CS C CG C N+ Q ++ TE G
Sbjct: 1388 ECKPESG----CGDDCINRMVFSECSPQLCPCGEKCENQKIQKHEWAPGLQKFMTEDKGW 1443
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ + IK G F++EYVGEV+ ++ + R+ ++ +T+ Y ++ +VID G
Sbjct: 1444 GVRTQQSIKSGVFILEYVGEVVSEREFKSRMA-TRYANDTHHYCLHLDGGLVIDGHRMGG 1502
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
R++NHSC PN EMQKW + G R+ +FA+RDIK GE LTYDY + + S C
Sbjct: 1503 DGRFVNHSCEPNCEMQKWSVHGLPRMALFASRDIKPGEELTYDYNFALFNPSEGQEC 1559
>gi|328716964|ref|XP_001950932.2| PREDICTED: hypothetical protein LOC100161826, partial [Acyrthosiphon
pisum]
Length = 1530
Score = 125 bits (315), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 68/161 (42%), Positives = 91/161 (56%), Gaps = 7/161 (4%)
Query: 74 PGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADE 132
PGS DC ML+ C C G+ C N+ F+ M T+ G G+ E
Sbjct: 1113 PGS------DCINRMLMFECEPKVCPAGDKCNNQRFEKTLYPAMVPFLTKGRGWGLKTLE 1166
Query: 133 DIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYIN 192
DIK G FVIEYVG+VID++ + R +M E N+Y I+ IDA KGN SR++N
Sbjct: 1167 DIKEGSFVIEYVGDVIDEEEFQRRCLEMHQRNEQNYYFLTIDNSRTIDAGPKGNLSRFMN 1226
Query: 193 HSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
HSC PN QKW ++G+TRIG+FA +DI G L +DY+ +
Sbjct: 1227 HSCEPNCVTQKWTVNGDTRIGLFALQDIPTGTELVFDYRLQ 1267
>gi|225679434|gb|EEH17718.1| histone-lysine N-methyltransferase [Paracoccidioides brasiliensis
Pb03]
Length = 976
Score = 125 bits (315), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 64/172 (37%), Positives = 96/172 (55%), Gaps = 3/172 (1%)
Query: 75 GSSGVC--DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADE 132
G +G C D DC C C CG+ C N+ FQ R + +++TEK G G+ AD
Sbjct: 163 GKNGACGEDSDCINRATKMECVGDCGCGDECQNQRFQRRQYANVTVIKTEKKGYGLRADS 222
Query: 133 DIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYIN 192
D++ +F+ EY+GEVI++ + +R+ G +FY + + +DAT KGN R+ N
Sbjct: 223 DLRPHQFIFEYIGEVINEASFRKRMIAYDEEGIKHFYFMSLTKGEFVDATKKGNLGRFCN 282
Query: 193 HSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
HSC PN + KW++ + R+GIFA R IK GE L ++Y + + D YC
Sbjct: 283 HSCNPNCYVDKWVVGEKLRMGIFAERHIKAGEELVFNYNVDRYGADPQPCYC 334
>gi|295665754|ref|XP_002793428.1| histone-lysine N-methyltransferase [Paracoccidioides sp. 'lutzii'
Pb01]
gi|226278342|gb|EEH33908.1| histone-lysine N-methyltransferase [Paracoccidioides sp. 'lutzii'
Pb01]
Length = 1001
Score = 125 bits (315), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 64/172 (37%), Positives = 96/172 (55%), Gaps = 3/172 (1%)
Query: 75 GSSGVC--DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADE 132
G +G C D DC C C CG+ C N+ FQ R + +++TEK G G+ AD
Sbjct: 164 GKNGACGEDSDCINRATKMECVGDCGCGDECQNQRFQRRQYANVTVIKTEKKGYGLRADS 223
Query: 133 DIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYIN 192
D++ +F+ EY+GEVI++ + +R+ G +FY + + +DAT KGN R+ N
Sbjct: 224 DLRPHQFIFEYIGEVINEASFRKRMIAYDEEGIKHFYFMSLTKGEFVDATKKGNLGRFCN 283
Query: 193 HSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
HSC PN + KW++ + R+GIFA R IK GE L ++Y + + D YC
Sbjct: 284 HSCNPNCYVDKWVVGEKLRMGIFAERHIKAGEELVFNYNVDRYGADPQPCYC 335
>gi|73622271|ref|NP_619620.3| histone-lysine N-methyltransferase ASH1L [Mus musculus]
gi|341940590|sp|Q99MY8.3|ASH1L_MOUSE RecName: Full=Histone-lysine N-methyltransferase ASH1L; AltName:
Full=ASH1-like protein; AltName: Full=Absent small and
homeotic disks protein 1 homolog
Length = 2958
Score = 125 bits (314), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 75/218 (34%), Positives = 117/218 (53%), Gaps = 16/218 (7%)
Query: 28 VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
++F+LP W + KP Y I+ N+Y+ +K + C+C +
Sbjct: 2038 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDDT 2095
Query: 78 GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
C DC M+ + CS C CG C N+ Q V+ ++ + E+ G GI E +
Sbjct: 2096 RKGCGDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2155
Query: 135 KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
K G+F+IEY+GEV+ +Q R+ + H ++ Y ++ MVID+ GN++R+INHS
Sbjct: 2156 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2214
Query: 195 CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
C PN EMQKW ++G RIG++A +D+ G LTYDY +
Sbjct: 2215 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2252
>gi|291397821|ref|XP_002715465.1| PREDICTED: absent, small, or homeotic 1-like [Oryctolagus cuniculus]
Length = 2961
Score = 125 bits (314), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 75/218 (34%), Positives = 118/218 (54%), Gaps = 16/218 (7%)
Query: 28 VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCT-ASPGS 76
++F+LP W + KP Y I+ N+Y+ +K + C+C +
Sbjct: 2040 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDDT 2097
Query: 77 SGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
C DC M+ + CS C CG+ C N+ Q V+ ++ + E+ G GI E +
Sbjct: 2098 RKGCVDDCLNRMIFAECSPNTCPCGDQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2157
Query: 135 KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
K G+F+IEY+GEV+ +Q R+ + H ++ Y ++ MVID+ GN++R+INHS
Sbjct: 2158 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2216
Query: 195 CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
C PN EMQKW ++G RIG++A +D+ G LTYDY +
Sbjct: 2217 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2254
>gi|417407091|gb|JAA50172.1| Putative histone-lysine n-methyltransferase ash1l isoform 1 [Desmodus
rotundus]
Length = 2962
Score = 125 bits (314), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 75/218 (34%), Positives = 118/218 (54%), Gaps = 16/218 (7%)
Query: 28 VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
++F+LP W + KP Y I+ N+Y+ +K + C+C +
Sbjct: 2041 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDDT 2098
Query: 78 GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
C DC M+ + CS C CG+ C N+ Q V+ ++ + E+ G GI E +
Sbjct: 2099 RKGCVDDCLNRMIFAECSPNTCPCGDQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2158
Query: 135 KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
K G+F+IEY+GEV+ +Q R+ + H ++ Y ++ MVID+ GN++R+INHS
Sbjct: 2159 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2217
Query: 195 CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
C PN EMQKW ++G RIG++A +D+ G LTYDY +
Sbjct: 2218 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2255
>gi|402856517|ref|XP_003892835.1| PREDICTED: histone-lysine N-methyltransferase ASH1L [Papio anubis]
Length = 1277
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 75/218 (34%), Positives = 117/218 (53%), Gaps = 16/218 (7%)
Query: 28 VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
++F+LP W + KP Y I+ N+Y+ +K + C+C +
Sbjct: 356 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDDT 413
Query: 78 GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
C DC M+ + CS C CG C N+ Q V+ ++ + E+ G GI E +
Sbjct: 414 RKGCVDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 473
Query: 135 KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
K G+F+IEY+GEV+ +Q R+ + H ++ Y ++ MVID+ GN++R+INHS
Sbjct: 474 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 532
Query: 195 CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
C PN EMQKW ++G RIG++A +D+ G LTYDY +
Sbjct: 533 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 570
>gi|354478852|ref|XP_003501628.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
N-methyltransferase ASH1L-like [Cricetulus griseus]
Length = 2962
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 75/218 (34%), Positives = 117/218 (53%), Gaps = 16/218 (7%)
Query: 28 VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
++F+LP W + KP Y I+ N+Y+ +K + C+C +
Sbjct: 2039 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDDT 2096
Query: 78 GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
C DC M+ + CS C CG C N+ Q V+ ++ + E+ G GI E +
Sbjct: 2097 RKGCGDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2156
Query: 135 KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
K G+F+IEY+GEV+ +Q R+ + H ++ Y ++ MVID+ GN++R+INHS
Sbjct: 2157 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2215
Query: 195 CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
C PN EMQKW ++G RIG++A +D+ G LTYDY +
Sbjct: 2216 CDPNCEMQKWSVNGVYRIGLYALKDVLAGTELTYDYNF 2253
>gi|417407083|gb|JAA50168.1| Putative histone-lysine n-methyltransferase ash1l isoform 1 [Desmodus
rotundus]
Length = 2832
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 75/218 (34%), Positives = 118/218 (54%), Gaps = 16/218 (7%)
Query: 28 VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
++F+LP W + KP Y I+ N+Y+ +K + C+C +
Sbjct: 2041 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDDT 2098
Query: 78 GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
C DC M+ + CS C CG+ C N+ Q V+ ++ + E+ G GI E +
Sbjct: 2099 RKGCVDDCLNRMIFAECSPNTCPCGDQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2158
Query: 135 KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
K G+F+IEY+GEV+ +Q R+ + H ++ Y ++ MVID+ GN++R+INHS
Sbjct: 2159 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2217
Query: 195 CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
C PN EMQKW ++G RIG++A +D+ G LTYDY +
Sbjct: 2218 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2255
>gi|320583119|gb|EFW97335.1| Histone methyltransferase [Ogataea parapolymorpha DL-1]
Length = 942
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 66/175 (37%), Positives = 99/175 (56%), Gaps = 7/175 (4%)
Query: 63 DDGIFCSCTASPGSSGV-----CDRDCHCGMLLSSCSSG--CKCGNSCLNKPFQNRPVKK 115
D+ + C C G+ GV D DC + C G CG+ C N+ FQN +
Sbjct: 57 DEFMACDCMEDVGADGVNHACGPDSDCINRLTNIECVDGQCSGCGDKCQNQRFQNNEIAP 116
Query: 116 MKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINR 175
+ + +TE G G+ ADEDI F++EY GE++D++ +ER + + G +FY I
Sbjct: 117 VSIFKTEHKGYGMRADEDIYPHTFIVEYKGEIVDNELYKERKEQYSNEGIKHFYFMMIQD 176
Query: 176 DMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDY 230
+ +IDAT K + R+ NHSC PN ++KW+++ R+GIFA R I KGE +T+DY
Sbjct: 177 NEIIDATKKASIGRFCNHSCDPNAYVEKWVVNKRFRMGIFAKRKIIKGEEITFDY 231
>gi|390596531|gb|EIN05933.1| hypothetical protein PUNSTDRAFT_54761 [Punctularia strigosozonata
HHB-11173 SS5]
Length = 813
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 81/250 (32%), Positives = 128/250 (51%), Gaps = 11/250 (4%)
Query: 5 KKNSDNSRIGHAFNKLLKQI-GNPVEFELPDWFIKPKAIPYVF--IKRNIYLTKRIKRRL 61
K N D+S G + ++ +Q P + D + +A F +K N Y + R
Sbjct: 45 KTNRDSSTPGASSSRRQQQPQARPGPQLIGDLPVATEAAMATFEELKANHYQYGTLGRSR 104
Query: 62 ED-DGIFCSCT-----ASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVK 114
E +G+ C C P + DC + C G C+CG C N+ F R
Sbjct: 105 EALEGMTCDCQYDHAEDEPWMACGEGSDCINRLTQVECLPGECRCGAHCRNQRFNRREYA 164
Query: 115 KMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEIN 174
+++VQTEK G G+ A EDI++ +F+ EYVG+V+ + ++R+ + G +FY +
Sbjct: 165 PIEIVQTEKKGFGLRAREDIRKDQFIYEYVGDVVSHPSFKKRMREYAQEGIRHFYFMMLQ 224
Query: 175 RDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE- 233
+D IDAT +G R+ NHSC PN + KW + R+GIFA+R I++ E LT++Y +
Sbjct: 225 KDEYIDATKRGGIGRFANHSCSPNCYVAKWTVGTHVRMGIFASRHIRQHEELTFNYNVDR 284
Query: 234 FLHDSLIAYC 243
+ HD+ YC
Sbjct: 285 YGHDAQPCYC 294
>gi|403293713|ref|XP_003937857.1| PREDICTED: histone-lysine N-methyltransferase ASH1L [Saimiri
boliviensis boliviensis]
Length = 2970
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 75/218 (34%), Positives = 118/218 (54%), Gaps = 16/218 (7%)
Query: 28 VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
++F+LP W + KP Y I+ N+Y+ +K + C+C ++
Sbjct: 2049 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDNT 2106
Query: 78 GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
C DC M+ + CS C CG C N+ Q V+ ++ + E+ G GI E +
Sbjct: 2107 RKGCVDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2166
Query: 135 KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
K G+F+IEY+GEV+ +Q R+ + H ++ Y ++ MVID+ GN++R+INHS
Sbjct: 2167 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2225
Query: 195 CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
C PN EMQKW ++G RIG++A +D+ G LTYDY +
Sbjct: 2226 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2263
>gi|47216786|emb|CAG03790.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1443
Score = 125 bits (313), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 79/236 (33%), Positives = 112/236 (47%), Gaps = 33/236 (13%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
K PY FIK N + K ++ D C P C + +C ML C
Sbjct: 990 KPPPYKFIKVNKPVGK-VQVYAADISEIPKCNCKPSDERPCGFESECLNRMLQYECHPQV 1048
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKR-------------------- 136
C G C N+ F R K+++T G G++ DIK+
Sbjct: 1049 CPSGERCCNQDFTQRLYPDTKIIKTPGKGWGLITLRDIKKVSARRPGSPVPVFLPVGRRV 1108
Query: 137 ---------GEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNK 187
GEFV EY+GE+ID++ C R+ + TNFY+ I++D +IDA KGN
Sbjct: 1109 GTSWSDVTQGEFVNEYIGELIDEEECRARIKYAQENNITNFYMLTIDKDRIIDAGPKGNY 1168
Query: 188 SRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
SR++NHSC PN E QKW ++G+TR+G+FA DI G LT++Y + L + C
Sbjct: 1169 SRFMNHSCQPNCETQKWTVNGDTRVGLFAVCDIPAGTELTFNYNLDCLGNEKTVCC 1224
>gi|401888851|gb|EJT52799.1| histone-lysine N-methyltransferase [Trichosporon asahii var. asahii
CBS 2479]
Length = 905
Score = 125 bits (313), Expect = 2e-26, Method: Composition-based stats.
Identities = 63/164 (38%), Positives = 92/164 (56%), Gaps = 9/164 (5%)
Query: 81 DRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEF 139
+ DC ++ C +G CK G C N+ + + +VQTEK G G+ A EDI G
Sbjct: 206 ESDCINRIMFIECLAGECKAGKRCRNQRLARKQYAPVDVVQTEKKGFGLRAREDIPAGAL 265
Query: 140 VIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNT 199
V EY+GEV+ ++T +R+ + G +FY + ++ IDAT KG R+ NHSC PN
Sbjct: 266 VYEYIGEVVVEKTFRKRMAEYADEGIKHFYFMMLQKEEYIDATKKGGIGRFANHSCNPNC 325
Query: 200 EMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
E+QKW++ R+GIF RD+KK E +T++Y I YC
Sbjct: 326 EVQKWVVGRRMRMGIFTKRDVKKDEEITFNYN--------ICYC 361
>gi|298706866|emb|CBJ25830.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 810
Score = 125 bits (313), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 67/168 (39%), Positives = 94/168 (55%), Gaps = 11/168 (6%)
Query: 76 SSGVCDRDCHCGMLLSSCSSG---------CKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
S GVC C ML + C G CK G C N+ QNR K+ + G
Sbjct: 491 SEGVCGDSCLNAMLRTECVCGNGIRAKYQNCKLGAGCGNRRLQNRVNAKVAPFREAGMGW 550
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHL--GETNFYLCEINRDMVIDATYK 184
G+ D+ +G + EYVGEVID+ E R+ + + L + FY+ E+ + + IDA K
Sbjct: 551 GLKVAVDVPKGSLIGEYVGEVIDEAMVEHRMAEQRRLRPNDGEFYIMELGQSLFIDAKEK 610
Query: 185 GNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
GN R INHSC PN ++Q W I G TR+GI+A +D+ KGE+L+YDY++
Sbjct: 611 GNLMRLINHSCNPNCDVQAWNIAGYTRLGIYAKKDLAKGESLSYDYKF 658
>gi|390476801|ref|XP_002760038.2| PREDICTED: histone-lysine N-methyltransferase ASH1L [Callithrix
jacchus]
Length = 2970
Score = 125 bits (313), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 75/217 (34%), Positives = 114/217 (52%), Gaps = 14/217 (6%)
Query: 28 VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
++F+LP W + KP Y I+ N+Y+ + E C
Sbjct: 2049 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVDVKPLSGYEATTCNCKKPDDDARK 2108
Query: 78 GVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDIK 135
G D DC M+ + CS C CG C N+ Q V+ ++ + E+ G GI E +K
Sbjct: 2109 GCVD-DCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLK 2167
Query: 136 RGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSC 195
G+F+IEY+GEV+ +Q R+ + H ++ Y ++ MVID+ GN++R+INHSC
Sbjct: 2168 AGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHSC 2226
Query: 196 CPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
PN EMQKW ++G RIG++A +D+ G LTYDY +
Sbjct: 2227 DPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2263
>gi|196001997|ref|XP_002110866.1| hypothetical protein TRIADDRAFT_54228 [Trichoplax adhaerens]
gi|190586817|gb|EDV26870.1| hypothetical protein TRIADDRAFT_54228 [Trichoplax adhaerens]
Length = 1004
Score = 125 bits (313), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 70/186 (37%), Positives = 97/186 (52%), Gaps = 2/186 (1%)
Query: 50 NIYLTKRIKRRLEDDGIFCSCTASPGSSGV--CDRDCHCGMLLSSCSSGCKCGNSCLNKP 107
N YL+ R + + C CT C +C ML C S C CG C N+
Sbjct: 57 NTYLSGRKSKDKRIKNMTCECTYDEDDPNFIGCGENCLNRMLFIECGSKCSCGKFCTNRR 116
Query: 108 FQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETN 167
FQ K+++ +TEK G G+ ED++ +FV+EY GEVID + E R +
Sbjct: 117 FQMAEYPKVEVFKTEKKGFGLRTLEDLEDNQFVLEYCGEVIDLREFERRKRDYAKKKIKH 176
Query: 168 FYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLT 227
+Y ++ + +IDA+ KG SR+INHSC PN QKW ++G RIG F R I LT
Sbjct: 177 YYFMTLSPNEIIDASRKGTFSRFINHSCDPNCVTQKWTVNGMLRIGFFTLRKIPANTELT 236
Query: 228 YDYQYE 233
+DYQ+E
Sbjct: 237 FDYQFE 242
>gi|426216789|ref|XP_004002640.1| PREDICTED: histone-lysine N-methyltransferase ASH1L [Ovis aries]
Length = 2965
Score = 125 bits (313), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 75/218 (34%), Positives = 117/218 (53%), Gaps = 16/218 (7%)
Query: 28 VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
++F+LP W + KP Y I+ N+Y+ +K + C+C +
Sbjct: 2044 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDDT 2101
Query: 78 GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
C DC M+ + CS C CG C N+ Q V+ ++ + E+ G GI E +
Sbjct: 2102 KKGCVDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2161
Query: 135 KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
K G+F+IEY+GEV+ +Q R+ + H ++ Y ++ MVID+ GN++R+INHS
Sbjct: 2162 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2220
Query: 195 CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
C PN EMQKW ++G RIG++A +D+ G LTYDY +
Sbjct: 2221 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2258
>gi|300795068|ref|NP_001179672.1| probable histone-lysine N-methyltransferase ASH1L [Bos taurus]
gi|296489728|tpg|DAA31841.1| TPA: ash1 (absent, small, or homeotic)-like [Bos taurus]
Length = 2965
Score = 125 bits (313), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 75/218 (34%), Positives = 117/218 (53%), Gaps = 16/218 (7%)
Query: 28 VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
++F+LP W + KP Y I+ N+Y+ +K + C+C +
Sbjct: 2044 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDDT 2101
Query: 78 GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
C DC M+ + CS C CG C N+ Q V+ ++ + E+ G GI E +
Sbjct: 2102 KKGCVDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2161
Query: 135 KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
K G+F+IEY+GEV+ +Q R+ + H ++ Y ++ MVID+ GN++R+INHS
Sbjct: 2162 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2220
Query: 195 CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
C PN EMQKW ++G RIG++A +D+ G LTYDY +
Sbjct: 2221 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2258
>gi|440903623|gb|ELR54260.1| Putative histone-lysine N-methyltransferase ASH1L [Bos grunniens
mutus]
Length = 2965
Score = 125 bits (313), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 75/218 (34%), Positives = 117/218 (53%), Gaps = 16/218 (7%)
Query: 28 VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
++F+LP W + KP Y I+ N+Y+ +K + C+C +
Sbjct: 2044 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDDT 2101
Query: 78 GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
C DC M+ + CS C CG C N+ Q V+ ++ + E+ G GI E +
Sbjct: 2102 KKGCVDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2161
Query: 135 KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
K G+F+IEY+GEV+ +Q R+ + H ++ Y ++ MVID+ GN++R+INHS
Sbjct: 2162 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2220
Query: 195 CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
C PN EMQKW ++G RIG++A +D+ G LTYDY +
Sbjct: 2221 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2258
>gi|116207846|ref|XP_001229732.1| hypothetical protein CHGG_03216 [Chaetomium globosum CBS 148.51]
gi|121787978|sp|Q2H988.1|SET2_CHAGB RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36
specific; AltName: Full=SET domain-containing protein 2
gi|88183813|gb|EAQ91281.1| hypothetical protein CHGG_03216 [Chaetomium globosum CBS 148.51]
Length = 894
Score = 125 bits (313), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 67/186 (36%), Positives = 97/186 (52%), Gaps = 6/186 (3%)
Query: 64 DGIFCSCTAS--PGSSGVCDRDCHC---GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKL 118
D + C C G + C D C + S C CG C N+ FQ + + +
Sbjct: 73 DALDCDCAEEWHDGQNHACGEDSDCINRATKIECVSGDCNCGEGCENQRFQRKQYANVSV 132
Query: 119 VQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMV 178
++TEK G G+ D D++ +FV EYVGEVI++ T R K G +FY + +
Sbjct: 133 IKTEKKGFGLRTDADLQANDFVFEYVGEVINEPTFRNRTVKYDKEGIKHFYFMSLTKSEF 192
Query: 179 IDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHD 237
+DAT KGN R+ NHSC PN + KW++ + R+GIFATR I+ GE L ++Y + + D
Sbjct: 193 VDATKKGNLGRFCNHSCNPNCYVDKWVVGDKLRMGIFATRAIRAGEELVFNYNVDRYGAD 252
Query: 238 SLIAYC 243
YC
Sbjct: 253 PQPCYC 258
>gi|417515828|gb|JAA53722.1| histone-lysine N-methyltransferase ASH1L [Sus scrofa]
Length = 2951
Score = 125 bits (313), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 75/218 (34%), Positives = 117/218 (53%), Gaps = 16/218 (7%)
Query: 28 VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
++F+LP W + KP Y I+ N+Y+ +K + C+C +
Sbjct: 2030 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDDT 2087
Query: 78 GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
C DC M+ + CS C CG C N+ Q V+ ++ + E+ G GI E +
Sbjct: 2088 RKGCVDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2147
Query: 135 KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
K G+F+IEY+GEV+ +Q R+ + H ++ Y ++ MVID+ GN++R+INHS
Sbjct: 2148 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2206
Query: 195 CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
C PN EMQKW ++G RIG++A +D+ G LTYDY +
Sbjct: 2207 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2244
>gi|156051364|ref|XP_001591643.1| hypothetical protein SS1G_07089 [Sclerotinia sclerotiorum 1980]
gi|154704867|gb|EDO04606.1| hypothetical protein SS1G_07089 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 971
Score = 125 bits (313), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 77/225 (34%), Positives = 119/225 (52%), Gaps = 10/225 (4%)
Query: 27 PVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTA--SPGSSGVCDRDC 84
P+ +LPD + ++ + IK IY K + + D + C C S G++ C D
Sbjct: 114 PLFNDLPDSTEEAASV-FQVIKDCIYGAKHMGAS-DHDALDCDCPEEYSDGNNHACGEDS 171
Query: 85 HCGMLLS--SCSSG---CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEF 139
C L+ C G C CG +C N+ FQ R K+ +++T+K G G+ A+ D+ +F
Sbjct: 172 DCINRLTKMECGGGHKDCNCGVACQNQRFQRRQYAKVSVIKTDKKGYGLRANTDLHPDDF 231
Query: 140 VIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNT 199
+ EY+GEVI++ T R + G +FY + + +DAT KGN R+ NHSC PN
Sbjct: 232 IFEYIGEVINEPTFRRRTIQYDQEGIKHFYFMSLTKHEFVDATKKGNLGRFCNHSCNPNC 291
Query: 200 EMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
+ KW++ + R+GIFA R IK GE L ++Y + + D YC
Sbjct: 292 YVDKWVVGEKLRMGIFAERAIKAGEELVFNYNVDRYGADPQPCYC 336
>gi|410226116|gb|JAA10277.1| ash1 (absent, small, or homeotic)-like [Pan troglodytes]
gi|410264036|gb|JAA19984.1| ash1 (absent, small, or homeotic)-like [Pan troglodytes]
gi|410264040|gb|JAA19986.1| ash1 (absent, small, or homeotic)-like [Pan troglodytes]
gi|410306368|gb|JAA31784.1| ash1 (absent, small, or homeotic)-like [Pan troglodytes]
gi|410355463|gb|JAA44335.1| ash1 (absent, small, or homeotic)-like [Pan troglodytes]
Length = 2964
Score = 125 bits (313), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 75/218 (34%), Positives = 117/218 (53%), Gaps = 16/218 (7%)
Query: 28 VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
++F+LP W + KP Y I+ N+Y+ +K + C+C +
Sbjct: 2043 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDDT 2100
Query: 78 GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
C DC M+ + CS C CG C N+ Q V+ ++ + E+ G GI E +
Sbjct: 2101 RKGCVDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2160
Query: 135 KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
K G+F+IEY+GEV+ +Q R+ + H ++ Y ++ MVID+ GN++R+INHS
Sbjct: 2161 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2219
Query: 195 CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
C PN EMQKW ++G RIG++A +D+ G LTYDY +
Sbjct: 2220 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2257
>gi|410033849|ref|XP_003949641.1| PREDICTED: histone-lysine N-methyltransferase ASH1L [Pan troglodytes]
Length = 2964
Score = 125 bits (313), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 75/218 (34%), Positives = 117/218 (53%), Gaps = 16/218 (7%)
Query: 28 VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
++F+LP W + KP Y I+ N+Y+ +K + C+C +
Sbjct: 2043 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDDT 2100
Query: 78 GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
C DC M+ + CS C CG C N+ Q V+ ++ + E+ G GI E +
Sbjct: 2101 RKGCVDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2160
Query: 135 KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
K G+F+IEY+GEV+ +Q R+ + H ++ Y ++ MVID+ GN++R+INHS
Sbjct: 2161 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2219
Query: 195 CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
C PN EMQKW ++G RIG++A +D+ G LTYDY +
Sbjct: 2220 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2257
>gi|354548180|emb|CCE44916.1| hypothetical protein CPAR2_407180 [Candida parapsilosis]
Length = 810
Score = 125 bits (313), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 61/155 (39%), Positives = 92/155 (59%), Gaps = 3/155 (1%)
Query: 79 VCDRDCHCGMLLSS---CSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIK 135
C D +C L+S + C CG+ C N+ FQ R + + QTE G G+ A++ I
Sbjct: 71 ACGEDSNCINRLTSVECTNRHCLCGDDCQNQRFQKRQYADVSVFQTELKGYGLKANKPIS 130
Query: 136 RGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSC 195
G+F+ EY+GEVID+ +R+ + +FY + D IDAT KG+ +R++NHSC
Sbjct: 131 EGQFIYEYIGEVIDEGAFRQRMIEYDMKNYKHFYFMMLKPDAFIDATEKGSLARFVNHSC 190
Query: 196 CPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDY 230
PN + KW++ + R+GIFA R+I KGE +T+DY
Sbjct: 191 NPNAFVDKWVVGDKLRMGIFAKRNIAKGEEITFDY 225
>gi|367049654|ref|XP_003655206.1| hypothetical protein THITE_2118632 [Thielavia terrestris NRRL 8126]
gi|347002470|gb|AEO68870.1| hypothetical protein THITE_2118632 [Thielavia terrestris NRRL 8126]
Length = 931
Score = 125 bits (313), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 65/165 (39%), Positives = 95/165 (57%), Gaps = 2/165 (1%)
Query: 81 DRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEF 139
D DC C G C CG+ C N+ FQ + K+ +++TEK G G+ AD D++ +F
Sbjct: 138 DSDCINRATKIECVDGECNCGDGCQNQRFQRKQYAKVSVIKTEKKGFGLRADTDLQPNDF 197
Query: 140 VIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNT 199
V EYVGEVI++ T R+ K G +FY + ++ +DAT KGN R+ NHSC PN
Sbjct: 198 VYEYVGEVINEPTFRSRMLKYDKEGIKHFYFMSLTKNEFVDATKKGNLGRFCNHSCNPNC 257
Query: 200 EMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
+ KW++ + R+GIFA R I+ GE L ++Y + + D YC
Sbjct: 258 YVDKWVVGEKLRMGIFAGRRIRAGEELVFNYNVDRYGADPQPCYC 302
>gi|7739725|gb|AAF68983.1|AF257305_1 ASH1 [Homo sapiens]
Length = 2969
Score = 125 bits (313), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 75/218 (34%), Positives = 117/218 (53%), Gaps = 16/218 (7%)
Query: 28 VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
++F+LP W + KP Y I+ N+Y+ +K + C+C +
Sbjct: 2048 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDDT 2105
Query: 78 GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
C DC M+ + CS C CG C N+ Q V+ ++ + E+ G GI E +
Sbjct: 2106 RKGCVDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2165
Query: 135 KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
K G+F+IEY+GEV+ +Q R+ + H ++ Y ++ MVID+ GN++R+INHS
Sbjct: 2166 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2224
Query: 195 CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
C PN EMQKW ++G RIG++A +D+ G LTYDY +
Sbjct: 2225 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2262
>gi|397492363|ref|XP_003817092.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
ASH1L [Pan paniscus]
Length = 2964
Score = 125 bits (313), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 75/218 (34%), Positives = 117/218 (53%), Gaps = 16/218 (7%)
Query: 28 VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
++F+LP W + KP Y I+ N+Y+ +K + C+C +
Sbjct: 2043 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDDT 2100
Query: 78 GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
C DC M+ + CS C CG C N+ Q V+ ++ + E+ G GI E +
Sbjct: 2101 RKGCVDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2160
Query: 135 KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
K G+F+IEY+GEV+ +Q R+ + H ++ Y ++ MVID+ GN++R+INHS
Sbjct: 2161 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2219
Query: 195 CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
C PN EMQKW ++G RIG++A +D+ G LTYDY +
Sbjct: 2220 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2257
>gi|351696657|gb|EHA99575.1| Putative histone-lysine N-methyltransferase ASH1L [Heterocephalus
glaber]
Length = 2930
Score = 125 bits (313), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 75/218 (34%), Positives = 117/218 (53%), Gaps = 16/218 (7%)
Query: 28 VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
++F+LP W + KP Y I+ N+Y+ +K + C+C +
Sbjct: 2009 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDDT 2066
Query: 78 GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
C DC M+ + CS C CG C N+ Q V+ ++ + E+ G GI E +
Sbjct: 2067 RKGCVDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2126
Query: 135 KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
K G+F+IEY+GEV+ +Q R+ + H ++ Y ++ MVID+ GN++R+INHS
Sbjct: 2127 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2185
Query: 195 CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
C PN EMQKW ++G RIG++A +D+ G LTYDY +
Sbjct: 2186 CDPNCEMQKWSVNGVYRIGLYALKDMTAGTELTYDYNF 2223
>gi|357443579|ref|XP_003592067.1| hypothetical protein MTR_1g098370 [Medicago truncatula]
gi|355481115|gb|AES62318.1| hypothetical protein MTR_1g098370 [Medicago truncatula]
Length = 491
Score = 125 bits (313), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 73/182 (40%), Positives = 102/182 (56%), Gaps = 6/182 (3%)
Query: 58 KRRLEDDGIFCSC---TASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPV 113
K++ E+D C C P S+ C C + + C+ G C C C N+ FQ
Sbjct: 38 KKQKEEDIAICECRYDEDDPDSA--CGDGCLNVLTSTECTPGFCPCDIHCKNQKFQKCEY 95
Query: 114 KKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEI 173
K KL +TE G G++ADE+IK G+FVIEY GEVI + + R + G + ++ +
Sbjct: 96 AKTKLFKTEGRGWGLLADEEIKAGQFVIEYCGEVISCKEAKRRSHTYEIQGLKDAFIISL 155
Query: 174 NRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
N IDAT KG+ +R+INHSC PN E +KW + GE R+GIFA +I G L YDY +E
Sbjct: 156 NASESIDATRKGSLARFINHSCQPNCETRKWNVMGEIRVGIFALENIPIGTELAYDYNFE 215
Query: 234 FL 235
+
Sbjct: 216 WF 217
>gi|350583322|ref|XP_003125756.3| PREDICTED: probable histone-lysine N-methyltransferase ASH1L-like,
partial [Sus scrofa]
Length = 2824
Score = 125 bits (313), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 75/218 (34%), Positives = 117/218 (53%), Gaps = 16/218 (7%)
Query: 28 VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
++F+LP W + KP Y I+ N+Y+ +K + C+C +
Sbjct: 1897 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDDT 1954
Query: 78 GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
C DC M+ + CS C CG C N+ Q V+ ++ + E+ G GI E +
Sbjct: 1955 RKGCVDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2014
Query: 135 KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
K G+F+IEY+GEV+ +Q R+ + H ++ Y ++ MVID+ GN++R+INHS
Sbjct: 2015 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2073
Query: 195 CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
C PN EMQKW ++G RIG++A +D+ G LTYDY +
Sbjct: 2074 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2111
>gi|338724967|ref|XP_001499134.2| PREDICTED: probable histone-lysine N-methyltransferase ASH1L isoform
1 [Equus caballus]
Length = 2963
Score = 125 bits (313), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 75/218 (34%), Positives = 117/218 (53%), Gaps = 16/218 (7%)
Query: 28 VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
++F+LP W + KP Y I+ N+Y+ +K + C+C +
Sbjct: 2042 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDDT 2099
Query: 78 GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
C DC M+ + CS C CG C N+ Q V+ ++ + E+ G GI E +
Sbjct: 2100 RKGCVDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2159
Query: 135 KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
K G+F+IEY+GEV+ +Q R+ + H ++ Y ++ MVID+ GN++R+INHS
Sbjct: 2160 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2218
Query: 195 CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
C PN EMQKW ++G RIG++A +D+ G LTYDY +
Sbjct: 2219 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2256
>gi|110349788|ref|NP_060959.2| histone-lysine N-methyltransferase ASH1L [Homo sapiens]
gi|225000936|gb|AAI72595.1| Ash1 (absent, small, or homeotic)-like (Drosophila) [synthetic
construct]
Length = 2964
Score = 125 bits (313), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 75/218 (34%), Positives = 117/218 (53%), Gaps = 16/218 (7%)
Query: 28 VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
++F+LP W + KP Y I+ N+Y+ +K + C+C +
Sbjct: 2043 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDDT 2100
Query: 78 GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
C DC M+ + CS C CG C N+ Q V+ ++ + E+ G GI E +
Sbjct: 2101 RKGCVDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2160
Query: 135 KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
K G+F+IEY+GEV+ +Q R+ + H ++ Y ++ MVID+ GN++R+INHS
Sbjct: 2161 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2219
Query: 195 CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
C PN EMQKW ++G RIG++A +D+ G LTYDY +
Sbjct: 2220 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2257
>gi|117949323|sp|Q9NR48.2|ASH1L_HUMAN RecName: Full=Histone-lysine N-methyltransferase ASH1L; AltName:
Full=ASH1-like protein; Short=huASH1; AltName:
Full=Absent small and homeotic disks protein 1 homolog;
AltName: Full=Lysine N-methyltransferase 2H
Length = 2969
Score = 125 bits (313), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 75/218 (34%), Positives = 117/218 (53%), Gaps = 16/218 (7%)
Query: 28 VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
++F+LP W + KP Y I+ N+Y+ +K + C+C +
Sbjct: 2048 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDDT 2105
Query: 78 GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
C DC M+ + CS C CG C N+ Q V+ ++ + E+ G GI E +
Sbjct: 2106 RKGCVDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2165
Query: 135 KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
K G+F+IEY+GEV+ +Q R+ + H ++ Y ++ MVID+ GN++R+INHS
Sbjct: 2166 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2224
Query: 195 CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
C PN EMQKW ++G RIG++A +D+ G LTYDY +
Sbjct: 2225 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2262
>gi|392587279|gb|EIW76613.1| SET domain-containing protein [Coniophora puteana RWD-64-598 SS2]
Length = 793
Score = 124 bits (312), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 68/187 (36%), Positives = 100/187 (53%), Gaps = 7/187 (3%)
Query: 64 DGIFCSCTASPGSSGVCD-----RDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMK 117
+G+ C C PG S D DC + C G C+C N C N+ FQ + ++
Sbjct: 66 EGMACDCQFEPGVSDPSDACGPHSDCINRLTEVECIPGECRCRNYCQNQRFQRKQYANIE 125
Query: 118 LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDM 177
+VQTEK G G+ A DI + F+ EYVG+V+ + +R+ + G +FY + +
Sbjct: 126 VVQTEKKGFGLRAGSDIPKDAFIYEYVGDVVSHPSFLKRMREYAAEGIRHFYFMMLQSNE 185
Query: 178 VIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLH 236
IDAT +G K R+ NHSC PN + KW + R+GIFA R +K+ E LT++Y + + H
Sbjct: 186 YIDATKRGGKGRFANHSCNPNCYVAKWTVGPHVRMGIFAKRAVKRNEELTFNYNVDRYGH 245
Query: 237 DSLIAYC 243
D YC
Sbjct: 246 DPQPCYC 252
>gi|119573453|gb|EAW53068.1| ash1 (absent, small, or homeotic)-like (Drosophila) [Homo sapiens]
Length = 2969
Score = 124 bits (312), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 75/218 (34%), Positives = 117/218 (53%), Gaps = 16/218 (7%)
Query: 28 VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
++F+LP W + KP Y I+ N+Y+ +K + C+C +
Sbjct: 2048 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDDT 2105
Query: 78 GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
C DC M+ + CS C CG C N+ Q V+ ++ + E+ G GI E +
Sbjct: 2106 RKGCVDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2165
Query: 135 KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
K G+F+IEY+GEV+ +Q R+ + H ++ Y ++ MVID+ GN++R+INHS
Sbjct: 2166 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2224
Query: 195 CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
C PN EMQKW ++G RIG++A +D+ G LTYDY +
Sbjct: 2225 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2262
>gi|320586545|gb|EFW99215.1| set and ww domain containing protein [Grosmannia clavigera kw1407]
Length = 1051
Score = 124 bits (312), Expect = 2e-26, Method: Composition-based stats.
Identities = 64/177 (36%), Positives = 94/177 (53%), Gaps = 5/177 (2%)
Query: 62 EDDGIFCSCTAS--PGSSGVC--DRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKM 116
E D C C A G + C D DC C G C CG +C N+ FQ + +
Sbjct: 119 EHDAFECDCAAEWHDGQNLACGEDSDCINRATKMECVRGDCNCGETCQNQRFQRKQYAAV 178
Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
+++T K G G+ AD ++ +FV EY+GEVI++ R+ + G +FY + +
Sbjct: 179 SVIKTAKKGFGLRADVALRANDFVYEYIGEVINEPAFRRRMLQYDEEGIRHFYFMSLTKH 238
Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
+DAT KGN R+ NHSC PN + KW++ + R+GIFA+R I GE L +DY +
Sbjct: 239 EFVDATRKGNLGRFCNHSCRPNCYVDKWVVGEKLRMGIFASRAIAAGEELVFDYNVD 295
>gi|332219957|ref|XP_003259124.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
ASH1L [Nomascus leucogenys]
Length = 2892
Score = 124 bits (312), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 75/218 (34%), Positives = 117/218 (53%), Gaps = 16/218 (7%)
Query: 28 VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
++F+LP W + KP Y I+ N+Y+ +K + C+C +
Sbjct: 2043 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDDT 2100
Query: 78 GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
C DC M+ + CS C CG C N+ Q V+ ++ + E+ G GI E +
Sbjct: 2101 RKGCVDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2160
Query: 135 KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
K G+F+IEY+GEV+ +Q R+ + H ++ Y ++ MVID+ GN++R+INHS
Sbjct: 2161 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2219
Query: 195 CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
C PN EMQKW ++G RIG++A +D+ G LTYDY +
Sbjct: 2220 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2257
>gi|426331996|ref|XP_004026979.1| PREDICTED: histone-lysine N-methyltransferase ASH1L [Gorilla gorilla
gorilla]
Length = 2776
Score = 124 bits (312), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 75/218 (34%), Positives = 117/218 (53%), Gaps = 16/218 (7%)
Query: 28 VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
++F+LP W + KP Y I+ N+Y+ +K + C+C +
Sbjct: 2043 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDDT 2100
Query: 78 GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
C DC M+ + CS C CG C N+ Q V+ ++ + E+ G GI E +
Sbjct: 2101 RKGCVDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2160
Query: 135 KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
K G+F+IEY+GEV+ +Q R+ + H ++ Y ++ MVID+ GN++R+INHS
Sbjct: 2161 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2219
Query: 195 CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
C PN EMQKW ++G RIG++A +D+ G LTYDY +
Sbjct: 2220 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2257
>gi|357611223|gb|EHJ67375.1| hypothetical protein KGM_13830 [Danaus plexippus]
Length = 1798
Score = 124 bits (312), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 72/215 (33%), Positives = 114/215 (53%), Gaps = 17/215 (7%)
Query: 32 LPDWFIKPKAIPYVFIKRNIYLTKRIKRRLED-DGIFCSCTASPGSSGVCDRDCHCGMLL 90
+P W Y I+ N+Y +K E+ + + C+C S C+ DC ++
Sbjct: 886 VPSWN-------YKKIRTNVYYD--VKPSAEECESVACNC----APSSACNEDCINRLVY 932
Query: 91 SSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVI 148
S CS C CG+ C N+ Q V ++ TE G G+ + I+ G+F++EYVGEV+
Sbjct: 933 SECSPQLCPCGDKCKNQRIQRHEWVPGLEKFMTENKGWGVRTKQMIRSGDFILEYVGEVV 992
Query: 149 DDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDG 208
D+ +ER+ ++ +T+ Y ++ +VID G R++NHSC PN EMQKW +G
Sbjct: 993 SDKEFKERMA-TRYARDTHHYCLHLDGGLVIDGHRVGGDGRFVNHSCRPNCEMQKWTANG 1051
Query: 209 ETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
R+ +FA RDI+ E LTYDY + + ++ C
Sbjct: 1052 TFRMALFALRDIEPDEELTYDYNFSLFNPAVGQPC 1086
>gi|410986772|ref|XP_003999683.1| PREDICTED: histone-lysine N-methyltransferase ASH1L isoform 1 [Felis
catus]
Length = 2965
Score = 124 bits (312), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 75/218 (34%), Positives = 117/218 (53%), Gaps = 16/218 (7%)
Query: 28 VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCT-ASPGS 76
++F+LP W + KP Y I+ N+Y+ +K + C+C +
Sbjct: 2044 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDDT 2101
Query: 77 SGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
C DC M+ + CS C CG C N+ Q V+ ++ + E+ G GI E +
Sbjct: 2102 RKGCVDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2161
Query: 135 KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
K G+F+IEY+GEV+ +Q R+ + H ++ Y ++ MVID+ GN++R+INHS
Sbjct: 2162 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2220
Query: 195 CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
C PN EMQKW ++G RIG++A +D+ G LTYDY +
Sbjct: 2221 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2258
>gi|380814664|gb|AFE79206.1| putative histone-lysine N-methyltransferase ASH1L [Macaca mulatta]
gi|383419979|gb|AFH33203.1| putative histone-lysine N-methyltransferase ASH1L [Macaca mulatta]
Length = 2963
Score = 124 bits (312), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 75/218 (34%), Positives = 117/218 (53%), Gaps = 16/218 (7%)
Query: 28 VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
++F+LP W + KP Y I+ N+Y+ +K + C+C +
Sbjct: 2042 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDDT 2099
Query: 78 GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
C DC M+ + CS C CG C N+ Q V+ ++ + E+ G GI E +
Sbjct: 2100 RKGCVDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2159
Query: 135 KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
K G+F+IEY+GEV+ +Q R+ + H ++ Y ++ MVID+ GN++R+INHS
Sbjct: 2160 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2218
Query: 195 CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
C PN EMQKW ++G RIG++A +D+ G LTYDY +
Sbjct: 2219 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2256
>gi|348579791|ref|XP_003475662.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
N-methyltransferase ASH1L-like [Cavia porcellus]
Length = 2964
Score = 124 bits (312), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 75/218 (34%), Positives = 117/218 (53%), Gaps = 16/218 (7%)
Query: 28 VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSS 77
++F+LP W + KP Y I+ N+Y+ +K + C+C +
Sbjct: 2043 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKRPDDET 2100
Query: 78 GV-CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
C DC M+ + CS C CG C N+ Q V+ ++ + E+ G GI E +
Sbjct: 2101 RKGCVDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2160
Query: 135 KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
K G+F+IEY+GEV+ +Q R+ + H ++ Y ++ MVID+ GN++R+INHS
Sbjct: 2161 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2219
Query: 195 CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
C PN EMQKW ++G RIG++A +D+ G LTYDY +
Sbjct: 2220 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2257
>gi|410986774|ref|XP_003999684.1| PREDICTED: histone-lysine N-methyltransferase ASH1L isoform 2 [Felis
catus]
Length = 2974
Score = 124 bits (312), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 75/218 (34%), Positives = 117/218 (53%), Gaps = 16/218 (7%)
Query: 28 VEFELP-----DW-----FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCT-ASPGS 76
++F+LP W + KP Y I+ N+Y+ +K + C+C +
Sbjct: 2053 IDFQLPYDILWQWKHNQLYKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDDT 2110
Query: 77 SGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
C DC M+ + CS C CG C N+ Q V+ ++ + E+ G GI E +
Sbjct: 2111 RKGCVDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPL 2170
Query: 135 KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
K G+F+IEY+GEV+ +Q R+ + H ++ Y ++ MVID+ GN++R+INHS
Sbjct: 2171 KAGQFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHS 2229
Query: 195 CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
C PN EMQKW ++G RIG++A +D+ G LTYDY +
Sbjct: 2230 CDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2267
>gi|258571704|ref|XP_002544655.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237904925|gb|EEP79326.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 1013
Score = 124 bits (312), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 62/164 (37%), Positives = 91/164 (55%), Gaps = 1/164 (0%)
Query: 81 DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
D DC C C CG+SC N+ FQ R + +++TEK G G+ D D++ +F+
Sbjct: 171 DSDCINRATKMECVGDCGCGDSCQNQRFQQREYANVSVIKTEKKGYGLRTDSDLRPNQFI 230
Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
EY+GEVI++ R+ G +FY +N+ +DAT KGN R+ NHSC PN
Sbjct: 231 FEYIGEVINEPQFRRRMINYDEEGIKHFYFMSLNKGEFVDATKKGNLGRFCNHSCNPNCY 290
Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
+ KW++ + R+GIFA R IK GE L ++Y + + D YC
Sbjct: 291 VDKWVVGEKLRMGIFAERHIKAGEELVFNYNVDRYGADPQPCYC 334
>gi|345486764|ref|XP_001606931.2| PREDICTED: histone-lysine N-methyltransferase ash1 [Nasonia
vitripennis]
Length = 1690
Score = 124 bits (311), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 77/237 (32%), Positives = 115/237 (48%), Gaps = 19/237 (8%)
Query: 19 KLLKQIGNPVEFELPDWFIKPKA-------IP---YVFIKRNIYLTKRIKRRLEDDGIFC 68
K L+Q P + W++ + +P Y I+ N+Y +K + + C
Sbjct: 763 KFLRQREIPFQLPYDLWWLHTHSRLPGRDLVPSWNYRKIRSNVYYD--VKPTMHYEAQAC 820
Query: 69 SCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGA 126
C G C DC M+ S CS C CG C N+ Q ++ TE G
Sbjct: 821 ECKPDAG----CGDDCINRMVFSECSPQLCPCGERCKNQKIQKHDWAPGLQRFMTESKGW 876
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ E I+ GEF++EYVGEV+ ++ + R+ ++ +T+ Y ++ +VID G
Sbjct: 877 GVRTHEPIRTGEFILEYVGEVVSEREFKTRMA-TRYANDTHHYCLHLDGGLVIDGHRMGG 935
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
R++NHSC PN EMQKW + G R+ +FA RDI GE LTYDY + + S C
Sbjct: 936 DGRFVNHSCEPNCEMQKWSVHGLPRMALFALRDITAGEELTYDYNFALFNPSEGQEC 992
>gi|74140676|dbj|BAC28183.2| unnamed protein product [Mus musculus]
Length = 418
Score = 124 bits (311), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 71/200 (35%), Positives = 111/200 (55%), Gaps = 6/200 (3%)
Query: 36 FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCT-ASPGSSGVCDRDCHCGMLLSSCS 94
+ KP Y I+ N+Y+ +K + C+C + C DC M+ + CS
Sbjct: 9 YKKPDVPLYKKIRSNVYVD--VKPLSGYEATTCNCKKPDDDTRKGCGDDCLNRMIFAECS 66
Query: 95 -SGCKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQT 152
+ C CG C N+ Q V+ ++ + E+ G GI E +K G+F+IEY+GEV+ +Q
Sbjct: 67 PNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAGQFIIEYLGEVVSEQE 126
Query: 153 CEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRI 212
R+ + H ++ Y ++ MVID+ GN++R+INHSC PN EMQKW ++G RI
Sbjct: 127 FRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRI 185
Query: 213 GIFATRDIKKGENLTYDYQY 232
G++A +D+ G LTYDY +
Sbjct: 186 GLYALKDMPAGTELTYDYNF 205
>gi|47219458|emb|CAG10822.1| unnamed protein product [Tetraodon nigroviridis]
Length = 2598
Score = 124 bits (311), Expect = 4e-26, Method: Composition-based stats.
Identities = 70/200 (35%), Positives = 110/200 (55%), Gaps = 6/200 (3%)
Query: 36 FIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSC-TASPGSSGVCDRDCHCGMLLSSCS 94
+ +P Y I+ N+Y+ +K + C+C T + C DC M + CS
Sbjct: 1666 YKRPDVPLYKKIRSNVYVD--VKPLSGYETTTCNCRTPDDQTEKSCLDDCLNRMSFAECS 1723
Query: 95 -SGCKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQT 152
S C C + C N+ Q V+ ++ +TE G GI + ++ G+F+IEY+GEV+ +Q
Sbjct: 1724 PSTCPCADQCDNQRIQRHEWVQCLERFRTEGKGWGIRTKQPLRAGQFIIEYLGEVVSEQE 1783
Query: 153 CEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRI 212
R+ + ++ + Y ++ MVID+ GN++R+INHSC PN EMQKW ++G RI
Sbjct: 1784 FRSRMME-QYFSHSGNYCLNLDSGMVIDSYRMGNEARFINHSCEPNCEMQKWSVNGVYRI 1842
Query: 213 GIFATRDIKKGENLTYDYQY 232
G+FA +I G LTYDY +
Sbjct: 1843 GLFALGEIPSGTELTYDYNF 1862
>gi|327354316|gb|EGE83173.1| histone-lysine N-methyltransferase [Ajellomyces dermatitidis ATCC
18188]
Length = 1159
Score = 124 bits (311), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 66/183 (36%), Positives = 98/183 (53%), Gaps = 7/183 (3%)
Query: 68 CSCTAS----PGSSGVC--DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQT 121
C CT +G C D DC C C CG+ C N+ FQ R + +++T
Sbjct: 155 CDCTEEWDPVAAKNGACGEDSDCINRATKMECVGDCGCGDECENQRFQRRQYANVTVIKT 214
Query: 122 EKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDA 181
EK G G+ AD D++ +F+ EY+GEVI++ +R+ G +FY +++ +DA
Sbjct: 215 EKKGYGLRADSDLRPNQFIFEYIGEVINEGNFRKRMRSYDEEGIKHFYFMSLSKGEFVDA 274
Query: 182 TYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLI 240
T KGN R+ NHSC PN + KW++ + R+GIFA R IK GE L ++Y + + D
Sbjct: 275 TKKGNLGRFCNHSCNPNCYVDKWVVGEKLRMGIFAERHIKAGEELVFNYNVDRYGADPQP 334
Query: 241 AYC 243
YC
Sbjct: 335 CYC 337
>gi|444323962|ref|XP_004182621.1| hypothetical protein TBLA_0J01040 [Tetrapisispora blattae CBS 6284]
gi|387515669|emb|CCH63102.1| hypothetical protein TBLA_0J01040 [Tetrapisispora blattae CBS 6284]
Length = 745
Score = 124 bits (311), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 65/169 (38%), Positives = 95/169 (56%), Gaps = 6/169 (3%)
Query: 68 CSCTA--SPGSSGVCDRDCHC---GMLLSSCSSGCK-CGNSCLNKPFQNRPVKKMKLVQT 121
C C S G + CD + C L+ +S C CG++C N+ FQ + + QT
Sbjct: 44 CDCVENYSDGVNHSCDENSDCINRLTLIECVNSKCNYCGDNCQNQRFQKHEYSNISIFQT 103
Query: 122 EKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDA 181
E G G+ A+E+++ +F+ EY+GEVIDD +R+ GE +FY + IDA
Sbjct: 104 EMKGFGVRANENLEINQFIYEYIGEVIDDDEFHKRMINYDQRGEKHFYFMMLKSGEFIDA 163
Query: 182 TYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDY 230
T KGN R+ NHSC PN + KW++ + ++GIFA R I KGE +T+DY
Sbjct: 164 TEKGNLGRFCNHSCNPNAYVNKWLVGDKLKMGIFAKRPIDKGEEITFDY 212
>gi|330946631|ref|XP_003306794.1| hypothetical protein PTT_20031 [Pyrenophora teres f. teres 0-1]
gi|311315535|gb|EFQ85104.1| hypothetical protein PTT_20031 [Pyrenophora teres f. teres 0-1]
Length = 961
Score = 124 bits (310), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 72/224 (32%), Positives = 115/224 (51%), Gaps = 10/224 (4%)
Query: 27 PVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCT----ASPGSSGVC-- 80
P+ +LPD K +V + Y K I D + C C ++ ++ C
Sbjct: 119 PLYLDLPDSTEAAKET-FVVLPECTYANKTIG--TTDPALECDCQEEYDSATKTNHACGE 175
Query: 81 DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
D DC C C CG C N+ FQ + + +++TEK G G+ A++++ G+FV
Sbjct: 176 DSDCINRATKMECVGDCSCGRRCQNQRFQRKQYADVTVIKTEKKGFGLRANKNMVPGDFV 235
Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
EY+GEVID++T R+ + G +FY + + +DAT KGN R+ NHSC PN
Sbjct: 236 FEYIGEVIDERTFRRRMGQYDEEGIKHFYFMSLTKGEFVDATKKGNLGRFCNHSCNPNCF 295
Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
+ KW++ + R+GIF R+++ GE L ++Y + + D YC
Sbjct: 296 VDKWVVGDKLRMGIFVERNVRAGEELVFNYNVDRYGADPQPCYC 339
>gi|239609047|gb|EEQ86034.1| histone-lysine N-methyltransferase [Ajellomyces dermatitidis ER-3]
Length = 897
Score = 124 bits (310), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 66/183 (36%), Positives = 98/183 (53%), Gaps = 7/183 (3%)
Query: 68 CSCTAS----PGSSGVC--DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQT 121
C CT +G C D DC C C CG+ C N+ FQ R + +++T
Sbjct: 155 CDCTEEWDPVAAKNGACGEDSDCINRATKMECVGDCGCGDECENQRFQRRQYANVTVIKT 214
Query: 122 EKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDA 181
EK G G+ AD D++ +F+ EY+GEVI++ +R+ G +FY +++ +DA
Sbjct: 215 EKKGYGLRADSDLRPNQFIFEYIGEVINEGNFRKRMRSYDEEGIKHFYFMSLSKGEFVDA 274
Query: 182 TYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLI 240
T KGN R+ NHSC PN + KW++ + R+GIFA R IK GE L ++Y + + D
Sbjct: 275 TKKGNLGRFCNHSCNPNCYVDKWVVGEKLRMGIFAERHIKAGEELVFNYNVDRYGADPQP 334
Query: 241 AYC 243
YC
Sbjct: 335 CYC 337
>gi|71013616|ref|XP_758647.1| hypothetical protein UM02500.1 [Ustilago maydis 521]
gi|74702635|sp|Q4PBL3.1|SET2_USTMA RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36
specific; AltName: Full=SET domain-containing protein 2
gi|46098305|gb|EAK83538.1| hypothetical protein UM02500.1 [Ustilago maydis 521]
Length = 972
Score = 124 bits (310), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 69/175 (39%), Positives = 100/175 (57%), Gaps = 4/175 (2%)
Query: 63 DDGIFCSCTASPGSSGVCDRD---CHCGMLLSSCS-SGCKCGNSCLNKPFQNRPVKKMKL 118
DD + C CT + G+ + D C M CS S C+ G C N+ F R + +
Sbjct: 191 DDYMICDCTPNSGNLDMACTDYSGCINRMTQIECSASKCRWGKQCRNQRFHRRQYVDVDI 250
Query: 119 VQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMV 178
VQTEK G G+ A +DI + F+ EYVGEV++ T +R+ + + G +FY + +
Sbjct: 251 VQTEKKGFGLRACQDIPKETFIYEYVGEVMNQTTFLQRMQQYRIEGIRHFYFMMLQPNEY 310
Query: 179 IDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
+DAT KG K R+INHSC PN + KW + R+GIFA R+I+KGE LT++Y +
Sbjct: 311 LDATKKGGKGRFINHSCNPNCAVSKWQVGKHLRMGIFAKRNIQKGEELTFNYNVD 365
>gi|332021876|gb|EGI62212.1| Putative histone-lysine N-methyltransferase ASH1L [Acromyrmex
echinatior]
Length = 2178
Score = 124 bits (310), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 78/237 (32%), Positives = 119/237 (50%), Gaps = 19/237 (8%)
Query: 19 KLLKQIGNPVEFELPDWFIKPKA-------IP---YVFIKRNIYLTKRIKRRLEDDGIFC 68
KLL+Q P + + ++ ++ +P Y I+ NIY +K + C
Sbjct: 1296 KLLRQRKTPFQLQYDLLWLHAQSRLSGRDLVPSWNYKKIRTNIYYD--VKPTTLYEAQAC 1353
Query: 69 SCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGA 126
C G C DC M+ S CS C CG+ C N+ Q ++ TE G
Sbjct: 1354 ECKQESG----CGDDCINRMVFSECSPQLCPCGDKCENQKIQKHEWSPGLQRFMTEDKGW 1409
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ + IK G+F++EYVGEV+ ++ + R+ ++ +T+ Y ++ +VID G
Sbjct: 1410 GVRTQQAIKAGDFILEYVGEVVSEREFKSRMA-TRYANDTHHYCLHLDGGLVIDGHRMGG 1468
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
R++NHSC PN EMQKW + G R+ +FA+RDIK GE LTYDY + + S C
Sbjct: 1469 DGRFVNHSCEPNCEMQKWSVLGLPRMALFASRDIKPGEELTYDYNFALFNPSEGQQC 1525
>gi|261189306|ref|XP_002621064.1| histone-lysine N-methyltransferase [Ajellomyces dermatitidis
SLH14081]
gi|239591641|gb|EEQ74222.1| histone-lysine N-methyltransferase [Ajellomyces dermatitidis
SLH14081]
Length = 897
Score = 124 bits (310), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 66/186 (35%), Positives = 99/186 (53%), Gaps = 7/186 (3%)
Query: 65 GIFCSCTAS----PGSSGVC--DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKL 118
+ C CT +G C D DC C C CG+ C N+ FQ R + +
Sbjct: 152 AMECDCTEEWDPVAAKNGACGEDSDCINRATKMECVGDCGCGDECENQRFQRRQYANVTV 211
Query: 119 VQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMV 178
++TEK G G+ AD D++ +F+ EY+GEVI++ +R+ G +FY +++
Sbjct: 212 IKTEKKGYGLRADSDLRPNQFIFEYIGEVINEGNFRKRMRSYDEEGIKHFYFMSLSKGEF 271
Query: 179 IDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHD 237
+DAT KGN R+ NHSC PN + KW++ + R+GIFA R IK GE L ++Y + + D
Sbjct: 272 VDATKKGNLGRFCNHSCNPNCYVDKWVVGEKLRMGIFAERHIKAGEELVFNYNVDRYGAD 331
Query: 238 SLIAYC 243
YC
Sbjct: 332 PQPCYC 337
>gi|440633543|gb|ELR03462.1| hypothetical protein GMDG_06195 [Geomyces destructans 20631-21]
Length = 977
Score = 123 bits (309), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 65/183 (35%), Positives = 100/183 (54%), Gaps = 4/183 (2%)
Query: 65 GIFCSCTASPGSSGVCDRDCHCGMLLSSCS---SGCKCGNSCLNKPFQNRPVKKMKLVQT 121
G CS G + C D C L+ C CG +C N+ FQ + K+ +++T
Sbjct: 149 GCDCSEDWRDGKNHACGEDTDCINRLTKIECMDDECNCGPNCQNQRFQRKEFSKVSVIKT 208
Query: 122 EKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDA 181
EK G G+ AD D+ +F+ EY+GEV+++ T R+ + G +FY +++ +DA
Sbjct: 209 EKKGYGLRADTDLSAHDFIFEYIGEVVNEPTLRRRMRQYDEEGIKHFYFMSLSKSEFVDA 268
Query: 182 TYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLI 240
T KGN R+ NHSC PN + KW++ + R+GIFA R+IK GE L ++Y + + D
Sbjct: 269 TIKGNLGRFCNHSCNPNCYVDKWVVGDKLRMGIFAERNIKVGEELVFNYNVDRYGADPQP 328
Query: 241 AYC 243
YC
Sbjct: 329 CYC 331
>gi|296081207|emb|CBI18233.3| unnamed protein product [Vitis vinifera]
Length = 370
Score = 123 bits (309), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 73/186 (39%), Positives = 96/186 (51%), Gaps = 2/186 (1%)
Query: 51 IYLTKRIKRRLEDDGIFCSCT-ASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPF 108
+ L R ++ E+D C C + C C + C+ C C C N+ F
Sbjct: 6 LLLCSRHIKQKENDIAICECKYKANDPDSACGERCWNVLTSIECTPRYCPCSIHCKNQRF 65
Query: 109 QNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNF 168
Q R K KL + E G G++A E+IK GEFV+EY GEVI R G +
Sbjct: 66 QKREYAKTKLFRAEGRGWGLLATENIKAGEFVMEYCGEVISRTEARGRSQVYVSQGLKDV 125
Query: 169 YLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTY 228
Y+ +N IDAT KGN +R+INHSC PN E KW + GE R+GIFA R+I G LTY
Sbjct: 126 YIIPLNARECIDATKKGNLARFINHSCQPNCETMKWSVLGEDRVGIFALRNISVGTELTY 185
Query: 229 DYQYEF 234
Y +E+
Sbjct: 186 SYNFEW 191
>gi|323354369|gb|EGA86208.1| Set2p [Saccharomyces cerevisiae VL3]
Length = 683
Score = 123 bits (309), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 68/186 (36%), Positives = 100/186 (53%), Gaps = 6/186 (3%)
Query: 51 IYLTKRIKRRLEDDGIFCSCTA--SPGSSGVCDRDCHC--GMLLSSCSSG--CKCGNSCL 104
IY KRI +D + C C S G + CD D C + L C + CGN C
Sbjct: 51 IYANKRIGTFKNNDFMECDCYEEFSDGVNHACDEDSDCINRLTLIECVNDLCSSCGNDCQ 110
Query: 105 NKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLG 164
N+ FQ + + + +T+ G G+ A++DI+ +F+ EY GEVI++ +RL
Sbjct: 111 NQRFQKKQYAPIAIFKTKHKGYGVRAEQDIEANQFIYEYKGEVIEEMEFRDRLIDYDQRH 170
Query: 165 ETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGE 224
+FY + IDAT KG+ +R+ NHSC PN + KW++ + R+GIFA R I KGE
Sbjct: 171 FKHFYFMMLQNGEFIDATIKGSLARFCNHSCSPNAYVNKWVVKDKLRMGIFAQRKILKGE 230
Query: 225 NLTYDY 230
+T+DY
Sbjct: 231 EITFDY 236
>gi|448524136|ref|XP_003868932.1| Set2 protein [Candida orthopsilosis Co 90-125]
gi|380353272|emb|CCG26028.1| Set2 protein [Candida orthopsilosis]
Length = 806
Score = 123 bits (309), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 61/155 (39%), Positives = 91/155 (58%), Gaps = 3/155 (1%)
Query: 79 VCDRDCHCGMLLSSC---SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIK 135
C D +C L+S + C CG+ C N+ FQ R + + QTE G G+ A++ I
Sbjct: 71 ACGEDSNCINRLTSVECTNRHCLCGDDCQNQRFQKRQYADVSVFQTELKGYGLKANKPIS 130
Query: 136 RGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSC 195
G+F+ EY+GEVID+ +R+ + +FY + D IDAT KG+ +R++NHSC
Sbjct: 131 EGQFIYEYIGEVIDEGAFRQRMIEYDVKNYKHFYFMMLKPDAFIDATEKGSLARFVNHSC 190
Query: 196 CPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDY 230
PN + KW++ + R+GIFA R I KGE +T+DY
Sbjct: 191 NPNAFVDKWVVGDKLRMGIFAKRKIAKGEEITFDY 225
>gi|409043202|gb|EKM52685.1| hypothetical protein PHACADRAFT_211907 [Phanerochaete carnosa
HHB-10118-sp]
Length = 841
Score = 123 bits (309), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 69/208 (33%), Positives = 111/208 (53%), Gaps = 8/208 (3%)
Query: 44 YVFIKRNIYLTKRIKRRLE-DDGIFCSCTASPG---SSGVCDRDCHCGMLLSSCS---SG 96
++ I N Y + + R E + + C C G S C D C L+
Sbjct: 111 FIEIPDNHYQYQSLGRSREAGESMTCDCVYEHGVSDPSDACGHDSDCINRLTQVECLPED 170
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C+C + C N+ FQ + +++VQTEK G G+ A +D+++ F+ EYVG+V++ + +R
Sbjct: 171 CRCRSHCQNQRFQRKQHAPIEIVQTEKKGFGLRAGQDLRKDTFIYEYVGDVVNHPSLLKR 230
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + G +FY + +D IDAT +G R+ NHSC PN + KW + R+GIFA
Sbjct: 231 MRQYGEEGIQHFYFMALQKDEFIDATKRGGIGRFANHSCNPNCYVAKWSVGKRVRMGIFA 290
Query: 217 TRDIKKGENLTYDYQYE-FLHDSLIAYC 243
R+IKK E LT++Y + + H+ + YC
Sbjct: 291 NRNIKKNEELTFNYNVDRYGHEPQMCYC 318
>gi|353234454|emb|CCA66479.1| related to SET2-Histone methyltransferase [Piriformospora indica
DSM 11827]
Length = 799
Score = 123 bits (309), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 75/211 (35%), Positives = 108/211 (51%), Gaps = 27/211 (12%)
Query: 58 KRRLEDDGIFCSCTASPGSSGVCDR------DCHCGMLLSSC-SSGCKCGNSCLNKPFQN 110
K ++++D + C C PG DR DC + L C C+CG C N+ FQ
Sbjct: 93 KPKIQEDAMGCECYYKPGDP--LDRACGPNSDCINRLTLVECVEEECRCGVFCQNQRFQK 150
Query: 111 RPVKKMKLVQTEKCGAGIVADEDIKR-----------------GEFVIEYVGEVIDDQTC 153
R + +V+TEK G G+ A IKR EF+ EY+GE+I+++T
Sbjct: 151 RQYANIHVVKTEKKGYGLRAASPIKRYLIFSSADASTPHMVSSDEFIYEYIGEIINERTL 210
Query: 154 EERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIG 213
E R+ G +FY + + IDAT KG R+ NHSC PN + +W++DG R+G
Sbjct: 211 ERRMDNYGDEGIEHFYFMMLQKGEFIDATKKGGFGRFANHSCNPNCYVARWVVDGGLRMG 270
Query: 214 IFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
IFA RDI K E LT++Y + + H + YC
Sbjct: 271 IFAKRDILKDEELTFNYNADRYGHAAQPCYC 301
>gi|361129227|gb|EHL01139.1| putative Histone-lysine N-methyltransferase, H3 lysine-36 specific
[Glarea lozoyensis 74030]
Length = 839
Score = 123 bits (309), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 60/148 (40%), Positives = 92/148 (62%), Gaps = 1/148 (0%)
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
CKCG++C N+ FQ + + +++T+K G G+ A+ D+ G+FV EY+GEVI++ T +R
Sbjct: 104 CKCGDTCQNQRFQRQDYADVSVIKTDKKGYGLRANTDLDAGDFVFEYIGEVINEPTFRKR 163
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ G +FY + +D +DAT KGN R+ NHSC PN + KWI+ + R+GIFA
Sbjct: 164 TIEYHKEGIKHFYFMSLGKDEFVDATKKGNLGRFCNHSCNPNCFVDKWIVKDKLRMGIFA 223
Query: 217 TRDIKKGENLTYDYQYE-FLHDSLIAYC 243
R I+KGE L ++Y + + D YC
Sbjct: 224 QRPIQKGEELVFNYNVDRYGADPQPCYC 251
>gi|1008361|emb|CAA89464.1| EZL1 [Saccharomyces cerevisiae]
Length = 733
Score = 123 bits (309), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 68/186 (36%), Positives = 100/186 (53%), Gaps = 6/186 (3%)
Query: 51 IYLTKRIKRRLEDDGIFCSCTA--SPGSSGVCDRDCHC--GMLLSSCSSG--CKCGNSCL 104
IY KRI +D + C C S G + CD D C + L C + CGN C
Sbjct: 51 IYANKRIGTFKNNDFMECDCYEEFSDGVNHACDEDSDCINRLTLIECVNDLCSSCGNDCQ 110
Query: 105 NKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLG 164
N+ FQ + + + +T+ G G+ A++DI+ +F+ EY GEVI++ +RL
Sbjct: 111 NQRFQKKQYAPIAIFKTKHKGYGVRAEQDIEANQFIYEYKGEVIEEMEFRDRLIDYDQRH 170
Query: 165 ETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGE 224
+FY + IDAT KG+ +R+ NHSC PN + KW++ + R+GIFA R I KGE
Sbjct: 171 FKHFYFMMLQNGEFIDATIKGSLARFCNHSCSPNAYVNKWVVKDKLRMGIFAQRKILKGE 230
Query: 225 NLTYDY 230
+T+DY
Sbjct: 231 EITFDY 236
>gi|330443605|ref|NP_012367.2| Set2p [Saccharomyces cerevisiae S288c]
gi|347595709|sp|P46995.2|SET2_YEAST RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36
specific; AltName: Full=Lysine N-methyltransferase 3;
AltName: Full=SET domain-containing protein 2
gi|329138914|tpg|DAA08635.2| TPA: Set2p [Saccharomyces cerevisiae S288c]
Length = 733
Score = 123 bits (309), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 68/186 (36%), Positives = 100/186 (53%), Gaps = 6/186 (3%)
Query: 51 IYLTKRIKRRLEDDGIFCSCTA--SPGSSGVCDRDCHC--GMLLSSCSSG--CKCGNSCL 104
IY KRI +D + C C S G + CD D C + L C + CGN C
Sbjct: 51 IYANKRIGTFKNNDFMECDCYEEFSDGVNHACDEDSDCINRLTLIECVNDLCSSCGNDCQ 110
Query: 105 NKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLG 164
N+ FQ + + + +T+ G G+ A++DI+ +F+ EY GEVI++ +RL
Sbjct: 111 NQRFQKKQYAPIAIFKTKHKGYGVRAEQDIEANQFIYEYKGEVIEEMEFRDRLIDYDQRH 170
Query: 165 ETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGE 224
+FY + IDAT KG+ +R+ NHSC PN + KW++ + R+GIFA R I KGE
Sbjct: 171 FKHFYFMMLQNGEFIDATIKGSLARFCNHSCSPNAYVNKWVVKDKLRMGIFAQRKILKGE 230
Query: 225 NLTYDY 230
+T+DY
Sbjct: 231 EITFDY 236
>gi|349579035|dbj|GAA24198.1| K7_Set2p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 733
Score = 123 bits (309), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 68/186 (36%), Positives = 100/186 (53%), Gaps = 6/186 (3%)
Query: 51 IYLTKRIKRRLEDDGIFCSCTA--SPGSSGVCDRDCHC--GMLLSSCSSG--CKCGNSCL 104
IY KRI +D + C C S G + CD D C + L C + CGN C
Sbjct: 51 IYANKRIGTFKNNDFMECDCYEEFSDGVNHACDEDSDCINRLTLIECVNDLCSSCGNDCQ 110
Query: 105 NKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLG 164
N+ FQ + + + +T+ G G+ A++DI+ +F+ EY GEVI++ +RL
Sbjct: 111 NQRFQKKQYAPIAIFKTKHKGYGVRAEQDIEANQFIYEYKGEVIEEMEFRDRLIDYDQRH 170
Query: 165 ETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGE 224
+FY + IDAT KG+ +R+ NHSC PN + KW++ + R+GIFA R I KGE
Sbjct: 171 FKHFYFMMLQNGEFIDATIKGSLARFCNHSCSPNAYVNKWVVKDKLRMGIFAQRKILKGE 230
Query: 225 NLTYDY 230
+T+DY
Sbjct: 231 EITFDY 236
>gi|151944961|gb|EDN63216.1| SET domain-containing protein [Saccharomyces cerevisiae YJM789]
gi|190409343|gb|EDV12608.1| SET domain protein 2 [Saccharomyces cerevisiae RM11-1a]
gi|207344135|gb|EDZ71371.1| YJL168Cp-like protein [Saccharomyces cerevisiae AWRI1631]
gi|256271359|gb|EEU06424.1| Set2p [Saccharomyces cerevisiae JAY291]
Length = 733
Score = 123 bits (309), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 68/186 (36%), Positives = 100/186 (53%), Gaps = 6/186 (3%)
Query: 51 IYLTKRIKRRLEDDGIFCSCTA--SPGSSGVCDRDCHC--GMLLSSCSSG--CKCGNSCL 104
IY KRI +D + C C S G + CD D C + L C + CGN C
Sbjct: 51 IYANKRIGTFKNNDFMECDCYEEFSDGVNHACDEDSDCINRLTLIECVNDLCSSCGNDCQ 110
Query: 105 NKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLG 164
N+ FQ + + + +T+ G G+ A++DI+ +F+ EY GEVI++ +RL
Sbjct: 111 NQRFQKKQYAPIAIFKTKHKGYGVRAEQDIEANQFIYEYKGEVIEEMEFRDRLIDYDQRH 170
Query: 165 ETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGE 224
+FY + IDAT KG+ +R+ NHSC PN + KW++ + R+GIFA R I KGE
Sbjct: 171 FKHFYFMMLQNGEFIDATIKGSLARFCNHSCSPNAYVNKWVVKDKLRMGIFAQRKILKGE 230
Query: 225 NLTYDY 230
+T+DY
Sbjct: 231 EITFDY 236
>gi|323332954|gb|EGA74356.1| Set2p [Saccharomyces cerevisiae AWRI796]
Length = 537
Score = 123 bits (308), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 68/186 (36%), Positives = 100/186 (53%), Gaps = 6/186 (3%)
Query: 51 IYLTKRIKRRLEDDGIFCSCTA--SPGSSGVCDRDCHC--GMLLSSCSSG--CKCGNSCL 104
IY KRI +D + C C S G + CD D C + L C + CGN C
Sbjct: 51 IYANKRIGTFKNNDFMECDCYEEFSDGVNHACDEDSDCINRLTLIECVNDLCSSCGNDCQ 110
Query: 105 NKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLG 164
N+ FQ + + + +T+ G G+ A++DI+ +F+ EY GEVI++ +RL
Sbjct: 111 NQRFQKKQYAPIAIFKTKHKGYGVRAEQDIEANQFIYEYKGEVIEEMEFRDRLIDYDQRH 170
Query: 165 ETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGE 224
+FY + IDAT KG+ +R+ NHSC PN + KW++ + R+GIFA R I KGE
Sbjct: 171 FKHFYFMMLQNGEFIDATIKGSLARFCNHSCSPNAYVNKWVVKDKLRMGIFAQRKILKGE 230
Query: 225 NLTYDY 230
+T+DY
Sbjct: 231 EITFDY 236
>gi|365764896|gb|EHN06414.1| Set2p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 733
Score = 123 bits (308), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 68/186 (36%), Positives = 100/186 (53%), Gaps = 6/186 (3%)
Query: 51 IYLTKRIKRRLEDDGIFCSCTA--SPGSSGVCDRDCHC--GMLLSSCSSG--CKCGNSCL 104
IY KRI +D + C C S G + CD D C + L C + CGN C
Sbjct: 51 IYANKRIGTFKNNDFMECDCYEEFSDGVNHACDEDSDCINRLTLIECVNDLCSSCGNDCQ 110
Query: 105 NKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLG 164
N+ FQ + + + +T+ G G+ A++DI+ +F+ EY GEVI++ +RL
Sbjct: 111 NQRFQKKQYAPIAIFKTKHKGYGVRAEQDIEANQFIYEYKGEVIEEMEFRDRLIDYDQRH 170
Query: 165 ETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGE 224
+FY + IDAT KG+ +R+ NHSC PN + KW++ + R+GIFA R I KGE
Sbjct: 171 FKHFYFMMLQNGEFIDATIKGSLARFCNHSCSPNAYVNKWVVKDKLRMGIFAQRKILKGE 230
Query: 225 NLTYDY 230
+T+DY
Sbjct: 231 EITFDY 236
>gi|198451130|ref|XP_001358254.2| GA18567 [Drosophila pseudoobscura pseudoobscura]
gi|198131348|gb|EAL27392.2| GA18567 [Drosophila pseudoobscura pseudoobscura]
Length = 1541
Score = 123 bits (308), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 75/214 (35%), Positives = 114/214 (53%), Gaps = 11/214 (5%)
Query: 37 IKPKAIPYVFIKRN--IYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSS 92
IK PYV I+ N + K I E +C P C + +C ML +
Sbjct: 1266 IKVNPPPYVKIRINKAVPPVKFITNSEE----HSTCDCRPEDEHPCGANSNCLNRMLFNE 1321
Query: 93 CSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQ 151
C C+CG+ C N+ F+ R +M +V G G+V E I G+F+IEYVGEVI+ +
Sbjct: 1322 CHPEYCRCGDRCENRMFETRKSPRMDVVYMNARGFGLVCREPIAEGDFIIEYVGEVINQE 1381
Query: 152 TCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETR 211
+ R+ + + + NFY + ++ +IDA KGN +R++NHSC PN QKW ++ R
Sbjct: 1382 EFQRRMLRKQKDRDENFYFLGVEKEFIIDAGPKGNLARFMNHSCEPNCTSQKWTVNCTNR 1441
Query: 212 IGIFATRDIKKGENLTYDYQY-EFLHDSLIA-YC 243
+G+FA +DI LT++Y + + L+D A YC
Sbjct: 1442 VGLFAIQDIPAETELTFNYLWDDLLNDKKKACYC 1475
>gi|157818737|ref|NP_001101159.1| probable histone-lysine N-methyltransferase ASH1L [Rattus norvegicus]
gi|149048100|gb|EDM00676.1| ash1 (absent, small, or homeotic)-like (Drosophila) (predicted)
[Rattus norvegicus]
Length = 2918
Score = 123 bits (308), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 62/155 (40%), Positives = 93/155 (60%), Gaps = 3/155 (1%)
Query: 80 CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDIKRG 137
C DC M+ + CS C CG C N+ Q V+ ++ + E+ G GI E +K G
Sbjct: 2059 CGDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAG 2118
Query: 138 EFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCP 197
+F+IEY+GEV+ +Q R+ + H ++ Y ++ MVID+ GN++R+INHSC P
Sbjct: 2119 QFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDP 2177
Query: 198 NTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
N EMQKW ++G RIG++A +D+ G LTYDY +
Sbjct: 2178 NCEMQKWSVNGVYRIGLYALKDVPAGTELTYDYNF 2212
>gi|365982940|ref|XP_003668303.1| hypothetical protein NDAI_0B00260 [Naumovozyma dairenensis CBS 421]
gi|343767070|emb|CCD23060.1| hypothetical protein NDAI_0B00260 [Naumovozyma dairenensis CBS 421]
Length = 795
Score = 123 bits (308), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 73/200 (36%), Positives = 107/200 (53%), Gaps = 18/200 (9%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGS----SGVCDRDCHC--GMLLSSCSSGC 97
++ I+ IY +K++ +D I C C + CD D C + L C +
Sbjct: 27 FIPIESCIYSSKKLGNSKNNDFIECDCYEDKDPQIHINHACDEDSDCINRLTLIECVNNL 86
Query: 98 --KCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEE 155
CGN C N+ FQ + + + +T+ G G+ A+E+I+ G+F+ EY+GEVID+ E
Sbjct: 87 CYSCGNDCQNQRFQKKQYSNVSIFKTKLKGYGVRANENIENGQFIYEYIGEVIDEIQFRE 146
Query: 156 R-----LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGET 210
R L K KH FY + IDAT KG+ +R+ NHSC PN + KW + G+
Sbjct: 147 RMIDYDLKKFKH-----FYFMMLQNGQFIDATIKGSLARFCNHSCSPNAYVNKWEVAGKL 201
Query: 211 RIGIFATRDIKKGENLTYDY 230
R+GIFA R I KGE +T+DY
Sbjct: 202 RMGIFAKRKILKGEEITFDY 221
>gi|195143973|ref|XP_002012971.1| GL23881 [Drosophila persimilis]
gi|194101914|gb|EDW23957.1| GL23881 [Drosophila persimilis]
Length = 1466
Score = 123 bits (308), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 68/200 (34%), Positives = 106/200 (53%), Gaps = 1/200 (0%)
Query: 37 IKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG 96
IK PYV IK N + ++ C C + +C ML + C
Sbjct: 1191 IKVNPPPYVKIKINKAVPPVKFVTNSEEHSTCDCRPEDEHPCGANSNCLNRMLFNECHPE 1250
Query: 97 -CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEE 155
C+CG+ C N+ F+ R +M +V G G+V E I G+F+IEYVGEVI+ + +
Sbjct: 1251 YCRCGDRCENRMFETRKSPRMDVVYMNARGFGLVCREPIAEGDFIIEYVGEVINQEEFQR 1310
Query: 156 RLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIF 215
R+ + + + NFY + ++ +IDA KGN +R++NHSC PN QKW ++ R+G+F
Sbjct: 1311 RMLRKQKDRDENFYFLGVEKEFIIDAGPKGNLARFMNHSCEPNCTSQKWTVNCTHRVGLF 1370
Query: 216 ATRDIKKGENLTYDYQYEFL 235
A +DI LT++Y ++ L
Sbjct: 1371 AIQDIPAETELTFNYLWDDL 1390
>gi|156847005|ref|XP_001646388.1| hypothetical protein Kpol_2001p35 [Vanderwaltozyma polyspora DSM
70294]
gi|156117064|gb|EDO18530.1| hypothetical protein Kpol_2001p35 [Vanderwaltozyma polyspora DSM
70294]
Length = 693
Score = 123 bits (308), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 68/185 (36%), Positives = 102/185 (55%), Gaps = 6/185 (3%)
Query: 52 YLTKRIKRRLEDDGIFCSCTAS--PGSSGVCDR--DCHCGMLLSSCSSGC--KCGNSCLN 105
Y +K I D + C C G + CD DC + L C +G CG++C N
Sbjct: 46 YASKYIGNAKSSDFMECDCYEEFEEGINHACDENSDCINRLTLIECVNGLCDTCGDNCQN 105
Query: 106 KPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGE 165
+ FQ + + + +T+ G G+ A++DI+ +F+ EY GEVI++ ERL G
Sbjct: 106 QRFQKKEYADISVFKTKMKGYGVRANKDIEANDFIYEYKGEVIEEDEFRERLIDYDERGF 165
Query: 166 TNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGEN 225
+FY + IDAT KG+ +R+ NHSC PN + KW++ G+ ++GIFA RDI KGE
Sbjct: 166 KHFYFMMLQNAEFIDATIKGSLARFCNHSCNPNAYVNKWVVAGKLKMGIFAKRDIYKGEE 225
Query: 226 LTYDY 230
+T+DY
Sbjct: 226 ITFDY 230
>gi|344230287|gb|EGV62172.1| SET domain-containing protein [Candida tenuis ATCC 10573]
Length = 721
Score = 123 bits (308), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 67/186 (36%), Positives = 99/186 (53%), Gaps = 7/186 (3%)
Query: 52 YLTKRIKRRLEDDGIFCSC----TASPGSSGVCDRDCHCGMLLSS--CSSG-CKCGNSCL 104
Y K I +++ + C C + + C D C +S C +G C CG+ C
Sbjct: 40 YQNKSIGDSGQNENMTCDCHEKVEETTNVNHACGDDSGCINRATSVECMAGACNCGDRCQ 99
Query: 105 NKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLG 164
N+ FQ + + QTEK G G+ AD + G F+ EY+GEVID+ T R+
Sbjct: 100 NQRFQRCEYAAISVFQTEKKGYGVRADTALDEGSFIYEYIGEVIDEATFRRRMVDYDSRQ 159
Query: 165 ETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGE 224
+FY + +D IDAT KG +R++NHSC PN + KW++ + R+GIFA R I +GE
Sbjct: 160 LRHFYFMMLKKDAFIDATEKGALARFVNHSCSPNAYVDKWVVGDKLRMGIFAKRHIARGE 219
Query: 225 NLTYDY 230
+T+DY
Sbjct: 220 EITFDY 225
>gi|195395005|ref|XP_002056127.1| GJ10771 [Drosophila virilis]
gi|194142836|gb|EDW59239.1| GJ10771 [Drosophila virilis]
Length = 1430
Score = 122 bits (307), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 70/196 (35%), Positives = 105/196 (53%), Gaps = 5/196 (2%)
Query: 43 PYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG-CKC 99
PY+ IK N + E+D C C PGS C + C ML C C
Sbjct: 1154 PYIKIKANKPVPPVRFVYNEEDLNICEC--KPGSEHPCGPESGCLNRMLYHECHPDYCHA 1211
Query: 100 GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
G C N+ F+ R ++++V + G G+V E I G+F+IEYVGEVI+ + R+ +
Sbjct: 1212 GRQCENQLFELRKSPRLEVVYMNERGFGLVCREPIAEGDFIIEYVGEVINHAEFQRRMAQ 1271
Query: 160 MKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD 219
+ NFY + +D +IDA KGN +R++NHSC PN QKW ++ R+G+FA +D
Sbjct: 1272 KTRDRDENFYFLGVEKDFIIDAGPKGNLARFMNHSCEPNCATQKWTVNCINRVGLFAIKD 1331
Query: 220 IKKGENLTYDYQYEFL 235
I + LT++Y ++ L
Sbjct: 1332 IPENTELTFNYLWDDL 1347
>gi|358056897|dbj|GAA97247.1| hypothetical protein E5Q_03924 [Mixia osmundae IAM 14324]
Length = 949
Score = 122 bits (307), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 65/181 (35%), Positives = 98/181 (54%), Gaps = 5/181 (2%)
Query: 68 CSCTASPG---SSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEK 123
C C G S C +C + C G C+CG C N+ FQ + +++V T
Sbjct: 219 CECDYEHGLDAPSSACGTNCINRLTQVECVPGSCRCGKYCTNRRFQRKQYANIQIVDTGM 278
Query: 124 CGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATY 183
G G+ A ED+ + FV EY+GEV+ ++R+ G +FY + R+ IDAT
Sbjct: 279 KGFGVRAAEDMLKDAFVYEYIGEVVGAGQLQKRMKDYYEEGIEHFYFMALQREEFIDATK 338
Query: 184 KGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAY 242
KGNK R++NHSC PN + KW++ + R+GIF R I+ GE LT++Y + + H++ Y
Sbjct: 339 KGNKGRFLNHSCSPNCYVSKWVVGEKMRMGIFTKRKIQAGEELTFNYNVDRYGHEAQPCY 398
Query: 243 C 243
C
Sbjct: 399 C 399
>gi|148683294|gb|EDL15241.1| ash1 (absent, small, or homeotic)-like (Drosophila) [Mus musculus]
Length = 2918
Score = 122 bits (307), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 62/155 (40%), Positives = 93/155 (60%), Gaps = 3/155 (1%)
Query: 80 CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDIKRG 137
C DC M+ + CS C CG C N+ Q V+ ++ + E+ G GI E +K G
Sbjct: 2059 CGDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAG 2118
Query: 138 EFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCP 197
+F+IEY+GEV+ +Q R+ + H ++ Y ++ MVID+ GN++R+INHSC P
Sbjct: 2119 QFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDP 2177
Query: 198 NTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
N EMQKW ++G RIG++A +D+ G LTYDY +
Sbjct: 2178 NCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2212
>gi|147846734|emb|CAN80636.1| hypothetical protein VITISV_017995 [Vitis vinifera]
Length = 278
Score = 122 bits (307), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 75/204 (36%), Positives = 102/204 (50%), Gaps = 4/204 (1%)
Query: 41 AIPYVFIK--RNIYLTKRIKRRLEDDGIFCSCTASPGS-SGVCDRDCHCGMLLSSCSSG- 96
+ Y F+ N+ +R ++ E+D C C C C + C+
Sbjct: 37 VVDYSFVSFLDNLEGEERHIKQKENDIAICECKYKANDPDSACGERCWNVLTSIECTPRY 96
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C C C N+ FQ R K KL + E G G++A E+IK GEFV+EY GEVI R
Sbjct: 97 CPCSIHCKNQRFQKREYAKTKLFRAEGRGWGLLATENIKAGEFVMEYCGEVISRTEARGR 156
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
G + Y+ +N IDAT KGN +R+INHSC PN E KW + GE R+GIFA
Sbjct: 157 SQVYVSQGLKDVYIIPLNARECIDATKKGNLARFINHSCQPNCETMKWSVLGEDRVGIFA 216
Query: 217 TRDIKKGENLTYDYQYEFLHDSLI 240
R+I G LTY Y +E+ + +
Sbjct: 217 LRNISVGTELTYSYNFEWYSXAKV 240
>gi|154292496|ref|XP_001546822.1| hypothetical protein BC1G_14887 [Botryotinia fuckeliana B05.10]
gi|347836524|emb|CCD51096.1| hypothetical protein [Botryotinia fuckeliana]
Length = 953
Score = 122 bits (307), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 76/225 (33%), Positives = 118/225 (52%), Gaps = 10/225 (4%)
Query: 27 PVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTA--SPGSSGVCDRDC 84
P+ +LPD + ++ + IK IY K + + D + C C S G + C D
Sbjct: 114 PLFNDLPDSTEEAASV-FQVIKDCIYGAKHMGAS-DHDALDCDCPEEFSDGKNYACGEDS 171
Query: 85 HCGMLLS--SCSSG---CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEF 139
C L+ C G C CG C N+ FQ + K+ +++T+K G G+ A+ D++ +F
Sbjct: 172 DCINRLTKMECGGGHKDCNCGLDCQNQRFQRKQYAKVSVIKTDKKGYGLRANTDLQPDDF 231
Query: 140 VIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNT 199
+ EY+GEVI++ T R + G +FY + + +DAT KGN R+ NHSC PN
Sbjct: 232 IFEYIGEVINEPTFRRRTVQYDQEGIKHFYFMSLTKHEFVDATKKGNLGRFCNHSCNPNC 291
Query: 200 EMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
+ KW++ + R+GIFA R IK GE L ++Y + + D YC
Sbjct: 292 YVDKWVVGEKLRMGIFAERAIKAGEELVFNYNVDRYGADPQPCYC 336
>gi|218190961|gb|EEC73388.1| hypothetical protein OsI_07633 [Oryza sativa Indica Group]
Length = 1906
Score = 122 bits (307), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 61/125 (48%), Positives = 76/125 (60%)
Query: 108 FQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETN 167
FQ R K+ T K G G+ ED+ G F+IEYVGEV+D E R G+ +
Sbjct: 1302 FQRRTYAKLGKFHTGKKGYGLQLKEDVSEGRFLIEYVGEVLDITAYESRQRYYASKGQKH 1361
Query: 168 FYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLT 227
FY +N VIDA KGN R+INHSC PN +KW+++GE IGIFA R+IKKGE LT
Sbjct: 1362 FYFMALNGGEVIDACTKGNLGRFINHSCSPNCRTEKWMVNGEVCIGIFAMRNIKKGEELT 1421
Query: 228 YDYQY 232
+DY Y
Sbjct: 1422 FDYNY 1426
>gi|222623047|gb|EEE57179.1| hypothetical protein OsJ_07116 [Oryza sativa Japonica Group]
Length = 1963
Score = 122 bits (307), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 61/125 (48%), Positives = 76/125 (60%)
Query: 108 FQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETN 167
FQ R K+ T K G G+ ED+ G F+IEYVGEV+D E R G+ +
Sbjct: 1287 FQRRTYAKLGKFHTGKKGYGLQLKEDVSEGRFLIEYVGEVLDITAYESRQRYYASKGQKH 1346
Query: 168 FYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLT 227
FY +N VIDA KGN R+INHSC PN +KW+++GE IGIFA R+IKKGE LT
Sbjct: 1347 FYFMALNGGEVIDACTKGNLGRFINHSCSPNCRTEKWMVNGEVCIGIFAMRNIKKGEELT 1406
Query: 228 YDYQY 232
+DY Y
Sbjct: 1407 FDYNY 1411
>gi|323337021|gb|EGA78277.1| Set2p [Saccharomyces cerevisiae Vin13]
Length = 733
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 67/186 (36%), Positives = 100/186 (53%), Gaps = 6/186 (3%)
Query: 51 IYLTKRIKRRLEDDGIFCSCTA--SPGSSGVCDRDCHC--GMLLSSCSSG--CKCGNSCL 104
IY KRI +D + C C S G + CD D C + L C + CGN C
Sbjct: 51 IYANKRIGTFKNNDFMECDCYEEFSDGVNHACDEDSDCINRLTLIECVNDLCSSCGNDCQ 110
Query: 105 NKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLG 164
N+ FQ + + + +T+ G G+ A++DI+ +F+ EY GEVI++ +RL
Sbjct: 111 NQRFQKKQYAPIAIFKTKHKGYGVRAEQDIEANQFIYEYKGEVIEEMEFRDRLIDYDQRH 170
Query: 165 ETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGE 224
+FY + IDAT KG+ +R+ NHSC PN + KW++ + R+GIFA R + KGE
Sbjct: 171 FKHFYFMMLQNGEFIDATIKGSLARFCNHSCSPNAYVNKWVVKDKLRMGIFAQRKLLKGE 230
Query: 225 NLTYDY 230
+T+DY
Sbjct: 231 EITFDY 236
>gi|342872314|gb|EGU74695.1| hypothetical protein FOXB_14795 [Fusarium oxysporum Fo5176]
Length = 919
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 74/240 (30%), Positives = 121/240 (50%), Gaps = 9/240 (3%)
Query: 11 SRIGHAFNKLLKQIGNPVEFE-LPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCS 69
S++ ++ L PV F+ LPD + + I +Y +K + ++D + C
Sbjct: 59 SKLSRKASQKLAASREPVLFDHLPDMTAESCNF-FQLIPDCLYGSKHLGS-TDNDALDCE 116
Query: 70 CTAS--PGSSGVCDRDCHC---GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKC 124
C G + C D C + + C C N+ FQ + + +++TEK
Sbjct: 117 CREEWHDGENIACGEDSDCINRATKMECSAEAGNCAGGCQNQRFQRKQYANVSVIKTEKK 176
Query: 125 GAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYK 184
G G+ AD D++ +FV EY+GEVI++ T R+ + G +FY +N+ +DAT K
Sbjct: 177 GFGLRADSDLQANDFVFEYIGEVINEPTFRRRMMQYDEEGIKHFYFMSLNKSEFVDATKK 236
Query: 185 GNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
GN R+ NHSC PN + KW++ + R+GIFA+R I+ GE L ++Y + + D YC
Sbjct: 237 GNYGRFCNHSCNPNCYVDKWVVGDKLRMGIFASRKIQSGEELVFNYNVDRYGADPQPCYC 296
>gi|392298598|gb|EIW09695.1| Set2p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 733
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 67/186 (36%), Positives = 100/186 (53%), Gaps = 6/186 (3%)
Query: 51 IYLTKRIKRRLEDDGIFCSCTA--SPGSSGVCDRDCHC--GMLLSSCSSG--CKCGNSCL 104
IY KRI +D + C C S G + CD D C + L C + CGN C
Sbjct: 51 IYANKRIGTFKNNDFMECDCYEEFSDGVNHACDEDSDCINRLTLIECVNDLCSSCGNDCQ 110
Query: 105 NKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLG 164
N+ FQ + + + +T+ G G+ A++DI+ +F+ EY GEVI++ +RL
Sbjct: 111 NQRFQKKQYAPIAIFKTKHKGYGVRAEQDIEANQFIYEYKGEVIEEMEFRDRLIDYDQRH 170
Query: 165 ETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGE 224
+FY + IDAT KG+ +R+ NHSC PN + KW++ + R+GIFA R + KGE
Sbjct: 171 FKHFYFMMLQNGEFIDATIKGSLARFCNHSCSPNAYVNKWVVKDKLRMGIFAQRKLLKGE 230
Query: 225 NLTYDY 230
+T+DY
Sbjct: 231 EITFDY 236
>gi|259147310|emb|CAY80562.1| Set2p [Saccharomyces cerevisiae EC1118]
Length = 733
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 67/186 (36%), Positives = 100/186 (53%), Gaps = 6/186 (3%)
Query: 51 IYLTKRIKRRLEDDGIFCSCTA--SPGSSGVCDRDCHC--GMLLSSCSSG--CKCGNSCL 104
IY KRI +D + C C S G + CD D C + L C + CGN C
Sbjct: 51 IYANKRIGTFKNNDFMECDCYEEFSDGVNHACDEDSDCINRLTLIECVNDLCSSCGNDCQ 110
Query: 105 NKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLG 164
N+ FQ + + + +T+ G G+ A++DI+ +F+ EY GEVI++ +RL
Sbjct: 111 NQRFQKKQYAPIAIFKTKHKGYGVRAEQDIEANQFIYEYKGEVIEEMEFRDRLIDYDQRH 170
Query: 165 ETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGE 224
+FY + IDAT KG+ +R+ NHSC PN + KW++ + R+GIFA R + KGE
Sbjct: 171 FKHFYFMMLQNGEFIDATIKGSLARFCNHSCSPNAYVNKWVVKDKLRMGIFAQRKLLKGE 230
Query: 225 NLTYDY 230
+T+DY
Sbjct: 231 EITFDY 236
>gi|149239909|ref|XP_001525830.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
gi|146449953|gb|EDK44209.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
Length = 822
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 62/172 (36%), Positives = 96/172 (55%), Gaps = 7/172 (4%)
Query: 66 IFCSCTASPGSSG----VCDRDCHCGMLLSSC---SSGCKCGNSCLNKPFQNRPVKKMKL 118
+ C C SS C D +C ++S + C CG +C N+ FQ + + +
Sbjct: 55 MTCDCEEDWDSSTEQNMACGEDSNCINRITSVECINRHCSCGENCQNQRFQKKQYADVSV 114
Query: 119 VQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMV 178
QTE G G+ A+ ++ G+F+ EY+GEVID+ T +++ + +FY + D
Sbjct: 115 FQTELKGYGLRANTQLREGDFIYEYIGEVIDEPTFRQKMIEYDLKQYKHFYFMMLKNDAF 174
Query: 179 IDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDY 230
IDAT KG+ +R++NHSC PN + KW++ R+GIFA RDI GE +T+DY
Sbjct: 175 IDATEKGSLARFVNHSCSPNAFVDKWVVADRLRMGIFAKRDIMAGEEITFDY 226
>gi|323348010|gb|EGA82268.1| Set2p [Saccharomyces cerevisiae Lalvin QA23]
Length = 650
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 67/186 (36%), Positives = 100/186 (53%), Gaps = 6/186 (3%)
Query: 51 IYLTKRIKRRLEDDGIFCSCTA--SPGSSGVCDRDCHC--GMLLSSCSSG--CKCGNSCL 104
IY KRI +D + C C S G + CD D C + L C + CGN C
Sbjct: 51 IYANKRIGTFKNNDFMECDCYEEFSDGVNHACDEDSDCINRLTLIECVNDLCSSCGNDCQ 110
Query: 105 NKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLG 164
N+ FQ + + + +T+ G G+ A++DI+ +F+ EY GEVI++ +RL
Sbjct: 111 NQRFQKKQYAPIAIFKTKHKGYGVRAEQDIEANQFIYEYKGEVIEEMEFRDRLIDYDQRH 170
Query: 165 ETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGE 224
+FY + IDAT KG+ +R+ NHSC PN + KW++ + R+GIFA R + KGE
Sbjct: 171 FKHFYFMMLQNGEFIDATIKGSLARFCNHSCSPNAYVNKWVVKDKLRMGIFAQRKLLKGE 230
Query: 225 NLTYDY 230
+T+DY
Sbjct: 231 EITFDY 236
>gi|322798377|gb|EFZ20101.1| hypothetical protein SINV_07690 [Solenopsis invicta]
Length = 2203
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 78/237 (32%), Positives = 115/237 (48%), Gaps = 19/237 (8%)
Query: 19 KLLKQIGNPVEFELPD-WFIKPKAIP---------YVFIKRNIYLTKRIKRRLEDDGIFC 68
KLL+Q P + + W +P Y I+ NIY +K + C
Sbjct: 1310 KLLRQRKTPFQLQYDLLWLHAQSRLPGRDLVPSWNYKKIRTNIYYD--VKPTTLYEAQAC 1367
Query: 69 SCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGA 126
C G C DC M+ S CS C C + C N+ Q ++ TE G
Sbjct: 1368 ECKPESG----CGDDCINRMVFSECSPQLCPCSDKCENQKIQKHEWSPGLQRFMTEDKGW 1423
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ + IK G+F++EYVGEV+ ++ + R+ ++ +T+ Y ++ +VID G
Sbjct: 1424 GVRTQQAIKSGDFILEYVGEVVSEREFKSRMA-TRYANDTHHYCLHLDGGLVIDGHRMGG 1482
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
R++NHSC PN EMQKW + G R+ +FA+RDIK GE LTYDY + + S C
Sbjct: 1483 DGRFVNHSCEPNCEMQKWSVLGLPRMALFASRDIKPGEELTYDYNFALFNPSEGQQC 1539
>gi|396478086|ref|XP_003840449.1| hypothetical protein LEMA_P101010.1 [Leptosphaeria maculans JN3]
gi|312217021|emb|CBX96970.1| hypothetical protein LEMA_P101010.1 [Leptosphaeria maculans JN3]
Length = 962
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 73/239 (30%), Positives = 117/239 (48%), Gaps = 10/239 (4%)
Query: 12 RIGHAFNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCT 71
++ A ++ + P+ +LPD KA + + Y K I D + C C
Sbjct: 129 KLSRAASQKIHARPPPLYLDLPDSTEAAKAT-FTVLPECTYANKNIG--TTDPALECDCQ 185
Query: 72 ASPGS----SGVC--DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCG 125
+ C D DC C C CG C N+ F + + +++T+K G
Sbjct: 186 TEWDPVLHVNNACGEDSDCINRATKMECVGDCTCGRKCQNQRFLRKQYADVTVIKTDKKG 245
Query: 126 AGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKG 185
G+ A++D+ G+FV EY+GEVID++T R+ + H G +FY + + +DAT KG
Sbjct: 246 FGLRANKDMAPGDFVFEYIGEVIDEKTFRRRMLQYDHEGIKHFYFMSLTKGEFVDATKKG 305
Query: 186 NKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
N R+ NHSC PN + KW++ + R+GIF R ++ GE L ++Y + + D YC
Sbjct: 306 NLGRFCNHSCNPNCFVDKWVVGDKLRMGIFVERRVQAGEELVFNYNVDRYGADPQPCYC 364
>gi|150865134|ref|XP_001384227.2| hypothetical protein PICST_31672 [Scheffersomyces stipitis CBS
6054]
gi|149386390|gb|ABN66198.2| predicted protein [Scheffersomyces stipitis CBS 6054]
Length = 732
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 69/186 (37%), Positives = 100/186 (53%), Gaps = 7/186 (3%)
Query: 52 YLTKRIKRRLEDDGIFCSCTASPGSSG----VCDRDCHCGMLLSS--CSSG-CKCGNSCL 104
Y +K I + + + C C S C D +C ++S C +G C CG +C
Sbjct: 51 YSSKLIGSSGQQEYMTCDCVEDWDSESQRNLACSDDSNCINRVTSVECINGHCGCGKNCQ 110
Query: 105 NKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLG 164
N+ FQ R + + QTE G G+ AD+ I G F+ EY+GEVID+ + R+
Sbjct: 111 NQRFQKRQYASVSVFQTELKGYGLRADDVIPEGGFIYEYIGEVIDEASFRARMIDYDSKN 170
Query: 165 ETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGE 224
+FY + D IDAT KG+ +R+ NHSC PN + KW++ + R+GIFA R I KGE
Sbjct: 171 FKHFYFMMLKNDSFIDATIKGSLARFCNHSCSPNAFVDKWVVGDKLRMGIFAKRSISKGE 230
Query: 225 NLTYDY 230
+T+DY
Sbjct: 231 EITFDY 236
>gi|170573421|ref|XP_001892464.1| SET domain containing protein [Brugia malayi]
gi|158601976|gb|EDP38706.1| SET domain containing protein [Brugia malayi]
Length = 1603
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 88/286 (30%), Positives = 136/286 (47%), Gaps = 55/286 (19%)
Query: 12 RIGHAFNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRN-IYLTKRIKR-----RLEDDG 65
+I F L +G+ ++LP + + K + I+RN + +KR K R+ D
Sbjct: 754 QISRQFANLELNLGDRKHWDLPWYRLNWKEVTRRLIERNRLEKSKREKNSSKTYRIGDKK 813
Query: 66 IFCS-------CTASPGSSGVCDRDCHCG-------------------MLLSSCSSGC-- 97
+ S +A+P SS VC+R + SCSSGC
Sbjct: 814 LPKSRKSDSFATSATPSSSKVCERSRRSSGDYIKLKQNVIVDAYPKIEQMQCSCSSGCCG 873
Query: 98 ------------KCGNSCLN-------KPFQNRPVKKMKLVQT-EKCGAGIVADEDIKRG 137
+CGNSC + F+ V++++ QT CG G+ D +I +G
Sbjct: 874 ESDECLNRVVLMECGNSCPRNALCTNKRLFRRECVERLRTFQTMNGCGIGVKTDVNIDKG 933
Query: 138 EFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCP 197
+F+ EY+GEV+ +T R + + + N Y + V+DA +KGN +R+INHSC P
Sbjct: 934 QFICEYIGEVVSMETFNIR-SRTDYRYQRNHYALNLCPGFVVDAYHKGNIARFINHSCAP 992
Query: 198 NTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
N EMQ+W ++G RIG+FA R I +GE LTYDY ++ + C
Sbjct: 993 NCEMQRWSVNGHYRIGLFALRGIHEGEELTYDYNWDAFEFDDVTIC 1038
>gi|355558542|gb|EHH15322.1| hypothetical protein EGK_01394 [Macaca mulatta]
Length = 2796
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 62/155 (40%), Positives = 93/155 (60%), Gaps = 3/155 (1%)
Query: 80 CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDIKRG 137
C DC M+ + CS C CG C N+ Q V+ ++ + E+ G GI E +K G
Sbjct: 2029 CVDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAG 2088
Query: 138 EFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCP 197
+F+IEY+GEV+ +Q R+ + H ++ Y ++ MVID+ GN++R+INHSC P
Sbjct: 2089 QFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDP 2147
Query: 198 NTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
N EMQKW ++G RIG++A +D+ G LTYDY +
Sbjct: 2148 NCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2182
>gi|403167549|ref|XP_003327326.2| histone-lysine N-methyltransferase SETD2 [Puccinia graminis f. sp.
tritici CRL 75-36-700-3]
gi|375167080|gb|EFP82907.2| histone-lysine N-methyltransferase SETD2 [Puccinia graminis f. sp.
tritici CRL 75-36-700-3]
Length = 974
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 71/208 (34%), Positives = 109/208 (52%), Gaps = 9/208 (4%)
Query: 44 YVFIKRNIYLTKRI--KRRLEDDGIFCSCTASPGS--SGVCDRDCHCGMLLSS---CSSG 96
+ I +N Y K + ++LED + C CT +PG S C C L+
Sbjct: 122 FTQIDQNWYQNKSLGTTKQLED-AMVCDCTWTPGDDPSWACSEHSGCINYLTQIECLQDQ 180
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C+C C N+ FQ R +++V T K G G+ D+ + FV EY+GEVI + +R
Sbjct: 181 CRCREKCQNQRFQKRLYAPIEIVLTPKKGFGMRLQADVPKDTFVYEYLGEVIGVKALHKR 240
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
L G +FY E+ +D IDAT KG R++NHSC PN + KW++ + R+GIF
Sbjct: 241 LKDYGQEGIKHFYFMELQKDQYIDATKKGGFGRFLNHSCNPNCYIGKWVVGRQLRMGIFT 300
Query: 217 TRDIKKGENLTYDYQYE-FLHDSLIAYC 243
R ++ GE LT++Y + + H++ +C
Sbjct: 301 KRAVRGGEELTFNYNVDRYGHEAQECFC 328
>gi|302677464|ref|XP_003028415.1| hypothetical protein SCHCODRAFT_70386 [Schizophyllum commune H4-8]
gi|300102103|gb|EFI93512.1| hypothetical protein SCHCODRAFT_70386 [Schizophyllum commune H4-8]
Length = 883
Score = 122 bits (305), Expect = 2e-25, Method: Composition-based stats.
Identities = 71/211 (33%), Positives = 106/211 (50%), Gaps = 14/211 (6%)
Query: 47 IKRNIYLTKRI-KRRLEDDGIFCSCTASPGSSGVCDRDC--HCGMLLSSC---------- 93
I N Y ++ K R +G C C G RD H S+C
Sbjct: 66 IPENWYQYAKLGKSREAMEGSTCDCEYEEGEHAALGRDNLDHACGPRSNCINRLTQVECL 125
Query: 94 SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTC 153
C C C N+ FQ + +++V+TEK G G+ A++DI+R F+ EYVG+V++ +
Sbjct: 126 EEECHCRQFCQNQRFQRKEYAPIEIVKTEKKGFGLRAEQDIQRDTFIYEYVGDVVNSPSF 185
Query: 154 EERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIG 213
++R+ G +FY + +D IDAT G R+ NHSC PN + KW + R+G
Sbjct: 186 KKRMRDYAKEGIQHFYFMMLQKDEFIDATKSGGIGRFANHSCSPNCYVAKWTVGDRVRMG 245
Query: 214 IFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
IFA R+IKK E LT++Y + + H + YC
Sbjct: 246 IFAKRNIKKYEELTFNYNVDRYGHQAQTCYC 276
>gi|355745722|gb|EHH50347.1| hypothetical protein EGM_01160 [Macaca fascicularis]
Length = 2904
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 62/155 (40%), Positives = 93/155 (60%), Gaps = 3/155 (1%)
Query: 80 CDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDIKRG 137
C DC M+ + CS C CG C N+ Q V+ ++ + E+ G GI E +K G
Sbjct: 2105 CVDDCLNRMIFAECSPNTCPCGEQCCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAG 2164
Query: 138 EFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCP 197
+F+IEY+GEV+ +Q R+ + H ++ Y ++ MVID+ GN++R+INHSC P
Sbjct: 2165 QFIIEYLGEVVSEQEFRNRMIEQYH-NHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDP 2223
Query: 198 NTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
N EMQKW ++G RIG++A +D+ G LTYDY +
Sbjct: 2224 NCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 2258
>gi|443709486|gb|ELU04158.1| hypothetical protein CAPTEDRAFT_93578, partial [Capitella teleta]
Length = 1129
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 76/233 (32%), Positives = 120/233 (51%), Gaps = 11/233 (4%)
Query: 6 KNSDNSRIGHAFNKLLKQIG------NPVEFELPDWFIKPKAIPYVFIKRNIYL-TKRIK 58
K S+++ +G +FN+ +K+ N ++ +L + K Y IK N + + +I
Sbjct: 773 KASESNSLGKSFNQAVKEAALVNKMWNTIKEQLKGDIDQKKPSAYKHIKSNFPIGSVQIY 832
Query: 59 RRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMK 117
+ + C CTA + +C + L C CK G C N+ F R
Sbjct: 833 KSPSN---VCECTAQMENPCGEQGNCLNRLTLVECHPQMCKAGEKCQNQRFMKREYPDQS 889
Query: 118 LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDM 177
+T G G+ + DIK+G+FV EYVG++ID + C+ R+ NFY+ ++
Sbjct: 890 AYRTGARGWGLKTNVDIKKGQFVNEYVGDLIDSEECQRRIKAAHDNNIVNFYMLTLDNRR 949
Query: 178 VIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDY 230
+IDA KGN SR++NHSC PN QKW ++G+ R+G+FA DI G LT++Y
Sbjct: 950 IIDAGPKGNLSRFMNHSCDPNLVTQKWTVNGDVRVGLFAVCDIPAGTELTFNY 1002
>gi|302505469|ref|XP_003014441.1| hypothetical protein ARB_07003 [Arthroderma benhamiae CBS 112371]
gi|291178262|gb|EFE34052.1| hypothetical protein ARB_07003 [Arthroderma benhamiae CBS 112371]
Length = 973
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 60/164 (36%), Positives = 94/164 (57%), Gaps = 1/164 (0%)
Query: 81 DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
D DC C C CG++C N+ FQ R + +++TEK G G+ A+ D++ +F+
Sbjct: 163 DSDCINRATKMECVDNCGCGDNCQNQRFQKRQYADVTVIKTEKKGFGLRANTDLRPNDFI 222
Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
EY+GEVI++ +R+ + G +FY +N+ +DAT KGN R+ NHSC PN
Sbjct: 223 YEYIGEVINEPQFRKRMMQYDDEGIKHFYFMSLNKGEFVDATRKGNLGRFCNHSCNPNCY 282
Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
+ KW++ + R+GIFA + IK GE L ++Y + + D YC
Sbjct: 283 IDKWVVGEKLRMGIFAEKHIKAGEELVFNYNVDRYGADPQPCYC 326
>gi|46122361|ref|XP_385734.1| hypothetical protein FG05558.1 [Gibberella zeae PH-1]
Length = 1051
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 63/188 (33%), Positives = 100/188 (53%), Gaps = 6/188 (3%)
Query: 62 EDDGIFCSCTAS--PGSSGVCDRDCHC---GMLLSSCSSGCKCGNSCLNKPFQNRPVKKM 116
++D + C C G + C D C + + G C C N+ FQ + +
Sbjct: 249 DNDALDCECRDEWHDGKNLACGEDSDCINRATKMECSAEGGNCAGGCQNQRFQRKQYANV 308
Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
+++TEK G G+ AD D++ +FV EY+GEVI++ T R+ + G +FY +N+
Sbjct: 309 SVIKTEKKGFGLRADSDLQPNDFVFEYIGEVINEPTFRRRMIQYDEEGIKHFYFMSLNKS 368
Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FL 235
+DAT KGN R+ NHSC PN + KW++ + R+GIF +R I+ GE L ++Y + +
Sbjct: 369 EFVDATKKGNYGRFCNHSCNPNCYVDKWVVGDKLRMGIFTSRKIQSGEELVFNYNVDRYG 428
Query: 236 HDSLIAYC 243
D YC
Sbjct: 429 ADPQPCYC 436
>gi|302664281|ref|XP_003023773.1| hypothetical protein TRV_02085 [Trichophyton verrucosum HKI 0517]
gi|291187786|gb|EFE43155.1| hypothetical protein TRV_02085 [Trichophyton verrucosum HKI 0517]
Length = 973
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 60/164 (36%), Positives = 94/164 (57%), Gaps = 1/164 (0%)
Query: 81 DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
D DC C C CG++C N+ FQ R + +++TEK G G+ A+ D++ +F+
Sbjct: 163 DSDCINRATKMECVDNCGCGDNCQNQRFQKRQYADVTVIKTEKKGFGLRANTDLRPNDFI 222
Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
EY+GEVI++ +R+ + G +FY +N+ +DAT KGN R+ NHSC PN
Sbjct: 223 YEYIGEVINEPQFRKRMMQYDDEGIKHFYFMSLNKGEFVDATRKGNLGRFCNHSCNPNCY 282
Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
+ KW++ + R+GIFA + IK GE L ++Y + + D YC
Sbjct: 283 IDKWVVGEKLRMGIFAEKHIKAGEELVFNYNVDRYGADPQPCYC 326
>gi|290992999|ref|XP_002679121.1| set domain-containing protein [Naegleria gruberi]
gi|284092736|gb|EFC46377.1| set domain-containing protein [Naegleria gruberi]
Length = 153
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 55/126 (43%), Positives = 88/126 (69%), Gaps = 2/126 (1%)
Query: 108 FQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETN 167
FQ + KK+++ + + G G+ A E I +G F+IEYVGEV+D++ ++R + + GE +
Sbjct: 1 FQKKSWKKIEVKRAGEKGFGVFAKEKISKGTFIIEYVGEVLDEKIYKQR--QEAYDGERH 58
Query: 168 FYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLT 227
+Y + + +IDA+ KGN +R+INHSC PN+ +QKW + ++RIG+FA RDI+KGE +T
Sbjct: 59 YYFLSVGTNQIIDASKKGNNARFINHSCDPNSVLQKWTVGHQSRIGVFALRDIEKGEEVT 118
Query: 228 YDYQYE 233
+DY E
Sbjct: 119 FDYAME 124
>gi|410516927|sp|Q4IB50.2|SET2_GIBZE RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36
specific; AltName: Full=SET domain-containing protein 2
Length = 911
Score = 121 bits (304), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 63/188 (33%), Positives = 100/188 (53%), Gaps = 6/188 (3%)
Query: 62 EDDGIFCSCTAS--PGSSGVCDRDCHC---GMLLSSCSSGCKCGNSCLNKPFQNRPVKKM 116
++D + C C G + C D C + + G C C N+ FQ + +
Sbjct: 109 DNDALDCECRDEWHDGKNLACGEDSDCINRATKMECSAEGGNCAGGCQNQRFQRKQYANV 168
Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
+++TEK G G+ AD D++ +FV EY+GEVI++ T R+ + G +FY +N+
Sbjct: 169 SVIKTEKKGFGLRADSDLQPNDFVFEYIGEVINEPTFRRRMIQYDEEGIKHFYFMSLNKS 228
Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FL 235
+DAT KGN R+ NHSC PN + KW++ + R+GIF +R I+ GE L ++Y + +
Sbjct: 229 EFVDATKKGNYGRFCNHSCNPNCYVDKWVVGDKLRMGIFTSRKIQSGEELVFNYNVDRYG 288
Query: 236 HDSLIAYC 243
D YC
Sbjct: 289 ADPQPCYC 296
>gi|406697594|gb|EKD00852.1| histone-lysine N-methyltransferase [Trichosporon asahii var. asahii
CBS 8904]
Length = 843
Score = 121 bits (304), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 60/149 (40%), Positives = 88/149 (59%), Gaps = 1/149 (0%)
Query: 83 DCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVI 141
DC ++ C +G CK G C N+ + + +VQTEK G G+ A EDI G V
Sbjct: 208 DCINRIMFIECLAGECKAGKRCRNQRLARKQYAPVDVVQTEKKGFGLRAREDIPAGALVY 267
Query: 142 EYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEM 201
EY+GEV+ ++T +R+ + G +FY + ++ IDAT KG R+ NHSC PN E+
Sbjct: 268 EYIGEVVVEKTFRKRMAEYADEGIKHFYFMMLQKEEYIDATKKGGIGRFANHSCNPNCEV 327
Query: 202 QKWIIDGETRIGIFATRDIKKGENLTYDY 230
QKW++ R+GIF RD+KK E +T++Y
Sbjct: 328 QKWVVGRRMRMGIFTKRDVKKDEEITFNY 356
>gi|50302715|ref|XP_451294.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|74607741|sp|Q6CXP5.1|SET2_KLULA RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36
specific; AltName: Full=SET domain-containing protein 2
gi|49640425|emb|CAH02882.1| KLLA0A06600p [Kluyveromyces lactis]
Length = 702
Score = 121 bits (304), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 61/153 (39%), Positives = 90/153 (58%), Gaps = 3/153 (1%)
Query: 81 DRDCHCGMLLSSCSSG-CK--CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRG 137
D DC L C +G CK CG C N+ FQ + + + +TE+ G G+ A+ DI+
Sbjct: 71 DSDCINRATLIECVNGLCKHSCGTDCQNQRFQKKAYADISVFKTERKGFGVRANSDIEPH 130
Query: 138 EFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCP 197
F+ EY+GEVI ++ R+ K +G +FY + IDAT KG +R+ NHSC P
Sbjct: 131 NFIYEYIGEVIQEEEFRNRMVKYDQMGFKHFYFMMLQTGQFIDATLKGCIARFCNHSCNP 190
Query: 198 NTEMQKWIIDGETRIGIFATRDIKKGENLTYDY 230
N + KW+++G+ ++GIFA R I KGE +T+DY
Sbjct: 191 NAYVNKWVVNGKLKMGIFANRHISKGEEVTFDY 223
>gi|408394323|gb|EKJ73531.1| hypothetical protein FPSE_06149 [Fusarium pseudograminearum CS3096]
Length = 911
Score = 121 bits (304), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 63/188 (33%), Positives = 100/188 (53%), Gaps = 6/188 (3%)
Query: 62 EDDGIFCSCTAS--PGSSGVCDRDCHC---GMLLSSCSSGCKCGNSCLNKPFQNRPVKKM 116
++D + C C G + C D C + + G C C N+ FQ + +
Sbjct: 109 DNDALDCECRDEWHDGKNLACGEDSDCINRATKMECSAEGGNCAGGCQNQRFQRKQYANV 168
Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
+++TEK G G+ AD D++ +FV EY+GEVI++ T R+ + G +FY +N+
Sbjct: 169 SVIKTEKKGFGLRADSDLQPNDFVFEYIGEVINEPTFRRRMIQYDEEGIKHFYFMSLNKS 228
Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FL 235
+DAT KGN R+ NHSC PN + KW++ + R+GIF +R I+ GE L ++Y + +
Sbjct: 229 EFVDATKKGNYGRFCNHSCNPNCYVDKWVVGDKLRMGIFTSRKIQSGEELVFNYNVDRYG 288
Query: 236 HDSLIAYC 243
D YC
Sbjct: 289 ADPQPCYC 296
>gi|291225527|ref|XP_002732754.1| PREDICTED: Ash1l protein-like [Saccoglossus kowalevskii]
Length = 2643
Score = 121 bits (304), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 78/218 (35%), Positives = 112/218 (51%), Gaps = 16/218 (7%)
Query: 28 VEFELPD---WFIKPKAIP-------YVFIKRNIYLTKRIKRRLEDDGIFCSCT-ASPGS 76
V+F LP W +P Y I+ NIY+ + LE + C+C + G
Sbjct: 1781 VDFSLPYDIWWLHDMDKLPPKNDDSKYKKIRSNIYVDVKPYSGLE--AVSCNCMKPTNGI 1838
Query: 77 SGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDI 134
C DC M+ + CS C CG+ C N Q + ++ + TE G GI + I
Sbjct: 1839 DRGCGDDCINKMVFTECSPNLCPCGDHCSNMSIQKHEWITTLERLPTENRGWGIRTKDAI 1898
Query: 135 KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
K G+F++EYVGEV+ + +R K + + Y +N MVID G + R++NHS
Sbjct: 1899 KSGQFILEYVGEVVSENEFWQRAMK-NYQNRRHHYCLNLNSGMVIDGYRMGCEGRFVNHS 1957
Query: 195 CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
C PN EMQKW ++G RIG+FA +DI+ G LTYDY +
Sbjct: 1958 CEPNCEMQKWSVNGVYRIGLFALKDIQPGSELTYDYNF 1995
>gi|195377533|ref|XP_002047543.1| GJ11874 [Drosophila virilis]
gi|194154701|gb|EDW69885.1| GJ11874 [Drosophila virilis]
Length = 2343
Score = 121 bits (304), Expect = 2e-25, Method: Composition-based stats.
Identities = 69/214 (32%), Positives = 112/214 (52%), Gaps = 12/214 (5%)
Query: 27 PVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC 86
P +P W Y I+ N+Y D C+C P +SG C +C
Sbjct: 1414 PTRHVVPSW-------NYRKIRTNVYAESVRPNLAGFDHPNCNC--KPQASGACLDNCLN 1464
Query: 87 GMLLSSCS-SGCKCGNSCLNKPFQNRPVKK-MKLVQTEKCGAGIVADEDIKRGEFVIEYV 144
M+ + C+ S C G+ C N+ Q V ++ T G G+ I +G +++EYV
Sbjct: 1465 RMVYTECAPSNCPAGDKCRNQKIQRHEVAPGVERFMTADKGWGVRTKLPIAKGTYILEYV 1524
Query: 145 GEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKW 204
GEV+ ++ ++R+ + +L +T+ Y ++ +VID G+ R++NHSC PN EMQKW
Sbjct: 1525 GEVVTEREFKQRMASI-YLNDTHHYCLHLDGGLVIDGQRMGSDCRFVNHSCEPNCEMQKW 1583
Query: 205 IIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
++G +R+ +FA R I++GE LTYDY + + S
Sbjct: 1584 SVNGLSRMVLFAKRAIEQGEELTYDYNFSLFNPS 1617
>gi|402217077|gb|EJT97159.1| SET domain-containing protein [Dacryopinax sp. DJM-731 SS1]
Length = 711
Score = 121 bits (304), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 70/208 (33%), Positives = 112/208 (53%), Gaps = 8/208 (3%)
Query: 44 YVFIKRNIYLTKRIKR-RLEDDGIFCSC-----TASPGSSGVCDRDCHCGMLLSSC-SSG 96
++ + N Y + R R +DD + C C T P ++ DC + C +
Sbjct: 34 FIELSDNAYQYNSLGRSRQKDDCMECDCQFDYATDPPEAACGYGSDCINRLTQVECLADE 93
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C+C + C N+ FQ + +++V T+K G G+ A D++ F+ EY+GEVI+ T +R
Sbjct: 94 CRCRSHCQNQRFQRKQYADIEIVSTDKKGYGLRAATDLQPDSFIYEYIGEVINQTTFLKR 153
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + G +FY + R+ IDAT KG R+ NHSC PN + KW++ R+GIFA
Sbjct: 154 MHEYADEGIKHFYFMMLQREEYIDATKKGGIGRFANHSCNPNCYVAKWVVGPRVRMGIFA 213
Query: 217 TRDIKKGENLTYDYQYE-FLHDSLIAYC 243
R+I +GE LT++Y + + HD+ YC
Sbjct: 214 KREISRGEELTFNYNVDRYGHDAQPCYC 241
>gi|170587756|ref|XP_001898640.1| SET domain containing protein [Brugia malayi]
gi|158593910|gb|EDP32504.1| SET domain containing protein [Brugia malayi]
Length = 1449
Score = 121 bits (304), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 76/219 (34%), Positives = 110/219 (50%), Gaps = 21/219 (9%)
Query: 20 LLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGV 79
+LK G PV++E D I + ++ ++ R FC T+ + G
Sbjct: 550 MLKSAGEPVQYEYLDENI-------ILCDESLIKEAKVVR------CFCEPTSVEIAEGR 596
Query: 80 -CDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGE 138
C C L + C S C G C N+ F N+ K+++ G G+ A E ++ G
Sbjct: 597 GCSSGCINRELYTECGSRCPSGAGCANRRFHNKQYAKVEVFNAGIKGWGLRAAEPLEPGR 656
Query: 139 FVIEYVGEVIDDQTCEERLWKMKHLGE----TNFYLCEINRDMVIDATYKGNKSRYINHS 194
F+IEYVGEVID EE + + + G+ + YL + VIDAT KGN SR+INHS
Sbjct: 657 FIIEYVGEVID---AEEMIRRGRRYGKDPKHVHHYLMALKNGAVIDATAKGNVSRFINHS 713
Query: 195 CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
C PN E QKW ++ + R+G F + I GE + +DYQ E
Sbjct: 714 CDPNCESQKWTVNRQLRVGFFVIKPIALGEEIVFDYQLE 752
>gi|452846178|gb|EME48111.1| hypothetical protein DOTSEDRAFT_167709 [Dothistroma septosporum
NZE10]
Length = 963
Score = 121 bits (304), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 60/168 (35%), Positives = 95/168 (56%), Gaps = 3/168 (1%)
Query: 79 VCDRDCHCGMLLS--SCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKR 136
CD D C + C C CG+ C N+ FQ + + +++TEK G G+ AD++++
Sbjct: 172 ACDEDSDCINRATKMECVGDCGCGSKCQNRRFQRKKYADVTVIKTEKKGYGLRADKELRP 231
Query: 137 GEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCC 196
G+FV EY+GEVI + R+ + G +FY + + +DAT KGN R+ NHSC
Sbjct: 232 GDFVYEYIGEVIGENVFRRRMQQYDEEGIKHFYFMSLTKGEFVDATKKGNLGRFCNHSCN 291
Query: 197 PNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
PN + KW+++ + R+GIF R+I+ GE L ++Y + + D YC
Sbjct: 292 PNCYVDKWVVNDKLRMGIFVERNIQAGEELVFNYNVDRYGADPQPCYC 339
>gi|340515729|gb|EGR45981.1| predicted protein [Trichoderma reesei QM6a]
Length = 933
Score = 121 bits (304), Expect = 2e-25, Method: Composition-based stats.
Identities = 58/177 (32%), Positives = 95/177 (53%), Gaps = 5/177 (2%)
Query: 62 EDDGIFCSCTAS--PGSSGVCDRDCHC---GMLLSSCSSGCKCGNSCLNKPFQNRPVKKM 116
+ D + C C G + C D C + + CG C N+ FQ + +
Sbjct: 123 DSDALDCDCREEWRDGENFACGEDSDCINRATKMECSADSANCGGGCQNRRFQRKQYADV 182
Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
+++TEK G G+ A+ ++ +F+ EY+GEVI++ T R+ + G +FY +N++
Sbjct: 183 TVIKTEKKGFGLRANSPLEPNDFIYEYIGEVINEPTFRRRMLQYDEEGIKHFYFMSLNKN 242
Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
+DAT KGN R+ NHSC PN + KW++ + R+GIFA R I+ GE L ++Y +
Sbjct: 243 EFVDATKKGNLGRFCNHSCSPNCFVDKWVVGDKLRMGIFALRKIQAGEELVFNYNVD 299
>gi|296808927|ref|XP_002844802.1| histone-lysine N-methyltransferase [Arthroderma otae CBS 113480]
gi|238844285|gb|EEQ33947.1| histone-lysine N-methyltransferase [Arthroderma otae CBS 113480]
Length = 969
Score = 121 bits (303), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 59/162 (36%), Positives = 93/162 (57%), Gaps = 1/162 (0%)
Query: 83 DCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIE 142
DC C C CG++C N+ FQ R + +++TEK G G+ A+ D++ +F+ E
Sbjct: 164 DCINRATKMECVQNCGCGDNCQNQRFQKRQYADVTVIKTEKKGFGLRANTDLQPNDFIFE 223
Query: 143 YVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQ 202
Y+GEVI++ +R+ + G +FY +N+ +DAT KGN R+ NHSC PN +
Sbjct: 224 YIGEVINEPQFRKRMIQYDEEGIKHFYFMSLNKGEFVDATRKGNLGRFCNHSCNPNCYID 283
Query: 203 KWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
KW++ + R+GIFA R +K GE L ++Y + + D YC
Sbjct: 284 KWVVGEKLRMGIFAERHVKAGEELVFNYNVDRYGADPQPCYC 325
>gi|219841942|gb|AAI43511.1| WHSC1L1 protein [Homo sapiens]
Length = 1426
Score = 121 bits (303), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 72/199 (36%), Positives = 103/199 (51%), Gaps = 15/199 (7%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N + K ++ ++ D C P C + C ML C
Sbjct: 1069 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1127
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G+ C N+ F R ++++TE+ G G+ IK+GEFV EYVGE+ID++ C R
Sbjct: 1128 CPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1187
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + N D +IDA KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1188 IKRAHE-----------NSDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1236
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1237 LCDIPAGMELTFNYNLDCL 1255
>gi|397521375|ref|XP_003830772.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2 [Pan
paniscus]
Length = 1426
Score = 121 bits (303), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 72/199 (36%), Positives = 103/199 (51%), Gaps = 15/199 (7%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N + K ++ ++ D C P C + C ML C
Sbjct: 1069 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1127
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G+ C N+ F R ++++TE+ G G+ IK+GEFV EYVGE+ID++ C R
Sbjct: 1128 CPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1187
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + N D +IDA KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1188 IKRAHE-----------NSDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1236
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1237 LCDIPAGMELTFNYNLDCL 1255
>gi|255943807|ref|XP_002562671.1| Pc20g01100 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211587406|emb|CAP85439.1| Pc20g01100 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 688
Score = 121 bits (303), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 59/164 (35%), Positives = 93/164 (56%), Gaps = 1/164 (0%)
Query: 81 DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
D DC C+ C CG C N+ FQ R + +++TEK G G+ A++++ GE +
Sbjct: 145 DSDCINRATKIECAGECGCGPDCQNQRFQKRQFAPVSVIKTEKKGFGLRAEKNLDPGELI 204
Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
EYVGEV+ +Q +R+ + G +FY +N+ +DAT +GN R+ NHSC PN
Sbjct: 205 YEYVGEVVGEQQFRKRMRQYDEEGIKHFYFMSLNKGEFVDATKRGNLGRFCNHSCNPNCY 264
Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
+ KW++ + R+GIFA R ++ GE L ++Y + + D YC
Sbjct: 265 VDKWVVGEKLRMGIFAERAVQAGEELVFNYNVDRYGADPQPCYC 308
>gi|301629157|ref|XP_002943714.1| PREDICTED: hypothetical protein LOC100496979 [Xenopus (Silurana)
tropicalis]
Length = 1666
Score = 121 bits (303), Expect = 3e-25, Method: Composition-based stats.
Identities = 63/167 (37%), Positives = 94/167 (56%), Gaps = 4/167 (2%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSCS-SGCKCGNSCLNKPFQNRP-VKKMKLVQTEKCG 125
C C + G C DC M+ + CS + C CG C N+ Q V+ ++ + E G
Sbjct: 4 CKCPDNQSEKG-CVEDCLNRMIFAECSPNTCPCGEQCNNQCIQRHEWVQCLERFRAEGKG 62
Query: 126 AGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKG 185
GI E +K +F+IEY+GEV+ + R + H ++ Y ++ MVID+ G
Sbjct: 63 WGIRTKEPLKASQFIIEYLGEVVSETEFRNRTIEQYH-NHSDHYCLSLDSGMVIDSYRMG 121
Query: 186 NKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
N++R+INHSC PN EMQKW ++G RIG++A +D+ G LTYDY +
Sbjct: 122 NEARFINHSCDPNCEMQKWSVNGVYRIGLYALKDMPAGTELTYDYNF 168
>gi|332825882|ref|XP_003311721.1| PREDICTED: histone-lysine N-methyltransferase NSD3 [Pan troglodytes]
Length = 1426
Score = 121 bits (303), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 72/199 (36%), Positives = 103/199 (51%), Gaps = 15/199 (7%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N + K ++ ++ D C P C + C ML C
Sbjct: 1069 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1127
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G+ C N+ F R ++++TE+ G G+ IK+GEFV EYVGE+ID++ C R
Sbjct: 1128 CPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1187
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + N D +IDA KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1188 IKRAHE-----------NSDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1236
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1237 LCDIPAGMELTFNYNLDCL 1255
>gi|296222019|ref|XP_002757007.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1
[Callithrix jacchus]
Length = 1427
Score = 121 bits (303), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 72/199 (36%), Positives = 103/199 (51%), Gaps = 15/199 (7%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N + K ++ ++ D C P C + C ML C
Sbjct: 1070 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1128
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G+ C N+ F R ++++TE+ G G+ IK+GEFV EYVGE+ID++ C R
Sbjct: 1129 CPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1188
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + N D +IDA KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1189 IKRAHE-----------NSDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1237
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1238 LCDIPAGMELTFNYNLDCL 1256
>gi|327300913|ref|XP_003235149.1| SET and WW domain-containing protein [Trichophyton rubrum CBS
118892]
gi|326462501|gb|EGD87954.1| SET and WW domain-containing protein [Trichophyton rubrum CBS
118892]
Length = 971
Score = 121 bits (303), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 60/164 (36%), Positives = 94/164 (57%), Gaps = 1/164 (0%)
Query: 81 DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
D DC C C CG++C N+ FQ R + +++TEK G G+ A+ D++ +F+
Sbjct: 162 DSDCINRATKMECVDNCGCGDNCQNQRFQKRQYADVTVIKTEKKGFGLRANTDLRPNDFI 221
Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
EY+GEVI++ +R+ + G +FY +N+ +DAT KGN R+ NHSC PN
Sbjct: 222 FEYIGEVINEPQFRKRMMQYDDEGIKHFYFMSLNKGEFVDATRKGNLGRFCNHSCNPNCY 281
Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
+ KW++ + R+GIFA + IK GE L ++Y + + D YC
Sbjct: 282 IDKWVVGEKLRMGIFAEKYIKAGEELVFNYNVDRYGADPQPCYC 325
>gi|315048637|ref|XP_003173693.1| histone-lysine N-methyltransferase [Arthroderma gypseum CBS 118893]
gi|311341660|gb|EFR00863.1| histone-lysine N-methyltransferase [Arthroderma gypseum CBS 118893]
Length = 967
Score = 121 bits (303), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 65/188 (34%), Positives = 102/188 (54%), Gaps = 7/188 (3%)
Query: 63 DDGIFCSCT----ASPGSSGVC--DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKM 116
D + C C + G + C D DC C C CG++C N+ FQ R +
Sbjct: 138 DHAMECDCAEEWNSKTGINEACGEDSDCINRATKMECVQNCGCGDNCQNQRFQKRQYADV 197
Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
+++TEK G G+ A+ D++ +F+ EY+GEVI++ +R+ + G +FY +N+
Sbjct: 198 TVIKTEKKGFGLRANADLRPNDFIFEYIGEVINEPQFRKRMIQYDDEGIKHFYFMSLNKG 257
Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FL 235
+DAT KGN R+ NHSC PN + KW++ + R+GIFA + IK GE L ++Y + +
Sbjct: 258 EFVDATRKGNLGRFCNHSCNPNCYIDKWVVGEKLRMGIFAEKYIKAGEELVFNYNVDRYG 317
Query: 236 HDSLIAYC 243
D YC
Sbjct: 318 ADPQPCYC 325
>gi|58269858|ref|XP_572085.1| histone-lysine N-methyltransferase [Cryptococcus neoformans var.
neoformans JEC21]
gi|338819405|sp|P0CO28.1|SET2_CRYNJ RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36
specific; AltName: Full=SET domain-containing protein 2
gi|57228321|gb|AAW44778.1| histone-lysine N-methyltransferase, putative [Cryptococcus
neoformans var. neoformans JEC21]
Length = 834
Score = 121 bits (303), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 69/205 (33%), Positives = 107/205 (52%), Gaps = 11/205 (5%)
Query: 45 VFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGV----CDRDCHCGMLLSSCSSG-CKC 99
V+ +++I L+K D+ + C C + D DC L C +G C+
Sbjct: 112 VYERKDIGLSKE-----NDEMMVCECVYNRHDPDADPCGPDSDCINRALYIECIAGECRA 166
Query: 100 GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
G C N+ F R + +V TEK G G+ A I + EY+GEV+ ++T +R+ +
Sbjct: 167 GKHCHNQQFSKRQYANVDVVLTEKKGYGLRASSTIPANTLIYEYIGEVVAEKTFRKRMQQ 226
Query: 160 MKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD 219
G +FY + ++ IDAT KG R+ NHSC PN E+QKW++ R+GIF RD
Sbjct: 227 YADEGIRHFYFMMLQKEEYIDATKKGGIGRFANHSCNPNCEVQKWVVGRRLRMGIFTKRD 286
Query: 220 IKKGENLTYDYQYE-FLHDSLIAYC 243
+ KGE +T++Y + + HD+ YC
Sbjct: 287 VIKGEEITFNYNVDRYGHDAQTCYC 311
>gi|134113727|ref|XP_774448.1| hypothetical protein CNBG0940 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|338819404|sp|P0CO29.1|SET2_CRYNB RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36
specific; AltName: Full=SET domain-containing protein 2
gi|50257086|gb|EAL19801.1| hypothetical protein CNBG0940 [Cryptococcus neoformans var.
neoformans B-3501A]
Length = 834
Score = 121 bits (303), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 69/205 (33%), Positives = 107/205 (52%), Gaps = 11/205 (5%)
Query: 45 VFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGV----CDRDCHCGMLLSSCSSG-CKC 99
V+ +++I L+K D+ + C C + D DC L C +G C+
Sbjct: 112 VYERKDIGLSKE-----NDEMMVCECVYNRHDPDADPCGPDSDCINRALYIECIAGECRA 166
Query: 100 GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
G C N+ F R + +V TEK G G+ A I + EY+GEV+ ++T +R+ +
Sbjct: 167 GKHCHNQQFSKRQYANVDVVLTEKKGYGLRASSTIPANTLIYEYIGEVVAEKTFRKRMQQ 226
Query: 160 MKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD 219
G +FY + ++ IDAT KG R+ NHSC PN E+QKW++ R+GIF RD
Sbjct: 227 YADEGIRHFYFMMLQKEEYIDATKKGGIGRFANHSCNPNCEVQKWVVGRRLRMGIFTKRD 286
Query: 220 IKKGENLTYDYQYE-FLHDSLIAYC 243
+ KGE +T++Y + + HD+ YC
Sbjct: 287 VIKGEEITFNYNVDRYGHDAQTCYC 311
>gi|321479504|gb|EFX90460.1| hypothetical protein DAPPUDRAFT_39415 [Daphnia pulex]
Length = 799
Score = 120 bits (302), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 67/202 (33%), Positives = 111/202 (54%), Gaps = 10/202 (4%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCT-----ASPGSSGVCDRDCHCGMLLSSCSSG-C 97
Y I+ N+Y +K +D C+CT + C DC M+ + CSS C
Sbjct: 95 YRKIRSNVYYD--VKPPFTNDAEACNCTLPQPDTEDAEAQCCGDDCLNRMVYTECSSQMC 152
Query: 98 KCGNSCLNKPFQ-NRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
G+ C+N+ Q +R + ++ T++ G G+ ++K G F++EYVGEV+ D+ +ER
Sbjct: 153 PVGDKCMNQRIQRHRWAQGLERFMTKEKGWGVRCRNELKTGVFILEYVGEVVSDKEFKER 212
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + ++ +T+ Y ++ +VID G R++NHSC PN EMQKW ++G R+ +FA
Sbjct: 213 MHTV-YVHDTHHYCLHLDGGLVIDGHRMGGDGRFVNHSCAPNCEMQKWSVNGLPRMALFA 271
Query: 217 TRDIKKGENLTYDYQYEFLHDS 238
RDI E L+YDY + + +
Sbjct: 272 LRDISPLEELSYDYNFSLFNPA 293
>gi|254566589|ref|XP_002490405.1| Histone methyltransferase [Komagataella pastoris GS115]
gi|238030201|emb|CAY68124.1| Histone methyltransferase [Komagataella pastoris GS115]
gi|328350800|emb|CCA37200.1| histone-lysine N-methyltransferase SETD2 [Komagataella pastoris CBS
7435]
Length = 722
Score = 120 bits (302), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 67/174 (38%), Positives = 96/174 (55%), Gaps = 7/174 (4%)
Query: 64 DGIFCSCTAS----PGSSGVCDRDCHCGMLLSS--CSSG-CKCGNSCLNKPFQNRPVKKM 116
DG+ C C + G + C C L+S C SG C CG+ C N+ FQ + +
Sbjct: 65 DGMSCDCEENWDTVSGINNACGEYSECINRLTSIECISGTCSCGDDCQNQRFQKKQYAPI 124
Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
+ +TEK G G+ A DI++ F+ EY+GEVID+ T +R + G +FY + +
Sbjct: 125 AVFETEKKGYGVRAQADIRQDAFIYEYLGEVIDESTFRKRKENYDNQGLEHFYFMMLQKG 184
Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDY 230
IDAT KG R+ NHSC PN + KW + + R+GIFA R+I KGE + +DY
Sbjct: 185 EFIDATAKGGLGRFCNHSCRPNAYVDKWEVGNKLRMGIFAKREIYKGEEICFDY 238
>gi|312072804|ref|XP_003139232.1| hypothetical protein LOAG_03647 [Loa loa]
gi|307765598|gb|EFO24832.1| hypothetical protein LOAG_03647 [Loa loa]
Length = 1422
Score = 120 bits (302), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 67/172 (38%), Positives = 93/172 (54%), Gaps = 8/172 (4%)
Query: 67 FCSCTASPGSSGV-CDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCG 125
FC T + + G C C L + C S C G C N+ F N+ K+++ G
Sbjct: 554 FCEPTLAEIAEGRGCSSGCINRELYTECGSRCPSGVGCANRRFHNKQYAKVEVFNAGVKG 613
Query: 126 AGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGE----TNFYLCEINRDMVIDA 181
G+ A E ++ G F+IEY+GEVID EE + + + G+ + YL + VIDA
Sbjct: 614 WGLRAAEPLEPGRFIIEYIGEVID---AEEMIRRGRRYGKDPKHVHHYLMALKNGAVIDA 670
Query: 182 TYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
T KGN SR+INHSC PN E QKW +D + R+G F + I GE + +DYQ E
Sbjct: 671 TAKGNVSRFINHSCDPNCESQKWTVDRQLRVGFFVIKPIALGEEIVFDYQLE 722
>gi|74210080|dbj|BAE21322.1| unnamed protein product [Mus musculus]
Length = 484
Score = 120 bits (302), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 74/199 (37%), Positives = 105/199 (52%), Gaps = 15/199 (7%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSS--CSSG- 96
K PY IK N + K ++ ++ D C PG C + C +S C
Sbjct: 128 KPPPYKHIKANKVIGK-VQVQVADLSEIPRCNCKPGDENPCGLESQCLNRMSQYECHPQV 186
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G+ C N+ F R ++++TE+ G G+ IK+GEFV EYVGE+ID++ C R
Sbjct: 187 CPAGDRCQNQCFTKRLYPNAEVIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 246
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ K H N D +IDA KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 247 I-KRAHE----------NSDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 295
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 296 LCDIPAGMELTFNYNLDCL 314
>gi|297295821|ref|XP_001094467.2| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific, partial [Macaca mulatta]
Length = 2329
Score = 120 bits (302), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 66/169 (39%), Positives = 97/169 (57%), Gaps = 12/169 (7%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGA 126
C+C A+ + D +C MLL C + C G C N+ F R ++++ +T + G
Sbjct: 1539 CNCKATDENPCGIDSECINRMLLYECHPTVCPAGGRCQNQCFSKRQYPEVEIFRTLQRGW 1598
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ DIK+GEFV EYVGE+ID++ C R+ + + D +IDA KGN
Sbjct: 1599 GLRTKTDIKKGEFVNEYVGELIDEEECRARI-----------RYAQEHEDRIIDAGPKGN 1647
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
+R++NH C PN E QKW ++G+TR+G+FA DIK G LT++Y E L
Sbjct: 1648 YARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECL 1696
>gi|389743273|gb|EIM84458.1| hypothetical protein STEHIDRAFT_82382 [Stereum hirsutum FP-91666
SS1]
Length = 1040
Score = 120 bits (302), Expect = 4e-25, Method: Composition-based stats.
Identities = 65/188 (34%), Positives = 97/188 (51%), Gaps = 8/188 (4%)
Query: 64 DGIFCSCTASPGSSG----VCDRDCHCGMLLSSCS---SGCKCGNSCLNKPFQNRPVKKM 116
+ + C CT P + C D C L+ C+C + C N+ FQ +
Sbjct: 241 ESMTCDCTYDPSAGDDPDIACGHDSDCINRLTQVECMPDDCRCKSYCRNQRFQRLQYAPI 300
Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
+VQTEK G G+ A D+++ F+ EYVG+VI T +R+ + G +FY + +D
Sbjct: 301 DIVQTEKKGFGLRAAADLRKDTFIYEYVGDVISHPTFVKRMREYAEEGIRHFYFMMLQKD 360
Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FL 235
IDAT +G R+ NHSC PN + KW + R+GIFA R I+ E LT++Y + +
Sbjct: 361 EYIDATKRGGIGRFANHSCNPNCYVAKWTVGSRIRMGIFANRFIRANEELTFNYNVDRYG 420
Query: 236 HDSLIAYC 243
HD+ YC
Sbjct: 421 HDAQPCYC 428
>gi|366997071|ref|XP_003678298.1| hypothetical protein NCAS_0I02880 [Naumovozyma castellii CBS 4309]
gi|342304169|emb|CCC71956.1| hypothetical protein NCAS_0I02880 [Naumovozyma castellii CBS 4309]
Length = 726
Score = 120 bits (302), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 68/193 (35%), Positives = 106/193 (54%), Gaps = 6/193 (3%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSC--TASPGSSGVCDR--DCHCGMLLSSCSSGC-- 97
+ +K+ Y +K++ +D I C C ++ G + CD DC + L C +
Sbjct: 30 FTELKKCTYSSKKLGNSKNNDFIECDCYEDSTNGVNHACDENSDCINRLTLIECVNDLCY 89
Query: 98 KCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERL 157
CGN C N+ FQ + + +T+ G G+ A+ DI+ EF+ EY GEVID++ +R+
Sbjct: 90 SCGNDCQNQRFQKSQYADISIFKTKMKGYGVRANADIETNEFIYEYTGEVIDEEIFRDRM 149
Query: 158 WKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFAT 217
+ +FY + IDAT KG+ +R+ NHSC PN + KW + G+ R+GIFA+
Sbjct: 150 IEYDEKKFKHFYFMMLQNCEFIDATLKGSLARFCNHSCNPNAYVNKWEVAGKLRMGIFAS 209
Query: 218 RDIKKGENLTYDY 230
R I KGE +T+DY
Sbjct: 210 RKIIKGEEITFDY 222
>gi|225562106|gb|EEH10386.1| histone-lysine N-methyltransferase [Ajellomyces capsulatus G186AR]
Length = 1000
Score = 120 bits (302), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 60/164 (36%), Positives = 92/164 (56%), Gaps = 1/164 (0%)
Query: 81 DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
D DC C C CG+ C N+ FQ R + +++TEK G G+ A+ D++ +F+
Sbjct: 173 DSDCINRATKMECVGDCGCGDECENQRFQRRQYADVSVIKTEKKGYGLRAESDLRPNQFI 232
Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
EY+GEVI++ +R+ G +FY +++ +DAT KGN R+ NHSC PN
Sbjct: 233 FEYIGEVINETLFRKRMLSYDEEGIKHFYFMSLSKGEFVDATKKGNLGRFCNHSCNPNCY 292
Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
+ KW++ + R+GIFA R IK GE L ++Y + + D YC
Sbjct: 293 VDKWVVGEKLRMGIFAERHIKAGEELVFNYNVDRYGADPQPCYC 336
>gi|325091551|gb|EGC44861.1| histone-lysine N-methyltransferase [Ajellomyces capsulatus H88]
Length = 1000
Score = 120 bits (302), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 60/164 (36%), Positives = 92/164 (56%), Gaps = 1/164 (0%)
Query: 81 DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
D DC C C CG+ C N+ FQ R + +++TEK G G+ A+ D++ +F+
Sbjct: 173 DSDCINRATKMECVGDCGCGDECENQRFQRRQYAHVSVIKTEKKGYGLRAESDLRPNQFI 232
Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
EY+GEVI++ +R+ G +FY +++ +DAT KGN R+ NHSC PN
Sbjct: 233 FEYIGEVINETLFRKRMLSYDEEGIKHFYFMSLSKGEFVDATKKGNLGRFCNHSCNPNCY 292
Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
+ KW++ + R+GIFA R IK GE L ++Y + + D YC
Sbjct: 293 VDKWVVGEKLRMGIFAERHIKAGEELVFNYNVDRYGADPQPCYC 336
>gi|410911836|ref|XP_003969396.1| PREDICTED: histone-lysine N-methyltransferase ASH1L-like [Takifugu
rubripes]
Length = 2782
Score = 120 bits (302), Expect = 4e-25, Method: Composition-based stats.
Identities = 78/227 (34%), Positives = 120/227 (52%), Gaps = 18/227 (7%)
Query: 19 KLLKQIGNPVEFELP---------DWFIKPKAIP-YVFIKRNIYLTKRIKRRLEDDGIFC 68
K L+Q ++F+LP D K +P Y I+ N+Y+ +K + C
Sbjct: 1875 KYLRQ--KRIDFQLPYDILWLWKHDQLYKRPDVPLYKKIRSNVYVD--VKPLSGYETTTC 1930
Query: 69 SC-TASPGSSGVCDRDCHCGMLLSSCS-SGCKCGNSCLNKPFQNRP-VKKMKLVQTEKCG 125
+C T + C DC M + CS S C + C N+ Q V+ ++ +TE G
Sbjct: 1931 NCRTPNDRIEKSCLDDCLNRMSFAECSPSTCPSADQCDNQHIQRHDWVQCLERFRTEGKG 1990
Query: 126 AGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKG 185
GI E ++ G+F+IEY+GEV+ +Q R+ + ++ + Y ++ MVID+ G
Sbjct: 1991 WGIRTKEPLRAGQFIIEYLGEVVSEQEFRSRMME-QYFSHSGNYCLNLDSGMVIDSYRMG 2049
Query: 186 NKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
N++R+INHSC PN EMQKW ++G RIG+FA +I G LTYDY +
Sbjct: 2050 NEARFINHSCEPNCEMQKWSVNGVYRIGLFALGEIPSGTELTYDYNF 2096
>gi|170029985|ref|XP_001842871.1| histone-lysine N-methyltransferase ash1 [Culex quinquefasciatus]
gi|167865331|gb|EDS28714.1| histone-lysine N-methyltransferase ash1 [Culex quinquefasciatus]
Length = 2119
Score = 120 bits (302), Expect = 4e-25, Method: Composition-based stats.
Identities = 69/214 (32%), Positives = 107/214 (50%), Gaps = 16/214 (7%)
Query: 27 PVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC 86
P +P W Y I+ N+Y +K D C+C G C DC
Sbjct: 1723 PGRHSVPSW-------NYRKIRTNVYYD--VKPNPSSDHPQCNCKPDSG----CQDDCLN 1769
Query: 87 GMLLSSC-SSGCKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDIKRGEFVIEYV 144
++ C C CG+ C N Q ++ TE G GI + E +K+G F++EY+
Sbjct: 1770 RLVYVECVPENCPCGDRCQNTKIQRHEYAPGLERFMTELKGWGIRSKEGVKKGSFIMEYL 1829
Query: 145 GEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKW 204
GEV+ ++ +ER+ + +L +T+ Y + +VID G+ R++NHSC PN EMQKW
Sbjct: 1830 GEVVTEKEFKERM-RTIYLNDTHHYCLNLTGGLVIDGHRMGSDCRFVNHSCAPNCEMQKW 1888
Query: 205 IIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
++G R+ ++A+RDI E L YDY + + S
Sbjct: 1889 SVNGLFRMALYASRDIPPHEELCYDYNFSLFNPS 1922
>gi|326481316|gb|EGE05326.1| histone-lysine N-methyltransferase [Trichophyton equinum CBS
127.97]
Length = 970
Score = 120 bits (302), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 60/164 (36%), Positives = 94/164 (57%), Gaps = 1/164 (0%)
Query: 81 DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
D DC C C CG++C N+ FQ R + +++TEK G G+ A+ D++ +F+
Sbjct: 163 DSDCINRATKMECVDNCGCGDNCQNQRFQKRQYADVTVIKTEKKGFGLRANTDLRPNDFI 222
Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
EY+GEVI++ +R+ + G +FY +N+ +DAT KGN R+ NHSC PN
Sbjct: 223 FEYIGEVINEPQFRKRMIQYDDEGIKHFYFMSLNKGEFVDATRKGNLGRFCNHSCNPNCY 282
Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
+ KW++ + R+GIFA + IK GE L ++Y + + D YC
Sbjct: 283 IDKWVVGEKLRMGIFAEKYIKAGEELVFNYNVDRYGADPQPCYC 326
>gi|441621285|ref|XP_003269632.2| PREDICTED: histone-lysine N-methyltransferase NSD3 [Nomascus
leucogenys]
Length = 505
Score = 120 bits (302), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 72/199 (36%), Positives = 103/199 (51%), Gaps = 15/199 (7%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N + K ++ ++ D C P C + C ML C
Sbjct: 217 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 275
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G+ C N+ F R ++++TE+ G G+ IK+GEFV EYVGE+ID++ C R
Sbjct: 276 CPAGDRCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 335
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + N D +IDA KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 336 IKRAHE-----------NSDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 384
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 385 LCDIPAGMELTFNYNLDCL 403
>gi|326468707|gb|EGD92716.1| histone-lysine N-methyltransferase [Trichophyton tonsurans CBS
112818]
Length = 970
Score = 120 bits (302), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 60/164 (36%), Positives = 94/164 (57%), Gaps = 1/164 (0%)
Query: 81 DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
D DC C C CG++C N+ FQ R + +++TEK G G+ A+ D++ +F+
Sbjct: 163 DSDCINRATKMECVDNCGCGDNCQNQRFQKRQYADVTVIKTEKKGFGLRANTDLRPNDFI 222
Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
EY+GEVI++ +R+ + G +FY +N+ +DAT KGN R+ NHSC PN
Sbjct: 223 FEYIGEVINEPQFRKRMIQYDDEGIKHFYFMSLNKGEFVDATRKGNLGRFCNHSCNPNCY 282
Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
+ KW++ + R+GIFA + IK GE L ++Y + + D YC
Sbjct: 283 IDKWVVGEKLRMGIFAEKYIKAGEELVFNYNVDRYGADPQPCYC 326
>gi|395847339|ref|XP_003796336.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 3
[Otolemur garnettii]
Length = 1427
Score = 120 bits (302), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 72/199 (36%), Positives = 102/199 (51%), Gaps = 15/199 (7%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N + K ++ ++ D C P C + C ML C
Sbjct: 1070 KPPPYKHIKANKVIGK-VQIQVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 1128
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R ++++TE+ G G+ IK+GEFV EYVGE+ID++ C R
Sbjct: 1129 CPAGERCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLR 1188
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + N D +IDA KGN SR++NHSC PN E QKW ++G+ R+G+FA
Sbjct: 1189 IKRAHE-----------NSDRIIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1237
Query: 217 TRDIKKGENLTYDYQYEFL 235
DI G LT++Y + L
Sbjct: 1238 LCDIPAGMELTFNYNLDCL 1256
>gi|448089486|ref|XP_004196819.1| Piso0_004045 [Millerozyma farinosa CBS 7064]
gi|448093762|ref|XP_004197850.1| Piso0_004045 [Millerozyma farinosa CBS 7064]
gi|359378241|emb|CCE84500.1| Piso0_004045 [Millerozyma farinosa CBS 7064]
gi|359379272|emb|CCE83469.1| Piso0_004045 [Millerozyma farinosa CBS 7064]
Length = 744
Score = 120 bits (302), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 61/155 (39%), Positives = 89/155 (57%), Gaps = 3/155 (1%)
Query: 79 VCDRDCHCGMLLSSCSSGCK---CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIK 135
CD D C L+S K CG C N+ FQ + + + QTE G G+ A++DI
Sbjct: 72 ACDDDSDCINRLTSVECTNKYGTCGKDCQNQRFQKKQYASVAVFQTEMKGYGLRAEKDIP 131
Query: 136 RGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSC 195
F+ EY GEVID++ +R+ + +FY + +D IDAT KG+ +R+ NHSC
Sbjct: 132 DSSFIYEYTGEVIDEEAFRKRMVQYDEKNIKHFYFMMLKQDSFIDATRKGSLARFCNHSC 191
Query: 196 CPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDY 230
PN + KW++ + R+GIFA R IK+GE +T+DY
Sbjct: 192 NPNAYVDKWVVGEKLRMGIFAKRTIKRGEEITFDY 226
>gi|260800140|ref|XP_002594994.1| hypothetical protein BRAFLDRAFT_99284 [Branchiostoma floridae]
gi|229280233|gb|EEN51005.1| hypothetical protein BRAFLDRAFT_99284 [Branchiostoma floridae]
Length = 1541
Score = 120 bits (301), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 70/202 (34%), Positives = 110/202 (54%), Gaps = 3/202 (1%)
Query: 37 IKPKAIPYVFIKRNIYL-TKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSC-S 94
I K P+ IK N + + +I + C C + S + DC L+ C
Sbjct: 1057 IGKKPPPFRMIKTNKPVGSVQIHTADPSEIQRCECKVTDESPCGPESDCLNRNLMIECHP 1116
Query: 95 SGCKCGNSCLNKPFQNRPVKKMKLVQT-EKCGAGIVADEDIKRGEFVIEYVGEVIDDQTC 153
+GC G C N+ F R ++ +T + G G+ D+K+ +FV EYVGE+ID++
Sbjct: 1117 AGCPAGEKCQNQRFVKRQYPAVESFKTPDGRGWGLKTLVDVKKHDFVYEYVGELIDEEEV 1176
Query: 154 EERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIG 213
+ R+ K TNFY+ ++++ +IDA K N SR++NHSC PN E QKW+++G+ R+G
Sbjct: 1177 QRRIKKAHEDNVTNFYMLTLDKNRIIDAGPKANMSRFMNHSCQPNCETQKWMVNGDIRVG 1236
Query: 214 IFATRDIKKGENLTYDYQYEFL 235
+FA DI G LT++Y + L
Sbjct: 1237 LFAMDDIPTGSELTFNYNLDCL 1258
>gi|164655887|ref|XP_001729072.1| hypothetical protein MGL_3860 [Malassezia globosa CBS 7966]
gi|159102961|gb|EDP41858.1| hypothetical protein MGL_3860 [Malassezia globosa CBS 7966]
Length = 707
Score = 120 bits (301), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 62/148 (41%), Positives = 86/148 (58%), Gaps = 1/148 (0%)
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C+CG C N+ FQ K+ +V+T G G+ A D+ R EFV EY+GEVI+ T R
Sbjct: 29 CRCGVHCANQRFQQHAYAKVDIVKTPDKGYGLRAQTDLGRDEFVYEYIGEVINHDTFMRR 88
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + K +FY + RD IDAT +G +SR+INHSC PN + KW + R+GIFA
Sbjct: 89 MAQYKTEHIEHFYFMMLQRDEYIDATKRGGRSRFINHSCSPNCYVSKWHVGRHVRMGIFA 148
Query: 217 TRDIKKGENLTYDYQYE-FLHDSLIAYC 243
R I GE LT++Y + + +D YC
Sbjct: 149 KRRILAGEELTFNYNVDRYGNDPQPCYC 176
>gi|402594990|gb|EJW88916.1| SET domain-containing protein [Wuchereria bancrofti]
Length = 1425
Score = 120 bits (301), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 67/172 (38%), Positives = 94/172 (54%), Gaps = 8/172 (4%)
Query: 67 FCSCTASPGSSGV-CDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCG 125
FC T++ + G C C L + C S C G C N+ F N+ K+++ G
Sbjct: 557 FCEPTSAEIAEGRGCSSGCINRELYTECGSRCPSGAGCANRRFHNKQYAKVEVFNAGIKG 616
Query: 126 AGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGE----TNFYLCEINRDMVIDA 181
G+ A E ++ G F+IEYVGEVID EE + + + G+ + YL + VIDA
Sbjct: 617 WGLRAAEPLEPGRFIIEYVGEVID---AEEMIRRGRRYGKDPKHVHHYLMALKNGAVIDA 673
Query: 182 TYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
T KGN SR+INHSC PN E QKW ++ + R+G F + I GE + +DYQ E
Sbjct: 674 TAKGNVSRFINHSCDPNCESQKWTVNRQLRVGFFVIKPIALGEEIVFDYQLE 725
>gi|302903800|ref|XP_003048936.1| hypothetical protein NECHADRAFT_106311 [Nectria haematococca mpVI
77-13-4]
gi|256729870|gb|EEU43223.1| hypothetical protein NECHADRAFT_106311 [Nectria haematococca mpVI
77-13-4]
Length = 928
Score = 120 bits (301), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 63/188 (33%), Positives = 99/188 (52%), Gaps = 6/188 (3%)
Query: 62 EDDGIFCSCTAS--PGSSGVCDRDCHC---GMLLSSCSSGCKCGNSCLNKPFQNRPVKKM 116
++D + C C G + C D C + + CG C N+ FQ + +
Sbjct: 119 DNDSLDCECREEWRDGENLACGEDSDCINRATKMECSAEAGNCGGGCQNQRFQRKQYANV 178
Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
+++TEK G G+ AD D++ +FV EY+GEVI++ T R+ + G +FY +N+
Sbjct: 179 SVIKTEKKGFGLRADADLQPNDFVYEYIGEVINEPTFRRRMLQYDEEGIKHFYFMSLNKS 238
Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FL 235
+DAT KGN R+ NHSC PN + KW++ + R+GIF R I+ GE L ++Y + +
Sbjct: 239 EFVDATKKGNLGRFCNHSCNPNCYVDKWVVGDKLRMGIFTLRKIQSGEELVFNYNVDRYG 298
Query: 236 HDSLIAYC 243
D YC
Sbjct: 299 ADPQPCYC 306
>gi|195368593|ref|XP_002045796.1| GM13547 [Drosophila sechellia]
gi|194134946|gb|EDW56462.1| GM13547 [Drosophila sechellia]
Length = 419
Score = 120 bits (301), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 71/188 (37%), Positives = 104/188 (55%), Gaps = 7/188 (3%)
Query: 37 IKPKAIPYVFIKRNIYLTK-RIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSC 93
IK PYV IK N + R + LED +C P C + C ML + C
Sbjct: 221 IKLHPPPYVKIKTNKAVPPLRFSQNLED---LSTCNCLPVDEHPCGPEAGCLNRMLFNEC 277
Query: 94 SSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQT 152
+ CK G+ C N+ F+ R ++++V + G G+V E I G+FVIEYVGEVI+
Sbjct: 278 NPEYCKAGSLCENRMFEQRKSPRLEVVYMNERGFGLVNREPIAAGDFVIEYVGEVINHAE 337
Query: 153 CEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRI 212
+ R+ + + + N+Y + +D +IDA KGN +R++NHSC PN E QKW ++ R+
Sbjct: 338 FQRRMEQKQRDRDENYYFLGVEKDFIIDAGPKGNLARFMNHSCEPNCETQKWTVNCIHRV 397
Query: 213 GIFATRDI 220
GIFA +DI
Sbjct: 398 GIFAIKDI 405
>gi|67903676|ref|XP_682094.1| hypothetical protein AN8825.2 [Aspergillus nidulans FGSC A4]
gi|74592565|sp|Q5ASA5.1|SET2_EMENI RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36
specific; AltName: Full=SET domain-containing protein 2
gi|40740923|gb|EAA60113.1| hypothetical protein AN8825.2 [Aspergillus nidulans FGSC A4]
gi|259482964|tpe|CBF77940.1| TPA: Histone-lysine N-methyltransferase, H3 lysine-36 specific (EC
2.1.1.43)(SET domain-containing protein 2)
[Source:UniProtKB/Swiss-Prot;Acc:Q5ASA5] [Aspergillus
nidulans FGSC A4]
Length = 980
Score = 120 bits (301), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 61/164 (37%), Positives = 92/164 (56%), Gaps = 1/164 (0%)
Query: 81 DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
D DC C C CG C N+ FQ R + +++TEK G G+ A+ED++ +F+
Sbjct: 208 DSDCINRATKIECMGDCGCGPDCQNQRFQRREYANVAVIKTEKKGYGLRAEEDLRPHQFI 267
Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
EYVGEVI++ R+ + G +FY +++ +DAT KGN R+ NHSC PN
Sbjct: 268 FEYVGEVINEGPFHRRMRQYDAEGIKHFYFMSLSKGEFVDATKKGNLGRFCNHSCNPNCY 327
Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
+ KW++ + R+GIFA R I+ GE L ++Y + + D YC
Sbjct: 328 VDKWVVGEKLRMGIFAERHIQAGEELVFNYNVDRYGADPQPCYC 371
>gi|428183837|gb|EKX52694.1| hypothetical protein GUITHDRAFT_65244 [Guillardia theta CCMP2712]
Length = 211
Score = 120 bits (300), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 68/190 (35%), Positives = 104/190 (54%), Gaps = 6/190 (3%)
Query: 46 FIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCL 104
++RN YL ++ + DD C+C G D+ C + C+ G C CG+SC
Sbjct: 1 MLRRNTYLHRKYAKLSVDDIAVCNCLPIKGKVACADQTCALRQIYVECTPGFCPCGDSCQ 60
Query: 105 NKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLG 164
N+ FQ + ++ + +K G + ED+++G FVIEY+GE+++ + E R K +
Sbjct: 61 NQRFQKCQYVRTEVKKVDKRGWALFTMEDVQQGTFVIEYMGEILNRRMYERR--KKAYAK 118
Query: 165 ETNFYLCEINRD---MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIK 221
E + Y +N VIDA+ K + R+INHSC PN ++ GE +GIFA RDI+
Sbjct: 119 EKHTYFMVLNTSPIFEVIDASRKSSMGRFINHSCDPNCHTHRYRSLGEVVVGIFAKRDIE 178
Query: 222 KGENLTYDYQ 231
KGE +T DYQ
Sbjct: 179 KGEEITIDYQ 188
>gi|406867230|gb|EKD20268.1| histone H3 lysine 36 (K36) methyltransferase [Marssonina brunnea f.
sp. 'multigermtubi' MB_m1]
Length = 1038
Score = 120 bits (300), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 75/223 (33%), Positives = 116/223 (52%), Gaps = 8/223 (3%)
Query: 27 PVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTAS--PGSSGVC--DR 82
P+ ++PD + + + IK IY +K + E D + C C+ G + C D
Sbjct: 119 PLFNDVPDATEEATGV-FQKIKDCIYGSKGMGSS-EHDALDCDCSEEWRDGKNHACGEDS 176
Query: 83 DCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVI 141
DC C G C CG+ C N+ FQ + + + +TEK G G+ A+ D+ +F+
Sbjct: 177 DCINRATKMECVGGECNCGSGCQNQRFQQKQYADVSVFKTEKKGYGLRANVDLDANDFIF 236
Query: 142 EYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEM 201
EY+GEVI++ T R + G +FY + + +DAT KGN R+ NHSC PN +
Sbjct: 237 EYIGEVINEPTFRRRTVQYDQEGIKHFYFMSLTKHEFVDATKKGNLGRFCNHSCNPNCYV 296
Query: 202 QKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
KW++ + R+GIFA R+IK GE L ++Y + + D YC
Sbjct: 297 DKWVVGEKLRMGIFAERNIKAGEELVFNYNVDRYGADPQPCYC 339
>gi|157123795|ref|XP_001653916.1| set domain protein [Aedes aegypti]
gi|157123797|ref|XP_001653917.1| set domain protein [Aedes aegypti]
gi|108874200|gb|EAT38425.1| AAEL009666-PB, partial [Aedes aegypti]
gi|108874201|gb|EAT38426.1| AAEL009666-PA, partial [Aedes aegypti]
Length = 2091
Score = 120 bits (300), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 67/197 (34%), Positives = 105/197 (53%), Gaps = 9/197 (4%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSS-GCKCGNS 102
Y I+ N+Y +K D C+C G C DC ++ CS C CG
Sbjct: 1240 YRKIRTNVYYD--VKPNPSTDHPQCNCKPDSG----CQDDCLNRLVFVECSPENCPCGER 1293
Query: 103 CLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMK 161
C N Q ++ TE+ G GI + E +++G F++EY+GEV+ ++ +ER+ +
Sbjct: 1294 CKNTKIQRHEYAPGLERFMTEQKGWGIRSKEGVRKGLFIMEYLGEVVTEKEFKERMRTI- 1352
Query: 162 HLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIK 221
+L +T+ Y + +VID G+ R++NHSC PN EMQKW ++G R+ +FA+RDI
Sbjct: 1353 YLNDTHHYCLNLTGGLVIDGHRMGSDCRFVNHSCAPNCEMQKWSVNGLFRMALFASRDIP 1412
Query: 222 KGENLTYDYQYEFLHDS 238
E LTYDY + + +
Sbjct: 1413 PYEELTYDYNFSLFNPT 1429
>gi|402087032|gb|EJT81930.1| histone-lysine N-methyltransferase [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 962
Score = 120 bits (300), Expect = 6e-25, Method: Composition-based stats.
Identities = 61/153 (39%), Positives = 85/153 (55%), Gaps = 1/153 (0%)
Query: 81 DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
D DC C G CG C N+ FQ + K+ +++TEK G G+ AD D+ +FV
Sbjct: 146 DSDCINRHTKIECVGG-NCGEGCQNQRFQAKQYAKVSVIKTEKKGFGLRADTDLDANDFV 204
Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
EY+GEVI ++ RL K +FY + R +DAT KGN R+ NHSC PN
Sbjct: 205 FEYIGEVIGEELFRRRLMKYDDQRLEHFYFMSLTRTEYVDATKKGNLGRFCNHSCNPNCF 264
Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
+ KW++ + R+GIFA R IK GE L ++Y +
Sbjct: 265 VDKWVVGDKLRMGIFALRAIKAGEELCFNYNVD 297
>gi|307167950|gb|EFN61316.1| Probable histone-lysine N-methyltransferase ASH1L [Camponotus
floridanus]
Length = 2136
Score = 120 bits (300), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 73/214 (34%), Positives = 106/214 (49%), Gaps = 16/214 (7%)
Query: 32 LPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLS 91
+P W Y I+ NIY +K + C C G C DC M+ S
Sbjct: 1282 VPSWN-------YKKIRTNIYYD--VKPTTLYEAQACECKPESG----CGDDCINRMVFS 1328
Query: 92 SCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVID 149
CS C G C N+ Q ++ TE G G+ + IK G+F++EYVGEV+
Sbjct: 1329 ECSPQLCPSGEKCENQKIQKHEWSPGLQKFMTEDKGWGVRTQQSIKSGDFILEYVGEVVS 1388
Query: 150 DQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGE 209
++ + R+ ++ +T+ Y ++ +VID G R++NHSC PN EMQKW + G
Sbjct: 1389 EREFKSRMA-TRYANDTHHYCLHLDGGLVIDGHRMGGDGRFVNHSCEPNCEMQKWSVLGL 1447
Query: 210 TRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
R+ +FA+RDIK GE LTYDY + + S C
Sbjct: 1448 PRMALFASRDIKPGEELTYDYNFALFNPSEGQQC 1481
>gi|402585708|gb|EJW79647.1| hypothetical protein WUBG_09444, partial [Wuchereria bancrofti]
Length = 511
Score = 120 bits (300), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 66/176 (37%), Positives = 100/176 (56%), Gaps = 4/176 (2%)
Query: 70 CTASPGSSGVCDRDCHCGMLLSSCSSGCKCGNSCLNKP-FQNRPVKKMKLVQT-EKCGAG 127
C+ S G G D +C ++L C S C C NK F+ +++++ QT CG G
Sbjct: 137 CSCSSGCCGESD-ECLNRVVLMECGSSCPRNAICTNKRLFRRECIERLRTFQTMNGCGIG 195
Query: 128 IVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNK 187
+ D +I +G+F+ EY+GEV+ +T R + + + N Y + V+DA +KGN
Sbjct: 196 VKTDVNIDKGQFICEYIGEVVSMETFNIRS-RTDYRYQRNHYALNLCPGFVVDAYHKGNI 254
Query: 188 SRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
+R+INHSC PN EMQ+W ++G RIG+FA R I +GE LTYDY ++ + C
Sbjct: 255 ARFINHSCAPNCEMQRWSVNGHYRIGLFALRGIHEGEELTYDYNWDAFEFDDVTIC 310
>gi|169596857|ref|XP_001791852.1| hypothetical protein SNOG_01198 [Phaeosphaeria nodorum SN15]
gi|111069727|gb|EAT90847.1| hypothetical protein SNOG_01198 [Phaeosphaeria nodorum SN15]
Length = 940
Score = 120 bits (300), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 73/240 (30%), Positives = 119/240 (49%), Gaps = 12/240 (5%)
Query: 12 RIGHAFNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSC- 70
++ A ++ +K+ P+ +LPD K + + + Y K I D + C C
Sbjct: 102 KLSRAASRTVKR-APPLYLDLPDSTQDAKQT-FEVLPQCTYANKHIG--TTDPALECDCG 157
Query: 71 ---TASPGSSGVCDRDCHC---GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKC 124
A + C D C + + GC CG C N+ F + + +++T K
Sbjct: 158 EEWDAVEKVNNACGEDSDCINRATKMECVADGCSCGTKCQNQRFFRKQYADVTVIKTAKK 217
Query: 125 GAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYK 184
G G+ A++D+ G+FV EY+GEVID++T R+ + G +FY + + +DAT K
Sbjct: 218 GFGLRANQDMFPGDFVFEYIGEVIDERTFRRRMIQYDEEGIKHFYFMSLTKGEFVDATKK 277
Query: 185 GNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
GN R+ NHSC PN + KW++ + R+GIF R +K GE L ++Y + + D YC
Sbjct: 278 GNLGRFCNHSCNPNCFVDKWVVGDKLRMGIFVERKVKAGEELVFNYNVDRYGADPQPCYC 337
>gi|425766575|gb|EKV05181.1| Histone-lysine N-methyltransferase, H3 lysine-36 specific
[Penicillium digitatum PHI26]
gi|425781728|gb|EKV19675.1| Histone-lysine N-methyltransferase, H3 lysine-36 specific
[Penicillium digitatum Pd1]
Length = 887
Score = 120 bits (300), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 58/164 (35%), Positives = 93/164 (56%), Gaps = 1/164 (0%)
Query: 81 DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
D DC C+ C CG+ C N+ FQ + + +++TEK G G+ A+ ++ GE +
Sbjct: 144 DSDCINRATKIECAGDCGCGSDCQNQRFQKQQFAPVSVIKTEKKGFGLRAERNLDPGELI 203
Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
EYVGEV+ +Q +R+ + G +FY +N+ +DAT +GN R+ NHSC PN
Sbjct: 204 YEYVGEVVGEQQFRKRMRQYDEEGIKHFYFMSLNKGEFVDATKRGNLGRFCNHSCNPNCY 263
Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
+ KW++ + R+GIFA R ++ GE L ++Y + + D YC
Sbjct: 264 VDKWVVGEKLRMGIFAERAVQAGEELVFNYNVDRYGADPQPCYC 307
>gi|336387462|gb|EGO28607.1| hypothetical protein SERLADRAFT_359892 [Serpula lacrymans var.
lacrymans S7.9]
Length = 780
Score = 120 bits (300), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 71/208 (34%), Positives = 104/208 (50%), Gaps = 8/208 (3%)
Query: 44 YVFIKRNIYLTKRIKRRLED-DGIFCSCTASPG---SSGVCDRDCHCGMLLSSCS---SG 96
+ I N Y + R E + + C C PG S C C L+
Sbjct: 50 FTEIHENHYQYGTLGRSREALESMTCDCQFEPGVDRPSDACGHGSDCINRLTQVECLPDD 109
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C+C + C N+ FQ R + +VQTEK G G+ A D+ R F+ EYVG+V+ + +R
Sbjct: 110 CRCRSFCQNQRFQRREYANINIVQTEKKGYGLRAAADLHRDAFIYEYVGDVVSHPSFVKR 169
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + G +FY + +D IDAT +G R+ NHSC PN + KW I R+GIFA
Sbjct: 170 MREYGEQGIRHFYFMMLQKDEFIDATKRGGMGRFANHSCNPNCYVAKWTIGNHVRMGIFA 229
Query: 217 TRDIKKGENLTYDYQYE-FLHDSLIAYC 243
R+I K E LT++Y + + H++ YC
Sbjct: 230 NRNIVKDEELTFNYNVDRYGHEAQPCYC 257
>gi|296418221|ref|XP_002838740.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295634701|emb|CAZ82931.1| unnamed protein product [Tuber melanosporum]
Length = 892
Score = 120 bits (300), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 71/187 (37%), Positives = 99/187 (52%), Gaps = 8/187 (4%)
Query: 51 IYLTKRIKRRLEDDGIFCSCTASPGSSGVCD------RDCHCGMLLSSCSSGCK-CGNSC 103
IY TK I +D G SC G C+ DC M C+ + CG C
Sbjct: 120 IYSTKSIGDSGQD-GEVMSCECRTDWDGECNPACGDNSDCINRMTSMECTQDDRACGQDC 178
Query: 104 LNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHL 163
N+ FQ R + +++TEK G G+ A+ DI F+ EYVGEVID+ R+ K
Sbjct: 179 QNQRFQQRQYADVSVIKTEKKGFGLRANSDISINTFLYEYVGEVIDENKFRRRMEKYDSE 238
Query: 164 GETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKG 223
G +FY + ++ IDAT KG +R+ NHSC PN + KW++ + R+GIFA R++K G
Sbjct: 239 GIKHFYFMSLGKNEFIDATKKGGLARFCNHSCNPNCFIDKWVVGEKLRMGIFAKRNVKAG 298
Query: 224 ENLTYDY 230
E L +DY
Sbjct: 299 EELVFDY 305
>gi|145540026|ref|XP_001455703.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124423511|emb|CAK88306.1| unnamed protein product [Paramecium tetraurelia]
Length = 1371
Score = 120 bits (300), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 78/228 (34%), Positives = 111/228 (48%), Gaps = 18/228 (7%)
Query: 25 GNPVEFE-LPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCT-----------A 72
GNPV + L D F + +K+N YL ++ R D+ + CSC
Sbjct: 32 GNPVRLQGLKDSFKY-----FTLLKKNQYLERKQYRVSRDETLVCSCIMCPEDQIQNRPQ 86
Query: 73 SPGSSGVCDRDCHCGMLLSSCS-SGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVAD 131
P S C C + C C C C N+ FQ + ++ G G+ A
Sbjct: 87 GPQYSYNCGERCLNRFTCTECDVELCPCAEQCKNRRFQKHDDACVYPLRCGGKGMGLFAG 146
Query: 132 EDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYI 191
E I +G+F+++YVGE+ + R ++ T YL ++N VID T KGN +R+I
Sbjct: 147 ERILKGQFIMQYVGEIFQINSAFGRRRVQEYSKSTCTYLMKLNNQEVIDPTSKGNLARFI 206
Query: 192 NHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSL 239
NHSC PN +KW + GE IGIFA RDI + E LT+DYQ++ H L
Sbjct: 207 NHSCEPNCITEKWNVLGEVCIGIFAIRDINEDEELTFDYQFDVFHTPL 254
>gi|392579013|gb|EIW72140.1| hypothetical protein TREMEDRAFT_25563 [Tremella mesenterica DSM
1558]
Length = 869
Score = 119 bits (299), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 70/194 (36%), Positives = 102/194 (52%), Gaps = 14/194 (7%)
Query: 60 RLEDDGIFCSCT-------ASP-GSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQN 110
R +D+ + C C A P GS DC + C G C+ C N+ F
Sbjct: 139 REQDEMMVCDCVYDRDDPHAEPCGSQS----DCINRAIFIECLPGECRAREQCQNQRFGK 194
Query: 111 RPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYL 170
R + +V TE G G+ A +DI G + EY+GEVI + T +R+ + G +FY
Sbjct: 195 RQYAPIDVVLTEMKGFGLRAADDIPEGSLIYEYIGEVIKETTFRKRMAQYADEGIKHFYF 254
Query: 171 CEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDY 230
+ ++ IDAT KG R+ NHSC PN+E+QKW++ R+GIFA RDI K E +T++Y
Sbjct: 255 MMLQKEEYIDATKKGGIGRFANHSCNPNSEVQKWVVGRRLRMGIFAKRDIIKDEEITFNY 314
Query: 231 QYE-FLHDSLIAYC 243
+ + HD+ YC
Sbjct: 315 NVDRYGHDAQPCYC 328
>gi|363756006|ref|XP_003648219.1| hypothetical protein Ecym_8108 [Eremothecium cymbalariae
DBVPG#7215]
gi|356891419|gb|AET41402.1| Hypothetical protein Ecym_8108 [Eremothecium cymbalariae
DBVPG#7215]
Length = 682
Score = 119 bits (299), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 68/193 (35%), Positives = 104/193 (53%), Gaps = 6/193 (3%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTAS--PGSSGVCDR--DCHCGMLLSSCSSG--C 97
++ ++++ Y KR+ + + C C G + C DC + L C +
Sbjct: 34 FIELEKSTYAHKRLGNSPSHEFMECDCFEEYRDGKNHACGETSDCINRLTLIECVNELCT 93
Query: 98 KCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERL 157
CG++C N+ FQ R + + QTEK G G+ A++DI+ EF+ EY+GEVI + ER+
Sbjct: 94 SCGDNCQNQRFQGRQYADIAVFQTEKKGYGVRAEKDIEANEFIYEYIGEVISESEFRERM 153
Query: 158 WKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFAT 217
G +FY + IDAT KG +R+ NHSC PN + KW + G+ ++GIFA
Sbjct: 154 VDYDVRGYKHFYFMMLQTGEFIDATEKGCLARFCNHSCNPNAYVSKWDVAGKLKMGIFAN 213
Query: 218 RDIKKGENLTYDY 230
R I KGE +T+DY
Sbjct: 214 RKIFKGEEITFDY 226
>gi|85086731|ref|XP_957740.1| histone H3 lysine 36 (K36) methyltransferase [Neurospora crassa
OR74A]
gi|74614418|sp|Q7RZU4.1|SET2_NEUCR RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36
specific; AltName: Full=SET domain-containing protein 2
gi|28918835|gb|EAA28504.1| histone H3 lysine 36 (K36) methyltransferase [Neurospora crassa
OR74A]
Length = 954
Score = 119 bits (299), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 62/165 (37%), Positives = 93/165 (56%), Gaps = 2/165 (1%)
Query: 81 DRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEF 139
D DC C G C CG+ C N+ FQ + + +++TEK G G+ A+ D++ +F
Sbjct: 148 DSDCINRATKMECVDGDCNCGSGCQNQRFQRKQYADVSVIKTEKKGFGLRANTDLQVNDF 207
Query: 140 VIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNT 199
+ EY+GEVI++ T R+ K G +FY + + +DAT KGN R+ NHSC PN
Sbjct: 208 IFEYIGEVINEPTFRSRMVKYDKEGIKHFYFMSLTKSEFVDATKKGNLGRFCNHSCDPNC 267
Query: 200 EMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
+ KW++ + R+GIFA R IK GE L ++Y + + D YC
Sbjct: 268 YVDKWVVGDKLRMGIFAGRAIKAGEELVFNYNVDRYGADPQPCYC 312
>gi|336374580|gb|EGO02917.1| hypothetical protein SERLA73DRAFT_47203 [Serpula lacrymans var.
lacrymans S7.3]
Length = 776
Score = 119 bits (299), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 71/208 (34%), Positives = 104/208 (50%), Gaps = 8/208 (3%)
Query: 44 YVFIKRNIYLTKRIKRRLED-DGIFCSCTASPG---SSGVCDRDCHCGMLLSSCS---SG 96
+ I N Y + R E + + C C PG S C C L+
Sbjct: 50 FTEIHENHYQYGTLGRSREALESMTCDCQFEPGVDRPSDACGHGSDCINRLTQVECLPDD 109
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C+C + C N+ FQ R + +VQTEK G G+ A D+ R F+ EYVG+V+ + +R
Sbjct: 110 CRCRSFCQNQRFQRREYANINIVQTEKKGYGLRAAADLHRDAFIYEYVGDVVSHPSFVKR 169
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + G +FY + +D IDAT +G R+ NHSC PN + KW I R+GIFA
Sbjct: 170 MREYGEQGIRHFYFMMLQKDEFIDATKRGGMGRFANHSCNPNCYVAKWTIGNHVRMGIFA 229
Query: 217 TRDIKKGENLTYDYQYE-FLHDSLIAYC 243
R+I K E LT++Y + + H++ YC
Sbjct: 230 NRNIVKDEELTFNYNVDRYGHEAQPCYC 257
>gi|336469811|gb|EGO57973.1| histone-lysine N-methyltransferase, H3 lysine-36 specific
[Neurospora tetrasperma FGSC 2508]
gi|350290514|gb|EGZ71728.1| histone-lysine N-methyltransferase, H3 lysine-36 specific, partial
[Neurospora tetrasperma FGSC 2509]
Length = 957
Score = 119 bits (299), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 62/165 (37%), Positives = 93/165 (56%), Gaps = 2/165 (1%)
Query: 81 DRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEF 139
D DC C G C CG+ C N+ FQ + + +++TEK G G+ A+ D++ +F
Sbjct: 148 DSDCINRATKMECVDGDCNCGSGCQNQRFQRKQYADVSVIKTEKKGFGLRANTDLQVNDF 207
Query: 140 VIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNT 199
+ EY+GEVI++ T R+ K G +FY + + +DAT KGN R+ NHSC PN
Sbjct: 208 IFEYIGEVINEPTFRSRMVKYDKEGIKHFYFMSLTKSEFVDATKKGNLGRFCNHSCDPNC 267
Query: 200 EMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
+ KW++ + R+GIFA R IK GE L ++Y + + D YC
Sbjct: 268 YVDKWVVGDKLRMGIFAGRAIKAGEELVFNYNVDRYGADPQPCYC 312
>gi|213406581|ref|XP_002174062.1| histone-lysine N-methyltransferase [Schizosaccharomyces japonicus
yFS275]
gi|212002109|gb|EEB07769.1| histone-lysine N-methyltransferase [Schizosaccharomyces japonicus
yFS275]
Length = 779
Score = 119 bits (299), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 64/185 (34%), Positives = 101/185 (54%), Gaps = 5/185 (2%)
Query: 51 IYLTKRIKRRLEDDGIFCSCTAS--PGSSGVCDRDCHCGMLLSS---CSSGCKCGNSCLN 105
+Y + +++ + C C G + C +C ++S C CG SC N
Sbjct: 90 LYANAHLGEPQQNEAMACDCKPEWVDGVNIACGHGSYCINRMTSIECTDENCYCGPSCQN 149
Query: 106 KPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGE 165
+ FQ + + ++QTEK G G+ A+ + +G FV EY+GEVI + +R+ + G
Sbjct: 150 QRFQKKMYADVDVIQTEKKGFGLRANSYLTKGTFVYEYIGEVIPEVRFRKRMREYDERGI 209
Query: 166 TNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGEN 225
+FY + + IDAT KG+ +R+ NHSC PN + KW++ + R+GIF RDI+KGE
Sbjct: 210 RHFYFMMLQKGEYIDATVKGSLARFCNHSCRPNCYVDKWVVGNKLRMGIFCKRDIQKGEE 269
Query: 226 LTYDY 230
LT+DY
Sbjct: 270 LTFDY 274
>gi|170097553|ref|XP_001879996.1| histone methyltransferase [Laccaria bicolor S238N-H82]
gi|164645399|gb|EDR09647.1| histone methyltransferase [Laccaria bicolor S238N-H82]
Length = 880
Score = 119 bits (299), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 60/148 (40%), Positives = 88/148 (59%), Gaps = 1/148 (0%)
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C+C + C N+ FQ R + +V TEK G G+ A+ED+ R F+ EYVG+V++ + ++R
Sbjct: 204 CRCRSYCQNQRFQKREYANIDIVLTEKKGFGLRAEEDLPRDTFIYEYVGDVVNPTSFKKR 263
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ G +FY + +D IDAT G R+ NHSC PN + KW I R+GIFA
Sbjct: 264 MRDYALEGIQHFYFMMLQKDEFIDATKGGGIGRFANHSCSPNCYVAKWTIGHHVRMGIFA 323
Query: 217 TRDIKKGENLTYDYQYE-FLHDSLIAYC 243
RDIK+ E LT++Y + + H + I YC
Sbjct: 324 KRDIKQYEELTFNYNVDRYGHQAQICYC 351
>gi|119498557|ref|XP_001266036.1| SET and WW domain protein [Neosartorya fischeri NRRL 181]
gi|119414200|gb|EAW24139.1| SET and WW domain protein [Neosartorya fischeri NRRL 181]
Length = 967
Score = 119 bits (298), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 60/164 (36%), Positives = 91/164 (55%), Gaps = 1/164 (0%)
Query: 81 DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
D DC C C CG C N+ FQ + + +++TEK G G+ A+ D++ +F+
Sbjct: 181 DSDCINRATKIECVGDCSCGAECQNQRFQRKEYANVAVIKTEKKGFGLRAETDLRPHQFI 240
Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
EYVGEVI++ R+ + G +FY ++R +DAT KGN R+ NHSC PN
Sbjct: 241 FEYVGEVINEAQFRRRMRQYDEEGIKHFYFMSLSRGEFVDATKKGNLGRFCNHSCNPNCY 300
Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
+ KW++ + R+GIFA R I+ GE L ++Y + + D YC
Sbjct: 301 VDKWVVGEKLRMGIFAERTIQAGEELVFNYNVDRYGADPQPCYC 344
>gi|452985541|gb|EME85297.1| hypothetical protein MYCFIDRAFT_83386 [Pseudocercospora fijiensis
CIRAD86]
Length = 835
Score = 119 bits (298), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 56/150 (37%), Positives = 85/150 (56%)
Query: 81 DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
D DC C C CG +C N+ FQ + + ++QTEK G G+ A+ D++ +F+
Sbjct: 96 DSDCINRATKMECVGDCNCGIACQNQRFQRKQYANVSVIQTEKKGYGLRANTDLRANDFI 155
Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
EY+GEVI + ++ + G +FY + + +DAT KGN R+ NHSC PN
Sbjct: 156 FEYIGEVIGENVFRRKMQQYDEQGIKHFYFMSLTKGEFVDATKKGNLGRFCNHSCNPNCY 215
Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDY 230
+ KW++ + R+GIFA R IK GE L ++Y
Sbjct: 216 VDKWVVGDKLRMGIFAERKIKAGEELVFNY 245
>gi|195020546|ref|XP_001985216.1| GH14625 [Drosophila grimshawi]
gi|193898698|gb|EDV97564.1| GH14625 [Drosophila grimshawi]
Length = 2406
Score = 119 bits (298), Expect = 1e-24, Method: Composition-based stats.
Identities = 60/173 (34%), Positives = 101/173 (58%), Gaps = 5/173 (2%)
Query: 69 SCTASPGSSGVCDRDCHCGMLLSSCS-SGCKCGNSCLNKPFQNRPVKK--MKLVQTEKCG 125
+C P ++G C +C M+ + C+ S C G+ C N+ Q V + + T+K G
Sbjct: 1519 NCNCKPQANGACLDNCLNRMVYTECAPSNCPAGDKCRNQKIQRHEVAPGVERFMTTDK-G 1577
Query: 126 AGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKG 185
G+ I +G +++EYVGEV+ ++ ++R+ + +L +T+ Y ++ +VID G
Sbjct: 1578 WGVRTKLPIAKGTYILEYVGEVVTEREFKQRMASI-YLNDTHHYCLHLDGGLVIDGQRMG 1636
Query: 186 NKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
+ R++NHSC PN EMQKW ++G +R+ +FA R I+ GE LTYDY + + S
Sbjct: 1637 SDCRFVNHSCEPNCEMQKWSVNGLSRMVLFAKRAIEDGEELTYDYNFSLFNPS 1689
>gi|343427717|emb|CBQ71244.1| related to SET2-Histone methyltransferase [Sporisorium reilianum
SRZ2]
Length = 971
Score = 119 bits (298), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 68/177 (38%), Positives = 99/177 (55%), Gaps = 8/177 (4%)
Query: 63 DDGIFCSCTASPGSSGV---CDRDCHC--GMLLSSCSSG-CKCGNSCLNKPFQNRPVKKM 116
DD + C C +P + + C + C M CSS C+ G C N+ F R +
Sbjct: 184 DDFMICDC--NPNTENLDMACTDESGCINRMTQIECSSSKCRWGKQCRNQRFHRRQYVDV 241
Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
+VQTEK G G+ A +DI + FV EYVGEV++ T +R+ + + G +FY + +
Sbjct: 242 DIVQTEKKGFGLRAAQDIPKESFVYEYVGEVMNQPTFLDRMQQYRVEGIRHFYFMMLQPN 301
Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
+DAT KG K R+INHSC PN + KW + R+GIFA R+I +GE LT++Y +
Sbjct: 302 EYLDATKKGGKGRFINHSCNPNCSVSKWQVGKHIRMGIFAKRNILQGEELTFNYNVD 358
>gi|91077840|ref|XP_971447.1| PREDICTED: similar to set domain protein [Tribolium castaneum]
Length = 1549
Score = 119 bits (298), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 66/195 (33%), Positives = 104/195 (53%), Gaps = 10/195 (5%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCGNSC 103
Y I+ N+Y K E C+ A+ C DC ++L+ C + KC N
Sbjct: 754 YRKIRTNVYNVKTTTGACEPQTCNCTPLAN------CGDDCINRLVLAECPASHKCQNQK 807
Query: 104 LNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHL 163
+ + + ++ TE G G+ IK GEF++EYVGEV+ DQ +ER+ + ++
Sbjct: 808 IQR---HEWAPGLEKFMTENKGWGVRTKLPIKSGEFILEYVGEVVSDQEFKERMATI-YV 863
Query: 164 GETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKG 223
+T+ Y ++ +VID G R++NHSC PN EMQKW ++G+ R+ +FA RDI+
Sbjct: 864 NDTHHYCLHLDGGLVIDGHRMGGDGRFVNHSCQPNCEMQKWSVNGQFRMALFALRDIESS 923
Query: 224 ENLTYDYQYEFLHDS 238
E LTYDY + + +
Sbjct: 924 EELTYDYNFSLFNPA 938
>gi|70998620|ref|XP_754032.1| SET and WW domain protein [Aspergillus fumigatus Af293]
gi|74672926|sp|Q4WTT2.1|SET2_ASPFU RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36
specific; AltName: Full=SET domain-containing protein 2
gi|66851668|gb|EAL91994.1| SET and WW domain protein [Aspergillus fumigatus Af293]
Length = 966
Score = 119 bits (298), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 60/164 (36%), Positives = 91/164 (55%), Gaps = 1/164 (0%)
Query: 81 DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
D DC C C CG C N+ FQ + + +++TEK G G+ A+ D++ +F+
Sbjct: 181 DSDCINRATKIECVGDCSCGAECQNQRFQRKEYANVAVIKTEKKGFGLRAETDLRPHQFI 240
Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
EYVGEVI++ R+ + G +FY ++R +DAT KGN R+ NHSC PN
Sbjct: 241 FEYVGEVINEAQFRRRMRQYDEEGIKHFYFMSLSRGEFVDATKKGNLGRFCNHSCNPNCY 300
Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
+ KW++ + R+GIFA R I+ GE L ++Y + + D YC
Sbjct: 301 VDKWVVGEKLRMGIFAERAIQAGEELVFNYNVDRYGADPQPCYC 344
>gi|358253063|dbj|GAA51760.1| histone-lysine N-methyltransferase NSD1/2 [Clonorchis sinensis]
Length = 1596
Score = 119 bits (298), Expect = 1e-24, Method: Composition-based stats.
Identities = 75/210 (35%), Positives = 111/210 (52%), Gaps = 23/210 (10%)
Query: 33 PDWFIKPKAIPYVFIKRNIYLTKRIKRRLED--DGIFCSCTASPG------SSGVCDRDC 84
PD++ KP + + IY RL D + C C + G SSG +R+
Sbjct: 971 PDYY-KPIKVNWPLGSVRIY-------RLTDPSEAPRCECKPNSGGEPCGPSSGCINREL 1022
Query: 85 HCGMLLSSCSSGCKCGNSCLNKPFQNR--PVKKMKLVQTEKCGAGIVADEDIKRGEFVIE 142
H L S C +G ++C N+ F R P ++ +E+ G G+ IK G FV E
Sbjct: 1023 HYECLPSVCPNG----DACQNQRFTKRLYPRQRPFWTGSER-GWGLKTLVPIKAGSFVNE 1077
Query: 143 YVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQ 202
Y+G++ID++ RL TN+Y+ +++ +IDA KGN SR++NH C PN Q
Sbjct: 1078 YIGDLIDEEEANRRLRFAHENNVTNYYMMKLDAQRIIDAGPKGNLSRFMNHCCDPNLNTQ 1137
Query: 203 KWIIDGETRIGIFATRDIKKGENLTYDYQY 232
KW ++G+ RIG+FA RDI GE LT+DY +
Sbjct: 1138 KWTVNGDNRIGLFAVRDIAAGEELTFDYNF 1167
>gi|146419774|ref|XP_001485847.1| hypothetical protein PGUG_01518 [Meyerozyma guilliermondii ATCC
6260]
Length = 707
Score = 119 bits (298), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 62/189 (32%), Positives = 103/189 (54%), Gaps = 11/189 (5%)
Query: 51 IYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCG---------MLLSSCSSGCKCGN 101
IY K++ +++ + C C + R+ CG + + C CG+
Sbjct: 38 IYANKQLGHSGQNEQMTCDCHEKWDQA--TQRNLACGEHSECINRATSVECVNKSCGCGD 95
Query: 102 SCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMK 161
C N+ FQ + + ++QTE G G+ A+E+I F+ EYVGEVI++++ +R+ +
Sbjct: 96 DCQNQRFQKKEYANVSVIQTELKGYGLRANEEINESGFIYEYVGEVINEESFRKRMVEYD 155
Query: 162 HLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIK 221
+FY + +D IDAT KG+ +R+ NHSC PN + KW++ + R+GIFA R I+
Sbjct: 156 EKKFPHFYFMMLKKDSFIDATIKGSLARFCNHSCSPNAYVDKWVVGDKLRMGIFAKRLIQ 215
Query: 222 KGENLTYDY 230
GE +T+DY
Sbjct: 216 AGEEITFDY 224
>gi|159126234|gb|EDP51350.1| SET and WW domain protein [Aspergillus fumigatus A1163]
Length = 966
Score = 119 bits (298), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 60/164 (36%), Positives = 91/164 (55%), Gaps = 1/164 (0%)
Query: 81 DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
D DC C C CG C N+ FQ + + +++TEK G G+ A+ D++ +F+
Sbjct: 181 DSDCINRATKIECVGDCSCGPECQNQRFQRKEYANVAVIKTEKKGFGLRAETDLRPHQFI 240
Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
EYVGEVI++ R+ + G +FY ++R +DAT KGN R+ NHSC PN
Sbjct: 241 FEYVGEVINEAQFRRRMRQYDEEGIKHFYFMSLSRGEFVDATKKGNLGRFCNHSCNPNCY 300
Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
+ KW++ + R+GIFA R I+ GE L ++Y + + D YC
Sbjct: 301 VDKWVVGEKLRMGIFAERAIQAGEELVFNYNVDRYGADPQPCYC 344
>gi|270001477|gb|EEZ97924.1| hypothetical protein TcasGA2_TC000311 [Tribolium castaneum]
Length = 1647
Score = 119 bits (298), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 66/195 (33%), Positives = 104/195 (53%), Gaps = 10/195 (5%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCGNSC 103
Y I+ N+Y K E C+ A+ C DC ++L+ C + KC N
Sbjct: 852 YRKIRTNVYNVKTTTGACEPQTCNCTPLAN------CGDDCINRLVLAECPASHKCQNQK 905
Query: 104 LNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHL 163
+ + + ++ TE G G+ IK GEF++EYVGEV+ DQ +ER+ + ++
Sbjct: 906 IQR---HEWAPGLEKFMTENKGWGVRTKLPIKSGEFILEYVGEVVSDQEFKERMATI-YV 961
Query: 164 GETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKG 223
+T+ Y ++ +VID G R++NHSC PN EMQKW ++G+ R+ +FA RDI+
Sbjct: 962 NDTHHYCLHLDGGLVIDGHRMGGDGRFVNHSCQPNCEMQKWSVNGQFRMALFALRDIESS 1021
Query: 224 ENLTYDYQYEFLHDS 238
E LTYDY + + +
Sbjct: 1022 EELTYDYNFSLFNPA 1036
>gi|427798749|gb|JAA64826.1| Putative histone-lysine n-methyltransferase nsd2, partial
[Rhipicephalus pulchellus]
Length = 755
Score = 119 bits (298), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 68/152 (44%), Positives = 93/152 (61%), Gaps = 3/152 (1%)
Query: 70 CTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGI 128
CT +P C +C +LL CS C G C N+ FQ R + LV+T G G+
Sbjct: 604 CTCTPQDP--CRANCLNRLLLYECSRDLCPAGPDCENQRFQKRTYARTSLVRTPGRGWGL 661
Query: 129 VADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKS 188
+ + + G+FV+EYVGE+ID++ CE RL ++ NFY + +D +IDA KGN S
Sbjct: 662 LTSQALAAGDFVMEYVGELIDEEECERRLEELHMENIQNFYFLTLEKDRIIDAGPKGNMS 721
Query: 189 RYINHSCCPNTEMQKWIIDGETRIGIFATRDI 220
R+INHSC PN E QKW ++G+TR+GIFA RDI
Sbjct: 722 RFINHSCEPNCETQKWTVNGDTRVGIFAIRDI 753
>gi|254586457|ref|XP_002498796.1| ZYRO0G18766p [Zygosaccharomyces rouxii]
gi|238941690|emb|CAR29863.1| ZYRO0G18766p [Zygosaccharomyces rouxii]
Length = 680
Score = 119 bits (298), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 66/190 (34%), Positives = 101/190 (53%), Gaps = 6/190 (3%)
Query: 47 IKRNIYLTKRIKRRLEDDGIFCSCTAS--PGSSGVCDR--DCHCGMLLSSCSSGC--KCG 100
++ N Y KR+ ++ + C C G + CD DC + L C +G CG
Sbjct: 37 LEENHYANKRLGNSKHNEFMECDCYEEFEDGINRACDENSDCINRLTLIECVNGLCGSCG 96
Query: 101 NSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKM 160
C N+ FQ + + + QT+ G G+ A DI+ +F+ EY+G+VI + +RL
Sbjct: 97 EDCQNQRFQRKQYADIAVFQTKLKGYGVRAQSDIEPHQFIYEYMGDVIPEDVFRDRLVDY 156
Query: 161 KHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDI 220
G +FY + IDAT KG+ +R+ NHSC PN + KW++ G+ ++GIFA R I
Sbjct: 157 DQKGYQHFYFMMLQNGEFIDATVKGSLARFCNHSCNPNAYVNKWVVAGKLKMGIFAHRKI 216
Query: 221 KKGENLTYDY 230
KGE +T+DY
Sbjct: 217 LKGEEITFDY 226
>gi|190345522|gb|EDK37420.2| hypothetical protein PGUG_01518 [Meyerozyma guilliermondii ATCC
6260]
Length = 707
Score = 119 bits (298), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 62/187 (33%), Positives = 103/187 (55%), Gaps = 7/187 (3%)
Query: 51 IYLTKRIKRRLEDDGIFCSC----TASPGSSGVCDRDCHCGMLLSSC---SSGCKCGNSC 103
IY K++ +++ + C C + + C C +S + C CG+ C
Sbjct: 38 IYANKQLGHSGQNEQMTCDCHEKWDQATQRNLACGEHSECINRATSVECVNKSCGCGDDC 97
Query: 104 LNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHL 163
N+ FQ + + ++QTE G G+ A+E+I F+ EYVGEVI++++ +R+ +
Sbjct: 98 QNQRFQKKEYANVSVIQTELKGYGLRANEEINESGFIYEYVGEVINEESFRKRMVEYDEK 157
Query: 164 GETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKG 223
+FY + +D IDAT KG+ +R+ NHSC PN + KW++ + R+GIFA R I+ G
Sbjct: 158 KFPHFYFMMLKKDSFIDATIKGSLARFCNHSCSPNAYVDKWVVGDKLRMGIFAKRLIQAG 217
Query: 224 ENLTYDY 230
E +T+DY
Sbjct: 218 EEITFDY 224
>gi|345781638|ref|XP_003432154.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like [Canis
lupus familiaris]
Length = 742
Score = 119 bits (297), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 75/228 (32%), Positives = 109/228 (47%), Gaps = 33/228 (14%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG- 96
K PY IK N + K ++ + D C P C + C ML C
Sbjct: 410 KPPPYKHIKANKVIGK-VQIHVADLSEIPRCNCKPADENPCGLESECLNRMLQYECHPQV 468
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGI---------------------------- 128
C G C N+ F R ++++TE+ G G+
Sbjct: 469 CPAGERCQNQCFTKRLYPDAEIIKTERRGWGLRTKRSIKKVAWGTTLGIWKNSVKQDTFV 528
Query: 129 -VADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNK 187
+ + D K+GEFV EYVGE+ID++ C R+ + TNFY+ + +D +IDA KGN
Sbjct: 529 SLLETDNKQGEFVNEYVGELIDEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNY 588
Query: 188 SRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
SR++NHSC PN E QKW ++G+ R+G+FA DI G LT++Y + L
Sbjct: 589 SRFMNHSCNPNCETQKWTVNGDIRVGLFALCDIPAGMELTFNYNLDCL 636
>gi|380492750|emb|CCF34378.1| SET domain-containing protein [Colletotrichum higginsianum]
Length = 951
Score = 119 bits (297), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 65/188 (34%), Positives = 104/188 (55%), Gaps = 6/188 (3%)
Query: 62 EDDGIFCSCTAS--PGSSGVC--DRDCHCGMLLSSCSSGC-KCGNSCLNKPFQNRPVKKM 116
++D + C C + G + C D DC C +G CG+ C N+ FQ + +
Sbjct: 119 DNDALDCDCRSEWHDGKNLSCGEDSDCINRATKMECVAGAGNCGDGCQNQRFQRKEYANV 178
Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
+++TEK G G+ AD D++ +F+ EY+GEVI++ T R+ + G +FY + +
Sbjct: 179 SVIKTEKKGFGLRADVDLQANDFIFEYIGEVINEPTFRRRMVQYDDEGIKHFYFMSLTKH 238
Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FL 235
+DAT KGN R+ NHSC PN + KW++ + R+GIF++R IK GE L ++Y + +
Sbjct: 239 EFVDATKKGNLGRFCNHSCNPNCYVDKWVVGEKLRMGIFSSRRIKAGEELVFNYNVDRYG 298
Query: 236 HDSLIAYC 243
D YC
Sbjct: 299 ADPQPCYC 306
>gi|366998103|ref|XP_003683788.1| hypothetical protein TPHA_0A02720 [Tetrapisispora phaffii CBS 4417]
gi|357522083|emb|CCE61354.1| hypothetical protein TPHA_0A02720 [Tetrapisispora phaffii CBS 4417]
Length = 707
Score = 119 bits (297), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 66/185 (35%), Positives = 99/185 (53%), Gaps = 6/185 (3%)
Query: 52 YLTKRIKRRLEDDGIFCSCTAS--PGSSGVCDR--DCHCGMLLSSCSSGC--KCGNSCLN 105
Y +K I ++ + C C G + C DC + L C +G CG++C N
Sbjct: 46 YASKNIGNAKNNEFMECDCYEEFEDGRNKACSEYSDCINRLTLIECVNGLCNTCGDNCEN 105
Query: 106 KPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGE 165
+ FQ + + + +TE G G+ A+ D++ +F+ EY GEVI++ ERL + G
Sbjct: 106 QKFQKKEYADISIFKTELKGYGVRANSDLEENQFIYEYKGEVIEENEFRERLIEYDQRGF 165
Query: 166 TNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGEN 225
+FY + IDAT KG +R+ NHSC PN + KW + G+ R+GIFA R I KGE
Sbjct: 166 KHFYFMMLQSGEFIDATIKGALARFCNHSCNPNAYVNKWEVAGKLRMGIFAKRRISKGEE 225
Query: 226 LTYDY 230
+T+DY
Sbjct: 226 ITFDY 230
>gi|321260843|ref|XP_003195141.1| histone-lysine N-methyltransferase [Cryptococcus gattii WM276]
gi|317461614|gb|ADV23354.1| Histone-lysine N-methyltransferase, putative [Cryptococcus gattii
WM276]
Length = 833
Score = 119 bits (297), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 68/205 (33%), Positives = 107/205 (52%), Gaps = 11/205 (5%)
Query: 45 VFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGV----CDRDCHCGMLLSSCSSG-CKC 99
V+ +++I L+K D+ + C C + D DC L C +G C+
Sbjct: 112 VYERKDIGLSKE-----NDEMMVCECVYNRHDPDTDPCGPDSDCINRALYIECIAGECRA 166
Query: 100 GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
G C N+ F + + +V TEK G G+ A I + EY+GEV+ ++T +R+ +
Sbjct: 167 GKHCHNQQFSKKQYANVDVVLTEKKGYGLRASSIIPANTLIYEYIGEVVAEKTFRKRMQQ 226
Query: 160 MKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD 219
G +FY + ++ IDAT KG R+ NHSC PN E+QKW++ R+GIF RD
Sbjct: 227 YADEGIRHFYFMMLQKEEYIDATKKGGIGRFANHSCNPNCEVQKWVVGRRLRMGIFTKRD 286
Query: 220 IKKGENLTYDYQYE-FLHDSLIAYC 243
+ KGE +T++Y + + HD+ YC
Sbjct: 287 VVKGEEITFNYNVDRYGHDAQTCYC 311
>gi|405121647|gb|AFR96415.1| histone-lysine N-methyltransferase [Cryptococcus neoformans var.
grubii H99]
Length = 836
Score = 119 bits (297), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 68/205 (33%), Positives = 106/205 (51%), Gaps = 11/205 (5%)
Query: 45 VFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGV----CDRDCHCGMLLSSCSSG-CKC 99
V+ +++I L+K D+ + C C + D DC L C G C+
Sbjct: 112 VYERKDIGLSKE-----NDEMMVCECVYNRHDPDADPCGPDSDCINRALYIECIGGECRA 166
Query: 100 GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
G C N+ F + + +V TEK G G+ A I + EY+GEV+ ++T +R+ +
Sbjct: 167 GKHCHNQQFSKKQYANVDVVLTEKKGYGLRASSTIPANTLIYEYIGEVVAEKTFRKRMQQ 226
Query: 160 MKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD 219
G +FY + ++ IDAT KG R+ NHSC PN E+QKW++ R+GIF RD
Sbjct: 227 YADEGIRHFYFMMLQKEEYIDATKKGGIGRFANHSCNPNCEVQKWVVGRRLRMGIFTKRD 286
Query: 220 IKKGENLTYDYQYE-FLHDSLIAYC 243
+ KGE +T++Y + + HD+ YC
Sbjct: 287 VIKGEEITFNYNVDRYGHDAQTCYC 311
>gi|336266160|ref|XP_003347849.1| hypothetical protein SMAC_06682 [Sordaria macrospora k-hell]
gi|380091782|emb|CCC10510.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 907
Score = 119 bits (297), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 62/165 (37%), Positives = 92/165 (55%), Gaps = 2/165 (1%)
Query: 81 DRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEF 139
D DC C G C CG C N+ FQ + + +++TEK G G+ A+ D++ +F
Sbjct: 148 DSDCINRATKMECVDGDCNCGAGCQNQRFQRKQYADVSVIKTEKKGFGLRANTDLQVNDF 207
Query: 140 VIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNT 199
+ EY+GEVI++ T R+ K G +FY + + +DAT KGN R+ NHSC PN
Sbjct: 208 IFEYIGEVINEPTFRSRMIKYDKEGIKHFYFMSLTKSEFVDATKKGNLGRFCNHSCNPNC 267
Query: 200 EMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
+ KW++ + R+GIFA R IK GE L ++Y + + D YC
Sbjct: 268 YVDKWVVGDKLRMGIFAGRAIKAGEELVFNYNVDRYGADPQPCYC 312
>gi|134083984|emb|CAK49139.1| unnamed protein product [Aspergillus niger]
Length = 734
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 60/164 (36%), Positives = 91/164 (55%), Gaps = 1/164 (0%)
Query: 81 DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
D DC C C CG C N+ FQ R + +++TEK G G+ A+ D++ +F+
Sbjct: 187 DSDCINRATKIECMGDCGCGPECQNQRFQRREYAPVAVIKTEKKGFGLRAEADLRPHQFI 246
Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
EYVGEVI++ R+ + G +FY +++ +DAT KGN R+ NHSC PN
Sbjct: 247 FEYVGEVINEGQFRRRMRQYDEEGIKHFYFMSLSKGEFVDATKKGNLGRFCNHSCNPNCY 306
Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
+ KW++ + R+GIFA R I+ GE L ++Y + + D YC
Sbjct: 307 VDKWVVGEKLRMGIFAERHIQAGEELVFNYNVDRYGADPQPCYC 350
>gi|121806929|sp|Q2UTN6.1|SET2_ASPOR RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36
specific; AltName: Full=SET domain-containing protein 2
gi|83764935|dbj|BAE55079.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 965
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 59/164 (35%), Positives = 93/164 (56%), Gaps = 1/164 (0%)
Query: 81 DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
D DC C C CG C N+ FQ + ++ +++TEK G G+ A+ D++ +F+
Sbjct: 192 DSDCINRATKIECVGDCGCGPDCQNQRFQRKEYAQVAVIKTEKKGFGLRAEADLRPHQFI 251
Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
EYVGEVI++ R+ + G +FY +++ +DAT +GN R+ NHSC PN
Sbjct: 252 YEYVGEVINEGQFRRRMRQYDEEGIKHFYFMSLSKGEFVDATKRGNLGRFCNHSCNPNCY 311
Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
+ KW++ + R+GIFA RDI+ GE L ++Y + + D YC
Sbjct: 312 VDKWVVGEKLRMGIFAERDIQAGEELVFNYNVDRYGADPQPCYC 355
>gi|389633989|ref|XP_003714647.1| histone-lysine N-methyltransferase [Magnaporthe oryzae 70-15]
gi|351646980|gb|EHA54840.1| histone-lysine N-methyltransferase [Magnaporthe oryzae 70-15]
gi|440474559|gb|ELQ43296.1| histone-lysine N-methyltransferase [Magnaporthe oryzae Y34]
gi|440479738|gb|ELQ60486.1| histone-lysine N-methyltransferase [Magnaporthe oryzae P131]
Length = 946
Score = 119 bits (297), Expect = 2e-24, Method: Composition-based stats.
Identities = 59/153 (38%), Positives = 87/153 (56%), Gaps = 1/153 (0%)
Query: 81 DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
D DC + C SG CG+ C N+ FQ + + +++TE G G+ AD +++ +FV
Sbjct: 145 DSDCINRVTKIECVSG-NCGDGCQNQRFQRKQYANVSVIKTENKGYGLRADANLEPNDFV 203
Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
EY+GEVI ++ RL K +FY + R +DAT KGN R+ NHSC PN
Sbjct: 204 FEYIGEVIGEELFRSRLMKYDTQRLEHFYFMSLTRTEYVDATKKGNLGRFCNHSCNPNCY 263
Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
+ KW++ + R+GIFA R IK GE L ++Y +
Sbjct: 264 VDKWVVGDKLRMGIFAMRAIKAGEELCFNYNVD 296
>gi|317455359|pdb|3OPE|A Chain A, Structural Basis Of Auto-Inhibitory Mechanism Of Histone
Methyltransferase
gi|317455360|pdb|3OPE|B Chain B, Structural Basis Of Auto-Inhibitory Mechanism Of Histone
Methyltransferase
Length = 222
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 69/191 (36%), Positives = 105/191 (54%), Gaps = 4/191 (2%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCS-SGCKCGNS 102
Y I+ N+Y+ + E C G D DC M+ + CS + C CG
Sbjct: 3 YKKIRSNVYVDVKPLSGYEATTCNCKKPDDDTRKGCVD-DCLNRMIFAECSPNTCPCGEQ 61
Query: 103 CLNKPFQNRP-VKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMK 161
C N+ Q V+ ++ + E+ G GI E +K G+F+IEY+GEV+ +Q R+ +
Sbjct: 62 CCNQRIQRHEWVQCLERFRAEEKGWGIRTKEPLKAGQFIIEYLGEVVSEQEFRNRMIEQY 121
Query: 162 HLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIK 221
H ++ Y ++ MVID+ GN++R+INHSC PN EMQKW ++G RIG++A +D+
Sbjct: 122 H-NHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGLYALKDMP 180
Query: 222 KGENLTYDYQY 232
G LTYDY +
Sbjct: 181 AGTELTYDYNF 191
>gi|238503484|ref|XP_002382975.1| SET and WW domain protein [Aspergillus flavus NRRL3357]
gi|220690446|gb|EED46795.1| SET and WW domain protein [Aspergillus flavus NRRL3357]
gi|391863367|gb|EIT72678.1| clathrin coat binding protein [Aspergillus oryzae 3.042]
Length = 947
Score = 118 bits (296), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 59/164 (35%), Positives = 93/164 (56%), Gaps = 1/164 (0%)
Query: 81 DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
D DC C C CG C N+ FQ + ++ +++TEK G G+ A+ D++ +F+
Sbjct: 174 DSDCINRATKIECVGDCGCGPDCQNQRFQRKEYAQVAVIKTEKKGFGLRAEADLRPHQFI 233
Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
EYVGEVI++ R+ + G +FY +++ +DAT +GN R+ NHSC PN
Sbjct: 234 YEYVGEVINEGQFRRRMRQYDEEGIKHFYFMSLSKGEFVDATKRGNLGRFCNHSCNPNCY 293
Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
+ KW++ + R+GIFA RDI+ GE L ++Y + + D YC
Sbjct: 294 VDKWVVGEKLRMGIFAERDIQAGEELVFNYNVDRYGADPQPCYC 337
>gi|121712644|ref|XP_001273933.1| SET and WW domain protein [Aspergillus clavatus NRRL 1]
gi|119402086|gb|EAW12507.1| SET and WW domain protein [Aspergillus clavatus NRRL 1]
Length = 968
Score = 118 bits (296), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 63/174 (36%), Positives = 96/174 (55%), Gaps = 3/174 (1%)
Query: 73 SPGSSGVC--DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVA 130
S G + C D DC C C CG +C N+ FQ R + +++TEK G G+ A
Sbjct: 171 SAGKNLACGEDSDCINRATKIECVGDCSCGANCQNQRFQRREYANVAVIKTEKKGFGLRA 230
Query: 131 DEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRY 190
+ D++ +F+ EYVGEVI++ R+ + G +FY +++ +DAT KGN R+
Sbjct: 231 ETDLQPHQFIFEYVGEVINEGHFRRRMRQYDEEGIKHFYFMSLSKGEFVDATKKGNLGRF 290
Query: 191 INHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
NHSC PN + KW++ + R+GIFA R I+ GE L ++Y + + D YC
Sbjct: 291 CNHSCNPNCYVDKWVVGEKLRMGIFAERAIRAGEELVFNYNVDRYGADPQPCYC 344
>gi|410080658|ref|XP_003957909.1| hypothetical protein KAFR_0F01770 [Kazachstania africana CBS 2517]
gi|372464496|emb|CCF58774.1| hypothetical protein KAFR_0F01770 [Kazachstania africana CBS 2517]
Length = 711
Score = 118 bits (296), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 53/132 (40%), Positives = 83/132 (62%)
Query: 99 CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
CG +C N+ FQ + + + QTE G G+ + D++ EF+ EY+GEVID+++ +RL
Sbjct: 99 CGENCQNQRFQKKQYADISVFQTEMKGYGVRCNTDLEANEFIYEYIGEVIDEESFRDRLI 158
Query: 159 KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATR 218
K + +FY + IDAT KG R+ NHSC PN + KW+++G+ ++GIF+ R
Sbjct: 159 KYDEMHFKHFYFMMLQNGQFIDATRKGALGRFCNHSCNPNAYVNKWVVNGKLKMGIFSKR 218
Query: 219 DIKKGENLTYDY 230
I+KGE +T+DY
Sbjct: 219 KIQKGEEITFDY 230
>gi|400596232|gb|EJP64008.1| SET domain-containing protein [Beauveria bassiana ARSEF 2860]
Length = 900
Score = 118 bits (296), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 61/188 (32%), Positives = 97/188 (51%), Gaps = 6/188 (3%)
Query: 62 EDDGIFCSCTA--SPGSSGVCDRDCHC---GMLLSSCSSGCKCGNSCLNKPFQNRPVKKM 116
E+D C C G + C D C + + CG C N+ FQ + +
Sbjct: 121 ENDAFDCDCREEWQDGVNMACGEDSDCINRATKMECSETAGNCGGGCQNQRFQRKHYADV 180
Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
+++T+K G G+ D D+ +F+ EY+GEVI++ T R+ + G +FY +N++
Sbjct: 181 SVIKTDKKGYGLRTDADLAANDFIFEYIGEVINEATFRRRMLQYDQEGIKHFYFMSLNKN 240
Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FL 235
+DAT KGN R+ NHSC PN + KW++ + R+GIFA R + GE L ++Y + +
Sbjct: 241 EFVDATRKGNLGRFCNHSCVPNCYVDKWVVGDKLRMGIFALRAVSAGEELVFNYNVDRYG 300
Query: 236 HDSLIAYC 243
D YC
Sbjct: 301 ADPQPCYC 308
>gi|453087448|gb|EMF15489.1| hypothetical protein SEPMUDRAFT_161660 [Mycosphaerella populorum
SO2202]
Length = 966
Score = 118 bits (296), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 59/164 (35%), Positives = 92/164 (56%), Gaps = 1/164 (0%)
Query: 81 DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
D DC C C CG C N+ FQ++ + +++TEK G G+ A+ +++ +F+
Sbjct: 173 DTDCINRATKMECVGDCGCGAGCQNRRFQHKQYANVSVIKTEKKGYGLRANTELQANDFI 232
Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
EY+GEVI ++T RL + G +FY +++ +DAT KGN R+ NHSC PN
Sbjct: 233 FEYIGEVIGEKTFRNRLHQYDEEGIKHFYFMSLSKGEFVDATKKGNLGRFCNHSCNPNCY 292
Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
+ KW++ + R+GIFA R I GE L ++Y + + D YC
Sbjct: 293 VDKWVVGDKLRMGIFAERKIHAGEELVFNYNVDRYGADPQPCYC 336
>gi|403419215|emb|CCM05915.1| predicted protein [Fibroporia radiculosa]
Length = 861
Score = 118 bits (296), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 74/224 (33%), Positives = 112/224 (50%), Gaps = 19/224 (8%)
Query: 39 PKAIPYVF-----IKRNIYLTKRIKRRLED-DGIFCSCTASPGSSGVCD-----RDCHCG 87
P+A P I +N Y + R E + + C C PG D DC
Sbjct: 111 PRAEPQALLTFEEIPQNHYQYGTLGRSRESLESMTCDCQYDPGMDDPDDACGHGSDCINR 170
Query: 88 MLLSSC-SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGE------FV 140
+ C C+C + C N+ FQ + +++VQTEK G G+ A +D+ +G+ F+
Sbjct: 171 LTQVECLPDDCRCRSYCQNQRFQRKEYAPIEIVQTEKKGFGLRAAQDLHKGQPYYRDAFI 230
Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
EY+G+V+ + +R+ G +FY + +D IDAT +G R+ NHSC PN
Sbjct: 231 YEYLGDVVSQPSFLKRMRLYAEEGIRHFYFMMLQKDEFIDATKRGGIGRFANHSCNPNCY 290
Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
+ KW + R+GIFA RDIKK E LT++Y + + HD+ YC
Sbjct: 291 VAKWTVGHHVRMGIFANRDIKKDEELTFNYNVDRYGHDAQPCYC 334
>gi|145479919|ref|XP_001425982.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124393054|emb|CAK58584.1| unnamed protein product [Paramecium tetraurelia]
Length = 1384
Score = 118 bits (296), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 77/228 (33%), Positives = 110/228 (48%), Gaps = 18/228 (7%)
Query: 25 GNPVEFE-LPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCT-----------A 72
GNP + L D F + +K+N YL ++ R D+ + CSC
Sbjct: 32 GNPARLQGLKDSFK-----YFTLLKKNQYLERKQYRVSRDETLVCSCIMCPEDQIQSRPQ 86
Query: 73 SPGSSGVCDRDCHCGMLLSSCS-SGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVAD 131
P S C C + C C C C N+ FQ + ++ G G+ A
Sbjct: 87 GPQYSYNCGERCLNRFTCTECDVELCPCAEQCKNRRFQKHDDACVYPLRCGGKGMGLFAG 146
Query: 132 EDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYI 191
E I +G+F+++YVGE+ + R ++ T YL ++N VID T KGN +R+I
Sbjct: 147 ERILKGQFIMQYVGEIFQINSAFGRRRVQEYSKSTCTYLMKLNNQEVIDPTSKGNLARFI 206
Query: 192 NHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSL 239
NHSC PN +KW + GE IGIFA RDI + E LT+DYQ++ H L
Sbjct: 207 NHSCEPNCITEKWNVLGEVCIGIFAIRDINEDEELTFDYQFDVFHTPL 254
>gi|317138698|ref|XP_001817081.2| histone-lysine N-methyltransferase, H3 lysine-36 specific
[Aspergillus oryzae RIB40]
Length = 849
Score = 118 bits (296), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 59/164 (35%), Positives = 93/164 (56%), Gaps = 1/164 (0%)
Query: 81 DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
D DC C C CG C N+ FQ + ++ +++TEK G G+ A+ D++ +F+
Sbjct: 174 DSDCINRATKIECVGDCGCGPDCQNQRFQRKEYAQVAVIKTEKKGFGLRAEADLRPHQFI 233
Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
EYVGEVI++ R+ + G +FY +++ +DAT +GN R+ NHSC PN
Sbjct: 234 YEYVGEVINEGQFRRRMRQYDEEGIKHFYFMSLSKGEFVDATKRGNLGRFCNHSCNPNCY 293
Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
+ KW++ + R+GIFA RDI+ GE L ++Y + + D YC
Sbjct: 294 VDKWVVGEKLRMGIFAERDIQAGEELVFNYNVDRYGADPQPCYC 337
>gi|290985403|ref|XP_002675415.1| predicted protein [Naegleria gruberi]
gi|284089011|gb|EFC42671.1| predicted protein [Naegleria gruberi]
Length = 438
Score = 118 bits (295), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 72/210 (34%), Positives = 110/210 (52%), Gaps = 17/210 (8%)
Query: 29 EFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGM 88
+ +LPD F V+I N T + + D C CT +G + C +
Sbjct: 186 KIKLPDDF--------VYIMSNERKTTKQSDKKHQD--HCHCT-----NGCIPKYCINYL 230
Query: 89 LLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADED-IKRGEFVIEYVGEV 147
C + C CG++C N+ F ++ Q++ G G+ ++D IK+G F+ EYVGEV
Sbjct: 231 SQIECPTDCNCGDNCQNRVFTKMAYPELICFQSDTKGIGVKCNQDVIKKGTFITEYVGEV 290
Query: 148 IDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIID 207
I E R K + + Y +N + +IDAT+ GN +R+INHSC PN Q W ++
Sbjct: 291 ISVDKFETRT-KRSYKKSLHHYCMNMNENEIIDATWMGNIARFINHSCAPNARTQTWDVN 349
Query: 208 GETRIGIFATRDIKKGENLTYDYQYEFLHD 237
G+ R+GIFA +DI KGE +TY+Y + +D
Sbjct: 350 GQNRVGIFAIKDIVKGEEITYNYNFLIYND 379
>gi|118348872|ref|XP_001007909.1| SET domain containing protein [Tetrahymena thermophila]
gi|89289676|gb|EAR87664.1| SET domain containing protein [Tetrahymena thermophila SB210]
Length = 3527
Score = 118 bits (295), Expect = 2e-24, Method: Composition-based stats.
Identities = 63/155 (40%), Positives = 89/155 (57%), Gaps = 1/155 (0%)
Query: 80 CDRDC-HCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGE 138
C +C + + C C+CG C N+ FQ + V T G G+ A + I +G
Sbjct: 168 CGENCWNRAVCTECCDLSCRCGELCQNRRFQKHQDACVYPVPTRGKGWGLCAGQFIPKGT 227
Query: 139 FVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPN 198
F+I+Y GEV D + E + T YL +I+R+ VID TYKGN +R+INHSC PN
Sbjct: 228 FIIQYTGEVFDINSSEGIKRCKDYSRSTCTYLMKIDRNEVIDPTYKGNLARFINHSCDPN 287
Query: 199 TEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
QKW + GE IGIF+ +DI++ E LT+DYQ++
Sbjct: 288 CITQKWHVLGEICIGIFSIKDIQEDEELTFDYQFD 322
>gi|403214852|emb|CCK69352.1| hypothetical protein KNAG_0C02410 [Kazachstania naganishii CBS
8797]
Length = 708
Score = 118 bits (295), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 58/153 (37%), Positives = 89/153 (58%), Gaps = 3/153 (1%)
Query: 81 DRDCHCGMLLSSCSSG---CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRG 137
D DC + L C G CG +C N+ FQ + + + QT+ G G++A EDI +
Sbjct: 75 DSDCINRLTLIECVDGLCESTCGKNCQNQRFQRKQYADVMVFQTKMKGYGVLAKEDIDQH 134
Query: 138 EFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCP 197
+F+ EY+GEVID++ +R+ +FY + IDAT KG +R+ NHSC P
Sbjct: 135 QFIYEYMGEVIDEEEFRDRMATYDEKKFKHFYFMMLQNGQFIDATMKGCLARFCNHSCSP 194
Query: 198 NTEMQKWIIDGETRIGIFATRDIKKGENLTYDY 230
N + KW+++G+ ++GIFA+R I K E +T+DY
Sbjct: 195 NAYVNKWVVNGKLKMGIFASRKILKDEEITFDY 227
>gi|350634064|gb|EHA22428.1| hypothetical protein ASPNIDRAFT_51069 [Aspergillus niger ATCC 1015]
Length = 962
Score = 118 bits (295), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 60/164 (36%), Positives = 91/164 (55%), Gaps = 1/164 (0%)
Query: 81 DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
D DC C C CG C N+ FQ R + +++TEK G G+ A+ D++ +F+
Sbjct: 187 DSDCINRATKIECMGDCGCGPECQNQRFQRREYAPVAVIKTEKKGFGLRAEADLRPHQFI 246
Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
EYVGEVI++ R+ + G +FY +++ +DAT KGN R+ NHSC PN
Sbjct: 247 FEYVGEVINEGQFRRRMRQYDEEGIKHFYFMSLSKGEFVDATKKGNLGRFCNHSCNPNCY 306
Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
+ KW++ + R+GIFA R I+ GE L ++Y + + D YC
Sbjct: 307 VDKWVVGEKLRMGIFAERHIQAGEELVFNYNVDRYGADPQPCYC 350
>gi|358373289|dbj|GAA89888.1| SET and WW domain protein [Aspergillus kawachii IFO 4308]
Length = 963
Score = 117 bits (294), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 60/164 (36%), Positives = 91/164 (55%), Gaps = 1/164 (0%)
Query: 81 DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
D DC C C CG C N+ FQ R + +++TEK G G+ A+ D++ +F+
Sbjct: 188 DSDCINRATKIECMGDCGCGPECQNQRFQRREYAPVAVIKTEKKGFGLRAEADLRPHQFI 247
Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
EYVGEVI++ R+ + G +FY +++ +DAT KGN R+ NHSC PN
Sbjct: 248 FEYVGEVINEGQFRRRMRQYDEEGIKHFYFMSLSKGEFVDATKKGNLGRFCNHSCNPNCY 307
Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
+ KW++ + R+GIFA R I+ GE L ++Y + + D YC
Sbjct: 308 VDKWVVGEKLRMGIFAERHIQAGEELVFNYNVDRYGADPQPCYC 351
>gi|322697133|gb|EFY88916.1| histone H3 lysine 36 (K36) methyltransferase [Metarhizium acridum
CQMa 102]
Length = 895
Score = 117 bits (294), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 59/174 (33%), Positives = 95/174 (54%), Gaps = 5/174 (2%)
Query: 62 EDDGIFCSCTA--SPGSSGVCDRDCHC---GMLLSSCSSGCKCGNSCLNKPFQNRPVKKM 116
E+D + C C G + C D C + +SG CG C N+ FQ + +
Sbjct: 115 ENDALDCDCAEEFQDGENFACGEDSDCINRATKMECVASGSNCGGGCQNQRFQRKLWADV 174
Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
+++T+K G G+ + ++ +F+ EYVGEVI++ T R+ + G +FY +++
Sbjct: 175 AVIKTDKKGYGLRTESSLQPNDFIYEYVGEVINEPTFRRRMLQYDEEGIKHFYFMSLSKS 234
Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDY 230
+DAT KGN R+ NHSC PN + KW++ + R+GIFA R I+ GE L ++Y
Sbjct: 235 EFVDATRKGNLGRFCNHSCNPNCYVDKWVVGDKLRMGIFALRKIQAGEELVFNY 288
>gi|324500453|gb|ADY40214.1| Histone-lysine N-methyltransferase lin-59 [Ascaris suum]
Length = 1467
Score = 117 bits (294), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 76/195 (38%), Positives = 109/195 (55%), Gaps = 10/195 (5%)
Query: 41 AIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCG 100
A Y+ IK N+ + R+E + CSC GS G D +C M+ C++ C G
Sbjct: 721 AKSYIKIKHNVIVDAYP--RIEQ--LQCSCQK--GSCGESD-ECLNRMVQMECNNSCGRG 773
Query: 101 NSCLNKP-FQNRPVKKMKLVQTEKC-GAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
C NK F+ V K+ L +T G G+ D +++G+FV EYVGEV+ +T + R
Sbjct: 774 VHCSNKRIFRRECVDKLSLFETSNGRGLGVRTDVPLQKGQFVCEYVGEVVSMETFDAR-N 832
Query: 159 KMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATR 218
+ N Y + VIDA KGN +R++NHSC PN EMQ+W ++G+ RIG+FA R
Sbjct: 833 AHSYRAFRNHYALNLCPGYVIDAYQKGNIARFVNHSCVPNCEMQRWSVNGQHRIGLFALR 892
Query: 219 DIKKGENLTYDYQYE 233
+ KGE LTYDY ++
Sbjct: 893 VVAKGEELTYDYNWD 907
>gi|317037088|ref|XP_001398411.2| histone-lysine N-methyltransferase, H3 lysine-36 specific
[Aspergillus niger CBS 513.88]
Length = 853
Score = 117 bits (293), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 60/164 (36%), Positives = 91/164 (55%), Gaps = 1/164 (0%)
Query: 81 DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
D DC C C CG C N+ FQ R + +++TEK G G+ A+ D++ +F+
Sbjct: 187 DSDCINRATKIECMGDCGCGPECQNQRFQRREYAPVAVIKTEKKGFGLRAEADLRPHQFI 246
Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
EYVGEVI++ R+ + G +FY +++ +DAT KGN R+ NHSC PN
Sbjct: 247 FEYVGEVINEGQFRRRMRQYDEEGIKHFYFMSLSKGEFVDATKKGNLGRFCNHSCNPNCY 306
Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
+ KW++ + R+GIFA R I+ GE L ++Y + + D YC
Sbjct: 307 VDKWVVGEKLRMGIFAERHIQAGEELVFNYNVDRYGADPQPCYC 350
>gi|346321276|gb|EGX90876.1| histone H3 lysine 36 (K36) methyltransferase [Cordyceps militaris
CM01]
Length = 905
Score = 117 bits (293), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 62/188 (32%), Positives = 98/188 (52%), Gaps = 6/188 (3%)
Query: 62 EDDGIFCSCTAS--PGSSGVCDRDCHC---GMLLSSCSSGCKCGNSCLNKPFQNRPVKKM 116
E+D C C G + C D C + + CG C N+ FQ + +
Sbjct: 122 ENDAFDCECRGDWHDGVNMACGEDSDCINRATKMECSETAGNCGGGCQNQRFQRKQYADV 181
Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
+++TEK G G+ D ++ +F+ EY+GEVI++ T R+ + G +FY ++++
Sbjct: 182 CVIKTEKKGYGLRTDAELSAHDFIFEYIGEVINEATFRRRMLQYDQQGIKHFYFMSLSKN 241
Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FL 235
+DAT KGN R+ NHSC PN + KW+I + R+GIFA R I+ GE L ++Y + +
Sbjct: 242 EFVDATRKGNLGRFCNHSCVPNCYVDKWVIGDKLRMGIFALRTIRAGEELVFNYNVDRYG 301
Query: 236 HDSLIAYC 243
D YC
Sbjct: 302 ADPQPCYC 309
>gi|212535752|ref|XP_002148032.1| SET and WW domain protein [Talaromyces marneffei ATCC 18224]
gi|210070431|gb|EEA24521.1| SET and WW domain protein [Talaromyces marneffei ATCC 18224]
Length = 900
Score = 117 bits (293), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 59/164 (35%), Positives = 92/164 (56%), Gaps = 1/164 (0%)
Query: 81 DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
D DC C + C CG C N+ FQ + + +++T K G G+ A+ DI+ +F+
Sbjct: 185 DSDCINRATRIECLNDCSCGQDCQNQRFQRKEYANVTVIKTAKKGFGLRAESDIQPHQFI 244
Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
EY+GEVI++ R+ + G +FY +N+ +DAT KGN +R+ NHSC PN
Sbjct: 245 YEYIGEVINEANFRRRMIQYDKEGIKHFYFMSLNKGEFVDATKKGNLARFCNHSCNPNCY 304
Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
+ KW++ + R+GIFA R I+ GE L ++Y + + D YC
Sbjct: 305 VDKWVVGEKLRMGIFAERYIQAGEELVFNYNVDRYGADPQPCYC 348
>gi|256084142|ref|XP_002578291.1| SET domain protein [Schistosoma mansoni]
Length = 1746
Score = 117 bits (292), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 71/207 (34%), Positives = 104/207 (50%), Gaps = 12/207 (5%)
Query: 33 PDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC---GML 89
PD++ KP + + IY RL D C P S C +C +
Sbjct: 950 PDYY-KPIKVNWPLGSVRIY-------RLTDPSEAPRCECKPNSEDPCGPSSNCINRELH 1001
Query: 90 LSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQT-EKCGAGIVADEDIKRGEFVIEYVGEVI 148
S C G++C N+ F R + T ++ G G+ I+ GEFV EY+G++I
Sbjct: 1002 YECLPSICPNGDACRNQRFTKRLYPPQRPFWTGDQRGWGLKTMIAIRAGEFVNEYIGDLI 1061
Query: 149 DDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDG 208
D+ RL TN+Y+ +++ +IDA KGN SR++NHSC PN QKW ++G
Sbjct: 1062 DEDEANRRLRFAHENNITNYYMMKLDSQRIIDAGPKGNLSRFMNHSCDPNLNTQKWTVNG 1121
Query: 209 ETRIGIFATRDIKKGENLTYDYQYEFL 235
+ RIG+FA RDI GE LT++Y + L
Sbjct: 1122 DNRIGLFAVRDISVGEELTFNYNFVAL 1148
>gi|393219563|gb|EJD05050.1| hypothetical protein FOMMEDRAFT_105290 [Fomitiporia mediterranea
MF3/22]
Length = 872
Score = 117 bits (292), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 68/205 (33%), Positives = 105/205 (51%), Gaps = 8/205 (3%)
Query: 47 IKRNIYLTKRIKRRLED-DGIFCSCTASPGSSG---VCD--RDCHCGMLLSSCSSG-CKC 99
++ N Y + R E + + C C PG C DC + C G C+C
Sbjct: 100 LEENYYQYSTLGRSREALESMTCDCQFVPGRDAPEMACGYGSDCINRLTQVECEEGDCRC 159
Query: 100 GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
C N+ Q + +++V TEK G G+ A DI + F+ EY+G+V+ + +R+ +
Sbjct: 160 RGYCRNQRLQRKEYANVEIVLTEKKGFGLRAGSDITKDAFIYEYIGDVVSQPSFAKRMRE 219
Query: 160 MKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD 219
G +FY + +D IDAT +G K R+ NHSC PN + KW + R+GI+A R+
Sbjct: 220 YAEEGIRHFYFMMLQKDEFIDATKRGGKGRFANHSCNPNCYVAKWTVGDRVRMGIYAKRN 279
Query: 220 IKKGENLTYDYQYE-FLHDSLIAYC 243
IKK E LT++Y + + HD+ YC
Sbjct: 280 IKKDEELTFNYNVDRYGHDAQPCYC 304
>gi|353232109|emb|CCD79464.1| putative set domain protein [Schistosoma mansoni]
Length = 1503
Score = 117 bits (292), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 71/207 (34%), Positives = 104/207 (50%), Gaps = 12/207 (5%)
Query: 33 PDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC---GML 89
PD++ KP + + IY RL D C P S C +C +
Sbjct: 950 PDYY-KPIKVNWPLGSVRIY-------RLTDPSEAPRCECKPNSEDPCGPSSNCINRELH 1001
Query: 90 LSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQT-EKCGAGIVADEDIKRGEFVIEYVGEVI 148
S C G++C N+ F R + T ++ G G+ I+ GEFV EY+G++I
Sbjct: 1002 YECLPSICPNGDACRNQRFTKRLYPPQRPFWTGDQRGWGLKTMIAIRAGEFVNEYIGDLI 1061
Query: 149 DDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDG 208
D+ RL TN+Y+ +++ +IDA KGN SR++NHSC PN QKW ++G
Sbjct: 1062 DEDEANRRLRFAHENNITNYYMMKLDSQRIIDAGPKGNLSRFMNHSCDPNLNTQKWTVNG 1121
Query: 209 ETRIGIFATRDIKKGENLTYDYQYEFL 235
+ RIG+FA RDI GE LT++Y + L
Sbjct: 1122 DNRIGLFAVRDISVGEELTFNYNFVAL 1148
>gi|310793120|gb|EFQ28581.1| SET domain-containing protein [Glomerella graminicola M1.001]
Length = 940
Score = 116 bits (291), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 64/188 (34%), Positives = 104/188 (55%), Gaps = 6/188 (3%)
Query: 62 EDDGIFCSCTAS--PGSSGVC--DRDCHCGMLLSSCSSGC-KCGNSCLNKPFQNRPVKKM 116
++D + C C + G + C D DC C +G CG+ C N+ FQ + +
Sbjct: 112 DNDALDCDCRSEWHDGENLSCGEDSDCINRATKMECVAGAGNCGDGCQNQRFQRKQYANV 171
Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
+++TEK G G+ A+ D++ +F+ EY+GEVI++ T R+ + G +FY + +
Sbjct: 172 SVIKTEKKGFGLRANVDLQANDFIFEYIGEVINEPTFRRRMVQYDDEGIKHFYFMSLTKH 231
Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FL 235
+DAT KGN R+ NHSC PN + KW++ + R+GIF++R IK GE L ++Y + +
Sbjct: 232 EFVDATKKGNLGRFCNHSCNPNCYVDKWVVGEKLRMGIFSSRLIKAGEELVFNYNVDRYG 291
Query: 236 HDSLIAYC 243
D YC
Sbjct: 292 ADPQPCYC 299
>gi|406606115|emb|CCH42475.1| Histone-lysine N-methyltransferase [Wickerhamomyces ciferrii]
Length = 815
Score = 116 bits (291), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 61/174 (35%), Positives = 98/174 (56%), Gaps = 8/174 (4%)
Query: 65 GIFCSCTASPG-------SSGVCDRDC-HCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKM 116
G F SC P + D DC + G + + CG C N+ FQ + +
Sbjct: 109 GEFMSCECHPDWDESNELNYACIDDDCINRGTKIECINGVSSCGEDCSNQRFQKKQYVDI 168
Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
++QTEK G G+ + DI+ F+ EY+GEVID+++ ++R+ + L +FY + +
Sbjct: 169 NVIQTEKKGYGVRSQIDIEPDTFIYEYIGEVIDEKSFKKRMLEYDELNFKHFYFMMLQKG 228
Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDY 230
+DAT KG+ +R+ NHSC PN ++KW++ + ++GIFA R I KGE +T+DY
Sbjct: 229 EFLDATRKGSLARFCNHSCNPNCYVEKWVVGEKLKMGIFAKRKILKGEEITFDY 282
>gi|195128035|ref|XP_002008472.1| GI13513 [Drosophila mojavensis]
gi|193920081|gb|EDW18948.1| GI13513 [Drosophila mojavensis]
Length = 2416
Score = 116 bits (291), Expect = 7e-24, Method: Composition-based stats.
Identities = 67/214 (31%), Positives = 110/214 (51%), Gaps = 12/214 (5%)
Query: 27 PVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC 86
P +P W Y I+ N+Y D C+C P +G C +C
Sbjct: 1468 PTRHVVPSW-------NYRKIRTNVYAESVRPNLAGFDHPNCNCKPQP--NGACLDNCLN 1518
Query: 87 GMLLSSC-SSGCKCGNSCLNKPFQNRPVKK-MKLVQTEKCGAGIVADEDIKRGEFVIEYV 144
M+ + C + C G+ C N+ Q V ++ T G G+ I +G +++EYV
Sbjct: 1519 RMVYTECVPNNCPAGDKCRNQKIQRHEVAPGVERFMTADKGWGVRTKLPIAKGTYILEYV 1578
Query: 145 GEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKW 204
GEV+ ++ ++R+ + +L +T+ Y ++ +VID G+ R++NHSC PN EMQKW
Sbjct: 1579 GEVVTEREFKQRMATI-YLNDTHHYCLHLDGGLVIDGQRMGSDCRFVNHSCEPNCEMQKW 1637
Query: 205 IIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
++G +R+ +FA R I++GE LTYDY + + S
Sbjct: 1638 SVNGLSRMVLFAKRAIEEGEELTYDYNFSLFNPS 1671
>gi|242793892|ref|XP_002482258.1| SET and WW domain protein [Talaromyces stipitatus ATCC 10500]
gi|218718846|gb|EED18266.1| SET and WW domain protein [Talaromyces stipitatus ATCC 10500]
Length = 891
Score = 116 bits (291), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 59/164 (35%), Positives = 92/164 (56%), Gaps = 1/164 (0%)
Query: 81 DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
D DC C + C CG C N+ FQ + + +++T K G G+ A+ DI +F+
Sbjct: 176 DSDCINRATRIECLNDCSCGQDCQNQRFQRKEYANVTVIKTAKKGFGLRAESDIHPHQFI 235
Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
EY+GEVI++ + R+ + G +FY +N+ +DAT KGN +R+ NHSC PN
Sbjct: 236 YEYIGEVINEVSFRRRMIQYDKEGIKHFYFMSLNKGEFVDATKKGNLARFCNHSCNPNCY 295
Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
+ KW++ + R+GIFA R I+ GE L ++Y + + D YC
Sbjct: 296 VDKWVVGEKLRMGIFAERYIQAGEELVFNYNVDRYGADPQPCYC 339
>gi|388581385|gb|EIM21694.1| SET domain-containing protein [Wallemia sebi CBS 633.66]
Length = 681
Score = 116 bits (291), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 67/190 (35%), Positives = 101/190 (53%), Gaps = 5/190 (2%)
Query: 56 RIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCS-SGCKCGNSCLNKPFQNRPVK 114
R K+ LE+ +C C + G G DC + C CK G C N+ FQ +
Sbjct: 30 RTKQNLEE--AYCDCYLTGGECG-NHSDCINRLTQVECLLDDCKTGPQCNNQRFQRKQWA 86
Query: 115 KMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEIN 174
+ +++TEK G G+ A+ D+ R F+IEY+GEV+ R+ G +FY +
Sbjct: 87 NIDIIKTEKKGYGLRANVDLDRDTFLIEYIGEVVTQTQFLRRMNTYSKEGIKHFYFMMLQ 146
Query: 175 RDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE- 233
+ IDAT +GN R+ NHSC PN + KW++ ++GIF R I+KGE LT++Y +
Sbjct: 147 NEEFIDATRRGNIGRFANHSCAPNCFVSKWVVGKYVKMGIFTKRKIEKGEELTFNYNVDR 206
Query: 234 FLHDSLIAYC 243
+ HD+ YC
Sbjct: 207 YGHDAQPCYC 216
>gi|328767162|gb|EGF77213.1| hypothetical protein BATDEDRAFT_91931 [Batrachochytrium
dendrobatidis JAM81]
Length = 779
Score = 116 bits (290), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 72/234 (30%), Positives = 110/234 (47%), Gaps = 30/234 (12%)
Query: 28 VEFELP-----------------DWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSC 70
V+FELP F + +A P++ I++N+++ +R D C C
Sbjct: 469 VDFELPYDLQHFVKICNGVEGMRTLFNEEEASPFIHIQKNVWVDRRPLDVPFDFESICQC 528
Query: 71 TASPGSSGVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQ-NRPVKKMKLVQTEKCGAGIV 129
S C+ C C G+ C N+ F+ N ++K+++ G G+
Sbjct: 529 KLPKDGSPPCE-----------CGDNCPMGDCCTNRAFRRNEQIEKLRVFYAPNRGFGLY 577
Query: 130 ADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSR 189
D IK G +IEY GE+I C ER + + G+ N Y E +V+D KG +R
Sbjct: 578 TDVPIKAGVLIIEYRGEIISTAKCIERNDTI-YSGQKNHYFLEYGNGLVLDGCRKGTIAR 636
Query: 190 YINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
+ NHSC PN ++KW + E R+GIFAT +I G LTYDY+++ YC
Sbjct: 637 FANHSCDPNCHVEKWYVGTEFRVGIFATNNISVGSELTYDYRFDSYGQMQPCYC 690
>gi|392563363|gb|EIW56542.1| SET domain-containing protein [Trametes versicolor FP-101664 SS1]
Length = 785
Score = 116 bits (290), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 65/193 (33%), Positives = 100/193 (51%), Gaps = 7/193 (3%)
Query: 58 KRRLEDDGIFCSCTASPG---SSGVCDRDCHCGMLLSSCS---SGCKCGNSCLNKPFQNR 111
K R + + C C G C D C L+ C+C + C N+ FQ +
Sbjct: 61 KSREALESMMCDCQYEHGIDDPDTACGHDSDCINRLTQVECLPEDCRCRSYCRNQRFQRK 120
Query: 112 PVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLC 171
+++V+TE G G+ A E+I + F+ EYVG+V+ + +R+ + G +FY
Sbjct: 121 QYAPIEIVKTEMKGFGLRAGENIPKDAFIYEYVGDVVSQPSFLKRMRQYAEEGIRHFYFM 180
Query: 172 EINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQ 231
+ +D IDAT +G R+ NHSC PN + KW + R+GIFA R IK+ E LT++Y
Sbjct: 181 MLQKDEYIDATKRGGIGRFANHSCNPNCYVAKWTVGDHVRMGIFANRTIKQHEELTFNYN 240
Query: 232 YE-FLHDSLIAYC 243
+ + HD+ I YC
Sbjct: 241 VDRYGHDAQICYC 253
>gi|324500837|gb|ADY40382.1| Histone-lysine N-methyltransferase SETD2 [Ascaris suum]
Length = 950
Score = 116 bits (290), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 71/198 (35%), Positives = 103/198 (52%), Gaps = 9/198 (4%)
Query: 43 PYVFIKRNIYL-TKRIKRRLEDDGIFCSCTASP-GSSGVCDRDCHCGMLLSSCSSGCKCG 100
P++ I+ N+ L + + + FC T G C +C ML + C S C G
Sbjct: 99 PFIAIEENVILCDESLIKEAAVVRCFCEPTEEELAVGGGCGENCINRMLYTECGSRCPSG 158
Query: 101 NSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKM 160
C N+ N+ K+++ G G+ A E ++ G+FV+EYVGEVI + R+ +
Sbjct: 159 ARCSNRRLHNKEYAKVEVFYAGVKGWGLRACEPLEPGQFVMEYVGEVISAEEMRRRVRRY 218
Query: 161 ----KHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
KH+ + YL + VIDAT +GN SR+INHSC PN +KW +D RIG FA
Sbjct: 219 GRDPKHV---HHYLMALKNGAVIDATIRGNISRFINHSCDPNCRSEKWTVDRRVRIGFFA 275
Query: 217 TRDIKKGENLTYDYQYEF 234
T+ + GE L +DYQ E
Sbjct: 276 TKKVAVGEELVFDYQLEL 293
>gi|198435574|ref|XP_002121834.1| PREDICTED: absent, small, or homeotic discs 1 homolog [Ciona
intestinalis]
Length = 2850
Score = 116 bits (290), Expect = 9e-24, Method: Composition-based stats.
Identities = 76/235 (32%), Positives = 121/235 (51%), Gaps = 21/235 (8%)
Query: 15 HAFNKLLKQIGNPVEFELP--DWF------IKPK---AIPYVFIKRNIYLTKRIKRRLED 63
HA N LLK + +F+LP W+ I P A ++ I++N+Y+ + E+
Sbjct: 2007 HAGNFLLK---SRKDFQLPFNIWWMYNRKLISPSQDLATQFIKIEKNVYVDSQPTCEQEE 2063
Query: 64 DGIFCSCTAS--PGSSGV--CDRDCHCGMLLSSCS-SGCKCGNSCLNKPFQNRPV-KKMK 117
C + SS V C ++C ++ CS C C + C N+ Q + K ++
Sbjct: 2064 HVCVCQTLSDIHSLSSDVHGCGKECLNRLMYIECSPDTCPCQDKCANRCIQKQQWWKDLE 2123
Query: 118 LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDM 177
+T G G+ + DI G+F++EYVGEV+ ++ R + + + Y ++
Sbjct: 2124 RFRTNDRGWGVRTNSDIPEGQFLLEYVGEVVSEREFRRRTIE-NYNAHNDHYCVQLEAGT 2182
Query: 178 VIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
VID N+ R++NHSC PN EMQKW+++GE R+G+FA R I E LTYDY +
Sbjct: 2183 VIDGYRLANEGRFVNHSCQPNCEMQKWVVNGEYRVGLFAKRPIVSSEELTYDYNF 2237
>gi|195037347|ref|XP_001990122.1| GH19166 [Drosophila grimshawi]
gi|193894318|gb|EDV93184.1| GH19166 [Drosophila grimshawi]
Length = 1434
Score = 116 bits (290), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 67/196 (34%), Positives = 105/196 (53%), Gaps = 5/196 (2%)
Query: 43 PYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC--GMLLSSCSSG-CKC 99
PY+ IK N + E+ C C P S C + C +L C+ C
Sbjct: 1162 PYIKIKANKAVPPVRFVYSEESQNICVC--KPDSEHPCGPESGCLNRILYHECNPKYCNA 1219
Query: 100 GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
G C N+ F+ R ++++V + G G+V E I G+F+IEYVGEVI+ + R+ +
Sbjct: 1220 GKKCENQLFELRKSPRLEVVYMNERGFGLVCREAIAEGDFIIEYVGEVINHAEFQRRVAQ 1279
Query: 160 MKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD 219
+ + NFY + +D +IDA KGN +R++NHSC PN QKW ++ R+G+FA +D
Sbjct: 1280 KTNDRDENFYFLGVEKDFIIDAGPKGNLARFMNHSCEPNCATQKWTVNCINRVGLFAIKD 1339
Query: 220 IKKGENLTYDYQYEFL 235
I + LT++Y ++ L
Sbjct: 1340 IPENTELTFNYLWDDL 1355
>gi|385301625|gb|EIF45804.1| putative histone lysine methyltransferase set2p [Dekkera
bruxellensis AWRI1499]
Length = 695
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 72/194 (37%), Positives = 102/194 (52%), Gaps = 8/194 (4%)
Query: 44 YVFIKRNIYLTKRIK--RRLEDDGIFCSCTAS--PGSSGVCDRDCHCGMLLSS--CSSG- 96
+ +K N Y+ R+K ++ + C CT G + C D C L++ C G
Sbjct: 42 FQLVKHNEYID-RLKGVYSAANEVMTCDCTERRVAGVNVACGADSDCINRLTNVECVDGE 100
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C CG C N+ FQ + + + TE G G+ A+ DI F+IEY GEVID+Q + R
Sbjct: 101 CSCGXRCQNQRFQRNQMADISIFLTEHKGYGMRANCDIPANTFIIEYKGEVIDEQAYKIR 160
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
G +FY I +IDAT KG+ R+ NHSC PN +KW+++ R+GIFA
Sbjct: 161 KEAYAKEGIKHFYFMMIQDGQIIDATKKGSLGRFCNHSCDPNAYXEKWVVNKRYRMGIFA 220
Query: 217 TRDIKKGENLTYDY 230
R I KGE +T+DY
Sbjct: 221 KRKIPKGEEITFDY 234
>gi|322709340|gb|EFZ00916.1| histone H3 lysine 36 (K36) methyltransferase [Metarhizium
anisopliae ARSEF 23]
Length = 894
Score = 115 bits (289), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 58/174 (33%), Positives = 95/174 (54%), Gaps = 5/174 (2%)
Query: 62 EDDGIFCSCTA--SPGSSGVCDRDCHC---GMLLSSCSSGCKCGNSCLNKPFQNRPVKKM 116
++D + C C G + C D C + +SG CG C N+ FQ + +
Sbjct: 115 DNDALDCDCAEEFQDGENFACGEDSDCINRATKMECVASGSNCGGGCQNQRFQRKLWADV 174
Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
+++T+K G G+ + ++ +F+ EYVGEVI++ T R+ + G +FY +++
Sbjct: 175 AVIKTDKKGYGLRTESPLQPNDFIYEYVGEVINEPTFRRRMLQYDEEGIKHFYFMSLSKS 234
Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDY 230
+DAT KGN R+ NHSC PN + KW++ + R+GIFA R I+ GE L ++Y
Sbjct: 235 EFVDATRKGNLGRFCNHSCNPNCYVDKWVVGDKLRMGIFALRKIQAGEELVFNY 288
>gi|171686378|ref|XP_001908130.1| hypothetical protein [Podospora anserina S mat+]
gi|170943150|emb|CAP68803.1| unnamed protein product [Podospora anserina S mat+]
Length = 919
Score = 115 bits (289), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 61/186 (32%), Positives = 97/186 (52%), Gaps = 6/186 (3%)
Query: 64 DGIFCSCTAS--PGSSGVCDRDCHC---GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKL 118
D + C CT + C D C + C CG C N+ FQ + + +
Sbjct: 122 DALDCDCTEEWRNDENHACGEDSDCINRATKIECVDGDCNCGPGCQNQRFQRKQYADVSV 181
Query: 119 VQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMV 178
++T+K G G+ A+ +++ +F+ EY+GEVI++ T R+ K G +FY + +
Sbjct: 182 IKTDKKGFGLRANRNLQPNDFIFEYIGEVINEPTFRNRMIKYDREGIKHFYFMSLTKSEF 241
Query: 179 IDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHD 237
+DAT KGN R+ NHSC PN + KW++ + R+GIFA R I+ GE L ++Y + + D
Sbjct: 242 VDATKKGNLGRFCNHSCNPNCYVDKWVVGEKLRMGIFAGRPIRAGEELVFNYNVDRYGAD 301
Query: 238 SLIAYC 243
YC
Sbjct: 302 PQPCYC 307
>gi|346971461|gb|EGY14913.1| histone-lysine N-methyltransferase [Verticillium dahliae VdLs.17]
Length = 913
Score = 115 bits (288), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 61/188 (32%), Positives = 98/188 (52%), Gaps = 6/188 (3%)
Query: 62 EDDGIFCSCTA--SPGSSGVCDRDCHC---GMLLSSCSSGCKCGNSCLNKPFQNRPVKKM 116
++D + C C G + CD D C + CG+ C N+ FQ + +
Sbjct: 112 DNDALDCDCREEWQNGENHSCDEDSDCINRATKMECVMGAGNCGDGCQNQRFQRKAYADV 171
Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
+++TEK G G+ A+ D++ +F+ EY+GEVI++ T R+ + G +FY + +
Sbjct: 172 SVIKTEKKGFGLRANVDMQANDFIFEYIGEVINEPTFRRRMVQYDDEGIKHFYFMSLTKH 231
Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FL 235
+DAT KGN R+ NHSC PN + KW++ + R+GIF R I GE L ++Y + +
Sbjct: 232 EFVDATKKGNLGRFCNHSCNPNCYVDKWVVGDKLRMGIFTLRGIAAGEELVFNYNVDRYG 291
Query: 236 HDSLIAYC 243
D YC
Sbjct: 292 ADPQPCYC 299
>gi|397568484|gb|EJK46160.1| hypothetical protein THAOC_35187 [Thalassiosira oceanica]
Length = 473
Score = 115 bits (288), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 73/213 (34%), Positives = 109/213 (51%), Gaps = 10/213 (4%)
Query: 27 PVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGV--CDRDC 84
PV+F+ + KP A Y + N Y RR G C C S G GV C C
Sbjct: 203 PVDFKE-EVHSKPPA--YAHVNSNRYDPDNRPRRSIFSGEKCRCKPS-GEDGVPSCGERC 258
Query: 85 HCGM-LLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEY 143
+ C++ C G +C N+ R K ++ + G G+++ + +K G+ VIEY
Sbjct: 259 DNRLNYFECCAANCDLGPNCGNRAMGRRRFAKCRVQREHGKGWGLISVDGVKSGDLVIEY 318
Query: 144 VGEVIDDQTCEERL--WKMKHLGETNFYLCEINR-DMVIDATYKGNKSRYINHSCCPNTE 200
GEVID+ T E RL W H + NFY+ + + IDA + N++R++NHSC PN
Sbjct: 319 AGEVIDESTKESRLAAWTRDHPTDPNFYVMALGQAGWYIDARHVANQARFVNHSCDPNCR 378
Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
+ + G R+ I A RD++ GE L+YDYQ++
Sbjct: 379 LVPLNVAGHMRVAIVAVRDVRPGEFLSYDYQFD 411
>gi|380012561|ref|XP_003690348.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
N-methyltransferase NSD2-like [Apis florea]
Length = 1208
Score = 115 bits (288), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 64/149 (42%), Positives = 82/149 (55%), Gaps = 1/149 (0%)
Query: 66 IFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKC 124
+ C C + DC +LL CS G C G C N+ F R M+ T
Sbjct: 788 VACDCDPEWENPCAPGTDCLNRILLVECSPGICPAGPKCNNQAFVRRQYPAMEPFHTVGR 847
Query: 125 GAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYK 184
G G+ + E IK G+FVIEYVGEVID+ + RL + K L NFY I+ + IDA K
Sbjct: 848 GWGLRSLEHIKAGQFVIEYVGEVIDEAEYKRRLHRKKELKNENFYFLTIDNNRTIDAEPK 907
Query: 185 GNKSRYINHSCCPNTEMQKWIIDGETRIG 213
GN SR++NHSC PN E QKW ++G+TRI
Sbjct: 908 GNLSRFMNHSCSPNCETQKWTVNGDTRIA 936
>gi|255725552|ref|XP_002547705.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
gi|240135596|gb|EER35150.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
Length = 782
Score = 115 bits (287), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 58/155 (37%), Positives = 90/155 (58%), Gaps = 3/155 (1%)
Query: 79 VCDRDCHCGMLLSSC---SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIK 135
C D +C ++S + C CG C N+ FQ + K+ + QTE G G+ ++E I
Sbjct: 109 ACGEDSNCINRITSVECINKHCSCGEDCENQRFQKKLYSKVSVFQTELKGYGLKSEEIIN 168
Query: 136 RGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSC 195
+F+ EY+GEVI++ +R+ + +FY ++ D IDAT KG +R+ NHSC
Sbjct: 169 ENQFIYEYIGEVINETDFRKRMIEYDLENYKHFYFMMLDHDSFIDATKKGCLARFANHSC 228
Query: 196 CPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDY 230
PN + KW++ + R+GIFA R I+KGE +T+DY
Sbjct: 229 NPNAYVDKWVVGDKLRMGIFAKRRIEKGEEITFDY 263
>gi|345565974|gb|EGX48921.1| hypothetical protein AOL_s00079g142 [Arthrobotrys oligospora ATCC
24927]
Length = 1227
Score = 115 bits (287), Expect = 2e-23, Method: Composition-based stats.
Identities = 69/232 (29%), Positives = 115/232 (49%), Gaps = 24/232 (10%)
Query: 30 FELPDWFIKPKAIPYVFIKRNI--------YLTKRIKRRLEDDGIFCSCTASPGSSGVCD 81
F L W + PK + +KRNI Y ++ R+ C+CT SG C
Sbjct: 259 FNLDPW-VYPKPPGWKSLKRNIFVGEAAELYKQQKNSRKNAYQSTMCNCTPE---SGGCG 314
Query: 82 RDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVK---------KMKLVQTEKCGAGIVADE 132
C ++ C C N+PF++ ++ ++++ T+ G G+ A
Sbjct: 315 PGCLNRVMYYECDKNNCSLKDCQNRPFRDLAIRVENDTWFDDGIEIILTQDKGYGLRACR 374
Query: 133 DIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYIN 192
+ ++EY+GE+I CEERL E+ +YL E + ++IDAT +G+ +R++N
Sbjct: 375 SFGPNQIIVEYIGEIITQDECEERLHGPYKDNES-YYLMEFDNSLIIDAT-RGSLARFVN 432
Query: 193 HSCCPNTEMQKWIIDGETRIGIFATRD-IKKGENLTYDYQYEFLHDSLIAYC 243
HSC PN +M+KW++ G+ R+ +FA + I+ GE LTYDY + + C
Sbjct: 433 HSCSPNCKMEKWMVAGQPRMALFAGDEGIEVGEELTYDYNFSWFSGGSTQLC 484
>gi|256074584|ref|XP_002573604.1| huntingtin interacting protein-related [Schistosoma mansoni]
Length = 1575
Score = 115 bits (287), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 60/141 (42%), Positives = 82/141 (58%), Gaps = 1/141 (0%)
Query: 93 CSSGCKCGNSCLNKPFQNRPVKKMKLVQTEK-CGAGIVADEDIKRGEFVIEYVGEVIDDQ 151
C C G+ C N+ FQ R + K G G++A +++K+G FVIEYVGEVID
Sbjct: 431 CGLHCAAGDFCSNRQFQMRLYAPTRPFYAGKDKGWGLMATDNVKKGSFVIEYVGEVIDFS 490
Query: 152 TCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETR 211
R+ + + LG + Y + D IDA KGN +R++NHSC PN QKW ++GE R
Sbjct: 491 EFRRRIRRYERLGHAHHYFMAVESDRFIDAGSKGNWARFVNHSCEPNCVTQKWSVNGEIR 550
Query: 212 IGIFATRDIKKGENLTYDYQY 232
IG FA DI G+ +T DYQ+
Sbjct: 551 IGFFAKEDIPSGQEVTIDYQF 571
>gi|312374857|gb|EFR22332.1| hypothetical protein AND_15429 [Anopheles darlingi]
Length = 579
Score = 115 bits (287), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 68/202 (33%), Positives = 101/202 (50%), Gaps = 18/202 (8%)
Query: 44 YVFIKRNIYL------TKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCS-SG 96
Y I RN Y ++R + + E C+C+ G + C C+
Sbjct: 371 YTKIYRNRYTASLKESSRRSRIKEESMDSVCNCSMDDGDPCGANSSCINRFTFIECNPKT 430
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C C N+ F R +++ + E G G+V ED+ G+F+IEYVGEVI+ + R
Sbjct: 431 CPAKERCSNQRFAKRIYPDLEVRKFEDRGHGLVTKEDLTAGQFIIEYVGEVINKKEFGRR 490
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
L M+ E ++Y E++ +++IDA KGN +R++NHSC PN E QKW +D IG+FA
Sbjct: 491 LNYMQQQNEQHYYFLEVDSEIIIDAGPKGNLARFVNHSCEPNCETQKWTVDNMRVIGVFA 550
Query: 217 TRDIKKGENLTYDYQYEFLHDS 238
+DIK LHDS
Sbjct: 551 LKDIKA-----------MLHDS 561
>gi|210075797|ref|XP_002143056.1| YALI0D21684p [Yarrowia lipolytica]
gi|199425837|emb|CAG81317.4| YALI0D21684p [Yarrowia lipolytica CLIB122]
Length = 1284
Score = 115 bits (287), Expect = 2e-23, Method: Composition-based stats.
Identities = 77/232 (33%), Positives = 116/232 (50%), Gaps = 28/232 (12%)
Query: 29 EFELPDWFI-------KPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCD 81
+F+LP W I KPK+ + IKRNI++ + I C+C CD
Sbjct: 152 DFQLP-WNIYCPTGNVKPKS--WKSIKRNIFIDVEPPKPSTLKSI-CACVDR------CD 201
Query: 82 RDCHCGMLLSSC-SSGCKCGN-SCLNKPF---QNRPVK----KMKLVQTEKCGAGIVADE 132
C ML C SS C G+ C N+ F +NR + ++++ TE G G+ A
Sbjct: 202 ESCLNKMLQFECDSSNCPFGDVDCGNREFGLLKNRTSRFYHQAIEIIHTESKGYGLRACR 261
Query: 133 DIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYIN 192
GE V EY G+VI + RL ++ N+Y + + VID+ +G+ +R++N
Sbjct: 262 SFAPGELVCEYTGDVISPSEIKRRL-DNEYADSDNYYFLNLEKGAVIDSGLRGSAARFVN 320
Query: 193 HSCCPNTEMQKWIIDGETRIGIFATRD-IKKGENLTYDYQYEFLHDSLIAYC 243
HSC PN EMQKW + G R+G+FA D ++ G+ LTYDY + + + C
Sbjct: 321 HSCMPNCEMQKWFVKGLPRVGLFAGEDGVQNGQELTYDYNFVWFEGAKAQTC 372
>gi|50285531|ref|XP_445194.1| hypothetical protein [Candida glabrata CBS 138]
gi|74638008|sp|Q6FX50.1|SET2_CANGA RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36
specific; AltName: Full=SET domain-containing protein 2
gi|49524498|emb|CAG58098.1| unnamed protein product [Candida glabrata]
Length = 716
Score = 115 bits (287), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 63/174 (36%), Positives = 95/174 (54%), Gaps = 6/174 (3%)
Query: 63 DDGIFCSCTA--SPGSSGVC--DRDCHCGMLLSSCSSGC--KCGNSCLNKPFQNRPVKKM 116
+D + C C G + C D DC + L C + CGN C N+ FQ + +
Sbjct: 81 NDFMECDCYEDFQNGKNHACGEDSDCINRLTLIECVNDLCGTCGNDCANQRFQKKEYANI 140
Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
+ +T+ G G+ A+ DI+ +F+ EY GEVI+++ +RL +FY +
Sbjct: 141 AVFKTKMKGYGVRAESDIEINDFIYEYKGEVIEEEEFRDRLVDYDQKKFRHFYFMMLQSG 200
Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDY 230
IDAT KG+ +R+ NHSC PN + KW++ G+ R+GIFA R I KGE +T+DY
Sbjct: 201 EFIDATIKGSLARFCNHSCNPNAYVNKWVVAGKLRMGIFAKRKILKGEEITFDY 254
>gi|358381100|gb|EHK18776.1| hypothetical protein TRIVIDRAFT_57632 [Trichoderma virens Gv29-8]
Length = 924
Score = 114 bits (286), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 57/172 (33%), Positives = 92/172 (53%), Gaps = 5/172 (2%)
Query: 64 DGIFCSCTAS--PGSSGVCDRDCHC---GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKL 118
D + C C G + C D C + + CG C N+ FQ + + +
Sbjct: 121 DALDCDCREEWRDGENFACGEDSDCINRATKMECSADAGNCGGGCQNRRFQRKQYADVTV 180
Query: 119 VQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMV 178
++TEK G G+ + ++ +F+ EY+GEVI++ T R+ + G +FY +N++
Sbjct: 181 IKTEKKGFGLRTNSALEPNDFIYEYIGEVINEPTFRRRMLQYDEEGIKHFYFMSLNKNEF 240
Query: 179 IDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDY 230
+DAT KGN R+ NHSC PN + KW++ + R+GIFA R I+ GE L ++Y
Sbjct: 241 VDATKKGNLGRFCNHSCSPNCFVDKWVVGDKLRMGIFALRKIRAGEELVFNY 292
>gi|19115892|ref|NP_594980.1| histone lysine methyltransferase Set2 [Schizosaccharomyces pombe
972h-]
gi|74626626|sp|O14026.1|SET2_SCHPO RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36
specific; AltName: Full=Lysine N-methyltransferase 3;
AltName: Full=SET domain-containing protein 2
gi|2408044|emb|CAB16247.1| histone lysine methyltransferase Set2 [Schizosaccharomyces pombe]
Length = 798
Score = 114 bits (286), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 64/184 (34%), Positives = 99/184 (53%), Gaps = 5/184 (2%)
Query: 52 YLTKRIKRRLEDDGIFCSCTAS--PGSSGVCDRDCHCGMLLSS---CSSGCKCGNSCLNK 106
Y + I + E++ + C C G + C +C ++S CG SC N+
Sbjct: 113 YASANIGKPPENEAMICDCRPHWVDGVNVACGHGSNCINRMTSIECTDEDNVCGPSCQNQ 172
Query: 107 PFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGET 166
FQ K+ + TEK G G+ AD ++ + FV EY+GEVI +Q +R+ + G
Sbjct: 173 RFQRHEFAKVDVFLTEKKGFGLRADANLPKDTFVYEYIGEVIPEQKFRKRMRQYDSEGIK 232
Query: 167 NFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENL 226
+FY + + IDAT +G+ +R+ NHSC PN + KW++ + R+GIF RDI +GE L
Sbjct: 233 HFYFMMLQKGEYIDATKRGSLARFCNHSCRPNCYVDKWMVGDKLRMGIFCKRDIIRGEEL 292
Query: 227 TYDY 230
T+DY
Sbjct: 293 TFDY 296
>gi|302406220|ref|XP_003000946.1| histone-lysine N-methyltransferase [Verticillium albo-atrum
VaMs.102]
gi|261360204|gb|EEY22632.1| histone-lysine N-methyltransferase [Verticillium albo-atrum
VaMs.102]
Length = 936
Score = 114 bits (286), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 63/188 (33%), Positives = 100/188 (53%), Gaps = 6/188 (3%)
Query: 62 EDDGIFCSCTA--SPGSSGVCDRDCHCGMLLS--SCSSGC-KCGNSCLNKPFQNRPVKKM 116
++D + C C G + CD D C + C G CG+ C N+ FQ + +
Sbjct: 112 DNDALDCDCREEWQNGENHSCDEDSDCINRATKMECVMGTGNCGDGCQNQRFQRKAYADV 171
Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
+++TEK G G+ A+ D++ +F+ EY+GEVI++ T R+ + G +FY + +
Sbjct: 172 SVIKTEKKGFGLRANVDMQANDFIFEYIGEVINEPTFRRRMVQYDDEGIKHFYFMSLTKH 231
Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FL 235
+DAT KGN R+ NHSC PN + KW++ + R+GIF R I GE L ++Y + +
Sbjct: 232 EFVDATKKGNLGRFCNHSCNPNCYVDKWVVGDKLRMGIFTLRGIAAGEELVFNYNVDRYG 291
Query: 236 HDSLIAYC 243
D YC
Sbjct: 292 ADPQPCYC 299
>gi|398407533|ref|XP_003855232.1| histone methyltransferase, partial [Zymoseptoria tritici IPO323]
gi|339475116|gb|EGP90208.1| histone methyltransferase [Zymoseptoria tritici IPO323]
Length = 799
Score = 114 bits (286), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 58/172 (33%), Positives = 91/172 (52%), Gaps = 6/172 (3%)
Query: 65 GIFCSCT----ASPGSSGVC--DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKL 118
+ C C+ A S+ C D DC C + C CG+ C N FQ + + +
Sbjct: 68 ALECDCSEEWDAPSRSNHACGEDSDCINRATKMECVADCNCGSKCQNMRFQRKKYANVDV 127
Query: 119 VQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMV 178
++TEK G G+ D++ +F+ EY+GEVI + R+ + G +FY + +
Sbjct: 128 IKTEKKGYGLRTQTDLRPNDFIFEYIGEVIGENVFRRRMQQYDEDGIKHFYFMSLTKGEF 187
Query: 179 IDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDY 230
+DAT +GN R+ NHSC PN + KW++ + R+GIFA R I+ GE L ++Y
Sbjct: 188 VDATKRGNLGRFCNHSCNPNCYVDKWVVGDKLRMGIFAERAIQAGEELVFNY 239
>gi|336264616|ref|XP_003347084.1| hypothetical protein SMAC_05383 [Sordaria macrospora k-hell]
gi|380093778|emb|CCC08742.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 1224
Score = 114 bits (286), Expect = 3e-23, Method: Composition-based stats.
Identities = 67/213 (31%), Positives = 111/213 (52%), Gaps = 23/213 (10%)
Query: 38 KPKAIPYVFIKRNIYLTK-----RIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSS 92
+PK Y + +N ++ + + ED C CT G C +DC ++L
Sbjct: 679 QPKPAAYRTMTKNRFIGQAAAIWKKTPHFEDFASKCVCTPEDG----CAQDCQNRVMLYE 734
Query: 93 C-SSGCKCGNS-CLNKPFQ---------NRPVKKMKLVQTEKCGAGIVADEDIKRGEFVI 141
C + C G C N+ FQ R +++ +TE G G+ ++ + + ++
Sbjct: 735 CDETNCNVGKEFCQNRAFQMLTERTKQGGRYRVGVEVFKTEDRGYGVRSNRCFEPHQIIM 794
Query: 142 EYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEM 201
EY GE+I D+ CE R+ ++ +YL +++M+IDAT G+ +R++NHSC PN M
Sbjct: 795 EYTGEIITDEECERRM-NEEYKNNECYYLMSFDQNMIIDAT-TGSIARFVNHSCSPNCRM 852
Query: 202 QKWIIDGETRIGIFA-TRDIKKGENLTYDYQYE 233
KWI+ G+ R+ +FA R I+ GE LTYDY ++
Sbjct: 853 IKWIVSGQPRMALFAGDRPIQTGEELTYDYNFD 885
>gi|350855153|emb|CCD58126.1| huntingtin interacting protein-related [Schistosoma mansoni]
Length = 887
Score = 114 bits (286), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 60/141 (42%), Positives = 82/141 (58%), Gaps = 1/141 (0%)
Query: 93 CSSGCKCGNSCLNKPFQNRPVKKMKLVQTEK-CGAGIVADEDIKRGEFVIEYVGEVIDDQ 151
C C G+ C N+ FQ R + K G G++A +++K+G FVIEYVGEVID
Sbjct: 259 CGLHCAAGDFCSNRQFQMRLYAPTRPFYAGKDKGWGLMATDNVKKGSFVIEYVGEVIDFS 318
Query: 152 TCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETR 211
R+ + + LG + Y + D IDA KGN +R++NHSC PN QKW ++GE R
Sbjct: 319 EFRRRIRRYERLGHAHHYFMAVESDRFIDAGSKGNWARFVNHSCEPNCVTQKWSVNGEIR 378
Query: 212 IGIFATRDIKKGENLTYDYQY 232
IG FA DI G+ +T DYQ+
Sbjct: 379 IGFFAKEDIPSGQEVTIDYQF 399
>gi|328856222|gb|EGG05344.1| hypothetical protein MELLADRAFT_78094 [Melampsora larici-populina
98AG31]
Length = 1098
Score = 114 bits (285), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 69/211 (32%), Positives = 108/211 (51%), Gaps = 25/211 (11%)
Query: 44 YVFIKRNIYLTKRI-KRRLEDDGIFCSCTASPGSS----------GVCDRD---CHCGML 89
+ +++N Y K + K ++ DD + C C+ S G S + +RD CG L
Sbjct: 190 FDLLEQNWYQFKGLGKTKMLDDMMVCECSWSLGKSCAIALIDPSHMIQERDDPNLACG-L 248
Query: 90 LSSC----------SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEF 139
S C C+C C N+ FQ R +++V TE+ G G+ D+ F
Sbjct: 249 NSGCINYLTQVECLKKECRCLQMCQNQRFQKRQYAPIEIVATERKGFGVRLKSDVPADSF 308
Query: 140 VIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNT 199
V EY+GEV+ ++ + R+ + G +FY ++ R+ IDAT KG R++NHSC PN
Sbjct: 309 VYEYIGEVVGEKAFQRRIKEYAQEGLKHFYFMQLQREEYIDATKKGGLGRFLNHSCNPNC 368
Query: 200 EMQKWIIDGETRIGIFATRDIKKGENLTYDY 230
+ KW++ R+GIF R +K GE LT++Y
Sbjct: 369 YIGKWVVGRHLRMGIFTKRAVKGGEELTFNY 399
>gi|195435682|ref|XP_002065808.1| GK20252 [Drosophila willistoni]
gi|194161893|gb|EDW76794.1| GK20252 [Drosophila willistoni]
Length = 2294
Score = 114 bits (285), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 67/215 (31%), Positives = 114/215 (53%), Gaps = 18/215 (8%)
Query: 27 PVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLED-DGIFCSCTASPGSSGVCDRDCH 85
P+ ++P W Y I+ N+Y + ++ L D C+C S C +C
Sbjct: 1378 PLRHQVPSWN-------YRKIRTNVY-AESVRPNLAGFDHPVCNCKES------CMDNCL 1423
Query: 86 CGMLLSSCS-SGCKCGNSCLNKPFQNRPVKK-MKLVQTEKCGAGIVADEDIKRGEFVIEY 143
M+ + CS + C G+ C N+ V ++ T G G+ I +G +++EY
Sbjct: 1424 NRMVYTECSATNCPSGDKCHNQKIHRHEVAPGVERFMTADKGWGVRTKLPIAKGTYILEY 1483
Query: 144 VGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQK 203
VGEV+ ++ ++R+ + +L +T+ Y ++ +VID G+ R++NHSC PN EMQK
Sbjct: 1484 VGEVVTEREFKQRMASI-YLNDTHHYCLHLDGGLVIDGQRMGSDCRFVNHSCEPNCEMQK 1542
Query: 204 WIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
W ++G +R+ +FA R I++GE LTYDY + + S
Sbjct: 1543 WSVNGLSRMVLFAKRQIRQGEELTYDYNFSLFNPS 1577
>gi|323451560|gb|EGB07437.1| hypothetical protein AURANDRAFT_27977 [Aureococcus anophagefferens]
Length = 224
Score = 114 bits (284), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 60/158 (37%), Positives = 92/158 (58%), Gaps = 5/158 (3%)
Query: 80 CDRDC-HCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGE 138
CD DC + M + + C G+ C N+ F R K+ + + +CG G+ A E I G
Sbjct: 37 CDGDCLNRNMEIECDPATCPMGDKCQNRCFAARLGSKVSVEKAGRCGRGLFAREPIPEGA 96
Query: 139 FVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEIN----RDMVIDATYKGNKSRYINHS 194
FV+E +GE+I ++ +ERL + G+ ++Y+ + + +VIDAT KGN+ R+ NHS
Sbjct: 97 FVVEALGELISEEEAQERLATARANGDEHYYMLAASDAATKGLVIDATRKGNEFRWANHS 156
Query: 195 CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
C P+ ++KW + R GI A R IK GE LTYDY++
Sbjct: 157 CDPSCRLEKWRCGSQDRYGIVALRSIKPGEELTYDYRW 194
>gi|321460437|gb|EFX71479.1| Mes-4-like protein [Daphnia pulex]
Length = 706
Score = 114 bits (284), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 66/180 (36%), Positives = 94/180 (52%), Gaps = 14/180 (7%)
Query: 72 ASPGSSGVCD-------------RDCHCGMLLSSCSSGCKCGNS-CLNKPFQNRPVKKMK 117
+SP SS +CD +C +LL C G C N+ FQ R +
Sbjct: 349 SSPYSSAICDCEVKEGQPTCGPDSECINRILLFECDPNVCAGRKLCQNQRFQQRLSPPLY 408
Query: 118 LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDM 177
T G G+ + DIK+G+FV+EYVGE+ID+ RL + + +Y ++ +
Sbjct: 409 PFNTGGKGWGLKSHADIKKGDFVVEYVGEIIDNNEFRRRLKAKQDAQDEAYYFLTLDNNR 468
Query: 178 VIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHD 237
+IDA GN +R++NHSC PN E QKW + G+TRIG+FA DI LT++YQ E D
Sbjct: 469 MIDAGPSGNFARFMNHSCQPNCETQKWTVLGDTRIGLFAVVDIPAHAELTFNYQLECAQD 528
>gi|302307921|ref|NP_984733.2| AEL128Cp [Ashbya gossypii ATCC 10895]
gi|442570024|sp|Q757Y8.2|SET2_ASHGO RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36
specific; AltName: Full=SET domain-containing protein 2
gi|299789232|gb|AAS52557.2| AEL128Cp [Ashbya gossypii ATCC 10895]
gi|374107951|gb|AEY96858.1| FAEL128Cp [Ashbya gossypii FDAG1]
Length = 684
Score = 114 bits (284), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 65/193 (33%), Positives = 102/193 (52%), Gaps = 6/193 (3%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTAS--PGSSGVCDR--DCHCGMLLSSCSSG--C 97
++ +++ Y KR+ ++ + C C + CD DC + L C +
Sbjct: 34 FIELEQCTYTHKRLGDSRSNEFMECDCFEDYKDEQNHACDENSDCINRLTLIECVNDLCT 93
Query: 98 KCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERL 157
CG+ C N+ FQ + + + QTEK G G+ A+ DI+ EF+ EY+GEVI + +R+
Sbjct: 94 SCGDDCQNQRFQKKEYADIAVFQTEKKGYGVRAERDIEANEFIYEYIGEVISEADFRDRM 153
Query: 158 WKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFAT 217
G +FY + IDAT +G +R+ NHSC PN + KW + G+ ++GIFA
Sbjct: 154 VDYDMRGFKHFYFMMLQAGEFIDATERGCLARFCNHSCNPNAYVSKWDVAGKLKMGIFAH 213
Query: 218 RDIKKGENLTYDY 230
R I KGE +T+DY
Sbjct: 214 RKILKGEEITFDY 226
>gi|195496208|ref|XP_002095596.1| GE22484 [Drosophila yakuba]
gi|194181697|gb|EDW95308.1| GE22484 [Drosophila yakuba]
Length = 2215
Score = 113 bits (283), Expect = 5e-23, Method: Composition-based stats.
Identities = 71/225 (31%), Positives = 113/225 (50%), Gaps = 17/225 (7%)
Query: 28 VEFELPD---WFIKPKAIP---------YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPG 75
++FELP W +P Y I+ N+Y D C+C
Sbjct: 1281 MDFELPYDIWWAYTNSKLPTRNVVPSWNYRKIRTNVYAESVRPNLAGFDHPTCNCKNQGE 1340
Query: 76 SSGVCDRDCHCGMLLSSCS-SGCKCGNSCLNKPFQNRPVKK-MKLVQTEKCGAGIVADED 133
S C +C M+ + CS S C G C N+ Q V ++ T G G+
Sbjct: 1341 KS--CLDNCLNRMVYTECSPSNCPAGEKCRNQKIQRHAVAPGVERFMTADKGWGVRTKLP 1398
Query: 134 IKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINH 193
I +G +++EYVGEV+ ++ ++R+ + +L +T+ Y ++ +VID G+ R++NH
Sbjct: 1399 IAKGTYILEYVGEVVTEKEFKQRMASI-YLNDTHHYCLHLDGGLVIDGQRMGSDCRFVNH 1457
Query: 194 SCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
SC PN EMQKW ++G +R+ +FA R I++GE LTYDY + + S
Sbjct: 1458 SCEPNCEMQKWSVNGLSRMVLFAKRAIEEGEELTYDYNFSLFNPS 1502
>gi|429858497|gb|ELA33313.1| set and ww domain protein [Colletotrichum gloeosporioides Nara gc5]
Length = 833
Score = 113 bits (283), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 60/172 (34%), Positives = 95/172 (55%), Gaps = 5/172 (2%)
Query: 64 DGIFCSCTA--SPGSSGVC--DRDCHCGMLLSSCSSGC-KCGNSCLNKPFQNRPVKKMKL 118
D + C C + G + C D DC C G CG+ C N+ FQ + + +
Sbjct: 113 DALDCDCRSEWQDGKNYSCGEDSDCINRATRMECVVGTGNCGDGCQNQRFQRKQYANVSV 172
Query: 119 VQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMV 178
++T+K G G+ A+ D++ +F+ EY+GEVI++ T R+ + G +FY + +
Sbjct: 173 IKTDKKGFGLRANVDMQANDFIFEYIGEVINEPTFRRRMSQYDDEGIKHFYFMSLTKHEF 232
Query: 179 IDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDY 230
+DAT KGN R+ NHSC PN + KW++ + R+GIF +R IK GE L ++Y
Sbjct: 233 VDATKKGNLGRFCNHSCNPNCYVDKWVVGEKLRMGIFTSRKIKAGEELVFNY 284
>gi|194874189|ref|XP_001973356.1| GG13392 [Drosophila erecta]
gi|190655139|gb|EDV52382.1| GG13392 [Drosophila erecta]
Length = 2215
Score = 113 bits (283), Expect = 6e-23, Method: Composition-based stats.
Identities = 71/225 (31%), Positives = 113/225 (50%), Gaps = 17/225 (7%)
Query: 28 VEFELPD---WFIKPKAIP---------YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPG 75
++FELP W +P Y I+ N+Y D C+C
Sbjct: 1281 MDFELPYDIWWAYTNSKLPTRNVVPSWNYRKIRTNVYAESVRPNLAGFDHPTCNCKNQGE 1340
Query: 76 SSGVCDRDCHCGMLLSSCS-SGCKCGNSCLNKPFQNRPVKK-MKLVQTEKCGAGIVADED 133
S C +C M+ + CS S C G C N+ Q V ++ T G G+
Sbjct: 1341 KS--CLDNCLNRMVYTECSPSNCPAGEKCRNQKIQRHAVAPGVERFMTADKGWGVRTKLP 1398
Query: 134 IKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINH 193
I +G +++EYVGEV+ ++ ++R+ + +L +T+ Y ++ +VID G+ R++NH
Sbjct: 1399 IAKGTYILEYVGEVVTEKEFKQRMASI-YLNDTHHYCLHLDGGLVIDGQRMGSDCRFVNH 1457
Query: 194 SCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
SC PN EMQKW ++G +R+ +FA R I++GE LTYDY + + S
Sbjct: 1458 SCEPNCEMQKWSVNGLSRMVLFAKRAIEEGEELTYDYNFSLFNPS 1502
>gi|426193608|gb|EKV43541.1| hypothetical protein AGABI2DRAFT_76781 [Agaricus bisporus var.
bisporus H97]
Length = 851
Score = 113 bits (283), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 58/163 (35%), Positives = 94/163 (57%), Gaps = 2/163 (1%)
Query: 83 DCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVI 141
DC + C +G C+C N C N+ Q + +++V TEK G G+ A+ED+ + F+
Sbjct: 133 DCINRLTQVECLAGDCRCRNYCQNQRLQRKEYAPIEIVLTEKKGYGLRAEEDLPKDAFIY 192
Query: 142 EYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEM 201
EYVG+V++ + ++R+ + G +FY + +D IDAT G R+ NHSC PN +
Sbjct: 193 EYVGDVVNPISFKKRMREYGQEGIRHFYFMMLQKDEFIDATKNGGIGRFANHSCNPNCYV 252
Query: 202 QKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
KW + + R+GIFA R +KK E LT++Y + + H + +C
Sbjct: 253 AKWTVGDKVRMGIFAKRTVKKHEELTFNYNVDRYGHQAQTCFC 295
>gi|403333784|gb|EJY66014.1| hypothetical protein OXYTRI_13823 [Oxytricha trifallax]
Length = 1936
Score = 113 bits (283), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 72/208 (34%), Positives = 111/208 (53%), Gaps = 19/208 (9%)
Query: 44 YVFIKRNIYLTKR---IKRRLEDDGIFCSCT---------ASPGSSGVCDRDCHCGMLLS 91
++ IK+N YL ++ + + + C+C +P S C C +S
Sbjct: 75 FIPIKKNKYLERKRYVLNEKEASSLLVCNCQKATYKKNEKTNPLQSFNCGERC-INRCVS 133
Query: 92 S--CSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEV-- 147
+ C+ C G C N+ FQ + +TEK G G+ A E I +G F+++YVGEV
Sbjct: 134 TECCAQTCPSGAFCKNRRFQLHQNAYVFPAKTEKKGYGLFAGEFIPKGTFIMQYVGEVFS 193
Query: 148 IDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIID 207
+D ++R+ +++ T YL + VID TY GN +R+INHSC PN E QKW +
Sbjct: 194 VDTDLGQQRV--LEYRKSTCTYLMRTTNNEVIDPTYVGNVARFINHSCEPNCETQKWNVL 251
Query: 208 GETRIGIFATRDIKKGENLTYDYQYEFL 235
GE +GIF+ RDI + E L++DYQ++F
Sbjct: 252 GEVCVGIFSLRDIHENEELSFDYQFDFF 279
>gi|409074525|gb|EKM74921.1| hypothetical protein AGABI1DRAFT_123473 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 881
Score = 113 bits (283), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 58/163 (35%), Positives = 94/163 (57%), Gaps = 2/163 (1%)
Query: 83 DCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVI 141
DC + C +G C+C N C N+ Q + +++V TEK G G+ A+ED+ + F+
Sbjct: 155 DCINRLTQVECLAGDCRCRNYCQNQRLQRKEYAPIEIVLTEKKGYGLRAEEDLPKDAFIY 214
Query: 142 EYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEM 201
EYVG+V++ + ++R+ + G +FY + +D IDAT G R+ NHSC PN +
Sbjct: 215 EYVGDVVNPISFKKRMREYGQEGIRHFYFMMLQKDEFIDATKNGGIGRFANHSCNPNCYV 274
Query: 202 QKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
KW + + R+GIFA R +KK E LT++Y + + H + +C
Sbjct: 275 AKWTVGDKVRMGIFAKRTVKKHEELTFNYNVDRYGHQAQTCFC 317
>gi|395325049|gb|EJF57478.1| hypothetical protein DICSQDRAFT_111705 [Dichomitus squalens
LYAD-421 SS1]
Length = 866
Score = 113 bits (283), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 72/212 (33%), Positives = 106/212 (50%), Gaps = 16/212 (7%)
Query: 44 YVFIKRNIYLTKRI-KRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSC--------- 93
+V I N Y + K R + + C C GV D D CG L S C
Sbjct: 109 FVEIPNNHYQYGTLGKSREALESMTCDCQYE---HGVDDPDMACG-LSSDCINLMTQVEC 164
Query: 94 -SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQT 152
C+C C N+ FQ + +++V+TE G G+ A DI + F+ EYVG+V+ +
Sbjct: 165 LPDDCRCRGYCRNQRFQRKQYAPIEIVKTEMKGFGLRAAADIPKDTFIYEYVGDVVSQPS 224
Query: 153 CEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRI 212
+R+ + G +FY + +D IDAT +G R+ NHSC PN + KW + R+
Sbjct: 225 FIKRMRQYAEEGIRHFYFMMLQKDEFIDATKRGGIGRFANHSCNPNCYVAKWTVGSHVRM 284
Query: 213 GIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
GIFA R IKK E LT++Y + + H++ YC
Sbjct: 285 GIFANRVIKKDEELTFNYNVDRYGHEAQPCYC 316
>gi|323451944|gb|EGB07819.1| hypothetical protein AURANDRAFT_6525 [Aureococcus anophagefferens]
Length = 160
Score = 113 bits (283), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 53/140 (37%), Positives = 87/140 (62%), Gaps = 4/140 (2%)
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G+ C N+ F + K+ + + ++CG G+ A E I + FV+E +GE+I ++ +ER
Sbjct: 8 CPMGDKCQNRCFAAKLGSKVSVEKADQCGWGLFAGEPIAKRAFVVEALGELISEEEAQER 67
Query: 157 LWKMKHLGETNFYLCEIN----RDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRI 212
L + G+ ++Y+ + + +VIDAT KGN++R+ NHSC P+ ++KW + R
Sbjct: 68 LATARANGDEDYYMLAASDAATKGLVIDATRKGNEARFANHSCDPSCRLEKWRCGSQDRY 127
Query: 213 GIFATRDIKKGENLTYDYQY 232
GIFA R +K GE LTYDY++
Sbjct: 128 GIFALRSVKPGEQLTYDYRW 147
>gi|194751483|ref|XP_001958056.1| GF10724 [Drosophila ananassae]
gi|190625338|gb|EDV40862.1| GF10724 [Drosophila ananassae]
Length = 2257
Score = 113 bits (283), Expect = 7e-23, Method: Composition-based stats.
Identities = 70/225 (31%), Positives = 111/225 (49%), Gaps = 17/225 (7%)
Query: 28 VEFELPD---WFIKPKAIP---------YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPG 75
++FELP W +P Y I+ N+Y D C+C
Sbjct: 1310 IDFELPYDIWWAYTNSKLPTRHVVPSWNYRKIRTNVYAESVRPNLAGFDHPTCNCKNH-- 1367
Query: 76 SSGVCDRDCHCGMLLSSCS-SGCKCGNSCLNKPFQNRPVKK-MKLVQTEKCGAGIVADED 133
C +C M+ + CS S C C N+ Q V ++ TE G G+
Sbjct: 1368 GEIACQDNCLNRMVYTECSPSNCPAAEKCRNQKIQRHEVAPGVERFMTEDKGWGVRTKLP 1427
Query: 134 IKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINH 193
I +G +++EYVGEV+ ++ ++R+ + +L +T+ Y ++ +VID G+ R++NH
Sbjct: 1428 IAKGTYILEYVGEVVTEREFKQRMASI-YLNDTHHYCLHLDGGLVIDGQRMGSDCRFVNH 1486
Query: 194 SCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
SC PN EMQKW ++G +R+ +FA R I +GE LTYDY + + S
Sbjct: 1487 SCEPNCEMQKWSVNGLSRMVLFAKRAIAQGEELTYDYNFSLFNPS 1531
>gi|195354202|ref|XP_002043588.1| GM17444 [Drosophila sechellia]
gi|194127756|gb|EDW49799.1| GM17444 [Drosophila sechellia]
Length = 2218
Score = 113 bits (283), Expect = 7e-23, Method: Composition-based stats.
Identities = 71/225 (31%), Positives = 113/225 (50%), Gaps = 17/225 (7%)
Query: 28 VEFELPD---WFIKPKAIP---------YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPG 75
++FELP W +P Y I+ N+Y D C+C
Sbjct: 1283 MDFELPYDIWWAYTNSKLPTRNVVPSWNYRKIRTNVYAESVRPNLAGFDHPTCNCKDQGE 1342
Query: 76 SSGVCDRDCHCGMLLSSCS-SGCKCGNSCLNKPFQNRPVKK-MKLVQTEKCGAGIVADED 133
S C +C M+ + CS S C G C N+ Q V ++ T G G+
Sbjct: 1343 KS--CLDNCLNRMVYTECSPSNCPAGEKCRNQKIQRHAVAPGVERFMTADKGWGVRTKLP 1400
Query: 134 IKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINH 193
I +G +++EYVGEV+ ++ ++R+ + +L +T+ Y ++ +VID G+ R++NH
Sbjct: 1401 IAKGTYILEYVGEVVTEKEFKQRMASI-YLNDTHHYCLHLDGGLVIDGQRMGSDCRFVNH 1459
Query: 194 SCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
SC PN EMQKW ++G +R+ +FA R I++GE LTYDY + + S
Sbjct: 1460 SCEPNCEMQKWSVNGLSRMVLFAKRAIEEGEELTYDYNFSLFNPS 1504
>gi|386771427|ref|NP_524160.2| absent, small, or homeotic discs 1, isoform B [Drosophila
melanogaster]
gi|386771429|ref|NP_001246834.1| absent, small, or homeotic discs 1, isoform C [Drosophila
melanogaster]
gi|118572535|sp|Q9VW15.3|ASH1_DROME RecName: Full=Histone-lysine N-methyltransferase ash1; AltName:
Full=Absent small and homeotic disks protein 1; AltName:
Full=Lysine N-methyltransferase 2H
gi|383292012|gb|AAF49140.3| absent, small, or homeotic discs 1, isoform B [Drosophila
melanogaster]
gi|383292013|gb|AFH04505.1| absent, small, or homeotic discs 1, isoform C [Drosophila
melanogaster]
Length = 2226
Score = 113 bits (283), Expect = 7e-23, Method: Composition-based stats.
Identities = 71/225 (31%), Positives = 113/225 (50%), Gaps = 17/225 (7%)
Query: 28 VEFELPD---WFIKPKAIP---------YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPG 75
++FELP W +P Y I+ N+Y D C+C
Sbjct: 1292 MDFELPYDIWWAYTNSKLPTRNVVPSWNYRKIRTNVYAESVRPNLAGFDHPTCNCKNQGE 1351
Query: 76 SSGVCDRDCHCGMLLSSCS-SGCKCGNSCLNKPFQNRPVKK-MKLVQTEKCGAGIVADED 133
S C +C M+ + CS S C G C N+ Q V ++ T G G+
Sbjct: 1352 KS--CLDNCLNRMVYTECSPSNCPAGEKCRNQKIQRHAVAPGVERFMTADKGWGVRTKLP 1409
Query: 134 IKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINH 193
I +G +++EYVGEV+ ++ ++R+ + +L +T+ Y ++ +VID G+ R++NH
Sbjct: 1410 IAKGTYILEYVGEVVTEKEFKQRMASI-YLNDTHHYCLHLDGGLVIDGQRMGSDCRFVNH 1468
Query: 194 SCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
SC PN EMQKW ++G +R+ +FA R I++GE LTYDY + + S
Sbjct: 1469 SCEPNCEMQKWSVNGLSRMVLFAKRAIEEGEELTYDYNFSLFNPS 1513
>gi|449300569|gb|EMC96581.1| hypothetical protein BAUCODRAFT_481348 [Baudoinia compniacensis
UAMH 10762]
Length = 992
Score = 113 bits (283), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 53/150 (35%), Positives = 85/150 (56%)
Query: 81 DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
D DC C++ C CG +C N+ F + + +++TEK G G+ + ++K +F+
Sbjct: 177 DSDCINRATKMECAADCGCGAACQNQRFLRKQFADVTVIKTEKKGYGLRTNTNLKANDFI 236
Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
EY+GEVI + R+ + G +FY + + +DAT KGN R+ NHSC PN
Sbjct: 237 FEYIGEVIGENVFRRRMQQYDEEGIKHFYFMSLTKGEFVDATKKGNLGRFCNHSCNPNCY 296
Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDY 230
+ KW++ + R+GIFA R I+ GE L ++Y
Sbjct: 297 VDKWVVGDKLRMGIFAERKIQAGEELVFNY 326
>gi|356495442|ref|XP_003516586.1| PREDICTED: histone-lysine N-methyltransferase SUVR5-like [Glycine
max]
Length = 1492
Score = 113 bits (283), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 68/166 (40%), Positives = 93/166 (56%), Gaps = 19/166 (11%)
Query: 87 GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
G L+ C+ C+C SC N+ QN K+++ +TEK G + A E I RG FV EY+GE
Sbjct: 1310 GYLVYECNHMCRCNKSCPNRVLQNGVRVKLEVFKTEKKGWAVRAGEAILRGTFVCEYIGE 1369
Query: 147 VIDDQTCEERLWKMKHLGETNFYLCEIN---RDM----------VIDATYKGNKSRYINH 193
V+D Q +R + ++ E YL +I+ DM VIDAT GN SR+INH
Sbjct: 1370 VLDVQEARDR--RKRYGAEHCSYLYDIDARVNDMGRLIEEQAQYVIDATKFGNVSRFINH 1427
Query: 194 SCCPNTEMQKWIIDG----ETRIGIFATRDIKKGENLTYDYQYEFL 235
SC PN + +++ IG +A+RDI GE LTYDYQYE +
Sbjct: 1428 SCSPNLVNHQVLVESMDCERAHIGFYASRDIALGEELTYDYQYELM 1473
>gi|1335892|gb|AAB01100.1| ASH1 [Drosophila melanogaster]
Length = 2210
Score = 113 bits (282), Expect = 8e-23, Method: Composition-based stats.
Identities = 71/225 (31%), Positives = 113/225 (50%), Gaps = 17/225 (7%)
Query: 28 VEFELPD---WFIKPKAIP---------YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPG 75
++FELP W +P Y I+ N+Y D C+C
Sbjct: 1276 MDFELPYDIWWAYTNSKLPTRNVVPSWNYRKIRTNVYAESVRPNLAGFDHPTCNCKNQGE 1335
Query: 76 SSGVCDRDCHCGMLLSSCS-SGCKCGNSCLNKPFQNRPVKK-MKLVQTEKCGAGIVADED 133
S C +C M+ + CS S C G C N+ Q V ++ T G G+
Sbjct: 1336 KS--CLDNCLNRMVYTECSPSNCPAGEKCRNQKIQRHAVAPGVERFMTADKGWGVRTKLP 1393
Query: 134 IKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINH 193
I +G +++EYVGEV+ ++ ++R+ + +L +T+ Y ++ +VID G+ R++NH
Sbjct: 1394 IAKGTYILEYVGEVVTEKEFKQRMASI-YLNDTHHYCLHLDGGLVIDGQRMGSDCRFVNH 1452
Query: 194 SCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
SC PN EMQKW ++G +R+ +FA R I++GE LTYDY + + S
Sbjct: 1453 SCEPNCEMQKWSVNGLSRMVLFAKRAIEEGEELTYDYNFSLFNPS 1497
>gi|367016577|ref|XP_003682787.1| hypothetical protein TDEL_0G02090 [Torulaspora delbrueckii]
gi|359750450|emb|CCE93576.1| hypothetical protein TDEL_0G02090 [Torulaspora delbrueckii]
Length = 687
Score = 113 bits (282), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 70/193 (36%), Positives = 108/193 (55%), Gaps = 6/193 (3%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTA--SPGSSGVCDRDCHC--GMLLSSCSSGC-- 97
+V + +Y +KR+ ++ + C C S G + CD D C + L C +G
Sbjct: 35 FVSLNECLYASKRLGSAKNNEFMECDCFEDFSEGVNRACDEDSDCINRLTLIECVNGLCG 94
Query: 98 KCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERL 157
CG+ C N+ FQ R + + QTE G G+ A+ DI+ +F+ EY+GEVI+++ ERL
Sbjct: 95 SCGDDCQNQRFQKRQYADIAVFQTEMKGYGVRAETDIEAHQFIYEYMGEVIEEEEFRERL 154
Query: 158 WKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFAT 217
+ +FY + IDAT KG+ +R+ NHSC PN + KW++ G+ ++GIFA
Sbjct: 155 VEYDQKKLKHFYFMMLQNGEFIDATMKGSLARFCNHSCNPNAYVNKWVVAGKLKMGIFAN 214
Query: 218 RDIKKGENLTYDY 230
R I KGE +T+DY
Sbjct: 215 RKILKGEEITFDY 227
>gi|325180900|emb|CCA15310.1| histonelysine Nmethyltransferase putative [Albugo laibachii Nc14]
Length = 1258
Score = 113 bits (282), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 72/222 (32%), Positives = 110/222 (49%), Gaps = 26/222 (11%)
Query: 30 FELP-DWFIK--PKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC 86
F +P D+ ++ + PY + RN+Y K+++ ED C CT C C
Sbjct: 542 FRIPADYLVRVHTRPPPYRKLTRNLYTFKQLRPPFEDVPT-CVCTER------CGDGCIN 594
Query: 87 GMLLSSC--------------SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADE 132
+ + C + C G C N+ R + +L + G + A E
Sbjct: 595 RLSFTECFGPAPTEESTRNNQTFNCHVGEHCGNRALHQRVYPRTELFHSFDKGFALRAKE 654
Query: 133 DIKRGEFVIEYVGEVIDDQTCEERLWK--MKHLGETNFYLCEINRDMVIDATYKGNKSRY 190
I+ G+ V+EYVGEVI++Q + RL + KH + N Y+ E+ + IDA ++G+ SR+
Sbjct: 655 SIQAGQLVMEYVGEVINEQEKDRRLEEHARKHPQDRNMYIMELGNQIYIDARFRGSVSRF 714
Query: 191 INHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
INHSC PN + KW + RI I A RDIK E L+YDY++
Sbjct: 715 INHSCDPNCHLVKWRVCDLDRIAISALRDIKPEEELSYDYRF 756
>gi|299741036|ref|XP_001834167.2| histone-lysine N-methyltransferase [Coprinopsis cinerea
okayama7#130]
gi|298404525|gb|EAU87663.2| histone-lysine N-methyltransferase [Coprinopsis cinerea
okayama7#130]
Length = 826
Score = 113 bits (282), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 55/148 (37%), Positives = 87/148 (58%), Gaps = 1/148 (0%)
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C+C + C N+ FQ R +++V TEK G G+ A++DI + F+ EYVG+V+ + ++R
Sbjct: 175 CRCRSHCQNQRFQKRQYANIEIVLTEKKGYGLRAEDDIPKDSFIYEYVGDVVSPNSFKKR 234
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
+ + G +FY + +D IDAT G R+ NHSC PN + KW + R+GIF+
Sbjct: 235 MREYAEEGIQHFYFMMLQKDEFIDATKSGGIGRFANHSCNPNCYVAKWTVGDHVRMGIFS 294
Query: 217 TRDIKKGENLTYDYQYE-FLHDSLIAYC 243
R I+K E LT++Y + + H + YC
Sbjct: 295 KRLIRKHEELTFNYNVDRYGHQAQTCYC 322
>gi|358396308|gb|EHK45689.1| hypothetical protein TRIATDRAFT_241288 [Trichoderma atroviride IMI
206040]
Length = 931
Score = 112 bits (281), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 57/174 (32%), Positives = 96/174 (55%), Gaps = 5/174 (2%)
Query: 62 EDDGIFCSCTAS--PGSSGVCDRDCHC---GMLLSSCSSGCKCGNSCLNKPFQNRPVKKM 116
++D + C C G + C + C + ++ CG C N+ FQ + +
Sbjct: 119 DNDSLDCDCREEWRDGENFACGEESDCINRATKMECSANAGNCGGGCQNRRFQRKQYADV 178
Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
+++TEK G G+ A+ ++ +F+ EY+GEVI++ T R+ + G +FY +N++
Sbjct: 179 TVIKTEKKGFGLRANAMLEPNDFIYEYIGEVINEPTFRRRMLQYDDEGIKHFYFMSLNKN 238
Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDY 230
+DAT KGN R+ NHSC PN + KW++ + R+GIFA R I+ GE L ++Y
Sbjct: 239 EFVDATKKGNLGRFCNHSCNPNCFVDKWVVGDKLRMGIFALRKIQAGEELVFNY 292
>gi|407926722|gb|EKG19683.1| WW/Rsp5/WWP [Macrophomina phaseolina MS6]
Length = 972
Score = 112 bits (281), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 59/164 (35%), Positives = 90/164 (54%), Gaps = 1/164 (0%)
Query: 81 DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
D DC C C CG C N+ F + + + +T+K G G+ A+ D+K +FV
Sbjct: 175 DSDCINRATKMECVGDCGCGPDCQNQRFLRKQYADVTVFRTDKKGYGLRANTDLKPNDFV 234
Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
EY+GEVI+++ R+ + G +FY +++ +DAT KGN R+ NHSC PN
Sbjct: 235 YEYIGEVINERAFRGRMVQYDEEGIKHFYFMSLSKGEFVDATKKGNLGRFCNHSCNPNCF 294
Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
+ KW++ + R+GIFA R IK GE L ++Y + + D YC
Sbjct: 295 VDKWVVGEKLRMGIFAERHIKAGEELVFNYNVDRYGADPQPCYC 338
>gi|378732919|gb|EHY59378.1| histone-lysine N-methyltransferase SETD2 [Exophiala dermatitidis
NIH/UT8656]
Length = 1034
Score = 112 bits (281), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 59/164 (35%), Positives = 91/164 (55%), Gaps = 1/164 (0%)
Query: 81 DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
D DC C+ C CG +C N+ F R + +++TEK G G+ + D++ +F+
Sbjct: 193 DSDCINRACKMECAKDCGCGPACQNQRFLRRLWANVSVIKTEKKGYGLRTNVDLRPHDFI 252
Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
EY+GE I + T +R+ + G +FY +++ IDAT KGN R+ NHSC PN
Sbjct: 253 FEYIGETIPENTFRKRMRQYDEEGIKHFYFMSLSKGEFIDATKKGNLGRFCNHSCNPNCY 312
Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
+ KW++ + R+GIFA R IK GE L ++Y + + D YC
Sbjct: 313 VDKWVVGDKLRMGIFAERYIKAGEELVFNYNVDRYGADPQPCYC 356
>gi|242813250|ref|XP_002486129.1| histone-lysine N-methyltransferase (Ash1), putative [Talaromyces
stipitatus ATCC 10500]
gi|218714468|gb|EED13891.1| histone-lysine N-methyltransferase (Ash1), putative [Talaromyces
stipitatus ATCC 10500]
Length = 865
Score = 112 bits (281), Expect = 1e-22, Method: Composition-based stats.
Identities = 64/210 (30%), Positives = 110/210 (52%), Gaps = 20/210 (9%)
Query: 38 KPKAIPYVFIKRNIYLTKRIKRRLEDDGI---FCSCTASPGSSGVCDRDCHCGMLLSSC- 93
+PK + + RN+++ E+ I C C+ G CD DC + C
Sbjct: 420 QPKPDEWKKVNRNVFVGDAASFWRENKKIELSTCLCSEETG----CDEDCQNRFMFYECD 475
Query: 94 SSGCKCGNSCLNKPFQNRPVKK---------MKLVQTEKCGAGIVADEDIKRGEFVIEYV 144
SS C+ G +C N+ F+ + +++++T G G+ ++ + + ++EY
Sbjct: 476 SSNCRVGPNCGNRSFEELKQRTKAGGKYNVGVEVIKTADRGYGVRSNRTFEPNQVIVEYT 535
Query: 145 GEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKW 204
GE+I CE R+ + E +YL +++M+IDAT +G+ +R++NHSC PN M+KW
Sbjct: 536 GEIITQSECERRMRTVYKNNEC-YYLMYFDQNMIIDAT-RGSIARFVNHSCAPNCRMEKW 593
Query: 205 IIDGETRIGIFA-TRDIKKGENLTYDYQYE 233
+ G+ R+ +FA R I GE LTYDY ++
Sbjct: 594 TVGGKPRMALFAGDRGIMTGEELTYDYNFD 623
>gi|260830013|ref|XP_002609956.1| hypothetical protein BRAFLDRAFT_124382 [Branchiostoma floridae]
gi|229295318|gb|EEN65966.1| hypothetical protein BRAFLDRAFT_124382 [Branchiostoma floridae]
Length = 902
Score = 112 bits (280), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 76/230 (33%), Positives = 114/230 (49%), Gaps = 18/230 (7%)
Query: 16 AFNKLLKQIGNPVEFELPD---WFIKP-KAIP-------YVFIKRNIYLTKRIKRRLEDD 64
A K L+Q ++F++P W K K IP Y I+ NI++ +K +
Sbjct: 6 AAGKYLRQ--KRIDFQMPYDIWWQYKHNKLIPKIDERTKYKKIRSNIFVD--VKPLSGCE 61
Query: 65 GIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKK-MKLVQTE 122
+ C+C + C DC M CS C + C N+ Q ++ + T+
Sbjct: 62 PVVCTCVRPAQADKACQEDCLNRMSFMECSPNTCPYADQCANQRIQRHEWDPGLERIVTK 121
Query: 123 KCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDAT 182
G G+ + I +G F++EYVGEV+ +Q R ++ H + Y ++ VID
Sbjct: 122 DRGYGVRSKTPIPQGNFILEYVGEVVSEQEFRRRTVEIYH-DHNHHYCLNLHSGAVIDGY 180
Query: 183 YKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
G + R++NHSC PN EMQKW ++G RIG+FA RDI GE LTYDY +
Sbjct: 181 KYGCEGRFVNHSCEPNCEMQKWSVNGVYRIGLFALRDIPAGEELTYDYNF 230
>gi|93003038|tpd|FAA00102.1| TPA: zinc finger protein [Ciona intestinalis]
Length = 883
Score = 112 bits (280), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 76/235 (32%), Positives = 120/235 (51%), Gaps = 21/235 (8%)
Query: 15 HAFNKLLKQIGNPVEFELP--DWF------IKPK---AIPYVFIKRNIYLTKRIKRRLED 63
HA N LLK + F+LP W+ I P A ++ I++N+Y+ + E+
Sbjct: 7 HAGNFLLKSRKD---FQLPFNIWWMYNRKLISPSQDLATQFIKIEKNVYVDSQPTCEQEE 63
Query: 64 DGIFCSCTAS--PGSSGV--CDRDCHCGMLLSSCS-SGCKCGNSCLNKPFQNRPV-KKMK 117
C + SS V C ++C ++ CS C C + C N+ Q + K ++
Sbjct: 64 HVCVCQTLSDIHSLSSDVHGCGKECLNRLMYIECSPDTCPCQDKCANRCIQKQQWWKDLE 123
Query: 118 LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDM 177
+T G G+ + DI G+F++EYVGEV+ ++ R + + + Y ++
Sbjct: 124 RFRTNDRGWGVRTNSDIPEGQFLLEYVGEVVSEREFRRRTIE-NYNAHNDHYCVQLEAGT 182
Query: 178 VIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
VID N+ R++NHSC PN EMQKW+++GE R+G+FA R I E LTYDY +
Sbjct: 183 VIDGYRLANEGRFVNHSCQPNCEMQKWVVNGEYRVGLFAKRPIVSSEELTYDYNF 237
>gi|299116101|emb|CBN74517.1| Possible Cdc2-related protein kinase [Ectocarpus siliculosus]
Length = 1898
Score = 112 bits (279), Expect = 2e-22, Method: Composition-based stats.
Identities = 52/172 (30%), Positives = 91/172 (52%), Gaps = 6/172 (3%)
Query: 68 CSCT---ASPGSSGVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKC 124
C+CT P + D CH + + C +C C N+ Q R ++++ + +
Sbjct: 225 CNCTPVGGDPNAPTCYDDQCHNYVTMQECRKD-QCHRGCRNQRIQKRENAQVEVFKADGK 283
Query: 125 GAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYK 184
G G+ E + +G+F+ EYVGE+I + +R+ + G Y+ ++ D +DA K
Sbjct: 284 GMGLKVVEPVSKGQFIAEYVGEIITRKELNKRM--ISSAGTRKLYMMQLGDDTYLDAKRK 341
Query: 185 GNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLH 236
G +R++NHSC P +++W G+ R +F+ R +K GE L++DYQ+E H
Sbjct: 342 GGIARFVNHSCEPTCRLEQWTAMGQPRCAVFSLRAMKAGEELSFDYQWEAHH 393
>gi|426249978|ref|XP_004018720.1| PREDICTED: histone-lysine N-methyltransferase SETD2-like [Ovis aries]
Length = 1669
Score = 112 bits (279), Expect = 2e-22, Method: Composition-based stats.
Identities = 66/183 (36%), Positives = 98/183 (53%), Gaps = 11/183 (6%)
Query: 36 FIKPKAIPYVF--IKRNIYLTKRIKRRLEDDGIFCSCTASPGSSG-------VCDRDCHC 86
+ K +P F I+ N+YLT+R K + D C +P S C DC
Sbjct: 1447 YAKQGKMPCYFDLIEENVYLTERKKNKSHRDIKRMQCECTPLSKDERAQGEIACGEDCLN 1506
Query: 87 GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
+L+ CSS C G+ C N+ FQ + ++++ TEK G G+ A +D+ FV+EY GE
Sbjct: 1507 RLLMIECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPSNTFVLEYCGE 1566
Query: 147 VIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWII 206
V+D + + R+ + ++Y + D +IDAT KGN SR++NHSC PN E QK +
Sbjct: 1567 VLDHKEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQK--V 1624
Query: 207 DGE 209
DGE
Sbjct: 1625 DGE 1627
>gi|440633321|gb|ELR03240.1| hypothetical protein GMDG_01223 [Geomyces destructans 20631-21]
Length = 685
Score = 112 bits (279), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 69/210 (32%), Positives = 113/210 (53%), Gaps = 19/210 (9%)
Query: 46 FIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNS-C 103
FI L K+ K D C C+ S G C DC+ M+L C G C G C
Sbjct: 452 FIGDAAALWKKTKH-FSDSDSRCICSPSTG----CGEDCYNRMMLYECDDGNCPLGAELC 506
Query: 104 LNKPFQNRPVKKMK---------LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCE 154
N+ F + ++ K +++TE CG G+ A+ + G+ ++EY GE+I + C+
Sbjct: 507 GNRAFADLHERRAKGGKYRVGVEVIKTEDCGYGVRANRCFQEGQIIVEYTGEIITEPECQ 566
Query: 155 ERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGI 214
R+ + + +YL +++M+IDAT +G+ +R++NHSC PN EM KWI+ G+ + +
Sbjct: 567 RRM-REDYKNNECYYLMLFDQNMIIDAT-RGSIARFVNHSCEPNCEMVKWIVGGKPHMAL 624
Query: 215 FATRD-IKKGENLTYDYQYEFLHDSLIAYC 243
FA ++ I GE LTYDY+++ + + C
Sbjct: 625 FAGKNPIMTGEELTYDYKFDPISTRNVQEC 654
>gi|348667102|gb|EGZ06928.1| hypothetical protein PHYSODRAFT_565808 [Phytophthora sojae]
Length = 1540
Score = 111 bits (278), Expect = 2e-22, Method: Composition-based stats.
Identities = 61/154 (39%), Positives = 88/154 (57%), Gaps = 5/154 (3%)
Query: 93 CSSGCKCGNSCLNKPFQN-RPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQ 151
CS C G C N+ Q+ +++ +TE G G+ + I + V EYVGE+ID +
Sbjct: 114 CSPSCAAGQYCRNQRLQHPERYPQLEPFKTEHKGYGVRTRQHIGQLSIVGEYVGEIIDQK 173
Query: 152 TCEERLWKM-KHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGET 210
RL + +H E NFY + + IDA KG+ +R++NHSC PN + +KW + GET
Sbjct: 174 ELARRLKSVPRH--ELNFYYLLLAPGVYIDARNKGSFTRFVNHSCEPNCKTEKWTVKGET 231
Query: 211 RIGIFATRDIKKGENLTYDYQYEFLHDSLI-AYC 243
RI + A RDI+ GE LT+DYQ++ L I YC
Sbjct: 232 RIAVIALRDIEVGEELTFDYQWKALGSRQIKCYC 265
>gi|212544714|ref|XP_002152511.1| histone-lysine N-methyltransferase (Ash1), putative [Talaromyces
marneffei ATCC 18224]
gi|210065480|gb|EEA19574.1| histone-lysine N-methyltransferase (Ash1), putative [Talaromyces
marneffei ATCC 18224]
Length = 865
Score = 111 bits (278), Expect = 2e-22, Method: Composition-based stats.
Identities = 64/210 (30%), Positives = 108/210 (51%), Gaps = 20/210 (9%)
Query: 38 KPKAIPYVFIKRNIYLTKRIKRRLEDDGI---FCSCTASPGSSGVCDRDCHCGMLLSSC- 93
+PK + RN+++ E+ I C C G CD DC + C
Sbjct: 423 QPKPDEWKKTNRNVFVGDAASFWRENKKIELSTCLCEEETG----CDEDCQNRFMFYECD 478
Query: 94 SSGCKCGNSCLNKPFQNRPVKK---------MKLVQTEKCGAGIVADEDIKRGEFVIEYV 144
SS C+ G +C N+ F+ + +++++T G G+ ++ + + ++EY
Sbjct: 479 SSNCRVGPNCGNRSFEELKQRTKAGGKYNIGVEVIKTADRGYGVRSNRTFEPNQIIVEYT 538
Query: 145 GEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKW 204
GE+I CE R+ + E +YL +++M+IDAT +G+ +R++NHSC PN M+KW
Sbjct: 539 GEIITQNECERRMRSVYKNNEC-YYLMYFDQNMIIDAT-RGSIARFVNHSCAPNCRMEKW 596
Query: 205 IIDGETRIGIFA-TRDIKKGENLTYDYQYE 233
+ G+ R+ +FA R I GE LTYDY ++
Sbjct: 597 TVGGKPRMALFAGDRGIMTGEELTYDYNFD 626
>gi|449544167|gb|EMD35141.1| hypothetical protein CERSUDRAFT_54162 [Ceriporiopsis subvermispora
B]
Length = 788
Score = 111 bits (278), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 72/214 (33%), Positives = 108/214 (50%), Gaps = 20/214 (9%)
Query: 44 YVFIKRNIYLTKRIKRRLED-DGIFCSCTASPGSSGVCDRDCHCG---------MLLSSC 93
+V IK N Y + R E + + C C GV D + CG +
Sbjct: 45 FVEIKGNHYQYGTLGRSREALESMTCDCQYE---HGVDDEEMACGHSSDCINRLTQVECL 101
Query: 94 SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTC 153
C+C + C N+ FQ + +++V+TEK G G+ A D+++ F+ EYVG+V+ +
Sbjct: 102 PDDCRCRSYCQNQRFQRKEYADIEVVKTEKKGFGLRAGADLRKDTFIYEYVGDVVSQPSF 161
Query: 154 EERLWKMKHLGETN---FYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGET 210
L +M+ E N FY + +D IDAT +G R+ NHSC PN + KW I
Sbjct: 162 ---LKRMRQYAEENIRHFYFMMLQKDEFIDATKRGGIGRFANHSCNPNCYVAKWTIGEHV 218
Query: 211 RIGIFATRDIKKGENLTYDYQYE-FLHDSLIAYC 243
R+GIFA R IKK E LT++Y + + +D+ YC
Sbjct: 219 RMGIFANRYIKKDEELTFNYNVDRYGNDAQPCYC 252
>gi|356540712|ref|XP_003538829.1| PREDICTED: histone-lysine N-methyltransferase SUVR5-like [Glycine
max]
Length = 1545
Score = 111 bits (278), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 66/164 (40%), Positives = 87/164 (53%), Gaps = 15/164 (9%)
Query: 87 GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
G L+ C+ C+C SC N+ QN K+++ +TEK G + A E I RG FV EY+GE
Sbjct: 1363 GYLVYECNHMCRCNKSCPNRVLQNGVRVKLEVFKTEKKGWAVRAGEAILRGTFVCEYIGE 1422
Query: 147 VIDDQTCEERLWKMKHLGETNFY-----------LCEINRDMVIDATYKGNKSRYINHSC 195
V+D Q R + + FY L E VID+T GN SR+INHSC
Sbjct: 1423 VLDVQEARNRRKRYGTEHCSYFYDIDARVNDIGRLIEGQAQYVIDSTKFGNVSRFINHSC 1482
Query: 196 CPNTEMQKWIIDG----ETRIGIFATRDIKKGENLTYDYQYEFL 235
PN + I++ IG +A+RDI GE LTYDYQYE +
Sbjct: 1483 SPNLVNHQVIVESMDCERAHIGFYASRDITLGEELTYDYQYELM 1526
>gi|299469716|emb|CBN76570.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 2317
Score = 111 bits (277), Expect = 3e-22, Method: Composition-based stats.
Identities = 65/193 (33%), Positives = 98/193 (50%), Gaps = 8/193 (4%)
Query: 47 IKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSS-----GCK--C 99
I N+YL+K+ + +G C C S C C C+ G K C
Sbjct: 391 ITSNVYLSKKPPAKDVGEGHRCGCVPPSDGSAGCTEACSNRATFEECTKRTCGLGGKGVC 450
Query: 100 GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
GN + + Q +K+ + G G+VA I G + EYVGEV+ + + R
Sbjct: 451 GNRKIQRYKQEYMRRKVCIKDVGPKGIGLVAKTRIPAGALIGEYVGEVLSEDDWQAR-QA 509
Query: 160 MKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD 219
+ H E + ++ ++ VIDA+ KG+ R++NHSC PN E QKW+I G+ RIG+FAT
Sbjct: 510 LLHANEKHKFVMDLGESEVIDASQKGSILRFVNHSCGPNAETQKWMIQGKRRIGLFATEV 569
Query: 220 IKKGENLTYDYQY 232
I+KG +T++Y Y
Sbjct: 570 IEKGVEVTFNYCY 582
>gi|406606267|emb|CCH42258.1| Histone-lysine N-methyltransferase [Wickerhamomyces ciferrii]
Length = 1074
Score = 111 bits (277), Expect = 3e-22, Method: Composition-based stats.
Identities = 83/246 (33%), Positives = 122/246 (49%), Gaps = 30/246 (12%)
Query: 19 KLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNI----YLTKRIKRRLEDDGI-------F 67
K LK + +F+LP W I P + +RNI + KR + LE+ I
Sbjct: 138 KGLKVLNTVQDFKLP-WNI---YCPTIDGERNIVKWKHGLKRNQWLLENPFIAKKQVNEV 193
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCGNS-CLNKPFQN-----RPVKKM----K 117
CSCT S +R + + CK G+ C N+ F + R K +
Sbjct: 194 CSCTGVRCGSTCLNRSVQ----IECTPNNCKFGDQDCGNRAFADLMNAYRDHSKYAFGCE 249
Query: 118 LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDM 177
++QT+K G G+++ G V+EY GEVI E RL + ++ +Y + +
Sbjct: 250 VLQTDKKGCGLLSIRSFNAGSLVVEYTGEVIHLDEVEHRLNTIYKESDS-YYFLGLEEEY 308
Query: 178 VIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHD 237
VIDA KG+ +R+ NHSC PN EMQKW ++GE RIG+FA R I+ GE +TYDY +E+ +
Sbjct: 309 VIDAGQKGSVARFANHSCDPNAEMQKWYVNGEPRIGLFAKRSIEAGEEITYDYNFEWFEN 368
Query: 238 SLIAYC 243
C
Sbjct: 369 GEPQKC 374
>gi|328709442|ref|XP_001950412.2| PREDICTED: hypothetical protein LOC100165448 isoform 1
[Acyrthosiphon pisum]
gi|328709444|ref|XP_003243962.1| PREDICTED: hypothetical protein LOC100165448 isoform 2
[Acyrthosiphon pisum]
Length = 1506
Score = 111 bits (277), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 60/171 (35%), Positives = 94/171 (54%), Gaps = 4/171 (2%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRP-VKKMKLVQTEKCG 125
C+C P C DC M+ + CS C C C N+ Q + + TE+ G
Sbjct: 734 CNCVV-PTEGKGCTDDCINRMIFAECSPELCPCKEKCSNQRLQTQQWAPGLVKFMTEEKG 792
Query: 126 AGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKG 185
GI E++K GE ++EYVGEV+ +QT ++R+ + + + + Y +++ VID G
Sbjct: 793 WGIKTTEEVKSGELLLEYVGEVVSEQTFKDRMTSI-YKNDVHHYCLKLDGGSVIDGHRMG 851
Query: 186 NKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLH 236
++R++NHSC PN EMQKW ++G R+ +FA R I+ E L YDY + +
Sbjct: 852 GEARFVNHSCEPNCEMQKWSVNGLFRMALFALRKIQHDEELCYDYNFSLFN 902
>gi|195173304|ref|XP_002027432.1| GL20883 [Drosophila persimilis]
gi|194113284|gb|EDW35327.1| GL20883 [Drosophila persimilis]
Length = 2266
Score = 110 bits (275), Expect = 5e-22, Method: Composition-based stats.
Identities = 66/214 (30%), Positives = 106/214 (49%), Gaps = 12/214 (5%)
Query: 27 PVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC 86
P +P W Y I+ N+Y D C+C C +C
Sbjct: 1350 PTRHIVPSW-------NYRKIRTNVYAESVRPNLAGFDHPTCNCKNQ--GEKACLDNCLN 1400
Query: 87 GMLLSSCS-SGCKCGNSCLNKPFQNRPVKK-MKLVQTEKCGAGIVADEDIKRGEFVIEYV 144
M+ + CS S C C N+ Q V ++ T G G+ I +G +++EYV
Sbjct: 1401 RMVYTECSPSNCPAAEKCRNQKIQRHEVAPGVERFMTLDKGWGVRTKLPIAKGTYILEYV 1460
Query: 145 GEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKW 204
GEV+ ++ ++R+ + +L +T+ Y ++ +VID G+ R++NHSC PN EMQKW
Sbjct: 1461 GEVVTEREFKQRMASI-YLNDTHHYCLHLDGGLVIDGQRMGSDCRFVNHSCEPNCEMQKW 1519
Query: 205 IIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
++G +R+ +FA R I++GE LTYDY + + S
Sbjct: 1520 SVNGLSRMVLFAKRPIEQGEELTYDYNFSLFNPS 1553
>gi|198463835|ref|XP_001352956.2| GA21391 [Drosophila pseudoobscura pseudoobscura]
gi|198151432|gb|EAL30457.2| GA21391 [Drosophila pseudoobscura pseudoobscura]
Length = 2266
Score = 110 bits (275), Expect = 5e-22, Method: Composition-based stats.
Identities = 66/214 (30%), Positives = 106/214 (49%), Gaps = 12/214 (5%)
Query: 27 PVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHC 86
P +P W Y I+ N+Y D C+C C +C
Sbjct: 1350 PTRHIVPSW-------NYRKIRTNVYAESVRPNLAGFDHPTCNCKNQ--GEKACLDNCLN 1400
Query: 87 GMLLSSCS-SGCKCGNSCLNKPFQNRPVKK-MKLVQTEKCGAGIVADEDIKRGEFVIEYV 144
M+ + CS S C C N+ Q V ++ T G G+ I +G +++EYV
Sbjct: 1401 RMVYTECSPSNCPAAEKCRNQKIQRHEVAPGVERFMTLDKGWGVRTKLPIAKGTYILEYV 1460
Query: 145 GEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKW 204
GEV+ ++ ++R+ + +L +T+ Y ++ +VID G+ R++NHSC PN EMQKW
Sbjct: 1461 GEVVTEREFKQRMASI-YLNDTHHYCLHLDGGLVIDGQRMGSDCRFVNHSCEPNCEMQKW 1519
Query: 205 IIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
++G +R+ +FA R I++GE LTYDY + + S
Sbjct: 1520 SVNGLSRMVLFAKRPIEQGEELTYDYNFSLFNPS 1553
>gi|313221636|emb|CBY36121.1| unnamed protein product [Oikopleura dioica]
Length = 207
Score = 110 bits (275), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 54/132 (40%), Positives = 80/132 (60%), Gaps = 6/132 (4%)
Query: 112 PVKKMKLVQTEKCGAGIVADEDIK------RGEFVIEYVGEVIDDQTCEERLWKMKHLGE 165
P+ K L T I A+ +K +G F+IEY+GE+I RL + +G
Sbjct: 11 PIHKAPLSTTPSSSRQIGAETALKAKKLIPKGSFIIEYIGEIISHDESRIRLEESAKIGV 70
Query: 166 TNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGEN 225
TN+Y+ E++ +IDA +GN +R+INHSC PN + WI+ G+TRIGIF+ RDI++GE
Sbjct: 71 TNYYILELDNLRMIDAGPRGNIARFINHSCDPNCGIDPWIVQGDTRIGIFSKRDIQEGEE 130
Query: 226 LTYDYQYEFLHD 237
LT++YQ + D
Sbjct: 131 LTFNYQLQQSSD 142
>gi|320592445|gb|EFX04875.1| histone-lysine n-methyltransferase [Grosmannia clavigera kw1407]
Length = 1246
Score = 110 bits (275), Expect = 6e-22, Method: Composition-based stats.
Identities = 71/213 (33%), Positives = 108/213 (50%), Gaps = 23/213 (10%)
Query: 38 KPKAIPYVFIKRNIYL--TKRIKRRLEDDGIFCS-CTASPGSSGVCDRDCHCGMLLSSCS 94
+PK + Y +N ++ I R+ D F S C P + CD DC ++L C
Sbjct: 528 QPKPVKYGRFSKNRFVGDAHAIWRKNPDFDDFSSKCVCKPETG--CDEDCQNRIMLYECD 585
Query: 95 S-GCKCG-NSCLNKPFQ---NRPVKK------MKLVQTEKCGAGIVADEDIKRGEFVIEY 143
C G C N+ FQ R K +++ +T G GI A K + ++EY
Sbjct: 586 DMNCNVGPERCHNREFQRLAERTASKNPYHVGVEVFKTPDRGHGIRASRSFKPSQIIMEY 645
Query: 144 VGEVIDDQTCEERLWKMKHLGETN--FYLCEINRDMVIDATYKGNKSRYINHSCCPNTEM 201
+GE+I ++ + R M L + N +YL ++ ++ID T G+ +R++NHSC PN M
Sbjct: 646 IGEIITEEESDRR---MNELYKNNACYYLMSFDQSLIIDGT-SGSIARFVNHSCSPNCRM 701
Query: 202 QKWIIDGETRIGIFA-TRDIKKGENLTYDYQYE 233
KWI+ G+ RI +FA R I GE LTYDY ++
Sbjct: 702 IKWIVSGQPRIALFAGDRPIMTGEELTYDYNFD 734
>gi|405966542|gb|EKC31816.1| Putative histone-lysine N-methyltransferase ASH1L [Crassostrea gigas]
Length = 2162
Score = 110 bits (275), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 62/174 (35%), Positives = 94/174 (54%), Gaps = 6/174 (3%)
Query: 63 DDGIFCSCTAS--PGSSGVCDRDCHCGMLLSSCS-SGCKCGNSCLNKPFQNRP-VKKMKL 118
D+ C+C P G C +C M+ + C S C C CLN+ F V +++
Sbjct: 1346 DEAHPCTCKRPYDPEVKG-CGEECLNRMMYTECDISTCPCQEQCLNQRFHKHEWVSGLEV 1404
Query: 119 VQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMV 178
+ T+ G GI + I G+F++EY+GEV+ + R+ + ++ E + Y ++ V
Sbjct: 1405 IVTKDRGYGIRTSDSISNGQFILEYLGEVVSEAEFRRRMTE-EYSQERHHYCLNLDSGAV 1463
Query: 179 IDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
ID GN RY+NHSC PN EMQKW ++G R+G+FA +DI LTYDY +
Sbjct: 1464 IDGYRMGNIGRYVNHSCEPNCEMQKWNVNGVYRMGLFALKDISPNMELTYDYNF 1517
>gi|325187666|emb|CCA22203.1| histonelysine Nmethyltransferase putative [Albugo laibachii Nc14]
Length = 1414
Score = 110 bits (274), Expect = 6e-22, Method: Composition-based stats.
Identities = 60/152 (39%), Positives = 90/152 (59%), Gaps = 6/152 (3%)
Query: 93 CSSGCKCGNSCLNKPFQNRPVK--KMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDD 150
CS+ C+ C NK Q +P K ++ +T K G GI E+I++G V EYVGE+ID
Sbjct: 326 CSAACEAREFCQNKRLQ-QPEKFPSLEAFKTLKKGFGIRTKENIRQGTIVGEYVGEIIDQ 384
Query: 151 QTCEERLWKM-KHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGE 209
RL + +H E NFY + + IDA +G+ +R++NHSC PN + +KW ++G+
Sbjct: 385 NELNRRLCAIGRH--ELNFYYLLLRPGVYIDARNRGSLTRFVNHSCDPNCKTEKWTVEGD 442
Query: 210 TRIGIFATRDIKKGENLTYDYQYEFLHDSLIA 241
TRI + A R I+ G LT+DYQ++ L +A
Sbjct: 443 TRIAVVALRHIEFGTELTFDYQWKSLGSRQLA 474
>gi|113470951|gb|ABI34877.1| Wolf-Hirschhorn syndrome candidate 1-like 1 [Danio rerio]
Length = 129
Score = 110 bits (274), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 49/104 (47%), Positives = 71/104 (68%)
Query: 132 EDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYI 191
+D+K+G+FV+EYVGE+ID + C++R+ TNFY+ + +D VIDA KGN SR++
Sbjct: 5 QDLKKGDFVMEYVGELIDSEECKQRIRTANENHVTNFYMLTLTKDRVIDAGPKGNLSRFM 64
Query: 192 NHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFL 235
NHSC PN E QKW ++G+ RIG+F DI LT++Y + L
Sbjct: 65 NHSCSPNCETQKWTVNGDVRIGLFTLCDISADTELTFNYNLDCL 108
>gi|380006423|gb|AFD29602.1| NSD-1 [Schmidtea mediterranea]
Length = 914
Score = 110 bits (274), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 66/184 (35%), Positives = 98/184 (53%), Gaps = 12/184 (6%)
Query: 57 IKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKC-------GNSCLNKPFQ 109
++R++ + C+C + S DC G + +S C+C G+ C N+ F
Sbjct: 595 LRRKVVPETPHCNCISFDPKS-----DCSIGSGCFNVASKCECDPQICNLGDKCKNQNFV 649
Query: 110 NRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFY 169
R K G G+ ADE IK +FV EY+GE+I + E+R+ T+FY
Sbjct: 650 KRIYPKQYTFWANDRGWGLKADEFIKTKQFVNEYIGEIITMEESEKRILWANENNITDFY 709
Query: 170 LCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYD 229
E++ +IDA N SR+IN+SC PN +KWI++ E RIG+FA RDI+KGE LT+
Sbjct: 710 FMELDNGRLIDARQFSNLSRFINNSCDPNLVAEKWIVNREHRIGLFALRDIQKGEELTFQ 769
Query: 230 YQYE 233
Y +
Sbjct: 770 YNLQ 773
>gi|428176462|gb|EKX45346.1| hypothetical protein GUITHDRAFT_61291, partial [Guillardia theta
CCMP2712]
Length = 215
Score = 110 bits (274), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 64/193 (33%), Positives = 101/193 (52%), Gaps = 7/193 (3%)
Query: 47 IKRNIYL-TKRIKRRLEDDGIFCSCTASPG-SSGVCDRDCHCGM--LLSSCSSGCKCGNS 102
I +N+Y+ +R + + C C PG C +C + L C C C +
Sbjct: 1 ISKNVYVGEERAALKSMESRSACQCIYVPGVPETACGEHSNCMLRDLYIECDYRCPCSSH 60
Query: 103 CLNKPFQNRPVKKMKLVQTEKC-GAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMK 161
CLNK Q R K + + G + DED+++G+ V+EY+GEVI + R+ +
Sbjct: 61 CLNKRLQKRQWAKCSIFLAKNGRGWALRNDEDLRQGQLVMEYIGEVISGEEVSRRM--EE 118
Query: 162 HLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIK 221
+ G+ + Y+ ++N++ ID+T K N +R+INH C PN EMQKW + + + +FA I
Sbjct: 119 YAGKRHTYMLKLNQEEFIDSTRKANLARFINHCCEPNCEMQKWYVGNKQCVALFAKYFIP 178
Query: 222 KGENLTYDYQYEF 234
G LT+DY EF
Sbjct: 179 SGSELTFDYDMEF 191
>gi|159113654|ref|XP_001707053.1| Histone methyltransferase HMT1 [Giardia lamblia ATCC 50803]
gi|52857644|gb|AAU89075.1| histone methyltransferase HMT1 [Giardia intestinalis]
gi|157435155|gb|EDO79379.1| Histone methyltransferase HMT1 [Giardia lamblia ATCC 50803]
Length = 298
Score = 110 bits (274), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 72/228 (31%), Positives = 107/228 (46%), Gaps = 43/228 (18%)
Query: 41 AIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCG 100
++ Y +KRNIY+ KR FC+CT G+ C C + C C G
Sbjct: 52 SLHYTHVKRNIYVG--CKRPSAARKTFCTCTCKEGTG--CGTSCELRKVHLECYKECCAG 107
Query: 101 NSCLNKPFQNRPV-----------------------------KKMKLVQTE-----KCGA 126
+ C +K F RP+ ++M+ +T + G
Sbjct: 108 SPC-SKQFIVRPLFGNSIDLPKDATTLHKDNSRLADCGNQRLQRMQYARTAVYPAGRKGY 166
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ A I+RG V EY+GEVI + C R K + + Y ++R++ IDA +KGN
Sbjct: 167 GLFALTSIQRGALVTEYIGEVITREECMRR----KKSAKGHLYFLALDRELYIDAAHKGN 222
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEF 234
+SR+INHSC PN E+Q W + E R I A R I E L++DY+++F
Sbjct: 223 ESRFINHSCDPNCEVQLWYVGEEPRAAIVALRSIAPHEELSFDYKFDF 270
>gi|330792328|ref|XP_003284241.1| hypothetical protein DICPUDRAFT_27300 [Dictyostelium purpureum]
gi|325085814|gb|EGC39214.1| hypothetical protein DICPUDRAFT_27300 [Dictyostelium purpureum]
Length = 151
Score = 110 bits (274), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 52/109 (47%), Positives = 74/109 (67%), Gaps = 2/109 (1%)
Query: 125 GAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYK 184
G G+++ E+I +G+FV+EY GEVI TC R+ ++ E FY +N +DA+ +
Sbjct: 2 GWGLISCENINKGDFVMEYCGEVISKTTCLNRM--QENENEKFFYFLTLNSKECLDASRR 59
Query: 185 GNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
GN +R+INHSC PN E QKWI+ GE +IGIF+ + I+KG LT+DY YE
Sbjct: 60 GNLARFINHSCDPNCETQKWIVGGEVKIGIFSIKPIEKGTELTFDYNYE 108
>gi|320036571|gb|EFW18510.1| histone-lysine N-methyltransferase [Coccidioides posadasii str.
Silveira]
Length = 742
Score = 110 bits (274), Expect = 7e-22, Method: Composition-based stats.
Identities = 62/178 (34%), Positives = 99/178 (55%), Gaps = 18/178 (10%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSCS-SGCKCGNS-CLNKPFQ--NRPVKK-------M 116
C+CT G CD +C + C + CK G+ C N+PF R K +
Sbjct: 346 CTCTPETG----CDENCQNRYMFYECDDTNCKLGSELCRNRPFSALRRRAKAGGKFNIGV 401
Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
++++TE G G+ ++ + ++EY GE++ + CE R+ + E +YL +++
Sbjct: 402 EVIKTEDRGYGVRSNRSFDPNQIIVEYTGEILTQEECERRMRTVYKKNEC-YYLMYFDQN 460
Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD-IKKGENLTYDYQYE 233
MVIDAT +G+ +R+INHSC PN M+KW + G+ R+ +FA D I GE LTYDY ++
Sbjct: 461 MVIDAT-RGSIARFINHSCEPNCRMEKWTVAGKPRMALFAGEDGIMTGEELTYDYNFD 517
>gi|303313071|ref|XP_003066547.1| SET domain containing protein [Coccidioides posadasii C735 delta
SOWgp]
gi|240106209|gb|EER24402.1| SET domain containing protein [Coccidioides posadasii C735 delta
SOWgp]
Length = 742
Score = 110 bits (274), Expect = 7e-22, Method: Composition-based stats.
Identities = 62/178 (34%), Positives = 99/178 (55%), Gaps = 18/178 (10%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSCS-SGCKCGNS-CLNKPFQ--NRPVKK-------M 116
C+CT G CD +C + C + CK G+ C N+PF R K +
Sbjct: 346 CTCTPETG----CDENCQNRYMFYECDDTNCKLGSELCRNRPFSALRRRAKAGGKFNIGV 401
Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
++++TE G G+ ++ + ++EY GE++ + CE R+ + E +YL +++
Sbjct: 402 EVIKTEDRGYGVRSNRSFDPNQIIVEYTGEILTQEECERRMRTVYKKNEC-YYLMYFDQN 460
Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD-IKKGENLTYDYQYE 233
MVIDAT +G+ +R+INHSC PN M+KW + G+ R+ +FA D I GE LTYDY ++
Sbjct: 461 MVIDAT-RGSIARFINHSCEPNCRMEKWTVAGKPRMALFAGEDGIMTGEELTYDYNFD 517
>gi|119192128|ref|XP_001246670.1| hypothetical protein CIMG_00441 [Coccidioides immitis RS]
gi|392864092|gb|EAS35106.2| histone-lysine N-methyltransferase [Coccidioides immitis RS]
Length = 742
Score = 110 bits (274), Expect = 7e-22, Method: Composition-based stats.
Identities = 62/178 (34%), Positives = 99/178 (55%), Gaps = 18/178 (10%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSCS-SGCKCGNS-CLNKPFQ--NRPVKK-------M 116
C+CT G CD +C + C + CK G+ C N+PF R K +
Sbjct: 346 CTCTPETG----CDENCQNRYMFYECDDTNCKLGSELCRNRPFSALRRRAKAGGKFNIGV 401
Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
++++TE G G+ ++ + ++EY GE++ + CE R+ + E +YL +++
Sbjct: 402 EVIKTEDRGYGVRSNRSFDPNQIIVEYTGEILTQEECERRMRTVYKKNEC-YYLMYFDQN 460
Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD-IKKGENLTYDYQYE 233
MVIDAT +G+ +R+INHSC PN M+KW + G+ R+ +FA D I GE LTYDY ++
Sbjct: 461 MVIDAT-RGSIARFINHSCEPNCRMEKWTVAGKPRMALFAGEDGIMTGEELTYDYNFD 517
>gi|449017981|dbj|BAM81383.1| similar to histone methyltransferase Set2p [Cyanidioschyzon merolae
strain 10D]
Length = 1354
Score = 110 bits (274), Expect = 7e-22, Method: Composition-based stats.
Identities = 68/199 (34%), Positives = 103/199 (51%), Gaps = 14/199 (7%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPG---SSGVCDRDCH---CGMLLSSC 93
K P++ + N YL D C PG +S VC +C+ C + S
Sbjct: 787 KPSPFIHLTANEYLVPPSVAPAHADVCQCIGACVPGVCLNSTVC-VECNPATCPVARSRS 845
Query: 94 SSGCKCGNSCLNKPFQNRPVKKMKLVQTEKC-GAGIVADEDIKRGEFVIEYVGEVIDDQT 152
S+ +CGN FQ + ++L + G GI + +K+G+F++EY+GEVI
Sbjct: 846 STDPQCGN----MRFQRQAYAPVELFFSPNGRGCGIRSRAPLKKGDFIVEYMGEVIGPTE 901
Query: 153 CEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRI 212
R K H E + Y +++ +DA+ KG R++NHSC PN QKW++ G+ R+
Sbjct: 902 LARR--KRDHALERHVYFMTLDQSTFLDASRKGTWGRFLNHSCEPNCHTQKWLVLGKVRV 959
Query: 213 GIFATRDIKKGENLTYDYQ 231
GIFA+RDI GE LT+DY+
Sbjct: 960 GIFASRDIAAGEELTFDYR 978
>gi|388852139|emb|CCF54145.1| uncharacterized protein [Ustilago hordei]
Length = 1338
Score = 109 bits (273), Expect = 8e-22, Method: Composition-based stats.
Identities = 74/205 (36%), Positives = 106/205 (51%), Gaps = 18/205 (8%)
Query: 38 KPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCS-SG 96
K K Y I RN Y ++ +L+ + CSC PGS C DC ML+ C
Sbjct: 648 KRKPPRYQQISRNKYYSRP---KLQGEVPLCSC--QPGSG--CGSDCINRMLMFICDPKT 700
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G++C N RP K + + G G+ E IK+ +F+ EY GEVID +R
Sbjct: 701 CPSGSNCTNISLGRRPTVKTAVHYYGRRGFGLKTLEPIKKDDFIDEYRGEVIDLHEASKR 760
Query: 157 LWKMKHLGETNFYLCEINRDM--VIDATYKGNKSRYINHSCCPNTEMQKWIIDG------ 208
+ ++ NFYL + + ++D KGN +R+ NHSC PN ++K+II G
Sbjct: 761 VTD-EYKATGNFYLLDYDTAAGELLDGGRKGNITRFANHSCEPNCRIEKFIICGTDEALS 819
Query: 209 -ETRIGIFATRDIKKGENLTYDYQY 232
E +IG+FA RDI+ GE LTY+Y +
Sbjct: 820 AEFQIGLFALRDIEAGEELTYNYGW 844
>gi|308485102|ref|XP_003104750.1| CRE-MET-1 protein [Caenorhabditis remanei]
gi|308257448|gb|EFP01401.1| CRE-MET-1 protein [Caenorhabditis remanei]
Length = 1582
Score = 109 bits (273), Expect = 9e-22, Method: Composition-based stats.
Identities = 69/197 (35%), Positives = 103/197 (52%), Gaps = 16/197 (8%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCGNSC 103
Y+ I N Y + ++ + D + C C S D +C +L+ C S C +C
Sbjct: 632 YITIAENKYFNRNANKK-KTDSLICEC--SRLGLTCSDNNCVNRAMLTECPSSCPA--NC 686
Query: 104 LNKPFQNRPVKKMKLVQTEKC-GAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
N+ F + ++ T G G+ A +DIK+G F+IEYVGEV++ E+R K K+
Sbjct: 687 KNQRFAKKKYASVEAFHTGTAKGCGLRALKDIKKGRFIIEYVGEVVERDDYEKR--KKKY 744
Query: 163 LGE---TNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGE----TRIGIF 215
+ + YLC+ IDAT GN SR++NHSC PN +KW + +RIG F
Sbjct: 745 AADEKHKHHYLCDTGV-YTIDATVYGNPSRFVNHSCEPNAVCEKWSVPKTPGDISRIGFF 803
Query: 216 ATRDIKKGENLTYDYQY 232
A + IK GE +T+DYQ+
Sbjct: 804 AKKSIKAGEEITFDYQF 820
>gi|307106603|gb|EFN54848.1| hypothetical protein CHLNCDRAFT_35816, partial [Chlorella
variabilis]
Length = 239
Score = 109 bits (272), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 56/122 (45%), Positives = 74/122 (60%), Gaps = 2/122 (1%)
Query: 116 MKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINR 175
M++ T G G+ A I RG F++EY GEVIDD+ C R K E +FY+ E+
Sbjct: 1 MEVYLTPDKGWGVRAMAFIPRGTFIVEYAGEVIDDKECSRRAEDAKARNEPHFYMMEMAP 60
Query: 176 DMVIDATYKGNKSRYINHSCCPNTEMQKWIIDG--ETRIGIFATRDIKKGENLTYDYQYE 233
++IDA KGN +R +N SC PN E QKW G E R+GIF+ RD+ GE LTYDYQ++
Sbjct: 61 GLIIDARSKGNLARLLNSSCDPNCETQKWHDAGNSEVRVGIFSLRDVLPGEELTYDYQFQ 120
Query: 234 FL 235
Sbjct: 121 HF 122
>gi|345566121|gb|EGX49068.1| hypothetical protein AOL_s00079g289 [Arthrobotrys oligospora ATCC
24927]
Length = 980
Score = 109 bits (272), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 61/172 (35%), Positives = 91/172 (52%), Gaps = 3/172 (1%)
Query: 62 EDDGIFCSCTAS--PGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKKMKL 118
E++ + C C G + C ++C C G CG C N+ FQ R + +
Sbjct: 161 EEELMSCDCRPEYDDGVNHACSQNCINAETFVECVDGDSNCGGQCQNQRFQKREYANVSV 220
Query: 119 VQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMV 178
+QTE G G+ A+ ++ G F+ EYVGEVI + +R +FY I
Sbjct: 221 IQTEMKGYGLRANTSMEPGTFIYEYVGEVIGESQFRKRRELYGKEDIKHFYFMSIKVGEY 280
Query: 179 IDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDY 230
IDAT +G +R+ NHSC PN+ ++KW++ G+ R+GIFA I+ GE LT+DY
Sbjct: 281 IDATKRGCLARFCNHSCNPNSMVEKWVVGGKLRMGIFAKVKIEAGEELTFDY 332
>gi|340506525|gb|EGR32648.1| SET domain protein [Ichthyophthirius multifiliis]
Length = 978
Score = 109 bits (272), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 59/142 (41%), Positives = 90/142 (63%), Gaps = 2/142 (1%)
Query: 93 CSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQT 152
C+ C+CG+ C N+ FQ + T G G+ A + I +G F+I+Y+GEV D +
Sbjct: 27 CNLSCRCGDLCQNRRFQKHEDACVYPYPTSGKGWGLCAGQYIPKGTFIIQYIGEVFDINS 86
Query: 153 CEERLWKMKHLGETN-FYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETR 211
EE + ++K + YL +I+++ VID T+KGN +R+INHSC PN QKW + GE
Sbjct: 87 -EEGIKRVKDYSRSTCTYLMKIDKNEVIDPTFKGNLARFINHSCDPNCITQKWHVLGEIC 145
Query: 212 IGIFATRDIKKGENLTYDYQYE 233
IGIFA ++IK+ + LT+DYQ++
Sbjct: 146 IGIFAIKNIKEDDELTFDYQFD 167
>gi|301115083|ref|XP_002999311.1| histone-lysine N-methyltransferase, putative [Phytophthora
infestans T30-4]
gi|262111405|gb|EEY69457.1| histone-lysine N-methyltransferase, putative [Phytophthora
infestans T30-4]
Length = 905
Score = 109 bits (272), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 80/252 (31%), Positives = 115/252 (45%), Gaps = 39/252 (15%)
Query: 1 MPAAKKNSDNSRIGHAFNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRR 60
+P ++ N H F +LLK I V + P Y + R+IY K+ K
Sbjct: 422 LPKRDPDASNPNDCHHF-RLLKNIIEDVRLQPPT---------YRKLGRSIYTFKQPKVS 471
Query: 61 LEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSC----------------SSGCKCGNSCL 104
LED C C C DC + + C S C G +C
Sbjct: 472 LED-APTCVCKER------CGDDCINRLSFTECFGPAPTPGMKFNKHNRESNCMLGENCG 524
Query: 105 NKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLG 164
N+ + + + T + G + E K G+ VIEYVGEVI+++ E RL + H
Sbjct: 525 NRALHQKVYPRFQKFHTVEKGWALRLLEPAKAGQLVIEYVGEVINEEEKERRL--LDHAK 582
Query: 165 ET----NFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDI 220
+ N Y+ E+ + IDA +KG+ SR+INHSC PN + KW + G RI I A +DI
Sbjct: 583 NSPEDKNMYIMELGKGEYIDARFKGSVSRFINHSCDPNCHLLKWRVKGVNRIAITALKDI 642
Query: 221 KKGENLTYDYQY 232
+ G L+YDYQ+
Sbjct: 643 EPGTELSYDYQF 654
>gi|336463582|gb|EGO51822.1| hypothetical protein NEUTE1DRAFT_125465 [Neurospora tetrasperma
FGSC 2508]
Length = 1162
Score = 108 bits (271), Expect = 1e-21, Method: Composition-based stats.
Identities = 67/204 (32%), Positives = 105/204 (51%), Gaps = 26/204 (12%)
Query: 38 KPKAIPYVFIKRNIYLTK-----RIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSS 92
+PK Y + +N ++ + + ED C CT G C +DC ++L
Sbjct: 630 QPKPAAYRTMTKNRFIGQAAAIWKKTPHFEDFASKCVCTPEDG----CAQDCQNRVMLYE 685
Query: 93 CS-SGCKCGNS-CLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDD 150
C + C G C N+ FQ M +T+K G + E ++EY GE+I D
Sbjct: 686 CDDTNCNVGKEFCQNRAFQ------MLTERTKKGGRYRIGVE------IIMEYTGEIITD 733
Query: 151 QTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGET 210
+ CE R+ ++ +YL +++M+IDAT G+ +R++NHSC PN M KWI+ G+
Sbjct: 734 EECERRM-NEEYKNNECYYLMSFDQNMIIDAT-TGSIARFVNHSCSPNCRMIKWIVSGQP 791
Query: 211 RIGIFA-TRDIKKGENLTYDYQYE 233
R+ +FA R I+ GE LTYDY ++
Sbjct: 792 RMALFAGDRPIQTGEELTYDYNFD 815
>gi|350297194|gb|EGZ78171.1| hypothetical protein NEUTE2DRAFT_101826 [Neurospora tetrasperma
FGSC 2509]
Length = 1162
Score = 108 bits (271), Expect = 1e-21, Method: Composition-based stats.
Identities = 67/204 (32%), Positives = 105/204 (51%), Gaps = 26/204 (12%)
Query: 38 KPKAIPYVFIKRNIYLTK-----RIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSS 92
+PK Y + +N ++ + + ED C CT G C +DC ++L
Sbjct: 630 QPKPAAYRTMTKNRFIGQAAAIWKKTPHFEDFASKCVCTPEDG----CAQDCQNRVMLYE 685
Query: 93 CS-SGCKCGNS-CLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDD 150
C + C G C N+ FQ M +T+K G + E ++EY GE+I D
Sbjct: 686 CDDTNCNVGKEFCQNRAFQ------MLTERTKKGGRYRIGVE------IIMEYTGEIITD 733
Query: 151 QTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGET 210
+ CE R+ ++ +YL +++M+IDAT G+ +R++NHSC PN M KWI+ G+
Sbjct: 734 EECERRM-NEEYKNNECYYLMSFDQNMIIDAT-TGSIARFVNHSCSPNCRMIKWIVSGQP 791
Query: 211 RIGIFA-TRDIKKGENLTYDYQYE 233
R+ +FA R I+ GE LTYDY ++
Sbjct: 792 RMALFAGDRPIQTGEELTYDYNFD 815
>gi|348683753|gb|EGZ23568.1| hypothetical protein PHYSODRAFT_487680 [Phytophthora sojae]
Length = 868
Score = 108 bits (271), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 76/238 (31%), Positives = 111/238 (46%), Gaps = 39/238 (16%)
Query: 15 HAFNKLLKQIGNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASP 74
H F +L++ I V + P Y + R+IY K+ K LED C C
Sbjct: 442 HHF-RLMRNIIEDVRLQPPT---------YRKLGRSIYTFKQPKVSLED-APMCECKLQ- 489
Query: 75 GSSGVCDRDCHCGMLLSSC----------------SSGCKCGNSCLNKPFQNRPVKKMKL 118
C DC + + C S C G +C N+ + + +
Sbjct: 490 -----CGDDCINRLSFTECFGPAPTPGMKFNKQNRESNCMLGENCGNRALHQKIYPRFEK 544
Query: 119 VQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGET----NFYLCEIN 174
T + G + E +K G+ VIEYVGEVI+++ E RL + H + N Y+ E+
Sbjct: 545 FHTVEKGWALRVLEPVKAGQLVIEYVGEVINEEEKERRL--LDHAKNSPEDKNMYIMELG 602
Query: 175 RDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
+ IDA +KG+ SR+INHSC PN + KW + G RI I A +DI+ G L+YDYQ+
Sbjct: 603 KGEYIDARFKGSVSRFINHSCDPNCHLLKWRVKGVNRIAITALKDIEPGTELSYDYQF 660
>gi|242011020|ref|XP_002426255.1| protein MLP1, putative [Pediculus humanus corporis]
gi|212510318|gb|EEB13517.1| protein MLP1, putative [Pediculus humanus corporis]
Length = 2688
Score = 108 bits (271), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 66/209 (31%), Positives = 108/209 (51%), Gaps = 5/209 (2%)
Query: 31 ELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGV-CDRDCHCGML 89
+LP I P + Y I+ NIY + + + C+CT + C +C M+
Sbjct: 1861 QLPGRDIVP-SWNYKKIRTNIYYDVKPQMTGTYEAQACNCTVPLNTKQKGCGEECINRMV 1919
Query: 90 LSSCSSG-CKCGNSCLNKPFQNRPVKK-MKLVQTEKCGAGIVADEDIKRGEFVIEYVGEV 147
+ CS C C C N+ Q ++ T+ G GI + IK G+F++EYVGEV
Sbjct: 1920 FAECSPQLCPCKEKCSNQRIQKHEFAPGLQKFMTKNKGWGIRTKQPIKAGDFILEYVGEV 1979
Query: 148 IDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIID 207
+ D+ ++R+ + ++ + + Y ++ +VID G R++NHSC PN EMQKW ++
Sbjct: 1980 VSDKEFKDRMASI-YVNDKHHYCLHLDGGLVIDGHRMGGDGRFVNHSCNPNCEMQKWSVN 2038
Query: 208 GETRIGIFATRDIKKGENLTYDYQYEFLH 236
G R+ +FA R+I + LTYDY + +
Sbjct: 2039 GLFRMALFALRNIPAHQELTYDYNFSLFN 2067
>gi|164429563|ref|XP_964116.2| hypothetical protein NCU01932 [Neurospora crassa OR74A]
gi|157073530|gb|EAA34880.2| predicted protein [Neurospora crassa OR74A]
Length = 1162
Score = 108 bits (271), Expect = 2e-21, Method: Composition-based stats.
Identities = 67/204 (32%), Positives = 105/204 (51%), Gaps = 26/204 (12%)
Query: 38 KPKAIPYVFIKRNIYLTK-----RIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSS 92
+PK Y + +N ++ + + ED C CT G C +DC ++L
Sbjct: 630 QPKPAAYRTMTKNRFIGQAAAIWKKTPHFEDFASKCVCTPEDG----CAQDCQNRVMLYE 685
Query: 93 CS-SGCKCGNS-CLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDD 150
C + C G C N+ FQ M +T+K G + E ++EY GE+I D
Sbjct: 686 CDDTNCNVGKEFCQNRAFQ------MLTERTKKGGRYRIGVE------IIMEYTGEIITD 733
Query: 151 QTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGET 210
+ CE R+ ++ +YL +++M+IDAT G+ +R++NHSC PN M KWI+ G+
Sbjct: 734 EECERRM-NEEYKNNECYYLMSFDQNMIIDAT-TGSIARFVNHSCSPNCRMIKWIVSGQP 791
Query: 211 RIGIFA-TRDIKKGENLTYDYQYE 233
R+ +FA R I+ GE LTYDY ++
Sbjct: 792 RMALFAGDRPIQTGEELTYDYNFD 815
>gi|401884695|gb|EJT48845.1| hypothetical protein A1Q1_02180 [Trichosporon asahii var. asahii
CBS 2479]
Length = 1192
Score = 108 bits (270), Expect = 2e-21, Method: Composition-based stats.
Identities = 71/200 (35%), Positives = 103/200 (51%), Gaps = 12/200 (6%)
Query: 38 KPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGC 97
K + Y I N +L +R K + D + C C G C + SC +G
Sbjct: 471 KKRPPQYEMITSNQFL-ERPKIPVSDKEV-CHCVVGTGCGLSCTNRLQGILCGKSCPNGP 528
Query: 98 KCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERL 157
CGN L + RP K + + + G G+ A EDI GEFV++Y GEVI T +R+
Sbjct: 529 DCGNQALCR----RPAKAIIVALSSLHGYGVFAAEDIAAGEFVVDYRGEVISVDTFIDRI 584
Query: 158 WKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKW--IIDG--ETRIG 213
H + + + +RD +ID++ KGN +R+INHSC PN ++K+ + DG E G
Sbjct: 585 G--SHSEDDSVFAIAYDRDEIIDSSSKGNSARFINHSCDPNLVLRKFDTLGDGHEEHEFG 642
Query: 214 IFATRDIKKGENLTYDYQYE 233
+++ R IK GE LTYDY E
Sbjct: 643 LWSRRPIKAGEELTYDYNAE 662
>gi|451852041|gb|EMD65336.1| hypothetical protein COCSADRAFT_35392 [Cochliobolus sativus ND90Pr]
Length = 787
Score = 108 bits (270), Expect = 2e-21, Method: Composition-based stats.
Identities = 66/196 (33%), Positives = 98/196 (50%), Gaps = 17/196 (8%)
Query: 59 RRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGN-SCLNKPFQ------- 109
+R + D C C G C CH ++ C S+ C C N+PF
Sbjct: 366 KRDKQDTSQCYCDVDDG----CGESCHNRIMAYECDSTNCPLAKEQCNNRPFAELRKRAK 421
Query: 110 -NRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNF 168
NR ++++ TE G G+ A + + ++EY GE+I CE R+ K + + +
Sbjct: 422 GNRYDYGVEVLNTEDRGFGVRAMRTFEPHQIIVEYAGEIITQSECERRM-KQVYKKDKCY 480
Query: 169 YLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLT 227
YL + M+IDAT +G +R++NHSC PN EM KW + GE R+ +FA +R I GE LT
Sbjct: 481 YLMSFDNKMIIDAT-RGTIARFVNHSCEPNCEMIKWTVGGEPRMALFAGSRGIMTGEELT 539
Query: 228 YDYQYEFLHDSLIAYC 243
YDY ++ I C
Sbjct: 540 YDYNFDPFSQKNIQEC 555
>gi|451997634|gb|EMD90099.1| hypothetical protein COCHEDRAFT_1225655 [Cochliobolus
heterostrophus C5]
Length = 787
Score = 108 bits (269), Expect = 2e-21, Method: Composition-based stats.
Identities = 66/196 (33%), Positives = 98/196 (50%), Gaps = 17/196 (8%)
Query: 59 RRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSC-SSGCK-CGNSCLNKPFQ------- 109
+R + D C C G C CH ++ C S+ C C N+PF
Sbjct: 366 KRDKQDTSQCYCDVDDG----CGESCHNRIMAYECDSTNCPLTKEQCNNRPFAELKKRAK 421
Query: 110 -NRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNF 168
NR ++++ TE G G+ A + + ++EY GE+I CE R+ K + + +
Sbjct: 422 GNRYDYGVEVLNTEDRGFGVRAMRTFEPHQIIVEYAGEIITQSECERRM-KQVYKKDKCY 480
Query: 169 YLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLT 227
YL + M+IDAT +G +R++NHSC PN EM KW + GE R+ +FA +R I GE LT
Sbjct: 481 YLMSFDNKMIIDAT-RGTIARFVNHSCEPNCEMIKWTVGGEPRMALFAGSRGIMTGEELT 539
Query: 228 YDYQYEFLHDSLIAYC 243
YDY ++ I C
Sbjct: 540 YDYNFDPFSQKNIQEC 555
>gi|116193789|ref|XP_001222707.1| hypothetical protein CHGG_06612 [Chaetomium globosum CBS 148.51]
gi|88182525|gb|EAQ89993.1| hypothetical protein CHGG_06612 [Chaetomium globosum CBS 148.51]
Length = 907
Score = 108 bits (269), Expect = 2e-21, Method: Composition-based stats.
Identities = 60/178 (33%), Positives = 97/178 (54%), Gaps = 18/178 (10%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSCS-SGCKCGNS-CLNKPFQNRPVKKMK-------- 117
C C + G CD DC ++L C + C G + C N+ FQ+ + K
Sbjct: 449 CVCQPADG----CDEDCQNRIMLYECDDTNCNFGKAHCQNRAFQDLQERTKKGGRYRVGV 504
Query: 118 -LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
+V+T G G+ ++ + + ++EY GE+I + CE R+ ++ +YL +++
Sbjct: 505 EVVKTGDRGYGVRSNRCFEANQIIMEYTGEIITEAECERRM-NEEYKDNECYYLMSFDQN 563
Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDYQYE 233
M+IDAT G+ +R++NHSC PN M KWI+ G+ R+ +FA R I GE LTYDY ++
Sbjct: 564 MIIDAT-TGSIARFVNHSCSPNCRMIKWIVAGQPRMALFAGDRPIMTGEELTYDYNFD 620
>gi|429862858|gb|ELA37465.1| histone-lysine n-methyltransferase [Colletotrichum gloeosporioides
Nara gc5]
Length = 805
Score = 108 bits (269), Expect = 3e-21, Method: Composition-based stats.
Identities = 61/188 (32%), Positives = 97/188 (51%), Gaps = 18/188 (9%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCG-NSCLNKPFQNRPVKK---------M 116
C CT G C C ++L C C G +C N+ F + +K +
Sbjct: 394 CVCTPDDG----CGESCQNRIMLYECDEKNCNVGRENCTNRAFADLQERKAGGGKYRVGV 449
Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
++++T G GI A+ + ++EY GE+I D+ C R+ + K+ +YL +++
Sbjct: 450 EVIKTADRGYGIRANRCFAPNQIIMEYTGEIITDEECSNRM-ETKYKDNKCYYLMSFDQN 508
Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDYQYEFL 235
M+IDAT G+ +R++NHSC PN M KWI+ G+ R+ +FA + I GE LTYDY +
Sbjct: 509 MIIDAT-TGSIARFVNHSCAPNCRMIKWIVSGQPRMALFAGDKPIMTGEELTYDYNFSPF 567
Query: 236 HDSLIAYC 243
D + C
Sbjct: 568 SDENVQKC 575
>gi|301100332|ref|XP_002899256.1| histone-lysine N-methyltransferase, putative [Phytophthora
infestans T30-4]
gi|262104173|gb|EEY62225.1| histone-lysine N-methyltransferase, putative [Phytophthora
infestans T30-4]
Length = 1501
Score = 108 bits (269), Expect = 3e-21, Method: Composition-based stats.
Identities = 59/154 (38%), Positives = 89/154 (57%), Gaps = 6/154 (3%)
Query: 93 CSSGCKCGNSCLNKPFQNRPVKK--MKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDD 150
CS+ C C N+ Q+ P + ++ +TE+ G G+ + I + V EYVGE+ID
Sbjct: 103 CSASCPAAQYCRNQRLQH-PERYPLLEPFKTEQKGYGVRTRQHIPQLSIVGEYVGEIIDQ 161
Query: 151 QTCEERLWKM-KHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGE 209
+ RL + +H E NFY + + IDA KG+ +R++NHSC PN + +KW + GE
Sbjct: 162 KELARRLKSVPRH--ELNFYYLLLAPGVYIDARNKGSFTRFVNHSCEPNCKTEKWTVKGE 219
Query: 210 TRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
TRI + A RDI+ GE LT+DY+++ L I C
Sbjct: 220 TRIAVSALRDIEVGEELTFDYKWKALGSRQITCC 253
>gi|357483173|ref|XP_003611873.1| Histone-lysine N-methyltransferase SUVR5 [Medicago truncatula]
gi|355513208|gb|AES94831.1| Histone-lysine N-methyltransferase SUVR5 [Medicago truncatula]
Length = 1507
Score = 107 bits (268), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 67/167 (40%), Positives = 91/167 (54%), Gaps = 21/167 (12%)
Query: 87 GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
G L+ C+ C+C SC N+ QN K+++ +TEK G G+ A E I RG FV EY+GE
Sbjct: 1325 GYLVYECNRMCRCNKSCPNRILQNGVRVKLEVFKTEKKGWGVRAGEAILRGTFVCEYIGE 1384
Query: 147 VIDDQTCEERLWKMKHLGETN-FYLCEIN---RDM----------VIDATYKGNKSRYIN 192
V+D Q R K G N Y +IN DM VIDA+ GN SR+IN
Sbjct: 1385 VLDVQEAHNR---RKRYGTGNCSYFYDINARVNDMSRMIEEKAQYVIDASKNGNVSRFIN 1441
Query: 193 HSCCPNTEMQKWIIDG----ETRIGIFATRDIKKGENLTYDYQYEFL 235
HSC PN + +++ + IG +A++DI GE LTY +QYE +
Sbjct: 1442 HSCSPNLVSHQVLVESMDCERSHIGFYASQDIALGEELTYGFQYELV 1488
>gi|258573585|ref|XP_002540974.1| predicted protein [Uncinocarpus reesii 1704]
gi|237901240|gb|EEP75641.1| predicted protein [Uncinocarpus reesii 1704]
Length = 727
Score = 107 bits (268), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 72/225 (32%), Positives = 116/225 (51%), Gaps = 27/225 (12%)
Query: 33 PDWFIKPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSS 92
PD + K +V +I+ +IK + C+CT G CD +CH +
Sbjct: 333 PDEWRKANKNVFVGDATSIWKANKIK-----EHSTCTCTPETG----CDENCHNRYMFYE 383
Query: 93 CS-SGCKCGNS-CLNKPFQ--NRPVKK-------MKLVQTEKCGAGIVADEDIKRGEFVI 141
C + CK G C N+PF R K +++++TE G G+ ++ + ++
Sbjct: 384 CDDTNCKLGPELCQNRPFSELRRRSKAGGKFNIGVEVIKTEDRGYGVRSNRAFNPNQIIV 443
Query: 142 EYVGEVIDDQTCEERLWKMKHLGETN--FYLCEINRDMVIDATYKGNKSRYINHSCCPNT 199
EY GE++ + CE R M+ + + N +YL +++MVIDAT +G+ +R+INHSC PN
Sbjct: 444 EYTGEILTQEECERR---MRTVYKKNDCYYLMYFDQNMVIDAT-RGSIARFINHSCEPNC 499
Query: 200 EMQKWIIDGETRIGIFATRD-IKKGENLTYDYQYEFLHDSLIAYC 243
M+KW + G+ R+ +FA D I GE LTYDY ++ + C
Sbjct: 500 RMEKWTVAGKPRMALFAGEDGIMTGEELTYDYNFDPYSQKNVQEC 544
>gi|350631656|gb|EHA20027.1| hypothetical protein ASPNIDRAFT_179076 [Aspergillus niger ATCC
1015]
Length = 853
Score = 107 bits (268), Expect = 4e-21, Method: Composition-based stats.
Identities = 57/177 (32%), Positives = 95/177 (53%), Gaps = 17/177 (9%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKK---------MK 117
C CT G CD +C + C G C G C N+ F+ + ++
Sbjct: 427 CMCTPETG----CDEECQNRYMFYECDEGNCGVGEECGNRSFEELKQRTKAGGKYNIGVE 482
Query: 118 LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDM 177
+++T G G+ ++ + + ++EY GE+I CE+R+ + E +YL +++M
Sbjct: 483 VIKTADRGYGVRSNRTFEPNQIIVEYTGEIITQTECEKRMRTIYKHNEC-YYLMYFDQNM 541
Query: 178 VIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDYQYE 233
+IDAT +G+ +R++NHSC PN M+KW + G+ R+ +FA R I GE LTYDY ++
Sbjct: 542 IIDAT-RGSIARFVNHSCEPNCRMEKWTVAGKPRMALFAGDRGIMTGEELTYDYNFD 597
>gi|301624151|ref|XP_002941371.1| PREDICTED: histone-lysine N-methyltransferase SETD2-like [Xenopus
(Silurana) tropicalis]
Length = 3106
Score = 107 bits (268), Expect = 4e-21, Method: Composition-based stats.
Identities = 64/204 (31%), Positives = 103/204 (50%), Gaps = 20/204 (9%)
Query: 46 FIKRNIYLTKRIKRRLEDD--GIFCSC---TASPGSSG--VCDRDCHCGMLLSSCSSGCK 98
I+ N+YLT+R K + D + C C + + G C DC +L+ CSS C
Sbjct: 1989 LIEENVYLTERKKSKSHRDIKRMQCECPVLSKEERAQGQVACGEDCLNRLLMIECSSRCP 2048
Query: 99 CGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLW 158
G+ C N+ FQ + ++ + TEK G G+ A +D+K FV+EY GEV+D + + R+
Sbjct: 2049 NGDYCSNRSFQKKQHAGVEAILTEKKGWGLRAAKDLKSNTFVLEYCGEVLDHKEFKSRVK 2108
Query: 159 KMKHLGETNFYLCEINRDMVIDATYKG---------NKSRYINHSCCPNTEMQKWIIDGE 209
+ ++Y + D V + + S ++N PNT QKW ++G+
Sbjct: 2109 EYARNKNIHYYFMALKNDEVSSSGQRNLSPTNLPCPRSSSWVN----PNTVSQKWTVNGQ 2164
Query: 210 TRIGIFATRDIKKGENLTYDYQYE 233
R+G F TR + G LT+DYQ++
Sbjct: 2165 VRVGFFTTRVVPAGSELTFDYQFQ 2188
>gi|358369137|dbj|GAA85752.1| histone-lysine N-methyltransferase [Aspergillus kawachii IFO 4308]
Length = 852
Score = 107 bits (267), Expect = 4e-21, Method: Composition-based stats.
Identities = 57/177 (32%), Positives = 95/177 (53%), Gaps = 17/177 (9%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKK---------MK 117
C CT G CD +C + C G C G C N+ F+ + ++
Sbjct: 426 CMCTPETG----CDEECQNRYMFYECDEGNCGVGEECGNRSFEELKQRTKAGGKYNIGVE 481
Query: 118 LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDM 177
+++T G G+ ++ + + ++EY GE+I CE+R+ + E +YL +++M
Sbjct: 482 VIKTADRGYGVRSNRTFEPNQIIVEYTGEIITQTECEKRMRTIYKHNEC-YYLMYFDQNM 540
Query: 178 VIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDYQYE 233
+IDAT +G+ +R++NHSC PN M+KW + G+ R+ +FA R I GE LTYDY ++
Sbjct: 541 IIDAT-RGSIARFVNHSCEPNCRMEKWTVAGKPRMALFAGDRGIMTGEELTYDYNFD 596
>gi|253743884|gb|EET00168.1| Histone methyltransferase HMT1 [Giardia intestinalis ATCC 50581]
Length = 297
Score = 107 bits (267), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 70/226 (30%), Positives = 104/226 (46%), Gaps = 40/226 (17%)
Query: 41 AIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCG 100
++ YV IKRN Y+ KR FC+CT G+ C C + C C G
Sbjct: 52 SLHYVHIKRNTYVG--CKRPSTARKTFCTCTCREGAG--CGPGCELRRVHLECYKECLAG 107
Query: 101 NSCLNKPF---------------------------QNRPVKKMKLVQTE-----KCGAGI 128
C +P N+ +++++ +T K G G+
Sbjct: 108 PLCFQRPIVKPLFSSIMDSPRLATFHKDNNKHGDCGNQRLQRLQYARTAVYPAGKKGYGL 167
Query: 129 VADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKS 188
A +I+RG V EY+GEVI + C R K + Y +++++ IDA KGN+S
Sbjct: 168 FALTNIQRGTLVTEYIGEVITKEECMRR----KKDATGHLYFLALDKELYIDAARKGNES 223
Query: 189 RYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEF 234
R+INHSC PN E+Q W + E R I A R I E L++DY+++F
Sbjct: 224 RFINHSCDPNCEVQLWYVGEEPRAAIVALRSIVPYEELSFDYKFDF 269
>gi|317033153|ref|XP_001394944.2| histone-lysine N-methyltransferase (Ash1) [Aspergillus niger CBS
513.88]
Length = 825
Score = 107 bits (267), Expect = 4e-21, Method: Composition-based stats.
Identities = 57/177 (32%), Positives = 95/177 (53%), Gaps = 17/177 (9%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKK---------MK 117
C CT G CD +C + C G C G C N+ F+ + ++
Sbjct: 399 CMCTPETG----CDEECQNRYMFYECDEGNCGVGEECGNRSFEELKQRTKAGGKYNIGVE 454
Query: 118 LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDM 177
+++T G G+ ++ + + ++EY GE+I CE+R+ + E +YL +++M
Sbjct: 455 VIKTADRGYGVRSNRTFEPNQIIVEYTGEIITQTECEKRMRTIYKHNEC-YYLMYFDQNM 513
Query: 178 VIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDYQYE 233
+IDAT +G+ +R++NHSC PN M+KW + G+ R+ +FA R I GE LTYDY ++
Sbjct: 514 IIDAT-RGSIARFVNHSCEPNCRMEKWTVAGKPRMALFAGDRGIMTGEELTYDYNFD 569
>gi|299116669|emb|CBN74814.1| Whsc1 protein [Ectocarpus siliculosus]
Length = 1551
Score = 107 bits (266), Expect = 6e-21, Method: Composition-based stats.
Identities = 71/207 (34%), Positives = 103/207 (49%), Gaps = 19/207 (9%)
Query: 43 PYVFIKRNIYLTKRIKRRLEDDGI-FCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKC-- 99
PY +K N+Y +K + D I C+C G CD C +LL C+ G +C
Sbjct: 260 PYRKLKANLYEDSSLKGLVPVDEIPLCNCRPEDG----CDASCINRLLLMECAPG-RCPT 314
Query: 100 ----GNSCLNKPFQNRPVKKMKLVQT-EKCGAGIVADED--IKRGEFVIEYVGEVIDDQT 152
C N Q + ++ +T E G G+ E+ + G + EY+GEVI
Sbjct: 315 LRGASKYCNNNAIQTKTFPATEVFRTFEGRGWGLRLAEERGAEAGTLLHEYLGEVIMMDE 374
Query: 153 CEERLWKMKHLG----ETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDG 208
C RL K+ G +FY ++ ++V+D G ++R+ NHSC PN MQKW + G
Sbjct: 375 CRRRLRKVGRKGVEGSSGDFYFASLDGNLVLDGGPMGGEARFANHSCSPNCLMQKWSVLG 434
Query: 209 ETRIGIFATRDIKKGENLTYDYQYEFL 235
ETR+ + A RDI GE LTY+YQ + L
Sbjct: 435 ETRVVLVAARDISVGEELTYNYQADTL 461
>gi|378733973|gb|EHY60432.1| histone-lysine N-methyltransferase ASH1L [Exophiala dermatitidis
NIH/UT8656]
Length = 780
Score = 107 bits (266), Expect = 6e-21, Method: Composition-based stats.
Identities = 75/242 (30%), Positives = 127/242 (52%), Gaps = 34/242 (14%)
Query: 15 HAFNKLLKQIGNPVEFELP-DWFI-----KPKAIPYVFIKRNIYLTK-----RIKRRLED 63
+A +LLKQ +F+LP D F +PK + + +N+++ R+ + E
Sbjct: 350 YAGERLLKQ---GRDFKLPFDVFSPLPAGQPKPDEWRKVNKNVFVGDAAQAWRVSKFEEH 406
Query: 64 DGIFCSCTASPGSSGVCDRDCHCGMLLSSCSS-GCKCGNS-CLNKPFQ--NRPVKK---- 115
C C S G CD +C + C C C N+ F+ + VKK
Sbjct: 407 SR--CMCKPSTG----CDHNCMNRYMFYECDDRNCNLSEELCGNRQFEALRQRVKKGGKY 460
Query: 116 ---MKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCE 172
+++++TE G G+ ++ + + ++EY GE+I + CE R+ M + G +YL
Sbjct: 461 NVGVEVIKTEDRGYGVRSNRTFEPNQIIVEYTGEIITQEECERRMNTM-YKGNECYYLML 519
Query: 173 INRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDYQ 231
+++M+IDAT +G+ +R++NHSC PN M+KW ++G+ R+ +FA R I GE LTYDY
Sbjct: 520 FDQNMIIDAT-RGSIARFVNHSCAPNCRMEKWTVNGKPRMALFAGDRGIMTGEELTYDYN 578
Query: 232 YE 233
++
Sbjct: 579 FD 580
>gi|449664137|ref|XP_002169363.2| PREDICTED: histone-lysine N-methyltransferase SETD2-like [Hydra
magnipapillata]
Length = 655
Score = 106 bits (265), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 67/197 (34%), Positives = 95/197 (48%), Gaps = 24/197 (12%)
Query: 43 PYVF--IKRNIYLTKRIKRRLEDD--GIFCSCTASPGSSGV--CDRDCHCGMLLSSCSSG 96
P F + NIYL + K R + + C C P CD +C +L+ C++
Sbjct: 229 PLTFDSTETNIYLIDKEKSRSSKEVRRMICECVFEPDDPFFVGCDENCLNRLLMIECNNR 288
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C C N+ FQ K+ + E G FV EY GEV+D R
Sbjct: 289 CPTREFCTNRNFQ----KQEDFYENE--------------GSFVSEYCGEVVDYTEFHRR 330
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA 216
G ++Y + + +IDAT KG+KSR+INHSC PN QKW ++G R+G FA
Sbjct: 331 TKLYNAEGMNHYYFMTLKTNEIIDATKKGSKSRFINHSCDPNCITQKWTVNGFLRVGFFA 390
Query: 217 TRDIKKGENLTYDYQYE 233
R I+ GE L++DYQ++
Sbjct: 391 LRYIEAGEELSFDYQFQ 407
>gi|121704806|ref|XP_001270666.1| histone-lysine N-methyltransferase (Ash1), putative [Aspergillus
clavatus NRRL 1]
gi|119398812|gb|EAW09240.1| histone-lysine N-methyltransferase (Ash1), putative [Aspergillus
clavatus NRRL 1]
Length = 847
Score = 106 bits (264), Expect = 9e-21, Method: Composition-based stats.
Identities = 59/179 (32%), Positives = 97/179 (54%), Gaps = 21/179 (11%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKK---------MK 117
C C G CD +C + C G C G C N+ F+ + ++
Sbjct: 420 CMCIEETG----CDENCQNRYMFYECDEGNCGLGPDCGNRSFEELKQRTKVGGKYNIGVE 475
Query: 118 LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETN--FYLCEINR 175
+++TE G G+ ++ + + ++EY GE+I CE+R M+ L + N +YL ++
Sbjct: 476 VIKTEDRGYGVRSNRTFEPNQVIVEYTGEIITQVECEKR---MRTLYKNNECYYLMYFDQ 532
Query: 176 DMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDYQYE 233
+M+IDAT +G+ +R++NHSC PN M+KW + G+ R+ +FA R I GE LTYDY ++
Sbjct: 533 NMIIDAT-RGSIARFVNHSCEPNCRMEKWTVAGKPRMALFAGDRGIMTGEELTYDYNFD 590
>gi|15488418|gb|AAL01110.1|AF408059_1 ASH1-like protein 1 [Arabidopsis thaliana]
Length = 229
Score = 106 bits (264), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 55/127 (43%), Positives = 77/127 (60%)
Query: 108 FQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETN 167
FQ K KL++ E G G+VA E+IK G+F++EY GEVI + ++R + G +
Sbjct: 1 FQKCEYAKTKLIKCEGRGWGLVALEEIKAGQFIMEYCGEVISWKEAKKRAQTYETHGVKD 60
Query: 168 FYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLT 227
Y+ +N IDAT KG+ +R+INHSC PN E +KW + GE R+GIFA I L
Sbjct: 61 AYIISLNASEAIDATKKGSLARFINHSCRPNCETRKWNVLGEVRVGIFAKESISPRTELA 120
Query: 228 YDYQYEF 234
YDY +E+
Sbjct: 121 YDYNFEW 127
>gi|380493696|emb|CCF33689.1| SET domain-containing protein [Colletotrichum higginsianum]
Length = 833
Score = 106 bits (264), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 63/176 (35%), Positives = 96/176 (54%), Gaps = 14/176 (7%)
Query: 80 CDRDCHCGMLLSSCS-SGCKCGNS-CLNKPF---QNRPVKKMK------LVQTEKCGAGI 128
C DC ++L C C G C N+ F Q+R V K +++T G GI
Sbjct: 433 CGEDCQNRIMLYECDDKNCNVGREHCTNRAFADLQDRRVGGGKYRVGVEVIKTPDRGYGI 492
Query: 129 VADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKS 188
A+ + + ++EY GE+I D+ C ER+ + K+ +YL +++M+IDAT KG+ +
Sbjct: 493 RANRCFESNQIIMEYTGEIITDEECSERM-ENKYKNNKCYYLMSFDQNMIIDAT-KGSIA 550
Query: 189 RYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDYQYEFLHDSLIAYC 243
R++NHSC PN M KWI+ G+ R+ +FA + I GE LTYDY + D + C
Sbjct: 551 RFVNHSCAPNCRMIKWIVSGQPRMALFAGDKPIMTGEELTYDYNFSPFSDENVQTC 606
>gi|449497711|ref|XP_004160490.1| PREDICTED: histone-lysine N-methyltransferase SUVR5-like [Cucumis
sativus]
Length = 1419
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 60/166 (36%), Positives = 90/166 (54%), Gaps = 19/166 (11%)
Query: 87 GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
G L+ C+ C C +C N+ QN K+++ TE G + A E I RG FV EYVGE
Sbjct: 1245 GYLVYECNERCSCSRACPNRVLQNGVHVKLEVFMTETKGWAVRAGEAIMRGTFVCEYVGE 1304
Query: 147 VIDDQTCEERLWKMKHLGETNFYLCEINRDM-------------VIDATYKGNKSRYINH 193
V+D+Q R + K+ E N Y +++ + +IDAT+ GN SR+INH
Sbjct: 1305 VLDEQEANRR--RDKYNSEGNCYFLDVDAHINDISRLVDGSARYIIDATHYGNVSRFINH 1362
Query: 194 SCCPNTEMQKWIIDG----ETRIGIFATRDIKKGENLTYDYQYEFL 235
SC PN + +++ + IG++A R+I GE LT++Y+ E L
Sbjct: 1363 SCSPNLVTYQVLVESMEYQRSHIGLYANRNIATGEELTFNYRRELL 1408
>gi|310794022|gb|EFQ29483.1| SET domain-containing protein [Glomerella graminicola M1.001]
Length = 828
Score = 105 bits (263), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 60/176 (34%), Positives = 96/176 (54%), Gaps = 14/176 (7%)
Query: 80 CDRDCHCGMLLSSCS-SGCKCGNS-CLNKPFQNRPVKK---------MKLVQTEKCGAGI 128
C DC ++L C C G C N+ F + +K +++++T G GI
Sbjct: 419 CGEDCQNRIMLYECDDKNCNVGREHCTNRAFADLQERKSGGGKYRVGVEVIKTSDRGYGI 478
Query: 129 VADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKS 188
A+ + + ++EY GE+I D+ C ER+ + K+ +YL +++M+IDAT KG+ +
Sbjct: 479 RANRCFEPNQIIMEYTGEIITDEECSERM-ENKYKDSKCYYLMSFDQNMIIDAT-KGSIA 536
Query: 189 RYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDYQYEFLHDSLIAYC 243
R++NHSC PN M KWI+ G+ R+ +FA + I GE LTYDY + D + C
Sbjct: 537 RFVNHSCAPNCRMIKWIVSGQPRMALFAGDKPIMTGEELTYDYNFSPFSDENVQKC 592
>gi|145533713|ref|XP_001452601.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124420300|emb|CAK85204.1| unnamed protein product [Paramecium tetraurelia]
Length = 1137
Score = 105 bits (262), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 66/181 (36%), Positives = 94/181 (51%), Gaps = 7/181 (3%)
Query: 63 DDGIFCSCTASPGSSGVCDRDCHCGMLLSSCS-SGCKCGNSCLNKPFQNRPVKKMKLVQT 121
+D I S P S C C + C C C C N+ FQ + V
Sbjct: 12 EDCIKFSRIQGPQYSYNCGDRCLNKFTCTECDVELCPCAELCKNRRFQKH---EDACVYP 68
Query: 122 EKCGA---GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMV 178
KCG G+ A + I++G+F+++Y+GE+ + R ++ T YL ++N V
Sbjct: 69 LKCGGKGMGLYAGDRIQKGQFIMQYIGEIFQINSALGRRRVQEYSKSTCTYLMKLNNQEV 128
Query: 179 IDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
ID T KGN +R+INHSC PN +KW + GE IGIFA+RDI + E LT+DYQ++ H
Sbjct: 129 IDPTTKGNLARFINHSCEPNCITEKWNVLGEVCIGIFASRDINEDEELTFDYQFDVFHTP 188
Query: 239 L 239
L
Sbjct: 189 L 189
>gi|367041896|ref|XP_003651328.1| hypothetical protein THITE_2111453 [Thielavia terrestris NRRL 8126]
gi|346998590|gb|AEO64992.1| hypothetical protein THITE_2111453 [Thielavia terrestris NRRL 8126]
Length = 950
Score = 105 bits (262), Expect = 2e-20, Method: Composition-based stats.
Identities = 58/178 (32%), Positives = 94/178 (52%), Gaps = 18/178 (10%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNS-CLNKPFQNRPVKKMK-------- 117
C C G C DC ++L C + C G C N+ FQ+ + K
Sbjct: 453 CVCKPEDG----CGEDCQNRIMLYECDETNCNIGREYCTNRAFQDLQERTKKGGRYRVGV 508
Query: 118 -LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
+ +T G G+ ++ + + ++EY GE+I ++ CE R+ ++ +YL +++
Sbjct: 509 EVFKTPDRGYGVRSNRCFEPNQIIMEYTGEIITEEECERRM-NEEYKDNECYYLMSFDQN 567
Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDYQYE 233
M+IDAT G+ +R++NHSC PN M KWI+ G+ R+ +FA R I GE LTYDY ++
Sbjct: 568 MIIDAT-TGSIARFVNHSCSPNCRMIKWIVSGQPRMALFAGDRPIMTGEELTYDYNFD 624
>gi|71003762|ref|XP_756547.1| hypothetical protein UM00400.1 [Ustilago maydis 521]
gi|46095711|gb|EAK80944.1| hypothetical protein UM00400.1 [Ustilago maydis 521]
Length = 1367
Score = 105 bits (262), Expect = 2e-20, Method: Composition-based stats.
Identities = 71/205 (34%), Positives = 106/205 (51%), Gaps = 18/205 (8%)
Query: 38 KPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCS-SG 96
K K Y I +N Y+T+ +L+ + C+C PGS C DC ML+ C
Sbjct: 654 KRKPPRYQQINKNKYVTRA---KLQGEVPLCNC--KPGSG--CGHDCINRMLMFICDPKT 706
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C ++C N RP K + + G G+ E IKR +F+ EY GEVI+ +R
Sbjct: 707 CPSASNCTNISLGRRPHVKTAVAYYGRRGFGLKTLEAIKRDDFIDEYRGEVINLSEAAKR 766
Query: 157 LWKMKHLGETNFYLCEINRDM--VIDATYKGNKSRYINHSCCPNTEMQKWIIDG------ 208
+ + ++ N+YL + + ++D KGN +R+ NHSC PN ++K+II G
Sbjct: 767 VTE-EYKATGNYYLLDYDSAAGELLDGGRKGNITRFANHSCDPNCRIEKFIICGTDEALS 825
Query: 209 -ETRIGIFATRDIKKGENLTYDYQY 232
E +IG+FA RDI GE LTY+Y +
Sbjct: 826 AEFQIGLFANRDIAAGEELTYNYGW 850
>gi|323507954|emb|CBQ67825.1| conserved hypothetical protein [Sporisorium reilianum SRZ2]
Length = 1423
Score = 105 bits (262), Expect = 2e-20, Method: Composition-based stats.
Identities = 70/205 (34%), Positives = 107/205 (52%), Gaps = 18/205 (8%)
Query: 38 KPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCS-SG 96
K K Y I +N Y+++ +L+ + C+C PGS C DC ML+ C
Sbjct: 679 KRKPPRYQQISKNKYVSRA---KLQGETPLCNC--KPGSG--CGADCINRMLMFICDPRT 731
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C ++C N RP K + + G G+ E IK+ +F+ EY GEVI+ +R
Sbjct: 732 CPSASNCTNVSLGRRPTVKTTVAYYGRRGFGLKTLEAIKKDDFIDEYRGEVINLGEAAKR 791
Query: 157 LWKMKHLGETNFYLCEINRDM--VIDATYKGNKSRYINHSCCPNTEMQKWIIDG------ 208
+ + ++ NFYL + + ++D KGN +R+ NHSC PN ++K+II G
Sbjct: 792 VTE-EYKATGNFYLLDYDSAAGELLDGGRKGNITRFANHSCDPNCRIEKFIICGTDEALS 850
Query: 209 -ETRIGIFATRDIKKGENLTYDYQY 232
E +IG+FA RDI +GE LTY+Y +
Sbjct: 851 AEFQIGLFANRDIAEGEELTYNYGW 875
>gi|406694205|gb|EKC97537.1| hypothetical protein A1Q2_08152 [Trichosporon asahii var. asahii
CBS 8904]
Length = 1192
Score = 105 bits (262), Expect = 2e-20, Method: Composition-based stats.
Identities = 71/200 (35%), Positives = 101/200 (50%), Gaps = 12/200 (6%)
Query: 38 KPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGC 97
K + Y I N +L +R K D + C C G C + SC +G
Sbjct: 471 KKRPPQYEMITSNQFL-ERPKIPASDKEV-CHCVVGTGCGLSCTNRLQGILCGKSCPNGP 528
Query: 98 KCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERL 157
CGN L + RP K + + + G G A EDI GEFV++Y GEVI T +R+
Sbjct: 529 DCGNQALCR----RPAKAIIVALSSLHGYGEFAAEDIAAGEFVVDYRGEVISVDTFIDRI 584
Query: 158 WKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKW--IIDG--ETRIG 213
H + + + +RD +ID++ KGN +R+INHSC PN ++K+ + DG E G
Sbjct: 585 G--SHSEDDSVFAIAYDRDEIIDSSSKGNSARFINHSCDPNLVLRKFDTLGDGHEEHEFG 642
Query: 214 IFATRDIKKGENLTYDYQYE 233
+++ R IK GE LTYDY E
Sbjct: 643 LWSRRPIKAGEELTYDYNAE 662
>gi|308162821|gb|EFO65192.1| Histone methyltransferase HMT1 [Giardia lamblia P15]
Length = 298
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 69/227 (30%), Positives = 102/227 (44%), Gaps = 41/227 (18%)
Query: 41 AIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCG 100
++ Y IKRNIY+ KR FC+C G+ C C + C C G
Sbjct: 52 SLHYTHIKRNIYVG--CKRPSAARKTFCTCACKEGAG--CGASCELRKVHLECYKECCAG 107
Query: 101 NSCLNKPF----------------------------QNRPVKKMKLVQTE-----KCGAG 127
C + N+ +++M+ +T + G G
Sbjct: 108 TVCFKQAIVNPLFNSLIDPPRDIAESYKDSSKFIDCGNQRLQRMQYARTAVYPAGRKGYG 167
Query: 128 IVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNK 187
+ A I+RG V EYVGEVI + C R K + + Y ++ ++ IDA +KGN+
Sbjct: 168 LFALNSIQRGALVTEYVGEVITREECLRR----KRSAKGHLYFLALDGELYIDAAHKGNE 223
Query: 188 SRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEF 234
SR+INHSC PN E+Q W + E R I A R I E L++DY+++F
Sbjct: 224 SRFINHSCDPNCEVQLWYVGEEPRAAIVALRSIAPHEELSFDYKFDF 270
>gi|443714650|gb|ELU06966.1| hypothetical protein CAPTEDRAFT_176480 [Capitella teleta]
Length = 936
Score = 105 bits (261), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 69/216 (31%), Positives = 108/216 (50%), Gaps = 10/216 (4%)
Query: 33 PDWFIKPKAIPYVFIKRNIYL-TKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLS 91
P PK Y I+ N+Y+ TK + E C + P C C M+ +
Sbjct: 141 PQKLASPK---YKKIRHNVYVDTKPQCKGWEPHPCSCRIPSDPNEPA-CGDYCLNRMVYT 196
Query: 92 SCSSG-CKCGNSCLNKPFQ-NRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVID 149
C++G C CG+ C N+ Q + + ++ T G G+ + + G+++ EY+GEV+
Sbjct: 197 ECNAGACPCGDRCSNQRIQRHHHAEGLEKFVTADRGHGVRSKHPLVNGQYICEYLGEVVS 256
Query: 150 DQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGE 209
++ R+ + + Y ++ VID G+ SR+INHSC PN EMQKW I+G
Sbjct: 257 EEEFRRRMAD-DYSAAPHHYCLNLDSGTVIDGYRMGSISRFINHSCEPNCEMQKWNINGV 315
Query: 210 TRIGIFATRDIKKGENLTYDYQYEF--LHDSLIAYC 243
RI +F+ +DI GE LTYDY ++ +H I C
Sbjct: 316 YRIALFSLKDIPPGEELTYDYNFQSYNVHSQQICKC 351
>gi|330920531|ref|XP_003299046.1| hypothetical protein PTT_09957 [Pyrenophora teres f. teres 0-1]
gi|311327461|gb|EFQ92879.1| hypothetical protein PTT_09957 [Pyrenophora teres f. teres 0-1]
Length = 791
Score = 105 bits (261), Expect = 2e-20, Method: Composition-based stats.
Identities = 65/196 (33%), Positives = 97/196 (49%), Gaps = 17/196 (8%)
Query: 59 RRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCG-NSCLNKPFQ------- 109
+R + D C C G C CH ++ C ++ C C N+PF
Sbjct: 364 KRDKQDSSQCYCDVEDG----CGEACHNRIMAYECDNTNCPLPPEQCNNRPFAELKRRAK 419
Query: 110 -NRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNF 168
NR ++++ TE G G+ A + + ++EY GE+I CE R+ K + + +
Sbjct: 420 GNRYDYGVEVLDTEDRGYGVRAMRTFEPHQIIVEYAGEIITQSECERRM-KQVYKKDKCY 478
Query: 169 YLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLT 227
YL + M+IDAT +G +R++NHSC PN EM KW + GE R+ +FA R I GE LT
Sbjct: 479 YLMSFDNKMIIDAT-RGTIARFVNHSCEPNCEMIKWTVGGEPRMALFAGPRGIMTGEELT 537
Query: 228 YDYQYEFLHDSLIAYC 243
YDY ++ I C
Sbjct: 538 YDYNFDPFSQKNIQEC 553
>gi|296413387|ref|XP_002836395.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295630213|emb|CAZ80586.1| unnamed protein product [Tuber melanosporum]
Length = 901
Score = 105 bits (261), Expect = 2e-20, Method: Composition-based stats.
Identities = 57/180 (31%), Positives = 96/180 (53%), Gaps = 18/180 (10%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCG-NSCLNKPFQN---------RPVKKM 116
C C G CD +C C G C G C N+ F + + + +
Sbjct: 395 CVCAEEVG----CDENCLNMCTWVECDEGNCNVGVERCTNRAFADLKERVKSGTKFAEGV 450
Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
++V+T CG G+ A + ++EY GE+I + E R+ ++ + N+YL +++
Sbjct: 451 EVVKTANCGHGLRATRGFMPNQIIVEYTGEIITQEESERRMVEV-YKDNKNYYLMLFHQN 509
Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD-IKKGENLTYDYQYEFL 235
M++DAT +G+ +R++NHSC PN M+KW+++G R+ +FA D I+ GE LTYDY + +
Sbjct: 510 MILDAT-RGSVARFVNHSCDPNCRMEKWLVEGRPRMALFAGDDGIEAGEELTYDYNFNWF 568
>gi|346977253|gb|EGY20705.1| hypothetical protein VDAG_10334 [Verticillium dahliae VdLs.17]
Length = 787
Score = 105 bits (261), Expect = 2e-20, Method: Composition-based stats.
Identities = 58/178 (32%), Positives = 97/178 (54%), Gaps = 18/178 (10%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNS-CLNKPFQNRPVKKM--------- 116
C C G CD DC ++L C + C G + C N+ F + +K+
Sbjct: 337 CVCKPQDG----CDEDCQNRIMLYECDENNCNVGKAFCTNRAFADLHERKVAGGKYRTGV 392
Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
++++T G GI ++ + ++EY GE+I ++ CE R+ K+ +YL +++
Sbjct: 393 EVIKTSDRGHGIRSNRCFDANQIIMEYTGEIITEEECENRM-NTKYKNNDCYYLMSFDQN 451
Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDYQYE 233
M+IDAT G+ +R++NHSC PN +M KWI+ G+ R+ +FA I G+ LTYDY ++
Sbjct: 452 MIIDAT-TGSIARFVNHSCKPNCKMIKWIVGGQPRMALFAGDAPIMTGDELTYDYNFD 508
>gi|115402571|ref|XP_001217362.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114189208|gb|EAU30908.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 790
Score = 104 bits (260), Expect = 2e-20, Method: Composition-based stats.
Identities = 56/177 (31%), Positives = 93/177 (52%), Gaps = 17/177 (9%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKK---------MK 117
C CT G CD +C + C G C G C N+ F + ++
Sbjct: 390 CMCTEETG----CDENCQNRYMFYECDDGNCSLGPGCGNRNFNELKQRTKAGGKYNIGVE 445
Query: 118 LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDM 177
+++T G G+ ++ + ++EY GE+I CE+R+ + E +YL +++M
Sbjct: 446 VIKTADRGYGVRSNRTFDPNQIIVEYTGEIITQSECEKRMRTIYKNNEC-YYLMYFDQNM 504
Query: 178 VIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDYQYE 233
+IDAT +G+ +R++NHSC PN M+KW + G+ R+ +FA R + GE LTYDY ++
Sbjct: 505 IIDAT-RGSIARFVNHSCEPNCRMEKWTVAGKPRMALFAGDRGVMTGEELTYDYNFD 560
>gi|358346394|ref|XP_003637253.1| Histone-lysine N-methyltransferase SETDB1, partial [Medicago
truncatula]
gi|355503188|gb|AES84391.1| Histone-lysine N-methyltransferase SETDB1, partial [Medicago
truncatula]
Length = 338
Score = 104 bits (260), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 62/164 (37%), Positives = 90/164 (54%), Gaps = 15/164 (9%)
Query: 87 GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
G L+ C+ CKC +C N+ QN K+++ +TEK G G+ A E I RG FV EY+GE
Sbjct: 156 GYLVYECNEECKCDKTCPNRILQNGIHVKLEVFKTEKKGWGVRACEAISRGTFVCEYIGE 215
Query: 147 VIDDQTCEERLWKMKHLGETNFY-----------LCEINRDMVIDATYKGNKSRYINHSC 195
V+D+Q R + FY L E VID+T GN SR+IN+SC
Sbjct: 216 VLDEQEARNRRERYGKEHCDYFYDVDARVNDMSRLIEREARYVIDSTRYGNVSRFINNSC 275
Query: 196 CPNTEMQKWIIDG----ETRIGIFATRDIKKGENLTYDYQYEFL 235
PN + +++ + IG++A++DI KG+ LTY+Y YE +
Sbjct: 276 SPNLVNYQVLVESMDCKRSHIGLYASQDIAKGDELTYNYHYELV 319
>gi|171676169|ref|XP_001903038.1| hypothetical protein [Podospora anserina S mat+]
gi|170936150|emb|CAP60810.1| unnamed protein product [Podospora anserina S mat+]
Length = 894
Score = 104 bits (260), Expect = 3e-20, Method: Composition-based stats.
Identities = 59/185 (31%), Positives = 96/185 (51%), Gaps = 18/185 (9%)
Query: 61 LEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCS-SGCKCGNS-CLNKPFQNRPVKKMK- 117
ED C C G C DC ++L C + C G C N+ FQ+ + K
Sbjct: 433 FEDFSSKCVCKPEDG----CAEDCQNRIMLYECDDTNCNAGREFCQNRAFQDLQERTKKG 488
Query: 118 --------LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFY 169
+++T G G+ + + + ++EY GE+I ++ CE R+ K+ +Y
Sbjct: 489 GRFRVGVEVLKTSDRGYGVRSTRCFEPNQIIMEYTGEIITEEECERRM-NEKYKDNECYY 547
Query: 170 LCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTY 228
L +++M+IDAT G+ +R++NHSC PN M KWI+ G+ R+ +FA + I G+ LTY
Sbjct: 548 LMSFDQNMIIDAT-TGSMARFVNHSCSPNCRMIKWIVSGQPRMALFAGDKPIMTGDELTY 606
Query: 229 DYQYE 233
DY ++
Sbjct: 607 DYNFD 611
>gi|359476736|ref|XP_002269759.2| PREDICTED: histone-lysine N-methyltransferase SUVR5-like [Vitis
vinifera]
Length = 1517
Score = 104 bits (259), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 64/166 (38%), Positives = 91/166 (54%), Gaps = 18/166 (10%)
Query: 87 GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
G L+ C+ C C +C N+ QN K+++ +TE+ G + A E I RG F+ EY+GE
Sbjct: 1334 GYLVYECNGKCSCNRTCQNRVLQNGVRVKLEVFRTEEKGWAVRAGEAILRGTFICEYIGE 1393
Query: 147 VIDDQTCEERLWKMKHLGETNFYLCEIN---RDM----------VIDATYKGNKSRYINH 193
V+ +Q ++R +H E Y +I+ DM VIDAT GN SR+INH
Sbjct: 1394 VLSEQEADKR-GNNRHGEEGCSYFYDIDSHINDMSRLVEGQVPYVIDATRYGNVSRFINH 1452
Query: 194 SCCPNTEMQKWIIDGE----TRIGIFATRDIKKGENLTYDYQYEFL 235
SC PN + +++ IG+FA RDI GE LTYDY+Y+ L
Sbjct: 1453 SCSPNLINHQVLVESMDCQLAHIGLFANRDISLGEELTYDYRYKPL 1498
>gi|320168697|gb|EFW45596.1| Setd1a protein [Capsaspora owczarzaki ATCC 30864]
Length = 1312
Score = 104 bits (259), Expect = 4e-20, Method: Composition-based stats.
Identities = 51/132 (38%), Positives = 79/132 (59%), Gaps = 3/132 (2%)
Query: 115 KMKLVQTEKCGA---GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLC 171
+ KL++ ++ G G+ + E+I + VIEYVGEVI + R + G + Y
Sbjct: 1171 RKKLLKFQRSGIHAFGLFSQENISANDLVIEYVGEVIRQSISDIREHHYERRGIGSSYFF 1230
Query: 172 EINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQ 231
I+ D V+DATYKGN +R++NH C PN + ++DG RI I++ RDIKKGE +TYDY+
Sbjct: 1231 RIDEDHVVDATYKGNLARFMNHCCEPNCYAKIIMVDGHQRIVIYSKRDIKKGEEITYDYK 1290
Query: 232 YEFLHDSLIAYC 243
+ + + + C
Sbjct: 1291 FPYEENKIPCLC 1302
>gi|268566865|ref|XP_002639833.1| C. briggsae CBR-MET-1 protein [Caenorhabditis briggsae]
Length = 2074
Score = 104 bits (259), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 68/190 (35%), Positives = 103/190 (54%), Gaps = 18/190 (9%)
Query: 52 YLTKRI-KRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCGNSCLNKPFQN 110
YLT+ K+R E + C C + G+ D C +++ C S C C N+ F
Sbjct: 672 YLTRSANKKRTE--SLLCECGRTGGT--CSDNTCVNRAMMTECPSSCTA--KCKNQRFAK 725
Query: 111 RPVKKMKLVQTEKC-GAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGET--- 166
+ + ++ T G G+ A +DIK+G F+IEY+GEV++ E+R K+K+ +
Sbjct: 726 KKIASVEAYHTGTAKGCGLRALKDIKKGRFIIEYIGEVVERDDYEKR--KLKYAADKKHK 783
Query: 167 NFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGE----TRIGIFATRDIKK 222
+ YLC+ IDAT GN SR++NHSC PN +KW + +RIG FA + IK
Sbjct: 784 HHYLCDTG-IYTIDATVYGNASRFVNHSCDPNAVCEKWSVPKTPGDISRIGFFAKKFIKS 842
Query: 223 GENLTYDYQY 232
GE +T+DYQ+
Sbjct: 843 GEEITFDYQF 852
>gi|396469794|ref|XP_003838493.1| hypothetical protein LEMA_P114290.1 [Leptosphaeria maculans JN3]
gi|312215061|emb|CBX95014.1| hypothetical protein LEMA_P114290.1 [Leptosphaeria maculans JN3]
Length = 880
Score = 104 bits (259), Expect = 4e-20, Method: Composition-based stats.
Identities = 63/193 (32%), Positives = 96/193 (49%), Gaps = 17/193 (8%)
Query: 62 EDDGIFCSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCG-NSCLNKPFQ--------NR 111
+ D C C G C CH ++ C S+ C+ C N+PF NR
Sbjct: 371 KQDSSQCYCGPEDG----CGEACHNRIMAYECDSTNCRLSPEQCGNRPFAELKRRAKGNR 426
Query: 112 PVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLC 171
+++V T G G+ A + + ++EY GE+I CE R+ K + + +YL
Sbjct: 427 YDYGVEVVDTHDRGFGVRAMRTFEPHQIIVEYAGEIITQWECERRM-KQVYKKDKCYYLM 485
Query: 172 EINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDY 230
+ M+IDAT +G +R++NHSC PN EM KW + GE R+ +FA +R + G+ LTYDY
Sbjct: 486 SFDNKMIIDAT-RGTIARFVNHSCEPNCEMIKWTVGGEPRMALFAGSRGVMTGDELTYDY 544
Query: 231 QYEFLHDSLIAYC 243
++ I C
Sbjct: 545 NFDPFSQKNIQVC 557
>gi|297735229|emb|CBI17591.3| unnamed protein product [Vitis vinifera]
Length = 1315
Score = 104 bits (259), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 64/166 (38%), Positives = 91/166 (54%), Gaps = 18/166 (10%)
Query: 87 GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
G L+ C+ C C +C N+ QN K+++ +TE+ G + A E I RG F+ EY+GE
Sbjct: 1132 GYLVYECNGKCSCNRTCQNRVLQNGVRVKLEVFRTEEKGWAVRAGEAILRGTFICEYIGE 1191
Query: 147 VIDDQTCEERLWKMKHLGETNFYLCEIN---RDM----------VIDATYKGNKSRYINH 193
V+ +Q ++R +H E Y +I+ DM VIDAT GN SR+INH
Sbjct: 1192 VLSEQEADKR-GNNRHGEEGCSYFYDIDSHINDMSRLVEGQVPYVIDATRYGNVSRFINH 1250
Query: 194 SCCPNTEMQKWIIDGE----TRIGIFATRDIKKGENLTYDYQYEFL 235
SC PN + +++ IG+FA RDI GE LTYDY+Y+ L
Sbjct: 1251 SCSPNLINHQVLVESMDCQLAHIGLFANRDISLGEELTYDYRYKPL 1296
>gi|392560212|gb|EIW53395.1| hypothetical protein TRAVEDRAFT_154887 [Trametes versicolor FP-101664
SS1]
Length = 1014
Score = 104 bits (259), Expect = 4e-20, Method: Composition-based stats.
Identities = 53/117 (45%), Positives = 69/117 (58%)
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ A E I RGE VIEYVGEVI Q ++R + G + YL I+ D+V+DAT KGN
Sbjct: 888 GLYAMEKISRGEMVIEYVGEVIRAQVADKREKAYERQGIGSSYLFRIDEDLVVDATKKGN 947
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
R INHSC PN + I GE +I I+A +DI+ G +TYDY + D + C
Sbjct: 948 LGRLINHSCDPNCTAKIITISGEKKIVIYAKQDIELGSEITYDYHFPIEQDKIPCLC 1004
>gi|407918366|gb|EKG11637.1| hypothetical protein MPH_11130 [Macrophomina phaseolina MS6]
Length = 980
Score = 103 bits (258), Expect = 4e-20, Method: Composition-based stats.
Identities = 64/204 (31%), Positives = 100/204 (49%), Gaps = 26/204 (12%)
Query: 58 KRRLEDDGIFCSCTASPGSSGVCDRD-CHCGMLLSSCSS-GCKCGNSCLNKPFQN----- 110
K++ + + C+C G C+RD C L C C G +C N+ F N
Sbjct: 436 KKQRQMEASTCNCV------GRCERDGCFNASLFFECDDRSCNLGPNCGNRQFNNLQKRY 489
Query: 111 ----RPVKKMK-------LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
R K K +++T G G+ A + + V+EY+GE+I Q + R+ +
Sbjct: 490 KDDTRQGKYFKSFNVGVEVMETPDRGHGVRAMRPFHQDQIVVEYIGEIITQQESDRRVNE 549
Query: 160 MKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD 219
+ + FYL ++ID Y+GN +R++NHSC PN M+KW ++GE R+ +FA R+
Sbjct: 550 V-YKDHKAFYLMNFYDKLIIDG-YRGNVARFVNHSCDPNCRMEKWTVNGEQRMALFANRN 607
Query: 220 IKKGENLTYDYQYEFLHDSLIAYC 243
I GE LT+ Y +E YC
Sbjct: 608 IMTGEELTWHYNFESYGKEQPCYC 631
>gi|389646281|ref|XP_003720772.1| hypothetical protein MGG_02937 [Magnaporthe oryzae 70-15]
gi|351638164|gb|EHA46029.1| hypothetical protein MGG_02937 [Magnaporthe oryzae 70-15]
Length = 1015
Score = 103 bits (258), Expect = 4e-20, Method: Composition-based stats.
Identities = 66/200 (33%), Positives = 105/200 (52%), Gaps = 20/200 (10%)
Query: 46 FIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCS-SGCKCGNS-C 103
F+ ++ L K K D+ C CT G C DC +L C+ + C G C
Sbjct: 448 FVGESLSLWK--KNYYYDNRSTCVCTKDDG----CGEDCLNRSVLYECNDTNCNVGREHC 501
Query: 104 LNKPFQNRPVKKMK---------LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCE 154
N+ FQ+ + K +V T G G+ A + G+ ++EY GE+I ++ CE
Sbjct: 502 QNRAFQDLQDRNKKGGSYRVGVEVVHTGPRGFGVRASRCFEPGQIIMEYAGEIITEEECE 561
Query: 155 ERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGI 214
R+ ++ E +YL +++M++DAT G+ +R++NHSC PN M KWI+ G+ R+ +
Sbjct: 562 RRMNEVYKDNEA-YYLMSFDQNMILDAT-TGSIARFVNHSCSPNCRMIKWIVCGKPRMAL 619
Query: 215 FATRD-IKKGENLTYDYQYE 233
FA + I GE LTYDY ++
Sbjct: 620 FAGDNPIMTGEELTYDYNFD 639
>gi|449448546|ref|XP_004142027.1| PREDICTED: histone-lysine N-methyltransferase SUVR5-like [Cucumis
sativus]
Length = 1406
Score = 103 bits (258), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 58/163 (35%), Positives = 89/163 (54%), Gaps = 19/163 (11%)
Query: 87 GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
G L+ C+ C C +C N+ QN K+++ TE G + A E I RG FV EYVGE
Sbjct: 1245 GYLVYECNERCSCSRACPNRVLQNGVHVKLEVFMTETKGWAVRAGEAIMRGTFVCEYVGE 1304
Query: 147 VIDDQTCEERLWKMKHLGETNFYLCEINRDM-------------VIDATYKGNKSRYINH 193
V+D+Q R + K+ E N Y +++ + +IDAT+ GN SR+INH
Sbjct: 1305 VLDEQEANRR--RDKYNSEGNCYFLDVDAHINDISRLVDGSARYIIDATHYGNVSRFINH 1362
Query: 194 SCCPNTEMQKWIIDG----ETRIGIFATRDIKKGENLTYDYQY 232
SC PN + +++ + IG++A R+I GE LT++Y++
Sbjct: 1363 SCSPNLVTYQVLVESMEYQRSHIGLYANRNIATGEELTFNYRW 1405
>gi|425778490|gb|EKV16615.1| Histone-lysine N-methyltransferase (Ash1), putative [Penicillium
digitatum PHI26]
gi|425784214|gb|EKV22005.1| Histone-lysine N-methyltransferase (Ash1), putative [Penicillium
digitatum Pd1]
Length = 770
Score = 103 bits (258), Expect = 5e-20, Method: Composition-based stats.
Identities = 55/187 (29%), Positives = 99/187 (52%), Gaps = 17/187 (9%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKK---------MK 117
C CT G CD +C + C +G C+ G C N+ F+ + ++
Sbjct: 389 CMCTEDTG----CDENCQNRYMFYECDNGNCRLGPECGNRNFEGLKHRTKAGGKYNIGVE 444
Query: 118 LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDM 177
+++T G G+ ++ + ++EY GE++ CE+R+ + E +YL +++M
Sbjct: 445 VIKTADRGYGVRSNRSFDPNQIIVEYTGEILTQLECEKRMRTVYKNNEC-YYLMYFDQNM 503
Query: 178 VIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDYQYEFLH 236
+IDAT +G+ +R++NH+C PN M+KW + G+ R+ +FA R I GE L+YDY ++
Sbjct: 504 IIDAT-RGSIARFVNHACEPNCRMEKWTVAGKPRMALFAGDRGISTGEELSYDYNFDPYS 562
Query: 237 DSLIAYC 243
+ + C
Sbjct: 563 NKNVQQC 569
>gi|452836869|gb|EME38812.1| hypothetical protein DOTSEDRAFT_83490 [Dothistroma septosporum
NZE10]
Length = 1236
Score = 103 bits (258), Expect = 5e-20, Method: Composition-based stats.
Identities = 60/184 (32%), Positives = 105/184 (57%), Gaps = 15/184 (8%)
Query: 68 CSCTASPGSSGV-CDRDCHCGMLLSSCSS-GCKC-GNSCLNKPFQ--NRPVKK------- 115
C C+A + CD C ++ C+ C +C N+PF +R +KK
Sbjct: 550 CVCSAPTEEDDIGCDETCLNRVMQYECNEDNCALDAATCSNRPFAEIDRRLKKGGSFDIG 609
Query: 116 MKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINR 175
+++V+T+K G GI + + + ++EY GE+I + C+ R+ + ++ + N+YL E+ R
Sbjct: 610 VEVVKTDKRGFGIRSTRSFRPDQIIMEYTGEIISEGECQRRM-REEYKDKPNYYLMELER 668
Query: 176 DMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD-IKKGENLTYDYQYEF 234
+VID T KG+ +R+INH+C PN ++ + ++G R+G+FA + I GE LTYDY ++
Sbjct: 669 GLVIDGT-KGSMARFINHACEPNCTVKMFRVNGVARMGVFAGKSGIMTGEELTYDYNFDN 727
Query: 235 LHDS 238
+S
Sbjct: 728 FGES 731
>gi|170095481|ref|XP_001878961.1| histone methyltransferase [Laccaria bicolor S238N-H82]
gi|164646265|gb|EDR10511.1| histone methyltransferase [Laccaria bicolor S238N-H82]
Length = 144
Score = 103 bits (258), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 54/117 (46%), Positives = 72/117 (61%)
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ A E I +GE VIEYVGEVI Q E+R + G + YL I+ D+V+DAT KGN
Sbjct: 18 GLYAMEKISKGEMVIEYVGEVIRAQVAEKREKTYERQGIGSSYLFRIDEDLVVDATKKGN 77
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
R INHSC PN + I GE +I I+A +DI+ G+ +TYDY + F D ++ C
Sbjct: 78 LGRLINHSCDPNCTAKIITISGEKKIVIYAKQDIELGDEITYDYHFPFEQDKILCLC 134
>gi|71000549|ref|XP_754958.1| histone-lysine N-methyltransferase (Ash1) [Aspergillus fumigatus
Af293]
gi|66852595|gb|EAL92920.1| histone-lysine N-methyltransferase (Ash1), putative [Aspergillus
fumigatus Af293]
gi|159127971|gb|EDP53086.1| histone-lysine N-methyltransferase (Ash1), putative [Aspergillus
fumigatus A1163]
Length = 845
Score = 103 bits (257), Expect = 6e-20, Method: Composition-based stats.
Identities = 56/177 (31%), Positives = 94/177 (53%), Gaps = 17/177 (9%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKK---------MK 117
C CT G CD +C + C G C G C N+ F+ + ++
Sbjct: 418 CMCTEETG----CDENCQNRYMFYECDDGNCGLGPGCGNRSFEELKQRTKAGGKYNIGVE 473
Query: 118 LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDM 177
+++T G G+ ++ + + ++EY GE+I CE+R+ + E +YL +++M
Sbjct: 474 VIKTADRGYGVRSNRTFEPNQIIVEYTGEIITQAECEKRMRTIYKNNEC-YYLMYFDQNM 532
Query: 178 VIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDYQYE 233
+IDAT +G+ +R++NHSC PN M+KW + G+ R+ +FA I GE LTYDY ++
Sbjct: 533 IIDAT-RGSIARFVNHSCEPNCRMEKWTVAGKPRMALFAGDHGIMTGEELTYDYNFD 588
>gi|255945051|ref|XP_002563293.1| Pc20g07700 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211588028|emb|CAP86099.1| Pc20g07700 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 788
Score = 103 bits (257), Expect = 6e-20, Method: Composition-based stats.
Identities = 53/187 (28%), Positives = 100/187 (53%), Gaps = 17/187 (9%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKK---------MK 117
C CT G CD +C + C +G C+ G C N+ F+ + ++
Sbjct: 389 CMCTEDTG----CDENCQNRYMFYECDNGNCRLGPECGNRNFEGLKQRTKAGGKYNIGVE 444
Query: 118 LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDM 177
+++T G G+ ++ + + ++EY GE++ CE+R+ + + +YL +++M
Sbjct: 445 VIKTADRGYGVRSNRSFEPNQIIVEYTGEILTQLECEKRM-RTVYKNNDCYYLMYFDQNM 503
Query: 178 VIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDYQYEFLH 236
+IDAT +G+ +R++NH+C PN M+KW + G+ R+ +FA R + GE L+YDY ++
Sbjct: 504 IIDAT-RGSIARFVNHACEPNCRMEKWTVAGKPRMALFAGDRGVSTGEELSYDYNFDPYS 562
Query: 237 DSLIAYC 243
+ + C
Sbjct: 563 NKNVQEC 569
>gi|356502205|ref|XP_003519911.1| PREDICTED: histone-lysine N-methyltransferase SUVR5-like [Glycine
max]
Length = 1494
Score = 103 bits (257), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 62/164 (37%), Positives = 86/164 (52%), Gaps = 15/164 (9%)
Query: 87 GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
G L+ C+ CKC +C N+ QN K+++ +TEK G + A E I RG FV EY+GE
Sbjct: 1312 GYLVYECNQMCKCNKTCPNRILQNGIRIKLEVFKTEKKGWAVRAGEAILRGTFVCEYIGE 1371
Query: 147 VIDDQTCEERLWKMKHLGETNFY-----------LCEINRDMVIDATYKGNKSRYINHSC 195
V+D Q + R + + FY L E VID T GN SR+IN+SC
Sbjct: 1372 VLDKQEAQNRRKRYGKEHCSYFYDVDDHVNDMGRLIEGQAHYVIDTTRFGNVSRFINNSC 1431
Query: 196 CPNTEMQKWIIDG----ETRIGIFATRDIKKGENLTYDYQYEFL 235
PN + +++ IG++A RDI GE LTY+Y Y+ L
Sbjct: 1432 SPNLVSYQVLVESMDCERAHIGLYANRDIALGEELTYNYHYDLL 1475
>gi|238505954|ref|XP_002384179.1| histone-lysine N-methyltransferase (Ash1), putative [Aspergillus
flavus NRRL3357]
gi|220690293|gb|EED46643.1| histone-lysine N-methyltransferase (Ash1), putative [Aspergillus
flavus NRRL3357]
gi|391868614|gb|EIT77824.1| hypothetical protein Ao3042_05901 [Aspergillus oryzae 3.042]
Length = 789
Score = 103 bits (257), Expect = 7e-20, Method: Composition-based stats.
Identities = 71/240 (29%), Positives = 123/240 (51%), Gaps = 33/240 (13%)
Query: 16 AFNKLLKQIGNPVEFELP-DWFI-----KPKAIPYVFIKRNIYLTK-----RIKRRLEDD 64
A ++LLKQ +F+LP D F +PK + +N+++ + R + LE
Sbjct: 322 AGDRLLKQ---GRDFQLPFDIFSPLPSGQPKPNEWRKTNKNVFVGEASSIWRANKPLELS 378
Query: 65 GIFCSCTASPGSSGVCDRDCHCGMLLSSCS-SGCKCGNSCLNKPFQNRPVKK-------- 115
C C G CD +C + C + C G C N+ F+ +
Sbjct: 379 K--CMCAEETG----CDEECQNRYMFYECDDTNCGVGPECGNRNFEELKQRTKAGGKYNI 432
Query: 116 -MKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEIN 174
+++++TE G G+ ++ + + ++EY GE+I CE+R+ + E +YL +
Sbjct: 433 GVEVIKTEDRGYGVRSNRTFEPNQVIVEYTGEIITQAECEKRMRTIYKNNEC-YYLMYFD 491
Query: 175 RDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDYQYE 233
++M+IDAT +G+ +R++NHSC PN M+KW + G+ R+ +FA R I G+ LTYDY ++
Sbjct: 492 QNMIIDAT-RGSIARFVNHSCEPNCRMEKWTVAGKPRMALFAGDRGIMTGDELTYDYNFD 550
>gi|317151143|ref|XP_001824470.2| histone-lysine N-methyltransferase (Ash1) [Aspergillus oryzae
RIB40]
Length = 796
Score = 103 bits (257), Expect = 7e-20, Method: Composition-based stats.
Identities = 70/238 (29%), Positives = 122/238 (51%), Gaps = 33/238 (13%)
Query: 18 NKLLKQIGNPVEFELP-DWFI-----KPKAIPYVFIKRNIYLTK-----RIKRRLEDDGI 66
++LLKQ +F+LP D F +PK + +N+++ + R + LE
Sbjct: 331 DRLLKQ---GRDFQLPFDIFSPLPSGQPKPNEWRKTNKNVFVGEASSIWRANKPLELSK- 386
Query: 67 FCSCTASPGSSGVCDRDCHCGMLLSSCS-SGCKCGNSCLNKPFQNRPVKK---------M 116
C C G CD +C + C + C G C N+ F+ + +
Sbjct: 387 -CMCAEETG----CDEECQNRYMFYECDDTNCGVGPECGNRNFEELKQRTKAGGKYNIGV 441
Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
++++TE G G+ ++ + + ++EY GE+I CE+R+ + E +YL +++
Sbjct: 442 EVIKTEDRGYGVRSNRTFEPNQVIVEYTGEIITQAECEKRMRTIYKNNEC-YYLMYFDQN 500
Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDYQYE 233
M+IDAT +G+ +R++NHSC PN M+KW + G+ R+ +FA R I G+ LTYDY ++
Sbjct: 501 MIIDAT-RGSIARFVNHSCEPNCRMEKWTVAGKPRMALFAGDRGIMTGDELTYDYNFD 557
>gi|357143271|ref|XP_003572863.1| PREDICTED: histone-lysine N-methyltransferase SUVR5-like
[Brachypodium distachyon]
Length = 1625
Score = 103 bits (256), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 66/171 (38%), Positives = 90/171 (52%), Gaps = 16/171 (9%)
Query: 87 GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
G + C+S C C SC NK Q + K++L TE G + A + I RG FV EYVGE
Sbjct: 1447 GFPVYECNSLCTCDASCQNKVLQQGLLVKLELFSTENKGWAVRAADPIPRGTFVCEYVGE 1506
Query: 147 VIDDQTCEERLWKMKHLGETNFYLCEINRDM-------------VIDATYKGNKSRYINH 193
V+ D R + + GE + YL +IN + +IDAT GN SR+INH
Sbjct: 1507 VVKDDEAM-RNTEREAKGECS-YLLQINSHIDQERAKTLGTIPYMIDATRYGNVSRFINH 1564
Query: 194 SCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLH-DSLIAYC 243
SC PN + ++D +G+FA +DI GE L+YDY+ + L D YC
Sbjct: 1565 SCSPNLNTRLVLVDQLAHVGLFANQDIAVGEELSYDYRQKLLSGDGCPCYC 1615
>gi|113470945|gb|ABI34874.1| nuclear receptor binding SET domain protein 1b [Danio rerio]
Length = 119
Score = 103 bits (256), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 45/102 (44%), Positives = 68/102 (66%)
Query: 137 GEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCC 196
G F+ EYVGEVID++ C R+ + NFY+ +++D +IDA KGN++R++NH C
Sbjct: 1 GGFISEYVGEVIDEEECRARIKHAQENNIGNFYMLTLDKDRIIDAGPKGNEARFMNHCCQ 60
Query: 197 PNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
PN E QKW ++G+TR+G+F+ DI G LT++Y E L +
Sbjct: 61 PNRETQKWTVNGDTRVGLFSLTDIPAGTELTFNYNLECLGNG 102
>gi|357480379|ref|XP_003610475.1| Histone-lysine N-methyltransferase SETDB1 [Medicago truncatula]
gi|357497957|ref|XP_003619267.1| Histone-lysine N-methyltransferase SETDB1 [Medicago truncatula]
gi|355494282|gb|AES75485.1| Histone-lysine N-methyltransferase SETDB1 [Medicago truncatula]
gi|355511530|gb|AES92672.1| Histone-lysine N-methyltransferase SETDB1 [Medicago truncatula]
Length = 1047
Score = 103 bits (256), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 62/167 (37%), Positives = 92/167 (55%), Gaps = 21/167 (12%)
Query: 87 GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
G L+ C+ C+C +C N+ QN K+++ +TEK G G+ A E I RG FV EY+GE
Sbjct: 865 GYLVYECNDKCRCDKTCPNRILQNGIRVKLEVFKTEKKGWGVRAGEAISRGTFVCEYIGE 924
Query: 147 VIDDQTCEERLWKMKHLGETN---FY-----------LCEINRDMVIDATYKGNKSRYIN 192
V+++Q R K GE + FY L E +ID+T GN SR++N
Sbjct: 925 VLEEQEAHNRC---KSYGEEHCSYFYVVDARVNDMSRLIERQAQYIIDSTRYGNVSRFVN 981
Query: 193 HSCCPNTEMQKWIIDG----ETRIGIFATRDIKKGENLTYDYQYEFL 235
+SC PN + +++ +RIG++A+RDI GE LT +Y YE +
Sbjct: 982 NSCSPNLLSYQVLVESMDCKRSRIGLYASRDIAFGEELTCNYHYELV 1028
>gi|224005002|ref|XP_002296152.1| SET-domain containing protein [Thalassiosira pseudonana CCMP1335]
gi|209586184|gb|ACI64869.1| SET-domain containing protein [Thalassiosira pseudonana CCMP1335]
Length = 173
Score = 103 bits (256), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 52/153 (33%), Positives = 84/153 (54%), Gaps = 2/153 (1%)
Query: 81 DRDCHCGMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFV 140
D C C + C G C NK Q R KK+++ G G++ E+ K+G+F+
Sbjct: 5 DETCALFACQEECGNNCAAGPLCGNKRIQRREWKKLQVFDAGLKGRGLMVGEECKKGDFI 64
Query: 141 IEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTE 200
EYVG + + + + K E Y+ ++ D+ +DA ++G +RYINHSC PN
Sbjct: 65 CEYVGVAVKRRYLDGLFARYKS--ERMLYIMALDGDIYLDARHRGGIARYINHSCEPNCA 122
Query: 201 MQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
+ +W + G R G+FA R+I +GE L++DYQ++
Sbjct: 123 VHRWKVRGIIRAGVFALREILEGEELSFDYQWD 155
>gi|119493172|ref|XP_001263805.1| histone-lysine N-methyltransferase (Ash1), putative [Neosartorya
fischeri NRRL 181]
gi|119411965|gb|EAW21908.1| histone-lysine N-methyltransferase (Ash1), putative [Neosartorya
fischeri NRRL 181]
Length = 839
Score = 103 bits (256), Expect = 8e-20, Method: Composition-based stats.
Identities = 56/177 (31%), Positives = 93/177 (52%), Gaps = 17/177 (9%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKK---------MK 117
C CT G CD +C + C G C G C N+ F + ++
Sbjct: 418 CMCTEETG----CDENCQNRYMFYECDDGNCGLGPDCGNRSFDELKQRTKAGGKYNIGVE 473
Query: 118 LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDM 177
+++T G G+ ++ + + ++EY GE+I CE+R+ + E +YL +++M
Sbjct: 474 VIKTADRGYGVRSNRTFEPNQIIVEYTGEIITQAECEKRMRTIYKNNEC-YYLMYFDQNM 532
Query: 178 VIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDYQYE 233
+IDAT +G+ +R++NHSC PN M+KW + G+ R+ +FA I GE LTYDY ++
Sbjct: 533 IIDAT-RGSIARFVNHSCEPNCRMEKWTVAGKPRMALFAGDHGIMTGEELTYDYNFD 588
>gi|296804338|ref|XP_002843021.1| histone-lysine N-methyltransferase [Arthroderma otae CBS 113480]
gi|238845623|gb|EEQ35285.1| histone-lysine N-methyltransferase [Arthroderma otae CBS 113480]
Length = 719
Score = 103 bits (256), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 78/241 (32%), Positives = 128/241 (53%), Gaps = 35/241 (14%)
Query: 20 LLKQIGNPVEFELP-DWFI-----KPKAIPYVFIKRNIYLTKRI---KRRLEDDGIFCSC 70
LL + G P F+LP D F +PK + I +NI++ K + + C C
Sbjct: 318 LLLEKGRP--FKLPYDIFSPLPRGQPKPDEWRKINKNIFVGDAAGIWKATMPTEQSTCLC 375
Query: 71 TASPGSSGVCDRDCHCGMLLSSCS-SGCKCGNS-CLNKPF---QNRPVKK-------MKL 118
T G C +C +L C + CK G C N+ F +NR +K +++
Sbjct: 376 TPEMG----CGENCQNRHMLYECDDNNCKLGEDLCRNRNFAKLRNR-IKTGGKYNIGVEV 430
Query: 119 VQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETN--FYLCEINRD 176
++TE G G+ ++ + + ++EY GE++ + R MK + + N FYL + +++
Sbjct: 431 IKTESRGYGVRSNRTFEPNQIIVEYTGEILTQIEAQRR---MKTIYKKNECFYLMDFDQN 487
Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD-IKKGENLTYDYQYEFL 235
M+IDAT +G+ +R++NHSC PN +M+KWI+ G+ RI +FA + I GE LTYDY +E+
Sbjct: 488 MIIDAT-RGSIARFVNHSCEPNCKMEKWIVAGKPRIALFAGENGIMTGEELTYDYNFEYY 546
Query: 236 H 236
Sbjct: 547 Q 547
>gi|395329295|gb|EJF61682.1| hypothetical protein DICSQDRAFT_85722 [Dichomitus squalens LYAD-421
SS1]
Length = 1095
Score = 103 bits (256), Expect = 9e-20, Method: Composition-based stats.
Identities = 52/117 (44%), Positives = 69/117 (58%)
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ A E I RG+ VIEYVGEVI Q ++R + G + YL I+ D+V+DAT KGN
Sbjct: 969 GLYAMEKINRGDLVIEYVGEVIRAQVADKREKAYERQGIGSSYLFRIDEDLVVDATKKGN 1028
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
R INHSC PN + I GE +I I+A +DI+ G +TYDY + D + C
Sbjct: 1029 LGRLINHSCDPNCTAKIITISGEKKIVIYAKQDIELGSEITYDYHFPIEQDKIPCLC 1085
>gi|302682536|ref|XP_003030949.1| hypothetical protein SCHCODRAFT_77129 [Schizophyllum commune H4-8]
gi|300104641|gb|EFI96046.1| hypothetical protein SCHCODRAFT_77129 [Schizophyllum commune H4-8]
Length = 171
Score = 102 bits (255), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 54/117 (46%), Positives = 72/117 (61%)
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ A E I RGE VIEYVGEVI Q ++R + G + YL I+ ++V+DAT KGN
Sbjct: 45 GLYAMERIARGEMVIEYVGEVIRAQVADKREATYERQGIGSSYLFRIDEEIVVDATKKGN 104
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
R INHSC PN + I+GE +I I+A RDI+ G+ +TYDY + F D + C
Sbjct: 105 LGRLINHSCDPNCTAKIITINGEKKIVIYAKRDIELGDEITYDYHFPFEQDKIPCLC 161
>gi|443894422|dbj|GAC71770.1| histone H3 (Lys4) methyltransferase complex, subunit SET1 [Pseudozyma
antarctica T-34]
Length = 1366
Score = 102 bits (255), Expect = 1e-19, Method: Composition-based stats.
Identities = 49/120 (40%), Positives = 75/120 (62%)
Query: 114 KKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEI 173
K++K ++ G+ A E I G+ VIEYVGEV+ Q +ER + + G + YL +
Sbjct: 1225 KQLKFAKSPIHDWGLYAMELIPAGDMVIEYVGEVVRQQVADEREKQYERQGNFSTYLFRV 1284
Query: 174 NRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
+ D+V+DAT+KGN +R +NH C PN + ++GE RI +FA I+ GE LTYDY+++
Sbjct: 1285 DDDLVVDATHKGNIARLMNHCCTPNCNAKILTLNGEKRIVLFAKSPIRPGEELTYDYKFQ 1344
>gi|336372757|gb|EGO01096.1| hypothetical protein SERLA73DRAFT_50848 [Serpula lacrymans var.
lacrymans S7.3]
Length = 260
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 54/117 (46%), Positives = 71/117 (60%)
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ A E I RGE VIEYVGEVI Q ++R + G + YL I+ D+V+DAT KGN
Sbjct: 134 GLYAMERISRGEMVIEYVGEVIRAQVADKREKVYERQGIGSSYLFRIDEDLVVDATKKGN 193
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
R INHSC PN + I+GE +I I+A +DI+ GE +TYDY + D + C
Sbjct: 194 LGRLINHSCDPNCTAKIITINGEKKIVIYAKQDIELGEEITYDYHFPIEQDKIPCLC 250
>gi|307194684|gb|EFN76943.1| Histone-lysine N-methyltransferase ash1 [Harpegnathos saltator]
Length = 129
Score = 102 bits (254), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 47/114 (41%), Positives = 70/114 (61%), Gaps = 1/114 (0%)
Query: 125 GAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYK 184
G G+ IK G F++EYVGEV+ ++ + R+ ++ +T+ Y ++ +VID
Sbjct: 3 GLGVRTQLAIKSGNFILEYVGEVVSEREFKSRM-ATRYASDTHHYCLHLDGGLVIDGHRM 61
Query: 185 GNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
G R++NHSC PN EMQKW + G R+ +FA+RDIK GE LTYDY + + S
Sbjct: 62 GGDGRFVNHSCEPNCEMQKWSVHGLPRMALFASRDIKPGEELTYDYNFALFNPS 115
>gi|403223606|dbj|BAM41736.1| uncharacterized protein TOT_040000118 [Theileria orientalis strain
Shintoku]
Length = 944
Score = 102 bits (254), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 65/177 (36%), Positives = 96/177 (54%), Gaps = 13/177 (7%)
Query: 63 DDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSS-GCKCGN-SCLNKPFQNRPVKKMKLVQ 120
D C CT S C ++C M C+S C + +C NK FQN + K+KLV
Sbjct: 592 DTNTRCVCTKS------CGKECQNVMKNVECTSKNCALHDVNCGNKRFQNFLLPKLKLVY 645
Query: 121 TEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHL-----GETNFYLCEINR 175
E G G VA E+I+ E V EYVGEVI + L + ++Y+ ++++
Sbjct: 646 FEGKGIGAVATEEIRENELVCEYVGEVITQTDFHKSLASSSFAEIDDDNQCHWYVMKVHK 705
Query: 176 DMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
++ ID+T+ GN +R+INHSC PN + G R+G+FA+R I KGE +TY+Y +
Sbjct: 706 EVYIDSTHLGNVARFINHSCDPNCSSIPINVRGSYRMGVFASRKILKGEEVTYNYGF 762
>gi|46130624|ref|XP_389092.1| hypothetical protein FG08916.1 [Gibberella zeae PH-1]
Length = 786
Score = 102 bits (254), Expect = 1e-19, Method: Composition-based stats.
Identities = 55/178 (30%), Positives = 95/178 (53%), Gaps = 18/178 (10%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNS-CLNKPFQNRPVKK---------M 116
C C G C C ++L C C G C N+ F N ++ +
Sbjct: 407 CVCKPEDG----CGESCQNRIMLYECDEQNCNAGKKYCTNRAFANLTARRNRGGKYRVGV 462
Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
++++T G G+ ++ + + ++EY GE+I ++ CE R+ ++ E +YL +++
Sbjct: 463 EVIKTSDRGYGVRSNRCFRPNQIIMEYAGEIITEEECERRMTEVYKDNEC-YYLMSFDQN 521
Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDYQYE 233
M+IDAT G+ +R++NHSC PN M KWI+ G+ R+ +FA + I G+ LTYDY ++
Sbjct: 522 MIIDAT-TGSIARFVNHSCNPNCRMIKWIVSGQPRMALFAGDKPIMTGDELTYDYNFD 578
>gi|408391575|gb|EKJ70949.1| hypothetical protein FPSE_08917 [Fusarium pseudograminearum CS3096]
Length = 786
Score = 102 bits (254), Expect = 2e-19, Method: Composition-based stats.
Identities = 55/178 (30%), Positives = 95/178 (53%), Gaps = 18/178 (10%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNS-CLNKPFQNRPVKK---------M 116
C C G C C ++L C C G C N+ F N ++ +
Sbjct: 407 CVCKPEDG----CAESCQNRIMLYECDEQNCNAGKKYCTNRAFANLTARRNRGGKYRVGV 462
Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
++++T G G+ ++ + + ++EY GE+I ++ CE R+ ++ E +YL +++
Sbjct: 463 EVIKTSDRGYGVRSNRCFRPNQIIMEYAGEIITEEECERRMTEVYKDNEC-YYLMSFDQN 521
Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDYQYE 233
M+IDAT G+ +R++NHSC PN M KWI+ G+ R+ +FA + I G+ LTYDY ++
Sbjct: 522 MIIDAT-TGSIARFVNHSCNPNCRMIKWIVSGQPRMALFAGDKPIMTGDELTYDYNFD 578
>gi|169595256|ref|XP_001791052.1| hypothetical protein SNOG_00365 [Phaeosphaeria nodorum SN15]
gi|160701054|gb|EAT91860.2| hypothetical protein SNOG_00365 [Phaeosphaeria nodorum SN15]
Length = 950
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 70/199 (35%), Positives = 99/199 (49%), Gaps = 21/199 (10%)
Query: 46 FIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNS-C 103
FI L KR K+ D C C A G C CH ++ C ++ C G C
Sbjct: 762 FIGEASALWKRDKQ----DASQCYCDAEDG----CGEACHNRIMAYECDNTNCPLGPELC 813
Query: 104 LNKPFQ--------NRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEE 155
N+PF NR +++ T G G+ A + + ++EY GE+I CE
Sbjct: 814 GNRPFAELKRRAKGNRYDYGVEVTDTPDRGYGVRAMRMFEPHQIIVEYAGEIITQSECER 873
Query: 156 RLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIF 215
R+ K + + +YL + M+IDAT +G +R++NHSC PN EM KW + GE R+ +F
Sbjct: 874 RM-KQVYKKDKCYYLMSFDNKMIIDAT-RGTIARFVNHSCEPNCEMIKWTVGGEPRMALF 931
Query: 216 A-TRDIKKGENLTYDYQYE 233
A R I GE LTYDY +E
Sbjct: 932 AGPRGIMTGEELTYDYNFE 950
>gi|453080036|gb|EMF08088.1| hypothetical protein SEPMUDRAFT_152375 [Mycosphaerella populorum
SO2202]
Length = 881
Score = 102 bits (253), Expect = 2e-19, Method: Composition-based stats.
Identities = 60/199 (30%), Positives = 106/199 (53%), Gaps = 19/199 (9%)
Query: 58 KRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCG-NSCLNKPFQNRPVKK 115
K + +D +C+C G CD C ++ C S C+ ++C N+PF ++
Sbjct: 369 KNKNQDYVSYCTCLPETG----CDESCLNAVMGYECDESNCRLEPDNCSNRPFSELERRR 424
Query: 116 MK---------LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGET 166
K +V+T+ G G+ A + G+ ++EY GE+I + C+ER+ +L T
Sbjct: 425 TKGGRYDIGVEVVKTKNRGHGVRAARPFQPGQLIMEYTGEIITEDECQERM-ATTYLNAT 483
Query: 167 NFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGEN 225
N+++ E+ +++D KG+++R+INHSC PN E++ + +R+G++A I G+
Sbjct: 484 NYFVMEMENGLILDGN-KGSEARFINHSCDPNCEVKMTRVGQVSRLGVYAGPAGIMTGQE 542
Query: 226 LTYDYQYEFLHDSLIA-YC 243
LTYDY ++ D A YC
Sbjct: 543 LTYDYNFQNFSDHRQACYC 561
>gi|409047697|gb|EKM57176.1| hypothetical protein PHACADRAFT_142398 [Phanerochaete carnosa
HHB-10118-sp]
Length = 1389
Score = 102 bits (253), Expect = 2e-19, Method: Composition-based stats.
Identities = 51/117 (43%), Positives = 69/117 (58%)
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ A E I RGE VIEYVGE+I Q ++R + G + YL I+ D+V+DAT KGN
Sbjct: 1263 GLYAMEKISRGEMVIEYVGEIIRAQVADKREKAYERQGIGSSYLFRIDEDLVVDATKKGN 1322
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
R INHSC PN + I+ E +I I+A +DI+ G +TYDY + D + C
Sbjct: 1323 LGRLINHSCDPNCTAKIITINSEKKIVIYAKQDIELGSEITYDYHFPIEQDKIPCLC 1379
>gi|367021898|ref|XP_003660234.1| hypothetical protein MYCTH_2298281 [Myceliophthora thermophila ATCC
42464]
gi|347007501|gb|AEO54989.1| hypothetical protein MYCTH_2298281 [Myceliophthora thermophila ATCC
42464]
Length = 942
Score = 102 bits (253), Expect = 2e-19, Method: Composition-based stats.
Identities = 64/213 (30%), Positives = 106/213 (49%), Gaps = 23/213 (10%)
Query: 38 KPKAIPYVFIKRNIYLTK-----RIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSS 92
+PK Y + +N ++ + +ED C C G C DC ++L
Sbjct: 422 QPKPAAYRTMTKNRFVGDAASYWKKTPHIEDFASKCVCKPEDG----CGEDCQNRIMLYE 477
Query: 93 C-SSGCKCGNS-CLNKPFQNRPVKKMK---------LVQTEKCGAGIVADEDIKRGEFVI 141
C + C G C N+ FQ+ + K + +T+ G G+ ++ + + ++
Sbjct: 478 CDETNCGIGKEYCQNRAFQDLQERTKKGGRYRIGVEVFKTKDRGYGVRSNRCFEPNQIIM 537
Query: 142 EYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEM 201
EY GE+I CE R+ ++ +YL +++M+IDAT G+ +R++NHSC PN M
Sbjct: 538 EYTGEIITVAECERRM-NEEYKDNECYYLMSFDQNMIIDAT-TGSIARFVNHSCSPNCRM 595
Query: 202 QKWIIDGETRIGIFA-TRDIKKGENLTYDYQYE 233
KWI+ G+ R+ +FA R I GE LTYDY ++
Sbjct: 596 IKWIVAGQPRMALFAGDRPIMTGEELTYDYNFD 628
>gi|343429488|emb|CBQ73061.1| related to regulatory protein SET1 [Sporisorium reilianum SRZ2]
Length = 1453
Score = 102 bits (253), Expect = 2e-19, Method: Composition-based stats.
Identities = 49/120 (40%), Positives = 75/120 (62%)
Query: 114 KKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEI 173
K++K ++ G+ A E I G+ VIEYVGEV+ Q +ER + + G + YL +
Sbjct: 1312 KQLKFAKSPIHDWGLYAMEFIPAGDMVIEYVGEVVRQQVADEREKQYERQGNFSTYLFRV 1371
Query: 174 NRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
+ D+V+DAT+KGN +R +NH C PN + ++GE RI +FA I+ GE LTYDY+++
Sbjct: 1372 DDDLVVDATHKGNIARLMNHCCTPNCNAKILTLNGEKRIVLFAKSPIRAGEELTYDYKFQ 1431
>gi|134079644|emb|CAK97070.1| unnamed protein product [Aspergillus niger]
Length = 885
Score = 102 bits (253), Expect = 2e-19, Method: Composition-based stats.
Identities = 57/178 (32%), Positives = 92/178 (51%), Gaps = 28/178 (15%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNSCLNKPFQNRPVKK---------MK 117
C CT G CD +C + C G C G C N+ F+ + ++
Sbjct: 395 CMCTPETG----CDEECQNRYMFYECDEGNCGVGEECGNRSFEELKQRTKAGGKYNIGVE 450
Query: 118 LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKM-KHLGETNFYLCEINRD 176
+++T G G+ ++ + + ++EY GE+I CE+R+ + KH N +
Sbjct: 451 VIKTADRGYGVRSNRTFEPNQIIVEYTGEIITQTECEKRMRTIYKH-----------NEN 499
Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDYQYE 233
M+IDAT +G+ +R++NHSC PN M+KW + G+ R+ +FA R I GE LTYDY ++
Sbjct: 500 MIIDAT-RGSIARFVNHSCEPNCRMEKWTVAGKPRMALFAGDRGIMTGEELTYDYNFD 556
>gi|302505008|ref|XP_003014725.1| histone-lysine N-methyltransferase (Ash1), putative [Arthroderma
benhamiae CBS 112371]
gi|291178031|gb|EFE33822.1| histone-lysine N-methyltransferase (Ash1), putative [Arthroderma
benhamiae CBS 112371]
Length = 688
Score = 101 bits (252), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 76/237 (32%), Positives = 124/237 (52%), Gaps = 33/237 (13%)
Query: 20 LLKQIGNPVEFELP-DWFI-----KPKAIPYVFIKRNIYLTKRI---KRRLEDDGIFCSC 70
LL + G P F LP D F +PK + I +NI++ K + + C C
Sbjct: 320 LLLEKGRP--FRLPYDIFSPLPRGQPKPDEWRKINKNIFIGDAAGIWKATMPTEQSTCLC 377
Query: 71 TASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNS-CLNKPFQN--RPVKK-------MKLV 119
T G C+ +C + C + CK G C N+ F + +K ++++
Sbjct: 378 TPEMG----CEENCQNRHMFYECDENNCKLGEDLCRNRNFSELRKRIKTGGKYNIGVEVI 433
Query: 120 QTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETN--FYLCEINRDM 177
+TE G G+ ++ + + ++EY GE++ + R MK + + N FYL + ++DM
Sbjct: 434 KTESRGYGVRSNRTFEPNQIIVEYTGEILTQIEAQRR---MKTIYKKNECFYLMDFDQDM 490
Query: 178 VIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD-IKKGENLTYDYQYE 233
+IDAT +G+ +R++NHSC PN +M+KWI+ G+ RI +FA + I GE LTYDY +E
Sbjct: 491 IIDAT-RGSIARFVNHSCEPNCKMEKWIVAGKPRIALFAGDNGIMTGEELTYDYNFE 546
>gi|71015569|ref|XP_758824.1| hypothetical protein UM02677.1 [Ustilago maydis 521]
gi|74702458|sp|Q4PB36.1|SET1_USTMA RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component SET1; AltName:
Full=SET domain-containing protein 1
gi|46098614|gb|EAK83847.1| hypothetical protein UM02677.1 [Ustilago maydis 521]
Length = 1468
Score = 101 bits (252), Expect = 2e-19, Method: Composition-based stats.
Identities = 49/120 (40%), Positives = 75/120 (62%)
Query: 114 KKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEI 173
K++K ++ G+ A E I G+ VIEYVGEV+ Q +ER + + G + YL +
Sbjct: 1327 KQLKFAKSPIHDWGLYAMELIPAGDMVIEYVGEVVRQQVADEREKQYERQGNFSTYLFRV 1386
Query: 174 NRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
+ D+V+DAT+KGN +R +NH C PN + ++GE RI +FA I+ GE LTYDY+++
Sbjct: 1387 DDDLVVDATHKGNIARLMNHCCTPNCNAKILTLNGEKRIVLFAKTAIRAGEELTYDYKFQ 1446
>gi|328714763|ref|XP_003245446.1| PREDICTED: hypothetical protein LOC100571907 [Acyrthosiphon pisum]
Length = 2204
Score = 101 bits (252), Expect = 2e-19, Method: Composition-based stats.
Identities = 56/134 (41%), Positives = 80/134 (59%), Gaps = 13/134 (9%)
Query: 114 KKMK--------LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCE--ERLWKMKHL 163
KKMK L +++ G G+ A D+++ VIEY+GE+I Q C+ E+L++ K+
Sbjct: 2056 KKMKQEWRNNVYLARSKIQGLGLYAARDLEKHTMVIEYIGEIIRSQLCDYREKLYEAKNR 2115
Query: 164 GETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKG 223
G Y+ ++ D V+DAT G +RYINHSC PN +K +D E RI IFA R I +G
Sbjct: 2116 G---IYMFRLDDDRVVDATISGGLARYINHSCNPNCVTEKVEVDRELRIIIFAKRRIARG 2172
Query: 224 ENLTYDYQYEFLHD 237
E L YDYQ++ D
Sbjct: 2173 EELAYDYQFDIEDD 2186
>gi|402084863|gb|EJT79881.1| hypothetical protein GGTG_04964 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 1014
Score = 101 bits (252), Expect = 2e-19, Method: Composition-based stats.
Identities = 60/200 (30%), Positives = 105/200 (52%), Gaps = 28/200 (14%)
Query: 46 FIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNS-C 103
+ K+N Y + + C CT + G C C +L C+ G C G C
Sbjct: 460 YWKKNTYYDRSV----------CVCTPADG----CGEHCLNRSVLYECNEGNCNVGRELC 505
Query: 104 LNKPFQNRPVKK---------MKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCE 154
N+ FQ+ + +++ T G G+ A+ + G+ ++EY GE+I ++ C+
Sbjct: 506 KNRAFQDLQDRTKTGGSYRVGVEVYHTGDRGFGVRANRCFEPGQIIMEYAGEIITEEECD 565
Query: 155 ERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGI 214
R+ ++ + + +YL +++M++DAT G+ +R++NHSC PN M KWI+ G R+ +
Sbjct: 566 RRMNEV-YKDKQCYYLMSFDQNMILDAT-TGSIARFVNHSCSPNCRMIKWIVSGVPRMAL 623
Query: 215 FA-TRDIKKGENLTYDYQYE 233
FA R I+ G+ LTYDY ++
Sbjct: 624 FAGDRQIQTGDELTYDYNFD 643
>gi|358385026|gb|EHK22623.1| hypothetical protein TRIVIDRAFT_212935 [Trichoderma virens Gv29-8]
Length = 738
Score = 101 bits (252), Expect = 2e-19, Method: Composition-based stats.
Identities = 56/178 (31%), Positives = 96/178 (53%), Gaps = 18/178 (10%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSCS-SGCKCGNS-CLNKPFQNRPVKK---------M 116
C C G C C ++L C + C G + C N+ F ++ +
Sbjct: 396 CVCKPEDG----CAETCQNRIMLYECDDTNCNIGRAHCTNRAFAELTARRNRGGKYRVGV 451
Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
++++T G G+ ++ + + ++EY GE+I ++ CE R+ ++ E +YL +++
Sbjct: 452 EVIKTSDRGYGVRSNRCFEPHQIIMEYAGEIITEEECERRMNEIYKNNEC-YYLMSFDQN 510
Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDYQYE 233
M+IDAT G+ +R++NHSC PN M KWI+ G+ R+ +FA R I GE LTYDY ++
Sbjct: 511 MIIDAT-TGSIARFVNHSCNPNCRMIKWIVSGQPRMALFAGDRPIMTGEELTYDYNFD 567
>gi|315047188|ref|XP_003172969.1| SUV39H [Arthroderma gypseum CBS 118893]
gi|311343355|gb|EFR02558.1| SUV39H [Arthroderma gypseum CBS 118893]
Length = 703
Score = 101 bits (251), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 76/247 (30%), Positives = 128/247 (51%), Gaps = 33/247 (13%)
Query: 20 LLKQIGNPVEFELP-DWFI-----KPKAIPYVFIKRNIYLTKRI---KRRLEDDGIFCSC 70
LL + G P F LP D F +PK + I +NI++ + + + C C
Sbjct: 318 LLLERGRP--FRLPYDIFSPLPRGQPKPDEWRKINKNIFIGDAAGIWRATMPTEQSTCLC 375
Query: 71 TASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNS-CLNKPF-----QNRPVKK----MKLV 119
T G C+ +C + C + CK G C N+ F +N+ K ++++
Sbjct: 376 TPEMG----CEENCQNRHMFYECDENNCKLGEDICRNRNFSELRRRNKTGGKYNIGVEVI 431
Query: 120 QTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETN--FYLCEINRDM 177
+TE G G+ ++ + + ++EY GE++ + R MK + + N FYL + ++DM
Sbjct: 432 KTESRGYGVRSNRTFEPNQIIVEYTGEILTQIEAQRR---MKTIYKKNECFYLMDFDQDM 488
Query: 178 VIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD-IKKGENLTYDYQYEFLH 236
+IDAT +G+ +R++NHSC PN +M+KWI+ G+ RI +FA + I GE LTYDY ++
Sbjct: 489 IIDAT-RGSIARFVNHSCEPNCKMEKWIVAGKPRIALFAGDNGIMTGEELTYDYNFDPYS 547
Query: 237 DSLIAYC 243
+ + C
Sbjct: 548 NKNVQEC 554
>gi|84997445|ref|XP_953444.1| hypothetical protein [Theileria annulata strain Ankara]
gi|65304440|emb|CAI76819.1| hypothetical protein, conserved [Theileria annulata]
Length = 1083
Score = 101 bits (251), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 63/160 (39%), Positives = 89/160 (55%), Gaps = 7/160 (4%)
Query: 80 CDRDCHCGMLLSSCS-SGCKCGN-SCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRG 137
C DC M + C+ C + +C N+ F N K+KL + G G VA EDI G
Sbjct: 732 CGSDCSNVMKNTECTVKNCNLMDENCGNRRFLNFTGPKLKLNYVDGKGVGTVATEDINEG 791
Query: 138 EFVIEYVGEVI---DDQTC--EERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYIN 192
E V EYVGEVI D Q C ++ ++++Y+ +I RD ID+T+ GN +R+IN
Sbjct: 792 ELVCEYVGEVISQADFQRCLASASFAEIDDGNQSHWYVMKIQRDTYIDSTHLGNVARFIN 851
Query: 193 HSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
HSC PN + G R+G+FA R IK+GE +TY+Y +
Sbjct: 852 HSCDPNCASVPINVRGTYRMGVFAQRKIKQGEEVTYNYGF 891
>gi|302664384|ref|XP_003023822.1| histone-lysine N-methyltransferase (Ash1), putative [Trichophyton
verrucosum HKI 0517]
gi|291187840|gb|EFE43204.1| histone-lysine N-methyltransferase (Ash1), putative [Trichophyton
verrucosum HKI 0517]
Length = 709
Score = 101 bits (251), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 76/247 (30%), Positives = 127/247 (51%), Gaps = 33/247 (13%)
Query: 20 LLKQIGNPVEFELP-DWFI-----KPKAIPYVFIKRNIYLTKRI---KRRLEDDGIFCSC 70
LL + G P F LP D F +PK + I +NI++ K + + C C
Sbjct: 316 LLLEKGRP--FRLPYDIFSPLPRGQPKPDEWRKINKNIFIGDAAGIWKATMPTEQSTCLC 373
Query: 71 TASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNS-CLNKPFQN--RPVKK-------MKLV 119
T G C+ +C + C + CK G C N+ F + +K ++++
Sbjct: 374 TPEMG----CEENCQNRHMFYECDENNCKLGEDLCRNRNFSELRKRIKTGGKYNIGVEVI 429
Query: 120 QTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETN--FYLCEINRDM 177
+TE G G+ ++ + + ++EY GE++ + R MK + + N FYL + ++DM
Sbjct: 430 KTESRGYGVRSNRTFEPNQIIVEYTGEILTQIEAQRR---MKTIYKKNECFYLMDFDQDM 486
Query: 178 VIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD-IKKGENLTYDYQYEFLH 236
+IDAT +G+ +R++NHSC PN +M+KWI+ G+ RI +FA + I GE LTYDY ++
Sbjct: 487 IIDAT-RGSIARFVNHSCEPNCKMEKWIVAGKPRIALFAGDNGIMTGEELTYDYNFDPYS 545
Query: 237 DSLIAYC 243
+ + C
Sbjct: 546 NKNVQEC 552
>gi|224084157|ref|XP_002307227.1| SET domain protein [Populus trichocarpa]
gi|222856676|gb|EEE94223.1| SET domain protein [Populus trichocarpa]
Length = 196
Score = 101 bits (251), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 57/164 (34%), Positives = 86/164 (52%), Gaps = 15/164 (9%)
Query: 87 GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
G L+ C+S C C +C N+ QN K+++ +T+ G + A E I RG F+ EY GE
Sbjct: 15 GYLVYECNSMCNCNKTCPNRVLQNGIRVKLEVFKTDNKGWAVRAGEPILRGTFICEYTGE 74
Query: 147 VIDDQTCEER-----------LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSC 195
++++Q R ++K+ + E IDAT GN SR+INHSC
Sbjct: 75 ILNEQEASNRRDRYGKEVCSYMYKIDAHTNDMSRMVEGQAHYFIDATKYGNVSRFINHSC 134
Query: 196 CPNTEMQKWIIDG----ETRIGIFATRDIKKGENLTYDYQYEFL 235
PN + ++D IG++A++DI GE LTY+Y+YE L
Sbjct: 135 MPNLVNHQVLVDSMDSQRAHIGLYASQDIAFGEELTYNYRYELL 178
>gi|302809376|ref|XP_002986381.1| hypothetical protein SELMODRAFT_123930 [Selaginella moellendorffii]
gi|300145917|gb|EFJ12590.1| hypothetical protein SELMODRAFT_123930 [Selaginella moellendorffii]
Length = 206
Score = 101 bits (251), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 57/170 (33%), Positives = 89/170 (52%), Gaps = 21/170 (12%)
Query: 87 GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
G L+ C++ C C C N+ Q K+++ +T G + A ++I RG FV EY+GE
Sbjct: 18 GYLVYECNTSCMCSEECPNRVLQRGVKVKLEVFKTRHKGWAVRAAQNISRGTFVCEYLGE 77
Query: 147 VIDDQTCEERLWKMKHLGETNFYLCEIN-----------------RDMVIDATYKGNKSR 189
V++DQ R + +G + Y +++ + VIDAT GN +R
Sbjct: 78 VLNDQEANRRGERYDQVGCSYLYDIDVHLNTGGRSRRGPSRVPRIKPFVIDATKHGNVAR 137
Query: 190 YINHSCCPNTEMQKWIIDGE----TRIGIFATRDIKKGENLTYDYQYEFL 235
+INHSC PN + +++ IG+FA+RDI GE L+YDY+Y+ L
Sbjct: 138 FINHSCSPNLVNYQVLVESMDYQLAHIGLFASRDILCGEELSYDYRYKLL 187
>gi|25141373|ref|NP_491340.2| Protein MET-1, isoform a [Caenorhabditis elegans]
gi|351050564|emb|CCD65166.1| Protein MET-1, isoform a [Caenorhabditis elegans]
Length = 1604
Score = 101 bits (251), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 66/197 (33%), Positives = 103/197 (52%), Gaps = 16/197 (8%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCGNSC 103
+ I + YLT+ ++ + + + C C + G D C +L+ C S C+ C
Sbjct: 618 FELISESKYLTRNANKK-KTESLTCECHRTGG--NCSDNTCVNRAMLTECPSSCQV--KC 672
Query: 104 LNKPFQNRPVKKMKLVQTEKC-GAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
N+ F + ++ T G G+ A +DIK+G F+IEY+GEV++ E+R K K+
Sbjct: 673 KNQRFAKKKYAAVEAFHTGTAKGCGLRAVKDIKKGRFIIEYIGEVVERDDYEKR--KTKY 730
Query: 163 LGET---NFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGE----TRIGIF 215
+ + YLC+ IDAT GN SR++NHSC PN +KW + R+G F
Sbjct: 731 AADKKHKHHYLCDTGV-YTIDATVYGNPSRFVNHSCDPNAICEKWSVPRTPGDVNRVGFF 789
Query: 216 ATRDIKKGENLTYDYQY 232
+ R IK GE +T+DYQ+
Sbjct: 790 SKRFIKAGEEITFDYQF 806
>gi|327306067|ref|XP_003237725.1| hypothetical protein TERG_02440 [Trichophyton rubrum CBS 118892]
gi|326460723|gb|EGD86176.1| hypothetical protein TERG_02440 [Trichophyton rubrum CBS 118892]
Length = 711
Score = 101 bits (251), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 76/247 (30%), Positives = 127/247 (51%), Gaps = 33/247 (13%)
Query: 20 LLKQIGNPVEFELP-DWFI-----KPKAIPYVFIKRNIYLTKRI---KRRLEDDGIFCSC 70
LL + G P F LP D F +PK + I +NI++ K + + C C
Sbjct: 319 LLLEKGRP--FRLPYDIFSPLPRGQPKPDEWRKINKNIFIGDAAGIWKATMPTEQSTCLC 376
Query: 71 TASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNS-CLNKPFQN--RPVKK-------MKLV 119
T G C+ +C + C + CK G C N+ F + +K ++++
Sbjct: 377 TPEMG----CEENCQNRHMFYECDENNCKLGEDLCRNRNFSELRKRIKTGGKYNIGVEVI 432
Query: 120 QTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETN--FYLCEINRDM 177
+TE G G+ ++ + + ++EY GE++ + R MK + + N FYL + ++DM
Sbjct: 433 KTESRGYGVRSNRTFEPNQIIVEYTGEILTQIEAQRR---MKTIYKKNECFYLMDFDQDM 489
Query: 178 VIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD-IKKGENLTYDYQYEFLH 236
+IDAT +G+ +R++NHSC PN +M+KWI+ G+ RI +FA + I GE LTYDY ++
Sbjct: 490 IIDAT-RGSIARFVNHSCEPNCKMEKWIVAGKPRIALFAGDNGIMTGEELTYDYNFDPYS 548
Query: 237 DSLIAYC 243
+ + C
Sbjct: 549 NKNVQEC 555
>gi|260833262|ref|XP_002611576.1| hypothetical protein BRAFLDRAFT_117164 [Branchiostoma floridae]
gi|229296947|gb|EEN67586.1| hypothetical protein BRAFLDRAFT_117164 [Branchiostoma floridae]
Length = 734
Score = 101 bits (251), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 55/147 (37%), Positives = 80/147 (54%), Gaps = 3/147 (2%)
Query: 88 MLLSSCSSG-CKCGNSCLNKPFQNRPVKK-MKLVQTEKCGAGIVADEDIKRGEFVIEYVG 145
M CS C + C N+ Q ++ + T+ G G+ + I +G F++EYVG
Sbjct: 57 MSFMECSPNTCPYADQCANQRIQRHEWDPGLERIVTKDRGYGVRSKTPIPQGNFILEYVG 116
Query: 146 EVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWI 205
EV+ +Q R ++ H + Y ++ VID G + R++NHSC PN EMQKW
Sbjct: 117 EVVSEQEFRRRTVEIYH-DHNHHYCLNLHSGAVIDGYKYGCEGRFVNHSCEPNCEMQKWS 175
Query: 206 IDGETRIGIFATRDIKKGENLTYDYQY 232
++G RIG+FA RDI GE LTYDY +
Sbjct: 176 VNGVYRIGLFALRDIPAGEELTYDYNF 202
>gi|407921620|gb|EKG14761.1| hypothetical protein MPH_08036 [Macrophomina phaseolina MS6]
Length = 1167
Score = 101 bits (251), Expect = 3e-19, Method: Composition-based stats.
Identities = 54/127 (42%), Positives = 75/127 (59%), Gaps = 3/127 (2%)
Query: 110 NRPVKKMKLVQTEKCGA---GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGET 166
N+ K+ KLV+ ++ G+ A EDI + +IEYVGE + + + R K G
Sbjct: 1018 NQLKKRKKLVKFDRSAIHNWGLYAQEDISANDMIIEYVGEKVRQKVADIREIKYDKQGVG 1077
Query: 167 NFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENL 226
+ YL I+ D V+DAT KG +R+INHSC PN + +DG RI I+A RDIK E L
Sbjct: 1078 SSYLFRIDEDSVVDATKKGGIARFINHSCSPNCTAKIIRVDGTKRIVIYALRDIKTNEEL 1137
Query: 227 TYDYQYE 233
TYDY++E
Sbjct: 1138 TYDYKFE 1144
>gi|193202392|ref|NP_871842.2| Protein MET-1, isoform b [Caenorhabditis elegans]
gi|25395254|pir||B87754 protein C43E11.3 [imported] - Caenorhabditis elegans
gi|351050565|emb|CCD65167.1| Protein MET-1, isoform b [Caenorhabditis elegans]
Length = 1590
Score = 101 bits (251), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 66/197 (33%), Positives = 103/197 (52%), Gaps = 16/197 (8%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCGNSC 103
+ I + YLT+ ++ + + + C C + G D C +L+ C S C+ C
Sbjct: 604 FELISESKYLTRNANKK-KTESLTCECHRTGG--NCSDNTCVNRAMLTECPSSCQV--KC 658
Query: 104 LNKPFQNRPVKKMKLVQTEKC-GAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
N+ F + ++ T G G+ A +DIK+G F+IEY+GEV++ E+R K K+
Sbjct: 659 KNQRFAKKKYAAVEAFHTGTAKGCGLRAVKDIKKGRFIIEYIGEVVERDDYEKR--KTKY 716
Query: 163 LGET---NFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGE----TRIGIF 215
+ + YLC+ IDAT GN SR++NHSC PN +KW + R+G F
Sbjct: 717 AADKKHKHHYLCDTGV-YTIDATVYGNPSRFVNHSCDPNAICEKWSVPRTPGDVNRVGFF 775
Query: 216 ATRDIKKGENLTYDYQY 232
+ R IK GE +T+DYQ+
Sbjct: 776 SKRFIKAGEEITFDYQF 792
>gi|346320473|gb|EGX90073.1| histone-lysine N-methyltransferase (Ash1), putative [Cordyceps
militaris CM01]
Length = 962
Score = 101 bits (251), Expect = 3e-19, Method: Composition-based stats.
Identities = 55/178 (30%), Positives = 95/178 (53%), Gaps = 18/178 (10%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSCSS-GCKCGN-SCLNKPFQNRPVKKMK-------- 117
C CT G C +C ++L C+ C G +C N+ F ++ K
Sbjct: 546 CVCTPEDG----CGDNCQNRIMLYECNDINCNAGKATCTNRAFATLTARRAKGGKYRVGV 601
Query: 118 -LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
+++T G G+ ++ + + ++EY GE+I + C+ R+ K ++ +YL +++
Sbjct: 602 EVIKTSDRGYGVRSNRCFRPHQIIMEYAGEIITEDECDRRM-KNEYKNNECYYLMSFDQN 660
Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDYQYE 233
M+IDAT G+ +R++NHSC PN M KWI+ G+ R+ +FA I G+ LTYDY ++
Sbjct: 661 MIIDAT-TGSIARFVNHSCNPNCRMIKWIVSGQPRMALFAGDSPIMTGDELTYDYNFD 717
>gi|302814007|ref|XP_002988688.1| hypothetical protein SELMODRAFT_128419 [Selaginella moellendorffii]
gi|300143509|gb|EFJ10199.1| hypothetical protein SELMODRAFT_128419 [Selaginella moellendorffii]
Length = 206
Score = 101 bits (251), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 57/170 (33%), Positives = 89/170 (52%), Gaps = 21/170 (12%)
Query: 87 GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
G L+ C++ C C C N+ Q K+++ +T G + A ++I RG FV EY+GE
Sbjct: 18 GYLVYECNTSCMCSEECPNRVLQRGVNVKLEVFKTRHKGWAVRAAQNISRGTFVCEYLGE 77
Query: 147 VIDDQTCEERLWKMKHLGETNFYLCEIN-----------------RDMVIDATYKGNKSR 189
V++DQ R + +G + Y +++ + VIDAT GN +R
Sbjct: 78 VLNDQEANRRGERYDQVGCSYLYDIDVHLNTGGRSRRGPSRVPRIKPFVIDATKHGNVAR 137
Query: 190 YINHSCCPNTEMQKWIIDGE----TRIGIFATRDIKKGENLTYDYQYEFL 235
+INHSC PN + +++ IG+FA+RDI GE L+YDY+Y+ L
Sbjct: 138 FINHSCSPNLVNYQVLVESMDYQLAHIGLFASRDILCGEELSYDYRYKLL 187
>gi|219111565|ref|XP_002177534.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217412069|gb|EEC51997.1| predicted protein, partial [Phaeodactylum tricornutum CCAP 1055/1]
Length = 144
Score = 101 bits (251), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 51/119 (42%), Positives = 75/119 (63%), Gaps = 2/119 (1%)
Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERL--WKMKHLGETNFYLCEIN 174
K ++ + G G+V + I +G+ V+EYVG VID + E+RL W+ H + NFY+ +
Sbjct: 6 KPLREQGKGWGLVPCDKIGKGDLVLEYVGNVIDAKEKEDRLSEWERDHPNDPNFYIMSLR 65
Query: 175 RDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
IDA +K N SR+INHSC PN + + ++G R GIFA RDI+ GE L+YDY ++
Sbjct: 66 DQWYIDARHKANLSRFINHSCAPNCFLTQINVNGYARNGIFAKRDIQAGEFLSYDYHFD 124
>gi|326476825|gb|EGE00835.1| histone-lysine N-methyltransferase [Trichophyton tonsurans CBS
112818]
Length = 628
Score = 100 bits (250), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 76/247 (30%), Positives = 127/247 (51%), Gaps = 33/247 (13%)
Query: 20 LLKQIGNPVEFELP-DWFI-----KPKAIPYVFIKRNIYLTKRI---KRRLEDDGIFCSC 70
LL + G P F LP D F +PK + I +NI++ K + + C C
Sbjct: 235 LLLEKGRP--FRLPYDIFSPLPRGQPKPDEWRKINKNIFIGDAAGIWKATMPTEQSTCLC 292
Query: 71 TASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNS-CLNKPFQ--NRPVKK-------MKLV 119
T G C+ +C + C + CK G C N+ F + +K ++++
Sbjct: 293 TPEMG----CEENCQNRHMFYECDENNCKLGEDLCRNRNFSELRKRIKTGGKYNIGVEVI 348
Query: 120 QTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETN--FYLCEINRDM 177
+TE G G+ ++ + + ++EY GE++ + R MK + + N FYL + ++DM
Sbjct: 349 KTESRGYGVRSNRTFEPNQIIVEYTGEILTQIEAQRR---MKTIYKKNECFYLMDFDQDM 405
Query: 178 VIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD-IKKGENLTYDYQYEFLH 236
+IDAT +G+ +R++NHSC PN +M+KWI+ G+ RI +FA + I GE LTYDY ++
Sbjct: 406 IIDAT-RGSIARFVNHSCEPNCKMEKWIVAGKPRIALFAGDNGIMTGEELTYDYNFDPYS 464
Query: 237 DSLIAYC 243
+ + C
Sbjct: 465 NKNVQEC 471
>gi|356561269|ref|XP_003548905.1| PREDICTED: histone-lysine N-methyltransferase SUVR5-like [Glycine
max]
Length = 1496
Score = 100 bits (250), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 61/164 (37%), Positives = 86/164 (52%), Gaps = 15/164 (9%)
Query: 87 GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
G L+ C+ CKC +C N+ QN K+++ +TEK G + A E I RG FV EY+GE
Sbjct: 1314 GYLVYECNQMCKCYKTCPNRILQNGLRVKLEVFKTEKKGWALRAGEAILRGTFVCEYIGE 1373
Query: 147 VIDDQTCEERLWKMKHLGETNFY-----------LCEINRDMVIDATYKGNKSRYINHSC 195
V+D + + R + + FY L E VID T GN SR+IN+SC
Sbjct: 1374 VLDTREAQNRRKRYGKEHCSYFYDVDDHVNDMSRLIEGQAHYVIDTTRFGNVSRFINNSC 1433
Query: 196 CPNTEMQKWIIDG----ETRIGIFATRDIKKGENLTYDYQYEFL 235
PN + +++ IG++A RDI GE LTY+Y YE +
Sbjct: 1434 SPNLVSYQVLVESMDCERAHIGLYANRDIALGEELTYNYHYELV 1477
>gi|302921594|ref|XP_003053314.1| hypothetical protein NECHADRAFT_123121 [Nectria haematococca mpVI
77-13-4]
gi|256734254|gb|EEU47601.1| hypothetical protein NECHADRAFT_123121 [Nectria haematococca mpVI
77-13-4]
Length = 776
Score = 100 bits (250), Expect = 4e-19, Method: Composition-based stats.
Identities = 55/178 (30%), Positives = 94/178 (52%), Gaps = 18/178 (10%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCGNS-CLNKPFQNRPVKK---------M 116
C C G C C ++L C C G + C N+ F ++ +
Sbjct: 405 CVCKPEDG----CGESCQNRIMLYECDEQNCNVGKAHCTNRAFATLTARRNQGGKYRVGV 460
Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
++++T G G+ ++ + + ++EY GE+I + CE R+ ++ E +YL +++
Sbjct: 461 EVIKTSDRGYGVRSNRCFRPNQIIMEYAGEIITEAECERRMTEVYKDNEC-YYLMSFDQN 519
Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDYQYE 233
M+IDAT G+ +R++NHSC PN M KWI+ G+ R+ +FA + I GE LTYDY ++
Sbjct: 520 MIIDAT-TGSIARFVNHSCNPNCRMIKWIVSGQPRMALFAGDKPITTGEELTYDYNFD 576
>gi|443896289|dbj|GAC73633.1| putative transcription factor ASH1/LIN-59 [Pseudozyma antarctica
T-34]
Length = 1319
Score = 100 bits (250), Expect = 5e-19, Method: Composition-based stats.
Identities = 71/205 (34%), Positives = 105/205 (51%), Gaps = 18/205 (8%)
Query: 38 KPKAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCS-SG 96
K K Y I +N Y ++ +L+ + C+C PGS C DC ML C
Sbjct: 600 KRKPPRYQQISKNKYYSRP---KLQGEVPLCNC--KPGSG--CGSDCINRMLQFICDPRT 652
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C GNSC N R K + + G G+ E IK+ +F+ EY GEVI+ +R
Sbjct: 653 CPNGNSCTNISLGRRTGIKTAVAYYGRRGFGLKTLEAIKQHDFIDEYRGEVINLSEAAKR 712
Query: 157 LWKMKHLGETNFYLCEINRDM--VIDATYKGNKSRYINHSCCPNTEMQKWIIDG------ 208
+ + ++ N+YL + + ++D KGN +R+ NHSC PN ++K+II G
Sbjct: 713 VTE-EYKATGNYYLLDYDSAAGELLDGGRKGNITRFANHSCDPNCRIEKFIICGTDEALS 771
Query: 209 -ETRIGIFATRDIKKGENLTYDYQY 232
E +IG+FA RDI+ GE LTY+Y +
Sbjct: 772 AEFQIGLFANRDIEAGEELTYNYGW 796
>gi|261193681|ref|XP_002623246.1| histone-lysine N-methyltransferase [Ajellomyces dermatitidis
SLH14081]
gi|239588851|gb|EEQ71494.1| histone-lysine N-methyltransferase [Ajellomyces dermatitidis
SLH14081]
Length = 776
Score = 100 bits (249), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 61/188 (32%), Positives = 98/188 (52%), Gaps = 24/188 (12%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSCS-SGCKCGNS-CLNKPFQN-RPVKKM-------- 116
C CT G CD +C + C + CK G C N+ F+ R K+
Sbjct: 383 CMCTPESG----CDENCQNRYMFYECDDNNCKLGAELCGNRNFEGLRQRSKLGGKYNIGV 438
Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
++++T G G+ ++ + ++EY GE++ + CE R+ T +YL +R+
Sbjct: 439 EVIKTADRGYGVRSNRTFAPNQIIVEYTGEIVTQEECERRM-------RTCYYLMYFDRN 491
Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDYQYEFL 235
M+IDAT +G+ +R++NHSC PN +M+KW + G+ R+ +FA I GE LTYDY +E+
Sbjct: 492 MIIDAT-RGSIARFVNHSCEPNCKMEKWTVAGKPRMALFAGEHGIMTGEELTYDYNFEYS 550
Query: 236 HDSLIAYC 243
L C
Sbjct: 551 PLHLAPQC 558
>gi|388853505|emb|CCF52904.1| related to regulatory protein SET1 [Ustilago hordei]
Length = 1489
Score = 100 bits (249), Expect = 5e-19, Method: Composition-based stats.
Identities = 48/120 (40%), Positives = 74/120 (61%)
Query: 114 KKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEI 173
K++K ++ G+ A E I G+ VIEYVGE++ Q + R + + G + YL +
Sbjct: 1348 KQLKFAKSPIHDWGLYAMEYIPAGDMVIEYVGEMVRQQVADNREKQYERQGNFSTYLFRV 1407
Query: 174 NRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
+ D+V+DAT+KGN +R +NH C PN + ++GE RI +FA IK GE LTYDY+++
Sbjct: 1408 DDDLVVDATHKGNIARLMNHCCTPNCNAKILTVNGEKRIVLFAKSPIKAGEELTYDYKFQ 1467
>gi|225555248|gb|EEH03540.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR]
Length = 803
Score = 100 bits (249), Expect = 6e-19, Method: Composition-based stats.
Identities = 65/201 (32%), Positives = 104/201 (51%), Gaps = 21/201 (10%)
Query: 45 VFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCS-SGCKCGNS- 102
VF+ + K IK R C CT G CD +C + C + CK G
Sbjct: 355 VFVGDAACIWKAIKLRERST---CMCTPELG----CDENCQNRYMFYECDDNNCKLGAEL 407
Query: 103 CLNKPFQN-RPVKKM--------KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTC 153
C N+ F+ R KM ++++T G G+ ++ + ++EY GE+I + C
Sbjct: 408 CGNRSFEGLRQRIKMGGRYNIGVEVIKTADRGYGVRSNRTFAPNQIIVEYTGEIITQEEC 467
Query: 154 EERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIG 213
E R+ + E +YL +++M+IDAT +G+ +R++NHSC PN M+KW + G+ R+
Sbjct: 468 ERRMRTVYKDNEC-YYLMYFDQNMIIDAT-RGSIARFVNHSCEPNCRMEKWTVAGKPRMA 525
Query: 214 IFATRD-IKKGENLTYDYQYE 233
+FA + I GE LTYDY ++
Sbjct: 526 LFAGENGIMTGEELTYDYNFD 546
>gi|390605099|gb|EIN14490.1| SET domain-containing protein [Punctularia strigosozonata HHB-11173
SS5]
Length = 164
Score = 100 bits (249), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 53/117 (45%), Positives = 71/117 (60%)
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ A E I RGE VIEYVGEVI Q ++R + G + YL I+ D+V+DAT KGN
Sbjct: 38 GLYAMERIARGEMVIEYVGEVIRAQVADKREKAYERQGIGSSYLFRIDEDLVVDATKKGN 97
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
R INHSC PN + I+GE +I I+A +DI+ G+ +TYDY + D + C
Sbjct: 98 LGRLINHSCDPNCTAKIITINGEKKIVIYAKQDIELGDEITYDYHFPIEQDKIPCLC 154
>gi|255562775|ref|XP_002522393.1| set domain protein, putative [Ricinus communis]
gi|223538471|gb|EEF40077.1| set domain protein, putative [Ricinus communis]
Length = 1516
Score = 100 bits (248), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 61/166 (36%), Positives = 91/166 (54%), Gaps = 19/166 (11%)
Query: 87 GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
G L+ C+ C C +C N+ QN K+++ +T+ G + A E I G FV EY+GE
Sbjct: 1334 GYLVYECNQMCSCSKTCPNRVLQNGIRVKLEVYKTKNKGWAVRAGEPILSGTFVCEYIGE 1393
Query: 147 VIDDQTCEERLWKMKHLGETNFYLCEIN---RDM----------VIDATYKGNKSRYINH 193
V+D+ +R + ++ E+ Y+ +I+ DM VIDAT GN SR+INH
Sbjct: 1394 VLDEVEANQR--RGRYSEESCSYMYDIDAHTNDMSRLMEGQVKYVIDATKHGNVSRFINH 1451
Query: 194 SCCPNTEMQKWIIDG----ETRIGIFATRDIKKGENLTYDYQYEFL 235
SC PN + II+ IG++A+RDI GE LTY+Y+Y +
Sbjct: 1452 SCLPNLVNHQVIINSMDAQRAHIGLYASRDIAFGEELTYNYRYNLV 1497
>gi|322694149|gb|EFY85986.1| histone-lysine N-methyltransferase (Ash1), putative [Metarhizium
acridum CQMa 102]
Length = 760
Score = 100 bits (248), Expect = 7e-19, Method: Composition-based stats.
Identities = 52/158 (32%), Positives = 90/158 (56%), Gaps = 14/158 (8%)
Query: 88 MLLSSCSSG-CKCGNS-CLNKPFQNRPVKKMK---------LVQTEKCGAGIVADEDIKR 136
++L C +G C G C N+ F + ++ K +++T G G+ ++ K
Sbjct: 391 IMLYECDAGNCNIGKELCTNRSFSDLAARRSKGGKYRVGVEVIKTPDRGYGVRSNRCFKA 450
Query: 137 GEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCC 196
+ ++EY GE+I ++ CE R+ ++ +YL +++M+IDAT G+ +R++NHSC
Sbjct: 451 NQIIMEYTGEIITEEECERRM-NEEYKNNECYYLMSFDQNMIIDAT-TGSIARFVNHSCN 508
Query: 197 PNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDYQYE 233
PN M KWI+ G+ R+ +FA R I G+ LTYDY ++
Sbjct: 509 PNCRMIKWIVSGQPRMALFAGDRPIMTGDELTYDYNFD 546
>gi|168031194|ref|XP_001768106.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162680544|gb|EDQ66979.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 740
Score = 100 bits (248), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 60/177 (33%), Positives = 91/177 (51%), Gaps = 29/177 (16%)
Query: 87 GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
G ++ C+S C+C +SC N+ Q K+++ ++ G G+ A E I RG FV EY+GE
Sbjct: 546 GYMVYECNSSCQCKDSCRNRVLQKGVRLKLEVFKSRHKGWGVRAAEPISRGTFVCEYIGE 605
Query: 147 VIDDQTCEERLWKMKHLGETNFYLC------------------------EINRDMVIDAT 182
V++D+ ER + H T+ ++C +++ VIDAT
Sbjct: 606 VLNDKEANERGKREPH-EYTDTFVCSRYDQVGCSYLYNIDAHLDVIGSKSVSKPFVIDAT 664
Query: 183 YKGNKSRYINHSCCPNTEMQKWIIDGE----TRIGIFATRDIKKGENLTYDYQYEFL 235
GN +R+INHSC PN + +++ IG FA RDI GE L YDY+Y+ L
Sbjct: 665 KYGNVARFINHSCEPNLINYEVLVESMDCQLAHIGFFANRDIAIGEELAYDYRYKLL 721
>gi|189202688|ref|XP_001937680.1| SET domain containing protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187984779|gb|EDU50267.1| SET domain containing protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 646
Score = 99.8 bits (247), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 65/196 (33%), Positives = 97/196 (49%), Gaps = 17/196 (8%)
Query: 59 RRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSC-SSGCKCG-NSCLNKPFQ------- 109
+R + D C C G C CH ++ C ++ C C N+PF
Sbjct: 334 KRDKQDSSQCYCDVEDG----CGEACHNRIMAYECDNTNCPLPPEQCNNRPFAELKRRSK 389
Query: 110 -NRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNF 168
NR ++++ TE G G+ A + + ++EY GE+I CE R+ K + + +
Sbjct: 390 GNRYDYGVEVLDTEDRGYGVRAMRTFEPHQIIVEYAGEIITQSECERRM-KQVYKKDKCY 448
Query: 169 YLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLT 227
YL + M+IDAT +G +R++NHSC PN EM KW + GE R+ +FA R I GE LT
Sbjct: 449 YLMSFDNKMIIDAT-RGTIARFVNHSCEPNCEMIKWTVGGEPRMALFAGPRGIMTGEELT 507
Query: 228 YDYQYEFLHDSLIAYC 243
YDY ++ I C
Sbjct: 508 YDYNFDPFSQKNIQEC 523
>gi|84310015|emb|CAJ18337.1| putative H3K9 histone methyltransferase [Araneus diadematus]
Length = 467
Score = 99.8 bits (247), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 56/152 (36%), Positives = 83/152 (54%), Gaps = 7/152 (4%)
Query: 87 GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEK-CGAGIVADEDIKRGEFVIEYVG 145
G + C+ CKC NSC+N+ Q+ P K+ + +T CG G+ E ++RG+FV+EY+G
Sbjct: 284 GYPIYECNRRCKCDNSCINRVVQHGPKVKVAIFRTTNGCGWGLKTLELVQRGQFVLEYLG 343
Query: 146 EVIDDQTCEERLWKMKHLGETNFYLCEINRD--MVIDATYKGNKSRYINHSCCPNTEMQK 203
E+I + EER HLG T + + +D +D+ GN S +INHSC PN
Sbjct: 344 EIITSEHAEERGEVYDHLGRTYLFDMDWEKDCKYTVDSMLFGNASHFINHSCDPNLATYT 403
Query: 204 WIIDGE----TRIGIFATRDIKKGENLTYDYQ 231
I+ + RI FA + I E LT+DY+
Sbjct: 404 VWINQQDPMLPRIAFFAKKKINPDEELTFDYK 435
>gi|339253208|ref|XP_003371827.1| histone-lysine N-methyltransferase, H3 lysine-36 specific
[Trichinella spiralis]
gi|316967862|gb|EFV52228.1| histone-lysine N-methyltransferase, H3 lysine-36 specific
[Trichinella spiralis]
Length = 187
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 48/108 (44%), Positives = 68/108 (62%)
Query: 125 GAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYK 184
G G+ A +DI + EF++E+VGEVI + R + T+ Y ++ +VIDAT
Sbjct: 76 GIGLKARQDILKDEFIVEFVGEVITSKENCRRNKQQMRWANTHQYPMQLTSGLVIDATSF 135
Query: 185 GNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
GN +R+INHSC PN +KW+++G+ R+ I A RDI KGE LT DY Y
Sbjct: 136 GNAARFINHSCDPNCRAEKWMVEGKHRLAIVANRDISKGEELTLDYGY 183
>gi|224094656|ref|XP_002310194.1| SET domain protein [Populus trichocarpa]
gi|222853097|gb|EEE90644.1| SET domain protein [Populus trichocarpa]
Length = 174
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 56/154 (36%), Positives = 82/154 (53%), Gaps = 15/154 (9%)
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C C +C N+ QN K+++ +T+ G + A E I RG F+ EY+GEV+D+Q +R
Sbjct: 2 CSCNKTCPNRVLQNGIRVKLEVFKTDNKGWAVRAGEPILRGTFICEYIGEVLDEQEANDR 61
Query: 157 LWKMKHLGETNFY-----------LCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWI 205
+ G + Y + E IDAT GN SR+INHSC PN + +
Sbjct: 62 RDRYGKEGCSYMYKIDAHTNDMSRMVEGQSHYFIDATKYGNVSRFINHSCMPNLANHQVL 121
Query: 206 IDG----ETRIGIFATRDIKKGENLTYDYQYEFL 235
++ IG++A+RDI GE LTY+Y+YE L
Sbjct: 122 VNSMDSQRAHIGLYASRDISFGEELTYNYRYELL 155
>gi|392590566|gb|EIW79895.1| SET domain-containing protein [Coniophora puteana RWD-64-598 SS2]
Length = 160
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 55/142 (38%), Positives = 79/142 (55%), Gaps = 2/142 (1%)
Query: 104 LNKPFQNRPVKK--MKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMK 161
LN F +K ++ ++ G+ A E + RGE VIEYVGEV+ Q ++R +
Sbjct: 9 LNVKFNQLQTRKKHLRFARSPIHDWGLYAMERVSRGEMVIEYVGEVVRAQVADKREKAYE 68
Query: 162 HLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIK 221
G + YL I+ D+V+DAT KGN R INHSC PN + I GE +I I+A +DI+
Sbjct: 69 RQGIGSSYLFRIDEDLVVDATKKGNLGRLINHSCDPNCTARIITISGEKKIVIYAKQDIE 128
Query: 222 KGENLTYDYQYEFLHDSLIAYC 243
G+ +TYDY + D + C
Sbjct: 129 LGDEITYDYHFPIEQDKIPCLC 150
>gi|341874300|gb|EGT30235.1| hypothetical protein CAEBREN_24272 [Caenorhabditis brenneri]
Length = 249
Score = 99.0 bits (245), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 66/194 (34%), Positives = 98/194 (50%), Gaps = 12/194 (6%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCGNSC 103
Y I + I T K++ DD C C S D C L C CK N C
Sbjct: 43 YTHIDKCINRTGNNKQKRSDDPDECDCHKS--RKRCSDVTCLNRALKIECPKSCKARN-C 99
Query: 104 LNKPFQNRPVKKMKLVQTEKC-GAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
N+ + K+ + T G G+ A ++IK G+F+ E++GE+I + EE + K
Sbjct: 100 ANQRIAKQMYAKVHVQHTNSNKGFGLYAKKEIKAGQFITEFMGEIITE---EEEFRRRKA 156
Query: 163 LGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGET----RIGIFATR 218
Y+ E + +++ DAT GN++R+INHSC PNT ++WI+ G T RI FA
Sbjct: 157 KPGVYRYILE-SANLIFDATKYGNEARFINHSCAPNTRCERWIVPGRTSNIPRIAFFANT 215
Query: 219 DIKKGENLTYDYQY 232
IK G+ LT+DY++
Sbjct: 216 TIKAGQELTFDYKF 229
>gi|398388287|ref|XP_003847605.1| hypothetical protein MYCGRDRAFT_97419 [Zymoseptoria tritici IPO323]
gi|339467478|gb|EGP82581.1| hypothetical protein MYCGRDRAFT_97419 [Zymoseptoria tritici IPO323]
Length = 1069
Score = 99.0 bits (245), Expect = 1e-18, Method: Composition-based stats.
Identities = 63/190 (33%), Positives = 100/190 (52%), Gaps = 17/190 (8%)
Query: 68 CSCTASPGSSGV-CDRDCHCGMLLSSCS-SGCKCGNS-CLNKPFQNRPVKKMK------- 117
C C PG + CD DC ++ C+ C+ S C N+ F + K
Sbjct: 661 CICR-DPGQGDLGCDYDCINRVMQYECNDQNCRLPASVCSNRAFAQLAARTKKGGLFDVG 719
Query: 118 --LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINR 175
+V+T + G GI + G+ ++EY GE+I + C+ R+ ++ H + +YL E+ R
Sbjct: 720 VEVVKTHERGFGIRSCRSFNAGQIIMEYTGEIISEGECQRRMREVYH-DKACYYLMELER 778
Query: 176 DMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD-IKKGENLTYDYQYE- 233
++VID T KG+ +R+INHSC PN E++ ++G R+ +FA I GE LTYDY ++
Sbjct: 779 NLVIDGT-KGSMARFINHSCEPNCEVRMVKVNGTPRMAVFAGESGIMTGEELTYDYNFDN 837
Query: 234 FLHDSLIAYC 243
F + YC
Sbjct: 838 FGESAQKCYC 847
>gi|400595512|gb|EJP63307.1| SET domain-containing protein [Beauveria bassiana ARSEF 2860]
Length = 824
Score = 99.0 bits (245), Expect = 1e-18, Method: Composition-based stats.
Identities = 54/178 (30%), Positives = 95/178 (53%), Gaps = 18/178 (10%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSCSS-GCKCGN-SCLNKPFQNRPVKKMK-------- 117
C C G C +C ++L C+ C G +C N+ F ++ K
Sbjct: 416 CVCKPEDG----CGDNCQNRIMLYECNDINCNAGKETCTNRAFATLTARRAKGGKYRVGV 471
Query: 118 -LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
+++T G G+ ++ + + ++EY GE+I ++ CE R+ ++ +YL +++
Sbjct: 472 EVIKTSDRGYGVRSNRCFRPHQIIMEYAGEIITEEECERRM-NNEYKNNECYYLMSFDQN 530
Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD-IKKGENLTYDYQYE 233
M+IDAT G+ +R++NHSC PN M KWI+ G+ R+ +FA + I G+ LTYDY ++
Sbjct: 531 MIIDAT-TGSIARFVNHSCNPNCRMIKWIVSGQPRMALFAGDNPIMTGDELTYDYNFD 587
>gi|328875267|gb|EGG23632.1| hypothetical protein DFA_05766 [Dictyostelium fasciculatum]
Length = 1603
Score = 99.0 bits (245), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 47/106 (44%), Positives = 72/106 (67%), Gaps = 1/106 (0%)
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G A + I GEF+IEYVGE+I + +ER K ++ G + ++ I +D+++DAT+KGN
Sbjct: 1478 GAFAKKSIVAGEFIIEYVGEIIRQKIADERERKYQNDG-VDCFMFAIEKDVIVDATFKGN 1536
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
++R+ NHSC PN + + IDG +I I A++DI K E +TYDYQ+
Sbjct: 1537 RARFANHSCEPNAKTKIISIDGVKKIIIVASKDIAKNEEITYDYQF 1582
>gi|389746109|gb|EIM87289.1| SET domain-containing protein [Stereum hirsutum FP-91666 SS1]
Length = 191
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 53/117 (45%), Positives = 70/117 (59%)
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ A E I RGE VIEYVGEVI Q ++R + G + YL I+ D+V+DAT KGN
Sbjct: 65 GLYAMERIARGEMVIEYVGEVIRAQIADKREKAYERQGIGSSYLFRIDEDLVVDATKKGN 124
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
R INHSC PN + I GE +I I+A +DI+ G+ +TYDY + D + C
Sbjct: 125 LGRLINHSCDPNCTAKIITILGEKKIVIYAKQDIELGDEITYDYHFPIEQDKIPCLC 181
>gi|213402529|ref|XP_002172037.1| histone-lysine N-methyltransferase [Schizosaccharomyces japonicus
yFS275]
gi|212000084|gb|EEB05744.1| histone-lysine N-methyltransferase [Schizosaccharomyces japonicus
yFS275]
Length = 977
Score = 99.0 bits (245), Expect = 2e-18, Method: Composition-based stats.
Identities = 48/117 (41%), Positives = 73/117 (62%)
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ A E+I + + VIEYVGE++ + + R K G + YL I++D ++DAT KGN
Sbjct: 851 GLFAMENIDKNDMVIEYVGEIVRQRVADTRERKYVREGIGDSYLFRIDKDAIVDATKKGN 910
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
+R+INHSC PN + ++G +I I+A RDI++GE LTYDY++ D + C
Sbjct: 911 IARFINHSCAPNCIAKIIRVEGHQKIVIYADRDIEEGEELTYDYKFPEEVDKIPCLC 967
>gi|403176252|ref|XP_003334956.2| hypothetical protein PGTG_16563 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375172169|gb|EFP90537.2| hypothetical protein PGTG_16563 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 765
Score = 99.0 bits (245), Expect = 2e-18, Method: Composition-based stats.
Identities = 74/228 (32%), Positives = 113/228 (49%), Gaps = 28/228 (12%)
Query: 25 GNPVEFELPDWFIKPKAIPYVFIKRNIYLTKRIKRRLED-DGIFCSCTASPGSSGVCDRD 83
G+P + +W +P A Y+ IK N+Y + K + D C+CT G C
Sbjct: 431 GDP-NLDAENW--EPPA--YIKIKSNVYPFR--KPEISDFPPAICNCT-----DGGCGEK 478
Query: 84 CHCGMLLSSCSSG-CKCGNSCLNKPFQNRPV----------KKMKLVQTEKC-GAGIVAD 131
C + C C+ G +C N PF R K +K+ T G G+ +
Sbjct: 479 CLNRAMFYLCDPKLCQLGQACKNLPFNQRKDAIDESSPRLGKGLKVFYTGPIKGWGLKTE 538
Query: 132 EDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYI 191
+K+ F+I+Y GE+I C +R+ ++ G +FYL + + VIDA KGN +R+I
Sbjct: 539 IKLKKDRFLIDYRGEIISRDACYKRVLN-EYSGCKHFYLMDYDGSDVIDAGLKGNCARFI 597
Query: 192 NHSCCPNTEMQKWIIDG--ETRIGIFATRDIKKGENLTYDYQYEFLHD 237
NHSC PN ++++ + G E + GIF RDI+ GE L+YDY + D
Sbjct: 598 NHSCTPNLRVERFKLSGLEEYQFGIFTLRDIEIGEELSYDYGWRNFSD 645
>gi|225685245|gb|EEH23529.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb03]
Length = 756
Score = 98.6 bits (244), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 59/188 (31%), Positives = 101/188 (53%), Gaps = 18/188 (9%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSCSS-GCKCGNS-CLNKPFQN--RPVKK-------M 116
C CT G CD +C + C CK G C N+ F++ + +K +
Sbjct: 312 CMCTPESG----CDENCQNRCMFYECDDDNCKLGAELCRNRSFEDLRQRIKAGGKYNIGV 367
Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
++++T G G+ ++ + ++EY GE+I + CE R+ + E +YL +++
Sbjct: 368 EVIKTADRGYGVRSNRTFAPNQIIVEYTGEIITQKECERRMRTVYKNNEC-YYLMYFDQN 426
Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD-IKKGENLTYDYQYEFL 235
M+IDAT +G+ +R++NHSC PN EM+KW + G+ R+ +FA ++ I GE LTYDY ++
Sbjct: 427 MIIDAT-RGSIARFVNHSCEPNCEMEKWTVAGKPRMALFAGKNGITTGEELTYDYNFDPY 485
Query: 236 HDSLIAYC 243
+ C
Sbjct: 486 SQKNVQEC 493
>gi|239613825|gb|EEQ90812.1| histone-lysine N-methyltransferase [Ajellomyces dermatitidis ER-3]
Length = 776
Score = 98.6 bits (244), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 60/188 (31%), Positives = 98/188 (52%), Gaps = 24/188 (12%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSCS-SGCKCGNS-CLNKPFQN-RPVKKM-------- 116
C CT G CD +C + C + CK G C N+ F+ R K+
Sbjct: 383 CMCTPESG----CDENCQNRYMFYECDDNNCKLGAELCGNRNFEGLRQRSKLGGKYNIGV 438
Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
++++T G G+ ++ + ++EY GE++ + CE R+ T +YL +R+
Sbjct: 439 EVIKTADRGYGVRSNRTFAPNQIIVEYTGEIVTQEECERRM-------RTCYYLMYFDRN 491
Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDYQYEFL 235
M+IDAT +G+ +R++NHSC PN +++KW + G+ R+ +FA I GE LTYDY +E+
Sbjct: 492 MIIDAT-RGSIARFVNHSCEPNCKIEKWTVAGKPRMALFAGEHGIMTGEELTYDYNFEYS 550
Query: 236 HDSLIAYC 243
L C
Sbjct: 551 PLHLAPQC 558
>gi|328768995|gb|EGF79040.1| hypothetical protein BATDEDRAFT_35515 [Batrachochytrium dendrobatidis
JAM81]
Length = 1367
Score = 98.6 bits (244), Expect = 2e-18, Method: Composition-based stats.
Identities = 49/119 (41%), Positives = 72/119 (60%), Gaps = 4/119 (3%)
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCE--ERLWKMKHLGETNFYLCEINRDMVIDATYK 184
G+ A E I + VIEY+GE+I + + E+L++ +G + YL ++ D +IDAT
Sbjct: 1241 GLFAMEPIDANDMVIEYIGEIIRQKVADHREKLYEASGIGSS--YLFRVDEDTIIDATKT 1298
Query: 185 GNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
GN +R+INH C PN + +DG RI I+A RDIK+GE LTYDY++ D + C
Sbjct: 1299 GNLARFINHCCEPNCNAKVISVDGTKRIVIYANRDIKEGEELTYDYKFPIEEDKIPCLC 1357
>gi|295663144|ref|XP_002792125.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
gi|226279300|gb|EEH34866.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
Length = 816
Score = 98.6 bits (244), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 58/188 (30%), Positives = 103/188 (54%), Gaps = 18/188 (9%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSCS-SGCKCGNS-CLNKPFQN--RPVKK-------M 116
C CT G CD +C + C + CK G C N+ F++ + +K +
Sbjct: 373 CMCTPENG----CDENCQNRCMFYECDDNNCKLGAELCRNRSFEDLRQRIKAGGKYNIGV 428
Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
++++T G G+ ++ + + ++EY GE++ + CE R+ + E +YL +++
Sbjct: 429 EVIKTADRGYGVRSNRTFEPNQIIVEYTGEIVTQKECERRMRTVYKNNEC-YYLMYFDQN 487
Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD-IKKGENLTYDYQYEFL 235
M+IDAT +G+ +R++NHSC PN EM+KW + G+ R+ +FA ++ I GE LTYDY ++
Sbjct: 488 MIIDAT-RGSIARFVNHSCEPNCEMEKWTVAGKPRMALFAGKNGITTGEELTYDYNFDPY 546
Query: 236 HDSLIAYC 243
+ C
Sbjct: 547 SQKNVQEC 554
>gi|358253208|dbj|GAA52505.1| histone-lysine N-methyltransferase SETD2 [Clonorchis sinensis]
Length = 757
Score = 98.6 bits (244), Expect = 2e-18, Method: Composition-based stats.
Identities = 47/97 (48%), Positives = 61/97 (62%)
Query: 136 RGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSC 195
+ F+IEYVGEVID R+ + + LG + Y + D IDA KGN +R++NHSC
Sbjct: 4 KSTFIIEYVGEVIDFAEFRRRIRRYERLGHAHHYFMALESDRFIDAGAKGNWARFVNHSC 63
Query: 196 CPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
PN QKW +DGE RIG FA DI+ GE +T DYQ+
Sbjct: 64 EPNCVTQKWSVDGEIRIGFFAREDIEAGEEITIDYQF 100
>gi|301114847|ref|XP_002999193.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262111287|gb|EEY69339.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 295
Score = 98.6 bits (244), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 51/119 (42%), Positives = 74/119 (62%), Gaps = 2/119 (1%)
Query: 125 GAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYK 184
G G+ EDIK G FV EY+GE++ +Q E + ++ + E + Y+ ++ VIDAT
Sbjct: 13 GLGMKLLEDIKAGSFVGEYMGEIVTEQ--EYYMRRVLYHNEKHRYMMVLSGGEVIDATRM 70
Query: 185 GNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
G +R+INHSC PN ++KW ++GE R IFA RDI GE LT+DY++E + I C
Sbjct: 71 GGWARFINHSCNPNCGVEKWDVNGEERCAIFALRDIVAGEELTFDYKFESFSKAEITEC 129
>gi|340519773|gb|EGR50011.1| predicted protein [Trichoderma reesei QM6a]
Length = 654
Score = 98.2 bits (243), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 55/176 (31%), Positives = 95/176 (53%), Gaps = 14/176 (7%)
Query: 80 CDRDCHCGMLLSSCS-SGCKCGNS-CLNKPFQNRPVKKMK---------LVQTEKCGAGI 128
C C ++L C + C G + C N+ F + ++ + +++T G G+
Sbjct: 416 CGESCQNRIMLYECDDTNCNIGKARCTNRAFADLTARRARGGKYRVGVEVIKTADRGYGV 475
Query: 129 VADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKS 188
++ K + ++EY GE+I ++ CE R+ ++ E +YL +++M+IDAT G+ +
Sbjct: 476 RSNRCFKPHQIIMEYAGEIITEEECERRMNEVYKNNEC-YYLMSFDQNMIIDATT-GSIA 533
Query: 189 RYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDYQYEFLHDSLIAYC 243
R++NHSC PN M KWI+ G+ R+ +FA R I GE LTYDY ++ + C
Sbjct: 534 RFVNHSCNPNCRMIKWIVSGQPRMALFAGDRPIMTGEELTYDYNFDPFSAKNVQKC 589
>gi|359492362|ref|XP_002284621.2| PREDICTED: uncharacterized protein LOC100245350 [Vitis vinifera]
Length = 2184
Score = 98.2 bits (243), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 61/190 (32%), Positives = 87/190 (45%), Gaps = 40/190 (21%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNS 102
++ I+ N++L + + + D+ + C C C +C ML C G C CG+
Sbjct: 1217 WMLIRSNLFLHRSRRTQTIDEVMVCHCKRPVEGRFGCGDECLNRMLNIECVQGTCPCGDL 1276
Query: 103 CLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
C N+ +V+D QT E R +
Sbjct: 1277 CSNQ---------------------------------------QVLDLQTYEARQKEYAS 1297
Query: 163 LGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKK 222
G +FY +N VIDA KGN R+INHSC PN +KW+++GE IG+FA RDIKK
Sbjct: 1298 RGHKHFYFMTLNGSEVIDACAKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALRDIKK 1357
Query: 223 GENLTYDYQY 232
GE +T+DY Y
Sbjct: 1358 GEEVTFDYNY 1367
>gi|341896622|gb|EGT52557.1| hypothetical protein CAEBREN_29129 [Caenorhabditis brenneri]
Length = 873
Score = 98.2 bits (243), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 67/194 (34%), Positives = 96/194 (49%), Gaps = 14/194 (7%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSGCKCGNSC 103
Y I N Y TK + + C C + G + C +++ C S CK C
Sbjct: 182 YEHIVENQY-TKELMSEERTESAHCECHEN-GKDCSSEATCTNMGMMTECPSTCK-AEGC 238
Query: 104 LNKPFQNRPVKKMKL-VQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
N+ Q R ++ + +E+ G G++ + I+RG+FVIEYVGEV E+ +H
Sbjct: 239 RNQRIQRREYAELAIKFISEEVGHGVITLKPIRRGQFVIEYVGEVTQIGDEEKATSDSRH 298
Query: 163 LGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGE----TRIGIFATR 218
F+ E +R IDAT KGN SR+INH C PN E KW + G R+ FA
Sbjct: 299 ---RYFFDSESHR---IDATRKGNISRFINHHCSPNLECHKWSVPGTPDNLQRLVFFAKN 352
Query: 219 DIKKGENLTYDYQY 232
DI+ GE + +DYQ+
Sbjct: 353 DIEAGEEINFDYQF 366
>gi|168060592|ref|XP_001782279.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666292|gb|EDQ52951.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 690
Score = 97.4 bits (241), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 55/163 (33%), Positives = 84/163 (51%), Gaps = 14/163 (8%)
Query: 87 GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
G ++ C+S C C C N+ Q K+++ ++ G + + + I G FV EY+GE
Sbjct: 509 GYMVYECNSSCLCREDCQNRVLQKGVRVKLEVFKSRHKGWAVRSAQPIPSGTFVCEYIGE 568
Query: 147 VIDDQTCEERLWKMKHLGETNFYLCEINRDM----------VIDATYKGNKSRYINHSCC 196
V++D+ +R + G + Y + + DM VIDAT GN +R+INHSC
Sbjct: 569 VVNDREANQRGVRYDQDGCSYLYDIDAHLDMSISRAGAKPFVIDATKHGNVARFINHSCA 628
Query: 197 PNTEMQKWIIDGE----TRIGIFATRDIKKGENLTYDYQYEFL 235
PN + +++ IG FA RDI GE L YDY+Y+ L
Sbjct: 629 PNLINYEVLVESMDCQLAHIGFFANRDISAGEELAYDYRYKLL 671
>gi|299746032|ref|XP_002910994.1| Setd1a protein [Coprinopsis cinerea okayama7#130]
gi|298406870|gb|EFI27500.1| Setd1a protein [Coprinopsis cinerea okayama7#130]
Length = 1614
Score = 97.4 bits (241), Expect = 4e-18, Method: Composition-based stats.
Identities = 51/114 (44%), Positives = 68/114 (59%)
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ A E I +GE VIEYVGEVI Q ++R + G + YL I+ + V+DAT KGN
Sbjct: 1487 GLYAMEKISKGEMVIEYVGEVIRAQVADKREKTYEKQGIGSSYLFRIDEEFVVDATKKGN 1546
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLI 240
R INHSC PN + I G +I I+A +DI+ GE +TYDY + D+ I
Sbjct: 1547 LGRLINHSCDPNCTAKIITISGVKKIVIYAKQDIELGEEITYDYHFPIEQDNKI 1600
>gi|409078063|gb|EKM78427.1| hypothetical protein AGABI1DRAFT_41599 [Agaricus bisporus var.
burnettii JB137-S8]
gi|426194069|gb|EKV44001.1| histone methyltransferase [Agaricus bisporus var. bisporus H97]
Length = 163
Score = 97.4 bits (241), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 53/130 (40%), Positives = 74/130 (56%)
Query: 114 KKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEI 173
K ++ ++ G+ A E I RGE VIEYVGEVI ++R + G + YL I
Sbjct: 24 KHLRFAKSPIHDWGLYAMERIARGEMVIEYVGEVIRAAVADKREKAYERQGIGSSYLFRI 83
Query: 174 NRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
+ D+V+DAT KGN R INHSC PN + I G +I I+A +DI+ G+ +TYDY +
Sbjct: 84 DEDLVVDATKKGNLGRLINHSCDPNCTAKIITISGVKKIVIYAKQDIELGDEITYDYHFP 143
Query: 234 FLHDSLIAYC 243
F D + C
Sbjct: 144 FEQDKIPCLC 153
>gi|326429534|gb|EGD75104.1| set-domain-containing protein [Salpingoeca sp. ATCC 50818]
Length = 508
Score = 97.4 bits (241), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 59/157 (37%), Positives = 84/157 (53%), Gaps = 11/157 (7%)
Query: 84 CHCGMLLSSCSS----GCKCGN---SCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKR 136
C C + C C G+ SC N Q R K + L ++ G GI EDI++
Sbjct: 338 CPCAAAVRECDPDLCISCGAGDEEPSCKNCGIQRRQHKHLLLAPSDVAGWGIFTKEDIQK 397
Query: 137 GEFVIEYVGEVIDDQTCEERLWKM-KHLGETNFYLCEINRDMVIDATYKGNKSRYINHSC 195
G F+ EY GEVI + E R KH+ +L +N + V+DAT KGNK R+ NH+
Sbjct: 398 GAFISEYCGEVISQEEAERRGKVYDKHMCS---FLFNLNAEYVVDATRKGNKIRFANHAN 454
Query: 196 CPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
PN + +++GE RIGI+A R+I G+ L +DY+Y
Sbjct: 455 DPNCNARVMLVNGEHRIGIYAKRNIPAGKELFFDYRY 491
>gi|242066244|ref|XP_002454411.1| hypothetical protein SORBIDRAFT_04g030350 [Sorghum bicolor]
gi|241934242|gb|EES07387.1| hypothetical protein SORBIDRAFT_04g030350 [Sorghum bicolor]
Length = 633
Score = 97.1 bits (240), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 60/166 (36%), Positives = 85/166 (51%), Gaps = 18/166 (10%)
Query: 87 GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGE 146
G + C+S C C +SC NK Q + K++L +TE G I A E I +G FV EY+GE
Sbjct: 450 GYPIYECNSSCTCDSSCQNKVLQKGLLVKLELFRTENKGWAIRAAEPIPQGTFVCEYIGE 509
Query: 147 VIDDQTCEERLWKMKHLGETNFYLCEINRDM-------------VIDATYKGNKSRYINH 193
V+ + + G + YL +I + +IDAT GN SRYINH
Sbjct: 510 VVKADKTMKNAESVSSKGGCS-YLFDIASQIDMERVRTVGAIEYLIDATRSGNVSRYINH 568
Query: 194 SCCPNTEMQKWIIDGE----TRIGIFATRDIKKGENLTYDYQYEFL 235
SC PN + +++ + IG+FA RDI GE L YDY+ + +
Sbjct: 569 SCSPNLSTRLVLVESKDCQLAHIGLFANRDIAVGEELAYDYRQKLV 614
>gi|302416827|ref|XP_003006245.1| histone-lysine N-methyltransferase [Verticillium albo-atrum VaMs.102]
gi|261355661|gb|EEY18089.1| histone-lysine N-methyltransferase [Verticillium albo-atrum VaMs.102]
Length = 1135
Score = 97.1 bits (240), Expect = 5e-18, Method: Composition-based stats.
Identities = 49/120 (40%), Positives = 71/120 (59%)
Query: 114 KKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEI 173
K +K ++ G+ +E+I + + +IEYVGE + Q E R + G + YL I
Sbjct: 993 KPVKFARSAIHNWGLYTEENINKDDMIIEYVGEQVRQQISEIREVRYLKQGMGSSYLFRI 1052
Query: 174 NRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
+ + VIDAT KG +R+INHSC PN + +DG RI I+A RDI + E LTYDY++E
Sbjct: 1053 DENTVIDATKKGGIARFINHSCMPNCTAKIIKVDGSKRIVIYALRDIARTEELTYDYKFE 1112
>gi|345315868|ref|XP_001516412.2| PREDICTED: probable histone-lysine N-methyltransferase NSD2,
partial [Ornithorhynchus anatinus]
Length = 951
Score = 97.1 bits (240), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 60/158 (37%), Positives = 84/158 (53%), Gaps = 4/158 (2%)
Query: 40 KAIPYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVC--DRDCHCGMLLSSCSSG- 96
K PY IK N K ++ D C P C D +C ML+ C
Sbjct: 795 KPPPYKHIKVNKPCGK-VQIYTADISEIPKCNCKPTDESPCGFDSECLNRMLMYECHPQV 853
Query: 97 CKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEER 156
C G C N+ F R + K+V+T+ G G++A DIK+GEFV EYVGE+ID++ C R
Sbjct: 854 CPAGEQCQNQCFTKRQYPETKIVKTDGKGWGLLAKRDIKKGEFVNEYVGELIDEEECMAR 913
Query: 157 LWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
+ + T+FY+ I++D +IDA KGN SR++NHS
Sbjct: 914 IKHAQENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHS 951
>gi|384499027|gb|EIE89518.1| hypothetical protein RO3G_14229 [Rhizopus delemar RA 99-880]
Length = 1674
Score = 97.1 bits (240), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 48/119 (40%), Positives = 74/119 (62%)
Query: 114 KKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEI 173
K++K ++ G+ A+E I + VIEYVGE+I Q EER + + G + YL +
Sbjct: 1535 KQLKFAKSPIHDWGLFAEEHIDVNDMVIEYVGEMIRQQVAEEREKQYERCGIGSSYLFRV 1594
Query: 174 NRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
+ D VIDAT +G+ +R+INH C PN + +D + +I I+A RDI+ GE +TYDY++
Sbjct: 1595 DDDTVIDATKRGSIARFINHCCSPNCSAKIITVDKQKKIVIYANRDIEPGEEITYDYKF 1653
>gi|346974289|gb|EGY17741.1| histone-lysine N-methyltransferase [Verticillium dahliae VdLs.17]
Length = 1148
Score = 97.1 bits (240), Expect = 6e-18, Method: Composition-based stats.
Identities = 49/120 (40%), Positives = 71/120 (59%)
Query: 114 KKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEI 173
K +K ++ G+ +E+I + + +IEYVGE + Q E R + G + YL I
Sbjct: 1006 KPVKFARSAIHNWGLYTEENINKDDMIIEYVGEQVRQQISEIREVRYLKQGMGSSYLFRI 1065
Query: 174 NRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
+ + VIDAT KG +R+INHSC PN + +DG RI I+A RDI + E LTYDY++E
Sbjct: 1066 DENTVIDATKKGGIARFINHSCMPNCTAKIIKVDGSKRIVIYALRDIARTEELTYDYKFE 1125
>gi|389740685|gb|EIM81875.1| SET domain-containing protein [Stereum hirsutum FP-91666 SS1]
Length = 397
Score = 96.7 bits (239), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 71/202 (35%), Positives = 102/202 (50%), Gaps = 18/202 (8%)
Query: 49 RNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG--CKCG------ 100
RN + +KR KRR C+C+ S + DC C C +CG
Sbjct: 183 RNCHCSKRCKRRWRG----CTCSKSKRRNLCRTNDCPCWRAHRECDPELCTRCGAKYLFD 238
Query: 101 NSCLNKPFQNRPVKKMKL-VQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
+ C N Q +K+ +L T G + ADE G++++EYVGE++ D+T E R
Sbjct: 239 DECKNIAIQQNHIKRTELRSATYGLGLFLAADETALEGDYIMEYVGELVYDETAESRQDV 298
Query: 160 MKHLGETNFYLCEINRDMVIDATYKGNKSRYINH--SCCPNTEMQKWIIDGETRIGIFAT 217
K+LG FY +N + IDA+ GN++R+INH S N + +++GE RIGIFA
Sbjct: 299 AKYLGRNYFY--RLNETLNIDASRVGNEARFINHAPSRLANCNAEVLLVNGEHRIGIFAA 356
Query: 218 RDIKKGENLTYDYQYE-FLHDS 238
R I+ E L DY F+HDS
Sbjct: 357 RYIESNEELFLDYGPGFFIHDS 378
>gi|429329896|gb|AFZ81655.1| hypothetical protein BEWA_010720 [Babesia equi]
Length = 1153
Score = 96.7 bits (239), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 62/172 (36%), Positives = 89/172 (51%), Gaps = 13/172 (7%)
Query: 74 PGSSGVCDRDCHCGMLLSSCSSGCKCG--------NSCLNKPFQNRPVKKMKLVQTEKCG 125
P S+ C + CG S+ +C +C N+ F N K+KL E G
Sbjct: 587 PDSNTRCTCEISCGPDCSNILKNVECTPKNCSFHEKNCGNRRFTNISAPKLKLGFVEGKG 646
Query: 126 AGIVADEDIKRGEFVIEYVGEVIDDQTCEERL--WKMKHLGETN---FYLCEINRDMVID 180
G A EDI+ E V EYVGEVI + L W + + N +Y+ +I++D+ ID
Sbjct: 647 IGAFATEDIEIDELVCEYVGEVITHSDFQRSLSSWSFAEIDDNNQCHWYIMKIHKDIYID 706
Query: 181 ATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
+T+ GN +R+INHSC PN + G R+G+FA R I KGE +TY+Y +
Sbjct: 707 STHLGNVARFINHSCDPNCSSIPINVRGIYRMGVFAQRKIIKGEEVTYNYGF 758
>gi|406868443|gb|EKD21480.1| SET domain-containing protein [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 796
Score = 96.7 bits (239), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 58/188 (30%), Positives = 101/188 (53%), Gaps = 18/188 (9%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCG-NSCLNKPFQNRPVKK---------M 116
C C G CD DC +L C G C G + C N+ F + ++ +
Sbjct: 420 CVCKPEEG----CDEDCMNRTMLYECDDGNCASGRDRCTNRAFADLQERRKAGGKYRIGV 475
Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
++++T G G+ ++ + + ++EY GE+I ++ C+ R+ + E +YL +++
Sbjct: 476 EVIKTADRGYGVRSNRCFEANQIIVEYTGEIITEEECDRRMNEDYKNNEC-YYLMSFDQN 534
Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD-IKKGENLTYDYQYEFL 235
M+IDAT KG+ +R++NHSC PN +M KWI+ G+ R+ +FA + I G+ LTYDY ++
Sbjct: 535 MIIDAT-KGSIARFVNHSCNPNCKMVKWIVGGKPRMALFAGDNPIMTGDELTYDYNFDPF 593
Query: 236 HDSLIAYC 243
+ C
Sbjct: 594 SAKNVQEC 601
>gi|219130356|ref|XP_002185333.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217403248|gb|EEC43202.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 164
Score = 96.7 bits (239), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 45/141 (31%), Positives = 76/141 (53%), Gaps = 2/141 (1%)
Query: 93 CSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQT 152
C C C N+ N + +++V+ + G G+V +ED+ G+ ++EYVG + +
Sbjct: 1 CPKTCPGDMFCGNRRITNGEFRTVRVVEAGRKGRGLVVEEDVDVGDMILEYVGRAVPQKQ 60
Query: 153 CEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRI 212
+ + +H + Y+ + + IDA KG +RYINHSC PN ++Q+W + G R
Sbjct: 61 LAKYFRRYQH--DRRLYIMSLGDGIYIDARSKGGLARYINHSCEPNCQVQRWKVKGVLRA 118
Query: 213 GIFATRDIKKGENLTYDYQYE 233
+ TR + G LT+DYQ+E
Sbjct: 119 VVVPTRSLSAGTELTFDYQWE 139
>gi|226294579|gb|EEH49999.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb18]
Length = 808
Score = 96.7 bits (239), Expect = 9e-18, Method: Composition-based stats.
Identities = 56/178 (31%), Positives = 95/178 (53%), Gaps = 27/178 (15%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSCSS-GCKCGNS-CLNKPFQN--RPVKK-------M 116
C CT G CD +C + C CK G C N+ F++ + +K +
Sbjct: 373 CMCTPESG----CDENCQNRCMFYECDDDNCKLGAELCRNRSFEDLRQRIKAGGKYNIGV 428
Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
++++T G G+ ++ + ++EY GE+I + CE R+ + + N +
Sbjct: 429 EVIKTADRGYGVRSNRTFAPNQIIVEYTGEIITQKECERRMRTVY----------KNNEN 478
Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD-IKKGENLTYDYQYE 233
M+IDAT +G+ +R++NHSC PN EM+KW + G+ R+ +FA ++ I GE LTYDY ++
Sbjct: 479 MIIDAT-RGSIARFVNHSCEPNCEMEKWTVAGKPRMALFAGKNGITTGEELTYDYNFD 535
>gi|405968248|gb|EKC33333.1| Histone-lysine N-methyltransferase SUV39H2 [Crassostrea gigas]
Length = 628
Score = 96.3 bits (238), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 62/152 (40%), Positives = 83/152 (54%), Gaps = 8/152 (5%)
Query: 87 GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKC-GAGIVADEDIKRGEFVIEYVG 145
G + C+ CKCG C N+ Q K+ L +T G G+ + IK G FV+EYVG
Sbjct: 312 GTPIYECNKRCKCGPECPNRVVQQGRKFKVCLFRTANGRGWGVKTLQKIKEGSFVVEYVG 371
Query: 146 EVIDDQTCEERLWKMKHLGETNFYLCEINR---DMVIDATYKGNKSRYINHSCCPNTEM- 201
EVI D+ E R + +G T + + N +DA Y GN S +INHSC PN E+
Sbjct: 372 EVITDKEAERRGKQYDAVGRTYLFDLDYNPGDCPFTVDAGYYGNVSHFINHSCDPNLEVF 431
Query: 202 QKWI--IDGE-TRIGIFATRDIKKGENLTYDY 230
WI +D RI +F+ RDI+KGE LT+DY
Sbjct: 432 AVWINTLDPRLPRIALFSKRDIEKGEELTFDY 463
>gi|254565269|ref|XP_002489745.1| Histone methyltransferase with a role in transcriptional elongation
[Komagataella pastoris GS115]
gi|238029541|emb|CAY67464.1| Histone methyltransferase with a role in transcriptional elongation
[Komagataella pastoris GS115]
gi|328350161|emb|CCA36561.1| histone-lysine N-methyltransferase ASH1L [Komagataella pastoris CBS
7435]
Length = 589
Score = 96.3 bits (238), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 59/184 (32%), Positives = 92/184 (50%), Gaps = 34/184 (18%)
Query: 91 SSCSSGCKCGNSCLNKPFQNRPVKKM-KLVQTEKCGA----------------------- 126
+SC+ CG+ CLN+ Q + KL +T CG
Sbjct: 184 NSCNCRVNCGDLCLNRILQIECDSTICKLSETRSCGNTQFTTLELGISSLFREGTRVCRI 243
Query: 127 ------GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVID 180
G+VA + E + EY GEVI+ + +ERL K K ++Y + + + ID
Sbjct: 244 DDKKGYGLVAIREFAPYELICEYTGEVINQEVVKERLSKKKVF---HYYHLSLEQGLSID 300
Query: 181 ATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDYQYEFLHDSL 239
+T KG+ +R++NHSC PN E+QKW + E RIG+FA ++ I G+ +TYDY + +L ++
Sbjct: 301 STVKGSVARFVNHSCAPNAEVQKWYVQDEPRIGLFAGSKGIIPGDEITYDYNFIWLENAE 360
Query: 240 IAYC 243
C
Sbjct: 361 PQLC 364
>gi|348683877|gb|EGZ23692.1| hypothetical protein PHYSODRAFT_462592 [Phytophthora sojae]
Length = 292
Score = 96.3 bits (238), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 51/127 (40%), Positives = 78/127 (61%), Gaps = 2/127 (1%)
Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
+LV + G G+ E++K G FV EY+GE++ +Q E + ++ + E + Y+ ++
Sbjct: 5 QLVDCGRKGLGLKLLENVKAGSFVGEYMGEIVTEQ--EYYMRRVLYHNEKHRYMMVLSGG 62
Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLH 236
VIDAT G +R+INHSC PN ++KW ++GE R IFA RDI GE LT+DY++E
Sbjct: 63 EVIDATRMGGWARFINHSCDPNCGVEKWDVNGEERCAIFALRDIVVGEELTFDYKFESFS 122
Query: 237 DSLIAYC 243
+ I C
Sbjct: 123 KAEITEC 129
>gi|339244151|ref|XP_003378001.1| putative SET domain protein [Trichinella spiralis]
gi|316973127|gb|EFV56754.1| putative SET domain protein [Trichinella spiralis]
Length = 1416
Score = 96.3 bits (238), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 56/141 (39%), Positives = 85/141 (60%), Gaps = 10/141 (7%)
Query: 101 NSCLNKPFQNRPVKK-----MKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCE- 154
S + K +Q R +K+ + L +++ G G+ A+ D++ VIEYVGEVI ++ E
Sbjct: 1258 QSLVAKCYQYRKLKQEWKSNVYLARSKIQGLGLFANRDVEMNAMVIEYVGEVIRNEVAER 1317
Query: 155 -ERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIG 213
E+ ++ ++ G Y+ ++ D VIDAT G +RYINHSC PN ++ D E+RI
Sbjct: 1318 REKSYQKRNRG---VYMFRLDSDHVIDATVAGGPARYINHSCDPNCIAERIDFDRESRIV 1374
Query: 214 IFATRDIKKGENLTYDYQYEF 234
I + R I KGE LTYDYQ++F
Sbjct: 1375 IMSCRPICKGEELTYDYQFDF 1395
>gi|62858103|ref|NP_001016508.1| histone-lysine N-methyltransferase SUV39H2 [Xenopus (Silurana)
tropicalis]
gi|143586876|sp|Q28CQ7.2|SUV92_XENTR RecName: Full=Histone-lysine N-methyltransferase SUV39H2; AltName:
Full=Suppressor of variegation 3-9 homolog 2;
Short=Su(var)3-9 homolog 2
Length = 406
Score = 96.3 bits (238), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 60/163 (36%), Positives = 86/163 (52%), Gaps = 10/163 (6%)
Query: 87 GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKC-GAGIVADEDIKRGEFVIEYVG 145
G + C+S CKCG C N+ Q P + + +T+ G G+ + IK+ FV+EYVG
Sbjct: 219 GRPIYECNSRCKCGPDCPNRVVQKGPPYSLCIFRTDNGRGWGVKTLQKIKKNSFVMEYVG 278
Query: 146 EVIDDQTCEERLWKMKHLGETNFYLCEINRD-MVIDATYKGNKSRYINHSCCPNTEMQKW 204
EVI + E R + G T + + D +DA GN S ++NHSC PN ++
Sbjct: 279 EVITSEEAERRGQQYDSRGITYLFDLDYEADEFTVDAARYGNVSHFVNHSCDPNLQVFNV 338
Query: 205 IIDGE----TRIGIFATRDIKKGENLTYDYQY----EFLHDSL 239
ID RI +F+TR+IK GE LT+DYQ +F DS+
Sbjct: 339 FIDNLDVRLPRIALFSTRNIKAGEELTFDYQMKGSGDFSTDSI 381
>gi|391331299|ref|XP_003740087.1| PREDICTED: uncharacterized protein LOC100899404 [Metaseiulus
occidentalis]
Length = 2686
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 50/121 (41%), Positives = 73/121 (60%), Gaps = 5/121 (4%)
Query: 125 GAGIVADEDIKRGEFVIEYVGEVIDDQTCE--ERLWKMKHLGETNFYLCEINRDMVIDAT 182
G G+ +DI +GE +IEY GEVI C+ E+ ++ + LG Y+ ++ D V+DAT
Sbjct: 2559 GRGLYCKKDIAKGEMIIEYAGEVIRASLCDRREKYYEGRGLG---CYMFRMDNDEVVDAT 2615
Query: 183 YKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAY 242
KGN +R+INHSC PN + +D + I I+A R+I+ GE LTYDY++ D L
Sbjct: 2616 VKGNAARFINHSCDPNCYSKMITVDNKKHIVIYALREIRTGEELTYDYKFPIEDDKLHCT 2675
Query: 243 C 243
C
Sbjct: 2676 C 2676
>gi|242786320|ref|XP_002480782.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
gi|218720929|gb|EED20348.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
Length = 1155
Score = 96.3 bits (238), Expect = 1e-17, Method: Composition-based stats.
Identities = 49/112 (43%), Positives = 68/112 (60%)
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ A+E+I + +IEYVGE + Q + R + G + YL I+ + VIDAT +G
Sbjct: 1026 GLYAEENIAANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENAVIDATKRGG 1085
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
+R+INHSC PN + +DG RI I+A RDI K E LTYDY++E DS
Sbjct: 1086 IARFINHSCTPNCTAKIIRVDGSKRIVIYALRDISKDEELTYDYKFEREWDS 1137
>gi|165971516|gb|AAI58300.1| suppressor of variegation 3-9 homolog 2 [Xenopus (Silurana)
tropicalis]
Length = 406
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 57/151 (37%), Positives = 81/151 (53%), Gaps = 6/151 (3%)
Query: 87 GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKC-GAGIVADEDIKRGEFVIEYVG 145
G + C+S CKCG C N+ Q P + + +T+ G G+ + IK+ FV+EYVG
Sbjct: 219 GRPIYECNSRCKCGPDCPNRVVQKGPPYSLCIFRTDNGRGWGVKTLQKIKKNSFVMEYVG 278
Query: 146 EVIDDQTCEERLWKMKHLGETNFYLCEINRD-MVIDATYKGNKSRYINHSCCPNTEMQKW 204
EVI + E R + G T + + D +DA GN S ++NHSC PN ++
Sbjct: 279 EVITSEEAERRGQQYDSRGITYLFDLDYEADEFTVDAARYGNVSHFVNHSCDPNLQVFNV 338
Query: 205 IIDGE----TRIGIFATRDIKKGENLTYDYQ 231
ID RI +F+TR+IK GE LT+DYQ
Sbjct: 339 FIDNLDVRLPRIALFSTRNIKAGEELTFDYQ 369
>gi|341876597|gb|EGT32532.1| hypothetical protein CAEBREN_21874 [Caenorhabditis brenneri]
Length = 1048
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 60/160 (37%), Positives = 90/160 (56%), Gaps = 14/160 (8%)
Query: 83 DCHC--GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKC-GAGIVADEDIKRGEF 139
D HC +++ C S C+ C N+ F + ++ T G G+ A +DIK G+F
Sbjct: 654 DSHCVNRAMMTECPSSCR-AKKCKNQRFAKKKYAAVEPFHTGTAKGCGLRAVKDIKAGKF 712
Query: 140 VIEYVGEVIDDQTCEERLWKMKHLGET---NFYLCEINRDMVIDATYKGNKSRYINHSCC 196
+IEY+GEV++ + E+R K+K+ + + YLC+ IDAT GN SR++NHSC
Sbjct: 713 IIEYIGEVLEREDYEKR--KVKYAADKKHKHHYLCDTGV-YTIDATEFGNPSRFVNHSCD 769
Query: 197 PNTEMQKWIIDGE----TRIGIFATRDIKKGENLTYDYQY 232
PN +KW + +RIG FA R IK GE + +DYQ+
Sbjct: 770 PNAVCEKWSVPKTPGDISRIGFFAKRFIKAGEEICFDYQF 809
>gi|353243391|emb|CCA74938.1| related to regulatory protein SET1 [Piriformospora indica DSM 11827]
Length = 1224
Score = 96.3 bits (238), Expect = 1e-17, Method: Composition-based stats.
Identities = 51/130 (39%), Positives = 74/130 (56%)
Query: 114 KKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEI 173
K+++ ++ G+ A E I +GE VIEYVGEVI Q E+R + G + YL I
Sbjct: 1085 KQLQFARSPIHDWGLYALERIPKGEMVIEYVGEVIRQQVAEKRERAYERSGIGSSYLFRI 1144
Query: 174 NRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
+ D+V+DAT GN R INHSC PN + I G+ +I I+A DI G+ +TYDY +
Sbjct: 1145 DDDLVVDATKIGNLGRLINHSCDPNCTAKIITIGGQKKIVIYAKVDIHPGDEVTYDYHFP 1204
Query: 234 FLHDSLIAYC 243
++ + C
Sbjct: 1205 IENEKIPCLC 1214
>gi|347840183|emb|CCD54755.1| hypothetical protein [Botryotinia fuckeliana]
Length = 806
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 58/198 (29%), Positives = 101/198 (51%), Gaps = 22/198 (11%)
Query: 64 DGIFCSCTASPGSSGVCDRDCHCGMLLSSCS-SGCKCG-NSCLNKPFQ------------ 109
D F C P + CD DC ++L C + C G +SC N+ F
Sbjct: 399 DSFFSKCICKPDAG--CDEDCQNKIMLYECDDTNCGAGRDSCTNRAFAELVNRRKGNSFR 456
Query: 110 ---NRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGET 166
N+ +++++T G G+ ++ + ++EY GE+I + C+ R+ +
Sbjct: 457 KGGNKYEIGVEVIKTADRGYGVRSNRCFNANQIIVEYTGEIITEDECDRRM-NEDYKDND 515
Query: 167 NFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRD-IKKGEN 225
+YL +++M+IDAT +G+ +R++NHSC PN M KWI++G+ R+ +FA + I G+
Sbjct: 516 CYYLMSFDQNMIIDAT-RGSIARFVNHSCRPNCRMVKWIVEGKPRMALFAGDNPIMTGDE 574
Query: 226 LTYDYQYEFLHDSLIAYC 243
LTYDY ++ + C
Sbjct: 575 LTYDYNFDPFSAKNVQAC 592
>gi|89267215|emb|CAJ81421.1| suppressor of variegation 3-9 homolog 2 [Xenopus (Silurana)
tropicalis]
Length = 410
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 57/151 (37%), Positives = 81/151 (53%), Gaps = 6/151 (3%)
Query: 87 GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKC-GAGIVADEDIKRGEFVIEYVG 145
G + C+S CKCG C N+ Q P + + +T+ G G+ + IK+ FV+EYVG
Sbjct: 223 GRPIYECNSRCKCGPDCPNRVVQKGPPYSLCIFRTDNGRGWGVKTLQKIKKNSFVMEYVG 282
Query: 146 EVIDDQTCEERLWKMKHLGETNFYLCEINRD-MVIDATYKGNKSRYINHSCCPNTEMQKW 204
EVI + E R + G T + + D +DA GN S ++NHSC PN ++
Sbjct: 283 EVITSEEAERRGQQYDSRGITYLFDLDYEADEFTVDAARYGNVSHFVNHSCDPNLQVFNV 342
Query: 205 IIDGE----TRIGIFATRDIKKGENLTYDYQ 231
ID RI +F+TR+IK GE LT+DYQ
Sbjct: 343 FIDNLDVRLPRIALFSTRNIKAGEELTFDYQ 373
>gi|358056651|dbj|GAA97314.1| hypothetical protein E5Q_03992 [Mixia osmundae IAM 14324]
Length = 844
Score = 95.9 bits (237), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 72/209 (34%), Positives = 101/209 (48%), Gaps = 14/209 (6%)
Query: 43 PYVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGN 101
PY I N Y ++ + + FC A PG G C +C ++ C C C +
Sbjct: 305 PYKIIASNAYPERQKQPSVLQAICFCKPPA-PGEVG-CGDNCINRVMAYMCDPKLCPCKD 362
Query: 102 SCLNKPFQNRPVKKMK-------LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCE 154
C N P Q R K + T G G+ A IK G F++EY GEVI
Sbjct: 363 KCTNGPLQTRKSAAGKGDKEGVAVFYTGSRGFGLKATLPIKSGAFIMEYKGEVISINESY 422
Query: 155 ERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEM--QKWIIDGETRI 212
R+ K+ + G N+YL + + V+DA KGN +R+INHSC PN E+ K+ E +I
Sbjct: 423 RRV-KLIYPG-PNYYLLSYDDEEVLDAGLKGNATRFINHSCDPNCEVVRLKFADYDEFQI 480
Query: 213 GIFATRDIKKGENLTYDYQYEFLHDSLIA 241
G+FA RDI E +TY+Y ++ S A
Sbjct: 481 GLFALRDISPEEEITYNYGWQSFSSSSTA 509
>gi|340959767|gb|EGS20948.1| hypothetical protein CTHT_0027870 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 1295
Score = 95.9 bits (237), Expect = 1e-17, Method: Composition-based stats.
Identities = 51/120 (42%), Positives = 71/120 (59%)
Query: 114 KKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEI 173
K +K ++ G+ A E+I + E +IEYVGE + Q E R + G + YL I
Sbjct: 1153 KPVKFARSAIHNWGLYAMENIPKDEMIIEYVGEEVRQQIAELRESRYLKSGIGSSYLFRI 1212
Query: 174 NRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
+ + VIDAT KG +R+INHSC PN + ++G RI I+A RDI K E LTYDY++E
Sbjct: 1213 DDNTVIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAKNEELTYDYKFE 1272
>gi|212543321|ref|XP_002151815.1| SET domain protein [Talaromyces marneffei ATCC 18224]
gi|210066722|gb|EEA20815.1| SET domain protein [Talaromyces marneffei ATCC 18224]
Length = 1188
Score = 95.9 bits (237), Expect = 1e-17, Method: Composition-based stats.
Identities = 49/112 (43%), Positives = 68/112 (60%)
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ A+E+I + +IEYVGE + Q + R + G + YL I+ + VIDAT +G
Sbjct: 1059 GLYAEENITANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENAVIDATKRGG 1118
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
+R+INHSC PN + +DG RI I+A RDI K E LTYDY++E DS
Sbjct: 1119 IARFINHSCTPNCTAKIIRVDGSKRIVIYALRDISKDEELTYDYKFEREWDS 1170
>gi|156351239|ref|XP_001622422.1| predicted protein [Nematostella vectensis]
gi|156208959|gb|EDO30322.1| predicted protein [Nematostella vectensis]
Length = 688
Score = 95.9 bits (237), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 54/138 (39%), Positives = 76/138 (55%), Gaps = 12/138 (8%)
Query: 100 GNSCLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWK 159
+C N Q K M L ++ G GI + +K+ EF+ EY GEVI + R
Sbjct: 538 SKTCKNVSLQRGQRKHMLLAPSDVAGWGIYIKQSVKKNEFISEYCGEVISQDEADRR--- 594
Query: 160 MKHLGET-NFYLCE----INRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGI 214
G+ + Y+C +N D V+DAT KGNK R+ NHS PN + +++G+ RIGI
Sbjct: 595 ----GKVYDKYMCSFLFNLNNDFVVDATRKGNKIRFANHSISPNCYAKVMMVNGDHRIGI 650
Query: 215 FATRDIKKGENLTYDYQY 232
FA RDI+ GE L +DY+Y
Sbjct: 651 FAKRDIEAGEELFFDYRY 668
>gi|147901855|ref|NP_001091337.1| suppressor of variegation 3-9 homolog 2 [Xenopus laevis]
gi|125858896|gb|AAI29544.1| LOC100037174 protein [Xenopus laevis]
Length = 406
Score = 95.9 bits (237), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 58/157 (36%), Positives = 83/157 (52%), Gaps = 6/157 (3%)
Query: 87 GMLLSSCSSGCKCGNSCLNKPFQNRPVKKMKLVQTEKC-GAGIVADEDIKRGEFVIEYVG 145
G + C+S CKCG C N+ Q P + + +T+ G G+ + IK+ FV+EYVG
Sbjct: 219 GRPIFECNSRCKCGPDCPNRVVQKGPPYSLCIFRTDNGRGWGVKTLQKIKKNSFVMEYVG 278
Query: 146 EVIDDQTCEERLWKMKHLGETNFYLCEINRD-MVIDATYKGNKSRYINHSCCPNTEMQKW 204
EVI + E R + G T + + D +DA GN S ++NHSC PN ++
Sbjct: 279 EVITSEEAERRGQQYDSKGITYLFDLDYEADEFTVDAARYGNVSHFVNHSCDPNLQVFNV 338
Query: 205 IIDGE----TRIGIFATRDIKKGENLTYDYQYEFLHD 237
ID RI +F+TR+IK GE LT+DYQ + D
Sbjct: 339 FIDNLDVRLPRIALFSTRNIKAGEELTFDYQMKGYGD 375
>gi|296422581|ref|XP_002840838.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295637063|emb|CAZ85029.1| unnamed protein product [Tuber melanosporum]
Length = 1200
Score = 95.9 bits (237), Expect = 1e-17, Method: Composition-based stats.
Identities = 50/120 (41%), Positives = 71/120 (59%)
Query: 114 KKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEI 173
K +K ++ G+ A E+I G+ +IEYVGE+I Q + R K G + YL I
Sbjct: 1058 KPVKFARSAIHNWGLYAMENISAGDMIIEYVGEIIRQQVADMREKKYLKSGIGSSYLFRI 1117
Query: 174 NRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
+ VIDAT G +R+INHSC PN + ++G RI I+A RDI++ E LTYDY++E
Sbjct: 1118 DDTTVIDATKAGGIARFINHSCTPNCTAKIIKVEGSKRIVIYALRDIRENEELTYDYKFE 1177
>gi|357139465|ref|XP_003571302.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-4
specific-like [Brachypodium distachyon]
Length = 319
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 46/130 (35%), Positives = 81/130 (62%), Gaps = 1/130 (0%)
Query: 114 KKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEI 173
KK+++ +++ G++A DI +GE ++EY+GE + + R + + G+ +++ +I
Sbjct: 181 KKLRVERSKIHELGVMAVTDIMKGELIVEYIGERMPKWVADLRGLRYEKAGKGDYFF-KI 239
Query: 174 NRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
+ +VIDAT +G +RYINHSC PN E + + +G+ RI I+A + IK G LTYDY++
Sbjct: 240 DAGLVIDATLRGGIARYINHSCEPNCETRVILSNGQRRIFIYANQKIKAGTELTYDYKFP 299
Query: 234 FLHDSLIAYC 243
F + + C
Sbjct: 300 FEENKIPCSC 309
>gi|384484496|gb|EIE76676.1| hypothetical protein RO3G_01380 [Rhizopus delemar RA 99-880]
Length = 565
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 47/117 (40%), Positives = 70/117 (59%)
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ A E I + VIEY+GEVI Q E R + +G + YL ++ DMVIDAT KG
Sbjct: 439 GLYAGESIDAHDIVIEYIGEVIRQQVAEIREKHYERIGIGSSYLFRVDDDMVIDATKKGG 498
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
+R+INH C PN + +D + ++ I+A RDI+ GE +TYDY++ + + +C
Sbjct: 499 MARFINHCCTPNCSAKIITVDKQKKVVIYANRDIEPGEEITYDYKFPIEAEKIPCFC 555
>gi|195438882|ref|XP_002067361.1| GK16377 [Drosophila willistoni]
gi|194163446|gb|EDW78347.1| GK16377 [Drosophila willistoni]
Length = 2510
Score = 95.5 bits (236), Expect = 2e-17, Method: Composition-based stats.
Identities = 62/179 (34%), Positives = 93/179 (51%), Gaps = 25/179 (13%)
Query: 82 RDCHCGMLLSSCSSGCKCGNSC-----LNKPFQNRPV-------KKMK--------LVQT 121
R H SCSS + NS + P+ + V KKMK L ++
Sbjct: 2318 RKPHTQRTAGSCSSTQRMANSAAIAGEVACPYSKQFVHSKSSQYKKMKQEWRNNVYLARS 2377
Query: 122 EKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCE--ERLWKMKHLGETNFYLCEINRDMVI 179
+ G G+ A DI++ +IEY+GEVI + E E+ ++ K+ G Y+ ++ D V+
Sbjct: 2378 KIQGLGLYAARDIEKHTMIIEYIGEVIRTEVSEIREKQYEAKNRG---IYMFRLDEDRVV 2434
Query: 180 DATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
DAT G +RYINHSC PN + +D + RI IFA R I +GE L+YDY+++ +S
Sbjct: 2435 DATLSGGLARYINHSCNPNCVTEIVEVDRDVRIIIFAKRKIYRGEELSYDYKFDIEDES 2493
>gi|317037780|ref|XP_001399137.2| histone-lysine N-methyltransferase, H3 lysine-4 specific [Aspergillus
niger CBS 513.88]
Length = 1239
Score = 95.1 bits (235), Expect = 2e-17, Method: Composition-based stats.
Identities = 48/112 (42%), Positives = 69/112 (61%)
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ A+E+I + +IEYVGE + Q + R + G + YL I+ + VIDAT +G
Sbjct: 1110 GLYAEENISANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKRGG 1169
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
+R+INHSC PN + +DG RI I+A RDI++ E LTYDY++E DS
Sbjct: 1170 IARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDS 1221
>gi|327349992|gb|EGE78849.1| histone-lysine N-methyltransferase [Ajellomyces dermatitidis ATCC
18188]
Length = 775
Score = 95.1 bits (235), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 58/188 (30%), Positives = 98/188 (52%), Gaps = 18/188 (9%)
Query: 68 CSCTASPGSSGVCDRDCHCGMLLSSCS-SGCKCGNS-CLNKPFQN-RPVKKM-------- 116
C CT G CD +C + C + CK G C N+ F+ R K+
Sbjct: 376 CMCTPESG----CDENCQNRYMFYECDDNNCKLGAELCGNRNFEGLRQRSKLGGKYNIGV 431
Query: 117 KLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRD 176
++++T G G+ ++ + ++EY GE++ + CE R+ + E +YL +R+
Sbjct: 432 EVIKTADRGYGVRSNRTFAPNQIIVEYTGEIVTQEECERRMRTVYKNNEC-YYLMYFDRN 490
Query: 177 MVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFA-TRDIKKGENLTYDYQYEFL 235
M+IDAT +G+ +R++NHSC PN +++KW + G+ R+ +FA I GE LTYDY ++
Sbjct: 491 MIIDAT-RGSIARFVNHSCEPNCKIEKWTVAGKPRMALFAGEHGIMTGEELTYDYNFDPY 549
Query: 236 HDSLIAYC 243
+ C
Sbjct: 550 SQKNVQEC 557
>gi|261201264|ref|XP_002627032.1| histone-lysine N-methyltransferase [Ajellomyces dermatitidis
SLH14081]
gi|239592091|gb|EEQ74672.1| histone-lysine N-methyltransferase [Ajellomyces dermatitidis
SLH14081]
gi|239611745|gb|EEQ88732.1| histone-lysine N-methyltransferase [Ajellomyces dermatitidis ER-3]
Length = 1259
Score = 95.1 bits (235), Expect = 2e-17, Method: Composition-based stats.
Identities = 48/112 (42%), Positives = 69/112 (61%)
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ A+E+I + +IEYVGE + Q + R + G + YL I+ + VIDAT +G
Sbjct: 1130 GLYAEENISANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKRGG 1189
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
+R+INHSC PN + +DG RI I+A RDI++ E LTYDY++E DS
Sbjct: 1190 IARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDS 1241
>gi|154278862|ref|XP_001540244.1| hypothetical protein HCAG_04084 [Ajellomyces capsulatus NAm1]
gi|150412187|gb|EDN07574.1| hypothetical protein HCAG_04084 [Ajellomyces capsulatus NAm1]
Length = 1266
Score = 95.1 bits (235), Expect = 2e-17, Method: Composition-based stats.
Identities = 48/112 (42%), Positives = 69/112 (61%)
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ A+E+I + +IEYVGE + Q + R + G + YL I+ + VIDAT +G
Sbjct: 1137 GLYAEENISANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKRGG 1196
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
+R+INHSC PN + +DG RI I+A RDI++ E LTYDY++E DS
Sbjct: 1197 IARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDS 1248
>gi|350630881|gb|EHA19253.1| hypothetical protein ASPNIDRAFT_56859 [Aspergillus niger ATCC 1015]
Length = 1101
Score = 95.1 bits (235), Expect = 2e-17, Method: Composition-based stats.
Identities = 48/112 (42%), Positives = 69/112 (61%)
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ A+E+I + +IEYVGE + Q + R + G + YL I+ + VIDAT +G
Sbjct: 972 GLYAEENISANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKRGG 1031
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
+R+INHSC PN + +DG RI I+A RDI++ E LTYDY++E DS
Sbjct: 1032 IARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDS 1083
>gi|255938628|ref|XP_002560084.1| Pc14g00900 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211584705|emb|CAP74231.1| Pc14g00900 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 1202
Score = 95.1 bits (235), Expect = 2e-17, Method: Composition-based stats.
Identities = 48/112 (42%), Positives = 69/112 (61%)
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ A+E+I + +IEYVGE + Q + R + G + YL I+ + VIDAT +G
Sbjct: 1073 GLYAEENITANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKRGG 1132
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
+R+INHSC PN + +DG RI I+A RDI++ E LTYDY++E DS
Sbjct: 1133 IARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDS 1184
>gi|358338843|dbj|GAA57433.1| histone-lysine N-methyltransferase trithorax, partial [Clonorchis
sinensis]
Length = 328
Score = 95.1 bits (235), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 50/119 (42%), Positives = 69/119 (57%), Gaps = 1/119 (0%)
Query: 125 GAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYK 184
G G+ A + E VIEY+GE+I + CE R + + G + Y+ I+ D+VIDATY
Sbjct: 201 GRGLFALRGFREDEMVIEYMGELIRNFVCETREIRYRSAG-VDCYMFRIDSDLVIDATYA 259
Query: 185 GNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDSLIAYC 243
GN +R+INHSC PN + +D + I I A R I GE LTYDY++ D L+ C
Sbjct: 260 GNAARFINHSCDPNCYAKVVTVDDKKHIVILAQRRIYPGEELTYDYRFPKESDKLLCNC 318
>gi|396457972|ref|XP_003833599.1| hypothetical protein LEMA_P063600.1 [Leptosphaeria maculans JN3]
gi|312210147|emb|CBX90234.1| hypothetical protein LEMA_P063600.1 [Leptosphaeria maculans JN3]
Length = 1372
Score = 95.1 bits (235), Expect = 2e-17, Method: Composition-based stats.
Identities = 51/127 (40%), Positives = 74/127 (58%), Gaps = 3/127 (2%)
Query: 110 NRPVKKMKLVQTEKCGA---GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGET 166
N+ K+ KLV+ ++ G+ A+E+I + +IEYVGE + + + R K G
Sbjct: 1223 NQLKKRKKLVKFDRSAIHNWGLYAEENIAANDMIIEYVGEKVRQRVADLREAKYDQQGVG 1282
Query: 167 NFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENL 226
+ YL I+ D VIDAT G +R+INHSC PN + +D RI I+A RDI + E L
Sbjct: 1283 SSYLFRIDEDTVIDATKMGGIARFINHSCTPNCTAKIIRVDTSKRIVIYALRDISQDEEL 1342
Query: 227 TYDYQYE 233
TYDY++E
Sbjct: 1343 TYDYKFE 1349
>gi|358373521|dbj|GAA90119.1| SET domain protein [Aspergillus kawachii IFO 4308]
Length = 1239
Score = 95.1 bits (235), Expect = 2e-17, Method: Composition-based stats.
Identities = 48/112 (42%), Positives = 69/112 (61%)
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ A+E+I + +IEYVGE + Q + R + G + YL I+ + VIDAT +G
Sbjct: 1110 GLYAEENISANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKRGG 1169
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
+R+INHSC PN + +DG RI I+A RDI++ E LTYDY++E DS
Sbjct: 1170 IARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDS 1221
>gi|115400872|ref|XP_001216024.1| hypothetical protein ATEG_07403 [Aspergillus terreus NIH2624]
gi|114189965|gb|EAU31665.1| hypothetical protein ATEG_07403 [Aspergillus terreus NIH2624]
Length = 1230
Score = 95.1 bits (235), Expect = 2e-17, Method: Composition-based stats.
Identities = 48/112 (42%), Positives = 69/112 (61%)
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ A+E+I + +IEYVGE + Q + R + G + YL I+ + VIDAT +G
Sbjct: 1101 GLYAEENISANDMIIEYVGEKVRQQVADMRERQYLKSGIGSSYLFRIDENTVIDATKRGG 1160
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
+R+INHSC PN + +DG RI I+A RDI++ E LTYDY++E DS
Sbjct: 1161 IARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDS 1212
>gi|134084734|emb|CAK43391.1| unnamed protein product [Aspergillus niger]
Length = 1079
Score = 95.1 bits (235), Expect = 3e-17, Method: Composition-based stats.
Identities = 48/112 (42%), Positives = 69/112 (61%)
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ A+E+I + +IEYVGE + Q + R + G + YL I+ + VIDAT +G
Sbjct: 950 GLYAEENISANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKRGG 1009
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
+R+INHSC PN + +DG RI I+A RDI++ E LTYDY++E DS
Sbjct: 1010 IARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDS 1061
>gi|345567899|gb|EGX50801.1| hypothetical protein AOL_s00054g887 [Arthrobotrys oligospora ATCC
24927]
Length = 1338
Score = 94.7 bits (234), Expect = 3e-17, Method: Composition-based stats.
Identities = 50/112 (44%), Positives = 66/112 (58%)
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ A E+I E +IEYVGE++ Q + R G + YL I+ VIDAT KG
Sbjct: 1209 GLYAMENISNNEMIIEYVGEIVRQQVADLREKNYLRSGIGSSYLFRIDETTVIDATKKGG 1268
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
+R+INHSC PN + ++G RI I+A RDI K E LTYDY++E DS
Sbjct: 1269 IARFINHSCTPNCTAKIIKVEGTKRIVIYALRDIHKDEELTYDYKFEREIDS 1320
>gi|299117155|emb|CBN75119.1| histone methyltransferase [Ectocarpus siliculosus]
Length = 2067
Score = 94.7 bits (234), Expect = 3e-17, Method: Composition-based stats.
Identities = 43/119 (36%), Positives = 75/119 (63%)
Query: 114 KKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEI 173
KK++ ++ G+ ADE I G+ VIEY GE+I + ++R + + + + Y+ +
Sbjct: 1925 KKLRFGRSSVHAWGVFADEPIAAGDLVIEYRGEIIGNAVADKREKQYEDMQIGSDYMFRV 1984
Query: 174 NRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
+ D V+DAT+KG+ +R+INHSC P+ + ++G +I I+A RD+ GE L+YDY++
Sbjct: 1985 DEDTVVDATFKGSLARFINHSCDPSCTTRIITVEGSKKIVIYAERDVAMGEELSYDYKF 2043
>gi|341896557|gb|EGT52492.1| hypothetical protein CAEBREN_15183 [Caenorhabditis brenneri]
Length = 1112
Score = 94.7 bits (234), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 60/146 (41%), Positives = 84/146 (57%), Gaps = 9/146 (6%)
Query: 93 CSSGCKCGNSCLNKPFQNRPVKKMK--LVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDD 150
C CK N C N+ F + K + + EK G G+V+DEDI G+F+ EY+G +D
Sbjct: 290 CPRKCKAAN-CQNQRFTEKKYAKTRRAFMGPEK-GYGLVSDEDIPVGKFIYEYLGRATND 347
Query: 151 QTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWII---D 207
+ +E K + + YL + VIDAT KGN+SR+INHSC PN ++WI+
Sbjct: 348 EGVKEIQKKNDADSKRHRYLLKSG-TYVIDATDKGNESRFINHSCDPNAVCERWIVPKTP 406
Query: 208 GE-TRIGIFATRDIKKGENLTYDYQY 232
GE +RIG FA + IKKGE + + Y Y
Sbjct: 407 GELSRIGFFAIKPIKKGEEILFKYNY 432
>gi|119185079|ref|XP_001243361.1| hypothetical protein CIMG_07257 [Coccidioides immitis RS]
gi|121936913|sp|Q1DR06.1|SET1_COCIM RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component SET1; AltName:
Full=SET domain-containing protein 1
gi|392866240|gb|EAS28850.2| histone-lysine N-methyltransferase, H3 lysine-4 specific
[Coccidioides immitis RS]
Length = 1271
Score = 94.7 bits (234), Expect = 3e-17, Method: Composition-based stats.
Identities = 48/112 (42%), Positives = 68/112 (60%)
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ A+E+I + +IEYVGE + Q + R + G + YL I+ + VIDAT +G
Sbjct: 1142 GLYAEENISANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKRGG 1201
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
+R+INHSC PN + +DG RI I+A RDI + E LTYDY++E DS
Sbjct: 1202 IARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIDRDEELTYDYKFEREWDS 1253
>gi|326477398|gb|EGE01408.1| histone-lysine N-methyltransferase [Trichophyton equinum CBS 127.97]
Length = 1331
Score = 94.7 bits (234), Expect = 3e-17, Method: Composition-based stats.
Identities = 48/112 (42%), Positives = 68/112 (60%)
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ A+E+I + +IEYVGE + Q + R + G + YL I+ + VIDAT G
Sbjct: 1202 GLYAEENITANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKHGG 1261
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
+R+INHSC PN + +DG RI I+A RDI++ E LTYDY++E DS
Sbjct: 1262 IARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDS 1313
>gi|326472906|gb|EGD96915.1| histone-lysine N-methyltransferase [Trichophyton tonsurans CBS
112818]
Length = 1330
Score = 94.7 bits (234), Expect = 3e-17, Method: Composition-based stats.
Identities = 48/112 (42%), Positives = 68/112 (60%)
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ A+E+I + +IEYVGE + Q + R + G + YL I+ + VIDAT G
Sbjct: 1201 GLYAEENITANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKHGG 1260
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
+R+INHSC PN + +DG RI I+A RDI++ E LTYDY++E DS
Sbjct: 1261 IARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDS 1312
>gi|315045626|ref|XP_003172188.1| histone-lysine N-methyltransferase [Arthroderma gypseum CBS 118893]
gi|311342574|gb|EFR01777.1| histone-lysine N-methyltransferase [Arthroderma gypseum CBS 118893]
Length = 1334
Score = 94.7 bits (234), Expect = 3e-17, Method: Composition-based stats.
Identities = 48/112 (42%), Positives = 68/112 (60%)
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ A+E+I + +IEYVGE + Q + R + G + YL I+ + VIDAT G
Sbjct: 1205 GLYAEENITANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKHGG 1264
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
+R+INHSC PN + +DG RI I+A RDI++ E LTYDY++E DS
Sbjct: 1265 IARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDS 1316
>gi|296805347|ref|XP_002843498.1| histone-lysine N-methyltransferase [Arthroderma otae CBS 113480]
gi|238844800|gb|EEQ34462.1| histone-lysine N-methyltransferase [Arthroderma otae CBS 113480]
Length = 1344
Score = 94.7 bits (234), Expect = 3e-17, Method: Composition-based stats.
Identities = 48/112 (42%), Positives = 68/112 (60%)
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ A+E+I + +IEYVGE + Q + R + G + YL I+ + VIDAT G
Sbjct: 1215 GLYAEENITANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKHGG 1274
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
+R+INHSC PN + +DG RI I+A RDI++ E LTYDY++E DS
Sbjct: 1275 IARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDS 1326
>gi|320032561|gb|EFW14513.1| histone-lysine N-methyltransferase [Coccidioides posadasii str.
Silveira]
Length = 1271
Score = 94.7 bits (234), Expect = 3e-17, Method: Composition-based stats.
Identities = 48/112 (42%), Positives = 68/112 (60%)
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ A+E+I + +IEYVGE + Q + R + G + YL I+ + VIDAT +G
Sbjct: 1142 GLYAEENISANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKRGG 1201
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
+R+INHSC PN + +DG RI I+A RDI + E LTYDY++E DS
Sbjct: 1202 IARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIDRDEELTYDYKFEREWDS 1253
>gi|449017355|dbj|BAM80757.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 529
Score = 94.7 bits (234), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 42/99 (42%), Positives = 62/99 (62%), Gaps = 2/99 (2%)
Query: 135 KRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGNKSRYINHS 194
+ +F++EY+GEVI R K H E + Y +++ +DA+ KG R++NHS
Sbjct: 66 QESDFIVEYMGEVIGPTELARR--KRDHALERHVYFMTLDQSTFLDASRKGTWGRFLNHS 123
Query: 195 CCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYE 233
C PN QKW++ G+ R+GIFA+RDI GE LT+DY+ E
Sbjct: 124 CEPNCHTQKWLVLGKVRVGIFASRDIAAGEELTFDYRME 162
>gi|303313714|ref|XP_003066866.1| SET domain containing protein [Coccidioides posadasii C735 delta
SOWgp]
gi|240106533|gb|EER24721.1| SET domain containing protein [Coccidioides posadasii C735 delta
SOWgp]
Length = 1271
Score = 94.7 bits (234), Expect = 3e-17, Method: Composition-based stats.
Identities = 48/112 (42%), Positives = 68/112 (60%)
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ A+E+I + +IEYVGE + Q + R + G + YL I+ + VIDAT +G
Sbjct: 1142 GLYAEENISANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKRGG 1201
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
+R+INHSC PN + +DG RI I+A RDI + E LTYDY++E DS
Sbjct: 1202 IARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIDRDEELTYDYKFEREWDS 1253
>gi|255558564|ref|XP_002520307.1| huntingtin interacting protein, putative [Ricinus communis]
gi|223540526|gb|EEF42093.1| huntingtin interacting protein, putative [Ricinus communis]
Length = 1746
Score = 94.7 bits (234), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 62/190 (32%), Positives = 83/190 (43%), Gaps = 40/190 (21%)
Query: 44 YVFIKRNIYLTKRIKRRLEDDGIFCSCTASPGSSGVCDRDCHCGMLLSSCSSG-CKCGNS 102
+ I N +L + K + D+ + C C C +C ML C G C CG+
Sbjct: 801 FTRISTNQFLHRSRKTQTIDEIMVCHCKLPLDGRLGCRDECLNRMLNIECVRGTCPCGDL 860
Query: 103 CLNKPFQNRPVKKMKLVQTEKCGAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKH 162
C N+ +V+D T E R +
Sbjct: 861 CSNQ---------------------------------------QVLDMHTYEARQREYAF 881
Query: 163 LGETNFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKK 222
G +FY +N VIDA KGN R+INHSC PN +KW+++GE IG+FA RDIKK
Sbjct: 882 QGHKHFYFMTLNGSEVIDACAKGNLGRFINHSCDPNCRTEKWVVNGEICIGLFALRDIKK 941
Query: 223 GENLTYDYQY 232
GE LT+DY Y
Sbjct: 942 GEELTFDYNY 951
>gi|302666919|ref|XP_003025054.1| hypothetical protein TRV_00712 [Trichophyton verrucosum HKI 0517]
gi|291189136|gb|EFE44443.1| hypothetical protein TRV_00712 [Trichophyton verrucosum HKI 0517]
Length = 1376
Score = 94.4 bits (233), Expect = 3e-17, Method: Composition-based stats.
Identities = 48/112 (42%), Positives = 68/112 (60%)
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ A+E+I + +IEYVGE + Q + R + G + YL I+ + VIDAT G
Sbjct: 1247 GLYAEENITANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKHGG 1306
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
+R+INHSC PN + +DG RI I+A RDI++ E LTYDY++E DS
Sbjct: 1307 IARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDS 1358
>gi|327348240|gb|EGE77097.1| histone-lysine N-methyltransferase [Ajellomyces dermatitidis ATCC
18188]
Length = 1280
Score = 94.4 bits (233), Expect = 3e-17, Method: Composition-based stats.
Identities = 48/112 (42%), Positives = 68/112 (60%)
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ A+E+I + +IEYVGE + Q R + G + YL I+ + VIDAT +G
Sbjct: 1151 GLYAEENISANDMIIEYVGEKVRQQVANMRERRYLKSGIGSSYLFRIDENTVIDATKRGG 1210
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
+R+INHSC PN + +DG RI I+A RDI++ E LTYDY++E DS
Sbjct: 1211 IARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDS 1262
>gi|121709862|ref|XP_001272547.1| SET domain protein [Aspergillus clavatus NRRL 1]
gi|119400697|gb|EAW11121.1| SET domain protein [Aspergillus clavatus NRRL 1]
Length = 1232
Score = 94.4 bits (233), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 48/112 (42%), Positives = 69/112 (61%)
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ A+E+I + +IEYVGE + Q + R + G + YL I+ + VIDAT +G
Sbjct: 1103 GLYAEENISANDMIIEYVGEKVRQQVADMRERQYLKSGIGSSYLFRIDENTVIDATKRGG 1162
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
+R+INHSC PN + +DG RI I+A RDI++ E LTYDY++E DS
Sbjct: 1163 IARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDS 1214
>gi|325089235|gb|EGC42545.1| histone-lysine N-methyltransferase [Ajellomyces capsulatus H88]
Length = 1267
Score = 94.4 bits (233), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 48/112 (42%), Positives = 69/112 (61%)
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ A+E+I + +IEYVGE + Q + R + G + YL I+ + VIDAT +G
Sbjct: 1138 GLYAEENISANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKRGG 1197
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
+R+INHSC PN + +DG RI I+A RDI++ E LTYDY++E DS
Sbjct: 1198 IARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDS 1249
>gi|169769549|ref|XP_001819244.1| histone-lysine N-methyltransferase, H3 lysine-4 specific [Aspergillus
oryzae RIB40]
gi|121933328|sp|Q2UMH3.1|SET1_ASPOR RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component SET1; AltName:
Full=SET domain-containing protein 1
gi|83767103|dbj|BAE57242.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 1229
Score = 94.4 bits (233), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 48/112 (42%), Positives = 69/112 (61%)
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ A+E+I + +IEYVGE + Q + R + G + YL I+ + VIDAT +G
Sbjct: 1100 GLYAEENISANDMIIEYVGEKVRQQVADMRERQYLKSGIGSSYLFRIDENTVIDATKRGG 1159
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
+R+INHSC PN + +DG RI I+A RDI++ E LTYDY++E DS
Sbjct: 1160 IARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDS 1211
>gi|327304525|ref|XP_003236954.1| histone-lysine N-methyltransferase [Trichophyton rubrum CBS 118892]
gi|326459952|gb|EGD85405.1| histone-lysine N-methyltransferase [Trichophyton rubrum CBS 118892]
Length = 1337
Score = 94.4 bits (233), Expect = 4e-17, Method: Composition-based stats.
Identities = 48/112 (42%), Positives = 68/112 (60%)
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ A+E+I + +IEYVGE + Q + R + G + YL I+ + VIDAT G
Sbjct: 1208 GLYAEENITANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKHGG 1267
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
+R+INHSC PN + +DG RI I+A RDI++ E LTYDY++E DS
Sbjct: 1268 IARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDS 1319
>gi|391863483|gb|EIT72791.1| histone H3 (Lys4) methyltransferase complex, subunit SET1
[Aspergillus oryzae 3.042]
Length = 1223
Score = 94.4 bits (233), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 48/112 (42%), Positives = 69/112 (61%)
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ A+E+I + +IEYVGE + Q + R + G + YL I+ + VIDAT +G
Sbjct: 1094 GLYAEENISANDMIIEYVGEKVRQQVADMRERQYLKSGIGSSYLFRIDENTVIDATKRGG 1153
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
+R+INHSC PN + +DG RI I+A RDI++ E LTYDY++E DS
Sbjct: 1154 IARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDS 1205
>gi|225554361|gb|EEH02660.1| histone-lysine N-methyltransferase [Ajellomyces capsulatus G186AR]
Length = 1267
Score = 94.4 bits (233), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 48/112 (42%), Positives = 69/112 (61%)
Query: 127 GIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGETNFYLCEINRDMVIDATYKGN 186
G+ A+E+I + +IEYVGE + Q + R + G + YL I+ + VIDAT +G
Sbjct: 1138 GLYAEENISANDMIIEYVGEKVRQQVADMRERRYLKSGIGSSYLFRIDENTVIDATKRGG 1197
Query: 187 KSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQYEFLHDS 238
+R+INHSC PN + +DG RI I+A RDI++ E LTYDY++E DS
Sbjct: 1198 IARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDS 1249
>gi|449295340|gb|EMC91362.1| hypothetical protein BAUCODRAFT_80239 [Baudoinia compniacensis UAMH
10762]
Length = 1279
Score = 94.4 bits (233), Expect = 4e-17, Method: Composition-based stats.
Identities = 49/127 (38%), Positives = 77/127 (60%), Gaps = 3/127 (2%)
Query: 110 NRPVKKMKLVQTEKC---GAGIVADEDIKRGEFVIEYVGEVIDDQTCEERLWKMKHLGET 166
N+ K+ KLV+ ++ G G+ A+E+I + +IEYVGE + + + R + + G
Sbjct: 1130 NQLKKRKKLVKFDRSAIHGWGLYAEENITVNDLIIEYVGEKVRQKVADLRELRYEKQGVG 1189
Query: 167 NFYLCEINRDMVIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENL 226
+ YL + D ++DAT KG +R+INHSC PN + ++G RI I+A +DI K E L
Sbjct: 1190 SSYLFRMMDDEIVDATKKGGIARFINHSCSPNCTAKIIKVEGTPRIVIYALKDIGKNEEL 1249
Query: 227 TYDYQYE 233
TYDY++E
Sbjct: 1250 TYDYKFE 1256
>gi|71029610|ref|XP_764448.1| hypothetical protein [Theileria parva strain Muguga]
gi|68351402|gb|EAN32165.1| hypothetical protein TP04_0811 [Theileria parva]
Length = 995
Score = 94.4 bits (233), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 60/175 (34%), Positives = 92/175 (52%), Gaps = 13/175 (7%)
Query: 71 TASPGSSGVCDRDCHCGMLLSSCSSGCKCGN--------SCLNKPFQNRPVKKMKLVQTE 122
+P + C D CG S+ + +C +C N+ F + K++L +
Sbjct: 655 VVAPEAEMKCHCDKKCGSDCSNVTKNIECTVKNCGLADVNCGNRRFAHFSGPKLRLNYVD 714
Query: 123 KCGAGIVADEDIKRGEFVIEYVGEVI---DDQTC--EERLWKMKHLGETNFYLCEINRDM 177
G G VA E+I GE V EYVGEVI D Q C ++ ++++Y+ +I+RD
Sbjct: 715 GKGVGAVATEEIGEGELVCEYVGEVISQADFQRCLASASFAEIDDGNQSHWYVMKIHRDT 774
Query: 178 VIDATYKGNKSRYINHSCCPNTEMQKWIIDGETRIGIFATRDIKKGENLTYDYQY 232
ID+T+ GN +R+INHSC PN + G R+G+FA R IK+ E +TY+Y +
Sbjct: 775 YIDSTHLGNVARFINHSCDPNCASVPINVKGTYRMGVFALRKIKQDEEVTYNYGF 829
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.322 0.139 0.444
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,957,617,097
Number of Sequences: 23463169
Number of extensions: 163495975
Number of successful extensions: 343004
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 3845
Number of HSP's successfully gapped in prelim test: 1338
Number of HSP's that attempted gapping in prelim test: 331477
Number of HSP's gapped (non-prelim): 7075
length of query: 243
length of database: 8,064,228,071
effective HSP length: 138
effective length of query: 105
effective length of database: 9,121,278,045
effective search space: 957734194725
effective search space used: 957734194725
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 75 (33.5 bits)