BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy8426
(278 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|91081535|ref|XP_974904.1| PREDICTED: similar to Suv4-20 CG13363-PA [Tribolium castaneum]
gi|270006169|gb|EFA02617.1| hypothetical protein TcasGA2_TC008337 [Tribolium castaneum]
Length = 700
Score = 263 bits (672), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 123/165 (74%), Positives = 138/165 (83%), Gaps = 3/165 (1%)
Query: 66 LKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYS 125
++PCYRYSLEG +GA+I AT+KW KND I LVGCIAELT+EEEK+LLH G NDFSVMYS
Sbjct: 118 IEPCYRYSLEGQKGAKISATKKWYKNDKIECLVGCIAELTEEEEKQLLHPGKNDFSVMYS 177
Query: 126 CRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDK 185
CRKNCAQLWLGPAAYINHDCR NC+FVAT RDTACVKVLRDIE GEEITCFYGEDFFGDK
Sbjct: 178 CRKNCAQLWLGPAAYINHDCRANCKFVATGRDTACVKVLRDIEAGEEITCFYGEDFFGDK 237
Query: 186 NCLCECDTCEIRMKGAFAQSSVEPQTTQPSRYCLRDTDMRLNKRK 230
NC CEC+TCE R GAFA+ S + + Y LR+TD R+N+ K
Sbjct: 238 NCYCECETCERRGMGAFAKES---DKKEENGYRLRETDNRINRTK 279
>gi|242020905|ref|XP_002430891.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212516102|gb|EEB18153.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 551
Score = 261 bits (667), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 125/189 (66%), Positives = 146/189 (77%), Gaps = 6/189 (3%)
Query: 50 GTQHHRYSYRIQYLPE----LKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELT 105
G + H Y Y + + ++PCYRYSLEG +GA+I AT+KW KND I+ LVGCIAELT
Sbjct: 91 GLEEHIYRYLRVFDQDSGFVIEPCYRYSLEGQKGAKISATKKWFKNDKISRLVGCIAELT 150
Query: 106 KEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLR 165
+EEE +LLH G NDFSVMYSCRKNCAQLWLGPAA+INHDCR NC+FVAT RDTACVKVLR
Sbjct: 151 EEEEGQLLHPGKNDFSVMYSCRKNCAQLWLGPAAFINHDCRANCKFVATGRDTACVKVLR 210
Query: 166 DIEQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFAQSSVEPQTTQPSRYCLRDTDMR 225
DIE GEEITCFYGEDFFGD NC CEC TCE R GAF + + T + Y LR+TD R
Sbjct: 211 DIEVGEEITCFYGEDFFGDGNCYCECRTCERRETGAFKKGTKHDDLT--TGYKLRETDNR 268
Query: 226 LNKRKLHKK 234
+N+ K+HK+
Sbjct: 269 INRTKIHKE 277
>gi|383852111|ref|XP_003701572.1| PREDICTED: uncharacterized protein LOC100875180 [Megachile
rotundata]
Length = 1050
Score = 255 bits (652), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 123/179 (68%), Positives = 142/179 (79%), Gaps = 3/179 (1%)
Query: 66 LKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYS 125
++PCYRYSLEG +GA+I ATRKW K+D I+ LVGCIAEL+++EE LLH G NDFSVM+S
Sbjct: 143 IEPCYRYSLEGQKGAKICATRKWLKHDKISCLVGCIAELSEKEEAALLHPGKNDFSVMFS 202
Query: 126 CRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDK 185
CRKNCAQLWLGPAAYINHDCR NC+FVAT RDTACVKVLRDIE GEEITCFYGEDFFGD
Sbjct: 203 CRKNCAQLWLGPAAYINHDCRANCKFVATGRDTACVKVLRDIEVGEEITCFYGEDFFGDG 262
Query: 186 NCLCECDTCEIRMKGAFAQSSVEPQTTQPSRYCLRDTDMRLNKRKLHKK-LNRLLLASD 243
NC CEC+TCE R GAFA +P S Y LR+TD R+N+ K ++ LNR +D
Sbjct: 263 NCYCECETCERRGTGAFANQ--KPGEELSSGYRLRETDNRINRTKHRQQPLNRSKQQAD 319
>gi|307172382|gb|EFN63848.1| Histone-lysine N-methyltransferase Suv4-20 [Camponotus floridanus]
Length = 1161
Score = 255 bits (651), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 122/173 (70%), Positives = 141/173 (81%), Gaps = 3/173 (1%)
Query: 66 LKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYS 125
++PCYRYSLEG +GA+I ATRKW K++ I+ LVGCIAEL+++EE LLH G NDFSVM+S
Sbjct: 143 IEPCYRYSLEGQKGAKICATRKWMKHEKISCLVGCIAELSEKEEAALLHPGKNDFSVMFS 202
Query: 126 CRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDK 185
CRKNCAQLWLGPAAYINHDCR NC+FV T RDTACVKVLRDIE GEEITCFYGEDFFGD
Sbjct: 203 CRKNCAQLWLGPAAYINHDCRANCKFVPTGRDTACVKVLRDIEVGEEITCFYGEDFFGDG 262
Query: 186 NCLCECDTCEIRMKGAFAQSSVEPQTTQPSRYCLRDTDMRLNKRKLHKK-LNR 237
NC CEC+TCE R KGAFA S +P S Y LR+TD R+N+ K ++ LNR
Sbjct: 263 NCYCECETCERRGKGAFA--SQKPGEELSSGYRLRETDNRINRTKHRQQPLNR 313
>gi|332027439|gb|EGI67522.1| Histone-lysine N-methyltransferase Suv4-20 [Acromyrmex echinatior]
Length = 1287
Score = 254 bits (650), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 121/173 (69%), Positives = 140/173 (80%), Gaps = 3/173 (1%)
Query: 66 LKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYS 125
++PCYRYSLEG +GA+I AT+KW K+D I+ LVGCIAEL+++EE LLH G NDFSVM+S
Sbjct: 238 IEPCYRYSLEGQKGAKICATKKWMKHDKISCLVGCIAELSEKEEAALLHPGKNDFSVMFS 297
Query: 126 CRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDK 185
CRKNCAQLWLGPAAYINHDCR NC+FVAT RDTACVKVLRDIE GEEITCFYGEDFFGD
Sbjct: 298 CRKNCAQLWLGPAAYINHDCRANCKFVATGRDTACVKVLRDIEVGEEITCFYGEDFFGDG 357
Query: 186 NCLCECDTCEIRMKGAFAQSSVEPQTTQPSRYCLRDTDMRLNKRKLHKK-LNR 237
NC CEC+TCE R G FA S +P S Y LR+TD R+N+ K ++ LNR
Sbjct: 358 NCYCECETCERRGTGTFA--SQKPGEEMSSGYRLRETDNRINRTKHRQQPLNR 408
>gi|340728733|ref|XP_003402671.1| PREDICTED: hypothetical protein LOC100644080 [Bombus terrestris]
Length = 1136
Score = 254 bits (649), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 118/165 (71%), Positives = 135/165 (81%), Gaps = 2/165 (1%)
Query: 66 LKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYS 125
++PC+RYSLEG +GA+I ATRKW K+D I+ LVGCIAEL+++EE LLH G NDFSVM+S
Sbjct: 143 IEPCFRYSLEGQKGAKICATRKWLKHDKISCLVGCIAELSEKEEAALLHPGKNDFSVMFS 202
Query: 126 CRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDK 185
CRKNCAQLWLGPAAYINHDCR NC+FVAT RDTACVKVLRDIE GEEITCFYGEDFFGD
Sbjct: 203 CRKNCAQLWLGPAAYINHDCRANCKFVATGRDTACVKVLRDIEVGEEITCFYGEDFFGDA 262
Query: 186 NCLCECDTCEIRMKGAFAQSSVEPQTTQPSRYCLRDTDMRLNKRK 230
NC CEC+TCE R GAFA +P S Y LR+TD R+N+ K
Sbjct: 263 NCYCECETCERRATGAFANQ--KPGEELSSGYRLRETDNRINRTK 305
>gi|380026293|ref|XP_003696886.1| PREDICTED: uncharacterized protein LOC100864139 [Apis florea]
Length = 1140
Score = 254 bits (648), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 119/165 (72%), Positives = 135/165 (81%), Gaps = 2/165 (1%)
Query: 66 LKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYS 125
++PCYRYSLEG +GA+I ATRKW K+D I+ LVGCIAEL+++EE LLH G NDFSVM+S
Sbjct: 143 IEPCYRYSLEGQKGAKICATRKWLKHDKISCLVGCIAELSEKEEAALLHPGKNDFSVMFS 202
Query: 126 CRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDK 185
CRKNCAQLWLGPAAYINHDCR NC+FVAT RDTACVKVLRDIE GEEITCFYGEDFFGD
Sbjct: 203 CRKNCAQLWLGPAAYINHDCRANCKFVATGRDTACVKVLRDIEVGEEITCFYGEDFFGDG 262
Query: 186 NCLCECDTCEIRMKGAFAQSSVEPQTTQPSRYCLRDTDMRLNKRK 230
NC CEC+TCE R GAFA +P S Y LR+TD R+N+ K
Sbjct: 263 NCYCECETCERRGTGAFANQ--KPGEELSSGYRLRETDNRINRTK 305
>gi|347963046|ref|XP_566387.4| AGAP000042-PA [Anopheles gambiae str. PEST]
gi|333467385|gb|EAL41306.4| AGAP000042-PA [Anopheles gambiae str. PEST]
Length = 1865
Score = 253 bits (647), Expect = 5e-65, Method: Composition-based stats.
Identities = 118/182 (64%), Positives = 132/182 (72%), Gaps = 17/182 (9%)
Query: 66 LKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYS 125
++ CYRYSLEG +GA+I +TR+W KN+ I LVGCIAELT+ EE LL+ G NDFSVMYS
Sbjct: 143 IEACYRYSLEGQKGAKICSTRRWMKNEKIECLVGCIAELTEREEADLLYPGRNDFSVMYS 202
Query: 126 CRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDK 185
CRKNCAQLWLGPAAYINHDCR NC+FVAT RDTACVKVLRDIE GEEITCFYGEDFFGD
Sbjct: 203 CRKNCAQLWLGPAAYINHDCRANCKFVATGRDTACVKVLRDIEVGEEITCFYGEDFFGDN 262
Query: 186 NCLCECDTCEIRMKGAFAQSS-----------------VEPQTTQPSRYCLRDTDMRLNK 228
NC CEC+TCE R GAFA + V RY LR+TD RLN+
Sbjct: 263 NCYCECETCERRGTGAFAARARLLEEGGSSALGGVVVGVVGTANGGVRYRLRETDNRLNR 322
Query: 229 RK 230
K
Sbjct: 323 MK 324
>gi|307206288|gb|EFN84353.1| Histone-lysine N-methyltransferase Suv4-20 [Harpegnathos saltator]
Length = 1181
Score = 253 bits (645), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 121/173 (69%), Positives = 139/173 (80%), Gaps = 3/173 (1%)
Query: 66 LKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYS 125
++PCYRYSLE +GA+I ATRKW KN+ I+ LVGCIAEL+++EE LLH G NDFSVM+S
Sbjct: 143 IEPCYRYSLEAQKGAKICATRKWLKNEKISCLVGCIAELSEKEEAALLHPGKNDFSVMFS 202
Query: 126 CRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDK 185
CRKNCAQLWLGPAAYINHDCR NC+FV T RDTACVKVLRDIE GEEITCFYGEDFFGD
Sbjct: 203 CRKNCAQLWLGPAAYINHDCRANCKFVPTGRDTACVKVLRDIEVGEEITCFYGEDFFGDN 262
Query: 186 NCLCECDTCEIRMKGAFAQSSVEPQTTQPSRYCLRDTDMRLNKRKLHKK-LNR 237
NC CEC+TCE R GAFA S +P S Y LR+TD R+N+ K ++ LNR
Sbjct: 263 NCYCECETCERRGTGAFA--SQKPGEELSSGYRLRETDNRINRTKHRQQPLNR 313
>gi|345487802|ref|XP_003425758.1| PREDICTED: hypothetical protein LOC100122935 [Nasonia vitripennis]
Length = 1086
Score = 253 bits (645), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 121/169 (71%), Positives = 136/169 (80%), Gaps = 3/169 (1%)
Query: 66 LKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYS 125
+K CYRYSLEG +GA+I ATRKW KN+ I+ LVGCIAEL+ +EE LLH G NDFSVM+S
Sbjct: 140 IKACYRYSLEGQKGAKICATRKWLKNEKISCLVGCIAELSDKEEAALLHPGKNDFSVMFS 199
Query: 126 CRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDK 185
CRKNCAQLWLGPAAYINHDCR NC+FVAT RDTACVKVLRDIE GEEITCFYGEDFFGD
Sbjct: 200 CRKNCAQLWLGPAAYINHDCRANCKFVATGRDTACVKVLRDIEVGEEITCFYGEDFFGDG 259
Query: 186 NCLCECDTCEIRMKGAFAQSSVEPQTTQPSRYCLRDTDMRLNKRKLHKK 234
NC CEC+TCE R GAFA +P S Y LR+TD R+N+ K HK+
Sbjct: 260 NCYCECETCERRGTGAFANQ--KPGEELESGYRLRETDNRINRHK-HKQ 305
>gi|350415771|ref|XP_003490746.1| PREDICTED: hypothetical protein LOC100742553 [Bombus impatiens]
Length = 1140
Score = 252 bits (644), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 116/165 (70%), Positives = 134/165 (81%), Gaps = 2/165 (1%)
Query: 66 LKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYS 125
++PC+RYSLEG +GA+I ATRKW K+D I+ LVGCIAEL+++EE LLH G NDFSVM+S
Sbjct: 143 IEPCFRYSLEGQKGAKICATRKWLKHDKISCLVGCIAELSEKEEAALLHPGKNDFSVMFS 202
Query: 126 CRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDK 185
CRKNCAQLWLGPAAYINHDCR NC+FV T RDTAC+KVLRDIE GEEITCFYGEDFFGD
Sbjct: 203 CRKNCAQLWLGPAAYINHDCRANCKFVPTGRDTACIKVLRDIEVGEEITCFYGEDFFGDG 262
Query: 186 NCLCECDTCEIRMKGAFAQSSVEPQTTQPSRYCLRDTDMRLNKRK 230
NC CEC+TCE R GAFA +P S Y LR+TD R+N+ K
Sbjct: 263 NCFCECETCERRATGAFANQ--KPGEELSSGYRLRETDNRINRMK 305
>gi|405975339|gb|EKC39912.1| hypothetical protein CGI_10007764 [Crassostrea gigas]
Length = 1358
Score = 249 bits (637), Expect = 7e-64, Method: Compositional matrix adjust.
Identities = 115/173 (66%), Positives = 136/173 (78%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
+L PC+RYS+EG GA+I AT+ W KND I LVGCIAEL+KEEE+++L TG+NDFSVMY
Sbjct: 649 KLFPCHRYSMEGRVGAKICATKHWYKNDKIPMLVGCIAELSKEEEEKVLKTGVNDFSVMY 708
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
SCRKNCAQLWLGPA++INHDCR NCRFV+T RDTACVKVLRDI+ GEEITCFYGEDFFG+
Sbjct: 709 SCRKNCAQLWLGPASFINHDCRANCRFVSTGRDTACVKVLRDIDPGEEITCFYGEDFFGE 768
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQPSRYCLRDTDMRLNKRKLHKKLNR 237
NCLCEC+TCE R GAF ++ Y LRDTD RL + K+ + R
Sbjct: 769 NNCLCECETCERRKTGAFKVEESPSVDSEEKGYRLRDTDDRLTRLKVDTEKTR 821
>gi|157117566|ref|XP_001658829.1| hypothetical protein AaeL_AAEL008027 [Aedes aegypti]
gi|108876000|gb|EAT40225.1| AAEL008027-PA [Aedes aegypti]
Length = 937
Score = 248 bits (634), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 119/166 (71%), Positives = 134/166 (80%), Gaps = 1/166 (0%)
Query: 66 LKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYS 125
++ CYRYSLEG +GA+I +TRKW KN+ I LVGCIAELT++EE LL G NDFSVMYS
Sbjct: 141 IEACYRYSLEGQKGAKICSTRKWLKNEKIECLVGCIAELTEKEEDALLQPGKNDFSVMYS 200
Query: 126 CRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDK 185
CRKNCAQLWLGPAAYINHDCR NC+FVAT RDTACVKVLRDIE GEEITCFYGEDFFGD
Sbjct: 201 CRKNCAQLWLGPAAYINHDCRANCKFVATGRDTACVKVLRDIEIGEEITCFYGEDFFGDN 260
Query: 186 NCLCECDTCEIRMKGAFAQ-SSVEPQTTQPSRYCLRDTDMRLNKRK 230
N CEC+TCE R GAFA+ + E Q+ RY LR+TD RLN+ K
Sbjct: 261 NRYCECETCERRGTGAFAKDKNGEEQSPNGVRYRLRETDNRLNRLK 306
>gi|427784953|gb|JAA57928.1| hypothetical protein [Rhipicephalus pulchellus]
Length = 899
Score = 247 bits (631), Expect = 4e-63, Method: Composition-based stats.
Identities = 115/172 (66%), Positives = 133/172 (77%), Gaps = 2/172 (1%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ C RYS+EGN GA+I AT+KW KN+ I +LVGCIAELT+EEE +LL G NDFSVMY
Sbjct: 113 EVMRCDRYSMEGNVGAKICATKKWYKNEKIPHLVGCIAELTEEEECQLLCPGKNDFSVMY 172
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
SCRKNCAQLWLGPAA+INHDCRPNC+FV+T RD+ACVKVLRDIE+GEEITCFYGEDFFGD
Sbjct: 173 SCRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDSACVKVLRDIEEGEEITCFYGEDFFGD 232
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQPSR--YCLRDTDMRLNKRKLHKK 234
N CEC+TCE R GAF + +R Y R+TD RLN+ K K
Sbjct: 233 GNSYCECETCERRGTGAFTTKKGRQPSLMETRLAYSFRETDNRLNRWKQQVK 284
>gi|321458304|gb|EFX69374.1| hypothetical protein DAPPUDRAFT_329141 [Daphnia pulex]
Length = 674
Score = 246 bits (628), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 123/234 (52%), Positives = 155/234 (66%), Gaps = 11/234 (4%)
Query: 1 MKSIVHNLILHVPIQLACHLLNKLNPSFWAFCVQALLAQRFNIGR--ILDMGTQHHRYSY 58
+K V + I H + A L + W F Q+ N+ + +G H +
Sbjct: 61 LKQAVIDFIEHQDYEKAYKQLTSSD-YVWNFLSNKNKQQQMNVKEHIVRYLGMFHESAGF 119
Query: 59 RIQYLPELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLN 118
IQ CYRYS+EG +GA++ T+KW KN+ I L+GCIAEL+ +EE +LLH G N
Sbjct: 120 TIQA------CYRYSMEGQKGAKLCTTKKWYKNEQITLLMGCIAELSPDEEAQLLHPGKN 173
Query: 119 DFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYG 178
DFSVMYSCRKNCAQLWLGPAA+INHDCR NC+FVAT R TACV++LRDIE GEEITCFYG
Sbjct: 174 DFSVMYSCRKNCAQLWLGPAAFINHDCRSNCKFVATGRGTACVRILRDIEAGEEITCFYG 233
Query: 179 EDFFGDKNCLCECDTCEIRMKGAFAQS--SVEPQTTQPSRYCLRDTDMRLNKRK 230
EDFFGD NC CEC+TCE R GAFA++ + T+ RY LR+T +RL++ K
Sbjct: 234 EDFFGDNNCFCECETCERRGTGAFAKNKKNTPEAATKEERYRLRETHLRLSRVK 287
>gi|170067844|ref|XP_001868640.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167863903|gb|EDS27286.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 1031
Score = 243 bits (619), Expect = 9e-62, Method: Compositional matrix adjust.
Identities = 115/167 (68%), Positives = 134/167 (80%), Gaps = 2/167 (1%)
Query: 66 LKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYS 125
++ CYRYSLEG +GA+I +T++W K++ I LVGCIAELT++EE LL G NDFSVMYS
Sbjct: 25 IEACYRYSLEGQKGAKICSTKRWLKHEKIECLVGCIAELTEKEEAALLQPGRNDFSVMYS 84
Query: 126 CRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDK 185
CRKNCAQLWLGPAAYINHDCR NC+FVAT RDTACVKVLRDIE GEEITCFYGEDFFGD
Sbjct: 85 CRKNCAQLWLGPAAYINHDCRANCKFVATGRDTACVKVLRDIEIGEEITCFYGEDFFGDN 144
Query: 186 NCLCECDTCEIRMKGAFA--QSSVEPQTTQPSRYCLRDTDMRLNKRK 230
N CEC+TCE R GAFA ++ + Q+ RY LR+TD RLN+ K
Sbjct: 145 NRYCECETCERRGTGAFAKDKNGEDQQSPNGVRYRLRETDNRLNRLK 191
>gi|198470207|ref|XP_002133391.1| GA22871 [Drosophila pseudoobscura pseudoobscura]
gi|198145341|gb|EDY72019.1| GA22871 [Drosophila pseudoobscura pseudoobscura]
Length = 1488
Score = 241 bits (614), Expect = 3e-61, Method: Composition-based stats.
Identities = 112/185 (60%), Positives = 128/185 (69%), Gaps = 20/185 (10%)
Query: 66 LKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYS 125
++ CYRYSLE RGA+I +T++W KN+ I LVGCIAEL++ EE LLH+G NDFSVMYS
Sbjct: 257 IEACYRYSLEEQRGAKISSTKRWSKNEKIECLVGCIAELSETEEAALLHSGKNDFSVMYS 316
Query: 126 CRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDK 185
CRKNCAQLWLGPAAYINHDCR NC+F+AT RDTACVKVLRDIE GEEITCFYGEDFFGD
Sbjct: 317 CRKNCAQLWLGPAAYINHDCRANCKFLATGRDTACVKVLRDIEVGEEITCFYGEDFFGDS 376
Query: 186 NCLCECDTCEIRMKGAFAQSS--------------------VEPQTTQPSRYCLRDTDMR 225
N CEC+TCE R GAFA Y LR+TD R
Sbjct: 377 NRYCECETCERRGMGAFAGKHNRHFDDATGLVLGLTSGLGLTSGGAGNGGGYRLRETDNR 436
Query: 226 LNKRK 230
+N+ K
Sbjct: 437 INRIK 441
>gi|195168772|ref|XP_002025204.1| GL13360 [Drosophila persimilis]
gi|194108660|gb|EDW30703.1| GL13360 [Drosophila persimilis]
Length = 1497
Score = 240 bits (613), Expect = 4e-61, Method: Composition-based stats.
Identities = 112/185 (60%), Positives = 128/185 (69%), Gaps = 20/185 (10%)
Query: 66 LKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYS 125
++ CYRYSLE RGA+I +T++W KN+ I LVGCIAEL++ EE LLH+G NDFSVMYS
Sbjct: 256 IEACYRYSLEEQRGAKISSTKRWSKNEKIECLVGCIAELSETEEAALLHSGKNDFSVMYS 315
Query: 126 CRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDK 185
CRKNCAQLWLGPAAYINHDCR NC+F+AT RDTACVKVLRDIE GEEITCFYGEDFFGD
Sbjct: 316 CRKNCAQLWLGPAAYINHDCRANCKFLATGRDTACVKVLRDIEVGEEITCFYGEDFFGDS 375
Query: 186 NCLCECDTCEIRMKGAFAQSS--------------------VEPQTTQPSRYCLRDTDMR 225
N CEC+TCE R GAFA Y LR+TD R
Sbjct: 376 NRYCECETCERRGMGAFAGKHNRHFDDATGLVLGLTSGLGLTNGGAGNGGGYRLRETDNR 435
Query: 226 LNKRK 230
+N+ K
Sbjct: 436 INRIK 440
>gi|357607272|gb|EHJ65427.1| hypothetical protein KGM_05424 [Danaus plexippus]
Length = 684
Score = 238 bits (607), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 111/165 (67%), Positives = 132/165 (80%), Gaps = 1/165 (0%)
Query: 66 LKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYS 125
++PCYRYSLEG GA+I T+K+ K++ I +LVGCIAE+T+EEEK+LLH G NDFSVMYS
Sbjct: 123 IEPCYRYSLEGRIGAKISTTKKFFKHERIDFLVGCIAEMTEEEEKQLLHPGKNDFSVMYS 182
Query: 126 CRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDK 185
CRKNCAQLWLGPAAYINHDCRP C F AT+R A V+VLRDIE GEEITCFYGEDFFG+
Sbjct: 183 CRKNCAQLWLGPAAYINHDCRPTCTFEATDRGKAFVRVLRDIEVGEEITCFYGEDFFGNG 242
Query: 186 NCLCECDTCEIRMKGAFAQSSVEPQTTQPSRYCLRDTDMRLNKRK 230
NC CEC+TCE R KGAF+ + Q +RY R+TD R+N+ K
Sbjct: 243 NCYCECETCERRGKGAFSVQNAH-NDEQATRYRFRETDNRINRTK 286
>gi|195059025|ref|XP_001995547.1| GH17704 [Drosophila grimshawi]
gi|193896333|gb|EDV95199.1| GH17704 [Drosophila grimshawi]
Length = 1572
Score = 237 bits (604), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 105/138 (76%), Positives = 117/138 (84%)
Query: 66 LKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYS 125
++ CYRYSLE RGA+I +T++W KN+ I LVGCIAELT+ EE LLH+G NDFSVMYS
Sbjct: 287 IEACYRYSLEEQRGAKISSTKRWSKNEKIECLVGCIAELTEAEETALLHSGKNDFSVMYS 346
Query: 126 CRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDK 185
CRKNCAQLWLGPAAYINHDCR NC+F+AT RDTACVKVLRDIE GEEITCFYGEDFFGD
Sbjct: 347 CRKNCAQLWLGPAAYINHDCRANCKFLATGRDTACVKVLRDIEVGEEITCFYGEDFFGDS 406
Query: 186 NCLCECDTCEIRMKGAFA 203
N CEC+TCE R GAFA
Sbjct: 407 NRYCECETCERRGTGAFA 424
>gi|312385974|gb|EFR30356.1| hypothetical protein AND_00112 [Anopheles darlingi]
Length = 2330
Score = 235 bits (600), Expect = 1e-59, Method: Composition-based stats.
Identities = 100/135 (74%), Positives = 115/135 (85%)
Query: 66 LKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYS 125
++PCYRYSLEG +GA+I +TR+W +++ I LVGCIAEL++ EE LLH G NDFSVMYS
Sbjct: 152 IEPCYRYSLEGQKGAKICSTRRWLRHEKIVCLVGCIAELSEREEAMLLHPGRNDFSVMYS 211
Query: 126 CRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDK 185
CRKNCAQLWLGPAAYINHDCR NC+FV T RDTAC++VLRDIE GEEITCFYGEDFFGD
Sbjct: 212 CRKNCAQLWLGPAAYINHDCRANCKFVPTGRDTACIEVLRDIEAGEEITCFYGEDFFGDN 271
Query: 186 NCLCECDTCEIRMKG 200
NC CEC+TCE R G
Sbjct: 272 NCYCECETCERRGTG 286
>gi|195439066|ref|XP_002067452.1| GK16179 [Drosophila willistoni]
gi|194163537|gb|EDW78438.1| GK16179 [Drosophila willistoni]
Length = 1573
Score = 234 bits (596), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 105/138 (76%), Positives = 116/138 (84%)
Query: 66 LKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYS 125
++ CYRYSLE RGA+I +T++W KN+ I LVGCIAELT+ EE LLH G NDFSVMYS
Sbjct: 279 IEACYRYSLEEQRGAKINSTKRWSKNEKIECLVGCIAELTESEEAALLHAGKNDFSVMYS 338
Query: 126 CRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDK 185
CRKNCAQLWLGPAAYINHDCR NC+F+AT RDTACVKVLRDIE GEEITCFYGEDFFGD
Sbjct: 339 CRKNCAQLWLGPAAYINHDCRANCKFLATGRDTACVKVLRDIEVGEEITCFYGEDFFGDS 398
Query: 186 NCLCECDTCEIRMKGAFA 203
N CEC+TCE R GAFA
Sbjct: 399 NRYCECETCERRGTGAFA 416
>gi|390355128|ref|XP_800216.3| PREDICTED: uncharacterized protein LOC583266 [Strongylocentrotus
purpuratus]
Length = 772
Score = 231 bits (589), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 107/170 (62%), Positives = 128/170 (75%), Gaps = 1/170 (0%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+KPCYRYSLEG G +IV T W K++ I LVGCIAELT+ EE L +G NDFS+M+
Sbjct: 122 EIKPCYRYSLEGE-GGKIVTTCHWSKHEKIDNLVGCIAELTEFEENTFLKSGENDFSIMF 180
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FVAT RDTACV++LRDIE EEITC+YGEDFFG+
Sbjct: 181 SMRKNCAQLWLGPAAFINHDCRPNCKFVATGRDTACVQILRDIEPEEEITCYYGEDFFGE 240
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQPSRYCLRDTDMRLNKRKLHKK 234
NC CEC+TCE R GAF + + +Y LRDT+ R+ + K KK
Sbjct: 241 NNCYCECETCERRQSGAFTPKDSPDKKKETPKYSLRDTNKRIMRLKKQKK 290
>gi|291224457|ref|XP_002732220.1| PREDICTED: Suv4-20-like [Saccoglossus kowalevskii]
Length = 930
Score = 230 bits (587), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 109/167 (65%), Positives = 133/167 (79%), Gaps = 2/167 (1%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+K C RYSLEG +GA+IV+T++WEKN+ I+ LVGCI+ELT EE +LL G NDFSVMY
Sbjct: 119 EVKSCCRYSLEG-QGAKIVSTKQWEKNEKISLLVGCISELTPLEENQLLRAGENDFSVMY 177
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPA++INHDCRPNC+FV+T RDTACV+VLRDIE EEITCFYG+ FFG+
Sbjct: 178 STRKNCAQLWLGPASFINHDCRPNCKFVSTGRDTACVQVLRDIEPEEEITCFYGDGFFGE 237
Query: 185 KNCLCECDTCEIRMKGAFA-QSSVEPQTTQPSRYCLRDTDMRLNKRK 230
NC CEC+TCE R GAF Q+ E + + Y LR+TD RL++ K
Sbjct: 238 DNCYCECETCERRKTGAFTPQNPSEKDVLKETSYRLRETDNRLSRMK 284
>gi|260827234|ref|XP_002608570.1| hypothetical protein BRAFLDRAFT_235988 [Branchiostoma floridae]
gi|229293921|gb|EEN64580.1| hypothetical protein BRAFLDRAFT_235988 [Branchiostoma floridae]
Length = 312
Score = 229 bits (585), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 116/203 (57%), Positives = 141/203 (69%), Gaps = 12/203 (5%)
Query: 37 LAQRFNIGRILDMGTQHHRY---SYRIQYLP--------ELKPCYRYSLEGNRGARIVAT 85
L Q I + L TQH + S+ ++YL E+ PC RYSLE GA+I+AT
Sbjct: 108 LTQGEWIWKYLQHKTQHQKKIFKSHVVRYLHMFHPDAGFEVVPCDRYSLE-KHGAKILAT 166
Query: 86 RKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDC 145
+ W KN+ I LVGCIAE+T EEE +LL G+NDFSVM+S RKNCAQLWLGPAA+INHDC
Sbjct: 167 KAWTKNEKIPMLVGCIAEITAEEENQLLQPGVNDFSVMFSTRKNCAQLWLGPAAFINHDC 226
Query: 146 RPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFAQS 205
R NC+FV+T RDTACVKVLRDIE GEEITC+YGE FFGD N CEC+TCE R GAFA
Sbjct: 227 RSNCKFVSTGRDTACVKVLRDIEAGEEITCYYGEGFFGDDNSYCECETCERRGTGAFASR 286
Query: 206 SVEPQTTQPSRYCLRDTDMRLNK 228
+ ++Y LR+TD R+ +
Sbjct: 287 EGNKEKVHNTKYGLRETDKRIQR 309
>gi|410974658|ref|XP_003993760.1| PREDICTED: histone-lysine N-methyltransferase SUV420H1 [Felis
catus]
Length = 880
Score = 229 bits (585), Expect = 8e-58, Method: Compositional matrix adjust.
Identities = 116/206 (56%), Positives = 143/206 (69%), Gaps = 9/206 (4%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 196 EILPCNRYSSEQN-GAKIVATKEWKRNDKIELLVGCIAELSEIEENMLLRHGENDFSVMY 254
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 255 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 314
Query: 185 KNCLCECDTCEIRMKGAFAQSS--VEPQTTQPSRYCLRDTDMRLNKRKLHKKLNRLLLAS 242
N CEC TCE R GAF + P S+Y LR+TD RLN +L +L +S
Sbjct: 315 NNEFCECYTCERRGTGAFKSRAGLPAPAPVINSKYGLRETDKRLN------RLKKLGDSS 368
Query: 243 DKNDTNSSDNSSKYSLVQWNNNSKEN 268
+D+ S +++ Q NN+ N
Sbjct: 369 KNSDSQSVSSNTDADTTQEKNNATPN 394
>gi|344295780|ref|XP_003419589.1| PREDICTED: histone-lysine N-methyltransferase SUV420H1 [Loxodonta
africana]
Length = 883
Score = 229 bits (583), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 116/206 (56%), Positives = 143/206 (69%), Gaps = 9/206 (4%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 196 EILPCNRYSSEQN-GAKIVATKEWKRNDKIELLVGCIAELSEIEENMLLRHGENDFSVMY 254
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 255 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 314
Query: 185 KNCLCECDTCEIRMKGAFAQSS--VEPQTTQPSRYCLRDTDMRLNKRKLHKKLNRLLLAS 242
N CEC TCE R GAF + P S+Y LR+TD RLN +L +L +S
Sbjct: 315 NNEFCECYTCERRGTGAFKSRAGLPAPAPVINSKYGLRETDKRLN------RLKKLGDSS 368
Query: 243 DKNDTNSSDNSSKYSLVQWNNNSKEN 268
+D+ S +++ Q NN+ N
Sbjct: 369 KNSDSQSVSSNTDADTTQEKNNATSN 394
>gi|335281381|ref|XP_003122479.2| PREDICTED: histone-lysine N-methyltransferase SUV420H1-like isoform
1 [Sus scrofa]
Length = 888
Score = 229 bits (583), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 118/207 (57%), Positives = 145/207 (70%), Gaps = 11/207 (5%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 195 EILPCNRYSSEQN-GAKIVATKEWKRNDKIELLVGCIAELSEIEENMLLRHGENDFSVMY 253
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 254 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 313
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQP---SRYCLRDTDMRLNKRKLHKKLNRLLLA 241
N CEC TCE R GAF +S V P S+Y LR+TD RLN +L +L +
Sbjct: 314 NNEFCECYTCERRGTGAF-KSRVGLPAPAPVINSKYGLRETDKRLN------RLKKLGDS 366
Query: 242 SDKNDTNSSDNSSKYSLVQWNNNSKEN 268
S +D+ S +++ Q NN+ N
Sbjct: 367 SKNSDSQSVSSNTDADTTQEKNNATSN 393
>gi|426252140|ref|XP_004019776.1| PREDICTED: histone-lysine N-methyltransferase SUV420H1 [Ovis aries]
Length = 883
Score = 229 bits (583), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 118/207 (57%), Positives = 145/207 (70%), Gaps = 11/207 (5%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 196 EILPCNRYSSEQN-GAKIVATKEWKRNDKIELLVGCIAELSEIEENMLLRHGENDFSVMY 254
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 255 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 314
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQP---SRYCLRDTDMRLNKRKLHKKLNRLLLA 241
N CEC TCE R GAF +S V P S+Y LR+TD RLN +L +L +
Sbjct: 315 NNEFCECYTCERRGTGAF-KSRVGLPAPAPVINSKYGLRETDKRLN------RLKKLGDS 367
Query: 242 SDKNDTNSSDNSSKYSLVQWNNNSKEN 268
S +D+ S +++ Q NN+ N
Sbjct: 368 SKNSDSQSVSSNTDADTTQEKNNTTSN 394
>gi|73982734|ref|XP_851524.1| PREDICTED: histone-lysine N-methyltransferase SUV420H1 isoform 2
[Canis lupus familiaris]
Length = 885
Score = 229 bits (583), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 118/207 (57%), Positives = 145/207 (70%), Gaps = 11/207 (5%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 196 EILPCNRYSSEQN-GAKIVATKEWKRNDKIELLVGCIAELSEIEENMLLRHGENDFSVMY 254
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 255 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 314
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQP---SRYCLRDTDMRLNKRKLHKKLNRLLLA 241
N CEC TCE R GAF +S V P S+Y LR+TD RLN +L +L +
Sbjct: 315 NNEFCECYTCERRGTGAF-KSRVGLPAPAPVINSKYGLRETDKRLN------RLKKLGDS 367
Query: 242 SDKNDTNSSDNSSKYSLVQWNNNSKEN 268
S +D+ S +++ Q NN+ N
Sbjct: 368 SKNSDSQSVSSNTDADTTQEKNNATPN 394
>gi|351709796|gb|EHB12715.1| Histone-lysine N-methyltransferase SUV420H1 [Heterocephalus glaber]
Length = 879
Score = 229 bits (583), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 118/207 (57%), Positives = 145/207 (70%), Gaps = 11/207 (5%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 196 EILPCNRYSSEQN-GAKIVATKEWKRNDKIELLVGCIAELSEIEENMLLRHGENDFSVMY 254
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 255 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 314
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQP---SRYCLRDTDMRLNKRKLHKKLNRLLLA 241
N CEC TCE R GAF +S V P S+Y LR+TD RLN +L +L +
Sbjct: 315 NNEFCECYTCERRGTGAF-KSRVGLPAPAPVINSKYGLRETDKRLN------RLKKLGDS 367
Query: 242 SDKNDTNSSDNSSKYSLVQWNNNSKEN 268
S +D+ S +++ Q NN+ N
Sbjct: 368 SKNSDSQSVSSNTDADTTQEKNNATSN 394
>gi|198418355|ref|XP_002128107.1| PREDICTED: similar to suppressor of variegation 4-20 homolog 1
(Drosophila) [Ciona intestinalis]
Length = 597
Score = 229 bits (583), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 117/214 (54%), Positives = 152/214 (71%), Gaps = 8/214 (3%)
Query: 52 QHHRYSYRIQYLPE----LKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKE 107
++H + Y + PE ++PC RY+LE + GA+IV+TR W KND + LVGCIAELTK+
Sbjct: 110 KNHVFRYLGMFDPECGFSIQPCNRYTLE-DGGAKIVSTRDWCKNDKMNLLVGCIAELTKD 168
Query: 108 EEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDI 167
+E LL G NDFSVM+S RKNCAQLWLGPA++INHDCRPNC FV+T RDTACV+VLRDI
Sbjct: 169 DESSLLVPGRNDFSVMFSTRKNCAQLWLGPASFINHDCRPNCCFVSTGRDTACVRVLRDI 228
Query: 168 EQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFAQ---SSVEPQTTQPSRYCLRDTDM 224
GEEITCFYGE FFGDKNC CEC+TCE R GA+ S + T ++Y LR+TD
Sbjct: 229 LPGEEITCFYGESFFGDKNCFCECETCERRKTGAYRPKDGSKSDVPMTFSNKYGLRETDK 288
Query: 225 RLNKRKLHKKLNRLLLASDKNDTNSSDNSSKYSL 258
RL++ + +K N+L + + S + S++S+
Sbjct: 289 RLHRLQKERKSNKLPSSHKLVKRDPSPSCSEFSM 322
>gi|335307894|ref|XP_003361020.1| PREDICTED: histone-lysine N-methyltransferase SUV420H1-like [Sus
scrofa]
Length = 888
Score = 229 bits (583), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 118/207 (57%), Positives = 145/207 (70%), Gaps = 11/207 (5%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 195 EILPCNRYSSEQN-GAKIVATKEWKRNDKIELLVGCIAELSEIEENMLLRHGENDFSVMY 253
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 254 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 313
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQP---SRYCLRDTDMRLNKRKLHKKLNRLLLA 241
N CEC TCE R GAF +S V P S+Y LR+TD RLN +L +L +
Sbjct: 314 NNEFCECYTCERRGTGAF-KSRVGLPAPAPVINSKYGLRETDKRLN------RLKKLGDS 366
Query: 242 SDKNDTNSSDNSSKYSLVQWNNNSKEN 268
S +D+ S +++ Q NN+ N
Sbjct: 367 SKNSDSQSVSSNTDADTTQEKNNATSN 393
>gi|332278247|sp|Q4FZB7.4|SV421_HUMAN RecName: Full=Histone-lysine N-methyltransferase SUV420H1; AltName:
Full=Lysine N-methyltransferase 5B; AltName:
Full=Suppressor of variegation 4-20 homolog 1;
Short=Su(var)4-20 homolog 1; Short=Suv4-20h1
Length = 885
Score = 229 bits (583), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 118/207 (57%), Positives = 145/207 (70%), Gaps = 11/207 (5%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 196 EILPCNRYSSEQN-GAKIVATKEWKRNDKIELLVGCIAELSEIEENMLLRHGENDFSVMY 254
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 255 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 314
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQP---SRYCLRDTDMRLNKRKLHKKLNRLLLA 241
N CEC TCE R GAF +S V P S+Y LR+TD RLN +L +L +
Sbjct: 315 NNEFCECYTCERRGTGAF-KSRVGLPAPAPVINSKYGLRETDKRLN------RLKKLGDS 367
Query: 242 SDKNDTNSSDNSSKYSLVQWNNNSKEN 268
S +D+ S +++ Q NN+ N
Sbjct: 368 SKNSDSQSVSSNTDADTTQEKNNATSN 394
>gi|296218974|ref|XP_002755674.1| PREDICTED: histone-lysine N-methyltransferase SUV420H1 [Callithrix
jacchus]
Length = 883
Score = 229 bits (583), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 118/207 (57%), Positives = 145/207 (70%), Gaps = 11/207 (5%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 196 EILPCNRYSSEQN-GAKIVATKEWKRNDKIELLVGCIAELSEIEENMLLRHGENDFSVMY 254
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 255 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 314
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQP---SRYCLRDTDMRLNKRKLHKKLNRLLLA 241
N CEC TCE R GAF +S V P S+Y LR+TD RLN +L +L +
Sbjct: 315 NNEFCECYTCERRGTGAF-KSRVGLPAPAPVINSKYGLRETDKRLN------RLKKLGDS 367
Query: 242 SDKNDTNSSDNSSKYSLVQWNNNSKEN 268
S +D+ S +++ Q NN+ N
Sbjct: 368 SKNSDSQSVSSNTDADTTQEQNNATSN 394
>gi|50659082|ref|NP_060105.3| histone-lysine N-methyltransferase SUV420H1 isoform 1 [Homo
sapiens]
gi|71043475|gb|AAH99714.1| Suppressor of variegation 4-20 homolog 1 (Drosophila) [Homo
sapiens]
gi|73695394|gb|AAI03499.1| Suppressor of variegation 4-20 homolog 1, isoform 1 [Homo sapiens]
gi|119595105|gb|EAW74699.1| suppressor of variegation 4-20 homolog 1 (Drosophila), isoform
CRA_c [Homo sapiens]
gi|168277680|dbj|BAG10818.1| histone-lysine N-methyltransferase SUV420H1 [synthetic construct]
Length = 885
Score = 228 bits (582), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 118/207 (57%), Positives = 145/207 (70%), Gaps = 11/207 (5%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 196 EILPCNRYSSEQN-GAKIVATKEWKRNDKIELLVGCIAELSEIEENMLLRHGENDFSVMY 254
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 255 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 314
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQP---SRYCLRDTDMRLNKRKLHKKLNRLLLA 241
N CEC TCE R GAF +S V P S+Y LR+TD RLN +L +L +
Sbjct: 315 NNEFCECYTCERRGTGAF-KSRVGLPAPAPVINSKYGLRETDKRLN------RLKKLGDS 367
Query: 242 SDKNDTNSSDNSSKYSLVQWNNNSKEN 268
S +D+ S +++ Q NN+ N
Sbjct: 368 SKNSDSQSVSSNTDADTTQEKNNATSN 394
>gi|403301082|ref|XP_003941228.1| PREDICTED: histone-lysine N-methyltransferase SUV420H1 [Saimiri
boliviensis boliviensis]
Length = 882
Score = 228 bits (582), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 118/207 (57%), Positives = 145/207 (70%), Gaps = 11/207 (5%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 196 EILPCNRYSSEQN-GAKIVATKEWKRNDKIELLVGCIAELSEIEENMLLRHGENDFSVMY 254
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 255 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 314
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQP---SRYCLRDTDMRLNKRKLHKKLNRLLLA 241
N CEC TCE R GAF +S V P S+Y LR+TD RLN +L +L +
Sbjct: 315 NNEFCECYTCERRGTGAF-KSRVGLPAPAPVINSKYGLRETDKRLN------RLKKLGDS 367
Query: 242 SDKNDTNSSDNSSKYSLVQWNNNSKEN 268
S +D+ S +++ Q NN+ N
Sbjct: 368 SKNSDSQSVSSNTDADTTQEKNNATSN 394
>gi|395544582|ref|XP_003774187.1| PREDICTED: histone-lysine N-methyltransferase SUV420H1 [Sarcophilus
harrisii]
Length = 899
Score = 228 bits (582), Expect = 1e-57, Method: Composition-based stats.
Identities = 117/207 (56%), Positives = 144/207 (69%), Gaps = 11/207 (5%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 196 EILPCNRYSSEQN-GAKIVATKEWKRNDKIELLVGCIAELSEIEENMLLRHGENDFSVMY 254
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 255 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 314
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQP---SRYCLRDTDMRLNKRKLHKKLNRLLLA 241
N CEC TCE R GAF +S V T P S+Y LR+TD RLN +L +L
Sbjct: 315 NNEYCECYTCERRGTGAF-KSRVGLPTPTPVINSKYGLRETDKRLN------RLKKLGDG 367
Query: 242 SDKNDTNSSDNSSKYSLVQWNNNSKEN 268
S +D+ S +++ Q ++ N
Sbjct: 368 SKNSDSQSVSSNTDADTTQEKADAAAN 394
>gi|332249714|ref|XP_003274002.1| PREDICTED: histone-lysine N-methyltransferase SUV420H1 [Nomascus
leucogenys]
Length = 885
Score = 228 bits (582), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 118/207 (57%), Positives = 145/207 (70%), Gaps = 11/207 (5%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 196 EILPCNRYSSEQN-GAKIVATKEWKRNDKIELLVGCIAELSEIEENMLLRHGENDFSVMY 254
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 255 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 314
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQP---SRYCLRDTDMRLNKRKLHKKLNRLLLA 241
N CEC TCE R GAF +S V P S+Y LR+TD RLN +L +L +
Sbjct: 315 NNEFCECYTCERRGTGAF-KSRVGLPAPAPVINSKYGLRETDKRLN------RLKKLGDS 367
Query: 242 SDKNDTNSSDNSSKYSLVQWNNNSKEN 268
S +D+ S +++ Q NN+ N
Sbjct: 368 SKNSDSQSVSSNTDADTTQEKNNATSN 394
>gi|386869558|ref|NP_001247722.1| histone-lysine N-methyltransferase SUV420H1 [Macaca mulatta]
gi|402892592|ref|XP_003909494.1| PREDICTED: histone-lysine N-methyltransferase SUV420H1 [Papio
anubis]
gi|355566227|gb|EHH22606.1| Histone-lysine N-methyltransferase SUV420H1 [Macaca mulatta]
gi|355751897|gb|EHH56017.1| Histone-lysine N-methyltransferase SUV420H1 [Macaca fascicularis]
gi|380813342|gb|AFE78545.1| histone-lysine N-methyltransferase SUV420H1 isoform 1 [Macaca
mulatta]
gi|383410283|gb|AFH28355.1| histone-lysine N-methyltransferase SUV420H1 isoform 1 [Macaca
mulatta]
gi|384941870|gb|AFI34540.1| histone-lysine N-methyltransferase SUV420H1 isoform 1 [Macaca
mulatta]
Length = 885
Score = 228 bits (582), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 118/207 (57%), Positives = 145/207 (70%), Gaps = 11/207 (5%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 196 EILPCNRYSSEQN-GAKIVATKEWKRNDKIELLVGCIAELSEIEENMLLRHGENDFSVMY 254
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 255 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 314
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQP---SRYCLRDTDMRLNKRKLHKKLNRLLLA 241
N CEC TCE R GAF +S V P S+Y LR+TD RLN +L +L +
Sbjct: 315 NNEFCECYTCERRGTGAF-KSRVGLPAPAPVINSKYGLRETDKRLN------RLKKLGDS 367
Query: 242 SDKNDTNSSDNSSKYSLVQWNNNSKEN 268
S +D+ S +++ Q NN+ N
Sbjct: 368 SKNSDSQSVSSNTDADTTQEKNNATSN 394
>gi|426369487|ref|XP_004051719.1| PREDICTED: histone-lysine N-methyltransferase SUV420H1 [Gorilla
gorilla gorilla]
Length = 860
Score = 228 bits (582), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 118/207 (57%), Positives = 145/207 (70%), Gaps = 11/207 (5%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 171 EILPCNRYSSEQN-GAKIVATKEWKRNDKIELLVGCIAELSEIEENMLLRHGENDFSVMY 229
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 230 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 289
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQP---SRYCLRDTDMRLNKRKLHKKLNRLLLA 241
N CEC TCE R GAF +S V P S+Y LR+TD RLN +L +L +
Sbjct: 290 NNEFCECYTCERRGTGAF-KSRVGLPAPAPVINSKYGLRETDKRLN------RLKKLGDS 342
Query: 242 SDKNDTNSSDNSSKYSLVQWNNNSKEN 268
S +D+ S +++ Q NN+ N
Sbjct: 343 SKNSDSQSVSSNTDADTTQEKNNATSN 369
>gi|395851619|ref|XP_003798350.1| PREDICTED: histone-lysine N-methyltransferase SUV420H1 [Otolemur
garnettii]
Length = 880
Score = 228 bits (582), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 118/207 (57%), Positives = 145/207 (70%), Gaps = 11/207 (5%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 196 EILPCNRYSSEQN-GAKIVATKEWKRNDKIELLVGCIAELSEIEENMLLRHGENDFSVMY 254
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 255 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 314
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQP---SRYCLRDTDMRLNKRKLHKKLNRLLLA 241
N CEC TCE R GAF +S V P S+Y LR+TD RLN +L +L +
Sbjct: 315 NNEFCECYTCERRGTGAF-KSRVGLPAPAPVINSKYGLRETDKRLN------RLKKLGDS 367
Query: 242 SDKNDTNSSDNSSKYSLVQWNNNSKEN 268
S +D+ S +++ Q NN+ N
Sbjct: 368 SKNSDSQSVSSNTDADTTQEKNNATSN 394
>gi|301624805|ref|XP_002941687.1| PREDICTED: histone-lysine N-methyltransferase SUV420H1-A [Xenopus
(Silurana) tropicalis]
Length = 844
Score = 228 bits (582), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 118/204 (57%), Positives = 144/204 (70%), Gaps = 11/204 (5%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAELT+ EE LL G NDFSVMY
Sbjct: 172 EILPCNRYSSERN-GAKIVATKEWKRNDKIELLVGCIAELTEAEENMLLRHGENDFSVMY 230
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 231 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 290
Query: 185 KNCLCECDTCEIRMKGAFAQSSV---EPQTTQPSRYCLRDTDMRLNKRKLHKKLNRLLLA 241
N CEC TCE R GAF +S V EP S+Y LR+TD RLN +L +L +
Sbjct: 291 NNEYCECYTCERRATGAF-KSRVGLNEPGPVINSKYGLRETDKRLN------RLKKLGDS 343
Query: 242 SDKNDTNSSDNSSKYSLVQWNNNS 265
S +D+ S +++ Q + S
Sbjct: 344 SKNSDSQSVSSNTDADTSQEKDTS 367
>gi|301771874|ref|XP_002921341.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
SUV420H1-like [Ailuropoda melanoleuca]
Length = 881
Score = 228 bits (582), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 118/207 (57%), Positives = 145/207 (70%), Gaps = 11/207 (5%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 196 EILPCNRYSSEQN-GAKIVATKEWKRNDKIELLVGCIAELSEIEENMLLRHGENDFSVMY 254
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 255 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 314
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQP---SRYCLRDTDMRLNKRKLHKKLNRLLLA 241
N CEC TCE R GAF +S V P S+Y LR+TD RLN +L +L +
Sbjct: 315 NNEFCECYTCERRGTGAF-KSRVGLPAPAPVINSKYGLRETDKRLN------RLKKLGDS 367
Query: 242 SDKNDTNSSDNSSKYSLVQWNNNSKEN 268
S +D+ S +++ Q NN+ N
Sbjct: 368 SKNSDSQSVSSNTDADTTQEKNNATPN 394
>gi|195469551|ref|XP_002099701.1| GE16574 [Drosophila yakuba]
gi|194187225|gb|EDX00809.1| GE16574 [Drosophila yakuba]
Length = 1330
Score = 228 bits (582), Expect = 2e-57, Method: Composition-based stats.
Identities = 115/178 (64%), Positives = 132/178 (74%), Gaps = 13/178 (7%)
Query: 66 LKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYS 125
++ CYRYSLE RGA+I +T++W KND I LVGCIAELT+ EE LLH+G NDFSVMYS
Sbjct: 252 IEACYRYSLEEQRGAKISSTKRWSKNDKIECLVGCIAELTEAEEAALLHSGKNDFSVMYS 311
Query: 126 CRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDK 185
CRKNCAQLWLGPAAYINHDCR NC+F+AT RDTACVKVLRDIE GEEITCFYGEDFFGD
Sbjct: 312 CRKNCAQLWLGPAAYINHDCRANCKFLATGRDTACVKVLRDIEVGEEITCFYGEDFFGDS 371
Query: 186 NCLCECDTCEIRMKGAFA----------QSSVEPQTTQPSR---YCLRDTDMRLNKRK 230
N CEC+TCE R GAFA + ++ P Y LR+TD R+N+ K
Sbjct: 372 NRYCECETCERRGTGAFAGKDDGLMLGLSMGLGLASSGPGNNGGYRLRETDNRINRIK 429
>gi|397517177|ref|XP_003828795.1| PREDICTED: histone-lysine N-methyltransferase SUV420H1 [Pan
paniscus]
gi|410223832|gb|JAA09135.1| suppressor of variegation 4-20 homolog 1 [Pan troglodytes]
gi|410251048|gb|JAA13491.1| suppressor of variegation 4-20 homolog 1 [Pan troglodytes]
gi|410306186|gb|JAA31693.1| suppressor of variegation 4-20 homolog 1 [Pan troglodytes]
gi|410352267|gb|JAA42737.1| suppressor of variegation 4-20 homolog 1 [Pan troglodytes]
Length = 885
Score = 228 bits (582), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 118/207 (57%), Positives = 145/207 (70%), Gaps = 11/207 (5%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 196 EILPCNRYSSEQN-GAKIVATKEWKRNDKIELLVGCIAELSEIEENMLLRHGENDFSVMY 254
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 255 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 314
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQP---SRYCLRDTDMRLNKRKLHKKLNRLLLA 241
N CEC TCE R GAF +S V P S+Y LR+TD RLN +L +L +
Sbjct: 315 NNEFCECYTCERRGTGAF-KSRVGLPAPAPVINSKYGLRETDKRLN------RLKKLGDS 367
Query: 242 SDKNDTNSSDNSSKYSLVQWNNNSKEN 268
S +D+ S +++ Q NN+ N
Sbjct: 368 SKNSDSQSVSSNTDADTTQEKNNATSN 394
>gi|297687986|ref|XP_002821477.1| PREDICTED: histone-lysine N-methyltransferase SUV420H1 [Pongo
abelii]
Length = 885
Score = 228 bits (582), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 118/207 (57%), Positives = 145/207 (70%), Gaps = 11/207 (5%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 196 EILPCNRYSSEQN-GAKIVATKEWKRNDKIELLVGCIAELSEIEENMLLRHGENDFSVMY 254
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 255 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 314
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQP---SRYCLRDTDMRLNKRKLHKKLNRLLLA 241
N CEC TCE R GAF +S V P S+Y LR+TD RLN +L +L +
Sbjct: 315 NNEFCECYTCERRGTGAF-KSRVGLPAPAPVINSKYGLRETDKRLN------RLKKLGDS 367
Query: 242 SDKNDTNSSDNSSKYSLVQWNNNSKEN 268
S +D+ S +++ Q NN+ N
Sbjct: 368 SKNSDSQSVSSNTDADTTQEKNNATSN 394
>gi|114638902|ref|XP_001173525.1| PREDICTED: histone-lysine N-methyltransferase SUV420H1 isoform 6
[Pan troglodytes]
Length = 885
Score = 228 bits (582), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 118/207 (57%), Positives = 145/207 (70%), Gaps = 11/207 (5%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 196 EILPCNRYSSEQN-GAKIVATKEWKRNDKIELLVGCIAELSEIEENMLLRHGENDFSVMY 254
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 255 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 314
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQP---SRYCLRDTDMRLNKRKLHKKLNRLLLA 241
N CEC TCE R GAF +S V P S+Y LR+TD RLN +L +L +
Sbjct: 315 NNEFCECYTCERRGTGAF-KSRVGLPAPAPVINSKYGLRETDKRLN------RLKKLGDS 367
Query: 242 SDKNDTNSSDNSSKYSLVQWNNNSKEN 268
S +D+ S +++ Q NN+ N
Sbjct: 368 SKNSDSQSVSSNTDADTTQEKNNATSN 394
>gi|348565143|ref|XP_003468363.1| PREDICTED: histone-lysine N-methyltransferase SUV420H1-like isoform
2 [Cavia porcellus]
Length = 859
Score = 228 bits (582), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 118/207 (57%), Positives = 145/207 (70%), Gaps = 11/207 (5%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 173 EILPCNRYSSEQN-GAKIVATKEWKRNDKIELLVGCIAELSEIEENMLLRHGENDFSVMY 231
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 232 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 291
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQP---SRYCLRDTDMRLNKRKLHKKLNRLLLA 241
N CEC TCE R GAF +S V P S+Y LR+TD RLN +L +L +
Sbjct: 292 NNEFCECYTCERRGTGAF-KSRVGLPAPAPVINSKYGLRETDKRLN------RLKKLGDS 344
Query: 242 SDKNDTNSSDNSSKYSLVQWNNNSKEN 268
S +D+ S +++ Q NN+ N
Sbjct: 345 SKNSDSQSVSSNTDADTTQEKNNATSN 371
>gi|348565141|ref|XP_003468362.1| PREDICTED: histone-lysine N-methyltransferase SUV420H1-like isoform
1 [Cavia porcellus]
Length = 882
Score = 228 bits (582), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 118/207 (57%), Positives = 145/207 (70%), Gaps = 11/207 (5%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 196 EILPCNRYSSEQN-GAKIVATKEWKRNDKIELLVGCIAELSEIEENMLLRHGENDFSVMY 254
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 255 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 314
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQP---SRYCLRDTDMRLNKRKLHKKLNRLLLA 241
N CEC TCE R GAF +S V P S+Y LR+TD RLN +L +L +
Sbjct: 315 NNEFCECYTCERRGTGAF-KSRVGLPAPAPVINSKYGLRETDKRLN------RLKKLGDS 367
Query: 242 SDKNDTNSSDNSSKYSLVQWNNNSKEN 268
S +D+ S +++ Q NN+ N
Sbjct: 368 SKNSDSQSVSSNTDADTTQEKNNATSN 394
>gi|410912598|ref|XP_003969776.1| PREDICTED: histone-lysine N-methyltransferase SUV420H1-like
[Takifugu rubripes]
Length = 843
Score = 228 bits (581), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 112/169 (66%), Positives = 130/169 (76%), Gaps = 5/169 (2%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT+ W++ND I YLVGCIAEL++ EE LL G NDFSVMY
Sbjct: 174 EILPCNRYSSEQN-GAKIVATKAWKRNDKIEYLVGCIAELSESEENMLLRHGENDFSVMY 232
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVKVLRDIE GEEI+C+YG+ FFG+
Sbjct: 233 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKVLRDIEPGEEISCYYGDGFFGE 292
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQP---SRYCLRDTDMRLNKRK 230
N CEC TCE R GAF +S Q P S+Y LR+TD RLN+ K
Sbjct: 293 NNEFCECYTCERRGTGAF-KSKAGLQVEVPVINSKYGLRETDKRLNRLK 340
>gi|348542104|ref|XP_003458526.1| PREDICTED: histone-lysine N-methyltransferase SUV420H1-like
[Oreochromis niloticus]
Length = 867
Score = 228 bits (581), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 112/175 (64%), Positives = 132/175 (75%), Gaps = 3/175 (1%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I YLVGCIAEL++ EE LL G NDFSVMY
Sbjct: 216 EILPCNRYSSEQN-GAKIVATKEWKRNDKIEYLVGCIAELSESEENLLLRHGENDFSVMY 274
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVKVLRDIE GEEI+C+YG+ FFG+
Sbjct: 275 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKVLRDIEPGEEISCYYGDGFFGE 334
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQP--SRYCLRDTDMRLNKRKLHKKLNR 237
N CEC TCE R GAF + P S+Y LR+TD RLN+ K + NR
Sbjct: 335 NNEFCECYTCERRGTGAFKSKAGLPVEVPVINSKYGLRETDKRLNRLKKLGEGNR 389
>gi|149725495|ref|XP_001498718.1| PREDICTED: histone-lysine N-methyltransferase SUV420H1 isoform 1
[Equus caballus]
Length = 877
Score = 228 bits (581), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 117/204 (57%), Positives = 144/204 (70%), Gaps = 11/204 (5%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 196 EILPCNRYSSEQN-GAKIVATKEWKRNDKIELLVGCIAELSEIEENMLLRHGENDFSVMY 254
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 255 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 314
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQP---SRYCLRDTDMRLNKRKLHKKLNRLLLA 241
N CEC TCE R GAF +S V P S+Y LR+TD RLN +L +L +
Sbjct: 315 NNEFCECYTCERRGTGAF-KSRVGLPAPAPVINSKYGLRETDKRLN------RLKKLGDS 367
Query: 242 SDKNDTNSSDNSSKYSLVQWNNNS 265
S +D+ S +++ Q NN+
Sbjct: 368 SKNSDSQSVSSNTDADTTQEKNNA 391
>gi|432851152|ref|XP_004066881.1| PREDICTED: histone-lysine N-methyltransferase SUV420H1-like
[Oryzias latipes]
Length = 837
Score = 228 bits (580), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 117/202 (57%), Positives = 142/202 (70%), Gaps = 8/202 (3%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I +LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 172 EILPCNRYSSEQN-GAKIVATKEWKRNDKIEFLVGCIAELSESEENMLLRHGENDFSVMY 230
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVKVLRDIE GEEI+C+YG+ FFG+
Sbjct: 231 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKVLRDIEPGEEISCYYGDGFFGE 290
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQP---SRYCLRDTDMRLNKRKLHKKLNRLLLA 241
N CEC TCE R GAF + P P S+Y LR+TD RLN+ K + NR +
Sbjct: 291 SNEFCECYTCERRGTGAFKSKAGLP-VEAPVINSKYGLRETDKRLNRLKKLGEGNR---S 346
Query: 242 SDKNDTNSSDNSSKYSLVQWNN 263
SD + S + + Q N
Sbjct: 347 SDSHSVGSHTDLDSQEISQKEN 368
>gi|417405104|gb|JAA49277.1| Putative histone tail methylase [Desmodus rotundus]
Length = 883
Score = 228 bits (580), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 117/204 (57%), Positives = 144/204 (70%), Gaps = 11/204 (5%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 196 EILPCNRYSSEQN-GAKIVATKEWKRNDKIELLVGCIAELSEIEENMLLRHGENDFSVMY 254
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 255 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 314
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQP---SRYCLRDTDMRLNKRKLHKKLNRLLLA 241
N CEC TCE R GAF +S V P S+Y LR+TD RLN +L +L +
Sbjct: 315 NNEFCECYTCERRGTGAF-KSRVGLPAPAPVINSKYGLRETDKRLN------RLKKLGDS 367
Query: 242 SDKNDTNSSDNSSKYSLVQWNNNS 265
S +D+ S +++ Q NN+
Sbjct: 368 SKNSDSQSVSSNTDADTTQEKNNA 391
>gi|115530374|emb|CAL49379.1| novel protein similar to suv420h1 [Xenopus (Silurana) tropicalis]
Length = 645
Score = 227 bits (579), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 118/204 (57%), Positives = 144/204 (70%), Gaps = 11/204 (5%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAELT+ EE LL G NDFSVMY
Sbjct: 172 EILPCNRYSSERN-GAKIVATKEWKRNDKIELLVGCIAELTEAEENMLLRHGENDFSVMY 230
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 231 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 290
Query: 185 KNCLCECDTCEIRMKGAFAQSSV---EPQTTQPSRYCLRDTDMRLNKRKLHKKLNRLLLA 241
N CEC TCE R GAF +S V EP S+Y LR+TD RLN +L +L +
Sbjct: 291 NNEYCECYTCERRATGAF-KSRVGLNEPGPVINSKYGLRETDKRLN------RLKKLGDS 343
Query: 242 SDKNDTNSSDNSSKYSLVQWNNNS 265
S +D+ S +++ Q + S
Sbjct: 344 SKNSDSQSVSSNTDADTSQEKDTS 367
>gi|46560161|gb|AAT00539.1| Suv4-20h1 [Mus musculus]
Length = 874
Score = 227 bits (578), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 117/207 (56%), Positives = 145/207 (70%), Gaps = 11/207 (5%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 188 EILPCNRYSSEQN-GAKIVATKEWKRNDKIELLVGCIAELSEIEENMLLRHGENDFSVMY 246
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 247 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 306
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQP---SRYCLRDTDMRLNKRKLHKKLNRLLLA 241
N CEC TCE R GAF +S V P S+Y LR+TD RLN +L +L +
Sbjct: 307 NNEFCECYTCERRGTGAF-KSRVGLPAPAPVINSKYGLRETDKRLN------RLKKLGDS 359
Query: 242 SDKNDTNSSDNSSKYSLVQWNNNSKEN 268
S +D+ S +++ Q +N+ N
Sbjct: 360 SKNSDSQSVSSNTDADTTQEKDNATSN 386
>gi|291414233|ref|XP_002723365.1| PREDICTED: suppressor of variegation 4-20 homolog 1 isoform 1
[Oryctolagus cuniculus]
Length = 873
Score = 227 bits (578), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 117/207 (56%), Positives = 144/207 (69%), Gaps = 11/207 (5%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 195 EILPCNRYSSERN-GAKIVATKEWKRNDKIELLVGCIAELSEIEENMLLRHGENDFSVMY 253
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 254 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 313
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQP---SRYCLRDTDMRLNKRKLHKKLNRLLLA 241
N CEC TCE R GAF +S V P S+Y LR+TD RLN +L +L
Sbjct: 314 NNEFCECYTCERRGTGAF-KSRVGLPAPAPVINSKYGLRETDKRLN------RLKKLGDG 366
Query: 242 SDKNDTNSSDNSSKYSLVQWNNNSKEN 268
S +D+ S +++ Q +N+ N
Sbjct: 367 SKHSDSQSVSSNTDADTAQEKSNATSN 393
>gi|440894216|gb|ELR46722.1| Histone-lysine N-methyltransferase SUV420H1 [Bos grunniens mutus]
Length = 866
Score = 227 bits (578), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 117/207 (56%), Positives = 145/207 (70%), Gaps = 11/207 (5%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 196 EILPCNRYSSEQN-GAKIVATKEWKRNDKIELLVGCIAELSEIEENMLLRHGENDFSVMY 254
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 255 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 314
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQP---SRYCLRDTDMRLNKRKLHKKLNRLLLA 241
N CEC TCE R GAF +S V P S+Y LR+TD RLN +L +L +
Sbjct: 315 NNEFCECYTCERRGTGAF-KSRVGLPAPAPVINSKYGLRETDKRLN------RLKKLGDS 367
Query: 242 SDKNDTNSSDNSSKYSLVQWNNNSKEN 268
S +D+ S +++ Q N++ N
Sbjct: 368 SKNSDSQSVSSNTDADTTQGRNDATSN 394
>gi|268607619|ref|NP_001161357.1| histone-lysine N-methyltransferase SUV420H1 isoform a [Mus
musculus]
gi|268607625|ref|NP_001161359.1| histone-lysine N-methyltransferase SUV420H1 isoform a [Mus
musculus]
gi|143586443|sp|Q3U8K7.2|SUV41_MOUSE RecName: Full=Histone-lysine N-methyltransferase SUV420H1; AltName:
Full=Suppressor of variegation 4-20 homolog 1;
Short=Su(var)4-20 homolog 1; Short=Suv4-20h1
gi|148701008|gb|EDL32955.1| suppressor of variegation 4-20 homolog 1 (Drosophila), isoform
CRA_d [Mus musculus]
Length = 883
Score = 227 bits (578), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 117/207 (56%), Positives = 145/207 (70%), Gaps = 11/207 (5%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 197 EILPCNRYSSEQN-GAKIVATKEWKRNDKIELLVGCIAELSEIEENMLLRHGENDFSVMY 255
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 256 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 315
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQP---SRYCLRDTDMRLNKRKLHKKLNRLLLA 241
N CEC TCE R GAF +S V P S+Y LR+TD RLN +L +L +
Sbjct: 316 NNEFCECYTCERRGTGAF-KSRVGLPAPAPVINSKYGLRETDKRLN------RLKKLGDS 368
Query: 242 SDKNDTNSSDNSSKYSLVQWNNNSKEN 268
S +D+ S +++ Q +N+ N
Sbjct: 369 SKNSDSQSVSSNTDADTTQEKDNATSN 395
>gi|159884215|gb|ABX00786.1| RE62653p [Drosophila melanogaster]
Length = 1121
Score = 227 bits (578), Expect = 5e-57, Method: Composition-based stats.
Identities = 114/178 (64%), Positives = 132/178 (74%), Gaps = 13/178 (7%)
Query: 66 LKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYS 125
++ CYRY+LE RGA+I +T++W KND I LVGCIAELT+ EE LLH+G NDFSVMYS
Sbjct: 75 IEACYRYTLEEQRGAKISSTKRWSKNDKIECLVGCIAELTEAEEAALLHSGRNDFSVMYS 134
Query: 126 CRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDK 185
CRKNCAQLWLGPAAYINHDCR NC+F+AT RDTACVKVLRDIE GEEITCFYGEDFFGD
Sbjct: 135 CRKNCAQLWLGPAAYINHDCRANCKFLATGRDTACVKVLRDIEVGEEITCFYGEDFFGDS 194
Query: 186 NCLCECDTCEIRMKGAFA----------QSSVEPQTTQPSR---YCLRDTDMRLNKRK 230
N CEC+TCE R GAFA + ++ P Y LR+TD R+N+ K
Sbjct: 195 NRYCECETCERRGTGAFAGKDDGLMLGLSMGLGLASSGPGNNGGYRLRETDNRINRIK 252
>gi|268607615|ref|NP_659120.3| histone-lysine N-methyltransferase SUV420H1 isoform b [Mus
musculus]
gi|268607623|ref|NP_001161358.1| histone-lysine N-methyltransferase SUV420H1 isoform b [Mus
musculus]
gi|148701005|gb|EDL32952.1| suppressor of variegation 4-20 homolog 1 (Drosophila), isoform
CRA_a [Mus musculus]
Length = 860
Score = 227 bits (578), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 117/207 (56%), Positives = 145/207 (70%), Gaps = 11/207 (5%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 174 EILPCNRYSSEQN-GAKIVATKEWKRNDKIELLVGCIAELSEIEENMLLRHGENDFSVMY 232
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 233 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 292
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQP---SRYCLRDTDMRLNKRKLHKKLNRLLLA 241
N CEC TCE R GAF +S V P S+Y LR+TD RLN +L +L +
Sbjct: 293 NNEFCECYTCERRGTGAF-KSRVGLPAPAPVINSKYGLRETDKRLN------RLKKLGDS 345
Query: 242 SDKNDTNSSDNSSKYSLVQWNNNSKEN 268
S +D+ S +++ Q +N+ N
Sbjct: 346 SKNSDSQSVSSNTDADTTQEKDNATSN 372
>gi|291414235|ref|XP_002723366.1| PREDICTED: suppressor of variegation 4-20 homolog 1 isoform 2
[Oryctolagus cuniculus]
Length = 850
Score = 226 bits (577), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 117/207 (56%), Positives = 144/207 (69%), Gaps = 11/207 (5%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 172 EILPCNRYSSERN-GAKIVATKEWKRNDKIELLVGCIAELSEIEENMLLRHGENDFSVMY 230
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 231 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 290
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQP---SRYCLRDTDMRLNKRKLHKKLNRLLLA 241
N CEC TCE R GAF +S V P S+Y LR+TD RLN +L +L
Sbjct: 291 NNEFCECYTCERRGTGAF-KSRVGLPAPAPVINSKYGLRETDKRLN------RLKKLGDG 343
Query: 242 SDKNDTNSSDNSSKYSLVQWNNNSKEN 268
S +D+ S +++ Q +N+ N
Sbjct: 344 SKHSDSQSVSSNTDADTAQEKSNATSN 370
>gi|20128811|ref|NP_569853.1| histone methyltransferase 4-20, isoform A [Drosophila melanogaster]
gi|386763550|ref|NP_001245453.1| histone methyltransferase 4-20, isoform B [Drosophila melanogaster]
gi|74872836|sp|Q9W5E0.1|SUV42_DROME RecName: Full=Histone-lysine N-methyltransferase Suv4-20; AltName:
Full=Suppressor of variegation 4-20; Short=Su(var)4-20
gi|7290071|gb|AAF45537.1| histone methyltransferase 4-20, isoform A [Drosophila melanogaster]
gi|383293123|gb|AFH07169.1| histone methyltransferase 4-20, isoform B [Drosophila melanogaster]
Length = 1300
Score = 226 bits (577), Expect = 6e-57, Method: Composition-based stats.
Identities = 114/178 (64%), Positives = 132/178 (74%), Gaps = 13/178 (7%)
Query: 66 LKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYS 125
++ CYRY+LE RGA+I +T++W KND I LVGCIAELT+ EE LLH+G NDFSVMYS
Sbjct: 254 IEACYRYTLEEQRGAKISSTKRWSKNDKIECLVGCIAELTEAEEAALLHSGKNDFSVMYS 313
Query: 126 CRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDK 185
CRKNCAQLWLGPAAYINHDCR NC+F+AT RDTACVKVLRDIE GEEITCFYGEDFFGD
Sbjct: 314 CRKNCAQLWLGPAAYINHDCRANCKFLATGRDTACVKVLRDIEVGEEITCFYGEDFFGDS 373
Query: 186 NCLCECDTCEIRMKGAFA----------QSSVEPQTTQPSR---YCLRDTDMRLNKRK 230
N CEC+TCE R GAFA + ++ P Y LR+TD R+N+ K
Sbjct: 374 NRYCECETCERRGTGAFAGKDDGLMLGLSMGLGLASSGPGNNGGYRLRETDNRINRIK 431
>gi|326920217|ref|XP_003206371.1| PREDICTED: histone-lysine N-methyltransferase SUV420H1-like isoform
2 [Meleagris gallopavo]
Length = 880
Score = 226 bits (577), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 116/206 (56%), Positives = 142/206 (68%), Gaps = 9/206 (4%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 195 EILPCNRYSSEQN-GAKIVATKEWKRNDKIELLVGCIAELSEMEENMLLRHGENDFSVMY 253
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 254 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 313
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQ--PSRYCLRDTDMRLNKRKLHKKLNRLLLAS 242
N CEC TCE R GAF P T S+Y LR+TD RLN +L +L +S
Sbjct: 314 NNEYCECYTCERRGTGAFKSRVGLPAPTPVINSKYGLRETDKRLN------RLKKLGDSS 367
Query: 243 DKNDTNSSDNSSKYSLVQWNNNSKEN 268
+D+ S +++ Q N+ N
Sbjct: 368 KNSDSQSVSSNTDADTTQEKANATSN 393
>gi|26350355|dbj|BAC38817.1| unnamed protein product [Mus musculus]
Length = 860
Score = 226 bits (577), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 117/207 (56%), Positives = 145/207 (70%), Gaps = 11/207 (5%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 174 EILPCNRYSSEQN-GAKIVATKEWKRNDKIELLVGCIAELSEIEENMLLRHGENDFSVMY 232
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 233 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 292
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQP---SRYCLRDTDMRLNKRKLHKKLNRLLLA 241
N CEC TCE R GAF +S V P S+Y LR+TD RLN +L +L +
Sbjct: 293 NNEFCECYTCERRGTGAF-KSRVGLPAPAPVINSKYGLRETDKRLN------RLKKLGDS 345
Query: 242 SDKNDTNSSDNSSKYSLVQWNNNSKEN 268
S +D+ S +++ Q +N+ N
Sbjct: 346 SKNSDSQSVSSNTDADTTQEKDNATSN 372
>gi|224050969|ref|XP_002199524.1| PREDICTED: histone-lysine N-methyltransferase SUV420H1 [Taeniopygia
guttata]
Length = 853
Score = 226 bits (577), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 116/211 (54%), Positives = 143/211 (67%), Gaps = 9/211 (4%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 171 EILPCNRYSSEQN-GAKIVATKEWKRNDKIELLVGCIAELSEMEENMLLRHGENDFSVMY 229
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 230 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 289
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQ--PSRYCLRDTDMRLNKRKLHKKLNRLLLAS 242
N CEC TCE R GAF P T S+Y LR+TD RLN +L +L +S
Sbjct: 290 NNEYCECYTCERRGTGAFKSRVGLPAPTPVINSKYGLRETDKRLN------RLKKLGDSS 343
Query: 243 DKNDTNSSDNSSKYSLVQWNNNSKENIFDLG 273
+D+ S +++ Q N+ +G
Sbjct: 344 KNSDSQSVSSNTDADTTQEKGNATNRKSSIG 374
>gi|355722719|gb|AES07664.1| suppressor of variegation 4-20-like protein 1 [Mustela putorius
furo]
Length = 413
Score = 226 bits (577), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 118/207 (57%), Positives = 145/207 (70%), Gaps = 11/207 (5%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 71 EILPCNRYSSEQN-GAKIVATKEWKRNDKIELLVGCIAELSEIEENMLLRHGENDFSVMY 129
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 130 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 189
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQP---SRYCLRDTDMRLNKRKLHKKLNRLLLA 241
N CEC TCE R GAF +S V P S+Y LR+TD RLN +L +L +
Sbjct: 190 NNEFCECYTCERRGTGAF-KSRVGLPAPAPVINSKYGLRETDKRLN------RLKKLGDS 242
Query: 242 SDKNDTNSSDNSSKYSLVQWNNNSKEN 268
S +D+ S +++ Q NN+ N
Sbjct: 243 SKNSDSQSVSSNTDADTTQEKNNATPN 269
>gi|74226530|dbj|BAE23934.1| unnamed protein product [Mus musculus]
Length = 481
Score = 226 bits (577), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 117/207 (56%), Positives = 145/207 (70%), Gaps = 11/207 (5%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 188 EILPCNRYSSEQN-GAKIVATKEWKRNDKIELLVGCIAELSEIEENMLLRHGENDFSVMY 246
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 247 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 306
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQP---SRYCLRDTDMRLNKRKLHKKLNRLLLA 241
N CEC TCE R GAF +S V P S+Y LR+TD RLN +L +L +
Sbjct: 307 NNEFCECYTCERRGTGAF-KSRVGLPAPAPVINSKYGLRETDKRLN------RLKKLGDS 359
Query: 242 SDKNDTNSSDNSSKYSLVQWNNNSKEN 268
S +D+ S +++ Q +N+ N
Sbjct: 360 SKNSDSQSVSSNTDADTTQEKDNATSN 386
>gi|148238277|ref|NP_001085777.1| histone-lysine N-methyltransferase SUV420H1-A [Xenopus laevis]
gi|82184341|sp|Q6GP17.1|SU41A_XENLA RecName: Full=Histone-lysine N-methyltransferase SUV420H1-A;
AltName: Full=Suppressor of variegation 4-20 homolog
1-A; Short=Su(var)4-20 homolog 1-A; Short=Suv4-20h1-A
gi|49118323|gb|AAH73331.1| MGC80741 protein [Xenopus laevis]
Length = 855
Score = 226 bits (577), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 111/169 (65%), Positives = 130/169 (76%), Gaps = 5/169 (2%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 172 EILPCNRYSSERN-GAKIVATKEWKRNDKIELLVGCIAELSEAEENMLLRHGENDFSVMY 230
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 231 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 290
Query: 185 KNCLCECDTCEIRMKGAFAQSSV---EPQTTQPSRYCLRDTDMRLNKRK 230
N CEC TCE R GAF +S V EP S+Y LR+TD RLN+ K
Sbjct: 291 NNEFCECYTCERRATGAF-KSRVGLNEPGPLINSKYGLRETDKRLNRLK 338
>gi|49898826|gb|AAH75709.1| Suv420h1 protein [Mus musculus]
Length = 817
Score = 226 bits (576), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 117/207 (56%), Positives = 145/207 (70%), Gaps = 11/207 (5%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 174 EILPCNRYSSEQN-GAKIVATKEWKRNDKIELLVGCIAELSEIEENMLLRHGENDFSVMY 232
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 233 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 292
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQP---SRYCLRDTDMRLNKRKLHKKLNRLLLA 241
N CEC TCE R GAF +S V P S+Y LR+TD RLN +L +L +
Sbjct: 293 NNEFCECYTCERRGTGAF-KSRVGLPAPAPVINSKYGLRETDKRLN------RLKKLGDS 345
Query: 242 SDKNDTNSSDNSSKYSLVQWNNNSKEN 268
S +D+ S +++ Q +N+ N
Sbjct: 346 SKNSDSQSVSSNTDADTTQEKDNATSN 372
>gi|326920215|ref|XP_003206370.1| PREDICTED: histone-lysine N-methyltransferase SUV420H1-like isoform
1 [Meleagris gallopavo]
Length = 848
Score = 226 bits (576), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 116/211 (54%), Positives = 143/211 (67%), Gaps = 9/211 (4%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 164 EILPCNRYSSEQN-GAKIVATKEWKRNDKIELLVGCIAELSEMEENMLLRHGENDFSVMY 222
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 223 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 282
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQ--PSRYCLRDTDMRLNKRKLHKKLNRLLLAS 242
N CEC TCE R GAF P T S+Y LR+TD RLN +L +L +S
Sbjct: 283 NNEYCECYTCERRGTGAFKSRVGLPAPTPVINSKYGLRETDKRLN------RLKKLGDSS 336
Query: 243 DKNDTNSSDNSSKYSLVQWNNNSKENIFDLG 273
+D+ S +++ Q N+ +G
Sbjct: 337 KNSDSQSVSSNTDADTTQEKANATNRKSSIG 367
>gi|256000841|gb|ACU51761.1| MIP06250p [Drosophila melanogaster]
Length = 1328
Score = 226 bits (576), Expect = 9e-57, Method: Composition-based stats.
Identities = 114/178 (64%), Positives = 132/178 (74%), Gaps = 13/178 (7%)
Query: 66 LKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYS 125
++ CYRY+LE RGA+I +T++W KND I LVGCIAELT+ EE LLH+G NDFSVMYS
Sbjct: 282 IEACYRYTLEEQRGAKISSTKRWSKNDKIECLVGCIAELTEAEEAALLHSGKNDFSVMYS 341
Query: 126 CRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDK 185
CRKNCAQLWLGPAAYINHDCR NC+F+AT RDTACVKVLRDIE GEEITCFYGEDFFGD
Sbjct: 342 CRKNCAQLWLGPAAYINHDCRANCKFLATGRDTACVKVLRDIEVGEEITCFYGEDFFGDS 401
Query: 186 NCLCECDTCEIRMKGAFA----------QSSVEPQTTQPSR---YCLRDTDMRLNKRK 230
N CEC+TCE R GAFA + ++ P Y LR+TD R+N+ K
Sbjct: 402 NRYCECETCERRGTGAFAGKGDGLMLGLSMGLGLASSGPGNNGGYRLRETDNRINRIK 459
>gi|374253865|gb|AEZ00750.1| FI18405p1 [Drosophila melanogaster]
Length = 1328
Score = 226 bits (575), Expect = 1e-56, Method: Composition-based stats.
Identities = 114/178 (64%), Positives = 132/178 (74%), Gaps = 13/178 (7%)
Query: 66 LKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYS 125
++ CYRY+LE RGA+I +T++W KND I LVGCIAELT+ EE LLH+G NDFSVMYS
Sbjct: 282 IEACYRYTLEEQRGAKISSTKRWSKNDKIECLVGCIAELTEAEEAALLHSGKNDFSVMYS 341
Query: 126 CRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDK 185
CRKNCAQLWLGPAAYINHDCR NC+F+AT RDTACVKVLRDIE GEEITCFYGEDFFGD
Sbjct: 342 CRKNCAQLWLGPAAYINHDCRANCKFLATGRDTACVKVLRDIEVGEEITCFYGEDFFGDS 401
Query: 186 NCLCECDTCEIRMKGAFA----------QSSVEPQTTQPSR---YCLRDTDMRLNKRK 230
N CEC+TCE R GAFA + ++ P Y LR+TD R+N+ K
Sbjct: 402 NRYCECETCERRGTGAFAGKDDGLMLGLSMGLGLASSGPGNNGGYRLRETDNRINRIK 459
>gi|363734592|ref|XP_422910.3| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
SUV420H1 [Gallus gallus]
Length = 848
Score = 226 bits (575), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 116/211 (54%), Positives = 143/211 (67%), Gaps = 9/211 (4%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 164 EILPCNRYSSEQN-GAKIVATKEWKRNDKIELLVGCIAELSEMEENMLLRHGENDFSVMY 222
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 223 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 282
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQ--PSRYCLRDTDMRLNKRKLHKKLNRLLLAS 242
N CEC TCE R GAF P T S+Y LR+TD RLN +L +L +S
Sbjct: 283 NNEYCECYTCERRGTGAFKSRVGLPAPTPVINSKYGLRETDKRLN------RLKKLGDSS 336
Query: 243 DKNDTNSSDNSSKYSLVQWNNNSKENIFDLG 273
+D+ S +++ Q N+ +G
Sbjct: 337 KNSDSQSVSSNTDADTTQEKANATNRKSSIG 367
>gi|148229543|ref|NP_001088653.1| histone-lysine N-methyltransferase SUV420H1-B [Xenopus laevis]
gi|82179806|sp|Q5RJX8.1|SU41B_XENLA RecName: Full=Histone-lysine N-methyltransferase SUV420H1-B;
AltName: Full=Suppressor of variegation 4-20 homolog
1-B; Short=Su(var)4-20 homolog 1-B; Short=Suv4-20h1-B
gi|55778134|gb|AAH86459.1| LOC495826 protein [Xenopus laevis]
Length = 785
Score = 226 bits (575), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 111/169 (65%), Positives = 129/169 (76%), Gaps = 5/169 (2%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT+ W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 102 EILPCCRYSSERN-GAKIVATKDWKRNDKIELLVGCIAELSEAEENMLLRHGENDFSVMY 160
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 161 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 220
Query: 185 KNCLCECDTCEIRMKGAFAQSSV---EPQTTQPSRYCLRDTDMRLNKRK 230
N CEC TCE R GAF +S V EP S+Y LR+TD RLN+ K
Sbjct: 221 NNEFCECYTCERRATGAF-KSRVGLNEPGPVINSKYGLRETDKRLNRLK 268
>gi|391342673|ref|XP_003745640.1| PREDICTED: histone-lysine N-methyltransferase SUV420H1-like
[Metaseiulus occidentalis]
Length = 648
Score = 226 bits (575), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 111/176 (63%), Positives = 126/176 (71%), Gaps = 12/176 (6%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS EG+ GAR+ ATR W KND I LVGCIAEL+ EE+++L G NDFSVMY
Sbjct: 117 EILPCDRYSTEGHMGARLCATRDWRKNDTIRNLVGCIAELSPIEERQILVAGRNDFSVMY 176
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCR NCRFV+T R +ACVK LRDI GEEITCFYGEDFFGD
Sbjct: 177 STRKNCAQLWLGPAAFINHDCRANCRFVSTGRGSACVKALRDIGIGEEITCFYGEDFFGD 236
Query: 185 KNCLCECDTCEIRMKGAF------AQSSVEPQTTQPS------RYCLRDTDMRLNK 228
NC CEC TCE R GAF AQS V ++ S +Y R+TD RL +
Sbjct: 237 SNCFCECHTCERRGSGAFRVDASDAQSDVGDSSSSQSEVDGARKYSFRETDNRLRR 292
>gi|432091033|gb|ELK24245.1| Histone-lysine N-methyltransferase SUV420H1 [Myotis davidii]
Length = 874
Score = 225 bits (574), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 116/204 (56%), Positives = 144/204 (70%), Gaps = 11/204 (5%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 187 EILPCNRYSSEQN-GAKIVATKEWKRNDKIELLVGCIAELSEIEENMLLRHGENDFSVMY 245
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 246 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 305
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQP---SRYCLRDTDMRLNKRKLHKKLNRLLLA 241
N CEC TCE R GAF +S V P S+Y LR+TD RLN +L +L +
Sbjct: 306 NNEFCECYTCERRGTGAF-KSRVGLPAPAPVINSKYGLRETDKRLN------RLKKLGDS 358
Query: 242 SDKNDTNSSDNSSKYSLVQWNNNS 265
S +D+ S +++ Q +N+
Sbjct: 359 SKNSDSQSVSSNTDADTTQEKHNA 382
>gi|387018894|gb|AFJ51565.1| Histone-lysine N-methyltransferase SUV420H1-like [Crotalus
adamanteus]
Length = 860
Score = 225 bits (574), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 116/211 (54%), Positives = 143/211 (67%), Gaps = 9/211 (4%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 173 EILPCNRYSSEQN-GAKIVATKEWKRNDKIELLVGCIAELSELEENMLLRHGENDFSVMY 231
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 232 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 291
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQ--PSRYCLRDTDMRLNKRKLHKKLNRLLLAS 242
N CEC TCE R GAF P T S+Y LR+TD RLN +L +L +S
Sbjct: 292 NNEYCECYTCERRGTGAFKSRVGLPAPTPVINSKYGLRETDKRLN------RLKKLGDSS 345
Query: 243 DKNDTNSSDNSSKYSLVQWNNNSKENIFDLG 273
+D+ S +++ Q N+ +G
Sbjct: 346 KNSDSQSVSSNTDADTTQEKANATTRKSSIG 376
>gi|74212540|dbj|BAE31010.1| unnamed protein product [Mus musculus]
Length = 883
Score = 225 bits (573), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 116/207 (56%), Positives = 145/207 (70%), Gaps = 11/207 (5%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 197 EILPCNRYSSEQN-GAKIVATKEWKRNDKIELLVGCIAELSEIEENMLLRHGENDFSVMY 255
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 256 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 315
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQP---SRYCLRDTDMRLNKRKLHKKLNRLLLA 241
N CEC TCE + GAF +S V P S+Y LR+TD RLN +L +L +
Sbjct: 316 NNEFCECYTCERQGTGAF-KSRVGLPAPAPVINSKYGLRETDKRLN------RLKKLGDS 368
Query: 242 SDKNDTNSSDNSSKYSLVQWNNNSKEN 268
S +D+ S +++ Q +N+ N
Sbjct: 369 SKNSDSQSVSSNTDADTTQEKDNATSN 395
>gi|194912010|ref|XP_001982418.1| GG12748 [Drosophila erecta]
gi|190648094|gb|EDV45387.1| GG12748 [Drosophila erecta]
Length = 1309
Score = 225 bits (573), Expect = 2e-56, Method: Composition-based stats.
Identities = 113/178 (63%), Positives = 132/178 (74%), Gaps = 13/178 (7%)
Query: 66 LKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYS 125
++ C+RYSLE RGA+I +T++W KND I LVGCIAELT+ EE LLH+G NDFSVMYS
Sbjct: 252 IEACFRYSLEEQRGAKISSTKRWSKNDKIECLVGCIAELTEAEEAALLHSGKNDFSVMYS 311
Query: 126 CRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDK 185
CRKNCAQLWLGPAAYINHDCR NC+F+AT RDTACVKVLRDIE GEEITCFYGEDFFGD
Sbjct: 312 CRKNCAQLWLGPAAYINHDCRANCKFLATGRDTACVKVLRDIEVGEEITCFYGEDFFGDS 371
Query: 186 NCLCECDTCEIRMKGAFA----------QSSVEPQTTQPSR---YCLRDTDMRLNKRK 230
N CEC+TCE R GA+A + ++ P Y LR+TD R+N+ K
Sbjct: 372 NRYCECETCERRGTGAYAGKDDGLMLGLSMGLGLASSGPGNSGGYRLRETDNRINRIK 429
>gi|4929639|gb|AAD34080.1|AF151843_1 CGI-85 protein [Homo sapiens]
Length = 384
Score = 225 bits (573), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 117/204 (57%), Positives = 144/204 (70%), Gaps = 11/204 (5%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 187 EILPCNRYSSEQN-GAKIVATKEWKRNDKIELLVGCIAELSEIEENMLLRHGENDFSVMY 245
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 246 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 305
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQP---SRYCLRDTDMRLNKRKLHKKLNRLLLA 241
N CEC TCE R GAF +S V P S+Y LR+TD RLN +L +L +
Sbjct: 306 NNEFCECYTCERRGTGAF-KSRVGLPAPAPVINSKYGLRETDKRLN------RLKKLGDS 358
Query: 242 SDKNDTNSSDNSSKYSLVQWNNNS 265
S +D+ S +++ Q NN+
Sbjct: 359 SKNSDSQSVSSNTDADTTQEKNNA 382
>gi|417400149|gb|JAA47040.1| Putative histone tail methylase [Desmodus rotundus]
Length = 393
Score = 225 bits (573), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 117/204 (57%), Positives = 144/204 (70%), Gaps = 11/204 (5%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 196 EILPCNRYSSEQN-GAKIVATKEWKRNDKIELLVGCIAELSEIEENMLLRHGENDFSVMY 254
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 255 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 314
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQP---SRYCLRDTDMRLNKRKLHKKLNRLLLA 241
N CEC TCE R GAF +S V P S+Y LR+TD RLN +L +L +
Sbjct: 315 NNEFCECYTCERRGTGAF-KSRVGLPAPAPVINSKYGLRETDKRLN------RLKKLGDS 367
Query: 242 SDKNDTNSSDNSSKYSLVQWNNNS 265
S +D+ S +++ Q NN+
Sbjct: 368 SKNSDSQSVSSNTDADTTQEKNNA 391
>gi|380813344|gb|AFE78546.1| histone-lysine N-methyltransferase SUV420H1 isoform 2 [Macaca
mulatta]
gi|383410281|gb|AFH28354.1| histone-lysine N-methyltransferase SUV420H1 isoform 2 [Macaca
mulatta]
gi|384941868|gb|AFI34539.1| histone-lysine N-methyltransferase SUV420H1 isoform 2 [Macaca
mulatta]
Length = 393
Score = 225 bits (573), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 117/204 (57%), Positives = 144/204 (70%), Gaps = 11/204 (5%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 196 EILPCNRYSSEQN-GAKIVATKEWKRNDKIELLVGCIAELSEIEENMLLRHGENDFSVMY 254
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 255 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 314
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQP---SRYCLRDTDMRLNKRKLHKKLNRLLLA 241
N CEC TCE R GAF +S V P S+Y LR+TD RLN +L +L +
Sbjct: 315 NNEFCECYTCERRGTGAF-KSRVGLPAPAPVINSKYGLRETDKRLN------RLKKLGDS 367
Query: 242 SDKNDTNSSDNSSKYSLVQWNNNS 265
S +D+ S +++ Q NN+
Sbjct: 368 SKNSDSQSVSSNTDADTTQEKNNA 391
>gi|50659084|ref|NP_057112.3| histone-lysine N-methyltransferase SUV420H1 isoform 2 [Homo
sapiens]
gi|40807014|gb|AAH65287.1| Suppressor of variegation 4-20 homolog 1 (Drosophila) [Homo
sapiens]
gi|51574062|gb|AAH02522.2| Suppressor of variegation 4-20 homolog 1 (Drosophila) [Homo
sapiens]
gi|56789794|gb|AAH87834.1| Suppressor of variegation 4-20 homolog 1 (Drosophila) [Homo
sapiens]
gi|119595104|gb|EAW74698.1| suppressor of variegation 4-20 homolog 1 (Drosophila), isoform
CRA_b [Homo sapiens]
gi|410223834|gb|JAA09136.1| suppressor of variegation 4-20 homolog 1 [Pan troglodytes]
gi|410251046|gb|JAA13490.1| suppressor of variegation 4-20 homolog 1 [Pan troglodytes]
gi|410306188|gb|JAA31694.1| suppressor of variegation 4-20 homolog 1 [Pan troglodytes]
gi|410352265|gb|JAA42736.1| suppressor of variegation 4-20 homolog 1 [Pan troglodytes]
Length = 393
Score = 224 bits (572), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 117/204 (57%), Positives = 144/204 (70%), Gaps = 11/204 (5%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 196 EILPCNRYSSEQN-GAKIVATKEWKRNDKIELLVGCIAELSEIEENMLLRHGENDFSVMY 254
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 255 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 314
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQP---SRYCLRDTDMRLNKRKLHKKLNRLLLA 241
N CEC TCE R GAF +S V P S+Y LR+TD RLN +L +L +
Sbjct: 315 NNEFCECYTCERRGTGAF-KSRVGLPAPAPVINSKYGLRETDKRLN------RLKKLGDS 367
Query: 242 SDKNDTNSSDNSSKYSLVQWNNNS 265
S +D+ S +++ Q NN+
Sbjct: 368 SKNSDSQSVSSNTDADTTQEKNNA 391
>gi|126343072|ref|XP_001362478.1| PREDICTED: histone-lysine N-methyltransferase SUV420H1 isoform 1
[Monodelphis domestica]
Length = 902
Score = 224 bits (571), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 116/207 (56%), Positives = 144/207 (69%), Gaps = 11/207 (5%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 196 EILPCNRYSSEQN-GAKIVATKEWKRNDKIELLVGCIAELSEIEENMLLRHGENDFSVMY 254
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 255 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 314
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQP---SRYCLRDTDMRLNKRKLHKKLNRLLLA 241
N CEC TCE R GAF +S V P S+Y LR+TD RLN +L +L +
Sbjct: 315 NNEYCECYTCERRGTGAF-KSRVGLPAPAPVINSKYGLRETDKRLN------RLKKLGDS 367
Query: 242 SDKNDTNSSDNSSKYSLVQWNNNSKEN 268
S +D+ S +++ Q ++ N
Sbjct: 368 SKNSDSQSVSSNTDADTTQEKTDATSN 394
>gi|126343074|ref|XP_001362560.1| PREDICTED: histone-lysine N-methyltransferase SUV420H1 isoform 2
[Monodelphis domestica]
Length = 879
Score = 224 bits (571), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 116/207 (56%), Positives = 144/207 (69%), Gaps = 11/207 (5%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 173 EILPCNRYSSEQN-GAKIVATKEWKRNDKIELLVGCIAELSEIEENMLLRHGENDFSVMY 231
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 232 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 291
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQP---SRYCLRDTDMRLNKRKLHKKLNRLLLA 241
N CEC TCE R GAF +S V P S+Y LR+TD RLN +L +L +
Sbjct: 292 NNEYCECYTCERRGTGAF-KSRVGLPAPAPVINSKYGLRETDKRLN------RLKKLGDS 344
Query: 242 SDKNDTNSSDNSSKYSLVQWNNNSKEN 268
S +D+ S +++ Q ++ N
Sbjct: 345 SKNSDSQSVSSNTDADTTQEKTDATSN 371
>gi|221042582|dbj|BAH12968.1| unnamed protein product [Homo sapiens]
Length = 370
Score = 224 bits (571), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 117/204 (57%), Positives = 144/204 (70%), Gaps = 11/204 (5%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 173 EILPCNRYSSEQN-GAKIVATKEWKRNDKIELLVGCIAELSEIEENMLLRHGENDFSVMY 231
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 232 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 291
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQP---SRYCLRDTDMRLNKRKLHKKLNRLLLA 241
N CEC TCE R GAF +S V P S+Y LR+TD RLN +L +L +
Sbjct: 292 NNEFCECYTCERRGTGAF-KSRVGLPAPAPVINSKYGLRETDKRLN------RLKKLGDS 344
Query: 242 SDKNDTNSSDNSSKYSLVQWNNNS 265
S +D+ S +++ Q NN+
Sbjct: 345 SKNSDSQSVSSNTDADTTQEKNNA 368
>gi|327260017|ref|XP_003214832.1| PREDICTED: histone-lysine N-methyltransferase SUV420H1-like isoform
1 [Anolis carolinensis]
Length = 861
Score = 223 bits (569), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 115/199 (57%), Positives = 141/199 (70%), Gaps = 11/199 (5%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 173 EILPCNRYSSEQN-GAKIVATKEWKRNDKIELLVGCIAELSELEENMLLRHGENDFSVMY 231
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 232 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 291
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQP---SRYCLRDTDMRLNKRKLHKKLNRLLLA 241
N CEC TCE R GAF +S V P S+Y LR+TD RLN +L +L +
Sbjct: 292 NNEFCECYTCERRGTGAF-KSRVGLPVPAPVINSKYGLRETDKRLN------RLKKLGDS 344
Query: 242 SDKNDTNSSDNSSKYSLVQ 260
S +D+ S +++ Q
Sbjct: 345 SKNSDSQSVSSNTDADTTQ 363
>gi|327260019|ref|XP_003214833.1| PREDICTED: histone-lysine N-methyltransferase SUV420H1-like isoform
2 [Anolis carolinensis]
Length = 860
Score = 223 bits (569), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 115/199 (57%), Positives = 141/199 (70%), Gaps = 11/199 (5%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 173 EILPCNRYSSEQN-GAKIVATKEWKRNDKIELLVGCIAELSELEENMLLRHGENDFSVMY 231
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 232 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 291
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQP---SRYCLRDTDMRLNKRKLHKKLNRLLLA 241
N CEC TCE R GAF +S V P S+Y LR+TD RLN +L +L +
Sbjct: 292 NNEFCECYTCERRGTGAF-KSRVGLPVPAPVINSKYGLRETDKRLN------RLKKLGDS 344
Query: 242 SDKNDTNSSDNSSKYSLVQ 260
S +D+ S +++ Q
Sbjct: 345 SKNSDSQSVSSNTDADTTQ 363
>gi|157817845|ref|NP_001101982.1| histone-lysine N-methyltransferase SUV420H1 [Rattus norvegicus]
gi|143586502|sp|P0C2N5.1|SUV41_RAT RecName: Full=Histone-lysine N-methyltransferase SUV420H1; AltName:
Full=Suppressor of variegation 4-20 homolog 1;
Short=Su(var)4-20 homolog 1; Short=Suv4-20h1
gi|149061877|gb|EDM12300.1| suppressor of variegation 4-20 homolog 1 (Drosophila) (predicted),
isoform CRA_a [Rattus norvegicus]
Length = 883
Score = 223 bits (568), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 122/223 (54%), Positives = 147/223 (65%), Gaps = 13/223 (5%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 196 EILPCNRYSSEQN-GAKIVATKEWKRNDKIELLVGCIAELSEIEENMLLRHGENDFSVMY 254
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 255 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 314
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQP---SRYCLRDTDMRLNK-RKLHKKLNRLLL 240
N CEC TCE R GAF +S V P S+Y LR+TD RLN+ +KL
Sbjct: 315 NNEFCECYTCERRGTGAF-KSRVGLPAPAPVINSKYGLRETDKRLNRLKKLGDSSKSSDS 373
Query: 241 A-----SDKNDTNSSDN--SSKYSLVQWNNNSKENIFDLGTFP 276
+D + T DN S++ S V NSK + P
Sbjct: 374 QSVSSNTDADTTQEKDNATSNRKSSVGVKKNSKSRALTRQSMP 416
>gi|268607629|ref|NP_001161361.1| histone-lysine N-methyltransferase SUV420H1 isoform c [Mus
musculus]
gi|26349851|dbj|BAC38565.1| unnamed protein product [Mus musculus]
Length = 394
Score = 223 bits (568), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 116/204 (56%), Positives = 144/204 (70%), Gaps = 11/204 (5%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 197 EILPCNRYSSEQN-GAKIVATKEWKRNDKIELLVGCIAELSEIEENMLLRHGENDFSVMY 255
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 256 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 315
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQP---SRYCLRDTDMRLNKRKLHKKLNRLLLA 241
N CEC TCE R GAF +S V P S+Y LR+TD RLN +L +L +
Sbjct: 316 NNEFCECYTCERRGTGAF-KSRVGLPAPAPVINSKYGLRETDKRLN------RLKKLGDS 368
Query: 242 SDKNDTNSSDNSSKYSLVQWNNNS 265
S +D+ S +++ Q +N+
Sbjct: 369 SKNSDSQSVSSNTDADTTQEKDNA 392
>gi|149061878|gb|EDM12301.1| suppressor of variegation 4-20 homolog 1 (Drosophila) (predicted),
isoform CRA_b [Rattus norvegicus]
Length = 860
Score = 223 bits (568), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 122/223 (54%), Positives = 147/223 (65%), Gaps = 13/223 (5%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 173 EILPCNRYSSEQN-GAKIVATKEWKRNDKIELLVGCIAELSEIEENMLLRHGENDFSVMY 231
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 232 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 291
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQP---SRYCLRDTDMRLNK-RKLHKKLNRLLL 240
N CEC TCE R GAF +S V P S+Y LR+TD RLN+ +KL
Sbjct: 292 NNEFCECYTCERRGTGAF-KSRVGLPAPAPVINSKYGLRETDKRLNRLKKLGDSSKSSDS 350
Query: 241 A-----SDKNDTNSSDN--SSKYSLVQWNNNSKENIFDLGTFP 276
+D + T DN S++ S V NSK + P
Sbjct: 351 QSVSSNTDADTTQEKDNATSNRKSSVGVKKNSKSRALTRQSMP 393
>gi|443689530|gb|ELT91903.1| hypothetical protein CAPTEDRAFT_228696 [Capitella teleta]
Length = 747
Score = 223 bits (568), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 101/169 (59%), Positives = 129/169 (76%), Gaps = 6/169 (3%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E++PC+RYS+EG G +I AT+KW K + I LVGCI EL+ EEE +LL G+ND+SVMY
Sbjct: 124 EIQPCHRYSMEGQVGGKICATKKWMKGEKITKLVGCIGELSAEEEAQLLRPGVNDYSVMY 183
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
SCRKNCAQLWLGPAA+INHDCR NC+FV+T RDTAC++VL DI G+EITC+YG+DFFG+
Sbjct: 184 SCRKNCAQLWLGPAAFINHDCRSNCKFVSTGRDTACLQVLLDIRVGQEITCYYGDDFFGE 243
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQPSRYC----LRDTDMRLNKR 229
NC CEC+TCE ++ GAF S P + C LRDT+ RL+++
Sbjct: 244 GNCYCECETCERQLTGAFTPKS--PGAVDRKKLCEGYRLRDTNDRLDRQ 290
>gi|115496590|ref|NP_001070019.1| histone-lysine N-methyltransferase SUV420H1 [Bos taurus]
gi|122135297|sp|Q29RP8.1|SUV41_BOVIN RecName: Full=Histone-lysine N-methyltransferase SUV420H1; AltName:
Full=Suppressor of variegation 4-20 homolog 1;
Short=Su(var)4-20 homolog 1; Short=Suv4-20h1
gi|88954113|gb|AAI14080.1| Suppressor of variegation 4-20 homolog 1 (Drosophila) [Bos taurus]
gi|296471485|tpg|DAA13600.1| TPA: histone-lysine N-methyltransferase SUV420H1 [Bos taurus]
Length = 393
Score = 223 bits (568), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 116/204 (56%), Positives = 144/204 (70%), Gaps = 11/204 (5%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 196 EILPCNRYSSEQN-GAKIVATKEWKRNDKIELLVGCIAELSEIEENMLLRHGENDFSVMY 254
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 255 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 314
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQP---SRYCLRDTDMRLNKRKLHKKLNRLLLA 241
N CEC TCE R GAF +S V P S+Y LR+TD RLN +L +L +
Sbjct: 315 NNEFCECYTCERRGTGAF-KSRVGLPAPAPVINSKYGLRETDKRLN------RLKKLGDS 367
Query: 242 SDKNDTNSSDNSSKYSLVQWNNNS 265
S +D+ S +++ Q N++
Sbjct: 368 SKNSDSQSVSSNTDADTTQGRNDA 391
>gi|143586162|sp|Q5U3H2.2|SUV41_DANRE RecName: Full=Histone-lysine N-methyltransferase SUV420H1; AltName:
Full=Suppressor of variegation 4-20 homolog 1;
Short=Su(var)4-20 homolog 1; Short=Suv4-20h1
Length = 808
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 117/201 (58%), Positives = 140/201 (69%), Gaps = 10/201 (4%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ C RYS E N GA+IVAT+ W++ND I +LVGCIAEL+ EE+ LL G NDFSVMY
Sbjct: 171 EILSCNRYSSEQN-GAKIVATKDWKRNDKIEHLVGCIAELSPSEERMLLRHGENDFSVMY 229
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVKVLRDIE GEEI+C+YG+ FFG+
Sbjct: 230 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKVLRDIEPGEEISCYYGDGFFGE 289
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQP---SRYCLRDTDMRLNKRKLHKKLNRLLLA 241
N CEC TCE R GAF P P S+Y LR+TD RLN+ KKL
Sbjct: 290 NNEFCECYTCERRGTGAFKSKPGLP-VEAPVINSKYGLRETDKRLNR---LKKLGESCRN 345
Query: 242 SDKN--DTNSSDNSSKYSLVQ 260
SD +N+ +S + + VQ
Sbjct: 346 SDSQSVSSNAEADSQEPTTVQ 366
>gi|55925233|ref|NP_001007338.1| histone-lysine N-methyltransferase SUV420H1 [Danio rerio]
gi|55250879|gb|AAH85544.1| Suppressor of variegation 4-20 homolog 1 (Drosophila) [Danio rerio]
Length = 808
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 117/201 (58%), Positives = 140/201 (69%), Gaps = 10/201 (4%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ C RYS E N GA+IVAT+ W++ND I +LVGCIAEL+ EE+ LL G NDFSVMY
Sbjct: 171 EILSCNRYSSEQN-GAKIVATKDWKRNDKIEHLVGCIAELSPSEERMLLRHGENDFSVMY 229
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVKVLRDIE GEEI+C+YG+ FFG+
Sbjct: 230 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKVLRDIEPGEEISCYYGDGFFGE 289
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQP---SRYCLRDTDMRLNKRKLHKKLNRLLLA 241
N CEC TCE R GAF P P S+Y LR+TD RLN+ KKL
Sbjct: 290 NNEFCECYTCERRGTGAFKSKPGLP-VEAPVINSKYGLRETDKRLNR---LKKLGESCRN 345
Query: 242 SDKN--DTNSSDNSSKYSLVQ 260
SD +N+ +S + + VQ
Sbjct: 346 SDSQSVSSNAEADSQEPTTVQ 366
>gi|195347432|ref|XP_002040257.1| GM19029 [Drosophila sechellia]
gi|194121685|gb|EDW43728.1| GM19029 [Drosophila sechellia]
Length = 1294
Score = 221 bits (563), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 114/178 (64%), Positives = 132/178 (74%), Gaps = 13/178 (7%)
Query: 66 LKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYS 125
++ CYRY+LE RGA+I +T++W KND I LVGCIAELT+ EE LLH+G NDFSVMYS
Sbjct: 251 IEACYRYTLEEQRGAKISSTKRWSKNDKIECLVGCIAELTEAEEAALLHSGKNDFSVMYS 310
Query: 126 CRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDK 185
CRKNCAQLWLGPAAYINHDCR NC+F+AT RDTACVKVLRDIE GEEITCFYGEDFFGD
Sbjct: 311 CRKNCAQLWLGPAAYINHDCRANCKFLATGRDTACVKVLRDIEVGEEITCFYGEDFFGDS 370
Query: 186 NCLCECDTCEIRMKGAFAQS----------SVEPQTTQPSR---YCLRDTDMRLNKRK 230
N CEC+TCE R GAFA + ++ P Y LR+TD R+N+ K
Sbjct: 371 NRYCECETCERRGTGAFAGKDDGLMLGLSMGLGLASSGPGNNGGYRLRETDNRINRIK 428
>gi|4688662|emb|CAA20894.2| EG:115C2.10 [Drosophila melanogaster]
Length = 1291
Score = 221 bits (563), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 115/180 (63%), Positives = 131/180 (72%), Gaps = 17/180 (9%)
Query: 66 LKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYS 125
++ CYRY+LE RGA+I +T++W KND I LVGCIAELT+ EE LLH+G NDFSVMYS
Sbjct: 254 IEACYRYTLEEQRGAKISSTKRWSKNDKIECLVGCIAELTEAEEAALLHSGKNDFSVMYS 313
Query: 126 CRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDK 185
CRKNCAQLWLGPAAYINHDCR NC+F+AT RDTACVKVLRDIE GEEITCFYGEDFFGD
Sbjct: 314 CRKNCAQLWLGPAAYINHDCRANCKFLATGRDTACVKVLRDIEVGEEITCFYGEDFFGDS 373
Query: 186 NCLCECDTCEIRMKGAFAQ---------------SSVEPQTTQPSRYCLRDTDMRLNKRK 230
N CEC+TCE R GAFA +S P Y LR+TD R+N+ K
Sbjct: 374 NRYCECETCERRGTGAFAGKDDGLMLGLSMGLGLASSGPGNN--GGYRLRETDNRINRIK 431
>gi|194768707|ref|XP_001966453.1| GF21991 [Drosophila ananassae]
gi|190617217|gb|EDV32741.1| GF21991 [Drosophila ananassae]
Length = 1437
Score = 220 bits (561), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 115/178 (64%), Positives = 130/178 (73%), Gaps = 13/178 (7%)
Query: 66 LKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYS 125
++ CYRYSLE RGA+I +T++W KND I LVGCIAELT+ EE LLH+G NDFSVMYS
Sbjct: 266 IEACYRYSLEEQRGAKISSTKRWSKNDKIECLVGCIAELTEAEEAALLHSGKNDFSVMYS 325
Query: 126 CRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDK 185
CRKNCAQLWLGPAAYINHDCR NC+F+AT RDTACVKVLRDIE GEEITCFYGEDFFGD
Sbjct: 326 CRKNCAQLWLGPAAYINHDCRANCKFLATGRDTACVKVLRDIEVGEEITCFYGEDFFGDS 385
Query: 186 NCLCECDTCEIRMKGAFA-------------QSSVEPQTTQPSRYCLRDTDMRLNKRK 230
N CEC+TCE R GAFA T + Y LR+TD R+N+ K
Sbjct: 386 NRYCECETCERRGTGAFAGKEDGLMLGLSSGLGLGLGLATGGTGYRLRETDNRINRIK 443
>gi|17862544|gb|AAL39749.1| LD36415p [Drosophila melanogaster]
Length = 782
Score = 220 bits (560), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 114/178 (64%), Positives = 132/178 (74%), Gaps = 13/178 (7%)
Query: 66 LKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYS 125
++ CYRY+LE RGA+I +T++W KND I LVGCIAELT+ EE LLH+G NDFSVMYS
Sbjct: 254 IEACYRYTLEEQRGAKISSTKRWSKNDKIECLVGCIAELTEAEEAALLHSGKNDFSVMYS 313
Query: 126 CRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDK 185
CRKNCAQLWLGPAAYINHDCR NC+F+AT RDTACVKVLRDIE GEEITCFYGEDFFGD
Sbjct: 314 CRKNCAQLWLGPAAYINHDCRANCKFLATGRDTACVKVLRDIEVGEEITCFYGEDFFGDS 373
Query: 186 NCLCECDTCEIRMKGAFAQS----------SVEPQTTQPSR---YCLRDTDMRLNKRK 230
N CEC+TCE R GAFA + ++ P Y LR+TD R+N+ K
Sbjct: 374 NRYCECETCERRGTGAFAGKDDGLMLGLSMGLGLASSGPGNNGGYRLRETDNRINRIK 431
>gi|195398701|ref|XP_002057959.1| GJ15764 [Drosophila virilis]
gi|194150383|gb|EDW66067.1| GJ15764 [Drosophila virilis]
Length = 1589
Score = 219 bits (558), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 115/187 (61%), Positives = 130/187 (69%), Gaps = 22/187 (11%)
Query: 66 LKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYS 125
++ CYRYSLE RGA+I +T++W KND I LVGCIAELT+ EE LLH+G NDFSVMYS
Sbjct: 298 IEACYRYSLEEQRGAKISSTKRWSKNDKIECLVGCIAELTEAEEAALLHSGKNDFSVMYS 357
Query: 126 CRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDK 185
CRKNCAQLWLGPAAYINHDCR NC+F+AT RDTACVKVLRDIE GEEITCFYGEDFFGD
Sbjct: 358 CRKNCAQLWLGPAAYINHDCRANCKFLATGRDTACVKVLRDIEVGEEITCFYGEDFFGDS 417
Query: 186 NCLCECDTCEIRMKGAFAQSSVEPQTTQP----------------------SRYCLRDTD 223
N CEC+TCE R GAFA Q + Y LR+TD
Sbjct: 418 NRYCECETCERRGTGAFAGKHSANQLNEAMLGLTNGLGLGLGLGAAGGANGGGYRLRETD 477
Query: 224 MRLNKRK 230
R+N+ K
Sbjct: 478 NRINRIK 484
>gi|187607046|ref|NP_001120591.1| uncharacterized protein LOC100145748 [Xenopus (Silurana)
tropicalis]
gi|171847150|gb|AAI61612.1| LOC100145748 protein [Xenopus (Silurana) tropicalis]
Length = 772
Score = 217 bits (552), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 112/181 (61%), Positives = 134/181 (74%), Gaps = 9/181 (4%)
Query: 54 HRYSYRIQYLPE----LKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEE 109
H + Y +LPE + C RYSLE N GA++V+T+ W KN+ I LVGCIAEL+K +E
Sbjct: 92 HIFRYLRMFLPESGFMILSCCRYSLETN-GAKVVSTKAWCKNEKIELLVGCIAELSKADE 150
Query: 110 KRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQ 169
LL G NDFSVMYS RK CAQLWLGPAA+INHDCRPNC+FV TE +TACVKVLR+I+
Sbjct: 151 T-LLRFGDNDFSVMYSTRKKCAQLWLGPAAFINHDCRPNCKFVPTEGNTACVKVLREIKS 209
Query: 170 GEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFAQSSVEPQTTQPS--RYCLRDTDMRLN 227
GEEITCFYG+ FFG+KN LCEC TCE + +GAF Q E QT S +Y LR+TD RLN
Sbjct: 210 GEEITCFYGDSFFGEKNELCECCTCERKGEGAFKQQKTE-QTVSTSLEKYQLRETDGRLN 268
Query: 228 K 228
+
Sbjct: 269 R 269
>gi|148233390|ref|NP_001090519.1| histone-lysine N-methyltransferase SUV420H2 [Xenopus laevis]
gi|143586742|sp|A0JMZ4.1|SUV42_XENLA RecName: Full=Histone-lysine N-methyltransferase SUV420H2; AltName:
Full=Suppressor of variegation 4-20 homolog 2;
Short=Su(var)4-20 homolog 2; Short=Suv4-20h2
gi|116487712|gb|AAI26061.1| Suv420h2 protein [Xenopus laevis]
Length = 761
Score = 216 bits (549), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 110/181 (60%), Positives = 133/181 (73%), Gaps = 9/181 (4%)
Query: 54 HRYSYRIQYLPE----LKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEE 109
H + Y +LPE + C RYSLE N GA++V+T+ W KN+ I LVGCIAEL+K +E
Sbjct: 92 HIFRYLRMFLPESGFMILSCSRYSLEMN-GAKVVSTKSWSKNEKIELLVGCIAELSKADE 150
Query: 110 KRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQ 169
LL G NDFSVMYS RK CAQLWLGPAA+INHDCRPNC+FV TE +TACVKVLR+I+
Sbjct: 151 T-LLRFGDNDFSVMYSTRKKCAQLWLGPAAFINHDCRPNCKFVPTEGNTACVKVLREIKT 209
Query: 170 GEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFAQSSVEPQTTQPS--RYCLRDTDMRLN 227
GEEITCFYG+ FFG+KN +CEC TCE + GAF Q + E QT S +Y LR+TD RL
Sbjct: 210 GEEITCFYGDSFFGEKNEMCECCTCERKGDGAFKQQNTE-QTVSTSLEKYQLRETDGRLK 268
Query: 228 K 228
+
Sbjct: 269 R 269
>gi|47212905|emb|CAF90795.1| unnamed protein product [Tetraodon nigroviridis]
Length = 839
Score = 214 bits (544), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 113/190 (59%), Positives = 133/190 (70%), Gaps = 10/190 (5%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT+ W++ND I YLVGCIAEL++ EE LL G NDFSVMY
Sbjct: 165 EILPCNRYSSEQN-GAKIVATKAWKRNDKIEYLVGCIAELSESEENMLLRHGENDFSVMY 223
Query: 125 SCRKNCAQLWLGPAAYINHDCRP--NCRFVATERDTACVKVLRDIEQGEEITCFYGEDFF 182
S RKNCAQLWLGPAA+INH RP +FV+T RDTACVKVLRDIE GEEI+C+YG+ FF
Sbjct: 224 STRKNCAQLWLGPAAFINHGTRPPGQLQFVSTGRDTACVKVLRDIEPGEEISCYYGDGFF 283
Query: 183 GDKNCLCECDTCEIRMKGAFAQSSVEPQTTQP---SRYCLRDTDMRLNKRKLHKKLNRLL 239
G+ N CEC TCE R GAF +S Q P S+Y LR+TD RLN+ KK+
Sbjct: 284 GENNEFCECYTCERRGTGAF-KSKAGLQVEVPVINSKYGLRETDKRLNR---LKKMEEGS 339
Query: 240 LASDKNDTNS 249
+SD N S
Sbjct: 340 KSSDGNSVGS 349
>gi|157818959|ref|NP_001100945.1| histone-lysine N-methyltransferase SUV420H2 [Rattus norvegicus]
gi|143586731|sp|P0C2N6.1|SUV42_RAT RecName: Full=Histone-lysine N-methyltransferase SUV420H2; AltName:
Full=Suppressor of variegation 4-20 homolog 2;
Short=Su(var)4-20 homolog 2; Short=Suv4-20h2
gi|149016665|gb|EDL75851.1| suppressor of variegation 4-20 homolog 2 (Drosophila) (predicted)
[Rattus norvegicus]
Length = 470
Score = 207 bits (526), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 111/212 (52%), Positives = 145/212 (68%), Gaps = 9/212 (4%)
Query: 24 LNPSFWAFCVQALLAQRF-NIGRILDMGTQHHRYSYRIQYLPE----LKPCYRYSLEGNR 78
L +F A + +A F N + ++H + Y +LPE + PC RYS+E N
Sbjct: 61 LEAAFRALTLGGWMAHYFQNRAPRQEAALKNHIFCYLRAFLPESGFTILPCTRYSMETN- 119
Query: 79 GARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPA 138
GA+IV+TR W+KN+ + LVGCIAEL +EE++ LL G NDFSVMYS RK AQLWLGPA
Sbjct: 120 GAKIVSTRAWKKNEKLELLVGCIAEL-REEDEYLLRAGENDFSVMYSTRKRSAQLWLGPA 178
Query: 139 AYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRM 198
A+INHDC+PNC+FV ++ +TACVKVLRDIE G+E+TCFYGE FFG+KN CEC TCE +
Sbjct: 179 AFINHDCKPNCKFVPSDGNTACVKVLRDIEPGDEVTCFYGEGFFGEKNEHCECYTCERKG 238
Query: 199 KGAFAQSSVEPQT-TQP-SRYCLRDTDMRLNK 228
+GAF EP+ +P +Y LR+T RL +
Sbjct: 239 EGAFRLQPREPELRPRPLDKYELRETKRRLQQ 270
>gi|148701006|gb|EDL32953.1| suppressor of variegation 4-20 homolog 1 (Drosophila), isoform
CRA_b [Mus musculus]
Length = 880
Score = 206 bits (524), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 112/207 (54%), Positives = 140/207 (67%), Gaps = 12/207 (5%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 195 EILPCNRYSSEQN-GAKIVATKEWKRNDKIELLVGCIAELSEIEENMLLRHGENDFSVMY 253
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INH P+ FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 254 STRKNCAQLWLGPAAFINHGKLPS-EFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 312
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQP---SRYCLRDTDMRLNKRKLHKKLNRLLLA 241
N CEC TCE R GAF +S V P S+Y LR+TD RLN +L +L +
Sbjct: 313 NNEFCECYTCERRGTGAF-KSRVGLPAPAPVINSKYGLRETDKRLN------RLKKLGDS 365
Query: 242 SDKNDTNSSDNSSKYSLVQWNNNSKEN 268
S +D+ S +++ Q +N+ N
Sbjct: 366 SKNSDSQSVSSNTDADTTQEKDNATSN 392
>gi|397471123|ref|XP_003807153.1| PREDICTED: histone-lysine N-methyltransferase SUV420H2 [Pan
paniscus]
Length = 464
Score = 204 bits (520), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 110/214 (51%), Positives = 140/214 (65%), Gaps = 13/214 (6%)
Query: 24 LNPSFWAFCVQALLAQRFNI-GRILDMGTQHHRYSYRIQYLPE----LKPCYRYSLEGNR 78
L ++ A + A+ F G + + H Y Y +LPE + PC RYS+E N
Sbjct: 61 LEAAYRALTLGGWTARYFQSRGPRQEAALKTHVYRYLRAFLPESGFTILPCTRYSMETN- 119
Query: 79 GARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPA 138
GA+IV+TR W+KN+ + LVGCIAEL +E ++ LL G NDFS+MYS RK AQLWLGPA
Sbjct: 120 GAKIVSTRAWKKNEKLELLVGCIAEL-READEGLLRAGENDFSIMYSTRKRSAQLWLGPA 178
Query: 139 AYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRM 198
A+INHDC+PNC+FV + + ACVKVLRDIE G+E+TCFYGE FFG+KN CEC TCE +
Sbjct: 179 AFINHDCKPNCKFVPADGNAACVKVLRDIEPGDEVTCFYGEGFFGEKNEHCECHTCERKG 238
Query: 199 KGAFAQSSVEPQTTQPSR----YCLRDTDMRLNK 228
+GAF EP T P R Y LR+T RL +
Sbjct: 239 EGAFRTRPREP--TLPPRPLDKYQLRETKRRLQQ 270
>gi|410982245|ref|XP_003997470.1| PREDICTED: histone-lysine N-methyltransferase SUV420H2 [Felis
catus]
Length = 337
Score = 204 bits (520), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 109/218 (50%), Positives = 142/218 (65%), Gaps = 11/218 (5%)
Query: 24 LNPSFWAFCVQALLAQRFNI-GRILDMGTQHHRYSYRIQYLPE----LKPCYRYSLEGNR 78
L ++ A + +A F G + + H Y Y +LPE + PC RYS+E N
Sbjct: 61 LEAAYRALTLGGWMAHYFQSRGPRQEAALKTHIYRYLRAFLPESGFTILPCTRYSMETN- 119
Query: 79 GARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPA 138
GA+IV+TR W+KN+ + LVGCIAEL +E ++ LL G NDFS+MYS RK AQLWLGPA
Sbjct: 120 GAKIVSTRAWKKNEKLELLVGCIAEL-READEGLLRAGENDFSIMYSTRKRSAQLWLGPA 178
Query: 139 AYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRM 198
A+INHDC+PNC+FV + + ACVKVLRDIE G+E+TCFYGE FFG+KN CEC TCE R
Sbjct: 179 AFINHDCKPNCKFVPADGNAACVKVLRDIEPGDEVTCFYGEGFFGEKNEHCECYTCERRG 238
Query: 199 KGAFAQSSVEPQTTQPSRYCLRDTDMRLNKRKLHKKLN 236
+GAF + P+ P R L ++R KR+L + L
Sbjct: 239 EGAF---RLRPREPAPPRL-LDKYELRETKRRLQQGLE 272
>gi|169234795|ref|NP_666289.2| histone-lysine N-methyltransferase SUV420H2 [Mus musculus]
gi|169234797|ref|NP_001108490.1| histone-lysine N-methyltransferase SUV420H2 [Mus musculus]
gi|81892754|sp|Q6Q783.1|SUV42_MOUSE RecName: Full=Histone-lysine N-methyltransferase SUV420H2; AltName:
Full=Suppressor of variegation 4-20 homolog 2;
Short=Su(var)4-20 homolog 2; Short=Suv4-20h2
gi|46560163|gb|AAT00540.1| Suv4-20h2 [Mus musculus]
gi|74191847|dbj|BAE32874.1| unnamed protein product [Mus musculus]
gi|148699305|gb|EDL31252.1| suppressor of variegation 4-20 homolog 2 (Drosophila), isoform
CRA_a [Mus musculus]
gi|148699306|gb|EDL31253.1| suppressor of variegation 4-20 homolog 2 (Drosophila), isoform
CRA_a [Mus musculus]
Length = 468
Score = 204 bits (519), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 109/212 (51%), Positives = 143/212 (67%), Gaps = 9/212 (4%)
Query: 24 LNPSFWAFCVQALLAQRFNI-GRILDMGTQHHRYSYRIQYLPE----LKPCYRYSLEGNR 78
L +F A + +A F + + H + Y +LPE + PC RYS+E N
Sbjct: 61 LEAAFRALTLGGWMAHYFQSRAPRQEAALKTHIFCYLRAFLPESGFTILPCTRYSMETN- 119
Query: 79 GARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPA 138
GA+IV+TR W+KN+ + LVGCIAEL +EE++ LL G NDFS+MYS RK AQLWLGPA
Sbjct: 120 GAKIVSTRAWKKNEKLELLVGCIAEL-REEDEDLLRAGENDFSIMYSTRKRSAQLWLGPA 178
Query: 139 AYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRM 198
A+INHDC+PNC+FV ++ +TACVKVLRDIE G+E+TCFYGE FFG+KN CEC TCE +
Sbjct: 179 AFINHDCKPNCKFVPSDGNTACVKVLRDIEPGDEVTCFYGEGFFGEKNEHCECYTCERKG 238
Query: 199 KGAFAQSSVEPQT-TQP-SRYCLRDTDMRLNK 228
+GAF EP+ +P +Y LR+T RL +
Sbjct: 239 EGAFRLQPREPELRPKPLDKYELRETKRRLQQ 270
>gi|193603706|ref|XP_001949874.1| PREDICTED: histone-lysine N-methyltransferase Suv4-20-like
[Acyrthosiphon pisum]
Length = 276
Score = 204 bits (519), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 100/172 (58%), Positives = 124/172 (72%), Gaps = 4/172 (2%)
Query: 66 LKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYS 125
++PCYRYSLE GA++ AT W K++ I LVGC+AELT++EE +LH G NDFSVMYS
Sbjct: 101 IEPCYRYSLENKVGAKVNATNHWYKDEIIKQLVGCVAELTEKEEDEILHFGKNDFSVMYS 160
Query: 126 CRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDK 185
CRK CAQLWLGPAAYINHDC NC+FV ++ TACVKVLRDI+ G+EITCFY ++FFG+
Sbjct: 161 CRKKCAQLWLGPAAYINHDCLANCKFVPNDKGTACVKVLRDIKPGDEITCFYSDNFFGEN 220
Query: 186 NCLCECDTCEIRMKGAFAQSSVEPQTTQPSRYCLRDTDMRLNK-RKLHKKLN 236
N CEC +CE KGAF + +P+ RY R T+ R N R L+K N
Sbjct: 221 NVNCECYSCEKLCKGAFKTNEEQPKL---RRYNFRQTNGRTNSYRSLYKSEN 269
>gi|336391073|ref|NP_001229572.1| histone-lysine N-methyltransferase Suv4-20-like [Acyrthosiphon
pisum]
gi|336391168|ref|NP_001229571.1| histone-lysine N-methyltransferase Suv4-20-like [Acyrthosiphon
pisum]
Length = 327
Score = 204 bits (519), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 96/162 (59%), Positives = 118/162 (72%), Gaps = 3/162 (1%)
Query: 66 LKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYS 125
++PCYRYSLE GA++ AT W K+ I LVGC+AELT++EE +LH G NDFSVMYS
Sbjct: 101 IEPCYRYSLENKVGAKVNATSHWHKDGIIKRLVGCVAELTEKEEDEILHVGKNDFSVMYS 160
Query: 126 CRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDK 185
CRK CAQLWLGPAAYINHDC+ NC+FV ++ TACVKVLRDI+ G+EITCFY +FFG+
Sbjct: 161 CRKKCAQLWLGPAAYINHDCQANCKFVPNDKGTACVKVLRDIKPGDEITCFYSANFFGEN 220
Query: 186 NCLCECDTCEIRMKGAFAQSSVEPQTTQPSRYCLRDTDMRLN 227
N CEC +CE KGAF + +P+ RY R T+ R N
Sbjct: 221 NVNCECCSCEKLCKGAFKTNEEQPKL---GRYNFRQTNGRTN 259
>gi|410210720|gb|JAA02579.1| suppressor of variegation 4-20 homolog 2 [Pan troglodytes]
Length = 462
Score = 204 bits (518), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 110/214 (51%), Positives = 140/214 (65%), Gaps = 13/214 (6%)
Query: 24 LNPSFWAFCVQALLAQRFNI-GRILDMGTQHHRYSYRIQYLPE----LKPCYRYSLEGNR 78
L ++ A + A+ F G + + H Y Y +LPE + PC RYS+E N
Sbjct: 61 LEAAYRALTLGGWTARYFQSRGPRQEAALKTHVYRYLRAFLPESGFTILPCTRYSMETN- 119
Query: 79 GARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPA 138
GA+IV+TR W+KN+ + LVGCIAEL +E ++ LL G NDFS+MYS RK AQLWLGPA
Sbjct: 120 GAKIVSTRAWKKNEKLELLVGCIAEL-READEGLLRAGENDFSIMYSTRKRSAQLWLGPA 178
Query: 139 AYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRM 198
A+INHDC+PNC+FV + + ACVKVLRDIE G+E+TCFYGE FFG+KN CEC TCE +
Sbjct: 179 AFINHDCKPNCKFVPADGNAACVKVLRDIEPGDEVTCFYGEGFFGEKNEHCECHTCERKG 238
Query: 199 KGAFAQSSVEPQTTQPSR----YCLRDTDMRLNK 228
+GAF EP T P R Y LR+T RL +
Sbjct: 239 EGAFRTRPREP--TLPPRPLDKYQLRETKRRLQQ 270
>gi|410054587|ref|XP_003953682.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
SUV420H2 [Pan troglodytes]
Length = 464
Score = 204 bits (518), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 105/183 (57%), Positives = 129/183 (70%), Gaps = 12/183 (6%)
Query: 54 HRYSYRIQYLPE----LKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEE 109
H Y Y +LPE + PC RYS+E N GA+IV+TR W+KN+ + LVGCIAEL +E +
Sbjct: 94 HVYRYLRAFLPESGFTILPCTRYSMETN-GAKIVSTRAWKKNEKLELLVGCIAEL-READ 151
Query: 110 KRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQ 169
+ LL G NDFS+MYS RK AQLWLGPAA+INHDC+PNC+FV + + ACVKVLRDIE
Sbjct: 152 EGLLRAGENDFSIMYSTRKRSAQLWLGPAAFINHDCKPNCKFVPADGNAACVKVLRDIEP 211
Query: 170 GEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFAQSSVEPQTTQPSR----YCLRDTDMR 225
G+E+TCFYGE FFG+KN CEC TCE + +GAF EP T P R Y LR+T R
Sbjct: 212 GDEVTCFYGEGFFGEKNEHCECHTCERKGEGAFRTRPREP--TLPPRPLDKYQLRETKRR 269
Query: 226 LNK 228
L +
Sbjct: 270 LQQ 272
>gi|148701007|gb|EDL32954.1| suppressor of variegation 4-20 homolog 1 (Drosophila), isoform
CRA_c [Mus musculus]
Length = 414
Score = 203 bits (517), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 111/204 (54%), Positives = 139/204 (68%), Gaps = 12/204 (5%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 218 EILPCNRYSSEQN-GAKIVATKEWKRNDKIELLVGCIAELSEIEENMLLRHGENDFSVMY 276
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INH P+ FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 277 STRKNCAQLWLGPAAFINHGKLPS-EFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 335
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQP---SRYCLRDTDMRLNKRKLHKKLNRLLLA 241
N CEC TCE R GAF +S V P S+Y LR+TD RLN +L +L +
Sbjct: 336 NNEFCECYTCERRGTGAF-KSRVGLPAPAPVINSKYGLRETDKRLN------RLKKLGDS 388
Query: 242 SDKNDTNSSDNSSKYSLVQWNNNS 265
S +D+ S +++ Q +N+
Sbjct: 389 SKNSDSQSVSSNTDADTTQEKDNA 412
>gi|339717664|pdb|3S8P|A Chain A, Crystal Structure Of The Set Domain Of Human
Histone-Lysine N- Methyltransferase Suv420h1 In Complex
With S-Adenosyl-L-Methionine
gi|339717665|pdb|3S8P|B Chain B, Crystal Structure Of The Set Domain Of Human
Histone-Lysine N- Methyltransferase Suv420h1 In Complex
With S-Adenosyl-L-Methionine
Length = 273
Score = 203 bits (516), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 96/138 (69%), Positives = 111/138 (80%), Gaps = 1/138 (0%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSV Y
Sbjct: 134 EILPCNRYSSEQN-GAKIVATKEWKRNDKIELLVGCIAELSEIEENXLLRHGENDFSVXY 192
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 193 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 252
Query: 185 KNCLCECDTCEIRMKGAF 202
N CEC TCE R GAF
Sbjct: 253 NNEFCECYTCERRGTGAF 270
>gi|426390244|ref|XP_004061516.1| PREDICTED: histone-lysine N-methyltransferase SUV420H2 [Gorilla
gorilla gorilla]
Length = 473
Score = 203 bits (516), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 108/212 (50%), Positives = 140/212 (66%), Gaps = 9/212 (4%)
Query: 24 LNPSFWAFCVQALLAQRFNI-GRILDMGTQHHRYSYRIQYLPE----LKPCYRYSLEGNR 78
L ++ A + A+ F G + + H Y Y +LPE + PC RYS+E N
Sbjct: 61 LEAAYRALTLGGWTARYFQSRGPRQEAALKTHVYRYLRAFLPESGFTILPCTRYSMETN- 119
Query: 79 GARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPA 138
GA+IV+TR W+KN+ + LVGCIAEL +E ++ LL G NDFS+MYS RK AQLWLGPA
Sbjct: 120 GAKIVSTRAWKKNEKLELLVGCIAEL-READEGLLRAGENDFSIMYSTRKRSAQLWLGPA 178
Query: 139 AYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRM 198
A+INHDC+PNC+FV + + ACVKVLRDIE G+E+TCFYGE FFG+KN CEC TCE +
Sbjct: 179 AFINHDCKPNCKFVPADGNAACVKVLRDIEPGDEVTCFYGEGFFGEKNEHCECHTCERKG 238
Query: 199 KGAFAQSSVEPQ-TTQP-SRYCLRDTDMRLNK 228
+GAF EP +P +Y LR+T RL +
Sbjct: 239 EGAFRTRPREPALPPRPLDKYQLRETKRRLQQ 270
>gi|31543169|ref|NP_116090.2| histone-lysine N-methyltransferase SUV420H2 [Homo sapiens]
gi|74727906|sp|Q86Y97.1|SV422_HUMAN RecName: Full=Histone-lysine N-methyltransferase SUV420H2; AltName:
Full=Lysine N-methyltransferase 5C; AltName:
Full=Suppressor of variegation 4-20 homolog 2;
Short=Su(var)4-20 homolog 2; Short=Suv4-20h2
gi|27924375|gb|AAH44889.1| Suppressor of variegation 4-20 homolog 2 (Drosophila) [Homo
sapiens]
gi|119592771|gb|EAW72365.1| suppressor of variegation 4-20 homolog 2 (Drosophila), isoform
CRA_b [Homo sapiens]
gi|307686221|dbj|BAJ21041.1| suppressor of variegation 4-20 homolog 2 [synthetic construct]
Length = 462
Score = 202 bits (515), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 108/212 (50%), Positives = 140/212 (66%), Gaps = 9/212 (4%)
Query: 24 LNPSFWAFCVQALLAQRFNI-GRILDMGTQHHRYSYRIQYLPE----LKPCYRYSLEGNR 78
L ++ A + A+ F G + + H Y Y +LPE + PC RYS+E N
Sbjct: 61 LEAAYRALTLGGWTARYFQSRGPRQEAALKTHVYRYLRAFLPESGFTILPCTRYSMETN- 119
Query: 79 GARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPA 138
GA+IV+TR W+KN+ + LVGCIAEL +E ++ LL G NDFS+MYS RK AQLWLGPA
Sbjct: 120 GAKIVSTRAWKKNEKLELLVGCIAEL-READEGLLRAGENDFSIMYSTRKRSAQLWLGPA 178
Query: 139 AYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRM 198
A+INHDC+PNC+FV + + ACVKVLRDIE G+E+TCFYGE FFG+KN CEC TCE +
Sbjct: 179 AFINHDCKPNCKFVPADGNAACVKVLRDIEPGDEVTCFYGEGFFGEKNEHCECHTCERKG 238
Query: 199 KGAFAQSSVEPQ-TTQP-SRYCLRDTDMRLNK 228
+GAF EP +P +Y LR+T RL +
Sbjct: 239 EGAFRTRPREPALPPRPLDKYQLRETKRRLQQ 270
>gi|395861391|ref|XP_003802972.1| PREDICTED: histone-lysine N-methyltransferase SUV420H2 [Otolemur
garnettii]
Length = 462
Score = 202 bits (514), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 109/214 (50%), Positives = 140/214 (65%), Gaps = 13/214 (6%)
Query: 24 LNPSFWAFCVQALLAQRFNI-GRILDMGTQHHRYSYRIQYLPE----LKPCYRYSLEGNR 78
L+ ++ A + +A F G + + H Y Y +LPE + PC RYS+E N
Sbjct: 61 LDAAYRALTLGGWMAHYFQSRGPRQEAALKTHVYRYLRAFLPESGFTILPCTRYSMETN- 119
Query: 79 GARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPA 138
GA+IV+TR W+KN+ + LVGCIAEL +E ++ LL G NDFS+MYS RK AQLWLGPA
Sbjct: 120 GAKIVSTRAWKKNEKLELLVGCIAEL-READEGLLRAGENDFSIMYSTRKRSAQLWLGPA 178
Query: 139 AYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRM 198
A+INHDC+PNC+FV + + ACVKVLRDIE G+E+TCFYGE FFG+KN CEC TCE +
Sbjct: 179 AFINHDCKPNCKFVPADGNAACVKVLRDIEPGDEVTCFYGEGFFGEKNEHCECYTCERKG 238
Query: 199 KGAFAQSSVEPQTTQPSR----YCLRDTDMRLNK 228
+GAF EP P R Y LR+T RL +
Sbjct: 239 EGAFRLQPREP--VLPPRLQDKYQLRETKRRLQQ 270
>gi|403308558|ref|XP_003944725.1| PREDICTED: histone-lysine N-methyltransferase SUV420H2 [Saimiri
boliviensis boliviensis]
Length = 463
Score = 201 bits (512), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 103/181 (56%), Positives = 129/181 (71%), Gaps = 8/181 (4%)
Query: 54 HRYSYRIQYLPE----LKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEE 109
H Y Y +LPE + PC RYS+E N GA+IV+TR W+KN+ + LVGCIAEL +E +
Sbjct: 92 HVYRYLRAFLPESGFTILPCTRYSMETN-GAKIVSTRAWKKNEKLELLVGCIAEL-READ 149
Query: 110 KRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQ 169
+ LL G NDFS+MYS RK AQLWLGPAA+INHDC+PNC+FV + + ACVKVLRDIE
Sbjct: 150 EGLLRAGENDFSIMYSTRKRSAQLWLGPAAFINHDCKPNCKFVPADGNAACVKVLRDIEP 209
Query: 170 GEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFAQSSVEPQ-TTQP-SRYCLRDTDMRLN 227
G+E+TCFYGE FFG+KN CEC TCE + +GAF EP +P +Y LR+T RL
Sbjct: 210 GDEVTCFYGEGFFGEKNEHCECYTCERKGEGAFRTRPREPALPPRPLDKYQLRETKRRLQ 269
Query: 228 K 228
+
Sbjct: 270 Q 270
>gi|297705971|ref|XP_002829827.1| PREDICTED: histone-lysine N-methyltransferase SUV420H2-like,
partial [Pongo abelii]
Length = 300
Score = 201 bits (511), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 107/212 (50%), Positives = 139/212 (65%), Gaps = 9/212 (4%)
Query: 24 LNPSFWAFCVQALLAQRFNI-GRILDMGTQHHRYSYRIQYLPE----LKPCYRYSLEGNR 78
L ++ A + A+ F G + + H Y Y +LPE + PC RYS+E
Sbjct: 61 LEAAYRALTLGGWTARYFQSRGPRQEAALKTHVYRYLRAFLPESGFTILPCTRYSME-TL 119
Query: 79 GARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPA 138
GA+IV+TR W+KN+ + LVGCIAEL +E ++ LL G NDFS+MYS RK AQLWLGPA
Sbjct: 120 GAKIVSTRAWKKNEKLELLVGCIAEL-READEGLLRAGENDFSIMYSTRKRSAQLWLGPA 178
Query: 139 AYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRM 198
A+INHDC+PNC+FV + + ACVKVLRDIE G+E+TCFYGE FFG+KN CEC TCE +
Sbjct: 179 AFINHDCKPNCKFVPADGNAACVKVLRDIEPGDEVTCFYGEGFFGEKNEHCECHTCERKG 238
Query: 199 KGAFAQSSVEPQ-TTQP-SRYCLRDTDMRLNK 228
+GAF EP +P +Y LR+T RL +
Sbjct: 239 EGAFRTRPREPALPPRPLDKYQLRETKRRLQQ 270
>gi|268607617|ref|NP_001161356.1| histone-lysine N-methyltransferase SUV420H1 isoform d [Mus
musculus]
gi|268607627|ref|NP_001161360.1| histone-lysine N-methyltransferase SUV420H1 isoform d [Mus
musculus]
Length = 327
Score = 200 bits (508), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 93/131 (70%), Positives = 108/131 (82%), Gaps = 1/131 (0%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 197 EILPCNRYSSEQN-GAKIVATKEWKRNDKIELLVGCIAELSEIEENMLLRHGENDFSVMY 255
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 256 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 315
Query: 185 KNCLCECDTCE 195
N CEC TCE
Sbjct: 316 NNEFCECYTCE 326
>gi|15029962|gb|AAH11214.1| Suv420h1 protein [Mus musculus]
Length = 318
Score = 200 bits (508), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 93/131 (70%), Positives = 108/131 (82%), Gaps = 1/131 (0%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 188 EILPCNRYSSEQN-GAKIVATKEWKRNDKIELLVGCIAELSEIEENMLLRHGENDFSVMY 246
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 247 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 306
Query: 185 KNCLCECDTCE 195
N CEC TCE
Sbjct: 307 NNEFCECYTCE 317
>gi|355703915|gb|EHH30406.1| hypothetical protein EGK_11071 [Macaca mulatta]
Length = 461
Score = 200 bits (508), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 102/180 (56%), Positives = 128/180 (71%), Gaps = 8/180 (4%)
Query: 54 HRYSYRIQYLPE----LKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEE 109
H Y Y +LPE + PC RYS+E N GA+IV+TR W+KN+ + LVGCIAEL +E +
Sbjct: 92 HVYRYLRAFLPESGFTILPCTRYSMETN-GAKIVSTRAWKKNEKLELLVGCIAEL-READ 149
Query: 110 KRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQ 169
+ LL G NDFS+MYS RK AQLWLGPAA+INHDC+PNC+FV + + ACVKVLRDIE
Sbjct: 150 EGLLRAGENDFSIMYSTRKRSAQLWLGPAAFINHDCKPNCKFVPADGNAACVKVLRDIEP 209
Query: 170 GEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFAQSSVEPQ-TTQP-SRYCLRDTDMRLN 227
G+E+TCFYG+ FFG+KN CEC TCE + +GAF EP +P +Y LR+T RL
Sbjct: 210 GDEVTCFYGDGFFGEKNEHCECYTCERKGEGAFRTRPREPALPPRPLDKYQLRETKRRLQ 269
>gi|402906805|ref|XP_003916173.1| PREDICTED: histone-lysine N-methyltransferase SUV420H2 [Papio
anubis]
Length = 461
Score = 199 bits (507), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 102/180 (56%), Positives = 128/180 (71%), Gaps = 8/180 (4%)
Query: 54 HRYSYRIQYLPE----LKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEE 109
H Y Y +LPE + PC RYS+E N GA+IV+TR W+KN+ + LVGCIAEL +E +
Sbjct: 92 HVYRYLRAFLPESGFTILPCTRYSMETN-GAKIVSTRAWKKNEKLELLVGCIAEL-READ 149
Query: 110 KRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQ 169
+ LL G NDFS+MYS RK AQLWLGPAA+INHDC+PNC+FV + + ACVKVLRDIE
Sbjct: 150 EGLLRAGENDFSIMYSTRKRSAQLWLGPAAFINHDCKPNCKFVPADGNAACVKVLRDIEP 209
Query: 170 GEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFAQSSVEPQ-TTQP-SRYCLRDTDMRLN 227
G+E+TCFYG+ FFG+KN CEC TCE + +GAF EP +P +Y LR+T RL
Sbjct: 210 GDEVTCFYGDGFFGEKNEHCECYTCERKGEGAFRTRPREPALPPRPLDKYQLRETKRRLQ 269
>gi|383422249|gb|AFH34338.1| histone-lysine N-methyltransferase SUV420H2 [Macaca mulatta]
Length = 461
Score = 199 bits (507), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 102/180 (56%), Positives = 128/180 (71%), Gaps = 8/180 (4%)
Query: 54 HRYSYRIQYLPE----LKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEE 109
H Y Y +LPE + PC RYS+E N GA+IV+TR W+KN+ + LVGCIAEL +E +
Sbjct: 92 HVYRYLRAFLPESGFTILPCTRYSMETN-GAKIVSTRAWKKNEKLELLVGCIAEL-READ 149
Query: 110 KRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQ 169
+ LL G NDFS+MYS RK AQLWLGPAA+INHDC+PNC+FV + + ACVKVLRDIE
Sbjct: 150 EGLLRAGENDFSIMYSTRKRSAQLWLGPAAFINHDCKPNCKFVPADGNAACVKVLRDIEP 209
Query: 170 GEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFAQSSVEPQ-TTQP-SRYCLRDTDMRLN 227
G+E+TCFYG+ FFG+KN CEC TCE + +GAF EP +P +Y LR+T RL
Sbjct: 210 GDEVTCFYGDGFFGEKNEHCECYTCERKGEGAFRTRPREPALPPRPLDKYQLRETKRRLQ 269
>gi|109126087|ref|XP_001086620.1| PREDICTED: histone-lysine N-methyltransferase SUV420H2 isoform 2
[Macaca mulatta]
Length = 461
Score = 199 bits (507), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 102/180 (56%), Positives = 128/180 (71%), Gaps = 8/180 (4%)
Query: 54 HRYSYRIQYLPE----LKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEE 109
H Y Y +LPE + PC RYS+E N GA+IV+TR W+KN+ + LVGCIAEL +E +
Sbjct: 92 HVYRYLRAFLPESGFTILPCTRYSMETN-GAKIVSTRAWKKNEKLELLVGCIAEL-READ 149
Query: 110 KRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQ 169
+ LL G NDFS+MYS RK AQLWLGPAA+INHDC+PNC+FV + + ACVKVLRDIE
Sbjct: 150 EGLLRAGENDFSIMYSTRKRSAQLWLGPAAFINHDCKPNCKFVPADGNAACVKVLRDIEP 209
Query: 170 GEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFAQSSVEPQ-TTQP-SRYCLRDTDMRLN 227
G+E+TCFYG+ FFG+KN CEC TCE + +GAF EP +P +Y LR+T RL
Sbjct: 210 GDEVTCFYGDGFFGEKNEHCECYTCERKGEGAFRTRPREPALPPRPLDKYQLRETKRRLQ 269
>gi|354496623|ref|XP_003510425.1| PREDICTED: histone-lysine N-methyltransferase SUV420H1-like
[Cricetulus griseus]
Length = 326
Score = 199 bits (507), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 93/131 (70%), Positives = 108/131 (82%), Gaps = 1/131 (0%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 196 EILPCNRYSSEQN-GAKIVATKEWKRNDKIELLVGCIAELSEIEENMLLRHGENDFSVMY 254
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+
Sbjct: 255 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGE 314
Query: 185 KNCLCECDTCE 195
N CEC TCE
Sbjct: 315 NNEFCECYTCE 325
>gi|76642512|ref|XP_598002.2| PREDICTED: histone-lysine N-methyltransferase SUV420H2 [Bos taurus]
Length = 457
Score = 199 bits (507), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 106/211 (50%), Positives = 137/211 (64%), Gaps = 8/211 (3%)
Query: 24 LNPSFWAFCVQALLAQRFNI-GRILDMGTQHHRYSYRIQYLPE----LKPCYRYSLEGNR 78
L ++ A + +A F G + + H Y Y +LPE + PC RYS+E N
Sbjct: 61 LEAAYRALTLGGWMAHYFQSRGPRQEAALKTHIYRYLRAFLPESGFTILPCTRYSMETN- 119
Query: 79 GARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPA 138
GA+IV+TR W+KN+ + LVGCIAEL +E ++ LL G NDFS+MYS RK AQLWLGPA
Sbjct: 120 GAKIVSTRAWKKNEKLELLVGCIAEL-READEELLRAGENDFSIMYSTRKRSAQLWLGPA 178
Query: 139 AYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRM 198
A+INHDC+PNC+FV + + ACVKVLRDIE G+E+TCFYGE FFG+KN CEC TCE R
Sbjct: 179 AFINHDCKPNCKFVPADGNAACVKVLRDIEPGDEVTCFYGEGFFGEKNEHCECYTCERRG 238
Query: 199 KGAF-AQSSVEPQTTQPSRYCLRDTDMRLNK 228
+GAF + +Y LR+T RL +
Sbjct: 239 EGAFRLRPREPLPPRPLDKYELRETKRRLQQ 269
>gi|297486135|ref|XP_002695433.1| PREDICTED: histone-lysine N-methyltransferase SUV420H2 [Bos taurus]
gi|296477284|tpg|DAA19399.1| TPA: Suv4-20-like [Bos taurus]
Length = 457
Score = 199 bits (507), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 106/211 (50%), Positives = 137/211 (64%), Gaps = 8/211 (3%)
Query: 24 LNPSFWAFCVQALLAQRFNI-GRILDMGTQHHRYSYRIQYLPE----LKPCYRYSLEGNR 78
L ++ A + +A F G + + H Y Y +LPE + PC RYS+E N
Sbjct: 61 LEAAYRALTLGGWMAHYFQSRGPRQEAALKTHIYRYLRAFLPESGFTILPCTRYSMETN- 119
Query: 79 GARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPA 138
GA+IV+TR W+KN+ + LVGCIAEL +E ++ LL G NDFS+MYS RK AQLWLGPA
Sbjct: 120 GAKIVSTRAWKKNEKLELLVGCIAEL-READEELLRAGENDFSIMYSTRKRSAQLWLGPA 178
Query: 139 AYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRM 198
A+INHDC+PNC+FV + + ACVKVLRDIE G+E+TCFYGE FFG+KN CEC TCE R
Sbjct: 179 AFINHDCKPNCKFVPADGNAACVKVLRDIEPGDEVTCFYGEGFFGEKNEHCECYTCERRG 238
Query: 199 KGAF-AQSSVEPQTTQPSRYCLRDTDMRLNK 228
+GAF + +Y LR+T RL +
Sbjct: 239 EGAFRLRPREPLPPRPLDKYELRETKRRLQQ 269
>gi|431910177|gb|ELK13250.1| Histone-lysine N-methyltransferase SUV420H1 [Pteropus alecto]
Length = 937
Score = 199 bits (506), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 102/180 (56%), Positives = 125/180 (69%), Gaps = 10/180 (5%)
Query: 89 EKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPN 148
++ND I LVGCIAEL++ EE LL G NDFSVMYS RKNCAQLWLGPAA+INHDCRPN
Sbjct: 275 KRNDKIELLVGCIAELSEIEENMLLRHGENDFSVMYSTRKNCAQLWLGPAAFINHDCRPN 334
Query: 149 CRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFAQSSVE 208
C+FV+T RDTACVK LRDIE GEEI+C+YG+ FFG+ N CEC TCE R GAF +S V
Sbjct: 335 CKFVSTGRDTACVKALRDIEPGEEISCYYGDGFFGENNEFCECYTCERRGTGAF-KSRVG 393
Query: 209 PQTTQP---SRYCLRDTDMRLNKRKLHKKLNRLLLASDKNDTNSSDNSSKYSLVQWNNNS 265
P S+Y LR+TD RLN +L +L +S +D+ S +++ Q NN+
Sbjct: 394 LPAPAPVINSKYGLRETDKRLN------RLKKLGDSSKNSDSQSVSSNTDADTTQEKNNA 447
>gi|147900696|ref|NP_001091656.1| suppressor of variegation 4-20 protein-like 2 [Danio rerio]
gi|146218412|gb|AAI39887.1| Suv420h2 protein [Danio rerio]
Length = 522
Score = 199 bits (505), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 99/188 (52%), Positives = 129/188 (68%), Gaps = 11/188 (5%)
Query: 52 QHHRYSYRIQYLPE----LKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKE 107
+ H Y Y +L E ++PC RYS E N GA+I +TR W + + L GCIAEL+ E
Sbjct: 91 KQHMYRYMTAFLLESGVNIEPCNRYSSETN-GAKITSTRHWLVGERVEVLQGCIAELSPE 149
Query: 108 EEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDI 167
+ +L G+NDFSVMYS RK CAQLWLGPAA+INHDCRPNC+FV +++ ACVKV+R I
Sbjct: 150 DSA-VLRAGVNDFSVMYSTRKRCAQLWLGPAAFINHDCRPNCKFVPGDKNGACVKVVRPI 208
Query: 168 EQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFAQSSVEPQ---TTQPS--RYCLRDT 222
GEEITC+YG+ FFG+KN +CEC TCE R +G+F + P T PS +Y R+T
Sbjct: 209 SPGEEITCYYGDSFFGEKNEMCECCTCERRGEGSFKKRDQSPDFACTGDPSGQKYEFRET 268
Query: 223 DMRLNKRK 230
D+RLN+ +
Sbjct: 269 DLRLNRSR 276
>gi|348559416|ref|XP_003465512.1| PREDICTED: histone-lysine N-methyltransferase SUV420H2-like [Cavia
porcellus]
Length = 497
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 96/182 (52%), Positives = 128/182 (70%), Gaps = 9/182 (4%)
Query: 54 HRYSYRIQYLPE----LKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEE 109
H + Y +LPE + PC RYS+E N GA+IV+TR W+KN+ + LVGCIAEL +E +
Sbjct: 113 HVFRYLRAFLPESGFTILPCTRYSMETN-GAKIVSTRAWKKNEKLELLVGCIAEL-READ 170
Query: 110 KRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQ 169
+ LL G NDFS+MYS RK QLWLGPAA+INHDC+PNC+FV + + ACVK LRDIE
Sbjct: 171 EDLLRAGENDFSIMYSTRKRSMQLWLGPAAFINHDCKPNCKFVPADGNAACVKALRDIEP 230
Query: 170 GEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFAQSSVEPQ---TTQPSRYCLRDTDMRL 226
G+E+TC+YG+ FFG+KN CEC TCE + +GAF+ +P+ + P +Y R+T RL
Sbjct: 231 GDEVTCYYGDGFFGEKNEHCECYTCERKGEGAFSLKPRDPEMPPSQTPDKYAFRETKCRL 290
Query: 227 NK 228
+
Sbjct: 291 QR 292
>gi|26352408|dbj|BAC39834.1| unnamed protein product [Mus musculus]
Length = 318
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 92/131 (70%), Positives = 108/131 (82%), Gaps = 1/131 (0%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 188 EILPCNRYSSEQN-GAKIVATKEWKRNDKIELLVGCIAELSEIEENMLLRHGENDFSVMY 246
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RKNCAQLWLGPAA+INHDCRPNC+FV+T RDTAC+K LRDIE GEEI+C+YG+ FFG+
Sbjct: 247 STRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACLKALRDIEPGEEISCYYGDGFFGE 306
Query: 185 KNCLCECDTCE 195
N CEC TCE
Sbjct: 307 NNEFCECYTCE 317
>gi|194216042|ref|XP_001489839.2| PREDICTED: histone-lysine N-methyltransferase SUV420H2 [Equus
caballus]
Length = 669
Score = 197 bits (502), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 107/211 (50%), Positives = 140/211 (66%), Gaps = 8/211 (3%)
Query: 24 LNPSFWAFCVQALLAQRFNI-GRILDMGTQHHRYSYRIQYLPE----LKPCYRYSLEGNR 78
L ++ A + +A F G + + H Y Y +LPE + PC RYS+E N
Sbjct: 61 LEAAYRALTLGGWMAHYFQSRGPRQEAALKTHIYRYLRAFLPESGFTILPCTRYSMEKN- 119
Query: 79 GARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPA 138
GA+IV+TR W+KN+ + LVGCIAEL +E ++ LL G NDFS+MYS RK AQLWLGPA
Sbjct: 120 GAKIVSTRAWKKNEKLELLVGCIAEL-READEGLLRAGENDFSIMYSTRKRSAQLWLGPA 178
Query: 139 AYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRM 198
A+INHDC+PNC+FV + + ACVKVLRDIE G+E+TCFYG+ FFG+KN CEC TCE +
Sbjct: 179 AFINHDCKPNCKFVPADGNAACVKVLRDIEPGDEVTCFYGDGFFGEKNEHCECYTCERKG 238
Query: 199 KGAFAQSSVEPQTTQPS-RYCLRDTDMRLNK 228
+GAF EP +P +Y LR+T RL +
Sbjct: 239 EGAFRLRPREPVPPRPLDKYELRETKRRLQQ 269
>gi|345785418|ref|XP_541411.3| PREDICTED: histone-lysine N-methyltransferase SUV420H2 [Canis lupus
familiaris]
Length = 496
Score = 197 bits (502), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 105/211 (49%), Positives = 137/211 (64%), Gaps = 8/211 (3%)
Query: 24 LNPSFWAFCVQALLAQRFNI-GRILDMGTQHHRYSYRIQYLPE----LKPCYRYSLEGNR 78
L ++ A + +A F G + + H Y Y +LPE + PC RYS+E N
Sbjct: 61 LEAAYRALTLGGWMAHYFQSRGPRQEAALKTHIYRYLRAFLPESGFTILPCTRYSMETN- 119
Query: 79 GARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPA 138
GA+IV+TR W+KN+ + LVGCIAEL +E ++ LL G NDFS+MYS RK AQLWLGPA
Sbjct: 120 GAKIVSTRAWKKNEKLELLVGCIAEL-READEGLLRAGENDFSIMYSTRKRIAQLWLGPA 178
Query: 139 AYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRM 198
A+INHDC+PNC+FV + + ACVKVLRDIE G+E+TCFYGE FFG+KN CEC TCE +
Sbjct: 179 AFINHDCKPNCKFVPADGNAACVKVLRDIEPGDEVTCFYGEGFFGEKNEHCECYTCERKG 238
Query: 199 KGAF-AQSSVEPQTTQPSRYCLRDTDMRLNK 228
+GAF + +Y LR+T RL +
Sbjct: 239 EGAFRLRPREPLPPRPLDKYELRETKRRLQQ 269
>gi|301782041|ref|XP_002926449.1| PREDICTED: histone-lysine N-methyltransferase SUV420H2-like
[Ailuropoda melanoleuca]
Length = 461
Score = 196 bits (499), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 105/211 (49%), Positives = 137/211 (64%), Gaps = 8/211 (3%)
Query: 24 LNPSFWAFCVQALLAQRFNI-GRILDMGTQHHRYSYRIQYLPE----LKPCYRYSLEGNR 78
L ++ A + +A F G + + H Y Y +LPE + PC RYS+E N
Sbjct: 61 LEAAYRALTLGGWMAHYFQSRGPRQEAALKTHIYRYLRAFLPESGFTILPCTRYSMETN- 119
Query: 79 GARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPA 138
GA+IV+TR W+KN+ + LVGCIAEL +E ++ LL G NDFS+MYS RK AQLWLGPA
Sbjct: 120 GAKIVSTRAWKKNEKLELLVGCIAEL-READEGLLRAGENDFSIMYSTRKRSAQLWLGPA 178
Query: 139 AYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRM 198
A+INHDC+PNC+FV + + ACVKVLRDIE G+E+TCFYGE FFG+KN CEC TCE +
Sbjct: 179 AFINHDCKPNCKFVPADGNAACVKVLRDIEPGDEVTCFYGEGFFGEKNEHCECYTCERKG 238
Query: 199 KGAF-AQSSVEPQTTQPSRYCLRDTDMRLNK 228
+GAF + +Y LR+T RL +
Sbjct: 239 EGAFRLRPREPLPPRPLDKYELRETKRRLQQ 269
>gi|281344140|gb|EFB19724.1| hypothetical protein PANDA_016084 [Ailuropoda melanoleuca]
Length = 459
Score = 196 bits (499), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 105/211 (49%), Positives = 137/211 (64%), Gaps = 8/211 (3%)
Query: 24 LNPSFWAFCVQALLAQRFNI-GRILDMGTQHHRYSYRIQYLPE----LKPCYRYSLEGNR 78
L ++ A + +A F G + + H Y Y +LPE + PC RYS+E N
Sbjct: 61 LEAAYRALTLGGWMAHYFQSRGPRQEAALKTHIYRYLRAFLPESGFTILPCTRYSMETN- 119
Query: 79 GARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPA 138
GA+IV+TR W+KN+ + LVGCIAEL +E ++ LL G NDFS+MYS RK AQLWLGPA
Sbjct: 120 GAKIVSTRAWKKNEKLELLVGCIAEL-READEGLLRAGENDFSIMYSTRKRSAQLWLGPA 178
Query: 139 AYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRM 198
A+INHDC+PNC+FV + + ACVKVLRDIE G+E+TCFYGE FFG+KN CEC TCE +
Sbjct: 179 AFINHDCKPNCKFVPADGNAACVKVLRDIEPGDEVTCFYGEGFFGEKNEHCECYTCERKG 238
Query: 199 KGAF-AQSSVEPQTTQPSRYCLRDTDMRLNK 228
+GAF + +Y LR+T RL +
Sbjct: 239 EGAFRLRPREPLPPRPLDKYELRETKRRLQQ 269
>gi|327282378|ref|XP_003225920.1| PREDICTED: histone-lysine N-methyltransferase SUV420H2-like [Anolis
carolinensis]
Length = 569
Score = 196 bits (499), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 103/195 (52%), Positives = 130/195 (66%), Gaps = 8/195 (4%)
Query: 42 NIGRILDMGTQHHRYSYRIQYLPE----LKPCYRYSLEGNRGARIVATRKWEKNDNIAYL 97
N R + + H + Y +LPE +K C RYSLE N GAR+V+ + W KND + L
Sbjct: 80 NRSRQQEAAFKTHIFRYLRIFLPESGFVIKRCNRYSLEIN-GARVVSAKSWRKNDKLELL 138
Query: 98 VGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERD 157
G IA+LT+++E LL TG NDFSVMYS RK AQLWLGPA++INHDCRPNC+FV + +
Sbjct: 139 EGYIADLTEQDES-LLRTGENDFSVMYSTRKQSAQLWLGPASFINHDCRPNCKFVPMDGN 197
Query: 158 TACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAF--AQSSVEPQTTQPS 215
ACVKVLRDIE +EITCFYG+ FFG+ N LCEC TCE + GAF + P +
Sbjct: 198 IACVKVLRDIEPDDEITCFYGDGFFGENNELCECYTCERKGGGAFRLLNQTPSPSISDNE 257
Query: 216 RYCLRDTDMRLNKRK 230
+Y LR+TD RL + K
Sbjct: 258 KYLLRETDGRLQRWK 272
>gi|156388978|ref|XP_001634769.1| predicted protein [Nematostella vectensis]
gi|156221856|gb|EDO42706.1| predicted protein [Nematostella vectensis]
Length = 247
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 101/178 (56%), Positives = 120/178 (67%), Gaps = 17/178 (9%)
Query: 18 CHLLNKLNPSFWAFCVQALLAQRFNIGRILDMGTQHHRYSYRIQYLPELKPCYRYSLEGN 77
C+ LNK AF L R+ +G H ++I C RYS E N
Sbjct: 86 CYFLNKSKNQIAAFKDHML---RY-------IGIFHRDAGFKIH------KCNRYSGESN 129
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGP 137
GA++V+T W K D + L GCIAE+++EEEK LL G NDFS+M+S RKN +QLWLGP
Sbjct: 130 -GAKVVSTGHWAKGDKLPNLCGCIAEMSEEEEKALLRAGENDFSIMFSTRKNLSQLWLGP 188
Query: 138 AAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCE 195
AAYINHDCRPNC+FV+T RDTACVKVLRD+E G+EITCFYGEDFFGD NC CEC TCE
Sbjct: 189 AAYINHDCRPNCKFVSTGRDTACVKVLRDLEVGDEITCFYGEDFFGDDNCNCECVTCE 246
>gi|334359552|pdb|3RQ4|A Chain A, Crystal Structure Of Suppressor Of Variegation 4-20
Homolog 2
Length = 247
Score = 196 bits (497), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 99/184 (53%), Positives = 127/184 (69%), Gaps = 7/184 (3%)
Query: 24 LNPSFWAFCVQALLAQRFNI-GRILDMGTQHHRYSYRIQYLPE----LKPCYRYSLEGNR 78
L ++ A + A+ F G + + H Y Y +LPE + PC RYS+E N
Sbjct: 60 LEAAYRALTLGGWTARYFQSRGPRQEAALKTHVYRYLRAFLPESGFTILPCTRYSMETN- 118
Query: 79 GARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPA 138
GA+IV+TR W+KN+ + LVGCIAEL +E ++ LL G NDFS+MYS RK AQLWLGPA
Sbjct: 119 GAKIVSTRAWKKNEKLELLVGCIAEL-READEGLLRAGENDFSIMYSTRKRSAQLWLGPA 177
Query: 139 AYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRM 198
A+INHDC+PNC+FV + + ACVKVLRDIE G+E+TCFYGE FFG+KN CEC TCE +
Sbjct: 178 AFINHDCKPNCKFVPADGNAACVKVLRDIEPGDEVTCFYGEGFFGEKNEHCECHTCERKG 237
Query: 199 KGAF 202
+GAF
Sbjct: 238 EGAF 241
>gi|355722725|gb|AES07666.1| suppressor of variegation 4-20-like protein 2 [Mustela putorius
furo]
Length = 244
Score = 195 bits (496), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 99/184 (53%), Positives = 127/184 (69%), Gaps = 7/184 (3%)
Query: 24 LNPSFWAFCVQALLAQRFNI-GRILDMGTQHHRYSYRIQYLPE----LKPCYRYSLEGNR 78
L ++ A + +A F G + + H Y Y +LPE + PC RYS+E N
Sbjct: 61 LEAAYRALTLGGWMAHYFQSRGPRQEAALKTHIYRYLRAFLPESGFTILPCTRYSMETN- 119
Query: 79 GARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPA 138
GA+IV+TR W+KN+ + LVGCIAEL +E ++ LL G NDFS+MYS RK AQLWLGPA
Sbjct: 120 GAKIVSTRAWKKNEKLELLVGCIAEL-READEGLLRAGENDFSIMYSTRKRSAQLWLGPA 178
Query: 139 AYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRM 198
A+INHDC+PNC+FV + + ACVKVLRDIE G+E+TCFYGE FFG+KN CEC TCE +
Sbjct: 179 AFINHDCKPNCKFVPADGNAACVKVLRDIEPGDEVTCFYGEGFFGEKNEHCECYTCERKG 238
Query: 199 KGAF 202
+GAF
Sbjct: 239 EGAF 242
>gi|426244242|ref|XP_004015935.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
SUV420H2 [Ovis aries]
Length = 373
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 98/169 (57%), Positives = 123/169 (72%), Gaps = 11/169 (6%)
Query: 46 ILD--MGTQHHR---YSYRIQYLPE----LKPCYRYSLEGNRGARIVATRKWEKNDNIAY 96
+LD +G + H+ Y Y +LPE + PC RYS+E N GA+IV+TR W+KN+ +
Sbjct: 22 VLDPYLGFRTHKMNIYRYLRAFLPESGFTILPCTRYSMETN-GAKIVSTRAWKKNEKLEL 80
Query: 97 LVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATER 156
LVGCIAEL +E ++ LL G NDFS+MYS RK AQLWLGPAA+INHDC+PNC+FV +
Sbjct: 81 LVGCIAEL-READEELLRAGENDFSIMYSTRKRSAQLWLGPAAFINHDCKPNCKFVPADG 139
Query: 157 DTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFAQS 205
+ ACVKVLRDIE G+E+TCFYGE FFG+KN CEC TCE KG A+S
Sbjct: 140 NAACVKVLRDIEPGDEVTCFYGEGFFGEKNEHCECYTCERWAKGLGARS 188
>gi|449683871|ref|XP_002166241.2| PREDICTED: histone-lysine N-methyltransferase SUV420H1-A-like
[Hydra magnipapillata]
Length = 487
Score = 191 bits (484), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 98/166 (59%), Positives = 123/166 (74%), Gaps = 5/166 (3%)
Query: 68 PCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCR 127
PC RY+ E GA++++TRKWE I YLVGCIAELT+EEE +LL G NDFSVM+S R
Sbjct: 110 PCTRYTTE-EVGAKVLSTRKWESGAKIKYLVGCIAELTEEEETKLLVPGENDFSVMFSTR 168
Query: 128 KNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNC 187
KNC+QLWLGPAAYINHDCRPNC+FV+T RDTAC+KVLRD+E GEE+ C+YG+D+FGD N
Sbjct: 169 KNCSQLWLGPAAYINHDCRPNCKFVSTGRDTACIKVLRDVEIGEELACYYGDDYFGDNNI 228
Query: 188 LCECDTCEIRMKG----AFAQSSVEPQTTQPSRYCLRDTDMRLNKR 229
CEC+TCE R G +S + +Y LR+T RL+++
Sbjct: 229 YCECETCERRKTGFFVNFNNVNSNNNKEKSSGKYTLRETVKRLSRK 274
>gi|351710539|gb|EHB13458.1| Histone-lysine N-methyltransferase SUV420H2, partial
[Heterocephalus glaber]
Length = 297
Score = 191 bits (484), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 102/212 (48%), Positives = 136/212 (64%), Gaps = 9/212 (4%)
Query: 24 LNPSFWAFCVQALLAQRFNI-GRILDMGTQHHRYSYRIQYLPE----LKPCYRYSLEGNR 78
L ++ A + +A F G + + H + Y +LP+ + PC RYS+E N
Sbjct: 61 LEAAYRALTLGDWMAHYFQSRGPRQEAALKTHVFRYLRAFLPDSGFTILPCTRYSMETN- 119
Query: 79 GARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPA 138
GA+IV+TR W+KN+ + LVGCIAEL +E ++ LL G NDFS+MYS RK AQLWLGPA
Sbjct: 120 GAKIVSTRAWKKNEKLELLVGCIAEL-READEDLLRAGENDFSIMYSTRKRSAQLWLGPA 178
Query: 139 AYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRM 198
A+INHDC+PNC+FV + + ACVK LRDIE G+E+TCFYG+ FFG+KN CEC TCE +
Sbjct: 179 AFINHDCKPNCKFVPADGNAACVKALRDIEPGDEVTCFYGDGFFGEKNEHCECYTCERKG 238
Query: 199 KGAF--AQSSVEPQTTQPSRYCLRDTDMRLNK 228
+GAF E P +Y R+T RL +
Sbjct: 239 EGAFRLQPREPELPPPPPDKYAFRETKRRLQR 270
>gi|332256552|ref|XP_003277380.1| PREDICTED: histone-lysine N-methyltransferase SUV420H2 [Nomascus
leucogenys]
Length = 462
Score = 189 bits (480), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 99/181 (54%), Positives = 125/181 (69%), Gaps = 8/181 (4%)
Query: 54 HRYSYRIQYLPE----LKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEE 109
H Y Y +LPE + PC RYS+E N GA+IV+TR W+KN+ + LVGCIAEL +E +
Sbjct: 92 HVYRYLRAFLPESGFTILPCTRYSMETN-GAKIVSTRAWKKNEKLELLVGCIAEL-READ 149
Query: 110 KRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQ 169
+ LL G NDFS+MYS RK AQLWLGPAA+INH+C +FV + + ACVKVLRDIE
Sbjct: 150 EGLLRAGENDFSIMYSTRKRSAQLWLGPAAFINHECGLGPQFVPADGNAACVKVLRDIEP 209
Query: 170 GEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFAQSSVEPQ-TTQP-SRYCLRDTDMRLN 227
G+E+TCFYGE FFG+KN CEC TCE + +GAF EP +P +Y LR+T RL
Sbjct: 210 GDEVTCFYGEGFFGEKNEHCECHTCERKGEGAFRTRPREPALPPRPLDKYQLRETKRRLQ 269
Query: 228 K 228
+
Sbjct: 270 Q 270
>gi|431902995|gb|ELK09177.1| Histone-lysine N-methyltransferase SUV420H2 [Pteropus alecto]
Length = 571
Score = 189 bits (480), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 100/186 (53%), Positives = 125/186 (67%), Gaps = 15/186 (8%)
Query: 54 HRYSYRIQYLPE----LKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEE 109
H Y Y +LPE + PC RYS+E N GA+IV+TR W+KN+ + LVGCIAEL +E +
Sbjct: 199 HIYRYLRAFLPESGFTILPCTRYSMETN-GAKIVSTRAWKKNEKLELLVGCIAEL-READ 256
Query: 110 KRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINH--------DCRPNCRFVATERDTACV 161
+ LL G NDFS+MYS RK AQLWLGPAA+INH DC+PNC+FV + + ACV
Sbjct: 257 EGLLRAGENDFSIMYSTRKRSAQLWLGPAAFINHGAVLRAPGDCKPNCKFVPADGNAACV 316
Query: 162 KVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAF-AQSSVEPQTTQPSRYCLR 220
KVLRDIE G+E+TCFYGE FFG+KN CEC TCE + +GAF + +Y LR
Sbjct: 317 KVLRDIEPGDEVTCFYGEGFFGEKNEHCECYTCERKGEGAFRLRPRDPLPPRPLDKYQLR 376
Query: 221 DTDMRL 226
+T RL
Sbjct: 377 ETKRRL 382
>gi|19353490|gb|AAH24816.1| Suppressor of variegation 4-20 homolog 2 (Drosophila) [Mus
musculus]
gi|133777083|gb|AAH85473.2| Suppressor of variegation 4-20 homolog 2 (Drosophila) [Mus
musculus]
Length = 353
Score = 188 bits (478), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 92/152 (60%), Positives = 117/152 (76%), Gaps = 3/152 (1%)
Query: 79 GARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPA 138
GA+IV+TR W+KN+ + LVGCIAEL +EE++ LL G NDFS+MYS RK AQLWLGPA
Sbjct: 5 GAKIVSTRAWKKNEKLELLVGCIAEL-REEDEDLLRAGENDFSIMYSTRKRSAQLWLGPA 63
Query: 139 AYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRM 198
A+INHDC+PNC+FV ++ +TACVKVLRDIE G+E+TCFYGE FFG+KN CEC TCE +
Sbjct: 64 AFINHDCKPNCKFVPSDGNTACVKVLRDIEPGDEVTCFYGEGFFGEKNEHCECYTCERKG 123
Query: 199 KGAFAQSSVEPQT-TQP-SRYCLRDTDMRLNK 228
+GAF EP+ +P +Y LR+T RL +
Sbjct: 124 EGAFRLQPREPELRPKPLDKYELRETKRRLQQ 155
>gi|195564467|ref|XP_002105839.1| GD16466 [Drosophila simulans]
gi|194203201|gb|EDX16777.1| GD16466 [Drosophila simulans]
Length = 1021
Score = 188 bits (477), Expect = 3e-45, Method: Composition-based stats.
Identities = 98/147 (66%), Positives = 109/147 (74%), Gaps = 13/147 (8%)
Query: 97 LVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATER 156
LVGCIAELT+ EE LLH+G NDFSVMYSCRKNCAQLWLGPAAYINHDCR NC+F+AT R
Sbjct: 4 LVGCIAELTEAEEAALLHSGKNDFSVMYSCRKNCAQLWLGPAAYINHDCRANCKFLATGR 63
Query: 157 DTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFA----------QSS 206
DTACVKVLRDIE GEEITCFYGEDFFGD N CEC+TCE R GAFA
Sbjct: 64 DTACVKVLRDIEVGEEITCFYGEDFFGDSNRYCECETCERRGTGAFAGKDDGLMLGLSMG 123
Query: 207 VEPQTTQPSR---YCLRDTDMRLNKRK 230
+ ++ P Y LR+TD R+N+ K
Sbjct: 124 LGLASSGPGNNGGYRLRETDNRINRIK 150
>gi|13543361|gb|AAH05842.1| SUV420H2 protein [Homo sapiens]
gi|17939508|gb|AAH19313.1| SUV420H2 protein [Homo sapiens]
gi|119592772|gb|EAW72366.1| suppressor of variegation 4-20 homolog 2 (Drosophila), isoform
CRA_c [Homo sapiens]
Length = 347
Score = 185 bits (470), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 90/152 (59%), Positives = 113/152 (74%), Gaps = 3/152 (1%)
Query: 79 GARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPA 138
GA+IV+TR W+KN+ + LVGCIAEL +E ++ LL G NDFS+MYS RK AQLWLGPA
Sbjct: 5 GAKIVSTRAWKKNEKLELLVGCIAEL-READEGLLRAGENDFSIMYSTRKRSAQLWLGPA 63
Query: 139 AYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRM 198
A+INHDC+PNC+FV + + ACVKVLRDIE G+E+TCFYGE FFG+KN CEC TCE +
Sbjct: 64 AFINHDCKPNCKFVPADGNAACVKVLRDIEPGDEVTCFYGEGFFGEKNEHCECHTCERKG 123
Query: 199 KGAFAQSSVEPQ-TTQP-SRYCLRDTDMRLNK 228
+GAF EP +P +Y LR+T RL +
Sbjct: 124 EGAFRTRPREPALPPRPLDKYQLRETKRRLQQ 155
>gi|320168043|gb|EFW44942.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 980
Score = 183 bits (465), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 81/144 (56%), Positives = 105/144 (72%)
Query: 68 PCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCR 127
P RYS + +GA++ A+R+W K + I L GCIAE+T +EEK LL G NDFSVMYS R
Sbjct: 127 PTNRYSQDSRQGAKLCASRRWHKGEIIPLLCGCIAEITPQEEKSLLRAGQNDFSVMYSTR 186
Query: 128 KNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNC 187
K AQLWLGPAAYINHDC P+C+F+ T ++TACV V +DI+ GEEITC+YG FFG++N
Sbjct: 187 KEIAQLWLGPAAYINHDCSPSCKFIPTSKNTACVLVCKDIDIGEEITCYYGRHFFGEENE 246
Query: 188 LCECDTCEIRMKGAFAQSSVEPQT 211
CEC +CEI G F+++ + T
Sbjct: 247 HCECHSCEIGQNGKFSRTDSQSST 270
>gi|348510088|ref|XP_003442578.1| PREDICTED: hypothetical protein LOC100691614 [Oreochromis
niloticus]
Length = 1724
Score = 182 bits (463), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 98/219 (44%), Positives = 132/219 (60%), Gaps = 12/219 (5%)
Query: 24 LNPSFWAFCVQALLAQRFN-IGRILDMGTQHHRYSYRIQYL----PELKPCYRYSLEGNR 78
N +F A V L FN +G + H Y Y +L +++ C RYS E N
Sbjct: 62 FNATFEALTVGELAGDYFNALGSHRQELLRQHVYRYLSAFLLDSGVKIESCDRYSSETN- 120
Query: 79 GARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPA 138
GA+I +TR W + + L+GCIAEL+ + +L G+NDFSVMYS RK CAQLWLGPA
Sbjct: 121 GAKITSTRHWFVGERVEVLLGCIAELSPADSA-VLRAGVNDFSVMYSTRKRCAQLWLGPA 179
Query: 139 AYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRM 198
A+INHDC+PNC+FV E++ ACV+V+R I GEEITC+YG FFG+ N +CEC TCE
Sbjct: 180 AFINHDCKPNCKFVPGEKNIACVEVIRPIAPGEEITCYYGASFFGEGNEMCECCTCERNG 239
Query: 199 KGAFAQSSVEP---QTTQP--SRYCLRDTDMRLNKRKLH 232
+G F +P +T P +Y LR+ +R + + H
Sbjct: 240 EGHFKHRGKQPEIEETKDPVGQKYRLRERYLRHQRERGH 278
>gi|326432429|gb|EGD77999.1| hypothetical protein PTSG_09637 [Salpingoeca sp. ATCC 50818]
Length = 967
Score = 177 bits (450), Expect = 3e-42, Method: Composition-based stats.
Identities = 83/165 (50%), Positives = 110/165 (66%), Gaps = 5/165 (3%)
Query: 43 IGRILDMGTQHHRYSYRIQYLPE----LKPCYRYSLEGNRGARIVATRKWEKNDNIAYLV 98
+ R++D + H Y +LP + PCYRY++ RGA+++ATR W + D IA L
Sbjct: 475 LARLIDPMFKEHAMRYIQCFLPHAGFVIVPCYRYAIASRRGAKVLATRPWRRGDTIALLC 534
Query: 99 GCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDT 158
GCIA+++ EEK L G+NDFSVM+S RK+C+QLWLGPAA+INHDC P C+F A
Sbjct: 535 GCIADMSDGEEKAYLRPGVNDFSVMFSNRKDCSQLWLGPAAFINHDCHPTCKFTAAGTTA 594
Query: 159 ACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFA 203
V VLRDI+ EITC+YG++FFG N C C+TCE +GAFA
Sbjct: 595 T-VTVLRDIDVNMEITCYYGDNFFGPNNSHCMCETCERLGRGAFA 638
>gi|432867633|ref|XP_004071279.1| PREDICTED: uncharacterized protein LOC101165555 [Oryzias latipes]
Length = 1938
Score = 177 bits (448), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 95/219 (43%), Positives = 131/219 (59%), Gaps = 12/219 (5%)
Query: 24 LNPSFWAFCVQALLAQRFN-IGRILDMGTQHHRYSYRIQYLPE----LKPCYRYSLEGNR 78
N +F A V F +G + H Y Y +L + ++PC RYS E N
Sbjct: 62 FNAAFEALTVGEPAGDFFGALGSHRQELLRQHVYRYLSAFLLDSGIKIEPCDRYSSETN- 120
Query: 79 GARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPA 138
GA+I +TR W + + L+GCIAEL+ + + G+NDFSVMYS RK CAQLWLGPA
Sbjct: 121 GAKITSTRHWFIGERVEVLLGCIAELSPADSC-FVTAGVNDFSVMYSTRKRCAQLWLGPA 179
Query: 139 AYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRM 198
A++NHDC+PNC+FV E++ ACV+V+R I GEEITC+YG FFG+ N +CEC TCE
Sbjct: 180 AFLNHDCKPNCKFVPCEKNVACVEVIRPISPGEEITCYYGASFFGEGNEMCECCTCERNG 239
Query: 199 KGAFAQSSVEP---QTTQPS--RYCLRDTDMRLNKRKLH 232
+G F +P +T +P +Y LR+ +R + + H
Sbjct: 240 EGHFKHRGKQPECEETKEPGGQKYRLRERYLRHQRERGH 278
>gi|402593845|gb|EJW87772.1| SET domain-containing protein [Wuchereria bancrofti]
Length = 655
Score = 175 bits (444), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 89/171 (52%), Positives = 110/171 (64%), Gaps = 11/171 (6%)
Query: 66 LKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYS 125
++ CYRY E G +VAT+ W+K D I LVG I +L+ +EE +LL +NDFSVMYS
Sbjct: 123 VQACYRYKTENRLGGMLVATKSWQKGDVIGSLVGVIGDLSHDEESQLLRKDINDFSVMYS 182
Query: 126 CRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDK 185
RK AQLWLGPAAYINHDC PNC+FV A V+VLRDI G+EIT +YG++FFGD
Sbjct: 183 TRKQRAQLWLGPAAYINHDCYPNCKFVPNSY-AAVVQVLRDIAIGDEITAYYGDNFFGDN 241
Query: 186 NCLCECDTCEIRMKGAF------AQSSVEPQTTQP----SRYCLRDTDMRL 226
NC CEC TCE KG F A +S+E + + + Y R TD RL
Sbjct: 242 NCRCECLTCERACKGFFTSLVVTAAASLEKNSAESKESGNSYKFRITDSRL 292
>gi|324508486|gb|ADY43581.1| Histone-lysine N-methyltransferase SUV420H2 [Ascaris suum]
Length = 522
Score = 175 bits (443), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 95/187 (50%), Positives = 115/187 (61%), Gaps = 17/187 (9%)
Query: 66 LKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYS 125
++ C RY E GA +V T+ W K + I LVG I +L+ EEE LL +NDFSVMYS
Sbjct: 123 IQRCTRYKAERRHGAMLVVTKPWRKGEVIESLVGVIGDLSAEEEVELLKKDVNDFSVMYS 182
Query: 126 CRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDK 185
RK AQLWLGP AYINHDCRPNC FV TA ++VLRDI GEEITCFYGE+FFGD
Sbjct: 183 TRKKRAQLWLGPGAYINHDCRPNCTFVPNG-PTAVIQVLRDIATGEEITCFYGENFFGDN 241
Query: 186 NCLCECDTCEIRMKGAFAQSSVEPQ-----TTQPSRYCLRDTDMRLNKRKLHKKLNRLLL 240
N CEC TCE R +GAF ++ + ++ RY LR+T+ R R +
Sbjct: 242 NERCECYTCERRNEGAFKKAKRTRRRNGNAGSERQRYVLRETEWR-----------RERV 290
Query: 241 ASDKNDT 247
AS NDT
Sbjct: 291 ASVDNDT 297
>gi|196011419|ref|XP_002115573.1| hypothetical protein TRIADDRAFT_29474 [Trichoplax adhaerens]
gi|190581861|gb|EDV21936.1| hypothetical protein TRIADDRAFT_29474 [Trichoplax adhaerens]
Length = 241
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 79/128 (61%), Positives = 102/128 (79%), Gaps = 1/128 (0%)
Query: 68 PCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCR 127
P RYS E A+I T +W K+ I L GCIAE+T++EEKRLL G NDFS+M+S R
Sbjct: 105 PVRRYSAE-KLEAKISVTTEWLKSSKIHDLCGCIAEMTEDEEKRLLLPGENDFSIMFSTR 163
Query: 128 KNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNC 187
K C+QLWLGPAAYINHDCRP+C+F++T R+TACVKVLRD++ G+E+TC+YG++FFGD N
Sbjct: 164 KKCSQLWLGPAAYINHDCRPSCKFISTGRNTACVKVLRDLKAGDEVTCYYGDNFFGDNNK 223
Query: 188 LCECDTCE 195
C+C+TCE
Sbjct: 224 DCKCETCE 231
>gi|170574229|ref|XP_001892719.1| SET domain containing protein [Brugia malayi]
gi|158601529|gb|EDP38422.1| SET domain containing protein [Brugia malayi]
Length = 645
Score = 172 bits (436), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 89/171 (52%), Positives = 108/171 (63%), Gaps = 11/171 (6%)
Query: 66 LKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYS 125
++ CYRY E G +VAT+ W+K D I LVG I +L+ EE +LL +NDFSVMYS
Sbjct: 123 VQACYRYKTENRLGGMLVATKSWQKGDVIGSLVGVIGDLSHNEESQLLRKDINDFSVMYS 182
Query: 126 CRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDK 185
RK AQLWLGPAAYINHDC PNC+FV A V+VLRDI G+EIT +YG++FFGD
Sbjct: 183 TRKQRAQLWLGPAAYINHDCYPNCKFVPNSY-AAVVQVLRDIAIGDEITAYYGDNFFGDN 241
Query: 186 NCLCECDTCEIRMKGAF------AQSSVEPQTTQP----SRYCLRDTDMRL 226
NC CEC TCE KG F A +S+E + + Y R TD RL
Sbjct: 242 NCRCECLTCERSCKGFFTSLVVTAAASLEKNCAESKESGNSYKFRVTDSRL 292
>gi|312072227|ref|XP_003138969.1| SET domain-containing protein [Loa loa]
gi|307765865|gb|EFO25099.1| SET domain-containing protein [Loa loa]
Length = 652
Score = 172 bits (436), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 89/171 (52%), Positives = 108/171 (63%), Gaps = 11/171 (6%)
Query: 66 LKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYS 125
++ C+RY E G +VAT+ W+K D I LVG I +L+ EEE +LL +NDFSVMYS
Sbjct: 123 VQACHRYKTENRLGGMLVATKSWQKGDVIGSLVGVIGDLSHEEESQLLRKDVNDFSVMYS 182
Query: 126 CRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDK 185
RK AQLWLGPAAYINHDC PNC+FV A V+VLRDI GEEIT +YG++FFGD
Sbjct: 183 TRKQRAQLWLGPAAYINHDCYPNCKFVPNSY-AAVVQVLRDIALGEEITAYYGDNFFGDN 241
Query: 186 NCLCECDTCEIRMKGAF------AQSSVEPQTTQP----SRYCLRDTDMRL 226
N CEC TCE KG F A +S+E + + Y R TD RL
Sbjct: 242 NSRCECLTCERACKGFFTSLMVTAAASLEKNNGESKESGNSYKFRVTDSRL 292
>gi|410903021|ref|XP_003964992.1| PREDICTED: uncharacterized protein LOC101069597 [Takifugu rubripes]
Length = 1750
Score = 171 bits (433), Expect = 3e-40, Method: Composition-based stats.
Identities = 80/147 (54%), Positives = 106/147 (72%), Gaps = 5/147 (3%)
Query: 52 QH-HRY--SYRIQYLPELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEE 108
QH HRY ++ + +++ C RYS E N GA+I +TR W + + L+GCIAEL+ +
Sbjct: 92 QHVHRYLSAFLLDSGIKIESCDRYSSETN-GAKITSTRHWFAGERVEVLLGCIAELSPAD 150
Query: 109 EKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIE 168
+L G+NDFSVMYS RK CAQLWLGPAA+INHDC+PNC+F+ E++ ACV+V+R I
Sbjct: 151 SA-VLRAGVNDFSVMYSTRKRCAQLWLGPAAFINHDCKPNCKFLPGEKNIACVEVIRPIS 209
Query: 169 QGEEITCFYGEDFFGDKNCLCECDTCE 195
GEEITC+YG FFG+ N +CEC TCE
Sbjct: 210 PGEEITCYYGASFFGEGNEMCECCTCE 236
>gi|268531558|ref|XP_002630905.1| C. briggsae CBR-SET-4 protein [Caenorhabditis briggsae]
gi|302595854|sp|A8WTV9.1|SUV42_CAEBR RecName: Full=Histone-lysine N-methyltransferase Suv4-20; AltName:
Full=SET domain-containing protein 4
Length = 326
Score = 169 bits (428), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 85/163 (52%), Positives = 106/163 (65%), Gaps = 8/163 (4%)
Query: 53 HHRYSYRIQYLPELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRL 112
HH Y IQ C RYSLEGN+GA++VATR W + D I L G + L+ ++E +
Sbjct: 155 HHDSGYTIQ------ECTRYSLEGNQGAKLVATRAWYRGDKIQRLSGVVCLLSTQDEDTI 208
Query: 113 LHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEE 172
L +DFSVMYS RK C+ LWLGP AYINHDCRP C FV+ TA ++VLRD+ G+E
Sbjct: 209 LQPEGSDFSVMYSNRKRCSTLWLGPGAYINHDCRPTCEFVS-HGSTAHIRVLRDMVAGDE 267
Query: 173 ITCFYGEDFFGDKNCLCECDTCEIRMKGAFAQSSVEPQTTQPS 215
ITCFYG +FFG KN CEC TCE +G F+ S E + +PS
Sbjct: 268 ITCFYGSEFFGPKNMDCECLTCEKTKRGKFSTSD-EEENDEPS 309
>gi|195133622|ref|XP_002011238.1| GI16112 [Drosophila mojavensis]
gi|193907213|gb|EDW06080.1| GI16112 [Drosophila mojavensis]
Length = 869
Score = 165 bits (417), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 73/86 (84%), Positives = 76/86 (88%)
Query: 118 NDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFY 177
NDFSVMYS RKNCAQLWLGPAAYINHDCR NC+F+AT RDTACVKVLRDIE GEEITCFY
Sbjct: 25 NDFSVMYSGRKNCAQLWLGPAAYINHDCRANCKFLATGRDTACVKVLRDIEVGEEITCFY 84
Query: 178 GEDFFGDKNCLCECDTCEIRMKGAFA 203
GEDFFGD N CEC+TCE R GAFA
Sbjct: 85 GEDFFGDSNRYCECETCERRGTGAFA 110
>gi|17532239|ref|NP_495272.1| Protein SET-4 [Caenorhabditis elegans]
gi|2496921|sp|Q09265.1|SUV42_CAEEL RecName: Full=Histone-lysine N-methyltransferase Suv4-20; AltName:
Full=SET domain-containing protein 4; AltName:
Full=Suppressor of variegation 4-20 homolog;
Short=Su(var)4-20 homolog
gi|351058570|emb|CCD66031.1| Protein SET-4 [Caenorhabditis elegans]
Length = 288
Score = 163 bits (413), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 86/192 (44%), Positives = 115/192 (59%), Gaps = 16/192 (8%)
Query: 12 VPIQLACHLLNKLNPSFWAFCVQALLAQRFNIGRILDMGTQHHRYSYRIQYLPELKPCYR 71
+ ++ H L+KL P+ L R +I R L+M Y+ ++ C R
Sbjct: 88 LTLRSVSHFLSKLPPN-------KLFEFRDHIVRFLNMFILDSGYT--------IQECKR 132
Query: 72 YSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCA 131
YS EG++GA++V+T W + D I L G + L+ E+E +L +DFSVMYS RK C+
Sbjct: 133 YSQEGHQGAKLVSTGVWSRGDKIERLSGVVCLLSSEDEDSILAQEGSDFSVMYSTRKRCS 192
Query: 132 QLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCEC 191
LWLGP AYINHDCRP C FV+ TA ++VLRD+ G+EITCFYG +FFG N CEC
Sbjct: 193 TLWLGPGAYINHDCRPTCEFVS-HGSTAHIRVLRDMVPGDEITCFYGSEFFGPNNIDCEC 251
Query: 192 DTCEIRMKGAFA 203
TCE M GAF+
Sbjct: 252 CTCEKNMNGAFS 263
>gi|341892185|gb|EGT48120.1| CBN-SET-4 protein [Caenorhabditis brenneri]
Length = 324
Score = 162 bits (411), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 82/164 (50%), Positives = 105/164 (64%), Gaps = 9/164 (5%)
Query: 53 HHRYSYRIQYLPELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRL 112
HH Y I C RYSLE NRGA++VATR W + D I L G + L++ EEK L
Sbjct: 153 HHDAGYTIV------ECNRYSLEQNRGAKLVATRNWFRGDKIERLSGVVCLLSQSEEKVL 206
Query: 113 LHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEE 172
L +DFSVMYS RK C+ LWLGP AYINHDC+P C FV + TA ++VLRDIE G E
Sbjct: 207 LQPEGSDFSVMYSTRKRCSTLWLGPGAYINHDCKPTCEFV-SHGSTAHIRVLRDIEPGHE 265
Query: 173 ITCFYGEDFFGDKNCLCECDTCE--IRMKGAFAQSSVEPQTTQP 214
ITC+YG +FFG N CEC TC+ R + A +++++ + +P
Sbjct: 266 ITCYYGSEFFGPGNADCECSTCDDKKREQQAALEAALDTENNEP 309
>gi|354492305|ref|XP_003508289.1| PREDICTED: histone-lysine N-methyltransferase SUV420H2-like
[Cricetulus griseus]
Length = 296
Score = 161 bits (408), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 81/142 (57%), Positives = 100/142 (70%), Gaps = 3/142 (2%)
Query: 54 HRYSYRIQYLPELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLL 113
H + Y LPE + + + R + RK KN+ + LVGCIAEL +E+E LL
Sbjct: 92 HIFCYLRALLPETQTGSLWVRQVGPRGRGASERK--KNEKLELLVGCIAELRQEDE-HLL 148
Query: 114 HTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEI 173
G NDFS+MYS RK AQLWLGPAA+INHDC+PNC+FV ++ +TACVKVLRDIE G+E+
Sbjct: 149 RAGENDFSIMYSTRKRSAQLWLGPAAFINHDCKPNCKFVPSDGNTACVKVLRDIEPGDEV 208
Query: 174 TCFYGEDFFGDKNCLCECDTCE 195
TCFYGE FFG+KN CEC TCE
Sbjct: 209 TCFYGEGFFGEKNEHCECYTCE 230
>gi|341896744|gb|EGT52679.1| hypothetical protein CAEBREN_31192 [Caenorhabditis brenneri]
Length = 324
Score = 161 bits (407), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 82/164 (50%), Positives = 105/164 (64%), Gaps = 9/164 (5%)
Query: 53 HHRYSYRIQYLPELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRL 112
HH Y I C RYSLE NRGA++VATR W + D I L G + L++ EEK L
Sbjct: 153 HHDAGYTIV------ECNRYSLEQNRGAKLVATRNWYRGDKIERLSGVVCLLSQSEEKVL 206
Query: 113 LHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEE 172
L +DFSVMYS RK C+ LWLGP AYINHDC+P C FV + TA ++VLRDIE G E
Sbjct: 207 LQPEGSDFSVMYSTRKRCSTLWLGPGAYINHDCKPTCEFV-SHGSTAHIRVLRDIEPGHE 265
Query: 173 ITCFYGEDFFGDKNCLCECDTCE--IRMKGAFAQSSVEPQTTQP 214
ITC+YG +FFG N CEC TC+ R + A +++++ + +P
Sbjct: 266 ITCYYGSEFFGPGNADCECCTCDDKKREQQAALEAALDTENNEP 309
>gi|47217792|emb|CAG07206.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1726
Score = 160 bits (404), Expect = 7e-37, Method: Composition-based stats.
Identities = 87/189 (46%), Positives = 118/189 (62%), Gaps = 11/189 (5%)
Query: 52 QH-HRY--SYRIQYLPELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEE 108
QH HRY ++ + +++ C RYS E N GA+I +TR W + + L+GCIAEL+ +
Sbjct: 104 QHVHRYLSAFLLDSGIKIESCDRYSSETN-GAKITSTRHWFAGERVEVLLGCIAELSPAD 162
Query: 109 EKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIE 168
+L G+NDFSVMYS RK CAQLWLGPAA+INH RF+ E++ ACV+V+R I
Sbjct: 163 SA-VLRAGVNDFSVMYSTRKRCAQLWLGPAAFINHG-EFAARFLPGEKNIACVEVIRPIS 220
Query: 169 QGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFAQSSVEPQ---TTQP--SRYCLRDTD 223
GEEITC+YG FFG+ N +CEC TCE +G F +P+ T P +Y LR+
Sbjct: 221 PGEEITCYYGASFFGEGNEMCECCTCERNGEGHFKHRGKQPECEDTKDPVGQKYRLRERY 280
Query: 224 MRLNKRKLH 232
+R + K H
Sbjct: 281 LRHQREKGH 289
>gi|308503460|ref|XP_003113914.1| CRE-SET-4 protein [Caenorhabditis remanei]
gi|308263873|gb|EFP07826.1| CRE-SET-4 protein [Caenorhabditis remanei]
Length = 301
Score = 157 bits (397), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 72/141 (51%), Positives = 95/141 (67%), Gaps = 1/141 (0%)
Query: 66 LKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYS 125
++ C RYS EG++GA++VATR W + D I L+G + + EE LL +DFSVMYS
Sbjct: 135 IEECNRYSAEGHKGAKLVATRAWSRGDKIDRLIGVLCSMKNNEETTLLSAEGSDFSVMYS 194
Query: 126 CRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDK 185
RK C+ LWLGP AYINHDC+P C+F + +A ++VLRD+ G+EITCFYG +FFG
Sbjct: 195 TRKRCSTLWLGPGAYINHDCKPTCQF-QSNGTSAHIRVLRDMVAGDEITCFYGNEFFGPG 253
Query: 186 NCLCECDTCEIRMKGAFAQSS 206
N CEC TCE +G F +S
Sbjct: 254 NVDCECVTCEKNKRGKFQLNS 274
>gi|256082581|ref|XP_002577533.1| suppressor of variegation 4-20-related [Schistosoma mansoni]
gi|353232247|emb|CCD79602.1| suppressor of variegation 4-20-related [Schistosoma mansoni]
Length = 613
Score = 153 bits (386), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 74/142 (52%), Positives = 93/142 (65%), Gaps = 2/142 (1%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E++PC+RY+ E + GA I ATR W K I+ LVGCIAEL EE L NDFSVMY
Sbjct: 127 EIRPCWRYASENHMGASIFATRDWAKGSRISTLVGCIAELKHHEEAAFLKHHKNDFSVMY 186
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRF-VATERDT-ACVKVLRDIEQGEEITCFYGEDFF 182
S RKNC+QLWLGPAAY+NHDC PNC F + + D ++ DI+ G+EI +YG FF
Sbjct: 187 SSRKNCSQLWLGPAAYVNHDCHPNCEFTINCDVDARMSLEAKMDIKCGDEIFIYYGTHFF 246
Query: 183 GDKNCLCECDTCEIRMKGAFAQ 204
N CEC TCE+ +G F++
Sbjct: 247 DTNNRACECFTCELLGRGYFSK 268
>gi|358335348|dbj|GAA53884.1| histone-lysine N-methyltransferase SUV420H, partial [Clonorchis
sinensis]
Length = 527
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 74/144 (51%), Positives = 93/144 (64%), Gaps = 6/144 (4%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E++PC RY+ E + GA I A++ W K I LVGCIAE+ EE L NDFSVMY
Sbjct: 68 EIRPCSRYASEDHVGAAIFASKDWPKGTRIHTLVGCIAEMKHAEEASFLKHRKNDFSVMY 127
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFV----ATERDTACVKVLRDIEQGEEITCFYGED 180
S RKNC+QLWLGPAAY+NHDC+PNC F A R + K+ DI +GEEI +YG+
Sbjct: 128 SSRKNCSQLWLGPAAYVNHDCQPNCEFTINCDADARMSLEAKI--DIRKGEEIFIYYGKH 185
Query: 181 FFGDKNCLCECDTCEIRMKGAFAQ 204
FF N CEC TCE+ +G F++
Sbjct: 186 FFDVNNSACECFTCELLGRGYFSK 209
>gi|313216250|emb|CBY37593.1| unnamed protein product [Oikopleura dioica]
Length = 392
Score = 150 bits (380), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 84/228 (36%), Positives = 127/228 (55%), Gaps = 16/228 (7%)
Query: 52 QHHRYSYRIQYLPE----LKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKE 107
Q H + Y + P ++ C RYS E +RG + AT +++ ++I L GCIA ++ E
Sbjct: 117 QKHIFLYLKMWDPNSGYTIRECDRYSKE-HRGGALYATIDYKRGEDIRELEGCIARMSPE 175
Query: 108 EEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDI 167
+ ++++ G ND+S+ YS K C+QLWLGPAA++NHDC PN ++T +TA V+ +DI
Sbjct: 176 QHEQIVVEGQNDYSIQYSTSKRCSQLWLGPAAFLNHDCCPNSEILSTGPNTASVRATKDI 235
Query: 168 EQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFAQSSVEPQTTQPSRYCLRDTDMRLN 227
+ EE+ FYG+ FFG +NC CEC TCE R GAF S P++ +R+
Sbjct: 236 KSNEELLIFYGKGFFGPENCECECRTCERRGAGAFTDKSRSADRRSPAKLSSGKYGLRMT 295
Query: 228 KRKLHKKLNRLLLASDK----NDTNSSDNSSKYSLVQWNNNSKENIFD 271
NR +A DK + N+S SS + +NS E+ F+
Sbjct: 296 S-------NRQRIADDKTRKNTELNTSVGSSNTADSSIASNSDESFFE 336
>gi|313233122|emb|CBY24234.1| unnamed protein product [Oikopleura dioica]
Length = 392
Score = 150 bits (380), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 84/228 (36%), Positives = 127/228 (55%), Gaps = 16/228 (7%)
Query: 52 QHHRYSYRIQYLPE----LKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKE 107
Q H + Y + P ++ C RYS E +RG + AT +++ ++I L GCIA ++ E
Sbjct: 117 QKHIFLYLKMWDPNSGYTIRECDRYSKE-HRGGALYATIDYKRGEDIRELEGCIARMSPE 175
Query: 108 EEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDI 167
+ ++++ G ND+S+ YS K C+QLWLGPAA++NHDC PN ++T +TA V+ +DI
Sbjct: 176 QHEQIVVEGQNDYSIQYSTSKRCSQLWLGPAAFLNHDCCPNSEILSTGPNTASVRATKDI 235
Query: 168 EQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFAQSSVEPQTTQPSRYCLRDTDMRLN 227
+ EE+ FYG+ FFG +NC CEC TCE R GAF S P++ +R+
Sbjct: 236 KSNEELLIFYGKGFFGPENCECECRTCERRGAGAFTDKSRSADRRSPAKLSSGKYGLRMT 295
Query: 228 KRKLHKKLNRLLLASDK----NDTNSSDNSSKYSLVQWNNNSKENIFD 271
NR +A DK + N+S SS + +NS E+ F+
Sbjct: 296 S-------NRQRIADDKTRKNTELNTSVGSSNTADSSIASNSDESFFE 336
>gi|313212401|emb|CBY36385.1| unnamed protein product [Oikopleura dioica]
Length = 409
Score = 140 bits (352), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 75/177 (42%), Positives = 102/177 (57%), Gaps = 8/177 (4%)
Query: 52 QHHRYSYRIQYLPE----LKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKE 107
Q H +S+ + P+ ++ C RYS EGNRGA I ATR ++ + I L GC A +T E
Sbjct: 138 QDHIFSFLKFWDPKSGFTVRECARYSKEGNRGAAIYATRSFKTGEEIKGLKGCSAPMTDE 197
Query: 108 EEKRLLHTGLNDFSVM--YSCRK--NCAQLWLGPAAYINHDCRPNCRFVATERDTACVKV 163
+ ++L+ G ND+S++ + RK N LWLGPAAYINHDCR NC + ++
Sbjct: 198 QYEQLVTPGKNDYSILIGFPRRKGPNRHSLWLGPAAYINHDCRANCEIDHRINNKVVIRA 257
Query: 164 LRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFAQSSVEPQTTQPSRYCLR 220
RDIE EE+ FYG FFG NC CEC TCE++ GAF ++ R+ LR
Sbjct: 258 TRDIEANEELLMFYGSQFFGADNCECECRTCELQRTGAFENAATCQMNPTVGRFNLR 314
>gi|313236491|emb|CBY11806.1| unnamed protein product [Oikopleura dioica]
Length = 405
Score = 139 bits (351), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 75/177 (42%), Positives = 102/177 (57%), Gaps = 8/177 (4%)
Query: 52 QHHRYSYRIQYLPE----LKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKE 107
Q H +S+ + P+ ++ C RYS EGNRGA I ATR ++ + I L GC A +T E
Sbjct: 138 QDHIFSFLKFWDPKSGFTVRECARYSKEGNRGAAIYATRSFKTGEEIKGLKGCSAPMTDE 197
Query: 108 EEKRLLHTGLNDFSVM--YSCRK--NCAQLWLGPAAYINHDCRPNCRFVATERDTACVKV 163
+ ++L+ G ND+S++ + RK N LWLGPAAYINHDCR NC + ++
Sbjct: 198 QYEQLVTPGKNDYSILIGFPRRKGPNRHSLWLGPAAYINHDCRANCEIDHRINNKVVIRA 257
Query: 164 LRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFAQSSVEPQTTQPSRYCLR 220
RDIE EE+ FYG FFG NC CEC TCE++ GAF ++ R+ LR
Sbjct: 258 TRDIEANEELLMFYGSQFFGADNCECECRTCELQRTGAFENAATCQMNPTVGRFNLR 314
>gi|344251394|gb|EGW07498.1| Histone-lysine N-methyltransferase SUV420H2 [Cricetulus griseus]
Length = 225
Score = 137 bits (346), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 80/175 (45%), Positives = 103/175 (58%), Gaps = 19/175 (10%)
Query: 39 QRFNIGRILDMGTQHHRYSYRIQYL--PELKPCYRYSLEGNRGARIVATR---------- 86
+ N+ + + QHH S +L +L+ YR G+ A +R
Sbjct: 32 HKMNVSPVPTLRRQHHLRSALEAFLRQQDLEATYRALTLGSWMAHYFQSRAPRQEAAFKT 91
Query: 87 ---KWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINH 143
+KN+ + LVGCIAEL +E+E LL G NDFS+MYS RK AQLWLGPAA+INH
Sbjct: 92 HPVSRKKNEKLELLVGCIAELRQEDE-HLLRAGENDFSIMYSTRKRSAQLWLGPAAFINH 150
Query: 144 DCRPN---CRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCE 195
+FV ++ +TACVKVLRDIE G+E+TCFYGE FFG+KN CEC TCE
Sbjct: 151 GWAIEWAWLQFVPSDGNTACVKVLRDIEPGDEVTCFYGEGFFGEKNEHCECYTCE 205
>gi|345569216|gb|EGX52084.1| hypothetical protein AOL_s00043g474 [Arthrobotrys oligospora ATCC
24927]
Length = 577
Score = 135 bits (339), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 64/141 (45%), Positives = 88/141 (62%), Gaps = 1/141 (0%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ RY+L+ + A ++A K D I YL G + +TKEEE DFS+++
Sbjct: 115 EVTTTNRYTLKTHE-ASVLARVPIPKGDTIRYLSGVMVTMTKEEEAEFNEASNRDFSIIH 173
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RK AQ++LGPA ++NHDC PNCRF + + T K LRDIE GEE+T +Y +D+FG
Sbjct: 174 SSRKAGAQIFLGPARFVNHDCSPNCRFQSASKSTVMFKALRDIEPGEELTVYYSDDYFGY 233
Query: 185 KNCLCECDTCEIRMKGAFAQS 205
KN C C TCE M+G +A+S
Sbjct: 234 KNAECLCHTCEREMRGGWAKS 254
>gi|384498927|gb|EIE89418.1| hypothetical protein RO3G_14129 [Rhizopus delemar RA 99-880]
Length = 227
Score = 127 bits (318), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 69/166 (41%), Positives = 94/166 (56%), Gaps = 2/166 (1%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ RYS E A ++AT+ W D I L G IA L +++ L G DFSVM+
Sbjct: 7 EVADTKRYS-ERKLEACLIATKNWHVGDEIKLLTGVIATLDPKDDAEL-KKGNRDFSVMW 64
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RK C L+LGPA + NHDC NCRF+A ++ KV+++I+ G+EIT FYG+ +FG+
Sbjct: 65 SSRKKCTCLFLGPARFANHDCNSNCRFIAHGLNSIIFKVVKEIKCGDEITVFYGQHYFGE 124
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQPSRYCLRDTDMRLNKRK 230
NC C+C TCE R G + SS P T ++ KRK
Sbjct: 125 NNCECKCVTCEKRNTGYYTLSSTSPPTIATKESAVKTNTRPTRKRK 170
>gi|390370989|ref|XP_003731940.1| PREDICTED: uncharacterized protein LOC100893305 [Strongylocentrotus
purpuratus]
Length = 794
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 54/91 (59%), Positives = 66/91 (72%)
Query: 144 DCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFA 203
DCRPNC+FVAT RDTACV++LRDIE EEITC+YGEDFFG+ NC CEC+TCE R GAF
Sbjct: 18 DCRPNCKFVATGRDTACVQILRDIEPEEEITCYYGEDFFGENNCYCECETCERRQSGAFT 77
Query: 204 QSSVEPQTTQPSRYCLRDTDMRLNKRKLHKK 234
+ + +Y LRDT+ R+ + K KK
Sbjct: 78 PKDSPDKKKETPKYSLRDTNKRIMRLKKQKK 108
>gi|296234638|ref|XP_002762550.1| PREDICTED: histone-lysine N-methyltransferase SUV420H2 [Callithrix
jacchus]
Length = 382
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 60/101 (59%), Positives = 75/101 (74%), Gaps = 6/101 (5%)
Query: 54 HRYSYRIQYLPE----LKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEE 109
H Y Y +LPE + PC RYS+E N GA+IV+TR W+KN+ + LVGCIAEL +E +
Sbjct: 92 HVYRYLRAFLPESGFAILPCTRYSMETN-GAKIVSTRAWKKNEKLELLVGCIAEL-READ 149
Query: 110 KRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCR 150
+ LL G NDFS+MYS RK AQLWLGPAA+INHDC+PNC+
Sbjct: 150 EGLLRAGENDFSIMYSTRKRSAQLWLGPAAFINHDCKPNCK 190
>gi|345318700|ref|XP_001517837.2| PREDICTED: histone-lysine N-methyltransferase SUV420H1-like,
partial [Ornithorhynchus anatinus]
Length = 253
Score = 118 bits (295), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 57/84 (67%), Positives = 66/84 (78%), Gaps = 1/84 (1%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 15 EILPCNRYSSEQN-GAKIVATKEWKRNDKIELLVGCIAELSEAEENLLLRHGDNDFSVMY 73
Query: 125 SCRKNCAQLWLGPAAYINHDCRPN 148
S RKNCAQLWLGPAA+INH RP
Sbjct: 74 STRKNCAQLWLGPAAFINHGERPG 97
>gi|119595107|gb|EAW74701.1| suppressor of variegation 4-20 homolog 1 (Drosophila), isoform
CRA_d [Homo sapiens]
Length = 298
Score = 114 bits (286), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 55/79 (69%), Positives = 64/79 (81%), Gaps = 1/79 (1%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA+IVAT++W++ND I LVGCIAEL++ EE LL G NDFSVMY
Sbjct: 196 EILPCNRYSSEQN-GAKIVATKEWKRNDKIELLVGCIAELSEIEENMLLRHGENDFSVMY 254
Query: 125 SCRKNCAQLWLGPAAYINH 143
S RKNCAQLWLGPAA+INH
Sbjct: 255 STRKNCAQLWLGPAAFINH 273
>gi|242776483|ref|XP_002478845.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
gi|218722464|gb|EED21882.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
Length = 634
Score = 113 bits (283), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 72/193 (37%), Positives = 102/193 (52%), Gaps = 16/193 (8%)
Query: 52 QHHRYSYRIQYLP----ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKE 107
+H R +I YLP E+ RY++ + A +VA R +K + I YL G + +T++
Sbjct: 106 RHFRKYIQI-YLPDCPFEISTTNRYTIIEHEAA-VVARRNIKKGEEIKYLCGTLVSMTRQ 163
Query: 108 EEKRLLHTGLN--DFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLR 165
EE L G N DFS++ S R+ L+LGPA + NHDC N R D + R
Sbjct: 164 EE---LDLGFNRKDFSIVMSSRRKSPSLFLGPARFANHDCNANGRLSTRGSDGMQIIAAR 220
Query: 166 DIEQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFAQSSVEPQTTQPSRYCLRDTDMR 225
DIE GEEIT YG+D+FG NC C C+TCE ++ ++ S P + P+ RD R
Sbjct: 221 DIELGEEITVSYGDDYFGIDNCECLCETCERAIRNGWSPSVSAPASQAPT--PTRDDSNR 278
Query: 226 ---LNKRKLHKKL 235
KR+L K+
Sbjct: 279 SPQPKKRRLDTKI 291
>gi|328774172|gb|EGF84209.1| hypothetical protein BATDEDRAFT_8511, partial [Batrachochytrium
dendrobatidis JAM81]
Length = 150
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 57/117 (48%), Positives = 74/117 (63%), Gaps = 4/117 (3%)
Query: 80 ARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAA 139
ARIVATR+W+ D I G +L EEE+ L DFSVM+S +++C L+LGPA
Sbjct: 36 ARIVATREWKTGDEIYLCSGIFTKLNSEEERSLAQ---RDFSVMFSTKRHCMGLFLGPAR 92
Query: 140 YINHDCRPNCRFVATERDTAC-VKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCE 195
+ NHDC NC+F R A KV RDI GEEITC+Y +++FG NC C C++CE
Sbjct: 93 FANHDCCANCKFNPVGRTNAIYFKVTRDINIGEEITCYYSDNYFGQGNCECLCESCE 149
>gi|395528528|ref|XP_003766381.1| PREDICTED: histone-lysine N-methyltransferase SUV420H2 [Sarcophilus
harrisii]
Length = 350
Score = 110 bits (275), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 60/125 (48%), Positives = 79/125 (63%), Gaps = 7/125 (5%)
Query: 24 LNPSFWAFCVQALLAQRFNI-GRILDMGTQHHRYSYRIQYLPE----LKPCYRYSLEGNR 78
L ++ A + +A F G + + H Y Y +LPE + PC RYSLE N
Sbjct: 61 LEAAYRALTLGGWMAHYFQSRGPQQEAALKTHIYRYLRAFLPESGFAILPCTRYSLETN- 119
Query: 79 GARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPA 138
GA++V+TR W+KN+ + LVGCIAEL +E ++ LL G NDFS+MYS RK AQLWLGPA
Sbjct: 120 GAKVVSTRSWKKNEKLELLVGCIAEL-READEGLLRAGENDFSIMYSTRKRSAQLWLGPA 178
Query: 139 AYINH 143
A+INH
Sbjct: 179 AFINH 183
>gi|340515476|gb|EGR45730.1| predicted protein [Trichoderma reesei QM6a]
Length = 652
Score = 110 bits (274), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 61/158 (38%), Positives = 88/158 (55%), Gaps = 7/158 (4%)
Query: 52 QHHRYSYRIQYLP----ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKE 107
QH R RI Y+P E+ RY++ + ++A R ++N+ I YL G +T E
Sbjct: 106 QHLRRYLRI-YMPDCPWEVSTTNRYTIVTHEAC-VIARRYIKRNETIKYLSGIQINITPE 163
Query: 108 EEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDI 167
EEK + DFS++ S RK C L++GPA + NHDCR N R + T + T + R I
Sbjct: 164 EEKEIT-VRKKDFSIVVSSRKKCTSLFMGPARFANHDCRANARLMTTSQSTIEIVASRPI 222
Query: 168 EQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFAQS 205
+ GEEIT Y + +FGD NC C C TCE ++ +A +
Sbjct: 223 DVGEEITVTYSDCYFGDNNCECLCRTCEKELRNGWAPA 260
>gi|429853922|gb|ELA28965.1| histone-lysine n-methyltransferase set9 [Colletotrichum
gloeosporioides Nara gc5]
Length = 792
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 60/150 (40%), Positives = 85/150 (56%), Gaps = 10/150 (6%)
Query: 52 QHHRYSYRIQYLP----ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKE 107
+ H Y YLP E+ RY++ + A I A R +N+NI YL G ++ E
Sbjct: 102 KRHMRRYTQVYLPDCPFEVNSTNRYTIVSHEAA-ITARRFIRRNENIKYLAGTQVNISPE 160
Query: 108 EEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVL--R 165
EE R + DFS++ S R CA L++GPA + NHDC N + + T TA ++++ R
Sbjct: 161 EE-REMTARKKDFSIIISSRSKCASLFMGPARFANHDCGANAKLMTT--GTAGIEIIATR 217
Query: 166 DIEQGEEITCFYGEDFFGDKNCLCECDTCE 195
+IE GEEIT Y E++FG+ NC C C TCE
Sbjct: 218 NIEAGEEITVTYAENYFGEDNCECLCRTCE 247
>gi|407917573|gb|EKG10877.1| hypothetical protein MPH_11879 [Macrophomina phaseolina MS6]
Length = 705
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 60/151 (39%), Positives = 80/151 (52%), Gaps = 6/151 (3%)
Query: 62 YLP----ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGL 117
Y+P E+ RY++ + A + A R KN+ I YL G +TKEEE+ L
Sbjct: 117 YMPDCPWEVSTTNRYTITQHEAA-VTARRDIRKNEVIKYLCGIQVAMTKEEEE-TLDLNK 174
Query: 118 NDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFY 177
DFS++ S RK L+LGPA + NHDC N R + + +DIE GEEIT Y
Sbjct: 175 RDFSIVMSSRKKAPSLFLGPARFANHDCDANARLTTAGPNGMAIVSKKDIEVGEEITVSY 234
Query: 178 GEDFFGDKNCLCECDTCEIRMKGAFAQSSVE 208
GED+FG+ NC C C TCE + +A E
Sbjct: 235 GEDYFGEDNCECLCSTCERLTRNGWAPQKKE 265
>gi|440897008|gb|ELR48791.1| Histone-lysine N-methyltransferase SUV420H2, partial [Bos grunniens
mutus]
Length = 338
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 79/215 (36%), Positives = 111/215 (51%), Gaps = 27/215 (12%)
Query: 56 YSYRIQYLPE----LKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKR 111
Y Y +LPE + PC RYS+E N GA+IV+TR D L + R
Sbjct: 2 YRYLRAFLPESGFTILPCTRYSMETN-GAKIVSTRACPLGDET--------PLQRPRLPR 52
Query: 112 LLHTGLNDFSVMYSCRKNCAQLWLGPAAYINH-------DCRPNCRFVATERDTACVKVL 164
GL ++ C + P++ +H +C+PNC+FV + + ACVKVL
Sbjct: 53 AAGKGLTLCPRVHCCPLSHLH-QTPPSSSSSHCWVRGQDNCKPNCKFVPADGNAACVKVL 111
Query: 165 RDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFAQSSVEPQTTQPS-RYCLRDTD 223
RDIE G+E+TCFYGE FFG+KN CEC TCE R +GAF EP +P +Y LR+T
Sbjct: 112 RDIEPGDEVTCFYGEGFFGEKNEHCECYTCERRGEGAFRLRPREPLPPRPLDKYELRET- 170
Query: 224 MRLNKRKLHKKLNRLLLASDKNDTNSSDNSSKYSL 258
KR+L + L+ + + S S+ ++L
Sbjct: 171 ----KRRLQQGLDGPRSCAPRRAGTPSAVSTGWAL 201
>gi|212532789|ref|XP_002146551.1| SET domain protein [Talaromyces marneffei ATCC 18224]
gi|210071915|gb|EEA26004.1| SET domain protein [Talaromyces marneffei ATCC 18224]
Length = 640
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 62/156 (39%), Positives = 86/156 (55%), Gaps = 10/156 (6%)
Query: 62 YLP----ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGL 117
YLP E+ RY++ + A +VA R +K + I YL G + +T++EE L G
Sbjct: 115 YLPDCPFEISTTNRYTIIEHE-ASVVARRNIKKGEEIKYLCGTLVSMTRQEE---LDLGF 170
Query: 118 N--DFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITC 175
N DFS++ S R+ L+LGPA + NHDC N R D V RDIE GEEIT
Sbjct: 171 NRKDFSIVMSSRRKNPSLFLGPARFANHDCNANGRLSTRGNDGMQVIAARDIELGEEITV 230
Query: 176 FYGEDFFGDKNCLCECDTCEIRMKGAFAQSSVEPQT 211
YGED+FG NC C C+TCE ++ ++ + P +
Sbjct: 231 SYGEDYFGIDNCECLCETCERTVRNGWSPAVSVPTS 266
>gi|119592770|gb|EAW72364.1| suppressor of variegation 4-20 homolog 2 (Drosophila), isoform
CRA_a [Homo sapiens]
Length = 204
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 60/125 (48%), Positives = 79/125 (63%), Gaps = 7/125 (5%)
Query: 24 LNPSFWAFCVQALLAQRFNI-GRILDMGTQHHRYSYRIQYLPE----LKPCYRYSLEGNR 78
L ++ A + A+ F G + + H Y Y +LPE + PC RYS+E N
Sbjct: 61 LEAAYRALTLGGWTARYFQSRGPRQEAALKTHVYRYLRAFLPESGFTILPCTRYSMETN- 119
Query: 79 GARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPA 138
GA+IV+TR W+KN+ + LVGCIAEL +E ++ LL G NDFS+MYS RK AQLWLGPA
Sbjct: 120 GAKIVSTRAWKKNEKLELLVGCIAEL-READEGLLRAGENDFSIMYSTRKRSAQLWLGPA 178
Query: 139 AYINH 143
A+INH
Sbjct: 179 AFINH 183
>gi|310801077|gb|EFQ35970.1| histone-lysine N-methyltransferase SET9 [Glomerella graminicola
M1.001]
Length = 794
Score = 107 bits (268), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 59/158 (37%), Positives = 84/158 (53%), Gaps = 6/158 (3%)
Query: 52 QHHRYSYRIQYLP----ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKE 107
+ H Y YLP E+ RY++ + A I A R +N+ I YL G ++ E
Sbjct: 105 KRHMRRYTQVYLPDCPFEVNSTNRYTIVSHEAA-ITARRFIRRNETIKYLAGTQVNISPE 163
Query: 108 EEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDI 167
EE R + DFS++ S R CA L++GPA + NHDC N + + T T + R+I
Sbjct: 164 EE-REMTARKKDFSIIISARSKCASLFMGPARFANHDCGANAKLMTTSSATIEIIATRNI 222
Query: 168 EQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFAQS 205
+ GEEIT Y E++FG+ NC C C TCE R +A +
Sbjct: 223 DVGEEITVTYAENYFGEDNCECLCKTCEDRCTNGWAPA 260
>gi|156048416|ref|XP_001590175.1| hypothetical protein SS1G_08939 [Sclerotinia sclerotiorum 1980]
gi|154693336|gb|EDN93074.1| hypothetical protein SS1G_08939 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 842
Score = 107 bits (268), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 63/167 (37%), Positives = 88/167 (52%), Gaps = 6/167 (3%)
Query: 52 QHHRYSYRIQYLP----ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKE 107
+ H Y YLP E+ RY++ + A IVA R+ K + + YL G +T E
Sbjct: 101 RRHLKKYINIYLPDCPFEVSSTNRYTVVTHEAA-IVARREIRKGEVVKYLCGIQVVMTPE 159
Query: 108 EEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDI 167
EE + + DFS++ S R A L+LGPA + NHDC N R + T + +RDI
Sbjct: 160 EEAHI-NNSRRDFSIVMSSRNKAASLFLGPARFANHDCGANARLMTTGSAGMEIIAVRDI 218
Query: 168 EQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFAQSSVEPQTTQP 214
E GEEIT YG+ +FG+ NC C C TCE + +AQ + + T P
Sbjct: 219 EIGEEITVTYGDSYFGEDNCECLCKTCEDNRENGWAQDTDDSNETIP 265
>gi|402074335|gb|EJT69864.1| histone-lysine N-methyltransferase SET9 [Gaeumannomyces graminis
var. tritici R3-111a-1]
Length = 916
Score = 107 bits (268), Expect = 4e-21, Method: Composition-based stats.
Identities = 61/159 (38%), Positives = 84/159 (52%), Gaps = 6/159 (3%)
Query: 54 HRYSYRIQYLP----ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEE 109
H Y YLP E+ RY++ + A + A R KN+ I L G +T EEE
Sbjct: 102 HLRRYLAIYLPDCPFEVNATNRYTIFTHE-ASVTARRPISKNETIKGLCGIQVVITPEEE 160
Query: 110 KRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQ 169
+L DFS++ S R C L++GPA + NHDC N R + V+ LRDI+
Sbjct: 161 AQL-SLRKKDFSLVVSSRNKCTSLFMGPARFANHDCGANARLKTAGQAGMEVQALRDIDI 219
Query: 170 GEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFAQSSVE 208
GEEIT YG+++FG+ NC C C TCE+R +A + E
Sbjct: 220 GEEITVTYGDNYFGESNCECLCRTCELRRVNGWAAAGRE 258
>gi|380491576|emb|CCF35220.1| histone-lysine N-methyltransferase SET9 [Colletotrichum
higginsianum]
Length = 815
Score = 107 bits (267), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 58/150 (38%), Positives = 81/150 (54%), Gaps = 6/150 (4%)
Query: 52 QHHRYSYRIQYLP----ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKE 107
+ H Y YLP E+ RY++ + A I A R +N+ I YL G ++ E
Sbjct: 105 KRHMRRYTQVYLPDCPFEVNSTNRYTIVSHEAA-ITARRFIRRNETIKYLAGTQVNISPE 163
Query: 108 EEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDI 167
EE R + DFS++ S R CA L++GPA + NHDC N + + T T + R+I
Sbjct: 164 EE-REMTARKKDFSIIISARNKCASLFMGPARFANHDCGANAKLMTTSSATIEIIATRNI 222
Query: 168 EQGEEITCFYGEDFFGDKNCLCECDTCEIR 197
+ GEEIT Y E++FG+ NC C C TCE R
Sbjct: 223 DVGEEITVTYAENYFGEDNCECLCRTCENR 252
>gi|392338316|ref|XP_003753498.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
SUV420H2-like [Rattus norvegicus]
gi|392355136|ref|XP_003751951.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
SUV420H2-like [Rattus norvegicus]
Length = 430
Score = 107 bits (266), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 57/99 (57%), Positives = 70/99 (70%), Gaps = 3/99 (3%)
Query: 68 PCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCR 127
PC R+S E N G +IV+T KN+ + LVGCIAEL +E+E LL G NDFS+MYS
Sbjct: 171 PCTRHSTETN-GTKIVSTHA-GKNEKLELLVGCIAELCEEDE-YLLRAGKNDFSIMYSTX 227
Query: 128 KNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRD 166
K A L LGPAA INHDC+PNC+FV ++ +TAC KVL D
Sbjct: 228 KRIAHLRLGPAAIINHDCKPNCKFVPSDGNTACRKVLLD 266
>gi|358392150|gb|EHK41554.1| hypothetical protein TRIATDRAFT_295421 [Trichoderma atroviride IMI
206040]
Length = 756
Score = 107 bits (266), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 60/155 (38%), Positives = 84/155 (54%), Gaps = 5/155 (3%)
Query: 52 QHHRYSYRIQYLP---ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEE 108
QH R RI L E+ RY++ + ++A R ++N+ I YL G +T EE
Sbjct: 105 QHLRRYLRIYMLDCPWEVSTTNRYTIVTHEAC-VIARRYIKRNETIKYLSGIQINITPEE 163
Query: 109 EKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIE 168
EK + DFS++ S RK C L++GPA + NHDCR N + + T T + R IE
Sbjct: 164 EKEIT-VRKKDFSIVVSSRKKCTSLFMGPARFANHDCRANAKLMTTGHSTIEIVASRPIE 222
Query: 169 QGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFA 203
GEEIT Y + +FGD NC C C TCE ++ +A
Sbjct: 223 VGEEITVTYSDCYFGDNNCECLCRTCEQELRNGWA 257
>gi|444724132|gb|ELW64750.1| Histone-lysine N-methyltransferase SUV420H2 [Tupaia chinensis]
Length = 235
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 70/195 (35%), Positives = 92/195 (47%), Gaps = 46/195 (23%)
Query: 24 LNPSFWAFCVQALLAQRFNI-GRILDMGTQHHRYSYRIQYLPE----LKPCYRYSLEGNR 78
L ++ A + +A F G + + H Y Y +LPE + PC RYS+E N
Sbjct: 61 LEAAYRALTLGGWMAHYFQSRGPRQESALKTHIYRYLRAFLPESGFAILPCSRYSMETN- 119
Query: 79 GARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPA 138
GA+IV+TR W+KN+ + LVGC+AEL + +E L G
Sbjct: 120 GAKIVSTRAWKKNEKLELLVGCVAELREADEGLLRADG---------------------- 157
Query: 139 AYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRM 198
+ ACVKVLRDIE G+E+TCFYGE FFG+KN CEC TCE
Sbjct: 158 ------------------NAACVKVLRDIEPGDEVTCFYGEGFFGEKNEHCECHTCERWG 199
Query: 199 KGAFAQSSVEPQTTQ 213
+G S Q Q
Sbjct: 200 RGEAQASRHRAQRLQ 214
>gi|345497522|ref|XP_003428010.1| PREDICTED: histone-lysine N-methyltransferase SUV420H2-like
[Nasonia vitripennis]
Length = 372
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 55/128 (42%), Positives = 77/128 (60%), Gaps = 4/128 (3%)
Query: 69 CYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRK 128
CYRYS E N G +I AT+KW +N I L G L +E + L L DFS++ +
Sbjct: 240 CYRYSYEKNVGVKICATKKWHRNAVIHELNGNKINLNDQELQYLTENRL-DFSIIKNSSN 298
Query: 129 NCAQLWLGPAAYINHDCRPNCRF-VATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNC 187
N Q+WLGPA+++N+DC N RF + T C +++ +++ G+EITC YGE+ F D
Sbjct: 299 NKMQIWLGPASFVNNDCNSNTRFNCLKSKKTVCFEIISEVDCGDEITCSYGENCFSDDE- 357
Query: 188 LCECDTCE 195
CEC TCE
Sbjct: 358 -CECITCE 364
>gi|384493545|gb|EIE84036.1| hypothetical protein RO3G_08741 [Rhizopus delemar RA 99-880]
Length = 397
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 67/185 (36%), Positives = 102/185 (55%), Gaps = 12/185 (6%)
Query: 57 SYRIQYLP----ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRL 112
S + +++P E+ RY+ G A I+A + W+ + + Y G + LTK+ ++ L
Sbjct: 53 SEKARFMPTAGFEVSSTKRYT--GEIEACIIANKSWQAGEYLKYCTGSVCGLTKDVDQML 110
Query: 113 LHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEE 172
G DFSVM S R A L+LGPA ++NHDC NC FV + + LR IE GEE
Sbjct: 111 RSEG-RDFSVMLSQRYKNAFLFLGPARFMNHDCNANCAFVQQGTEVT-FRTLRAIEPGEE 168
Query: 173 ITCFYGEDFFGDKNCLCECDTCEIRMKGAFAQSSVEPQTTQPSRYCLRDTDMRLNKRKLH 232
+T YG+ +FG N C C TCE KG +A++ EPQT + ++ + + N K+H
Sbjct: 169 LTVKYGDHYFGVNNTECRCFTCERSGKGYYAKNK-EPQTEENNKI---EEKKQTNPTKMH 224
Query: 233 KKLNR 237
++L +
Sbjct: 225 QRLRK 229
>gi|343959028|dbj|BAK63369.1| suppressor of variegation 4-20 homolog 2 [Pan troglodytes]
Length = 145
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 54/92 (58%), Positives = 67/92 (72%), Gaps = 6/92 (6%)
Query: 56 YSYRIQYLPE----LKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKR 111
Y Y +LPE + PC RYS+E N GA+IV+TR W+KN+ + LVGCIAEL +E ++
Sbjct: 29 YRYLRAFLPESGFTILPCTRYSMETN-GAKIVSTRAWKKNEKLELLVGCIAEL-READEG 86
Query: 112 LLHTGLNDFSVMYSCRKNCAQLWLGPAAYINH 143
LL G NDFS+MYS RK AQLWLGPAA+INH
Sbjct: 87 LLRAGENDFSIMYSTRKRSAQLWLGPAAFINH 118
>gi|154300642|ref|XP_001550736.1| hypothetical protein BC1G_10909 [Botryotinia fuckeliana B05.10]
gi|347841322|emb|CCD55894.1| hypothetical protein [Botryotinia fuckeliana]
Length = 807
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 60/149 (40%), Positives = 82/149 (55%), Gaps = 10/149 (6%)
Query: 62 YLP----ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGL 117
YLP E+ RY++ + A +VA R+ K + + YL G +T EEE+ H L
Sbjct: 111 YLPDCPFEVSSTNRYTVVTHEAA-VVARREIRKGEVVKYLSGIQVVMTPEEEE---HINL 166
Query: 118 --NDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITC 175
DFS++ S R A L+LGPA + NHDC N R + + + +RDIE GEEIT
Sbjct: 167 MRRDFSIVMSSRNKSASLFLGPARFANHDCGANARLMTSGTAGMEIIAVRDIEIGEEITV 226
Query: 176 FYGEDFFGDKNCLCECDTCEIRMKGAFAQ 204
YGE +FG+ NC C C TCE + +AQ
Sbjct: 227 TYGESYFGEDNCECLCKTCEDNRENGWAQ 255
>gi|358377580|gb|EHK15263.1| hypothetical protein TRIVIDRAFT_38211 [Trichoderma virens Gv29-8]
Length = 681
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 60/156 (38%), Positives = 86/156 (55%), Gaps = 7/156 (4%)
Query: 52 QHHRYSYRIQYLP----ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKE 107
QH R RI Y+P E+ RY++ + ++A R ++N+ I YL G +T E
Sbjct: 106 QHLRRYLRI-YMPDCPWEVSTTNRYTIVTHEAC-VIARRYIKRNETIKYLSGIQINITPE 163
Query: 108 EEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDI 167
EEK + DFS++ S RK L++GPA + NHDCR N + + T + T + R I
Sbjct: 164 EEKEIT-VRKKDFSIVVSSRKKRTSLFMGPARFANHDCRANAKLMTTSQSTIEIVASRPI 222
Query: 168 EQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFA 203
E GEEIT Y + +FGD NC C C TCE ++ +A
Sbjct: 223 EVGEEITVTYSDCYFGDDNCECLCRTCEKELRNGWA 258
>gi|396480355|ref|XP_003840976.1| similar to gi|121919568|sp|Q0U3A4.1|SET9_PHANO RecName:
Full=Histone-lysine N-methyltransferase SET9; AltName:
Full=SET domain protein 9 [Leptosphaeria maculans JN3]
gi|312217549|emb|CBX97497.1| similar to gi|121919568|sp|Q0U3A4.1|SET9_PHANO RecName:
Full=Histone-lysine N-methyltransferase SET9; AltName:
Full=SET domain protein 9 [Leptosphaeria maculans JN3]
Length = 670
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 54/139 (38%), Positives = 77/139 (55%), Gaps = 2/139 (1%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ RY++ + A + A R + I YL G +T+E+EK L DFS++Y
Sbjct: 121 EVTTTNRYTITEHE-ASVTARRDISAREEIKYLTGVQVAMTEEQEK-TLELQRKDFSLVY 178
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RK L+LGPA + NHDC PN + D + ++ I++GEEIT YG+D+FGD
Sbjct: 179 SSRKKTRSLFLGPARFANHDCDPNAKLATKGYDGMQIVAVKPIDEGEEITVSYGDDYFGD 238
Query: 185 KNCLCECDTCEIRMKGAFA 203
N C C TCE R + +A
Sbjct: 239 DNEECLCRTCEERQQNGWA 257
>gi|156407214|ref|XP_001641439.1| predicted protein [Nematostella vectensis]
gi|156228578|gb|EDO49376.1| predicted protein [Nematostella vectensis]
Length = 191
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 56/119 (47%), Positives = 73/119 (61%), Gaps = 2/119 (1%)
Query: 147 PNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFAQSS 206
P FV+T RDTACVKVLRD+E G+EITCFYGEDFFGD NC CEC TCE R +G F S
Sbjct: 27 PRVLFVSTGRDTACVKVLRDLEVGDEITCFYGEDFFGDDNCNCECVTCERRGEGTF--KS 84
Query: 207 VEPQTTQPSRYCLRDTDMRLNKRKLHKKLNRLLLASDKNDTNSSDNSSKYSLVQWNNNS 265
+ + + +Y LR+TD RL + + K + L + S ++ Y +V+ NS
Sbjct: 85 KQKENVKKQKYSLRETDKRLKRLNIMKGVPLLDVTGQATCPASMHHTEVYLVVRLQQNS 143
>gi|322698787|gb|EFY90554.1| hypothetical protein MAC_03332 [Metarhizium acridum CQMa 102]
Length = 1175
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 65/181 (35%), Positives = 91/181 (50%), Gaps = 16/181 (8%)
Query: 52 QHHRYSYRIQYLP----ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKE 107
+ H Y YLP E+ RY++ + A I A R ++N++I YL G +T E
Sbjct: 101 KRHLRRYLQIYLPDCPWEVNSTNRYTIVSHE-ASITARRYIKRNESIKYLSGIQVVITPE 159
Query: 108 EEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDI 167
EEK + DFS++ S R L++GPA + NHDC N + + T + +R I
Sbjct: 160 EEKEIAARK-KDFSMVVSSRSKSTSLFMGPARFANHDCDANAKLMTTSNAGIEIVAIRPI 218
Query: 168 EQGEEITCFYGEDFFGDKNCLCECDTCEIRMK--------GAFAQSSVEPQTTQPSRYCL 219
E GEEIT YGE +FG+ NC C C TCE ++ Q+SVE +P Y L
Sbjct: 219 ESGEEITVTYGESYFGEDNCECLCGTCEKGLRNGWEPEEGAVIIQTSVE--EDRPETYAL 276
Query: 220 R 220
R
Sbjct: 277 R 277
>gi|328719153|ref|XP_003246678.1| PREDICTED: zinc finger protein 808-like [Acyrthosiphon pisum]
Length = 593
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 61/146 (41%), Positives = 82/146 (56%), Gaps = 5/146 (3%)
Query: 67 KPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSC 126
+PC RY+ E GA+IV+ +KN I L G + + + L G+NDFSV
Sbjct: 263 EPCMRYTNEKGFGAKIVSNVLIKKNKIIPGLGGQSFFIHEND----LKAGVNDFSVFTRS 318
Query: 127 RKNCAQLWLGPAAYINHDCRPNCRFVAT-ERDTACVKVLRDIEQGEEITCFYGEDFFGDK 185
R ++LGPAAYINHDC N F + E +K +++I GEEIT FYG D+FGD
Sbjct: 319 RSLKQHVYLGPAAYINHDCESNAVFSSIGEPSYVQIKSVKEILPGEEITVFYGHDYFGDN 378
Query: 186 NCLCECDTCEIRMKGAFAQSSVEPQT 211
N C+C TCE KG F++ V +T
Sbjct: 379 NAKCQCMTCENNRKGFFSKKVVTVET 404
>gi|342881559|gb|EGU82448.1| hypothetical protein FOXB_07034 [Fusarium oxysporum Fo5176]
Length = 666
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 62/171 (36%), Positives = 92/171 (53%), Gaps = 9/171 (5%)
Query: 39 QRFNIGRILDMGTQHHRYSYR--IQ-YLP----ELKPCYRYSLEGNRGARIVATRKWEKN 91
+RF+ G D ++ R R +Q YLP E+ RY++ + A + A R +N
Sbjct: 86 KRFHNGLKTDNEKENFRRHLRRYVQIYLPDCPWEVSSTNRYTIVTHEAA-VTARRPIRRN 144
Query: 92 DNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRF 151
+ I YL G +T EEEK + + DFS++ S R C L++GPA + NHDC N +
Sbjct: 145 EAIKYLSGVQVIITPEEEK-AISSQKKDFSIVVSSRSKCTSLFMGPARFANHDCDANAKL 203
Query: 152 VATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAF 202
+ T + R IE GEEIT YG+++FG+ NC C C TCE ++ A+
Sbjct: 204 MRTSHAGIEIVATRPIEAGEEITVTYGDNYFGENNCECLCRTCEDLLRNAW 254
>gi|296419058|ref|XP_002839141.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295635137|emb|CAZ83332.1| unnamed protein product [Tuber melanosporum]
Length = 593
Score = 105 bits (261), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 52/130 (40%), Positives = 77/130 (59%), Gaps = 2/130 (1%)
Query: 80 ARIVATRK-WEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPA 138
R + TRK + + + YL G + +++ EE+ L +DFS++YS R L LGPA
Sbjct: 112 GRCLRTRKHLHRGEMVKYLTGVLVKMSDNEEQSLSRV-QSDFSIIYSSRVGAMSLLLGPA 170
Query: 139 AYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRM 198
+INHDC+PNC+F ++ + V RDI+ GEEIT Y +D+FG+ N C C TCE
Sbjct: 171 RFINHDCQPNCKFTTAGKEHVNLYVERDIKAGEEITVKYADDYFGEGNRECLCRTCEGLG 230
Query: 199 KGAFAQSSVE 208
+ +A +VE
Sbjct: 231 RSGWASGAVE 240
>gi|452843723|gb|EME45658.1| hypothetical protein DOTSEDRAFT_71379 [Dothistroma septosporum
NZE10]
Length = 823
Score = 104 bits (260), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 59/148 (39%), Positives = 80/148 (54%), Gaps = 7/148 (4%)
Query: 52 QHHRYSYRIQYLP----ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKE 107
+ H Y YLP E+ RY++E A + A R +K + I YL G E++++
Sbjct: 106 ERHLRKYVNMYLPDCPFEVTTTNRYTIETAEAA-VTARRYIKKGEAIKYLTGIQVEMSEQ 164
Query: 108 EEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDI 167
+EK + G DFSV+ S R+ L+LGPA + NHDC+ N T + RDI
Sbjct: 165 DEKEM--KGRTDFSVVVSSRRKRPSLFLGPARFANHDCQSNAGLNTTGPHGIHIVARRDI 222
Query: 168 EQGEEITCFYGEDFFGDKNCLCECDTCE 195
+ GEEIT YGED+FG NC C C TCE
Sbjct: 223 DVGEEITVTYGEDYFGIDNCECLCGTCE 250
>gi|50553664|ref|XP_504243.1| YALI0E21802p [Yarrowia lipolytica]
gi|74633451|sp|Q6C519.1|SET9_YARLI RecName: Full=Histone-lysine N-methyltransferase SET9; AltName:
Full=SET domain protein 9
gi|49650112|emb|CAG79838.1| YALI0E21802p [Yarrowia lipolytica CLIB122]
Length = 866
Score = 104 bits (259), Expect = 4e-20, Method: Composition-based stats.
Identities = 55/140 (39%), Positives = 79/140 (56%), Gaps = 6/140 (4%)
Query: 65 ELKPCYRYSLEGNRGARIVATR-KWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVM 123
E+ RY NRG V +R ++ + D I L GC+A++TK+EE L NDFSV+
Sbjct: 114 EINVSMRYKCRSNRGESCVISRVRYNRGDEIVGLSGCLAKMTKDEELALA----NDFSVL 169
Query: 124 YSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFG 183
+S R+ L LGPA ++NHDC N RFV ++ ++ I G+EIT Y E++FG
Sbjct: 170 HSSRRGGNCLMLGPARFVNHDCSANARFVPVPSGMV-IQAVKPINVGDEITVKYAENYFG 228
Query: 184 DKNCLCECDTCEIRMKGAFA 203
+N C C TCE +G +
Sbjct: 229 RRNKECLCQTCEENSRGLYG 248
>gi|322705888|gb|EFY97471.1| hypothetical protein MAA_07113 [Metarhizium anisopliae ARSEF 23]
Length = 1048
Score = 104 bits (259), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 63/179 (35%), Positives = 92/179 (51%), Gaps = 12/179 (6%)
Query: 52 QHHRYSYRIQYLP----ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKE 107
+ H Y YLP E+ RY++ + A I A R ++N++I YL G +T E
Sbjct: 101 KRHLRRYLQIYLPDCPWEVSSTNRYTIVSHE-ASITARRYIKRNESIKYLSGIQVVITPE 159
Query: 108 EEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDI 167
EE + DFS++ S R L++GPA + NHDC N + + T + +R I
Sbjct: 160 EETEIAARK-KDFSMVISSRSKSTSLFMGPARFANHDCNANAKLMTTSNAGIEIVAIRPI 218
Query: 168 EQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAF--AQSSVEPQTT----QPSRYCLR 220
E GEEIT YGE +FG+ NC C C TCE ++ + + +V QT+ +P Y LR
Sbjct: 219 ESGEEITVTYGESYFGEDNCECLCGTCEKGLRNGWEPEEGAVSLQTSAEEDRPETYALR 277
>gi|323508085|emb|CBQ67956.1| conserved hypothetical protein [Sporisorium reilianum SRZ2]
Length = 1368
Score = 103 bits (258), Expect = 6e-20, Method: Composition-based stats.
Identities = 62/155 (40%), Positives = 82/155 (52%), Gaps = 22/155 (14%)
Query: 82 IVATRKWEKNDNIAYLVGCIAELTKEE------------EKR-------LLHTGLNDFSV 122
++A R ++ + I + G + +LTK E EKR +L G DFSV
Sbjct: 183 VLAMRTFKPGELITFCKGGLKDLTKSEDDALREEATASREKRKDAEYKGVLGPG-RDFSV 241
Query: 123 MYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFF 182
+ S RK C+QL LGPA ++NHDC PN F T KV+R I + EEIT FYGE++F
Sbjct: 242 IRSARKGCSQLLLGPARFVNHDCNPNTEFYRMGA-TMVFKVIRPIHRNEEITTFYGENYF 300
Query: 183 GDKNCLCECDTCEIRMKGAFAQSSVEPQTTQPSRY 217
N C C TCE R GAF+ S P +PS +
Sbjct: 301 EWGNSECMCATCESRGTGAFSDPSA-PSAVEPSEH 334
>gi|296812771|ref|XP_002846723.1| histone-lysine N-methyltransferase set9 [Arthroderma otae CBS
113480]
gi|238841979|gb|EEQ31641.1| histone-lysine N-methyltransferase set9 [Arthroderma otae CBS
113480]
Length = 629
Score = 103 bits (258), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 71/207 (34%), Positives = 101/207 (48%), Gaps = 11/207 (5%)
Query: 5 VHNLILHVPIQLACHLLNKLNPSFWAFCVQALLAQRFNIGRILDMGTQHHRYSYRIQYLP 64
V +++LH I +K F A R R + +H R I YLP
Sbjct: 57 VTSILLHDVI--VAKDASKAEKEFLALPGLKKFTDRLRSAREKEWFRRHLRKYISI-YLP 113
Query: 65 ----ELKPCYRYSLEGNRGARIVATRKW-EKNDNIAYLVGCIAELTKEEEKRLLHTGLND 119
E+ RY++ + A VA RK+ + YL G + +TKEEE L T D
Sbjct: 114 DCPFEVTTTNRYTILTHEAA--VAARKFISAGHPVKYLSGTLVSITKEEETDLDLTK-RD 170
Query: 120 FSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGE 179
FS++ S RK A L+LGPA + NHDC N + V ++ V RDIE G+EIT YG+
Sbjct: 171 FSIVMSTRKKTASLFLGPARFANHDCNANAKLVTRGFESMEVVATRDIEVGDEITVSYGD 230
Query: 180 DFFGDKNCLCECDTCEIRMKGAFAQSS 206
+FG+ NC C C +CEI + ++ +
Sbjct: 231 SYFGEDNCECLCHSCEIAQRNGWSSPT 257
>gi|378733690|gb|EHY60149.1| histone-lysine N-methyltransferase SUV420H [Exophiala dermatitidis
NIH/UT8656]
Length = 771
Score = 103 bits (258), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 60/170 (35%), Positives = 92/170 (54%), Gaps = 6/170 (3%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ RY++ A + A R+ ++ D + YL+G + LT EE L + N FS++
Sbjct: 121 EVSATNRYTITTYEAA-VTARRRIKQGDTVKYLIGTLVPLTTEESAELDLSNRN-FSIVV 178
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
S RK + ++LGPA + NHDC N R V D V +++IE G+EIT YG+D+FG
Sbjct: 179 SSRKKNSSIFLGPARFANHDCDANGRLVTRGPDGMEVVAMKNIEVGDEITVSYGDDYFGP 238
Query: 185 KNCLCECDTCEIRMKGAFAQSS--VEPQT--TQPSRYCLRDTDMRLNKRK 230
N C C TCE+ + + + +EP + + PSR + + R KRK
Sbjct: 239 GNIDCLCHTCEVLERNGWTSKAAVLEPSSHASTPSRELIDNHRSRSLKRK 288
>gi|71004048|ref|XP_756690.1| hypothetical protein UM00543.1 [Ustilago maydis 521]
gi|46095762|gb|EAK80995.1| hypothetical protein UM00543.1 [Ustilago maydis 521]
Length = 1353
Score = 103 bits (257), Expect = 8e-20, Method: Composition-based stats.
Identities = 63/153 (41%), Positives = 82/153 (53%), Gaps = 22/153 (14%)
Query: 82 IVATRKWEKNDNIAYLVGCIAELTKEE------------EKR-------LLHTGLNDFSV 122
++A R ++ + I + G + +LTK E EKR +L G DFSV
Sbjct: 183 VLAMRTFKPGELIHFCKGGLKDLTKSEDDALREEATASREKRKDSEYKGVLGPG-RDFSV 241
Query: 123 MYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFF 182
+ S RK C+QL LGPA ++NHDC PN F T KV+R I + EEIT FYGE++F
Sbjct: 242 IRSARKGCSQLLLGPARFVNHDCNPNTEFYRMGA-TMVFKVIRPIHRNEEITTFYGENYF 300
Query: 183 GDKNCLCECDTCEIRMKGAFAQSSVEPQTTQPS 215
N C C TCE R GAF+ SV P +PS
Sbjct: 301 EWGNAECMCATCESRGTGAFSDPSVPP-PIEPS 332
>gi|398405962|ref|XP_003854447.1| hypothetical protein MYCGRDRAFT_85175 [Zymoseptoria tritici IPO323]
gi|339474330|gb|EGP89423.1| hypothetical protein MYCGRDRAFT_85175 [Zymoseptoria tritici IPO323]
Length = 556
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 53/116 (45%), Positives = 68/116 (58%), Gaps = 2/116 (1%)
Query: 80 ARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAA 139
A I+A + K ++I YL+G E+T EEEK L + NDFSV+ S R+ L+LGPA
Sbjct: 137 AAIIARKPIRKGESIKYLIGIQVEMTVEEEKDL--SSRNDFSVVMSSRRKRPSLFLGPAR 194
Query: 140 YINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCE 195
+ NHDC N + T + + I GEEIT YGED+FGD NC C C TCE
Sbjct: 195 FANHDCDSNAKLTTTGPHGLHIVACKAIALGEEITVTYGEDYFGDDNCECLCGTCE 250
>gi|320033677|gb|EFW15624.1| histone-lysine N-methyltransferase [Coccidioides posadasii str.
Silveira]
Length = 625
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 50/113 (44%), Positives = 66/113 (58%), Gaps = 1/113 (0%)
Query: 91 NDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCR 150
D I +L G + +T EEEK L T DFS++ S RK L+LGPA + NHDC PN R
Sbjct: 143 GDTIKHLSGTLVAITPEEEKTLDLT-RRDFSIVMSSRKKTPSLFLGPARFSNHDCNPNAR 201
Query: 151 FVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFA 203
V + + +RDI GEEIT YGE++FG NC C C TCE+ + ++
Sbjct: 202 LVTKGSEGMEIVAIRDISIGEEITVSYGENYFGVDNCECLCHTCELSLSNGWS 254
>gi|119192932|ref|XP_001247072.1| hypothetical protein CIMG_00843 [Coccidioides immitis RS]
gi|121771280|sp|Q1E9C0.1|SET9_COCIM RecName: Full=Histone-lysine N-methyltransferase SET9; AltName:
Full=SET domain protein 9
gi|392863692|gb|EAS35539.2| histone-lysine N-methyltransferase SET9 [Coccidioides immitis RS]
Length = 624
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 50/113 (44%), Positives = 66/113 (58%), Gaps = 1/113 (0%)
Query: 91 NDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCR 150
D I +L G + +T EEEK L T DFS++ S RK L+LGPA + NHDC PN R
Sbjct: 143 GDTIKHLSGTLVAITPEEEKTLDLT-RRDFSIVMSSRKKTPSLFLGPARFSNHDCNPNAR 201
Query: 151 FVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFA 203
V + + +RDI GEEIT YGE++FG NC C C TCE+ + ++
Sbjct: 202 LVTKGSEGMEIVAIRDISIGEEITVSYGENYFGVDNCECLCHTCELSLSNGWS 254
>gi|400594869|gb|EJP62698.1| histone-lysine N-methyltransferase set9 [Beauveria bassiana ARSEF
2860]
Length = 627
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 70/202 (34%), Positives = 101/202 (50%), Gaps = 17/202 (8%)
Query: 20 LLNKLNPSFWAFCVQALLA----QRFNIGRILDMGTQHHRYSYRIQ---YLP----ELKP 68
+LNK NP A LLA +RF+ G D + R R YLP E+
Sbjct: 66 VLNK-NP---AAAEAKLLATDGLRRFSAGLKTDKEREDFRRHLRRYINIYLPDCPWEVNS 121
Query: 69 CYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRK 128
RY++ + A I A R +N++I YL G +T EEE+ L+ DFS++ S R
Sbjct: 122 TNRYTIVTHE-ASITARRPIRRNESIKYLSGMKVHITPEEEE-LISNRKKDFSIVVSSRS 179
Query: 129 NCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCL 188
L++GPA + NHDC N + T + +R+I GEEIT YG+++FG+ NC
Sbjct: 180 KSTSLFMGPARFANHDCDANAALITTSHTGIEIVAVRNIAVGEEITVTYGDNYFGEDNCE 239
Query: 189 CECDTCEIRMKGAFAQSSVEPQ 210
C C TCE + + + +PQ
Sbjct: 240 CLCKTCEDGFRNGWKDAEGKPQ 261
>gi|406859839|gb|EKD12902.1| set domain containing protein [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 693
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 58/163 (35%), Positives = 86/163 (52%), Gaps = 6/163 (3%)
Query: 52 QHHRYSYRIQYLP----ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKE 107
+ H Y YLP E+ RY++ + A + A R +K + + YL G +T+E
Sbjct: 101 RRHLRKYVNIYLPDCPFEVSSTNRYTIVTHEAA-VTARRYIKKGEVVKYLCGIQVIMTEE 159
Query: 108 EEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDI 167
EE+ L+ + DFS++ S R A L+LGPA + NHDC N R + + + DI
Sbjct: 160 EEE-LIKSSRRDFSIVVSSRNRTASLFLGPARFANHDCGANARLMTAGTSGMEIIAVADI 218
Query: 168 EQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFAQSSVEPQ 210
E G+EIT YG+++FG+ NC C C TCE +A EP+
Sbjct: 219 EIGDEITVSYGDNYFGEDNCECLCQTCENLGHSGWAAGQAEPE 261
>gi|303312461|ref|XP_003066242.1| SET domain containing protein [Coccidioides posadasii C735 delta
SOWgp]
gi|240105904|gb|EER24097.1| SET domain containing protein [Coccidioides posadasii C735 delta
SOWgp]
Length = 625
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 50/113 (44%), Positives = 66/113 (58%), Gaps = 1/113 (0%)
Query: 91 NDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCR 150
D I +L G + +T EEEK L T DFS++ S RK L+LGPA + NHDC PN R
Sbjct: 143 GDTIKHLSGTLVAITPEEEKTLDLT-RRDFSIVMSSRKKTPSLFLGPARFSNHDCNPNAR 201
Query: 151 FVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFA 203
V + + +RDI GEEIT YGE++FG NC C C TCE+ + ++
Sbjct: 202 LVTKGSEGMEIVAIRDISIGEEITVSYGENYFGVDNCECLCHTCELSLSNGWS 254
>gi|225559408|gb|EEH07691.1| SET domain-containing protein [Ajellomyces capsulatus G186AR]
Length = 721
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 65/178 (36%), Positives = 90/178 (50%), Gaps = 14/178 (7%)
Query: 52 QHHRYSYRIQYLP----ELKPCYRYSLEGNRGARIVATRKW-EKNDNIAYLVGCIAELTK 106
+ H Y YLP E+ RY++ + A V+ RK+ I L G + +T+
Sbjct: 102 RRHLRKYISIYLPDCPFEVMTTNRYTITIHEAA--VSARKYIASGTTIRGLSGTLVPMTR 159
Query: 107 EEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRD 166
EEE L T DFS++ S RK ++LGPA + NHDC N R V ++ V +RD
Sbjct: 160 EEEIDLDLT-RKDFSIVMSSRKKTPSIFLGPARFANHDCNANARLVMRGPESMEVVAIRD 218
Query: 167 IEQGEEITCFYGEDFFGDKNCLCECDTCEIRMK------GAFAQSSVEPQTTQPSRYC 218
I GEEIT YGE++FG+ NC C C TCE+ ++ GA S + PS C
Sbjct: 219 IVVGEEITVSYGENYFGEDNCECLCHTCELALRNGWSPDGAMISSGTNSGASTPSELC 276
>gi|452985054|gb|EME84811.1| hypothetical protein MYCFIDRAFT_152998 [Pseudocercospora fijiensis
CIRAD86]
Length = 672
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 56/153 (36%), Positives = 83/153 (54%), Gaps = 7/153 (4%)
Query: 54 HRYSYRIQYLP----ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEE 109
H Y YLP E+ RY++ I+A ++ + + I YL G E+T++EE
Sbjct: 109 HMRKYINIYLPDCPFEVGTTNRYTITTAEAC-IIARKRVRRGEPIKYLTGIQVEMTEKEE 167
Query: 110 KRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQ 169
K L + DFS++ S RK L+LGPA + NHDC N + T ++ +DIE
Sbjct: 168 KML--SSRTDFSIVISSRKKRPSLFLGPARFANHDCDSNAKLTTTGAHGIHIQARKDIEI 225
Query: 170 GEEITCFYGEDFFGDKNCLCECDTCEIRMKGAF 202
G+EIT YGE++FG+ NC C C TCE+ + +
Sbjct: 226 GDEITVTYGENYFGEGNCECLCHTCEVATRNGW 258
>gi|156540534|ref|XP_001599251.1| PREDICTED: histone-lysine N-methyltransferase SUV420H1-like
[Nasonia vitripennis]
Length = 253
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 66/188 (35%), Positives = 102/188 (54%), Gaps = 17/188 (9%)
Query: 62 YLPELKP--------CYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLL 113
YL +KP C RYS N+G I AT+ W KN + YL G +LTKEE + L+
Sbjct: 42 YLLIVKPDSGFAVQICNRYSKNINQGVTICATKFWPKNAILKYLNGYTTKLTKEELEYLI 101
Query: 114 HTGLNDFSVMYSCRKNCAQ-LWLGPAAYINHDCRPNCRFVATERDTA-CVKVLRDIEQGE 171
+ L DFS+M + Q +WLGP +Y+NHDC PN + + + C++ ++I G+
Sbjct: 102 NANL-DFSIMSGTSTSENQRMWLGPGSYVNHDCEPNAKLYTLKMNGELCLQATKNIHVGD 160
Query: 172 EITCFYGEDFFGDKNCLCECDTCEIRMKGAFAQS-SVEPQTTQPSRYCLRDTDMRLNKRK 230
EIT YG ++FG CEC TC+ K A A + ++ T + + D RL++++
Sbjct: 161 EITWNYGLEYFGTGE--CECMTCKKENKDAKAPALQIKTPATLKNAVVVLD---RLSRQQ 215
Query: 231 LHKKLNRL 238
++K N +
Sbjct: 216 MNKSNNSM 223
>gi|67539954|ref|XP_663751.1| hypothetical protein AN6147.2 [Aspergillus nidulans FGSC A4]
gi|74594617|sp|Q5AZY3.1|SET9_EMENI RecName: Full=Histone-lysine N-methyltransferase set9; AltName:
Full=SET domain protein 9
gi|40738743|gb|EAA57933.1| hypothetical protein AN6147.2 [Aspergillus nidulans FGSC A4]
gi|259479661|tpe|CBF70089.1| TPA: Histone-lysine N-methyltransferase set9 (EC 2.1.1.43)(SET
domain protein 9)
[Source:UniProtKB/Swiss-Prot;Acc:Q5AZY3] [Aspergillus
nidulans FGSC A4]
Length = 641
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 65/183 (35%), Positives = 91/183 (49%), Gaps = 6/183 (3%)
Query: 52 QHHRYSYRIQYLP----ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKE 107
+ H Y YLP E+ RY++ + A I A + + I YL G + +TKE
Sbjct: 106 RKHLRKYIQMYLPDCPFEVTTTNRYTITEHEAA-ICARKFIPQGQEIKYLSGTLVPMTKE 164
Query: 108 EEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDI 167
EE R L DFS++ S R+ +LGPA + NHDC N R V + V RDI
Sbjct: 165 EE-RDLDLKRKDFSIVMSSRRKTPSFFLGPARFANHDCSANGRLVTRGSEGMQVVATRDI 223
Query: 168 EQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFAQSSVEPQTTQPSRYCLRDTDMRLN 227
GEEIT YGED+FG NC C C +CE + ++Q+ ++PS + ++ L
Sbjct: 224 YIGEEITVSYGEDYFGIDNCECLCLSCERVPRNGWSQNLAPGPQSKPSTPEPKASEDHLT 283
Query: 228 KRK 230
RK
Sbjct: 284 PRK 286
>gi|408399142|gb|EKJ78267.1| hypothetical protein FPSE_01728 [Fusarium pseudograminearum CS3096]
Length = 661
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 53/146 (36%), Positives = 79/146 (54%), Gaps = 6/146 (4%)
Query: 62 YLP----ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGL 117
YLP E+ RY++ + A + A R +N+ I YL G +T EEE + +
Sbjct: 111 YLPDCPWEVSSTNRYTIVSHEAA-VTARRAIRRNEAIKYLSGVQVVITPEEEM-AISSQK 168
Query: 118 NDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFY 177
DFS++ S R C L++GPA + NHDC N + + T + R I+ GEEIT Y
Sbjct: 169 KDFSIVVSSRSKCTSLFMGPARFANHDCDANAKLMRTSHAGIEIIATRPIDAGEEITVTY 228
Query: 178 GEDFFGDKNCLCECDTCEIRMKGAFA 203
G+++FG+ NC C C TCE ++ A+
Sbjct: 229 GDNYFGENNCECLCKTCEDLLRNAWG 254
>gi|255935645|ref|XP_002558849.1| Pc13g04130 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211583469|emb|CAP91482.1| Pc13g04130 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 644
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 63/166 (37%), Positives = 87/166 (52%), Gaps = 11/166 (6%)
Query: 52 QHHRYSYRIQYLP----ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKE 107
+ H Y YLP E+ RY + + A I A R +K + I +L G + +T E
Sbjct: 107 RRHLRKYIQMYLPDCPFEVTTTNRYIITQHEAA-ICARRFIKKGEEIKHLSGTLVSMTHE 165
Query: 108 EEKRLLHTGLN--DFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLR 165
EE L GL DFS++ S R+ +LGPA + NHDC N + V +R
Sbjct: 166 EE---LDLGLTRKDFSIVMSSRRRAPSFFLGPARFANHDCDANGSLTTRGNEGMSVLAMR 222
Query: 166 DIEQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFAQSSVEPQT 211
DI +GEEIT YGED+FG NC C C TCEI ++ ++ S V+ +T
Sbjct: 223 DIHEGEEITVSYGEDYFGIDNCECLCHTCEIAVRNGWS-SHVDTET 267
>gi|46124303|ref|XP_386705.1| hypothetical protein FG06529.1 [Gibberella zeae PH-1]
gi|121813515|sp|Q4I8C9.1|SET9_GIBZE RecName: Full=Histone-lysine N-methyltransferase SET9; AltName:
Full=SET domain protein 9
Length = 662
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 53/145 (36%), Positives = 79/145 (54%), Gaps = 6/145 (4%)
Query: 62 YLP----ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGL 117
YLP E+ RY++ + A + A R +N+ I YL G +T EEE + +
Sbjct: 111 YLPDCPWEVSSTNRYTIVSHEAA-VTARRAIRRNEAIKYLSGVQVVITPEEEM-AISSQK 168
Query: 118 NDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFY 177
DFS++ S R C L++GPA + NHDC N + + T + R I+ GEEIT Y
Sbjct: 169 KDFSIVVSSRSKCTSLFMGPARFANHDCDANAKLMRTSHAGIEIVATRPIDAGEEITVTY 228
Query: 178 GEDFFGDKNCLCECDTCEIRMKGAF 202
G+++FG+ NC C C TCE ++ A+
Sbjct: 229 GDNYFGENNCECLCKTCEDLLRNAW 253
>gi|89267105|emb|CAJ41970.1| hypothetical protein UHO_0294 [Ustilago hordei]
gi|388856443|emb|CCF49992.1| uncharacterized protein [Ustilago hordei]
Length = 1392
Score = 102 bits (253), Expect = 2e-19, Method: Composition-based stats.
Identities = 63/151 (41%), Positives = 79/151 (52%), Gaps = 22/151 (14%)
Query: 82 IVATRKWEKNDNIAYLVGCIAELTKEE------------EKR-------LLHTGLNDFSV 122
++A R ++ D I Y G + +LTK E EKR +L G DFSV
Sbjct: 183 VLAMRTFKPGDLINYCKGGLKDLTKSEDDALREEATASREKRKDTEYKGVLGPG-RDFSV 241
Query: 123 MYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFF 182
+ S RK C+QL LGPA ++NHDC PN F T KV+R I EEIT FYGE++F
Sbjct: 242 IRSARKGCSQLLLGPARFVNHDCNPNTEFYRMGA-TMVFKVIRPIHPNEEITTFYGENYF 300
Query: 183 GDKNCLCECDTCEIRMKGAFAQSSVEPQTTQ 213
N C C TCE R G F+ SV PQ +
Sbjct: 301 EWGNSECMCATCESRGTGVFSDPSV-PQPME 330
>gi|115443006|ref|XP_001218310.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|121733706|sp|Q0C9E6.1|SET9_ASPTN RecName: Full=Histone-lysine N-methyltransferase set9; AltName:
Full=SET domain protein 9
gi|114188179|gb|EAU29879.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 629
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 60/156 (38%), Positives = 82/156 (52%), Gaps = 6/156 (3%)
Query: 52 QHHRYSYRIQYLP----ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKE 107
+ H Y YLP E+ RY++ + A I A + ++ I YL G + +T+E
Sbjct: 106 RRHLRKYIQMYLPDCPFEVTTTNRYTITEHEAA-ICARKFIKQGQEIKYLSGTLVPMTRE 164
Query: 108 EEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDI 167
EE+ L DFS++ S R+ +LGPA + NHDC N R V + V RDI
Sbjct: 165 EEQDL-DLKRKDFSIVMSSRRKTPSFFLGPARFANHDCDANGRLVTRGSEGMQVVATRDI 223
Query: 168 EQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFA 203
E GEEIT YGED+FG NC C C TCE ++ +A
Sbjct: 224 EIGEEITVSYGEDYFGIDNCECLCLTCERAVRNGWA 259
>gi|261190130|ref|XP_002621475.1| histone-lysine N-methyltransferase set9 [Ajellomyces dermatitidis
SLH14081]
gi|239591303|gb|EEQ73884.1| histone-lysine N-methyltransferase set9 [Ajellomyces dermatitidis
SLH14081]
Length = 718
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 60/161 (37%), Positives = 87/161 (54%), Gaps = 8/161 (4%)
Query: 52 QHHRYSYRIQYLP----ELKPCYRYSLEGNRGARIVATRKW-EKNDNIAYLVGCIAELTK 106
+ H Y + YLP E+ RY++ + A V+ RK+ I L G + +T+
Sbjct: 102 RRHLRKYIVIYLPDCPFEVMTTNRYTITTHEAA--VSARKFIASGTTIRGLSGTLVPMTR 159
Query: 107 EEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRD 166
EEE L T DFS++ S RK ++LGPA + NHDC N R V ++ V RD
Sbjct: 160 EEEMDLDLT-RKDFSIVMSSRKKTPSIFLGPARFANHDCNANGRLVMRGPESMEVIATRD 218
Query: 167 IEQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFAQSSV 207
I GEEIT YGE++FG+ NC C C TCE+ ++ ++ + V
Sbjct: 219 IVVGEEITVSYGENYFGEDNCECLCHTCELALRNGWSPNGV 259
>gi|327353061|gb|EGE81918.1| histone-lysine N-methyltransferase set9 [Ajellomyces dermatitidis
ATCC 18188]
Length = 718
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 60/161 (37%), Positives = 87/161 (54%), Gaps = 8/161 (4%)
Query: 52 QHHRYSYRIQYLP----ELKPCYRYSLEGNRGARIVATRKW-EKNDNIAYLVGCIAELTK 106
+ H Y + YLP E+ RY++ + A V+ RK+ I L G + +T+
Sbjct: 102 RRHLRKYIVIYLPDCPFEVMTTNRYTITTHEAA--VSARKFIASGTTIRGLSGTLVPMTR 159
Query: 107 EEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRD 166
EEE L T DFS++ S RK ++LGPA + NHDC N R V ++ V RD
Sbjct: 160 EEEMDLDLT-RKDFSIVMSSRKKTPSIFLGPARFANHDCNANGRLVMRGPESMEVIATRD 218
Query: 167 IEQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFAQSSV 207
I GEEIT YGE++FG+ NC C C TCE+ ++ ++ + V
Sbjct: 219 IVVGEEITVSYGENYFGEDNCECLCHTCELALRNGWSPNGV 259
>gi|239606363|gb|EEQ83350.1| histone-lysine N-methyltransferase set9 [Ajellomyces dermatitidis
ER-3]
Length = 718
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 60/161 (37%), Positives = 87/161 (54%), Gaps = 8/161 (4%)
Query: 52 QHHRYSYRIQYLP----ELKPCYRYSLEGNRGARIVATRKW-EKNDNIAYLVGCIAELTK 106
+ H Y + YLP E+ RY++ + A V+ RK+ I L G + +T+
Sbjct: 102 RRHLRKYIVIYLPDCPFEVMTTNRYTITTHEAA--VSARKFIASGTTIRGLSGTLVPMTR 159
Query: 107 EEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRD 166
EEE L T DFS++ S RK ++LGPA + NHDC N R V ++ V RD
Sbjct: 160 EEEMDLDLT-RKDFSIVMSSRKKTPSIFLGPARFANHDCNANGRLVMRGPESMEVIATRD 218
Query: 167 IEQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFAQSSV 207
I GEEIT YGE++FG+ NC C C TCE+ ++ ++ + V
Sbjct: 219 IVVGEEITVSYGENYFGEDNCECLCHTCELALRNGWSPNGV 259
>gi|302417114|ref|XP_003006388.1| histone-lysine N-methyltransferase SET9 [Verticillium albo-atrum
VaMs.102]
gi|261353990|gb|EEY16418.1| histone-lysine N-methyltransferase SET9 [Verticillium albo-atrum
VaMs.102]
Length = 494
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 53/140 (37%), Positives = 77/140 (55%), Gaps = 6/140 (4%)
Query: 62 YLP----ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGL 117
YLP E+ RY++ + A I A R +N+ + YL G +T EEE R +
Sbjct: 112 YLPDCPFEVNSTNRYTIVSHEAA-ITARRFIARNETVKYLSGVQVMITPEEE-REMTARK 169
Query: 118 NDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFY 177
DFS++ S R CA L++GPA + NHDC+ N + V T + + +I+ G+EIT Y
Sbjct: 170 KDFSIIISSRNKCASLFMGPARFANHDCKANAKLVTTGQAGIEIIATTNIDVGDEITVTY 229
Query: 178 GEDFFGDKNCLCECDTCEIR 197
+ +FG+ NC C C TCE R
Sbjct: 230 ADSYFGEDNCECLCSTCERR 249
>gi|302893224|ref|XP_003045493.1| hypothetical protein NECHADRAFT_12117 [Nectria haematococca mpVI
77-13-4]
gi|256726419|gb|EEU39780.1| hypothetical protein NECHADRAFT_12117 [Nectria haematococca mpVI
77-13-4]
Length = 620
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 54/145 (37%), Positives = 81/145 (55%), Gaps = 6/145 (4%)
Query: 62 YLP----ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGL 117
Y+P E+ RY++ + A + A R ++N++I YL G +T EEEK + +
Sbjct: 112 YMPDCPWEVSSTNRYTIVSHEAA-VHARRYIKRNESIKYLSGVQVIITPEEEK-AISSQK 169
Query: 118 NDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFY 177
DFS++ S R C L++GPA + NHDC N + + T + R I GEEIT Y
Sbjct: 170 KDFSIVVSSRSKCTSLFMGPARFANHDCDANAKLMRTSHAGIEIIATRAIGPGEEITVTY 229
Query: 178 GEDFFGDKNCLCECDTCEIRMKGAF 202
G+++FGD NC C C TCE ++ A+
Sbjct: 230 GDNYFGDNNCECLCKTCEDLLRNAW 254
>gi|451993217|gb|EMD85691.1| hypothetical protein COCHEDRAFT_1148087 [Cochliobolus
heterostrophus C5]
Length = 660
Score = 101 bits (251), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 70/205 (34%), Positives = 100/205 (48%), Gaps = 18/205 (8%)
Query: 11 HVPIQLACHLLNKLNPSFWAFCVQALLA----QRFNIG----RILDMGTQHHRYSYRIQY 62
HV + H++ +P A VQ LL +RF +G R + +H R I Y
Sbjct: 51 HVADIIRQHVIWDKDP---AEAVQKLLQLPRLRRFLVGLKDERDQEQFRRHLRRYINI-Y 106
Query: 63 LPE----LKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLN 118
+P+ + RY++ + A I A R + I YL G +T+E+EK L
Sbjct: 107 MPDCQFDVTTTNRYTITEHE-ASITARRDINPREEIKYLTGVQVAMTEEQEK-TLELQRK 164
Query: 119 DFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYG 178
DFS++ S RK L+LGPA + NHDC N + D + ++ I +GEEIT YG
Sbjct: 165 DFSLVISSRKKTRSLFLGPARFANHDCDANAKLTTKGYDGMQIVAIKPISEGEEITVSYG 224
Query: 179 EDFFGDKNCLCECDTCEIRMKGAFA 203
ED+FGD N C C TCE + +A
Sbjct: 225 EDYFGDDNEECLCSTCESLQQNGWA 249
>gi|119480659|ref|XP_001260358.1| SET domain protein [Neosartorya fischeri NRRL 181]
gi|119408512|gb|EAW18461.1| SET domain protein [Neosartorya fischeri NRRL 181]
Length = 622
Score = 100 bits (250), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 65/188 (34%), Positives = 93/188 (49%), Gaps = 8/188 (4%)
Query: 52 QHHRYSYRIQYLP----ELKPCYRYSLEGNRGARIVATRKW-EKNDNIAYLVGCIAELTK 106
+ H Y YLP E+ RY++ + A V RK+ ++ I YL G + +T+
Sbjct: 106 RRHLRKYIQMYLPDSPFEITTTNRYTITEHEAA--VCARKFIKQGQEIKYLSGTLVPMTR 163
Query: 107 EEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRD 166
EEE R L DFS++ S RK +LGPA + NHDC N + V + V RD
Sbjct: 164 EEE-RDLDLKRKDFSIVMSSRKKTPSFFLGPARFANHDCNANGKLVTRGSEGMQVVATRD 222
Query: 167 IEQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFAQSSVEPQTTQPSRYCLRDTDMRL 226
I GEEIT YG+D+FG NC C C TCE ++ +A ++ S L D ++ +
Sbjct: 223 IYIGEEITVSYGDDYFGIDNCECLCLTCERLVRNGWAPHVPSGPQSKASTPALNDDNLSI 282
Query: 227 NKRKLHKK 234
+ KK
Sbjct: 283 DSHVSSKK 290
>gi|169621111|ref|XP_001803966.1| hypothetical protein SNOG_13760 [Phaeosphaeria nodorum SN15]
gi|160704179|gb|EAT78784.2| hypothetical protein SNOG_13760 [Phaeosphaeria nodorum SN15]
Length = 673
Score = 100 bits (250), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 55/146 (37%), Positives = 77/146 (52%), Gaps = 6/146 (4%)
Query: 62 YLP----ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGL 117
Y+P E+ RY++ + A I A R + I YL G +T+E+EK L
Sbjct: 121 YMPDCPFEVTTTNRYTITDHE-ASITARRDINPREEIKYLTGVQVAMTEEQEK-TLELAR 178
Query: 118 NDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFY 177
DFS++ S RK L+LGPA + NHDC N R D + ++ I +G+EIT Y
Sbjct: 179 KDFSLVISSRKKTRSLFLGPARFANHDCDANARLSTKGYDGMQIVAVKPINEGDEITVSY 238
Query: 178 GEDFFGDKNCLCECDTCEIRMKGAFA 203
G+D+FGD N C C TCE R + +A
Sbjct: 239 GDDYFGDNNEECLCHTCEDRQQNGWA 264
>gi|121919568|sp|Q0U3A4.1|SET9_PHANO RecName: Full=Histone-lysine N-methyltransferase SET9; AltName:
Full=SET domain protein 9
Length = 662
Score = 100 bits (250), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 55/146 (37%), Positives = 77/146 (52%), Gaps = 6/146 (4%)
Query: 62 YLP----ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGL 117
Y+P E+ RY++ + A I A R + I YL G +T+E+EK L
Sbjct: 110 YMPDCPFEVTTTNRYTI-TDHEASITARRDINPREEIKYLTGVQVAMTEEQEK-TLELAR 167
Query: 118 NDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFY 177
DFS++ S RK L+LGPA + NHDC N R D + ++ I +G+EIT Y
Sbjct: 168 KDFSLVISSRKKTRSLFLGPARFANHDCDANARLSTKGYDGMQIVAVKPINEGDEITVSY 227
Query: 178 GEDFFGDKNCLCECDTCEIRMKGAFA 203
G+D+FGD N C C TCE R + +A
Sbjct: 228 GDDYFGDNNEECLCHTCEDRQQNGWA 253
>gi|302509332|ref|XP_003016626.1| SET domain protein [Arthroderma benhamiae CBS 112371]
gi|291180196|gb|EFE35981.1| SET domain protein [Arthroderma benhamiae CBS 112371]
Length = 535
Score = 100 bits (250), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 58/161 (36%), Positives = 87/161 (54%), Gaps = 8/161 (4%)
Query: 52 QHHRYSYRIQYLP----ELKPCYRYSLEGNRGARIVATRKW-EKNDNIAYLVGCIAELTK 106
+ H Y YLP E+ RY++ + A V+ RK+ + YL G + +TK
Sbjct: 101 RRHLRKYISIYLPDCPFEVTTTNRYTVLTHEAA--VSARKFIPSGHPVKYLSGTLVSITK 158
Query: 107 EEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRD 166
EEE L T DFS++ S RK A L+LGPA + NHDC N + V ++ V +D
Sbjct: 159 EEENDLDLTK-RDFSIVMSTRKKTASLFLGPARFANHDCNANAKLVTRGFESMEVVATQD 217
Query: 167 IEQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFAQSSV 207
I+ G+EIT YG+++FG+ NC C C +CEI + ++ +
Sbjct: 218 IDIGDEITVSYGDNYFGEDNCECLCHSCEIAQRNGWSSPTA 258
>gi|443896440|dbj|GAC73784.1| histone tail methylase [Pseudozyma antarctica T-34]
Length = 1790
Score = 100 bits (250), Expect = 6e-19, Method: Composition-based stats.
Identities = 55/143 (38%), Positives = 74/143 (51%), Gaps = 19/143 (13%)
Query: 82 IVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGL------------------NDFSVM 123
++A R ++ + I++ G + +LTK E+ L DFSV+
Sbjct: 618 VLAMRTFKPGELISHCKGGLKDLTKSEDDALREEATASREIRKDSEYKGVLGPGRDFSVI 677
Query: 124 YSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFG 183
S RK C+QL LGPA ++NHDC PN F T KV+R I + EEIT FYGE++F
Sbjct: 678 RSARKGCSQLLLGPARFVNHDCNPNTEFYRMG-STMVFKVIRPIHRNEEITTFYGENYFE 736
Query: 184 DKNCLCECDTCEIRMKGAFAQSS 206
N C C TCE R GAF+ S
Sbjct: 737 WANAECMCATCEARGTGAFSDPS 759
>gi|145246426|ref|XP_001395462.1| histone-lysine N-methyltransferase set9 [Aspergillus niger CBS
513.88]
gi|134080178|emb|CAK46158.1| unnamed protein product [Aspergillus niger]
Length = 635
Score = 100 bits (249), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 67/188 (35%), Positives = 94/188 (50%), Gaps = 11/188 (5%)
Query: 52 QHHRYSYRIQYLP----ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKE 107
+ H Y YLP E+ RY++ + A I A + ++ I YL G + +TKE
Sbjct: 106 RRHLRKYIQMYLPDSPFEVTTTNRYTVTEHEAA-ICARKFIKQGQEIKYLSGTLVPMTKE 164
Query: 108 EEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDI 167
EE+ L DFS++ S R+ +LGPA + NHDC N R V + V +R+I
Sbjct: 165 EEQDL-DLKRKDFSIVMSSRRKTPSFFLGPARFANHDCNANGRLVTRGMEGMQVVAIRNI 223
Query: 168 EQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFA---QSSVEPQTTQPS--RYCLRDT 222
GEEIT YGED+FG NC C C TCE ++ +A S E + P+ L +
Sbjct: 224 HIGEEITVSYGEDYFGIDNCECLCLTCERAVRNGWAPQLDSDQESPASTPAYDEPALSEA 283
Query: 223 DMRLNKRK 230
+ NKRK
Sbjct: 284 LLSPNKRK 291
>gi|346320259|gb|EGX89860.1| SET domain protein [Cordyceps militaris CM01]
Length = 1039
Score = 100 bits (249), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 55/152 (36%), Positives = 80/152 (52%), Gaps = 6/152 (3%)
Query: 62 YLP----ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGL 117
YLP E+ RY++ + A I A R +N++I YL G +T EEE L+
Sbjct: 114 YLPDCPWEVNSTNRYTIVTHE-ASITARRPIRRNESIKYLSGMKVHITPEEED-LISNRK 171
Query: 118 NDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFY 177
DFS++ S R L++GPA + NHDC N + T + + +R+I GEEIT Y
Sbjct: 172 KDFSIVVSSRSKSTSLFMGPARFANHDCNANAALMTTSQAGIEIVAMRNIAVGEEITVTY 231
Query: 178 GEDFFGDKNCLCECDTCEIRMKGAFAQSSVEP 209
G+++FG NC C C TCE K + + +P
Sbjct: 232 GDNYFGQDNCECLCKTCEDERKNGWQNAQGKP 263
>gi|325088481|gb|EGC41791.1| histone-lysine N-methyltransferase [Ajellomyces capsulatus H88]
Length = 715
Score = 100 bits (249), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 60/161 (37%), Positives = 87/161 (54%), Gaps = 8/161 (4%)
Query: 52 QHHRYSYRIQYLP----ELKPCYRYSLEGNRGARIVATRKW-EKNDNIAYLVGCIAELTK 106
+ H Y YLP E+ RY++ + A V+ RK+ I L G + +T+
Sbjct: 102 RRHLRKYISIYLPDCPFEVMTTNRYTITIHEAA--VSARKYIASGTTIRGLSGTLVPMTR 159
Query: 107 EEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRD 166
EEE L T DFS++ S RK ++LGPA + NHDC N R V ++ V +RD
Sbjct: 160 EEEIDLDLT-RKDFSIVMSSRKKTPSIFLGPARFANHDCNANARLVMRGPESMEVVAIRD 218
Query: 167 IEQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFAQSSV 207
I GEEIT YGE++FG+ NC C C TCE+ ++ ++ +S
Sbjct: 219 IVVGEEITVSYGENYFGEDNCECLCHTCELALRNGWSPNSA 259
>gi|346975811|gb|EGY19263.1| histone-lysine N-methyltransferase SET9 [Verticillium dahliae
VdLs.17]
Length = 686
Score = 100 bits (249), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 53/141 (37%), Positives = 77/141 (54%), Gaps = 6/141 (4%)
Query: 62 YLP----ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGL 117
YLP E+ RY++ + A I A R +N+ + YL G +T EEE R +
Sbjct: 112 YLPDCPFEVNSTNRYTIVSHEAA-ITARRFIARNETVKYLSGVQVMITPEEE-REMTARK 169
Query: 118 NDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFY 177
DFS++ S R CA L++GPA + NHDC+ N + V T + + +I+ G+EIT Y
Sbjct: 170 KDFSIIISSRNKCASLFMGPARFANHDCKANAKLVTTGQAGIEIIATTNIDVGDEITVTY 229
Query: 178 GEDFFGDKNCLCECDTCEIRM 198
+ +FG+ NC C C TCE R
Sbjct: 230 ADSYFGEDNCECLCSTCERRQ 250
>gi|440477489|gb|ELQ58535.1| histone-lysine N-methyltransferase SET9 [Magnaporthe oryzae P131]
Length = 885
Score = 100 bits (249), Expect = 8e-19, Method: Composition-based stats.
Identities = 61/181 (33%), Positives = 88/181 (48%), Gaps = 16/181 (8%)
Query: 54 HRYSYRIQYLP----ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEE 109
H Y YLP E+ RY++ + A + A R KN+ I L G ++ EE
Sbjct: 101 HLRRYLSIYLPDCPFEVNSTNRYTIFTHE-ASVTARRPIRKNEIIKALCGIQVVISPAEE 159
Query: 110 KRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQ 169
+ DFS++ S R L++GPA + NHDC N R ++ V+ LR+IE
Sbjct: 160 AEIAKRK-KDFSIVISSRSKSTSLFMGPARFANHDCGANARLKTADQSIMEVQALRNIEV 218
Query: 170 GEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFA----------QSSVEPQTTQPSRYCL 219
GEEIT YG+++FG+ NC C C TCE+ +A + S+E P+ Y L
Sbjct: 219 GEEITVTYGDNYFGEDNCECLCRTCELGRVNGWAGDEGDGNGGFEKSIEEDQGTPAGYSL 278
Query: 220 R 220
R
Sbjct: 279 R 279
>gi|350636815|gb|EHA25173.1| hypothetical protein ASPNIDRAFT_56722 [Aspergillus niger ATCC 1015]
Length = 591
Score = 100 bits (249), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 67/188 (35%), Positives = 94/188 (50%), Gaps = 11/188 (5%)
Query: 52 QHHRYSYRIQYLP----ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKE 107
+ H Y YLP E+ RY++ + A I A + ++ I YL G + +TKE
Sbjct: 106 RRHLRKYIQMYLPDSPFEVTTTNRYTVTEHEAA-ICARKFIKQGQEIKYLSGTLVPMTKE 164
Query: 108 EEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDI 167
EE+ L DFS++ S R+ +LGPA + NHDC N R V + V +R+I
Sbjct: 165 EEQDL-DLKRKDFSIVMSSRRKTPSFFLGPARFANHDCNANGRLVTRGMEGMQVVAIRNI 223
Query: 168 EQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFA---QSSVEPQTTQPS--RYCLRDT 222
GEEIT YGED+FG NC C C TCE ++ +A S E + P+ L +
Sbjct: 224 HIGEEITVSYGEDYFGIDNCECLCLTCERAVRNGWAPQLDSDQESPASTPAYDEPALSEA 283
Query: 223 DMRLNKRK 230
+ NKRK
Sbjct: 284 LLSPNKRK 291
>gi|440462615|gb|ELQ32622.1| histone-lysine N-methyltransferase SET9 [Magnaporthe oryzae Y34]
Length = 909
Score = 100 bits (248), Expect = 8e-19, Method: Composition-based stats.
Identities = 61/181 (33%), Positives = 88/181 (48%), Gaps = 16/181 (8%)
Query: 54 HRYSYRIQYLP----ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEE 109
H Y YLP E+ RY++ + A + A R KN+ I L G ++ EE
Sbjct: 125 HLRRYLSIYLPDCPFEVNSTNRYTIFTHE-ASVTARRPIRKNEIIKALCGIQVVISPAEE 183
Query: 110 KRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQ 169
+ DFS++ S R L++GPA + NHDC N R ++ V+ LR+IE
Sbjct: 184 AEIAKRK-KDFSIVISSRSKSTSLFMGPARFANHDCGANARLKTADQSIMEVQALRNIEV 242
Query: 170 GEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFA----------QSSVEPQTTQPSRYCL 219
GEEIT YG+++FG+ NC C C TCE+ +A + S+E P+ Y L
Sbjct: 243 GEEITVTYGDNYFGEDNCECLCRTCELGRVNGWAGDEGDGNGGFEKSIEEDQGTPAGYSL 302
Query: 220 R 220
R
Sbjct: 303 R 303
>gi|240282346|gb|EER45849.1| SET domain-containing protein [Ajellomyces capsulatus H143]
Length = 636
Score = 100 bits (248), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 60/161 (37%), Positives = 87/161 (54%), Gaps = 8/161 (4%)
Query: 52 QHHRYSYRIQYLP----ELKPCYRYSLEGNRGARIVATRKW-EKNDNIAYLVGCIAELTK 106
+ H Y YLP E+ RY++ + A V+ RK+ I L G + +T+
Sbjct: 77 RRHLRKYISIYLPDCPFEVMTTNRYTITIHEAA--VSARKYIASGTTIRGLSGTLVPMTR 134
Query: 107 EEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRD 166
EEE L T DFS++ S RK ++LGPA + NHDC N R V ++ V +RD
Sbjct: 135 EEEIDLDLT-RKDFSIVMSSRKKTPSIFLGPARFANHDCNANARLVMRGPESMEVVAIRD 193
Query: 167 IEQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFAQSSV 207
I GEEIT YGE++FG+ NC C C TCE+ ++ ++ +S
Sbjct: 194 IVVGEEITVSYGENYFGEDNCECLCHTCELALRNGWSPNSA 234
>gi|389627236|ref|XP_003711271.1| histone-lysine N-methyltransferase SET9 [Magnaporthe oryzae 70-15]
gi|351643603|gb|EHA51464.1| histone-lysine N-methyltransferase SET9 [Magnaporthe oryzae 70-15]
Length = 885
Score = 100 bits (248), Expect = 8e-19, Method: Composition-based stats.
Identities = 61/181 (33%), Positives = 88/181 (48%), Gaps = 16/181 (8%)
Query: 54 HRYSYRIQYLP----ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEE 109
H Y YLP E+ RY++ + A + A R KN+ I L G ++ EE
Sbjct: 101 HLRRYLSIYLPDCPFEVNSTNRYTIFTHE-ASVTARRPIRKNEIIKALCGIQVVISPAEE 159
Query: 110 KRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQ 169
+ DFS++ S R L++GPA + NHDC N R ++ V+ LR+IE
Sbjct: 160 AEIAKRK-KDFSIVISSRSKSTSLFMGPARFANHDCGANARLKTADQSIMEVQALRNIEV 218
Query: 170 GEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFA----------QSSVEPQTTQPSRYCL 219
GEEIT YG+++FG+ NC C C TCE+ +A + S+E P+ Y L
Sbjct: 219 GEEITVTYGDNYFGEDNCECLCRTCELGRVNGWAGDEGDGNGGFEKSIEEDQGTPAGYSL 278
Query: 220 R 220
R
Sbjct: 279 R 279
>gi|353245147|emb|CCA76218.1| hypothetical protein PIIN_10211 [Piriformospora indica DSM 11827]
Length = 1229
Score = 100 bits (248), Expect = 8e-19, Method: Composition-based stats.
Identities = 68/187 (36%), Positives = 92/187 (49%), Gaps = 17/187 (9%)
Query: 36 LLAQRFNIGRILDMGTQH-------HRYSYRIQYLP----ELKPCYRYSLE-GNRGARIV 83
+L Q N+ R + +Q+ H Y YLP E+ RYS + G ++
Sbjct: 134 ILLQLRNVQRYIWHKSQYQINAFATHMSRYLELYLPGGSIEVAQTSRYSHKTGKSELCVI 193
Query: 84 ATRKWEKNDNIAYLVGCIAELTKEEEKRLLHT----GLNDFSVMYSCRKNCAQLWLGPAA 139
A + + I L G +A+LT EE++ L DFSV++S +N L+LGPA
Sbjct: 194 AVKPLKPGQTITDLKGSMADLTAEEDEELKEIRGAGARRDFSVIFSHSRNVNHLFLGPAR 253
Query: 140 YINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRMK 199
++NHDC NC R KVLR I GEEIT YGED+FG N C C TCE +
Sbjct: 254 FVNHDCENNCELFRDGR-YITFKVLRPIAAGEEITAHYGEDYFGKNNRRCLCATCEHKGI 312
Query: 200 GAFAQSS 206
G +A S
Sbjct: 313 GGYAIES 319
>gi|344269661|ref|XP_003406667.1| PREDICTED: histone-lysine N-methyltransferase SUV420H2-like
[Loxodonta africana]
Length = 319
Score = 100 bits (248), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 47/91 (51%), Positives = 62/91 (68%), Gaps = 3/91 (3%)
Query: 141 INHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRMKG 200
+ DC+PNC+FV + + ACVKVLRDIE G+E+TCFYG+ FFG+KN CEC TCE + +G
Sbjct: 155 LAPDCKPNCKFVPADGNAACVKVLRDIEPGDEVTCFYGDGFFGEKNEHCECYTCERKGEG 214
Query: 201 AFAQSSVEPQTTQP---SRYCLRDTDMRLNK 228
AF EP+ P +Y LR+T RL +
Sbjct: 215 AFRLRPKEPRLPTPRALDKYELRETKRRLQE 245
>gi|315050540|ref|XP_003174644.1| histone-lysine N-methyltransferase set9 [Arthroderma gypseum CBS
118893]
gi|311339959|gb|EFQ99161.1| histone-lysine N-methyltransferase set9 [Arthroderma gypseum CBS
118893]
Length = 638
Score = 100 bits (248), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 68/208 (32%), Positives = 103/208 (49%), Gaps = 11/208 (5%)
Query: 5 VHNLILHVPIQLACHLLNKLNPSFWAFCVQALLAQRFNIGRILDMGTQHHRYSYRIQYLP 64
V +++LH I ++K F A R R + +H R I YLP
Sbjct: 57 VTSILLHDVI--VAKDVSKAEKEFLALPGIKKFTDRLRSDREKEWFRRHLRKYISI-YLP 113
Query: 65 ----ELKPCYRYSLEGNRGARIVATRKW-EKNDNIAYLVGCIAELTKEEEKRLLHTGLND 119
E+ RY++ + A V+ RK+ + YL G + +TKEEE L T D
Sbjct: 114 DCPFEVATTNRYTVLTHEAA--VSARKFIPSGHPVKYLSGTLVSITKEEENDLDLTK-RD 170
Query: 120 FSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGE 179
FS++ S RK A L+LGPA + NHDC N + V ++ V RDI+ G+EIT YG+
Sbjct: 171 FSIVMSTRKKTASLFLGPARFANHDCNANAKLVTRGFESMEVVATRDIDVGDEITVSYGD 230
Query: 180 DFFGDKNCLCECDTCEIRMKGAFAQSSV 207
++FG+ NC C C +CE+ + ++ +
Sbjct: 231 NYFGEDNCECLCHSCEMAQRNGWSSPTA 258
>gi|358369805|dbj|GAA86418.1| SET domain protein [Aspergillus kawachii IFO 4308]
Length = 635
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 59/156 (37%), Positives = 82/156 (52%), Gaps = 6/156 (3%)
Query: 52 QHHRYSYRIQYLP----ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKE 107
+ H Y YLP E+ RY++ + A I A + ++ I YL G + +TKE
Sbjct: 106 RRHLRKYIQMYLPDSPFEVTTTNRYTVTEHEAA-ICARKFIKQGQEIKYLSGTLVPMTKE 164
Query: 108 EEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDI 167
EE+ L DFS++ S R+ +LGPA + NHDC N R V + V +R+I
Sbjct: 165 EEQDL-DLKRKDFSIVMSSRRKTPSFFLGPARFANHDCNANGRLVTRGMEGMQVVAIRNI 223
Query: 168 EQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFA 203
GEEIT YGED+FG NC C C TCE ++ +A
Sbjct: 224 HIGEEITVSYGEDYFGIDNCECLCLTCERAVRNGWA 259
>gi|85109675|ref|XP_963033.1| hypothetical protein NCU08551 [Neurospora crassa OR74A]
gi|74628935|sp|Q7SBJ9.1|SET9_NEUCR RecName: Full=Histone-lysine N-methyltransferase set-9; AltName:
Full=SET domain protein 9
gi|28924681|gb|EAA33797.1| hypothetical protein NCU08551 [Neurospora crassa OR74A]
Length = 783
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 79/260 (30%), Positives = 116/260 (44%), Gaps = 49/260 (18%)
Query: 54 HRYSYRIQYLP----ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEE 109
H Y YLP E+ RY++ A I A R ++N+ I YL G +T EEE
Sbjct: 105 HLRRYMSIYLPDCPFEVNATNRYTIVTYE-ASITARRFIQRNETIKYLAGIQVVITPEEE 163
Query: 110 KRLLHTGL--NDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVL--R 165
L L DFS++ S R L++GPA + NHDC N R + R A ++++ R
Sbjct: 164 ---LEMSLRKKDFSLIVSSRSKSTSLFMGPARFANHDCNANARLIT--RGQAGIEIIACR 218
Query: 166 DIEQGEEITCFYGEDFFGDKNCLCECDTCEIRMKG--------AFAQSSVEPQTT----- 212
+IE GEEIT Y E +FG+ NC C C TCE ++ A Q S+E +
Sbjct: 219 NIEVGEEITVTYSESYFGENNCDCLCATCESNLRNGWRPVDGEAAVQKSIEDEQPTESST 278
Query: 213 ---------------QPSRYCLRDTDMRLNKRKLHKKLNRLLLASDKNDTNSSDNSSKYS 257
Q SR DMR R L K ++++L + T+S+ +
Sbjct: 279 PYSFRRKRRYGSTALQASRTPSVTPDMR--PRVLRKSQSQMMLGERTSTTDSAAQGAGA- 335
Query: 258 LVQWNNNSKENIFDLGTFPL 277
+ S++ ++GT P
Sbjct: 336 ----DGQSRKRALEMGTPPF 351
>gi|451850087|gb|EMD63389.1| hypothetical protein COCSADRAFT_37170 [Cochliobolus sativus ND90Pr]
Length = 660
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 66/186 (35%), Positives = 92/186 (49%), Gaps = 15/186 (8%)
Query: 30 AFCVQALLA----QRFNIG----RILDMGTQHHRYSYRIQYLPE----LKPCYRYSLEGN 77
A VQ LL +RF +G R + +H R I Y+P+ + RY++ +
Sbjct: 67 AEAVQKLLQLPRLRRFLVGLKDERDQEQFRRHLRRYVNI-YMPDCQFDVTTTNRYTITEH 125
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGP 137
A I A R + I YL G +T+E+EK L DFS++ S RK L+LGP
Sbjct: 126 E-ASITARRDINPREEIKYLTGVQVAMTEEQEK-TLELQRKDFSLVISSRKKTRSLFLGP 183
Query: 138 AAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIR 197
A + NHDC N + D + ++ I +GEEIT YGED+FGD N C C TCE
Sbjct: 184 ARFANHDCDANAKLTTKGYDGMQIVAIKPISEGEEITVSYGEDYFGDDNEECLCSTCESL 243
Query: 198 MKGAFA 203
+ +A
Sbjct: 244 QQNGWA 249
>gi|367023212|ref|XP_003660891.1| hypothetical protein MYCTH_73315 [Myceliophthora thermophila ATCC
42464]
gi|347008158|gb|AEO55646.1| hypothetical protein MYCTH_73315 [Myceliophthora thermophila ATCC
42464]
Length = 667
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 58/148 (39%), Positives = 75/148 (50%), Gaps = 6/148 (4%)
Query: 52 QHHRYSYRIQYLP----ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKE 107
+ H Y YLP E+ RY++E A I A R +N+ I YL G +T E
Sbjct: 102 RSHLRRYMSIYLPDCPFEVNATNRYTIESYE-ASITARRPIRRNEAIKYLAGIQVTVTPE 160
Query: 108 EEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDI 167
EE +L DFS++ S R L++GPA + NHDC N R V + + RDI
Sbjct: 161 EEAQLALRK-KDFSLVVSSRSKLTSLFMGPARFANHDCSANARLVTGGQAGIQIFACRDI 219
Query: 168 EQGEEITCFYGEDFFGDKNCLCECDTCE 195
GEEIT Y E +FG+ NC C C TCE
Sbjct: 220 AVGEEITVTYSESYFGENNCECLCQTCE 247
>gi|317157355|ref|XP_001826424.2| hypothetical protein AOR_1_1434054 [Aspergillus oryzae RIB40]
Length = 1115
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 58/156 (37%), Positives = 81/156 (51%), Gaps = 6/156 (3%)
Query: 52 QHHRYSYRIQYLP----ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKE 107
+ H Y YLP E+ RY++ + A I A + ++ I YL G + +T+E
Sbjct: 106 RRHLRKYIQMYLPDSPFEVTTTNRYTITEHEAA-ICARKFIKQGQEIKYLSGTLVPMTRE 164
Query: 108 EEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDI 167
EE+ L DFS++ S R+ +LGPA + NHDC N R V + V RDI
Sbjct: 165 EEQEL-DLKRKDFSIVMSSRRKTPSFFLGPARFANHDCNANGRLVTRGSEGMQVVATRDI 223
Query: 168 EQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFA 203
GEEIT YG+D+FG NC C C TCE ++ +A
Sbjct: 224 YIGEEITVSYGDDYFGIDNCECLCLTCERAVRNGWA 259
>gi|238493749|ref|XP_002378111.1| SET domain protein [Aspergillus flavus NRRL3357]
gi|121797842|sp|Q2TZH4.1|SET9_ASPOR RecName: Full=Histone-lysine N-methyltransferase set9; AltName:
Full=SET domain protein 9
gi|83775168|dbj|BAE65291.1| unnamed protein product [Aspergillus oryzae RIB40]
gi|220696605|gb|EED52947.1| SET domain protein [Aspergillus flavus NRRL3357]
Length = 625
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 58/156 (37%), Positives = 81/156 (51%), Gaps = 6/156 (3%)
Query: 52 QHHRYSYRIQYLP----ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKE 107
+ H Y YLP E+ RY++ + A I A + ++ I YL G + +T+E
Sbjct: 106 RRHLRKYIQMYLPDSPFEVTTTNRYTITEHEAA-ICARKFIKQGQEIKYLSGTLVPMTRE 164
Query: 108 EEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDI 167
EE+ L DFS++ S R+ +LGPA + NHDC N R V + V RDI
Sbjct: 165 EEQEL-DLKRKDFSIVMSSRRKTPSFFLGPARFANHDCNANGRLVTRGSEGMQVVATRDI 223
Query: 168 EQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFA 203
GEEIT YG+D+FG NC C C TCE ++ +A
Sbjct: 224 YIGEEITVSYGDDYFGIDNCECLCLTCERAVRNGWA 259
>gi|171685382|ref|XP_001907632.1| hypothetical protein [Podospora anserina S mat+]
gi|170942652|emb|CAP68304.1| unnamed protein product [Podospora anserina S mat+]
Length = 674
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 59/157 (37%), Positives = 78/157 (49%), Gaps = 6/157 (3%)
Query: 52 QHHRYSYRIQYLP----ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKE 107
+ H Y YLP E+ RY++ A IVA R +N+ I YL G +T E
Sbjct: 101 KAHMRRYMSTYLPDCPFEVNATNRYTIVTYE-ASIVARRFIRRNETIKYLAGIQVTITPE 159
Query: 108 EEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDI 167
EE + + DFS++ S R L++GPA + NHDC N R V + + RDI
Sbjct: 160 EEADMA-SRKKDFSLVVSSRSKSTSLFMGPARFANHDCDANARLVTCGQAGIEIIACRDI 218
Query: 168 EQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFAQ 204
GEEIT Y E +FG+ NC C C TCE R + Q
Sbjct: 219 GVGEEITVSYSESYFGEDNCECLCQTCEARQVNGWQQ 255
>gi|391869503|gb|EIT78700.1| histone tail methylase [Aspergillus oryzae 3.042]
Length = 625
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 58/156 (37%), Positives = 81/156 (51%), Gaps = 6/156 (3%)
Query: 52 QHHRYSYRIQYLP----ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKE 107
+ H Y YLP E+ RY++ + A I A + ++ I YL G + +T+E
Sbjct: 106 RRHLRKYIQMYLPDSPFEVTTTNRYTITEHEAA-ICARKFIKQGQEIKYLSGTLVPMTRE 164
Query: 108 EEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDI 167
EE+ L DFS++ S R+ +LGPA + NHDC N R V + V RDI
Sbjct: 165 EEQEL-DLKRKDFSIVMSSRRKTPSFFLGPARFANHDCNANGRLVTRGSEGMQVVATRDI 223
Query: 168 EQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFA 203
GEEIT YG+D+FG NC C C TCE ++ +A
Sbjct: 224 YIGEEITVSYGDDYFGIDNCECLCLTCERAVRNGWA 259
>gi|440634968|gb|ELR04887.1| hypothetical protein GMDG_07112 [Geomyces destructans 20631-21]
Length = 661
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 65/193 (33%), Positives = 97/193 (50%), Gaps = 19/193 (9%)
Query: 44 GRILDMGTQHHRYSYRIQYLP----ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVG 99
G+ D +H R I YLP E+ RY++ + A + A R +K + + YL G
Sbjct: 94 GKERDDFRKHMRRYVNI-YLPDCPFEITSTNRYTVVTHE-ASVTARRLIKKGETVKYLCG 151
Query: 100 CIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTA 159
+TKEEE + + DFS++ S R A L+LGPA + NHDC N + T
Sbjct: 152 VQVIMTKEEEADI-NQRRRDFSIVVSSRNKSASLFLGPARFANHDCGANAELMRTGAAGM 210
Query: 160 CVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAF--------AQSSVEPQT 211
+ +RDIE G+EIT YG+++FG+ NC C C TCE ++ + + + +P
Sbjct: 211 EIIAVRDIEVGDEITVTYGDNYFGEDNCECLCKTCEDNLENGWGLEDDDGSSTPATKPSI 270
Query: 212 TQPS----RYCLR 220
QP+ Y LR
Sbjct: 271 EQPTVVIGSYSLR 283
>gi|71001008|ref|XP_755185.1| SET domain protein [Aspergillus fumigatus Af293]
gi|74675744|sp|Q4X1W8.1|SET9_ASPFU RecName: Full=Histone-lysine N-methyltransferase set9; AltName:
Full=SET domain protein 9
gi|66852823|gb|EAL93147.1| SET domain protein [Aspergillus fumigatus Af293]
gi|159129275|gb|EDP54389.1| SET domain protein [Aspergillus fumigatus A1163]
Length = 622
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 63/166 (37%), Positives = 85/166 (51%), Gaps = 9/166 (5%)
Query: 52 QHHRYSYRIQYLP----ELKPCYRYSLEGNRGARIVATRKW-EKNDNIAYLVGCIAELTK 106
+ H Y YLP E+ RY++ A V RK+ ++ I YL G + +T+
Sbjct: 106 RRHLRKYIQMYLPDSPFEITTTNRYTITEYEAA--VCARKFIKQGQEIKYLSGTLVPMTR 163
Query: 107 EEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRD 166
EEE R L DFS++ S RK +LGPA + NHDC N + V + V RD
Sbjct: 164 EEE-RDLDLKRKDFSIVMSSRKKTPSFFLGPARFANHDCNANGKLVTRGSEGMQVVATRD 222
Query: 167 IEQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFA-QSSVEPQT 211
I GEEIT YG+D+FG NC C C TCE ++ +A EPQ+
Sbjct: 223 IYIGEEITVSYGDDYFGIDNCECLCLTCERLVRNGWAPHVPSEPQS 268
>gi|326478898|gb|EGE02908.1| histone-lysine N-methyltransferase set9 [Trichophyton equinum CBS
127.97]
Length = 639
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 57/161 (35%), Positives = 87/161 (54%), Gaps = 8/161 (4%)
Query: 52 QHHRYSYRIQYLP----ELKPCYRYSLEGNRGARIVATRKW-EKNDNIAYLVGCIAELTK 106
+ H Y YLP E+ RY++ + A V+ RK+ + YL G + +TK
Sbjct: 101 RRHLRKYISMYLPDCPFEVTTTNRYTVLTHEAA--VSARKFIPSGHPVKYLSGTLVSITK 158
Query: 107 EEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRD 166
EEE L T DFS++ S RK A L+LGPA + NHDC N + V ++ V +D
Sbjct: 159 EEENDLDLTK-RDFSIVMSTRKKTASLFLGPARFANHDCNANAKLVTRGFESMEVVATQD 217
Query: 167 IEQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFAQSSV 207
I+ G+EIT YG+++FG+ NC C C +CE+ + ++ +
Sbjct: 218 IDIGDEITVSYGDNYFGEDNCECLCHSCELAQRNGWSSPTA 258
>gi|326469576|gb|EGD93585.1| histone-lysine N-methyltransferase [Trichophyton tonsurans CBS
112818]
Length = 639
Score = 98.2 bits (243), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 57/157 (36%), Positives = 86/157 (54%), Gaps = 8/157 (5%)
Query: 52 QHHRYSYRIQYLP----ELKPCYRYSLEGNRGARIVATRKW-EKNDNIAYLVGCIAELTK 106
+ H Y YLP E+ RY++ + A V+ RK+ + YL G + +TK
Sbjct: 101 RRHLRKYISMYLPDCPFEVTTTNRYTVLTHEAA--VSARKFIPSGHPVKYLSGTLVSITK 158
Query: 107 EEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRD 166
EEE L T DFS++ S RK A L+LGPA + NHDC N + V ++ V +D
Sbjct: 159 EEENDLDLTK-RDFSIVMSTRKKTASLFLGPARFANHDCNANAKLVTRGFESMEVVATQD 217
Query: 167 IEQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFA 203
I+ G+EIT YG+++FG+ NC C C +CE+ + ++
Sbjct: 218 IDIGDEITVSYGDNYFGEDNCECLCHSCELAQRNGWS 254
>gi|320588263|gb|EFX00738.1| set domain containing protein [Grosmannia clavigera kw1407]
Length = 750
Score = 97.8 bits (242), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 65/183 (35%), Positives = 92/183 (50%), Gaps = 15/183 (8%)
Query: 54 HRYSYRIQYLP----ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEE 109
H Y YLP E+ RY++ + A I A R + ++ I YL G +T EEE
Sbjct: 106 HMRRYMQIYLPDCPYEVASTNRYTI-FQQEASISARRFIKSHETIKYLSGIQVVITPEEE 164
Query: 110 KRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQ 169
+L DFS++ S RK L++GPA + NHDC N R V T + + + I
Sbjct: 165 AQLSERK-KDFSIVISSRKKEVNLFMGPARFANHDCDANARLVTTGQSGIEIIATKPIGV 223
Query: 170 GEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFA--------QSSVEPQTTQPSR-YCLR 220
G+EIT YGE++FG+ NC C C TCE R+ +A + S+E + P+R Y R
Sbjct: 224 GDEITVTYGENYFGEDNCECLCKTCEGRLVNGWAPGEGTAPVKRSIEVGASTPTRGYSFR 283
Query: 221 DTD 223
D
Sbjct: 284 RRD 286
>gi|340959165|gb|EGS20346.1| methyltransferase-like protein [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 609
Score = 97.8 bits (242), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 69/116 (59%), Gaps = 5/116 (4%)
Query: 82 IVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYI 141
+ A R +KN+ I YL G +T EEE R + DFS++ S R L++GPA +
Sbjct: 136 VTARRFIKKNETIKYLSGTQVTVTPEEE-REMALRKKDFSLIVSSRSKSTSLFMGPARFA 194
Query: 142 NHDCRPNCRFVATERDTACVKVL--RDIEQGEEITCFYGEDFFGDKNCLCECDTCE 195
NHDC N R V R A ++++ RDIE GEEIT YG+++FG+ NC C C TCE
Sbjct: 195 NHDCNANARLVT--RGPAGIEIVACRDIEVGEEITVTYGDNYFGENNCECLCATCE 248
>gi|295660192|ref|XP_002790653.1| histone-lysine N-methyltransferase set9 [Paracoccidioides sp.
'lutzii' Pb01]
gi|226281528|gb|EEH37094.1| histone-lysine N-methyltransferase set9 [Paracoccidioides sp.
'lutzii' Pb01]
Length = 639
Score = 97.8 bits (242), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 58/161 (36%), Positives = 86/161 (53%), Gaps = 8/161 (4%)
Query: 52 QHHRYSYRIQYLP----ELKPCYRYSLEGNRGARIVATRKW-EKNDNIAYLVGCIAELTK 106
+ H Y + YLP E+ RY++ + A V+ RK+ I L G + +T+
Sbjct: 73 RRHLRKYIVIYLPDCPFEVMTTNRYTITMHEAA--VSARKFIASGTTIKGLSGTLVPMTR 130
Query: 107 EEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRD 166
EEE L T DFS++ S RK ++LGPA + NHDC N R V ++ V RD
Sbjct: 131 EEEIDLDMT-RKDFSIVMSSRKKTPSIFLGPARFANHDCNANGRLVMRGPESMEVVATRD 189
Query: 167 IEQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFAQSSV 207
I GEEIT YG+++FG+ NC C C +CE+ ++ + + V
Sbjct: 190 IVVGEEITVSYGDNYFGEDNCECLCHSCELALRNGWDPNGV 230
>gi|302656560|ref|XP_003020032.1| SET domain protein [Trichophyton verrucosum HKI 0517]
gi|291183812|gb|EFE39408.1| SET domain protein [Trichophyton verrucosum HKI 0517]
Length = 535
Score = 97.8 bits (242), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 57/161 (35%), Positives = 86/161 (53%), Gaps = 8/161 (4%)
Query: 52 QHHRYSYRIQYLP----ELKPCYRYSLEGNRGARIVATRKW-EKNDNIAYLVGCIAELTK 106
+ H Y YLP E+ RY++ + A V+ RK+ + YL G + +TK
Sbjct: 101 RRHLRKYISIYLPDCPFEVTTTNRYTVLTHEAA--VSARKFIPSGHPVKYLSGTLVSITK 158
Query: 107 EEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRD 166
EEE L T DFS++ S RK A L+LGPA + NHDC N + V ++ V +D
Sbjct: 159 EEENDLDLTK-RDFSIVMSTRKKTASLFLGPARFANHDCNANAKLVTRGFESMEVVATQD 217
Query: 167 IEQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFAQSSV 207
I+ G+EIT YG+++F + NC C C +CE+ + ++ S
Sbjct: 218 IDIGDEITVSYGDNYFSEDNCECLCHSCEMAQRNGWSSPSA 258
>gi|327303388|ref|XP_003236386.1| histone-lysine N-methyltransferase [Trichophyton rubrum CBS 118892]
gi|326461728|gb|EGD87181.1| histone-lysine N-methyltransferase [Trichophyton rubrum CBS 118892]
Length = 639
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 78/243 (32%), Positives = 115/243 (47%), Gaps = 40/243 (16%)
Query: 52 QHHRYSYRIQYLP----ELKPCYRYSLEGNRGARIVATRKW-EKNDNIAYLVGCIAELTK 106
+ H Y YLP E+ RY++ + A V+ RK+ + YL G + +TK
Sbjct: 101 RRHLRKYISIYLPDCPFEVTTTNRYTVLTHEAA--VSARKFIPSGHPVKYLSGTLVSITK 158
Query: 107 EEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRD 166
EEE L T DFS++ S RK A L+LGPA + NHDC N + V ++ V +D
Sbjct: 159 EEENDLDLTK-RDFSIVMSTRKKTASLFLGPARFANHDCNANAKLVTRGFESMEVVATQD 217
Query: 167 IEQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGA------------------FAQSSVE 208
I+ G EIT YG+++FG+ NC C C +CE+ + FA +S
Sbjct: 218 IDIGNEITVSYGDNYFGEDNCECLCHSCEMAQRNGWSSPTASRVNSWVHSSSEFADASEH 277
Query: 209 PQTTQPSRYCLR---DTDMRLN------KRKLHKKLNRLLLASDKNDT---NSSDNSSKY 256
P ++ S C + +TD + KRK+ +K + L L NDT ++ N
Sbjct: 278 PASS--SNGCEKHELETDSESSDGFLGKKRKVERKSSNLRLQESVNDTVDVDAKSNQGSP 335
Query: 257 SLV 259
SLV
Sbjct: 336 SLV 338
>gi|336470258|gb|EGO58420.1| hypothetical protein NEUTE1DRAFT_82859 [Neurospora tetrasperma FGSC
2508]
gi|350290033|gb|EGZ71247.1| histone-lysine N-methyltransferase set-9 [Neurospora tetrasperma
FGSC 2509]
Length = 777
Score = 97.4 bits (241), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 79/260 (30%), Positives = 115/260 (44%), Gaps = 49/260 (18%)
Query: 54 HRYSYRIQYLP----ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEE 109
H Y YLP E+ RY++ A I A R ++N+ I YL G +T EEE
Sbjct: 105 HLRRYMSIYLPDCPFEVNATNRYTIVTYE-ASITARRFIQRNETIKYLAGIQVVITPEEE 163
Query: 110 KRLLHTGL--NDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVL--R 165
L L DFS++ S R L++GPA + NHDC N R + R A ++++ R
Sbjct: 164 ---LEMSLRKKDFSLIVSSRSKSTSLFMGPARFANHDCNANARLIT--RGQAGIEIIACR 218
Query: 166 DIEQGEEITCFYGEDFFGDKNCLCECDTCEIRMKG--------AFAQSSVEPQTT----- 212
+IE GEEIT Y E +FG+ NC C C TCE ++ A Q S+E +
Sbjct: 219 NIEVGEEITVTYSESYFGENNCDCLCATCESNLRNGWRPVDGEAAVQKSIEDEQPTESST 278
Query: 213 ---------------QPSRYCLRDTDMRLNKRKLHKKLNRLLLASDKNDTNSSDNSSKYS 257
Q SR DMR R L K ++++L + T+S+
Sbjct: 279 PYSFRRKRRYGSTALQASRTPSVTPDMR--PRVLRKSQSQMMLGERTSTTDSAAQGQGA- 335
Query: 258 LVQWNNNSKENIFDLGTFPL 277
+ S++ ++GT P
Sbjct: 336 ----DGLSRKRALEMGTPPF 351
>gi|425766500|gb|EKV05109.1| Histone-lysine N-methyltransferase set9 [Penicillium digitatum Pd1]
gi|425775346|gb|EKV13624.1| Histone-lysine N-methyltransferase set9 [Penicillium digitatum
PHI26]
Length = 668
Score = 97.4 bits (241), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 58/158 (36%), Positives = 80/158 (50%), Gaps = 10/158 (6%)
Query: 52 QHHRYSYRIQYLP----ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKE 107
+ H Y YLP E+ RY + + A I A +K + I +L G + +T E
Sbjct: 107 RRHLRKYIQMYLPDCPFEVTTTNRYIITQHEAA-ICARSLIKKGEEIKHLSGTLVSMTHE 165
Query: 108 EEKRLLHTGLN--DFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLR 165
EE L GL DFS++ S R+ +LGPA + NHDC N + V +R
Sbjct: 166 EE---LDLGLTRKDFSIVMSSRRKAPSFFLGPARFANHDCDANGSLTTRGNEGMSVVAMR 222
Query: 166 DIEQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFA 203
DI +GEEIT YGED+FG NC C C TCE ++ ++
Sbjct: 223 DIHEGEEITVSYGEDYFGVDNCECLCHTCERAVRNGWS 260
>gi|225678871|gb|EEH17155.1| histone-lysine N-methyltransferase set9 [Paracoccidioides
brasiliensis Pb03]
Length = 702
Score = 97.4 bits (241), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 58/161 (36%), Positives = 86/161 (53%), Gaps = 8/161 (4%)
Query: 52 QHHRYSYRIQYLP----ELKPCYRYSLEGNRGARIVATRKW-EKNDNIAYLVGCIAELTK 106
+ H Y + YLP E+ RY++ + A V+ RK+ I L G + +T+
Sbjct: 73 RRHLRKYIVIYLPDCPFEVMTTNRYTITMHEAA--VSARKFIASGTTIKGLRGTLVPMTR 130
Query: 107 EEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRD 166
EEE L T DFS++ S RK ++LGPA + NHDC N R V ++ V RD
Sbjct: 131 EEEIDLDMT-RKDFSIVMSSRKKTPSIFLGPARFANHDCNANGRLVMRGPESMEVVATRD 189
Query: 167 IEQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFAQSSV 207
I GEEIT YG+++FG+ NC C C +CE+ ++ + + V
Sbjct: 190 IVVGEEITVSYGDNYFGEDNCECLCHSCELALRNGWDPNGV 230
>gi|258574215|ref|XP_002541289.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237901555|gb|EEP75956.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 664
Score = 97.4 bits (241), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 49/124 (39%), Positives = 69/124 (55%), Gaps = 1/124 (0%)
Query: 80 ARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAA 139
A + A + D I +L G + +T EEE L T DFS++ S R+ L+LGPA
Sbjct: 132 AALSARKYIRTGDTIRHLSGTLVAITAEEEMNLDLT-RRDFSIVMSSRRKTPSLFLGPAR 190
Query: 140 YINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRMK 199
+ NHDC PN R V + + +RDI EEIT YG+++FG NC C C TCE+ ++
Sbjct: 191 FSNHDCNPNARLVTKGSEGMEIVAIRDIAINEEITVSYGDNYFGIDNCECLCHTCELDLR 250
Query: 200 GAFA 203
+A
Sbjct: 251 NGWA 254
>gi|116208452|ref|XP_001230035.1| hypothetical protein CHGG_03519 [Chaetomium globosum CBS 148.51]
gi|121787538|sp|Q2H8D5.1|SET9_CHAGB RecName: Full=Histone-lysine N-methyltransferase SET9; AltName:
Full=SET domain protein 9
gi|88184116|gb|EAQ91584.1| hypothetical protein CHGG_03519 [Chaetomium globosum CBS 148.51]
Length = 734
Score = 97.1 bits (240), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 58/148 (39%), Positives = 78/148 (52%), Gaps = 10/148 (6%)
Query: 54 HRYSYRIQYLP----ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEE 109
H Y YLP E+ R++++ A I A R +N+ I YL G +T EEE
Sbjct: 104 HLRRYISIYLPDCPFEVNATNRFTIDSYE-ASITARRPIRRNEAIKYLAGTQVTVTPEEE 162
Query: 110 KRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVL--RDI 167
+L DFS++ S R L++GPA + NHDC N R V R +A +++ RDI
Sbjct: 163 AQLALRK-KDFSLVVSSRSKLTSLFMGPARFANHDCAANARLVT--RGSAGIEIFACRDI 219
Query: 168 EQGEEITCFYGEDFFGDKNCLCECDTCE 195
GEEIT Y E +FG+ NC C C TCE
Sbjct: 220 GLGEEITVTYSESYFGENNCDCLCQTCE 247
>gi|330937799|ref|XP_003305628.1| hypothetical protein PTT_18535 [Pyrenophora teres f. teres 0-1]
gi|311317250|gb|EFQ86274.1| hypothetical protein PTT_18535 [Pyrenophora teres f. teres 0-1]
Length = 661
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 64/186 (34%), Positives = 91/186 (48%), Gaps = 15/186 (8%)
Query: 30 AFCVQALLA----QRFNIGRILDMGTQH----HRYSYRIQYLPE----LKPCYRYSLEGN 77
A VQ LL +RF +G + D Q H Y Y+P+ + RY++ +
Sbjct: 67 AEAVQRLLQLPGLRRF-VGGLKDKNDQEQFKRHLRRYINIYMPDCQFDVTTTNRYTITEH 125
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGP 137
A I A + + I YL G +T+E+EK L DFS++ S RK L+LGP
Sbjct: 126 E-ASITARKDINPREEIKYLTGVQVAMTEEQEK-TLELARKDFSLVISSRKKTRSLFLGP 183
Query: 138 AAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIR 197
A + NHDC N + D + ++ I +G+EIT YGED+FGD N C C TCE
Sbjct: 184 ARFANHDCDANAKLTTKGYDGMQIVAVKPIYEGDEITVSYGEDYFGDNNEECLCSTCESL 243
Query: 198 MKGAFA 203
+ +A
Sbjct: 244 QQNGWA 249
>gi|367049352|ref|XP_003655055.1| hypothetical protein THITE_42348 [Thielavia terrestris NRRL 8126]
gi|347002319|gb|AEO68719.1| hypothetical protein THITE_42348 [Thielavia terrestris NRRL 8126]
Length = 622
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 60/150 (40%), Positives = 77/150 (51%), Gaps = 10/150 (6%)
Query: 52 QHHRYSYRIQYLP----ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKE 107
+ H Y YLP E+ RY++ A I A R ++N+ I YL G +T E
Sbjct: 102 KAHLRRYMSMYLPDCPFEVNATNRYTISTYE-ASITARRPIKRNEVIKYLAGIHVTVTPE 160
Query: 108 EEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVL--R 165
EE +L G DFS++ S R L +GPA NHDC N R V R A + ++ R
Sbjct: 161 EEAQLAVWG-KDFSLVVSSRSKLTGLLMGPARLANHDCAANARLVT--RGQAGIDIVACR 217
Query: 166 DIEQGEEITCFYGEDFFGDKNCLCECDTCE 195
DI GEEIT Y E +FG+ NC C C TCE
Sbjct: 218 DIAVGEEITVTYSESYFGENNCECLCQTCE 247
>gi|336258710|ref|XP_003344163.1| hypothetical protein SMAC_08815 [Sordaria macrospora k-hell]
gi|380087391|emb|CCC14276.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 796
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 63/171 (36%), Positives = 86/171 (50%), Gaps = 22/171 (12%)
Query: 54 HRYSYRIQYLP----ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEE 109
H Y YLP E+ RY++ A I A R ++N+ I YL G +T EEE
Sbjct: 105 HLRRYMSIYLPDCPFEVNATNRYTIVTYE-ASITARRFIKRNETIKYLAGIQVVITPEEE 163
Query: 110 KRLLHTGL--NDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVL--R 165
L L DFS++ S R L++GPA + NHDC N R + R A ++++ R
Sbjct: 164 ---LEMSLRKKDFSLIVSSRSKSTSLFMGPARFANHDCNANARLIT--RGQAGIEIIACR 218
Query: 166 DIEQGEEITCFYGEDFFGDKNCLCECDTCEIRMKG--------AFAQSSVE 208
+IE GEEIT Y E +FG+ NC C C TCE ++ A Q S+E
Sbjct: 219 NIEVGEEITVTYSESYFGENNCDCLCATCESNLRNGWRPVDGEAVVQKSIE 269
>gi|121698296|ref|XP_001267775.1| SET domain protein [Aspergillus clavatus NRRL 1]
gi|119395917|gb|EAW06349.1| SET domain protein [Aspergillus clavatus NRRL 1]
Length = 635
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 58/161 (36%), Positives = 84/161 (52%), Gaps = 8/161 (4%)
Query: 52 QHHRYSYRIQYLP----ELKPCYRYSLEGNRGARIVATRKW-EKNDNIAYLVGCIAELTK 106
+ H Y YLP E+ RY++ + A V RK+ ++ I YL G + +T+
Sbjct: 106 RRHLRKYIQMYLPDSPFEITTTNRYTITEHEAA--VCARKFIKQGQEIKYLSGTLVPMTR 163
Query: 107 EEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRD 166
EEE+ L DFS++ S RK +LGPA + NHDC N + V + V RD
Sbjct: 164 EEEQDL-DLKRKDFSIVMSSRKRTPSFFLGPARFANHDCNANGKLVTRGSEGMQVVATRD 222
Query: 167 IEQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFAQSSV 207
I GEEIT YG+D+FG N C C TCE ++ ++Q++
Sbjct: 223 IYIGEEITVSYGDDYFGIDNRECLCLTCERLVRNGWSQNAA 263
>gi|170097834|ref|XP_001880136.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164644574|gb|EDR08823.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 1016
Score = 95.1 bits (235), Expect = 3e-17, Method: Composition-based stats.
Identities = 59/146 (40%), Positives = 76/146 (52%), Gaps = 8/146 (5%)
Query: 65 ELKPCYRYSLE-GNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLN----- 118
E+ RYS G I+ATR I L G +A LT EE+K L TG+
Sbjct: 111 EIAHTSRYSHRTGKSELCILATRFLAPGTVITELKGSMANLTDEEDKELKRTGIKHSDIR 170
Query: 119 -DFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFY 177
DFSV++S L+LGPA ++NHDC NC R +VLR I GEEIT Y
Sbjct: 171 RDFSVIHSKSMKKNHLFLGPARFVNHDCDNNCELFREGR-YITFRVLRPISIGEEITAHY 229
Query: 178 GEDFFGDKNCLCECDTCEIRMKGAFA 203
G+ +FG +N C C+TCE +G +A
Sbjct: 230 GDGYFGRRNRHCLCETCEKNGRGGYA 255
>gi|393228359|gb|EJD36007.1| SET domain-containing protein, partial [Auricularia delicata
TFB-10046 SS5]
Length = 223
Score = 94.7 bits (234), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 52/127 (40%), Positives = 74/127 (58%), Gaps = 8/127 (6%)
Query: 82 IVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHT------GLN-DFSVMYSCRKNCAQLW 134
I+AT+ I L G +A+LT EE++ L G+ DFSV+YS ++N + L+
Sbjct: 98 IIATKPLAVGTVITDLKGSLADLTPEEDEALRRKDTRNSLGIRRDFSVIYSRQRNTSHLF 157
Query: 135 LGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTC 194
LGPA ++NHDC N + R +V++ IEQG+EIT YG +FG KN C C+TC
Sbjct: 158 LGPARFVNHDCEHNVQLFRDGR-YITFRVIKPIEQGQEITAHYGNSYFGRKNKDCLCETC 216
Query: 195 EIRMKGA 201
E + KG
Sbjct: 217 EKKGKGG 223
>gi|169847037|ref|XP_001830231.1| hypothetical protein CC1G_11834 [Coprinopsis cinerea okayama7#130]
gi|116508707|gb|EAU91602.1| hypothetical protein CC1G_11834 [Coprinopsis cinerea okayama7#130]
Length = 982
Score = 94.7 bits (234), Expect = 4e-17, Method: Composition-based stats.
Identities = 58/146 (39%), Positives = 75/146 (51%), Gaps = 8/146 (5%)
Query: 65 ELKPCYRYSLE-GNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGL------ 117
E+ RYS G ++ATR I L G +A LT EE+K L G+
Sbjct: 111 EIAHTSRYSSHTGKSELCVLATRPLSPGMVITELKGSMANLTDEEDKELKRAGMVNSDIR 170
Query: 118 NDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFY 177
DFSV++S L+LGPA ++NHDC NC R +VLR I GEEIT Y
Sbjct: 171 RDFSVIHSKSMKKNHLFLGPARFVNHDCDNNCELFREGR-YITFRVLRPIAVGEEITAHY 229
Query: 178 GEDFFGDKNCLCECDTCEIRMKGAFA 203
G+ +FG KN C C+TCE +G +A
Sbjct: 230 GDGYFGKKNKHCLCETCEKNGRGGYA 255
>gi|345489616|ref|XP_001603972.2| PREDICTED: hypothetical protein LOC100120319, partial [Nasonia
vitripennis]
Length = 506
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 56/166 (33%), Positives = 84/166 (50%), Gaps = 20/166 (12%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGP 137
R ++VA R +++++ I L+G A+L+ E L + DFS+M+S R L LGP
Sbjct: 16 RQVKVVAKRDFKRSEKIECLIGYCAKLS-SENIAWLESHRKDFSIMHSSRSKHTLLLLGP 74
Query: 138 AAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIR 197
A+INHDC NC + ++ V I+ +EITCFYG D+F N C C +C+ +
Sbjct: 75 VAFINHDCHANCHYTLNDKGEVFVVADTHIKAEQEITCFYGIDYFRLNNLHCLCSSCDDK 134
Query: 198 MKGAFAQSSVEPQTTQPSRYCLRDTDMRLNKRKLHKKLNRLLLASD 243
GAF Q + D+ L+K+ N+LLL D
Sbjct: 135 HTGAFHQEQ-------------ENLDIDLSKK------NQLLLTKD 161
>gi|189206618|ref|XP_001939643.1| histone-lysine N-methyltransferase set9 [Pyrenophora
tritici-repentis Pt-1C-BFP]
gi|187975736|gb|EDU42362.1| histone-lysine N-methyltransferase set9 [Pyrenophora
tritici-repentis Pt-1C-BFP]
Length = 658
Score = 94.4 bits (233), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 63/185 (34%), Positives = 89/185 (48%), Gaps = 13/185 (7%)
Query: 30 AFCVQALL---AQRFNIGRILDMGTQH----HRYSYRIQYLPE----LKPCYRYSLEGNR 78
A VQ LL A R + + D Q H Y Y+P+ + RY++ +
Sbjct: 67 AEAVQKLLQLTALRRFVSGLKDKNDQEQFKRHLRRYINIYMPDCQFDVTTTNRYTITEHE 126
Query: 79 GARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPA 138
A I A + + I YL G +T+E+EK L DFS++ S RK L+LGPA
Sbjct: 127 -ASITARKDINPREEIKYLTGVQVAMTEEQEKSL-ELARKDFSLVISSRKKTRSLFLGPA 184
Query: 139 AYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRM 198
+ NHDC N + D + ++ I +G+EIT YGED+FGD N C C TCE
Sbjct: 185 RFANHDCDANAKLTTKGYDGMQIVAVKPIYEGDEITVSYGEDYFGDNNEECLCSTCESLQ 244
Query: 199 KGAFA 203
+ +A
Sbjct: 245 QNGWA 249
>gi|402223890|gb|EJU03953.1| hypothetical protein DACRYDRAFT_20676, partial [Dacryopinax sp.
DJM-731 SS1]
Length = 723
Score = 93.6 bits (231), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 61/162 (37%), Positives = 86/162 (53%), Gaps = 14/162 (8%)
Query: 54 HRYSYRIQYLP----ELKPCYRYS--LEGNRGARIVATRKWEKNDNIAYLVGCIAELTKE 107
H Y YLP E P RY+ E N A + AT +E I+ L G + LT E
Sbjct: 113 HANRYLQVYLPTSHFEFVPTERYTWHTEKNELA-VKATANFEVGKIISGLCGVLVALTNE 171
Query: 108 EEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACV--KVLR 165
E + L G DFS++ + + LGPA ++NHDC PN + +RD + +V+R
Sbjct: 172 ENEEL--RGGQDFSIVVNVHTKRFFILLGPARFVNHDCNPNVKM---KRDGKSLTFEVIR 226
Query: 166 DIEQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFAQSSV 207
I+ GEEI YGE++FG++NC C C+TCE G FA+ +
Sbjct: 227 PIKGGEEILSSYGENYFGEENCECLCETCEQNGTGLFAKEAA 268
>gi|449303136|gb|EMC99144.1| hypothetical protein BAUCODRAFT_393590 [Baudoinia compniacensis
UAMH 10762]
Length = 943
Score = 93.6 bits (231), Expect = 8e-17, Method: Composition-based stats.
Identities = 56/148 (37%), Positives = 77/148 (52%), Gaps = 7/148 (4%)
Query: 52 QHHRYSYRIQYLPE----LKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKE 107
+ H Y YLP+ + RY++ A I A + K + I YL G E+T++
Sbjct: 106 ERHVRKYINIYLPDCPFDVGTTNRYTVMTAEAA-IYARKAIRKGEPIKYLSGIQVEMTEQ 164
Query: 108 EEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDI 167
EEK L DFS++ S R+ L+LGPA + NHDC N R + + +DI
Sbjct: 165 EEKELCSR--TDFSIVLSSRRKRPSLFLGPARFANHDCDSNARLNTSGPHGIHIVACKDI 222
Query: 168 EQGEEITCFYGEDFFGDKNCLCECDTCE 195
G+EIT YGED+FG +NC C C TCE
Sbjct: 223 AVGDEITVTYGEDYFGLENCECLCATCE 250
>gi|164661351|ref|XP_001731798.1| hypothetical protein MGL_1066 [Malassezia globosa CBS 7966]
gi|159105699|gb|EDP44584.1| hypothetical protein MGL_1066 [Malassezia globosa CBS 7966]
Length = 765
Score = 93.6 bits (231), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 59/173 (34%), Positives = 91/173 (52%), Gaps = 23/173 (13%)
Query: 54 HRYS----YRIQYLPELKPCYRYS---LEGNRGARIVATRKWEKNDNIAYLVGCIAELTK 106
HRY R+Q+L + R+S + G ++A R ++ +D I Y + +LT
Sbjct: 16 HRYRTARLQRLQFLNTMSDEARFSAHLIPGRSDLCVMAVRSFQPDDIITYCTAALKDLTH 75
Query: 107 EEEKRL-----------LHTG----LNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRF 151
+E++ L L G DFS++ S R+ C+QL LGPA +INHDCRPN F
Sbjct: 76 QEDQALREEAAEAREADLRDGHLRPQRDFSIIRSSRRKCSQLLLGPARFINHDCRPNAEF 135
Query: 152 VATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFAQ 204
+ ++ +R I+ EEIT +YG+++F +N C C TCE +G +AQ
Sbjct: 136 RRSGH-ILTIRCIRPIQCNEEITTYYGDNYFEWQNKECMCATCERLGQGFYAQ 187
>gi|336371682|gb|EGO00022.1| hypothetical protein SERLA73DRAFT_25237 [Serpula lacrymans var.
lacrymans S7.3]
Length = 221
Score = 93.6 bits (231), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 54/126 (42%), Positives = 69/126 (54%), Gaps = 7/126 (5%)
Query: 82 IVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLN------DFSVMYSCRKNCAQLWL 135
I+ATR I L G +A LTKEE+K L T L DFSV++S + L+L
Sbjct: 97 ILATRPLAPGAVITELKGSMAHLTKEEDKELKRTDLRNTDIRRDFSVIHSRQMKKNHLFL 156
Query: 136 GPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCE 195
GPA ++NHDC NC E +V+R I GEEIT YG+ +FG +N C C+TCE
Sbjct: 157 GPARFVNHDCENNCELF-REGKYITFRVMRPIGIGEEITAHYGDGYFGYRNRHCLCETCE 215
Query: 196 IRMKGA 201
KG
Sbjct: 216 KNGKGG 221
>gi|453085120|gb|EMF13163.1| hypothetical protein SEPMUDRAFT_148543 [Mycosphaerella populorum
SO2202]
Length = 848
Score = 93.6 bits (231), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 48/119 (40%), Positives = 65/119 (54%), Gaps = 2/119 (1%)
Query: 80 ARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAA 139
A +VA + K D I YL G E+T++EE+ L + DFS++ + RK L+LGPA
Sbjct: 138 ASVVARKNIRKGDQIRYLRGIQVEMTEQEEREL--SSGTDFSIVITSRKKRPSLFLGPAR 195
Query: 140 YINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRM 198
+ NHDC N + + R+I GEEIT YG D+FG NC C C TCE +
Sbjct: 196 FANHDCNSNAKLRTMGAHGIEIIAKRNIYTGEEITVTYGTDYFGVGNCECLCHTCETKQ 254
>gi|393217672|gb|EJD03161.1| SET domain-containing protein, partial [Fomitiporia mediterranea
MF3/22]
Length = 221
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 60/159 (37%), Positives = 81/159 (50%), Gaps = 12/159 (7%)
Query: 54 HRYSYRIQYLP----ELKPCYRYS-LEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEE 108
H Y YLP E+ RYS + G I+ATR I+ L G +AELT +E
Sbjct: 64 HASRYLELYLPNGSIEIAQTTRYSRITGKNELCILATRPMPAGFLISELKGSLAELTDDE 123
Query: 109 EKRL------LHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVK 162
+K L + G DFSV++S + L+LGPA ++NHDC NC R +
Sbjct: 124 DKELKNPSGGIDGGRRDFSVIHSKQLKKNHLFLGPARFVNHDCDNNCELFRDGR-YITFR 182
Query: 163 VLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGA 201
++ I G+E+T YGE +FG N C C+TCE R KG
Sbjct: 183 TIKPIALGQELTAHYGEGYFGKNNRHCLCETCERRGKGG 221
>gi|302680853|ref|XP_003030108.1| hypothetical protein SCHCODRAFT_235958 [Schizophyllum commune H4-8]
gi|300103799|gb|EFI95205.1| hypothetical protein SCHCODRAFT_235958 [Schizophyllum commune H4-8]
Length = 1114
Score = 92.0 bits (227), Expect = 3e-16, Method: Composition-based stats.
Identities = 53/144 (36%), Positives = 79/144 (54%), Gaps = 7/144 (4%)
Query: 65 ELKPCYRYSL-EGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLN----- 118
E+ RYSL G I+ATR I+ L G +A LT++E++ L T +N
Sbjct: 145 EIAHTSRYSLYTGKSELCILATRDLAPGAVISELKGSMANLTEDEDRALKETQINSAIRR 204
Query: 119 DFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYG 178
DFSV++S + L+LGPA ++NHDC NC E +V++ I+ G+E+T YG
Sbjct: 205 DFSVIHSRQMKKNHLFLGPARFVNHDCDHNCELF-REGKYITFRVIKPIKLGQEVTAHYG 263
Query: 179 EDFFGDKNCLCECDTCEIRMKGAF 202
+ +FG N C C+TCE +G +
Sbjct: 264 DSYFGKGNKHCLCETCEKNGRGGY 287
>gi|134109489|ref|XP_776859.1| hypothetical protein CNBC3500 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50259539|gb|EAL22212.1| hypothetical protein CNBC3500 [Cryptococcus neoformans var.
neoformans B-3501A]
Length = 1117
Score = 92.0 bits (227), Expect = 3e-16, Method: Composition-based stats.
Identities = 46/120 (38%), Positives = 72/120 (60%), Gaps = 5/120 (4%)
Query: 103 ELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVK 162
E ++ ++R T DFS+++S K C QL+LGPA ++NHDC PN + + + +
Sbjct: 224 ESRRQGQRRSDRTKRRDFSIVWSGLKRCYQLFLGPARFLNHDCNPNVELL-RQGNYVTFR 282
Query: 163 VLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFAQSSVEPQTT-QPSRYCLRD 221
V+R + GEE+T FYGE++FG N C C TCE G F + +P+++ + SR+ RD
Sbjct: 283 VIRPVRIGEELTTFYGENYFGKGNVECLCLTCEKNHHGGF---TPKPESSRRTSRHLSRD 339
>gi|409080688|gb|EKM81048.1| hypothetical protein AGABI1DRAFT_119579 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 1017
Score = 91.7 bits (226), Expect = 3e-16, Method: Composition-based stats.
Identities = 57/146 (39%), Positives = 76/146 (52%), Gaps = 8/146 (5%)
Query: 65 ELKPCYRYSLE-GNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLN----- 118
E+ RYS + G I+ATR I L G +A LT EE++ L +
Sbjct: 111 EIAHTSRYSHKTGKSELCILATRYMAPGTVITELKGSMANLTDEEDRDLKRASMRNGDIR 170
Query: 119 -DFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFY 177
DFSV++S L+LGPA ++NHDC NC R +VLR I GEEIT Y
Sbjct: 171 RDFSVIHSKSMKKNHLFLGPARFVNHDCNFNCELFREGR-YITFRVLRPINIGEEITAHY 229
Query: 178 GEDFFGDKNCLCECDTCEIRMKGAFA 203
G+ +FG KN C C+TCE + +G +A
Sbjct: 230 GDGYFGRKNRHCLCETCEKKGRGGYA 255
>gi|426197602|gb|EKV47529.1| hypothetical protein AGABI2DRAFT_185465 [Agaricus bisporus var.
bisporus H97]
Length = 1017
Score = 91.7 bits (226), Expect = 3e-16, Method: Composition-based stats.
Identities = 57/146 (39%), Positives = 76/146 (52%), Gaps = 8/146 (5%)
Query: 65 ELKPCYRYSLE-GNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLN----- 118
E+ RYS + G I+ATR I L G +A LT EE++ L +
Sbjct: 111 EIAHTSRYSHKTGKSELCILATRYMAPGTVITELKGSMANLTDEEDRDLKRASMRNGDIR 170
Query: 119 -DFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFY 177
DFSV++S L+LGPA ++NHDC NC R +VLR I GEEIT Y
Sbjct: 171 RDFSVIHSKSMKKNHLFLGPARFVNHDCNFNCELFREGR-YITFRVLRPINIGEEITAHY 229
Query: 178 GEDFFGDKNCLCECDTCEIRMKGAFA 203
G+ +FG KN C C+TCE + +G +A
Sbjct: 230 GDGYFGRKNRHCLCETCEKKGRGGYA 255
>gi|19075578|ref|NP_588078.1| histone lysine methyltransferase Set9 [Schizosaccharomyces pombe
972h-]
gi|74638869|sp|Q9USK2.1|SET9_SCHPO RecName: Full=Histone-lysine N-methyltransferase set9; AltName:
Full=Lysine N-methyltransferase 5; AltName: Full=SET
domain protein 9
gi|6434020|emb|CAB60686.1| histone lysine methyltransferase Set9 [Schizosaccharomyces pombe]
Length = 441
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 56/150 (37%), Positives = 85/150 (56%), Gaps = 10/150 (6%)
Query: 51 TQHHRYSYRIQYLP----ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTK 106
T+H +Y Y YLP E+ +Y A ++A ++I L G I +L+
Sbjct: 94 TRHLKY-YLSLYLPSCKFEICSTNQYFSSSKPEACVIARESINAGEDITDLCGTIIKLSP 152
Query: 107 EEEKRLLHTGLN-DFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLR 165
+EE+ + G+ DFS+++S R + L+LGPA ++NHDC NCRF T ++ +R
Sbjct: 153 KEERNI---GIGKDFSILHSSRLDSMCLFLGPARFVNHDCNANCRF-NTSGKRIWLRCVR 208
Query: 166 DIEQGEEITCFYGEDFFGDKNCLCECDTCE 195
DI+ GEEIT FY ++FG +NC C C +CE
Sbjct: 209 DIKPGEEITTFYSSNYFGLENCECLCVSCE 238
>gi|405119170|gb|AFR93943.1| hypothetical protein CNAG_02749 [Cryptococcus neoformans var.
grubii H99]
Length = 1115
Score = 91.7 bits (226), Expect = 4e-16, Method: Composition-based stats.
Identities = 49/122 (40%), Positives = 68/122 (55%), Gaps = 8/122 (6%)
Query: 105 TKEE-----EKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTA 159
TKEE ++R T DFS+++S K C QL+LGPA ++NHDC PN + + +
Sbjct: 216 TKEESRRQGQRRSDRTKRRDFSIVWSGLKRCYQLFLGPARFLNHDCNPNVELL-RQGNYV 274
Query: 160 CVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFAQSSVEPQTTQPSRYCL 219
+V+R + GEE+T FYGE++FG N C C TCE G F + T SR+
Sbjct: 275 TFRVIRPVRIGEELTTFYGENYFGKGNVECLCLTCEKNHHGGFTPKPENSRRT--SRHLS 332
Query: 220 RD 221
RD
Sbjct: 333 RD 334
>gi|409045100|gb|EKM54581.1| hypothetical protein PHACADRAFT_258533 [Phanerochaete carnosa
HHB-10118-sp]
Length = 1280
Score = 89.4 bits (220), Expect = 2e-15, Method: Composition-based stats.
Identities = 61/169 (36%), Positives = 82/169 (48%), Gaps = 15/169 (8%)
Query: 54 HRYSYRIQYLP----ELKPCYRYSLE-GNRGARIVATRKWEKNDNIAYLVGCIAELTKEE 108
H Y YLP E+ RYS G I+ATR I+ L G +A+LT+EE
Sbjct: 66 HASRYFELYLPNGSIEISHTSRYSHRTGKSELCILATRPLAPGTVISELKGSMADLTEEE 125
Query: 109 EKRLLHTG---------LNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTA 159
+K L TG DFSV++S + L+LGPA ++NHDC N R
Sbjct: 126 DKELKRTGERHAEGVGIRRDFSVIHSRQLKKNHLFLGPARFVNHDCDHNVELFREGR-YI 184
Query: 160 CVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFAQSSVE 208
+V++ I GEE++ YG +FG KN C C TCE +G +A E
Sbjct: 185 TFRVIKPIGVGEEVSAHYGPGYFGRKNRHCLCATCEKNGRGGYAPQGSE 233
>gi|238600554|ref|XP_002395171.1| hypothetical protein MPER_04819 [Moniliophthora perniciosa FA553]
gi|215465455|gb|EEB96101.1| hypothetical protein MPER_04819 [Moniliophthora perniciosa FA553]
Length = 221
Score = 88.2 bits (217), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 67/202 (33%), Positives = 92/202 (45%), Gaps = 17/202 (8%)
Query: 9 ILHVPIQLACHLLNKLNPSFWAFCVQALLAQRFNIGRILDMGTQHHRYSYRIQYLP--EL 66
++HV I+ A L L P + Q + +I T RY Q E+
Sbjct: 28 LVHVVIRQAVDELLSLMPIRYYL-------QNYTQKQINAFATHASRYFELYQPFGSIEI 80
Query: 67 KPCYRYSLEGNRGAR-IVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLN------D 119
RYS + I+AT+ I L G +A LT EE++ L T L D
Sbjct: 81 SHTSRYSHRTGKSELCILATKSLAPGTVITELKGSMANLTDEEDRELKRTDLRTSDIRRD 140
Query: 120 FSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGE 179
FSV++S L+LGPA ++NHDC NC R +V++ I GEEIT YG+
Sbjct: 141 FSVIHSKSMKKNHLFLGPARFVNHDCDNNCELFREGR-YITFRVIKPITIGEEITAHYGD 199
Query: 180 DFFGDKNCLCECDTCEIRMKGA 201
+FG KN C C+TCE +G
Sbjct: 200 GYFGKKNRQCLCETCEKGGRGG 221
>gi|384485384|gb|EIE77564.1| hypothetical protein RO3G_02268 [Rhizopus delemar RA 99-880]
Length = 355
Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 57/185 (30%), Positives = 85/185 (45%), Gaps = 37/185 (20%)
Query: 54 HRYSYRIQYLP----ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEE 109
H Y Y+P E+ RYS E A ++AT+ W++ D I+ L G I T
Sbjct: 32 HMKRYLNMYMPNAGYEISDTKRYS-EKKVEACLIATKDWQRGDEISLLSGIILSPT---- 86
Query: 110 KRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQ 169
L+LGPA + NHDC NCRF+ ++T + ++I+
Sbjct: 87 -----------------------LFLGPARFANHDCDSNCRFITQGQNTIVFQARKEIKC 123
Query: 170 GEEITCFYGEDFFGDKNCLCECDTCEIRMKGAF-----AQSSVEPQTTQPSRYCLRDTDM 224
GEE+T FYG +FG+ NC C+C +CE G F Q++ Q + S+ + + D
Sbjct: 124 GEELTVFYGHHYFGENNCKCKCVSCEKNGLGYFTVAHEVQTNPNIQQRRRSKRTMIEEDE 183
Query: 225 RLNKR 229
KR
Sbjct: 184 GYRKR 188
>gi|392569755|gb|EIW62928.1| hypothetical protein TRAVEDRAFT_17441 [Trametes versicolor
FP-101664 SS1]
Length = 1061
Score = 87.4 bits (215), Expect = 6e-15, Method: Composition-based stats.
Identities = 58/164 (35%), Positives = 82/164 (50%), Gaps = 15/164 (9%)
Query: 54 HRYSYRIQYLP----ELKPCYRYSLE-GNRGARIVATRKWEKNDNIAYLVGCIAELTKEE 108
H Y YLP E+ RYS G I+ATR I+ L G +A+LT EE
Sbjct: 118 HASRYFELYLPSGSIEIAHTSRYSHHTGKSELCILATRPLAPGTVISELKGSMADLTDEE 177
Query: 109 EKRLLHTGLN---------DFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTA 159
+K L T + DFSV++S + L+LGPA ++NHDC N R
Sbjct: 178 DKELKGTDVGQADGAGIRRDFSVIHSKQLKKNHLFLGPARFVNHDCDHNVELFREGR-YI 236
Query: 160 CVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFA 203
+V++ I GEE+T YG+ +FG N C C++CE + +G +A
Sbjct: 237 TFRVIKHIGVGEEVTAHYGDGYFGRNNRHCLCESCEKKGRGGYA 280
>gi|406602645|emb|CCH45789.1| Histone-lysine N-methyltransferase [Wickerhamomyces ciferrii]
Length = 870
Score = 86.7 bits (213), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 59/165 (35%), Positives = 83/165 (50%), Gaps = 17/165 (10%)
Query: 46 ILDMGTQHHRYSYRIQYLPE----LKPCYRYSLE-GNRGARIVATRKWEKNDNIAYLVGC 100
I++ Q RY YLPE + RY + G+ IVA +++ D I +L G
Sbjct: 89 IMEFKRQARRYFE--MYLPECGFEMMTSDRYEAKSGSTEGAIVAKKEFLPGDEIKFLTGT 146
Query: 101 IAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATE---RD 157
+EL EEE L +DFSV++ + L LGP+ +INHDC N +F +T+ R
Sbjct: 147 FSELKPEEEDDL---DKSDFSVIHMTSRANPCLMLGPSRFINHDCDSNAKFSSTKSGMRI 203
Query: 158 TACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAF 202
A K+L GEE+T Y +FG N C C TCE + G+F
Sbjct: 204 FATKKIL----VGEEVTVSYANSYFGKDNKDCLCATCERTLNGSF 244
>gi|297705973|ref|XP_002829830.1| PREDICTED: histone-lysine N-methyltransferase SUV420H2-like [Pongo
abelii]
Length = 303
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 43/81 (53%), Positives = 55/81 (67%), Gaps = 2/81 (2%)
Query: 150 RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFAQSSVEP 209
+FV + + ACVKVLRDIE G+E+TCFYGE FFG+KN CEC TCE + +GAF EP
Sbjct: 31 QFVPADGNAACVKVLRDIEPGDEVTCFYGEGFFGEKNEHCECHTCERKGEGAFRTRPREP 90
Query: 210 Q-TTQP-SRYCLRDTDMRLNK 228
+P +Y LR+T RL +
Sbjct: 91 ALPPRPLDKYQLRETKRRLQQ 111
>gi|328708992|ref|XP_003243843.1| PREDICTED: hypothetical protein LOC100574039 [Acyrthosiphon pisum]
Length = 386
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 47/113 (41%), Positives = 62/113 (54%), Gaps = 4/113 (3%)
Query: 82 IVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYI 141
+VAT + +K+ I LVG + EK ++ G NDFS++YS R L+LGPAA+
Sbjct: 101 VVATVQIKKDTVIQGLVGATYLI----EKEFIYPGENDFSLIYSGRLKKDMLFLGPAAFC 156
Query: 142 NHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTC 194
NH CR N + V R I GEEIT FYG ++F + N LC C TC
Sbjct: 157 NHACRSNAVLKCINKKRTGVVTTRQINVGEEITVFYGRNYFENNNTLCNCKTC 209
>gi|268579165|ref|XP_002644565.1| Hypothetical protein CBG14502 [Caenorhabditis briggsae]
Length = 439
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 54/178 (30%), Positives = 87/178 (48%), Gaps = 25/178 (14%)
Query: 68 PCYRYSLEGNRGARIVATRKWEKND-----NIAYLVGCIAELTKEEEKRLLHTGLNDFSV 122
PC+++S E + G +++AT+KWEK++ N L I LN++SV
Sbjct: 128 PCHQFSTEQHVGLKLIATKKWEKDEILNSHNFNVLFDTIP------------LELNEYSV 175
Query: 123 MYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFF 182
+WLGP++ +NH C PN + E+ T +KV+R IE GEEIT Y + +F
Sbjct: 176 SKPSVGQKEHIWLGPSSIVNHSCEPNTVYEHYEKKT-LLKVIRPIEIGEEITVNYSKKYF 234
Query: 183 GDKNCLCECDTCEIRMKGAFAQSSVEPQTTQPSRYCLRDT---DMRLNKRKLHKKLNR 237
CECDTC+ + + + + T +R T + RL++ K+L +
Sbjct: 235 FK----CECDTCKKKPEESTVKRKRGDSTITSKNLIMRKTRASNTRLSQEDSDKELAK 288
>gi|395332802|gb|EJF65180.1| SET domain-containing protein, partial [Dichomitus squalens
LYAD-421 SS1]
Length = 224
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 57/162 (35%), Positives = 79/162 (48%), Gaps = 15/162 (9%)
Query: 54 HRYSYRIQYLP----ELKPCYRYSLEGNRGAR-IVATRKWEKNDNIAYLVGCIAELTKEE 108
H Y YLP E+ RYS + I+ATR I+ L G +A+LT+EE
Sbjct: 64 HASRYFELYLPSGSIEIAHTSRYSHRTGKSELCILATRPLVPGTVISELKGSMADLTEEE 123
Query: 109 EKRLLHTGLN---------DFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTA 159
+K L T + DFSV++S + L+LGPA ++NHDC N R
Sbjct: 124 DKELKGTDIGRPESVGIRRDFSVIHSKQLKKNHLFLGPARFVNHDCDHNVELFREGR-YI 182
Query: 160 CVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGA 201
+V++ I GEE+T YG+ +FG N C C TCE +G
Sbjct: 183 TFRVIKHIGVGEEVTAHYGDGYFGRNNRHCLCQTCEFNGRGG 224
>gi|389737966|gb|EIM79172.1| SET domain-containing protein, partial [Stereum hirsutum FP-91666
SS1]
Length = 224
Score = 84.3 bits (207), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 56/162 (34%), Positives = 80/162 (49%), Gaps = 15/162 (9%)
Query: 54 HRYSYRIQYLP----ELKPCYRYSLEGNRGAR-IVATRKWEKNDNIAYLVGCIAELTKEE 108
H Y YLP E+ RY+ + I+ATR + I L G +A+LT E
Sbjct: 64 HASRYFELYLPSGSIEIAHTGRYTFRTGKSELCILATRPLQPGAVITELKGSMADLTDAE 123
Query: 109 EKRLLHT--------GLN-DFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTA 159
++ L T G+ DFSV++S + L+LGPA ++NHDC NC R
Sbjct: 124 DQELKRTDDLRHSYGGIRRDFSVIHSKQLKKNHLFLGPARFVNHDCDNNCELFREGR-YI 182
Query: 160 CVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGA 201
+V++ I GEE+T YG+ +FG NC C C +CE + G
Sbjct: 183 TFRVIKPIAVGEEVTAHYGDGYFGRGNCHCLCASCERKGIGG 224
>gi|242213749|ref|XP_002472701.1| predicted protein [Postia placenta Mad-698-R]
gi|220728209|gb|EED82108.1| predicted protein [Postia placenta Mad-698-R]
Length = 1337
Score = 84.3 bits (207), Expect = 6e-14, Method: Composition-based stats.
Identities = 59/173 (34%), Positives = 87/173 (50%), Gaps = 15/173 (8%)
Query: 54 HRYSYRIQYLP----ELKPCYRYSLE-GNRGARIVATRKWEKNDNIAYLVGCIAELTKEE 108
H Y YLP E+ RYS + G I+ATR I L G +A+LT EE
Sbjct: 73 HASRYFELYLPTGSIEIAHTSRYSHKTGKSELCILATRPLNTGTVITELKGSMADLTDEE 132
Query: 109 E---KRLLHTGLN------DFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTA 159
+ KR+ T ++ DFSV++S + L+LGPA ++NHDC N R
Sbjct: 133 DMELKRIDATCVDGVQIRRDFSVIHSKQLKKNHLFLGPARFVNHDCDHNVELFREGR-YI 191
Query: 160 CVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFAQSSVEPQTT 212
+V++ I GEE+T YG+ +FG N C C+TCE +G +A + + + +
Sbjct: 192 TFRVIKPIGVGEEVTAHYGDGYFGRDNRHCLCETCEKLGRGGYATDAEDDEAS 244
>gi|403416629|emb|CCM03329.1| predicted protein [Fibroporia radiculosa]
Length = 1238
Score = 82.8 bits (203), Expect = 2e-13, Method: Composition-based stats.
Identities = 58/173 (33%), Positives = 90/173 (52%), Gaps = 15/173 (8%)
Query: 54 HRYSYRIQYLP----ELKPCYRYSLE-GNRGARIVATRKWEKNDNIAYLVGCIAELTKEE 108
H Y YLP E+ RY+ + G I+ATR I L G +A+LT+EE
Sbjct: 117 HASRYFELYLPTGSIEIAHTSRYTHKTGKSELCILATRPLMPGMVITELKGSMADLTEEE 176
Query: 109 E---KRLLHTGLN------DFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTA 159
+ KR+ + ++ DFSV++S + L+LGPA ++NHDC N R
Sbjct: 177 DMELKRIDGSCVDGAQIRRDFSVIHSKQLKKNHLFLGPARFVNHDCDHNVELFREGR-YI 235
Query: 160 CVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFAQSSVEPQTT 212
+V++ I G+E+T YG+ +FG KN C C++CE KG +A ++ E + +
Sbjct: 236 TFRVIKPIAVGDEVTAHYGDGYFGRKNRHCLCESCERLGKGGYAPNASEDELS 288
>gi|390601862|gb|EIN11255.1| SET domain-containing protein, partial [Punctularia strigosozonata
HHB-11173 SS5]
Length = 223
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 48/128 (37%), Positives = 69/128 (53%), Gaps = 9/128 (7%)
Query: 82 IVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHT-------GLN-DFSVMYSCRKNCAQL 133
I+ATR I L G +A+LT+EE++ L T G+ DFSV++S + L
Sbjct: 97 ILATRPMAVGSVINELKGSMADLTEEEDRELKRTDRRNTDGGIRRDFSVIHSRQMKKNHL 156
Query: 134 WLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDT 193
+LGPA ++NHDC N R +V + I G+EIT YG+ +FG +N C C+T
Sbjct: 157 FLGPARFVNHDCDHNAELFRDGR-YITFRVRKPINVGDEITAHYGDSYFGRRNRYCLCET 215
Query: 194 CEIRMKGA 201
CE +G
Sbjct: 216 CEKNGRGG 223
>gi|328857780|gb|EGG06895.1| hypothetical protein MELLADRAFT_63110 [Melampsora larici-populina
98AG31]
Length = 858
Score = 81.3 bits (199), Expect = 4e-13, Method: Composition-based stats.
Identities = 50/127 (39%), Positives = 72/127 (56%), Gaps = 6/127 (4%)
Query: 82 IVATRKWEKNDNIAYLVGCIAELTKEEEKRL--LH--TGLNDFSVMYSCRKNCAQLWLGP 137
+ A+R + ND + G A+LT+EE+ RL LH G DFS + + R Q++ GP
Sbjct: 125 VFASRDYLVNDIVVGCEGSFADLTEEEDLRLRGLHPLDGHPDFSHLVNSRGK-FQVFCGP 183
Query: 138 AAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIR 197
A ++NHDCR N + E T KV++ I EEI YG ++FG+ NC C C TCE
Sbjct: 184 ARFVNHDCRNNVVLL-REGFTIKFKVVQPIRADEEIVTSYGANYFGEGNCECMCATCEQT 242
Query: 198 MKGAFAQ 204
+G +A+
Sbjct: 243 GRGFYAE 249
>gi|213403254|ref|XP_002172399.1| histone-lysine N-methyltransferase set9 [Schizosaccharomyces
japonicus yFS275]
gi|212000446|gb|EEB06106.1| histone-lysine N-methyltransferase set9 [Schizosaccharomyces
japonicus yFS275]
Length = 472
Score = 80.5 bits (197), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 44/125 (35%), Positives = 68/125 (54%), Gaps = 3/125 (2%)
Query: 80 ARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAA 139
A ++ + + + L G I +LT EEKRL DFSV++S R L+LGPA
Sbjct: 92 ASAISIKDLDAGYELHDLCGTIIKLTPAEEKRLGDD--KDFSVLHSSRLRSMCLFLGPAR 149
Query: 140 YINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRMK 199
++NHDC NC F + R ++ +R + GEE+T +Y +FG N C C++CE +
Sbjct: 150 FVNHDCDANCMFRISGR-KVWLQTVRPVRAGEELTTYYSSRYFGPNNRDCLCESCERKGI 208
Query: 200 GAFAQ 204
G + +
Sbjct: 209 GGYVK 213
>gi|328700507|ref|XP_003241283.1| PREDICTED: hypothetical protein LOC100159175 [Acyrthosiphon pisum]
Length = 730
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 49/138 (35%), Positives = 70/138 (50%), Gaps = 8/138 (5%)
Query: 80 ARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAA 139
+IV T+ I L G A + EE + G+NDFS++ S R L+LGPAA
Sbjct: 573 VKIVTTQTISCGKVIYTLCGMTAPIKPEE----IKIGVNDFSILTSSRNKKDLLFLGPAA 628
Query: 140 YINHDCRPNCRFVATERDTA-CVKVLRDIEQGEEITCFYGEDFFGD--KNCLCECDTCEI 196
YINH C N +VA + C K ++ I G EIT YG+ +FG+ K+C C C E+
Sbjct: 629 YINHSCSNNTEWVAGRGEGVWCAKAIQHIRIGTEITTGYGDHYFGENQKDCECGCHPEEV 688
Query: 197 RMKGAFAQSSV-EPQTTQ 213
+ ++ PQ+ Q
Sbjct: 689 PVNDVPTSPNLNNPQSQQ 706
>gi|308512279|ref|XP_003118322.1| hypothetical protein CRE_00800 [Caenorhabditis remanei]
gi|308238968|gb|EFO82920.1| hypothetical protein CRE_00800 [Caenorhabditis remanei]
Length = 614
Score = 77.8 bits (190), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 44/129 (34%), Positives = 64/129 (49%), Gaps = 10/129 (7%)
Query: 68 PCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCR 127
PC+ + +E N G +++ATR W +A C L N +SV + +
Sbjct: 182 PCHDFDMEFNLGLKLIATRMWMPGQLMATYNLC-------SYFERLEPYFNPYSVSNAHK 234
Query: 128 KNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNC 187
K ++WLGP A +NH C PN + E T ++VL I GEEIT YG FF +
Sbjct: 235 KKSNKVWLGPCAIVNHSCNPNATYHTVENQT-IIQVLSPISIGEEITVNYGPTFF--PSG 291
Query: 188 LCECDTCEI 196
C+C TC++
Sbjct: 292 ACKCATCKL 300
>gi|331247328|ref|XP_003336293.1| hypothetical protein PGTG_18068 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|309315283|gb|EFP91874.1| hypothetical protein PGTG_18068 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 923
Score = 77.0 bits (188), Expect = 8e-12, Method: Composition-based stats.
Identities = 50/146 (34%), Positives = 74/146 (50%), Gaps = 18/146 (12%)
Query: 82 IVATRKWEKNDNIAYLVGCIAELTKEEEKRLLH-----------TGL-----NDFSVMYS 125
+ A R + ND +A G A+LT+EE+ +L TG DFS + +
Sbjct: 234 VFAARDYAPNDIVAGCEGSYADLTEEEDLKLRSNAHLPAQADRSTGAAPRESTDFSHIVN 293
Query: 126 CRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDK 185
R Q++ GPA ++NHDC N + E T KV + I+ G+EI YG ++FG+K
Sbjct: 294 SRGK-FQVFCGPARFVNHDCDNNAVLL-REGFTIKFKVTKPIKAGQEILTSYGMNYFGEK 351
Query: 186 NCLCECDTCEIRMKGAFAQSSVEPQT 211
NC C C TCE KG ++Q + +
Sbjct: 352 NCECMCATCERNGKGFYSQHEADSDS 377
>gi|154274169|ref|XP_001537936.1| predicted protein [Ajellomyces capsulatus NAm1]
gi|150415544|gb|EDN10897.1| predicted protein [Ajellomyces capsulatus NAm1]
Length = 246
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 39/93 (41%), Positives = 53/93 (56%), Gaps = 1/93 (1%)
Query: 90 KNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNC 149
+ I L G + +T+EEE L T DFS++ S RK ++LGPA + NHDC N
Sbjct: 144 RGTTIRGLSGTLVPMTREEEIDLDLTR-KDFSIVMSSRKKTPSIFLGPARFANHDCNANA 202
Query: 150 RFVATERDTACVKVLRDIEQGEEITCFYGEDFF 182
R V ++ V +RDI GEEIT YGE++F
Sbjct: 203 RLVMRGPESMEVVAIRDIVVGEEITVSYGENYF 235
>gi|11527207|gb|AAG36937.1| CGI-85 [Homo sapiens]
Length = 265
Score = 73.9 bits (180), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 42/81 (51%), Positives = 49/81 (60%), Gaps = 10/81 (12%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+ PC RYS E N GA IVAT++W++ND I LVGCIA+ K L T D
Sbjct: 187 EILPCXRYSSEQN-GAXIVATKEWKRNDXIXLLVGCIADFQK-----LXXTCYLDMEKTL 240
Query: 125 SC----RKNCAQLWLGPAAYI 141
C RKNCA LWLGPAA+I
Sbjct: 241 XCHVSTRKNCAXLWLGPAAFI 261
>gi|358060323|dbj|GAA93728.1| hypothetical protein E5Q_00374 [Mixia osmundae IAM 14324]
Length = 410
Score = 73.6 bits (179), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 45/129 (34%), Positives = 58/129 (44%), Gaps = 8/129 (6%)
Query: 74 LEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQL 133
++G G ++A R + + + I L G + LH NDFS Y +
Sbjct: 182 IDGKLGLTVLARRDFAEGEIIPGLDGPVFPHDPN-----LH---NDFSTFYDTSRQREMG 233
Query: 134 WLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDT 193
GPA +NHDC N RFV E + R I GEEI YG FG N C C T
Sbjct: 234 LCGPARLVNHDCNNNVRFVRLEGGRIGFRATRPIANGEEILTTYGPSSFGANNVGCCCQT 293
Query: 194 CEIRMKGAF 202
CE+ KGA+
Sbjct: 294 CELANKGAY 302
>gi|358056154|dbj|GAA97894.1| hypothetical protein E5Q_04574 [Mixia osmundae IAM 14324]
Length = 902
Score = 68.6 bits (166), Expect = 3e-09, Method: Composition-based stats.
Identities = 31/94 (32%), Positives = 49/94 (52%)
Query: 117 LNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCF 176
+ +FS+++ + L+ GP+ ++NHDC N ++ R + GEEIT
Sbjct: 535 VKNFSILFDPKTKKNSLFTGPSRFVNHDCNSNALLCRLPNGVVGYRLRRAVAYGEEITGS 594
Query: 177 YGEDFFGDKNCLCECDTCEIRMKGAFAQSSVEPQ 210
YG ++FG N C C CEI KGA+ + S E +
Sbjct: 595 YGHEYFGPNNEDCLCQDCEIGEKGAYRKLSAEAK 628
>gi|341874374|gb|EGT30309.1| hypothetical protein CAEBREN_25850 [Caenorhabditis brenneri]
Length = 633
Score = 68.2 bits (165), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/125 (33%), Positives = 62/125 (49%), Gaps = 13/125 (10%)
Query: 69 CYRYSLEGNRGARIVATRKWEKND--NIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSC 126
C Y+ + N G +++ TR W+ N+ ++ L E +E N +S
Sbjct: 143 CTMYTDDNNLGLKLICTRNWKPNEFLDLHNLSSPFNEAPQE---------TNTYSTSSIL 193
Query: 127 -RKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDK 185
+++ WLGP A INH C PN FV R T VK+L+ I+ GEE+ YG +FG
Sbjct: 194 PKRDKDGFWLGPLAIINHSCDPNAIFVNYGRKT-LVKILKPIKVGEEVFVNYGPKYFGLG 252
Query: 186 NCLCE 190
+C CE
Sbjct: 253 HCQCE 257
>gi|388579212|gb|EIM19539.1| hypothetical protein WALSEDRAFT_70560 [Wallemia sebi CBS 633.66]
Length = 358
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 42/134 (31%), Positives = 68/134 (50%), Gaps = 7/134 (5%)
Query: 82 IVATRKWEKNDNIAYLVGCIAELTK--EEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAA 139
+VAT+ K+ I+ C+A L K ++ ++L ND+S++ S +++LGPA
Sbjct: 119 LVATKNIPKDTLIS---NCLATLIKISHDQDKVLRDSNNDWSILTSLTTG-TRIFLGPAR 174
Query: 140 YINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRMK 199
+ NHDC N + T+ + + + I+ G+E+ YG +FG+ N C C TCE
Sbjct: 175 FANHDCDNNVQLERTQ-NHVYFRTNQWIKSGQEVLLNYGNHYFGENNENCFCLTCETNNN 233
Query: 200 GAFAQSSVEPQTTQ 213
GAF Q Q
Sbjct: 234 GAFKNLDYNTQDNQ 247
>gi|443917788|gb|ELU38428.1| MIP-T3 domain-containing protein [Rhizoctonia solani AG-1 IA]
Length = 1250
Score = 63.2 bits (152), Expect = 1e-07, Method: Composition-based stats.
Identities = 49/169 (28%), Positives = 77/169 (45%), Gaps = 17/169 (10%)
Query: 90 KNDNIAYLVGCIAELTKEEE---KRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYIN-HDC 145
K I L G IA+LT +E+ +R + DFSV+YS +K C QL+LGPA ++N + C
Sbjct: 162 KGQKIHELKGSIADLTNKEDDELRRSSGSASRDFSVIYSHQKGCNQLFLGPARFVNTYTC 221
Query: 146 RPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFAQS 205
V R + + ++ + I G +N C C TCE +G +A
Sbjct: 222 GRGDYCVVWARLLSVLFIVLGLALAHSI-------LVGKQNKHCLCATCEGAGRGGYAPP 274
Query: 206 SVEPQTTQPSRYCLRDTDMRLNKRKLHKKLNRLLLASDKNDTNSSDNSS 254
P+T+ R + KR+ ++ +SD T++SD S
Sbjct: 275 GTYPETSDSER------EKEFLKRQQKQQAEDANPSSDVQPTDTSDQPS 317
>gi|365898660|ref|ZP_09436605.1| putative histone-lysine N-methyltransferase with a SET domain
[Bradyrhizobium sp. STM 3843]
gi|365420571|emb|CCE09147.1| putative histone-lysine N-methyltransferase with a SET domain
[Bradyrhizobium sp. STM 3843]
Length = 222
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 49/164 (29%), Positives = 77/164 (46%), Gaps = 21/164 (12%)
Query: 79 GARIVATRKWEKNDNIAYLVGCIAELTKEE----EKRLLHTGLNDFSVMYSCRKNCAQLW 134
G + AT+ +K + I G + + KEE E + L +++ S R+N A+
Sbjct: 18 GLGLFATKPIKKGEKIIRYFGPLLDTKKEEDDAVENKYLFELDKRWTIDGSVRENIAR-- 75
Query: 135 LGPAAYINHDCRPNCRF-VATERDTACVKVLRDIEQGEEITCFYGEDFFGD--KNCLCEC 191
YINH CRPN V + ++ +++IE GEEI YG D+F K C+C
Sbjct: 76 -----YINHSCRPNAESDVKPRKKKIFIRAIKNIEPGEEINYDYGTDYFKAYLKPIGCKC 130
Query: 192 DTCEIRMKGAFAQSSVEPQTTQPSRYCLRDTDMRLNKRKLHKKL 235
+ CE + K A++ E R L++ + R KRK K +
Sbjct: 131 EGCERKRKKKRAEARAE-------RIRLKEKEERKAKRKAEKSV 167
>gi|92118074|ref|YP_577803.1| nuclear protein SET [Nitrobacter hamburgensis X14]
gi|91800968|gb|ABE63343.1| nuclear protein SET [Nitrobacter hamburgensis X14]
Length = 197
Score = 60.8 bits (146), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 47/154 (30%), Positives = 74/154 (48%), Gaps = 15/154 (9%)
Query: 63 LPELKPCYRYSLEGNR-GARIVATRKWEKNDNIAYLVGCIAELTKEE----EKRLLHTGL 117
+P + P Y + +R G + AT+ +K I G + + KE+ E + L
Sbjct: 1 MPVISPNKPYRIGRSRTGLGLFATKPIKKGTKIIRYSGPLLDCNKEKDDAVENKYLFQIT 60
Query: 118 NDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRF-VATERDTACVKVLRDIEQGEEITCF 176
N +++ S RKN A+ YINH C+PN V+ + ++ ++DI GEEI
Sbjct: 61 NRWTIDGSLRKNIAR-------YINHSCKPNAESDVSKIKRRVDIRAIKDIAPGEEINYD 113
Query: 177 YGEDFFGD--KNCLCECDTCEIRMKGAFAQSSVE 208
YG D+F + K C+CD CE + K A + E
Sbjct: 114 YGTDYFKEYLKPIGCKCDACEKKRKKLRAAAREE 147
>gi|115523596|ref|YP_780507.1| nuclear protein SET [Rhodopseudomonas palustris BisA53]
gi|115517543|gb|ABJ05527.1| nuclear protein SET [Rhodopseudomonas palustris BisA53]
Length = 217
Score = 60.5 bits (145), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 47/153 (30%), Positives = 73/153 (47%), Gaps = 16/153 (10%)
Query: 63 LPELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEE----EKRLLHTGLN 118
+P KP YR G + AT+ +K I G + + KE+ E + L
Sbjct: 4 IPASKP-YRVG-RAKTGLGLFATQPIKKGTKIIRYFGPLLDCHKEKDDAVENKYLFQLNK 61
Query: 119 DFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRF-VATERDTACVKVLRDIEQGEEITCFY 177
+++ S RKN A+ YINH C+PN V + ++ +++IE GEEI Y
Sbjct: 62 RWTIDGSIRKNIAR-------YINHSCKPNAESDVNPRKKKIVIRAIKNIEPGEEINYDY 114
Query: 178 GEDFFGD--KNCLCECDTCEIRMKGAFAQSSVE 208
G D+F + K C+CD+CE + K A++ E
Sbjct: 115 GTDYFKEYLKPIGCKCDSCEKKRKKQRAEARAE 147
>gi|338975263|ref|ZP_08630618.1| TolA protein [Bradyrhizobiaceae bacterium SG-6C]
gi|338231862|gb|EGP06997.1| TolA protein [Bradyrhizobiaceae bacterium SG-6C]
Length = 230
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 45/154 (29%), Positives = 73/154 (47%), Gaps = 15/154 (9%)
Query: 63 LPELKPCYRYSLEGNR-GARIVATRKWEKNDNIAYLVGCIAELTKEE----EKRLLHTGL 117
+PE+ Y + ++ G + ATRK +K I G + + ++ + + L
Sbjct: 1 MPEISSSKPYRIGRSKTGLGLFATRKIKKGTKIIRYFGPLLDSKNKKHDDIDNKYLFEIN 60
Query: 118 NDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRF-VATERDTACVKVLRDIEQGEEITCF 176
N +++ S RKN A+ Y+NH CRPN V + + ++ ++ IE GEEI
Sbjct: 61 NRWTIDGSTRKNIAR-------YVNHACRPNAESDVNSRKRKVIIRAIKTIEPGEEINYD 113
Query: 177 YGEDFFGD--KNCLCECDTCEIRMKGAFAQSSVE 208
YG D+F K C+CD CE + AQ+ E
Sbjct: 114 YGTDYFKIFLKPIGCKCDHCEKKRAKKRAQARAE 147
>gi|75676346|ref|YP_318767.1| nuclear protein SET [Nitrobacter winogradskyi Nb-255]
gi|74421216|gb|ABA05415.1| Nuclear protein SET [Nitrobacter winogradskyi Nb-255]
Length = 196
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 47/161 (29%), Positives = 79/161 (49%), Gaps = 15/161 (9%)
Query: 63 LPELKPCYRYSLEGNR-GARIVATRKWEKNDNIAYLVGCIAELTKEE----EKRLLHTGL 117
+P + P Y + +R G + AT+ +K I VG + + TKE+ E + L
Sbjct: 1 MPVIPPNKPYRIGRSRTGLGLFATKPIKKGTKIIRYVGRLLDCTKEKDDAVENKYLFQIT 60
Query: 118 NDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRF-VATERDTACVKVLRDIEQGEEITCF 176
+ +++ S R N A+ YINH C+PN V + ++ ++DI GEEI
Sbjct: 61 SRWTIDGSMRSNIAR-------YINHSCKPNAESDVRKIKRRVDIRAIKDIAPGEEINYD 113
Query: 177 YGEDFFGD--KNCLCECDTCEIRMKGAFAQSSVEPQTTQPS 215
YG ++F + K C+CD+CE + K A + E + ++ S
Sbjct: 114 YGTEYFKEYLKPIGCKCDSCEKKRKKLRAAAREEKKHSRGS 154
>gi|91976192|ref|YP_568851.1| nuclear protein SET [Rhodopseudomonas palustris BisB5]
gi|91682648|gb|ABE38950.1| nuclear protein SET [Rhodopseudomonas palustris BisB5]
Length = 300
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/137 (30%), Positives = 67/137 (48%), Gaps = 14/137 (10%)
Query: 79 GARIVATRKWEKNDNIAYLVGCIAELTKEE----EKRLLHTGLNDFSVMYSCRKNCAQLW 134
G + AT+ +K I G + + K++ E + L ++V S RKN A+
Sbjct: 17 GLGLFATQPIKKGAKIVRYFGPMLDCNKKKDDAVENKYLFQINKRWTVDGSVRKNIAR-- 74
Query: 135 LGPAAYINHDCRPNCRF-VATERDTACVKVLRDIEQGEEITCFYGEDFFGD--KNCLCEC 191
YINH C+PN V+ + ++ +++IE GEEI YG D+F + K C+C
Sbjct: 75 -----YINHACKPNAESDVSVRKRKIIIRAIKNIEPGEEINYDYGTDYFKEYLKPIGCKC 129
Query: 192 DTCEIRMKGAFAQSSVE 208
D CE + K A++ E
Sbjct: 130 DACEKKRKKKSAEARAE 146
>gi|414167564|ref|ZP_11423792.1| hypothetical protein HMPREF9696_01647 [Afipia clevelandensis ATCC
49720]
gi|410889896|gb|EKS37697.1| hypothetical protein HMPREF9696_01647 [Afipia clevelandensis ATCC
49720]
Length = 230
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 44/154 (28%), Positives = 73/154 (47%), Gaps = 15/154 (9%)
Query: 63 LPELKPCYRYSLEGNR-GARIVATRKWEKNDNIAYLVGCIAELTKEE----EKRLLHTGL 117
+PE+ Y + ++ G + ATRK +K I G + + ++ + + L
Sbjct: 1 MPEISSSKPYRIGRSKTGLGLFATRKIKKGTKIIRYFGPLLDSKNKKHDDIDNKYLFEIN 60
Query: 118 NDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRF-VATERDTACVKVLRDIEQGEEITCF 176
N +++ S RKN A+ Y+NH CRPN V + + ++ ++ IE GEEI
Sbjct: 61 NRWTIDGSTRKNIAR-------YVNHACRPNAESDVNSRKRKVIIRAIKTIEPGEEINYD 113
Query: 177 YGEDFFGD--KNCLCECDTCEIRMKGAFAQSSVE 208
YG D+F K C+CD CE + A++ E
Sbjct: 114 YGTDYFKIFLKPIGCKCDHCEKKRAKKRAEARAE 147
>gi|76156107|gb|AAX27342.2| SJCHGC08833 protein [Schistosoma japonicum]
Length = 178
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 25/49 (51%), Positives = 32/49 (65%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLL 113
E++PC+RY+ E + GA I AT+ W K I+ LVGCIAEL EE L
Sbjct: 117 EIRPCWRYASENHMGAAIFATKDWTKGSRISTLVGCIAELKHHEEAAFL 165
>gi|367472777|ref|ZP_09472353.1| putative histone-lysine N-methyltransferase with a SET domain
[Bradyrhizobium sp. ORS 285]
gi|365274960|emb|CCD84821.1| putative histone-lysine N-methyltransferase with a SET domain
[Bradyrhizobium sp. ORS 285]
Length = 231
Score = 57.0 bits (136), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 44/140 (31%), Positives = 66/140 (47%), Gaps = 16/140 (11%)
Query: 63 LPELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEE----EKRLLHTGLN 118
+P+ KP YR G + AT+ +K I G + + K E E + L N
Sbjct: 4 IPQTKP-YRIG-RSKTGLGLFATQPIKKGSKIIRYFGPLLDSKKAEDDAIENKYLFELDN 61
Query: 119 DFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRF-VATERDTACVKVLRDIEQGEEITCFY 177
+++ S R+N A+ YINH C+PN V + ++ +++IE GEEI Y
Sbjct: 62 RWTIDGSIRENVAR-------YINHSCKPNAESDVKPRKKKIFIRAIKNIEPGEEINYDY 114
Query: 178 GEDFFGD--KNCLCECDTCE 195
G D+F K C+CD CE
Sbjct: 115 GTDYFKAYLKPIGCKCDACE 134
>gi|392579537|gb|EIW72664.1| hypothetical protein TREMEDRAFT_58833 [Tremella mesenterica DSM
1558]
Length = 1067
Score = 56.6 bits (135), Expect = 1e-05, Method: Composition-based stats.
Identities = 37/99 (37%), Positives = 48/99 (48%), Gaps = 11/99 (11%)
Query: 119 DFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFY- 177
DFS+++S KNC QL+LGPA +INHDC PN + R + R E G +
Sbjct: 227 DFSIVWSGLKNCFQLFLGPARFINHDCSPNVELL---RQAEMTILARITENGGFFISWLM 283
Query: 178 -GEDFFGDKNCLCECDTCEIRMKGAFA--QSSVEPQTTQ 213
D CL TCEI KG F +SS+ T+
Sbjct: 284 SSSKLTSDSLCL----TCEIAGKGGFTIKESSLSRSVTR 318
>gi|85715646|ref|ZP_01046626.1| Nuclear protein SET [Nitrobacter sp. Nb-311A]
gi|85697585|gb|EAQ35462.1| Nuclear protein SET [Nitrobacter sp. Nb-311A]
Length = 196
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 45/159 (28%), Positives = 77/159 (48%), Gaps = 15/159 (9%)
Query: 63 LPELKPCYRYSLEGNR-GARIVATRKWEKNDNIAYLVGCIAELTKEE----EKRLLHTGL 117
+P + P Y + +R G + AT+ +K I G + + TKE+ E + L
Sbjct: 1 MPVISPNKPYRIGRSRTGLGLFATKPIKKGTKIIRYSGRMLDCTKEKDDAVENKYLFQIT 60
Query: 118 NDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRF-VATERDTACVKVLRDIEQGEEITCF 176
+ +++ S R N A+ YINH C+PN V + ++ ++DI GEEI
Sbjct: 61 SRWTIDGSMRSNVAR-------YINHSCKPNAESDVRKIKRRVDIRAIKDIAPGEEINYD 113
Query: 177 YGEDFFGD--KNCLCECDTCEIRMKGAFAQSSVEPQTTQ 213
YG ++F + K C+CD+CE + K A + E + ++
Sbjct: 114 YGTEYFKEYLKPIGCKCDSCEKKRKKLRAAAREEKKRSR 152
>gi|383770712|ref|YP_005449775.1| hypothetical protein S23_24500 [Bradyrhizobium sp. S23321]
gi|381358833|dbj|BAL75663.1| hypothetical protein S23_24500 [Bradyrhizobium sp. S23321]
Length = 188
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 54/105 (51%), Gaps = 10/105 (9%)
Query: 107 EEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTAC-VKVLR 165
E E + L N +++ S RKN A+ YINH CRPN R+ ++ ++
Sbjct: 50 EIENKYLFELNNRWTIDGSVRKNLAR-------YINHSCRPNAESDVRPRERKVFIRAIK 102
Query: 166 DIEQGEEITCFYGEDFFGD--KNCLCECDTCEIRMKGAFAQSSVE 208
+IE G+EI YG D+F K C+CD+CE + K A++ E
Sbjct: 103 NIEPGDEINYDYGTDYFKAYLKPIGCKCDSCEKKRKKLRAEARAE 147
>gi|384217618|ref|YP_005608784.1| hypothetical protein BJ6T_39220 [Bradyrhizobium japonicum USDA 6]
gi|354956517|dbj|BAL09196.1| hypothetical protein BJ6T_39220 [Bradyrhizobium japonicum USDA 6]
Length = 185
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 53/105 (50%), Gaps = 10/105 (9%)
Query: 107 EEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTAC-VKVLR 165
E E + L N +++ S RKN A+ YINH CRPN R+ ++ +R
Sbjct: 50 EIENKYLFELNNRWTIDGSVRKNLAR-------YINHSCRPNAESDVRPRERKVFIRAIR 102
Query: 166 DIEQGEEITCFYGEDFFGD--KNCLCECDTCEIRMKGAFAQSSVE 208
+IE G+EI YG D+F K C+C +CE + K A++ E
Sbjct: 103 NIEPGDEINYDYGTDYFKAYLKPIGCKCPSCETKRKKLRAEARAE 147
>gi|209884557|ref|YP_002288414.1| nuclear protein SET [Oligotropha carboxidovorans OM5]
gi|337741771|ref|YP_004633499.1| methyltransferase [Oligotropha carboxidovorans OM5]
gi|386030787|ref|YP_005951562.1| methyltransferase [Oligotropha carboxidovorans OM4]
gi|209872753|gb|ACI92549.1| nuclear protein SET [Oligotropha carboxidovorans OM5]
gi|336095855|gb|AEI03681.1| methyltransferase [Oligotropha carboxidovorans OM4]
gi|336099435|gb|AEI07258.1| methyltransferase [Oligotropha carboxidovorans OM5]
Length = 256
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 62/124 (50%), Gaps = 14/124 (11%)
Query: 79 GARIVATRKWEKNDNIAYLVGCIAELTKEE----EKRLLHTGLNDFSVMYSCRKNCAQLW 134
G + AT+ +K I G + + K++ E + L +++ S RKN A+
Sbjct: 18 GLGLFATQPIKKGTKIIRYFGPMLDCRKKKDDAIENKYLFAINGRWTIDGSVRKNIAR-- 75
Query: 135 LGPAAYINHDCRPNCRF-VATERDTACVKVLRDIEQGEEITCFYGEDFFGD--KNCLCEC 191
YINH CRPN V T + T ++ +++I G+EI YG D+F + K C+C
Sbjct: 76 -----YINHSCRPNAESDVNTRKRTVFIRAIKNIAPGDEINYDYGTDYFKEYLKPIGCKC 130
Query: 192 DTCE 195
D+CE
Sbjct: 131 DSCE 134
>gi|374576576|ref|ZP_09649672.1| SET domain-containing protein [Bradyrhizobium sp. WSM471]
gi|374424897|gb|EHR04430.1| SET domain-containing protein [Bradyrhizobium sp. WSM471]
Length = 186
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 42/137 (30%), Positives = 65/137 (47%), Gaps = 14/137 (10%)
Query: 79 GARIVATRKWEKNDNIAYLVGCIAELTKEE----EKRLLHTGLNDFSVMYSCRKNCAQLW 134
G + AT+ +K I G I + E E + L +++ S RKN A+
Sbjct: 18 GLGLFATKPIKKGTRIIRYYGPILDSRIPEHDDIENKYLFELNGRWTIDGSVRKNLAR-- 75
Query: 135 LGPAAYINHDCRPNCRFVATERDTAC-VKVLRDIEQGEEITCFYGEDFFGD--KNCLCEC 191
YINH CRPN R+ ++ +++IE G+EI YG D+F K C+C
Sbjct: 76 -----YINHSCRPNAESDVRPRERKVFIRAIKNIEPGDEINYDYGTDYFKAYLKPIGCKC 130
Query: 192 DTCEIRMKGAFAQSSVE 208
++CE + K A++ VE
Sbjct: 131 ESCEKKRKKLRAEARVE 147
>gi|421604165|ref|ZP_16046406.1| hypothetical protein BCCGELA001_36622 [Bradyrhizobium sp.
CCGE-LA001]
gi|404263723|gb|EJZ29162.1| hypothetical protein BCCGELA001_36622 [Bradyrhizobium sp.
CCGE-LA001]
Length = 188
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/128 (32%), Positives = 60/128 (46%), Gaps = 14/128 (10%)
Query: 79 GARIVATRKWEKNDNIAYLVGCIAE----LTKEEEKRLLHTGLNDFSVMYSCRKNCAQLW 134
G + AT+ +K I G I + E E + L +++ S RKN A+
Sbjct: 18 GLGLFATKPIKKGTRIIRYFGPILDSRIPAHDEIENKYLFELNGRWTIDGSVRKNLAR-- 75
Query: 135 LGPAAYINHDCRPNCRFVATERDTAC-VKVLRDIEQGEEITCFYGEDFFGD--KNCLCEC 191
YINH CRPN R+ ++ +R+IE G+EI YG D+F K C+C
Sbjct: 76 -----YINHSCRPNAESDVRPRERKVFIRAIRNIEPGDEINYDYGTDYFKAYLKPIGCKC 130
Query: 192 DTCEIRMK 199
D+CE + K
Sbjct: 131 DSCEKKRK 138
>gi|90423049|ref|YP_531419.1| nuclear protein SET [Rhodopseudomonas palustris BisB18]
gi|90105063|gb|ABD87100.1| nuclear protein SET [Rhodopseudomonas palustris BisB18]
Length = 251
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 43/139 (30%), Positives = 65/139 (46%), Gaps = 16/139 (11%)
Query: 63 LPELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEE----EKRLLHTGLN 118
+P KP YR G + ATR +K I G + + K++ E + L
Sbjct: 4 IPASKP-YRIG-RSKTGLGLFATRPIKKGSKIIRYFGPLLDCNKKKHDAIENKYLFQLNK 61
Query: 119 DFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRF-VATERDTACVKVLRDIEQGEEITCFY 177
+++ S RKN A+ YINH C+PN V + ++ +++IE GEEI Y
Sbjct: 62 RWTIDGSVRKNIAR-------YINHACKPNAESDVNPRKKKIVIRAIKNIEPGEEINYDY 114
Query: 178 GEDFFGD--KNCLCECDTC 194
G D+F + K C+CD C
Sbjct: 115 GTDYFKEYLKPIGCKCDAC 133
>gi|414174637|ref|ZP_11429041.1| hypothetical protein HMPREF9695_02687 [Afipia broomeae ATCC 49717]
gi|410888466|gb|EKS36269.1| hypothetical protein HMPREF9695_02687 [Afipia broomeae ATCC 49717]
Length = 246
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 43/154 (27%), Positives = 73/154 (47%), Gaps = 15/154 (9%)
Query: 63 LPELKPCYRYSLEGNR-GARIVATRKWEKNDNIAYLVGCIAELTKEE----EKRLLHTGL 117
+PE+ Y + ++ G + AT+K +K I G + + ++ + + L
Sbjct: 1 MPEISSNKPYRIGRSKTGLGLFATQKIKKGTKIIRYFGPMLDSKNKKHDDIDNKYLFEIN 60
Query: 118 NDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRF-VATERDTACVKVLRDIEQGEEITCF 176
N +++ S RKN A+ YINH CRPN V + + ++ ++ I+ GEEI
Sbjct: 61 NRWTIDGSVRKNIAR-------YINHACRPNAESDVNSRKRKVVIRAIKTIQPGEEINYD 113
Query: 177 YGEDFFGD--KNCLCECDTCEIRMKGAFAQSSVE 208
YG D+F K C+CD CE + A++ E
Sbjct: 114 YGTDYFKIFLKPIGCKCDHCEKKRAKKRAEARAE 147
>gi|365884644|ref|ZP_09423679.1| putative histone-lysine N-methyltransferase with a SET domain
[Bradyrhizobium sp. ORS 375]
gi|365286765|emb|CCD96210.1| putative histone-lysine N-methyltransferase with a SET domain
[Bradyrhizobium sp. ORS 375]
Length = 228
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 43/139 (30%), Positives = 64/139 (46%), Gaps = 16/139 (11%)
Query: 63 LPELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEE----EKRLLHTGLN 118
+P KP YR G + AT+ +K I G + + K E E + L N
Sbjct: 4 IPSTKP-YRIG-RSKTGLGLFATQPIKKGSKIIRYFGPLLDSKKAEDDAIENKYLFELDN 61
Query: 119 DFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRF-VATERDTACVKVLRDIEQGEEITCFY 177
+++ S R+N A+ YINH C+PN V + ++ +++IE GEEI Y
Sbjct: 62 RWTIDGSIRENVAR-------YINHSCKPNAESDVKPRKKKIFIRAIKNIEPGEEINYDY 114
Query: 178 GEDFFGD--KNCLCECDTC 194
G D+F K C+CD C
Sbjct: 115 GTDYFKAYLKPIGCKCDAC 133
>gi|398824672|ref|ZP_10582994.1| SET domain-containing protein [Bradyrhizobium sp. YR681]
gi|398224658|gb|EJN10958.1| SET domain-containing protein [Bradyrhizobium sp. YR681]
Length = 203
Score = 54.3 bits (129), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 32/87 (36%), Positives = 46/87 (52%), Gaps = 10/87 (11%)
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTAC-VKVLRDIEQGEEITCFYGEDFFG 183
S RKN A+ YINH CRPN R+ ++ +R+IE G+EI YG D+F
Sbjct: 82 SVRKNLAR-------YINHSCRPNAESDVRPRERKVFIRAIRNIEPGDEINYDYGTDYFK 134
Query: 184 D--KNCLCECDTCEIRMKGAFAQSSVE 208
K C+C++CE + K A++ E
Sbjct: 135 AYLKPIGCKCESCEKKRKKLRAEARAE 161
>gi|414161969|ref|ZP_11418216.1| hypothetical protein HMPREF9697_00117 [Afipia felis ATCC 53690]
gi|410879749|gb|EKS27589.1| hypothetical protein HMPREF9697_00117 [Afipia felis ATCC 53690]
Length = 235
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 41/137 (29%), Positives = 64/137 (46%), Gaps = 14/137 (10%)
Query: 79 GARIVATRKWEKNDNIAYLVG----CIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLW 134
G + AT+ +K I G C + E E + L N +++ S RKN A+
Sbjct: 18 GLGLFATKPIKKGTQIIRYFGPMLDCRKKKDDEVENKYLFQINNRWTIDGSVRKNTAR-- 75
Query: 135 LGPAAYINHDCRPNCRF-VATERDTACVKVLRDIEQGEEITCFYGEDFFGD--KNCLCEC 191
YINH CRPN V + + T ++ +++I GEEI YG ++F + K C+C
Sbjct: 76 -----YINHACRPNAESDVNSRKRTVHIRAIKNIAPGEEINYDYGTEYFKEYLKPIGCKC 130
Query: 192 DTCEIRMKGAFAQSSVE 208
CE + A++ E
Sbjct: 131 AACEKKRAKQRAEARAE 147
>gi|451847073|gb|EMD60381.1| hypothetical protein COCSADRAFT_174666 [Cochliobolus sativus
ND90Pr]
Length = 1123
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 47/168 (27%), Positives = 72/168 (42%), Gaps = 29/168 (17%)
Query: 68 PCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCR 127
P Y + +VA +++ I LVG L+ G+ VM S
Sbjct: 489 PVQVYRTPDGKNNGLVALTAFDRGTGIGELVG------------LITKGMRHLDVMDSST 536
Query: 128 KNCA-QLWLGPAA----YINHDCRPNCR-----FVATERDTACVKVLRDIEQGEEITCFY 177
++ + Q+W G +INH C+PN + ++ T+R V V + +E G EIT Y
Sbjct: 537 QSTSYQIWQGRVGNYTRFINHSCKPNVQSSTFTWLDTQR---VVLVSKGVEAGSEITLDY 593
Query: 178 GEDFFG--DKNCLCECDTCEIRMKGAFAQSSVEPQTTQPSRYCLRDTD 223
G+ ++ DK+CLC C GA Q +E Q+ R L D
Sbjct: 594 GDKYWAGLDKSCLCGESCCS--DSGASDQEILEAQSGARKRRRLSPAD 639
>gi|386401692|ref|ZP_10086470.1| SET domain-containing protein [Bradyrhizobium sp. WSM1253]
gi|385742318|gb|EIG62514.1| SET domain-containing protein [Bradyrhizobium sp. WSM1253]
Length = 186
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 41/137 (29%), Positives = 64/137 (46%), Gaps = 14/137 (10%)
Query: 79 GARIVATRKWEKNDNIAYLVGCIAELTKEE----EKRLLHTGLNDFSVMYSCRKNCAQLW 134
G + AT+ +K I G I + E E + L +++ S RKN A+
Sbjct: 18 GLGLFATKPIKKGTRIIRYYGPILDSRIPEHDDIENKYLFELNGRWTIDGSVRKNLAR-- 75
Query: 135 LGPAAYINHDCRPNCRFVATERDTAC-VKVLRDIEQGEEITCFYGEDFFGD--KNCLCEC 191
YINH CRPN R+ ++ +++IE G+EI YG D+F K C+C
Sbjct: 76 -----YINHSCRPNAESDVRPRERKVFIRAIKNIEPGDEINYDYGTDYFKAYLKPIGCKC 130
Query: 192 DTCEIRMKGAFAQSSVE 208
++CE + K A++ E
Sbjct: 131 ESCEKKRKKLRAEARTE 147
>gi|365886265|ref|ZP_09425212.1| putative histone-lysine N-methyltransferase with a SET domain
[Bradyrhizobium sp. STM 3809]
gi|365338230|emb|CCD97743.1| putative histone-lysine N-methyltransferase with a SET domain
[Bradyrhizobium sp. STM 3809]
Length = 244
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 43/140 (30%), Positives = 64/140 (45%), Gaps = 16/140 (11%)
Query: 63 LPELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEE----EKRLLHTGLN 118
+P KP YR G + AT+ +K I G + + K E E + L N
Sbjct: 4 IPSTKP-YRIG-RSKTGLGLFATQPIKKGTKIIRYFGPLLDSKKAEDDAIENKYLFELDN 61
Query: 119 DFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRF-VATERDTACVKVLRDIEQGEEITCFY 177
+++ S R+N A+ YINH C+PN V + ++ +++I GEEI Y
Sbjct: 62 RWTIDGSVRENVAR-------YINHSCKPNAESDVKPRKKKIFIRAIKNIAPGEEINYDY 114
Query: 178 GEDFFGD--KNCLCECDTCE 195
G D+F K C+CD CE
Sbjct: 115 GTDYFKAYLKPIGCKCDACE 134
>gi|146341873|ref|YP_001206921.1| histone-lysine N-methyltransferase [Bradyrhizobium sp. ORS 278]
gi|146194679|emb|CAL78704.1| Putative histone-lysine N-methyltransferase with a SET domain
[Bradyrhizobium sp. ORS 278]
Length = 222
Score = 52.8 bits (125), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 42/139 (30%), Positives = 63/139 (45%), Gaps = 16/139 (11%)
Query: 63 LPELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEE----EKRLLHTGLN 118
+P KP YR G + AT+ +K I G + + K E E + L N
Sbjct: 4 IPSTKP-YRIG-RSKTGLGLFATQPIKKGTKIIRYFGPLLDSKKAEDDAIENKYLFELDN 61
Query: 119 DFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRF-VATERDTACVKVLRDIEQGEEITCFY 177
+++ S R+N A+ YINH C+PN V + ++ +++I GEEI Y
Sbjct: 62 RWTIDGSVRENVAR-------YINHSCKPNAESDVKPRKKKIFIRAIKNIAPGEEINYDY 114
Query: 178 GEDFFGD--KNCLCECDTC 194
G D+F K C+CD C
Sbjct: 115 GTDYFKAYLKPIGCKCDAC 133
>gi|27380898|ref|NP_772427.1| hypothetical protein bll5787 [Bradyrhizobium japonicum USDA 110]
gi|27354064|dbj|BAC51052.1| bll5787 [Bradyrhizobium japonicum USDA 110]
Length = 186
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 41/137 (29%), Positives = 62/137 (45%), Gaps = 14/137 (10%)
Query: 79 GARIVATRKWEKNDNIAYLVGCIAE----LTKEEEKRLLHTGLNDFSVMYSCRKNCAQLW 134
G + AT+ +K I G I + E E + L +++ S RKN A+
Sbjct: 18 GLGLFATKPIKKGTRIIRYFGPILDSRIPAHDEIENKYLFELNGRWTIDGSVRKNLAR-- 75
Query: 135 LGPAAYINHDCRPNCRFVATERDTAC-VKVLRDIEQGEEITCFYGEDFFGD--KNCLCEC 191
YINH CRPN R+ ++ +++IE G+EI YG D+F K C+C
Sbjct: 76 -----YINHSCRPNAESDVRPRERKVFIRAIKNIEPGDEINYDYGTDYFKAYLKPIGCKC 130
Query: 192 DTCEIRMKGAFAQSSVE 208
+CE + K A+ E
Sbjct: 131 ASCETKRKKLRAEERAE 147
>gi|170105357|ref|XP_001883891.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164641071|gb|EDR05333.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 326
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 40/142 (28%), Positives = 67/142 (47%), Gaps = 5/142 (3%)
Query: 76 GN-RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLW 134
GN RG I +TR +++ + L+G + + K E RL + + ++
Sbjct: 155 GNERGFSITSTRDLVQHEVLYSLMGLMPDDAKAEHSRLSEIEPHPSQLHNKANAEKPRIL 214
Query: 135 LGPAAYINHDCRP-NCRFVATERDTA-CVKVLRDIEQGEEITCFYGEDFFGD-KNCLCEC 191
+GP +INH CR N +VA + A V+ + I+ GEEI YG+ +F D C C
Sbjct: 215 IGPIRFINHACRSYNAEYVAVKSKLAFVVQTTKAIKSGEEIFVNYGDTYFEDLPGGRCPC 274
Query: 192 DTCEIRMKGAFAQSSVEPQTTQ 213
C + A+S +E + ++
Sbjct: 275 RDC-VGEGSREAKSKIESEDSE 295
>gi|58264890|ref|XP_569601.1| hypothetical protein CNC03710 [Cryptococcus neoformans var.
neoformans JEC21]
gi|57225833|gb|AAW42294.1| hypothetical protein CNC03710 [Cryptococcus neoformans var.
neoformans JEC21]
Length = 1103
Score = 52.8 bits (125), Expect = 2e-04, Method: Composition-based stats.
Identities = 36/120 (30%), Positives = 58/120 (48%), Gaps = 21/120 (17%)
Query: 103 ELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVK 162
E ++ ++R T DFS+++S K C QL+LGPA ++N V
Sbjct: 224 ESRRQGQRRSDRTKRRDFSIVWSGLKRCYQLFLGPARFLN-----------------WVY 266
Query: 163 VLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFAQSSVEPQTT-QPSRYCLRD 221
+ + + IT GE++FG N C C TCE G F + +P+++ + SR+ RD
Sbjct: 267 ITFEWKPKLTITASSGENYFGKGNVECLCLTCEKNHHGGF---TPKPESSRRTSRHLSRD 323
>gi|299131755|ref|ZP_07024950.1| nuclear protein SET [Afipia sp. 1NLS2]
gi|298591892|gb|EFI52092.1| nuclear protein SET [Afipia sp. 1NLS2]
Length = 227
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 66/137 (48%), Gaps = 14/137 (10%)
Query: 79 GARIVATRKWEKNDNIAYLVGCIAELTKEE----EKRLLHTGLNDFSVMYSCRKNCAQLW 134
G + AT+ +K I G + + K+E E + L + +++ S RKN A+
Sbjct: 18 GLGLFATKPIKKGTKIIRYFGPMLDCRKKEDDAVENKYLFQINSRWTIDGSVRKNTAR-- 75
Query: 135 LGPAAYINHDCRPNCRF-VATERDTACVKVLRDIEQGEEITCFYGEDFFGD--KNCLCEC 191
YINH CRPN V + + T ++ +++I GEEI YG ++F + K C+C
Sbjct: 76 -----YINHACRPNAESDVNSRKRTVYIRAIKNIAPGEEINYDYGTEYFKEYLKPIGCKC 130
Query: 192 DTCEIRMKGAFAQSSVE 208
CE + A++ E
Sbjct: 131 AACEKKRAKQRAEARAE 147
>gi|291232347|ref|XP_002736118.1| PREDICTED: myeloid/lymphoid or mixed-lineage leukemia 4-like, partial
[Saccoglossus kowalevskii]
Length = 3264
Score = 52.8 bits (125), Expect = 2e-04, Method: Composition-based stats.
Identities = 39/130 (30%), Positives = 59/130 (45%), Gaps = 9/130 (6%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + ATR EK+ + +G I + ++ N M+ + A L
Sbjct: 3134 QGLGLFATRDIEKHTMVIEYIGTIIRNEVANRREDIYEEANRGVYMFRINSDYVIDATLT 3193
Query: 135 LGPAAYINHDCRPNC--RFVATERDTACVKVLRD-IEQGEEITCFYGEDFFGDKN---CL 188
GPA Y+NH C PNC V +++ + + I +GEE+T Y DF DKN CL
Sbjct: 3194 GGPARYVNHSCSPNCVAEVVTFDKEQKIIIISNSRIAKGEELTYDYKFDFEDDKNKISCL 3253
Query: 189 CECDTCEIRM 198
C+ C+ M
Sbjct: 3254 CKAVNCKKWM 3263
>gi|328697445|ref|XP_003240336.1| PREDICTED: histone-lysine N-methyltransferase set9-like
[Acyrthosiphon pisum]
Length = 148
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 29/70 (41%), Positives = 41/70 (58%), Gaps = 4/70 (5%)
Query: 82 IVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYI 141
+VA R +KN + L A+L EK ++ G+NDFS++YS L LGP A++
Sbjct: 79 VVAKRHIKKNTKLKELS---AQLFPFAEKYVV-KGVNDFSMIYSQLHKQQCLLLGPLAFV 134
Query: 142 NHDCRPNCRF 151
NH C PNC+F
Sbjct: 135 NHSCTPNCKF 144
>gi|239612023|gb|EEQ89010.1| SET domain-containing protein [Ajellomyces dermatitidis ER-3]
Length = 457
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 47/99 (47%), Gaps = 4/99 (4%)
Query: 130 CAQLWLGPAAYINHDCRPNCRF---VATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN 186
C + + A ++NH C P+ F ++ + LRDIE EEIT YG+ ++ +K
Sbjct: 343 CPKWYGNWARFVNHSCEPSVEFRSRTIGKQIYMMMVALRDIEAFEEITVHYGKRYWREKQ 402
Query: 187 CLCECDTCEIRMKGAFAQSSVEPQTTQPSRYCLRDTDMR 225
CLC C+ + K Q P T P R+ ++R
Sbjct: 403 CLCRSRACKDKKK-PIPQGEPHPIPTMPPNNPYRNWELR 440
>gi|261202006|ref|XP_002628217.1| SET domain-containing protein [Ajellomyces dermatitidis SLH14081]
gi|239590314|gb|EEQ72895.1| SET domain-containing protein [Ajellomyces dermatitidis SLH14081]
Length = 457
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 47/99 (47%), Gaps = 4/99 (4%)
Query: 130 CAQLWLGPAAYINHDCRPNCRF---VATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN 186
C + + A ++NH C P+ F ++ + LRDIE EEIT YG+ ++ +K
Sbjct: 343 CPKWYGNWARFVNHSCEPSVEFRSRTIGKQIYMMMVALRDIEAFEEITVHYGKRYWREKQ 402
Query: 187 CLCECDTCEIRMKGAFAQSSVEPQTTQPSRYCLRDTDMR 225
CLC C+ + K Q P T P R+ ++R
Sbjct: 403 CLCRSRACKDKKK-PIPQGEPHPIPTMPPNNPYRNWELR 440
>gi|396493592|ref|XP_003844094.1| similar to SET domain containing protein [Leptosphaeria maculans
JN3]
gi|312220674|emb|CBY00615.1| similar to SET domain containing protein [Leptosphaeria maculans
JN3]
Length = 616
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 48/146 (32%), Positives = 64/146 (43%), Gaps = 21/146 (14%)
Query: 60 IQYLPELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLND 119
+Q P YR +LEG IVA +E+ + I LVG I K + H + D
Sbjct: 481 VQRKPPPVQVYR-TLEGKNNG-IVALTSFERGNAIGELVGLIT-------KGVRHVDVMD 531
Query: 120 FSVMYSCRKNCAQLWLGPAA----YINHDCRPNCRFVA-TERDTA-CVKVLRDIEQGEEI 173
S + Q+W G +INH C+ N + T DT V V + IE G EI
Sbjct: 532 SSTPLANY----QIWQGQVGNFTRFINHSCKANAQTTTFTWLDTQRIVLVSKGIEAGSEI 587
Query: 174 TCFYGEDFFG--DKNCLCECDTCEIR 197
T YG+ ++ DK CLC C R
Sbjct: 588 TLDYGDKYWAGLDKTCLCGESCCRYR 613
>gi|239614648|gb|EEQ91635.1| SET domain-containing protein [Ajellomyces dermatitidis ER-3]
Length = 407
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 39/67 (58%), Gaps = 5/67 (7%)
Query: 140 YINHDCRPNCRFVAT---ERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCEC--DTC 194
Y+NH CR FV+T +R V+ +RDIE EEIT YG+++F ++ +C+C + C
Sbjct: 314 YMNHSCRAGVVFVSTVVGDRARVLVRAIRDIEMFEEITVDYGDEYFMPRHRVCKCGEEVC 373
Query: 195 EIRMKGA 201
+ G
Sbjct: 374 RFKEGGG 380
>gi|148256757|ref|YP_001241342.1| histone-lysine N-methyltransferase [Bradyrhizobium sp. BTAi1]
gi|146408930|gb|ABQ37436.1| Putative histone-lysine N-methyltransferase with a SET domain
[Bradyrhizobium sp. BTAi1]
Length = 230
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 38/138 (27%), Positives = 65/138 (47%), Gaps = 15/138 (10%)
Query: 63 LPELKPCYRYSLEGNR-GARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGL---- 117
+P + P Y + ++ G + AT+ +K I G + + K +E + + L
Sbjct: 1 MPAIPPTKPYRIGRSKTGLGLFATKPIKKGTKIIRYFGPLLDSKKADEDAIENKYLFELD 60
Query: 118 NDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRF-VATERDTACVKVLRDIEQGEEITCF 176
+++ S R+N A+ YINH C+PN V + ++ ++DIE GEEI
Sbjct: 61 KRWTIDGSIRENVAR-------YINHSCKPNAESDVKPRKKKIFIRAIKDIEPGEEINYD 113
Query: 177 YGEDFFGD--KNCLCECD 192
YG D+F K C+C+
Sbjct: 114 YGTDYFKAYLKPIGCKCE 131
>gi|261196303|ref|XP_002624555.1| SET domain-containing protein [Ajellomyces dermatitidis SLH14081]
gi|239587688|gb|EEQ70331.1| SET domain-containing protein [Ajellomyces dermatitidis SLH14081]
Length = 407
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 39/67 (58%), Gaps = 5/67 (7%)
Query: 140 YINHDCRPNCRFVAT---ERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCEC--DTC 194
Y+NH CR FV+T +R V+ +RDIE EEIT YG+++F ++ +C+C + C
Sbjct: 314 YMNHSCRAGVVFVSTVVGDRACVLVRAIRDIEMFEEITVDYGDEYFMPRHRVCKCGEEVC 373
Query: 195 EIRMKGA 201
+ G
Sbjct: 374 RFKEGGG 380
>gi|327356616|gb|EGE85473.1| SET domain-containing protein [Ajellomyces dermatitidis ATCC 18188]
Length = 407
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 39/67 (58%), Gaps = 5/67 (7%)
Query: 140 YINHDCRPNCRFVAT---ERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCEC--DTC 194
Y+NH CR FV+T +R V+ +RDIE EEIT YG+++F ++ +C+C + C
Sbjct: 314 YMNHSCRAGVVFVSTVVGDRACVLVRAIRDIEMFEEITVDYGDEYFMPRHRVCKCGEEVC 373
Query: 195 EIRMKGA 201
+ G
Sbjct: 374 RFKEGGG 380
>gi|321253882|ref|XP_003192885.1| hypothetical protein CGB_C5580C [Cryptococcus gattii WM276]
gi|317459354|gb|ADV21098.1| hypothetical protein CNC03710 [Cryptococcus gattii WM276]
Length = 971
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 24/53 (45%), Positives = 33/53 (62%), Gaps = 5/53 (9%)
Query: 105 TKEE-----EKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFV 152
TKEE ++R T DFS+++S K C QL+LGPA ++NHDC PN +
Sbjct: 221 TKEECRRQGQRRSDRTKRRDFSIVWSGLKRCYQLFLGPARFLNHDCNPNVELL 273
>gi|452989388|gb|EME89143.1| hypothetical protein MYCFIDRAFT_209999 [Pseudocercospora fijiensis
CIRAD86]
Length = 403
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 42/143 (29%), Positives = 62/143 (43%), Gaps = 29/143 (20%)
Query: 70 YRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKN 129
YR N G ++A +E+ I VG I +G+ + VM
Sbjct: 268 YRTPTGKNNG--LLACCSFERGTAIGEFVGQIT------------SGVANLDVMVGQTDR 313
Query: 130 CA-QLWLGPAA----YINHDCRPNCRF-----VATERDTACVKVLRDIEQGEEITCFYGE 179
A Q+W G ++NH C+PN +F + T+R V V + IE GEEIT Y +
Sbjct: 314 AAYQIWQGKQGNYTRFVNHSCQPNSQFERFIWLGTQR---IVLVSKGIEAGEEITVDYSD 370
Query: 180 DFFG--DKNCLCECDTCEIRMKG 200
++ DK CLC+ C R +
Sbjct: 371 TYWKNLDKECLCDSAKCRYRRRA 393
>gi|336377066|gb|EGO05401.1| hypothetical protein SERLA73DRAFT_69013 [Serpula lacrymans var.
lacrymans S7.3]
gi|336390109|gb|EGO31252.1| hypothetical protein SERLADRAFT_432900 [Serpula lacrymans var.
lacrymans S7.9]
Length = 152
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/65 (36%), Positives = 37/65 (56%), Gaps = 1/65 (1%)
Query: 131 AQLWLGPAAYINHDCRPNCRFVATERDTACVKV-LRDIEQGEEITCFYGEDFFGDKNCLC 189
++L L P + NHDC PNC+ ++ A + V LRDI+ G++IT Y +D + + C
Sbjct: 26 SRLILRPFRFANHDCNPNCQIYLVKKTHAAILVSLRDIKAGDQITVAYTKDGYYREGVKC 85
Query: 190 ECDTC 194
C C
Sbjct: 86 LCSAC 90
>gi|406992621|gb|EKE11949.1| Nuclear protein SET [uncultured bacterium]
Length = 145
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 3/60 (5%)
Query: 140 YINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD--KNCLCECDTCEIR 197
YINH C+PNC V R + LRDI +GEE++ YG ++F K C+C C++R
Sbjct: 82 YINHSCKPNCE-VQINRRMILIYALRDIAEGEELSYDYGAEYFLTLIKPNGCKCPYCQLR 140
>gi|225561043|gb|EEH09324.1| SET domain-containing protein [Ajellomyces capsulatus G186AR]
Length = 427
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 41/136 (30%), Positives = 60/136 (44%), Gaps = 8/136 (5%)
Query: 60 IQYLPELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAE-LTKEEEKRLLHTGLN 118
IQ E P +RG + + R + I VG I E ++ LH L
Sbjct: 215 IQLADETNPLLELREYPDRGVGVRSLRSIKAGTYIGQYVGEIREPAARKGTTYALHHTLR 274
Query: 119 DFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFV-ATERDTACV--KVLRDIEQGEEITC 175
+ S+ A ++ Y+NH CR FV A ++ACV + +RDIE EEIT
Sbjct: 275 NRSIGVID----AAIYGNWTRYMNHSCRTGVIFVSAVVGNSACVLVRAIRDIEIFEEITV 330
Query: 176 FYGEDFFGDKNCLCEC 191
YG +F + +C+C
Sbjct: 331 DYGAAYFAPRGRVCKC 346
>gi|86750868|ref|YP_487364.1| nuclear protein SET [Rhodopseudomonas palustris HaA2]
gi|86573896|gb|ABD08453.1| Nuclear protein SET [Rhodopseudomonas palustris HaA2]
Length = 250
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 36/123 (29%), Positives = 59/123 (47%), Gaps = 14/123 (11%)
Query: 79 GARIVATRKWEKNDNIAYLVGCIAELTKEE----EKRLLHTGLNDFSVMYSCRKNCAQLW 134
G + AT+ +K I G + + K++ E + L ++V S RKN A+
Sbjct: 36 GLGLFATQPIKKGTKIIRYFGPMLDCNKKKDDAVENKYLFQISKRWTVDGSVRKNIAR-- 93
Query: 135 LGPAAYINHDCRPNCRF-VATERDTACVKVLRDIEQGEEITCFYGEDFFGD--KNCLCEC 191
YINH C PN V + ++ +++IE G+EI YG D+F + K C+C
Sbjct: 94 -----YINHACNPNAESDVNVRKRKIIIRAIKNIEPGDEINYDYGTDYFKEYLKPIGCKC 148
Query: 192 DTC 194
++C
Sbjct: 149 ESC 151
>gi|192292758|ref|YP_001993363.1| nuclear protein SET [Rhodopseudomonas palustris TIE-1]
gi|192286507|gb|ACF02888.1| nuclear protein SET [Rhodopseudomonas palustris TIE-1]
Length = 245
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 27/73 (36%), Positives = 38/73 (52%), Gaps = 10/73 (13%)
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRF-VATERDTACVKVLRDIEQGEEITCFYGEDFFG 183
S RKN A+ YINH C+PN V + ++ +++IE GEEI YG D+F
Sbjct: 67 SVRKNVAR-------YINHACKPNAESDVNPRKKRVIIRAIKNIEPGEEINYDYGTDYFK 119
Query: 184 D--KNCLCECDTC 194
K C+C +C
Sbjct: 120 AYLKPIGCKCVSC 132
>gi|170105259|ref|XP_001883842.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164641022|gb|EDR05284.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 612
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 33/112 (29%), Positives = 54/112 (48%), Gaps = 2/112 (1%)
Query: 75 EGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLW 134
+G+ G I ATR + + + L+G +A ++ E RL + V + ++
Sbjct: 492 DGSGGFAIFATRDLVEGEILHPLMGLMASDSRAEHSRLSEIHPHPSQVKSEKGRMDPRVL 551
Query: 135 LGPAAYINHDCRP-NCRFVATERDTA-CVKVLRDIEQGEEITCFYGEDFFGD 184
+GP +INH C N FVA A V+ + I+ EE+ YG+D+F D
Sbjct: 552 VGPLRFINHACHSFNAEFVAIRSTLAFVVQTNKPIKFREEVFVDYGKDYFED 603
>gi|39936932|ref|NP_949208.1| nuclear protein SET [Rhodopseudomonas palustris CGA009]
gi|39650789|emb|CAE29312.1| Nuclear protein SET [Rhodopseudomonas palustris CGA009]
Length = 245
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 27/73 (36%), Positives = 38/73 (52%), Gaps = 10/73 (13%)
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRF-VATERDTACVKVLRDIEQGEEITCFYGEDFFG 183
S RKN A+ YINH C+PN V + ++ +++IE GEEI YG D+F
Sbjct: 67 SVRKNVAR-------YINHACKPNAESDVNPRKKRVIIRAIKNIEPGEEINYDYGTDYFK 119
Query: 184 D--KNCLCECDTC 194
K C+C +C
Sbjct: 120 AYLKPIGCKCVSC 132
>gi|358368606|dbj|GAA85222.1| SET domain protein [Aspergillus kawachii IFO 4308]
Length = 720
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 34/58 (58%), Gaps = 3/58 (5%)
Query: 140 YINHDCRPNCRF---VATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTC 194
+INH CRPN F R T V+V+R+I+ EE+T YG +++ + C+C + C
Sbjct: 499 FINHSCRPNTTFERRTIGNRATMTVEVVRNIQPFEELTINYGREYWKGRTCICGEEGC 556
>gi|124009625|ref|ZP_01694298.1| hypothetical protein M23134_05548 [Microscilla marina ATCC 23134]
gi|123984766|gb|EAY24746.1| hypothetical protein M23134_05548 [Microscilla marina ATCC 23134]
Length = 611
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 34/136 (25%), Positives = 54/136 (39%), Gaps = 19/136 (13%)
Query: 70 YRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKN 129
Y S GN+G ++AT+ + D + + G I +L G + +++
Sbjct: 454 YELSPMGNKGEGVIATKPFTSEDVV--MKGQIVKLLGGNHSHASQMGEHTWAIHE----- 506
Query: 130 CAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGE-----DFFGD 184
G +NH C PNC E + ++ I QGEE+T Y D+F
Sbjct: 507 ------GIIPKVNHSCAPNCGIRLNETGAHDIVAIKPIAQGEELTLDYAMRNYQIDYFPS 560
Query: 185 KNCLCECDTCEIRMKG 200
+ C C C R+ G
Sbjct: 561 Q-CQCGASECRTRITG 575
>gi|430745383|ref|YP_007204512.1| SET domain-containing protein [Singulisphaera acidiphila DSM 18658]
gi|430017103|gb|AGA28817.1| SET domain-containing protein [Singulisphaera acidiphila DSM 18658]
Length = 149
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 39/128 (30%), Positives = 54/128 (42%), Gaps = 10/128 (7%)
Query: 86 RKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDC 145
R+ E + + LV I EL + T L + ++S K+ L LG + NH
Sbjct: 28 REGEVIERVPVLVVPIDELKQGS----AWTSLGHYCFLWS--KDTVALALGYGSLYNHSF 81
Query: 146 RPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFAQS 205
RPN R+ + T LRDIE GEEIT Y GD + + KG Q
Sbjct: 82 RPNARYDDRGQRTKVFTSLRDIEPGEEITVNYN----GDPDDDSAVGFAVVEAKGDGDQP 137
Query: 206 SVEPQTTQ 213
P+T +
Sbjct: 138 GRSPKTAR 145
>gi|401411397|ref|XP_003885146.1| putative SET domain-containing protein [Neospora caninum Liverpool]
gi|325119565|emb|CBZ55118.1| putative SET domain-containing protein [Neospora caninum Liverpool]
Length = 1784
Score = 48.1 bits (113), Expect = 0.004, Method: Composition-based stats.
Identities = 27/64 (42%), Positives = 33/64 (51%), Gaps = 6/64 (9%)
Query: 138 AAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN---CLCECDTC 194
A YINH C PNC V + + LRD+ GEE+ FY + F DK+ CLC TC
Sbjct: 1724 ARYINHSCEPNCESVRMPHNAVAIVALRDLLPGEEL--FY-DYHFSDKSKEACLCGARTC 1780
Query: 195 EIRM 198
M
Sbjct: 1781 TGNM 1784
>gi|316932855|ref|YP_004107837.1| nuclear protein SET [Rhodopseudomonas palustris DX-1]
gi|315600569|gb|ADU43104.1| nuclear protein SET [Rhodopseudomonas palustris DX-1]
Length = 249
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 27/73 (36%), Positives = 38/73 (52%), Gaps = 10/73 (13%)
Query: 125 SCRKNCAQLWLGPAAYINHDCRPNCRF-VATERDTACVKVLRDIEQGEEITCFYGEDFFG 183
S RKN A+ YINH C+PN V + ++ +++IE GEEI YG D+F
Sbjct: 67 SVRKNVAR-------YINHACKPNAESDVNPRKKRVIIRAIKNIEPGEEINYDYGTDYFK 119
Query: 184 D--KNCLCECDTC 194
K C+C +C
Sbjct: 120 AYLKPIGCKCVSC 132
>gi|350595097|ref|XP_003134587.3| PREDICTED: histone-lysine N-methyltransferase MLL3, partial [Sus
scrofa]
Length = 3482
Score = 48.1 bits (113), Expect = 0.004, Method: Composition-based stats.
Identities = 39/127 (30%), Positives = 54/127 (42%), Gaps = 7/127 (5%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK+ + +G I K L+ N M+ + A L
Sbjct: 3352 QGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGVYMFRMDNDHVIDATLT 3411
Query: 135 LGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNCL-CE 190
GPA YINH C PNC V ER + R I++GEE+ Y DF D++ + C
Sbjct: 3412 GGPARYINHSCAPNCVAEVVTFERGHKIIISSSRRIQKGEELCYDYKFDFEDDQHKIPCH 3471
Query: 191 CDTCEIR 197
C R
Sbjct: 3472 CGAVNCR 3478
>gi|452847098|gb|EME49030.1| hypothetical protein DOTSEDRAFT_67914 [Dothistroma septosporum
NZE10]
Length = 622
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 31/94 (32%), Positives = 46/94 (48%), Gaps = 15/94 (15%)
Query: 113 LHTGLNDFSVMYSCRKNCA-QLWLGPAA----YINHDCRPNCRF-----VATERDTACVK 162
+ GL + VM + A Q+W G ++NH C+PN +F + T+R V
Sbjct: 516 ITAGLTNMDVMVGQTERAAYQIWQGRRGNHTRFVNHSCQPNSQFERFAWLGTQR---VVL 572
Query: 163 VLRDIEQGEEITCFYGEDFFG--DKNCLCECDTC 194
V + IE GEE+T Y + ++ DK CLC C
Sbjct: 573 VSKGIEAGEEVTVDYSDTYWKNLDKECLCGQSRC 606
>gi|389646281|ref|XP_003720772.1| hypothetical protein MGG_02937 [Magnaporthe oryzae 70-15]
gi|351638164|gb|EHA46029.1| hypothetical protein MGG_02937 [Magnaporthe oryzae 70-15]
Length = 1015
Score = 47.8 bits (112), Expect = 0.005, Method: Composition-based stats.
Identities = 55/208 (26%), Positives = 86/208 (41%), Gaps = 29/208 (13%)
Query: 70 YRYSLE----GNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYS 125
YR +E G RG + A+R +E I G I +T+EE +R ++ D Y
Sbjct: 519 YRVGVEVVHTGPRGFGVRASRCFEPGQIIMEYAGEI--ITEEECERRMNEVYKDNEAYYL 576
Query: 126 CRKNCAQLWLGP----AAYINHDCRPNCRF----VATERDTACVKVLRDIEQGEEITCFY 177
+ + A ++NH C PNCR V + A I GEE+T Y
Sbjct: 577 MSFDQNMILDATTGSIARFVNHSCSPNCRMIKWIVCGKPRMALFAGDNPIMTGEELTYDY 636
Query: 178 GEDFFGDKN---CLCECDTCEIRM-------KGAFAQSSVEPQTTQPSRYCLRDTDMRLN 227
D F KN CLC + C + K A A + + + + PS+ ++ +
Sbjct: 637 NFDPFSAKNVQKCLCGSENCRGVLGPRTRNDKAAKANGAGKGKNSPPSKKQTMKATVKAS 696
Query: 228 KRKLHKKLNRLLLASDKNDTNSSDNSSK 255
KRKL + LL +D ++ + +K
Sbjct: 697 KRKLKE-----LLHGTADDEKTATSPAK 719
>gi|453088797|gb|EMF16837.1| SET domain-containing protein [Mycosphaerella populorum SO2202]
Length = 604
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 33/108 (30%), Positives = 50/108 (46%), Gaps = 22/108 (20%)
Query: 101 IAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAA----YINHDCRPNCRF----- 151
I E+T E + + G D +V Q+W G + ++NH C PN +F
Sbjct: 496 IGEITSGVENKDVMIGQTDRAVY--------QIWQGKSGNYTRFVNHSCAPNSQFERFVW 547
Query: 152 VATERDTACVKVLRDIEQGEEITCFYGEDFFG--DKNCLCECDTCEIR 197
+ +R V V + +E GEEIT Y + ++ +K CLCE C R
Sbjct: 548 LGKQR---IVLVSKGVEAGEEITVDYSDTYWKNLNKKCLCESSKCRYR 592
>gi|427785301|gb|JAA58102.1| Putative histone-lysine n-methyltransferase mll3 [Rhipicephalus
pulchellus]
Length = 2867
Score = 47.8 bits (112), Expect = 0.006, Method: Composition-based stats.
Identities = 45/157 (28%), Positives = 68/157 (43%), Gaps = 14/157 (8%)
Query: 51 TQHHRYSYRIQYLPELKPCYRYSL----EGNRGARIVATRKWEKNDNIAYLVGCIAELTK 106
T+H +S QY +K +R ++ G +G + A R E + I +G +
Sbjct: 2707 TKHFVHSKSSQY-RRMKTEWRNNVYLARSGIQGLGLYAARDIECHTMIIEYIGQLIRNEI 2765
Query: 107 EEEKRLLHTGLNDFSVMYSCRKNC---AQLWLGPAAYINHDCRPNC--RFVATERDTACV 161
E ++ N M+ +N A L G A YINH C PNC V +R+ +
Sbjct: 2766 AERNEAIYEAQNRGVYMFRLDENRVIDATLSGGLARYINHSCSPNCVAELVQIDRENKIL 2825
Query: 162 KVL-RDIEQGEEITCFYGEDFFGDKN---CLCECDTC 194
+ R I +GEE+T Y D+ D + CLC C
Sbjct: 2826 IIANRRITRGEELTYDYKFDYEDDGHKIPCLCSASNC 2862
>gi|427785297|gb|JAA58100.1| Putative histone-lysine n-methyltransferase mll3 [Rhipicephalus
pulchellus]
Length = 3936
Score = 47.8 bits (112), Expect = 0.006, Method: Composition-based stats.
Identities = 45/157 (28%), Positives = 68/157 (43%), Gaps = 14/157 (8%)
Query: 51 TQHHRYSYRIQYLPELKPCYRYSL----EGNRGARIVATRKWEKNDNIAYLVGCIAELTK 106
T+H +S QY +K +R ++ G +G + A R E + I +G +
Sbjct: 3776 TKHFVHSKSSQY-RRMKTEWRNNVYLARSGIQGLGLYAARDIECHTMIIEYIGQLIRNEI 3834
Query: 107 EEEKRLLHTGLNDFSVMYSCRKNC---AQLWLGPAAYINHDCRPNC--RFVATERDTACV 161
E ++ N M+ +N A L G A YINH C PNC V +R+ +
Sbjct: 3835 AERNEAIYEAQNRGVYMFRLDENRVIDATLSGGLARYINHSCSPNCVAELVQIDRENKIL 3894
Query: 162 KVL-RDIEQGEEITCFYGEDFFGDKN---CLCECDTC 194
+ R I +GEE+T Y D+ D + CLC C
Sbjct: 3895 IIANRRITRGEELTYDYKFDYEDDGHKIPCLCSASNC 3931
>gi|431895735|gb|ELK05154.1| Histone-lysine N-methyltransferase MLL3 [Pteropus alecto]
Length = 4032
Score = 47.8 bits (112), Expect = 0.006, Method: Composition-based stats.
Identities = 39/127 (30%), Positives = 54/127 (42%), Gaps = 7/127 (5%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK+ + +G I K L+ N M+ + A L
Sbjct: 3902 QGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGVYMFRMDSDHVIDATLT 3961
Query: 135 LGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNCL-CE 190
GPA YINH C PNC V ER + R I++GEE+ Y DF D++ + C
Sbjct: 3962 GGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKFDFEDDQHKIPCH 4021
Query: 191 CDTCEIR 197
C R
Sbjct: 4022 CGAVNCR 4028
>gi|440895698|gb|ELR47828.1| Histone-lysine N-methyltransferase MLL3, partial [Bos grunniens
mutus]
Length = 4905
Score = 47.8 bits (112), Expect = 0.006, Method: Composition-based stats.
Identities = 39/127 (30%), Positives = 54/127 (42%), Gaps = 7/127 (5%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK+ + +G I K L+ N M+ + A L
Sbjct: 4775 QGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGVYMFRMDSDHVIDATLT 4834
Query: 135 LGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNCL-CE 190
GPA YINH C PNC V ER + R I++GEE+ Y DF D++ + C
Sbjct: 4835 GGPARYINHSCAPNCVAEVVTFERGHKIIISSSRRIQKGEELCYDYKFDFEDDQHKIPCH 4894
Query: 191 CDTCEIR 197
C R
Sbjct: 4895 CGAVNCR 4901
>gi|351695440|gb|EHA98358.1| Histone-lysine N-methyltransferase MLL3 [Heterocephalus glaber]
Length = 4724
Score = 47.4 bits (111), Expect = 0.006, Method: Composition-based stats.
Identities = 39/127 (30%), Positives = 54/127 (42%), Gaps = 7/127 (5%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK+ + +G I K L+ N M+ + A L
Sbjct: 4594 QGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGVYMFRMDNDHVIDATLT 4653
Query: 135 LGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNCL-CE 190
GPA YINH C PNC V ER + R I++GEE+ Y DF D++ + C
Sbjct: 4654 GGPARYINHSCAPNCVAEVVTFERGHKIIISSSRRIQKGEELCYDYKFDFEDDQHKIPCH 4713
Query: 191 CDTCEIR 197
C R
Sbjct: 4714 CGAVNCR 4720
>gi|449492124|ref|XP_002187267.2| PREDICTED: histone-lysine N-methyltransferase MLL3 [Taeniopygia
guttata]
Length = 4871
Score = 47.4 bits (111), Expect = 0.007, Method: Composition-based stats.
Identities = 39/127 (30%), Positives = 54/127 (42%), Gaps = 7/127 (5%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK+ + +G I K L+ N M+ + A L
Sbjct: 4741 QGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGVYMFRIDNDHVIDATLT 4800
Query: 135 LGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNCL-CE 190
GPA YINH C PNC V ER + R I++GEE+ Y DF D++ + C
Sbjct: 4801 GGPARYINHSCAPNCVAEVVTFERGHKIIISSSRRIQKGEELCYDYKFDFEDDQHKIPCH 4860
Query: 191 CDTCEIR 197
C R
Sbjct: 4861 CGAVNCR 4867
>gi|363729903|ref|XP_418542.3| PREDICTED: histone-lysine N-methyltransferase MLL3 [Gallus gallus]
Length = 4906
Score = 47.4 bits (111), Expect = 0.007, Method: Composition-based stats.
Identities = 39/127 (30%), Positives = 54/127 (42%), Gaps = 7/127 (5%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK+ + +G I K L+ N M+ + A L
Sbjct: 4776 QGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGVYMFRIDNDHVIDATLT 4835
Query: 135 LGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNCL-CE 190
GPA YINH C PNC V ER + R I++GEE+ Y DF D++ + C
Sbjct: 4836 GGPARYINHSCAPNCVAEVVTFERGHKIIISSSRRIQKGEELCYDYKFDFEDDQHKIPCH 4895
Query: 191 CDTCEIR 197
C R
Sbjct: 4896 CGAVNCR 4902
>gi|10568112|gb|AAF74766.2| ALR-like protein [Homo sapiens]
Length = 4025
Score = 47.4 bits (111), Expect = 0.007, Method: Composition-based stats.
Identities = 39/127 (30%), Positives = 54/127 (42%), Gaps = 7/127 (5%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK+ + +G I K L+ N M+ + A L
Sbjct: 3895 QGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGVYMFRMDNDHVIDATLT 3954
Query: 135 LGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNCL-CE 190
GPA YINH C PNC V ER + R I++GEE+ Y DF D++ + C
Sbjct: 3955 GGPARYINHSCAPNCVAEVVTFERGHKIIISSSRRIQKGEELCYDYKFDFEDDQHKIPCH 4014
Query: 191 CDTCEIR 197
C R
Sbjct: 4015 CGAVNCR 4021
>gi|443696187|gb|ELT96958.1| hypothetical protein CAPTEDRAFT_153177 [Capitella teleta]
Length = 844
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 38/126 (30%), Positives = 58/126 (46%), Gaps = 9/126 (7%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A+R E++ + +G + E + ++ N M+ N A +
Sbjct: 714 QGLGLFASRDLERHTMVIEYIGELIRSEVAEARERVYDSQNRGVYMFRIDDNTVVDATMT 773
Query: 135 LGPAAYINHDCRPNC--RFVATERDT-ACVKVLRDIEQGEEITCFYGEDFFGDKN---CL 188
GPA YINH C PNC V E+D+ + R I +GEE+T Y DF D++ CL
Sbjct: 774 GGPARYINHSCAPNCVAEVVPFEKDSKIIIIARRRIARGEELTYDYKFDFEDDQHKIPCL 833
Query: 189 CECDTC 194
C C
Sbjct: 834 CGAPNC 839
>gi|402865476|ref|XP_003919660.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL3, partial [Papio anubis]
Length = 3229
Score = 47.4 bits (111), Expect = 0.007, Method: Composition-based stats.
Identities = 39/127 (30%), Positives = 54/127 (42%), Gaps = 7/127 (5%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK+ + +G I K L+ N M+ + A L
Sbjct: 3099 QGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGVYMFRMDNDHVIDATLT 3158
Query: 135 LGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNCL-CE 190
GPA YINH C PNC V ER + R I++GEE+ Y DF D++ + C
Sbjct: 3159 GGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKFDFEDDQHKIPCH 3218
Query: 191 CDTCEIR 197
C R
Sbjct: 3219 CGAVNCR 3225
>gi|361125013|gb|EHK97075.1| putative Uncharacterized mitochondrial carrier [Glarea lozoyensis
74030]
Length = 990
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 52/204 (25%), Positives = 90/204 (44%), Gaps = 25/204 (12%)
Query: 70 YRYSLE----GNRGARIVATRKWEKNDNIAYLVGCIAELTKEE-EKRLLHTGLND--FSV 122
YR +E +RG + + R +E N I G I +T+EE ++R+ H N+ + +
Sbjct: 383 YRVGVEVLKTADRGYGVRSNRCFEANQIIVEYTGEI--ITEEECDRRMNHEYKNNECYYL 440
Query: 123 MYSCRKNCAQLWLGPAA-YINHDCRPNCRF----VATERDTACVKVLRDIEQGEEITCFY 177
M + G A ++NH C+PNCR VA + A I G+E+T Y
Sbjct: 441 MSFDQNMILDGTKGSIARFVNHSCKPNCRMVKWVVAGKPRMALFAGDNPIMTGDELTYDY 500
Query: 178 GEDFFGDKN---CLCECDTCEIRMKGAFAQSSVEPQTTQPSRYCLRDTDMRLNKRKLHKK 234
D F KN C C D C +G +P+ +P + ++D ++ K+ +K
Sbjct: 501 NFDPFSAKNVQTCRCGSDNC----RGVLGP---KPKDQKPVKEAIKDA-VKATKKYGKRK 552
Query: 235 LNRLLLASDKNDTNSSDNSSKYSL 258
+ D +D++ + + K +
Sbjct: 553 FKEVFGVGDTDDSDLAPSPKKRKI 576
>gi|149031399|gb|EDL86389.1| rCG56742 [Rattus norvegicus]
Length = 4499
Score = 47.4 bits (111), Expect = 0.008, Method: Composition-based stats.
Identities = 39/127 (30%), Positives = 54/127 (42%), Gaps = 7/127 (5%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK+ + +G I K L+ N M+ + A L
Sbjct: 4369 QGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGVYMFRMDNDHVIDATLT 4428
Query: 135 LGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNCL-CE 190
GPA YINH C PNC V ER + R I++GEE+ Y DF D++ + C
Sbjct: 4429 GGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKFDFEDDQHKIPCH 4488
Query: 191 CDTCEIR 197
C R
Sbjct: 4489 CGAVNCR 4495
>gi|320592445|gb|EFX04875.1| histone-lysine n-methyltransferase [Grosmannia clavigera kw1407]
Length = 1246
Score = 47.4 bits (111), Expect = 0.008, Method: Composition-based stats.
Identities = 38/129 (29%), Positives = 60/129 (46%), Gaps = 13/129 (10%)
Query: 77 NRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLG 136
+RG I A+R ++ + I +G I +T+EE R ++ + + Y + + + G
Sbjct: 625 DRGHGIRASRSFKPSQIIMEYIGEI--ITEEESDRRMNELYKNNACYYLMSFDQSLIIDG 682
Query: 137 P----AAYINHDCRPNCRF----VATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN-- 186
A ++NH C PNCR V+ + A R I GEE+T Y D + KN
Sbjct: 683 TSGSIARFVNHSCSPNCRMIKWIVSGQPRIALFAGDRPIMTGEELTYDYNFDPYSSKNVQ 742
Query: 187 -CLCECDTC 194
CLC + C
Sbjct: 743 TCLCGSENC 751
>gi|345324243|ref|XP_003430797.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL3-like [Ornithorhynchus anatinus]
Length = 4910
Score = 47.0 bits (110), Expect = 0.008, Method: Composition-based stats.
Identities = 39/127 (30%), Positives = 54/127 (42%), Gaps = 7/127 (5%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK+ + +G I K L+ N M+ + A L
Sbjct: 4780 QGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGVYMFRIDNDHVIDATLT 4839
Query: 135 LGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNCL-CE 190
GPA YINH C PNC V ER + R I++GEE+ Y DF D++ + C
Sbjct: 4840 GGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKFDFEDDQHKIPCH 4899
Query: 191 CDTCEIR 197
C R
Sbjct: 4900 CGAVNCR 4906
>gi|456354242|dbj|BAM88687.1| putative histone-lysine N-methyltransferase with a SET domain
[Agromonas oligotrophica S58]
Length = 213
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 44/153 (28%), Positives = 69/153 (45%), Gaps = 16/153 (10%)
Query: 63 LPELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEE----EKRLLHTGLN 118
+P KP YR G + AT+ +K I G + + K E E + L
Sbjct: 4 IPPTKP-YRIG-RSKTGLGLFATKPIKKGTKIVRYFGPLLDSKKAEDDAIENKYLFELDK 61
Query: 119 DFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRF-VATERDTACVKVLRDIEQGEEITCFY 177
+++ S R+N A+ YINH C+PN V + ++ +++IE GEEI Y
Sbjct: 62 RWTIDGSVRENVAR-------YINHSCKPNAESDVKPRKKKIIIRAIKNIEPGEEINYDY 114
Query: 178 GEDFFGD--KNCLCECDTCEIRMKGAFAQSSVE 208
G D+F K C+C+ CE + A++ E
Sbjct: 115 GTDYFKAYLKPIGCKCEACEKKRVKKKAEARAE 147
>gi|301614673|ref|XP_002936809.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Xenopus
(Silurana) tropicalis]
Length = 1298
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 36/125 (28%), Positives = 59/125 (47%), Gaps = 13/125 (10%)
Query: 77 NRGARIVATRKWEKNDNIAYLVGCIAELTKEEE--KRLLHTGLNDFSVMYSCRKNCAQLW 134
+G ++ATR +K + + +G EL EEE R+ H ND + Y + ++
Sbjct: 1016 GKGWGLIATRDIKKGEFVNEYIG---ELIDEEECMYRIRHAQENDITHFYMLTIDKDRII 1072
Query: 135 -LGPAA----YINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN 186
GP ++NH C+PNC ++ + +RDI GEE+T Y D G++
Sbjct: 1073 DAGPKGNFSRFMNHSCQPNCETQKWSVNGDTRVGLFAVRDIPAGEELTFNYNLDCLGNEK 1132
Query: 187 CLCEC 191
+C C
Sbjct: 1133 TICRC 1137
>gi|403276503|ref|XP_003929937.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Saimiri
boliviensis boliviensis]
Length = 4029
Score = 47.0 bits (110), Expect = 0.009, Method: Composition-based stats.
Identities = 39/127 (30%), Positives = 54/127 (42%), Gaps = 7/127 (5%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK+ + +G I K L+ N M+ + A L
Sbjct: 3899 QGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGVYMFRMDNDHVIDATLT 3958
Query: 135 LGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNCL-CE 190
GPA YINH C PNC V ER + R I++GEE+ Y DF D++ + C
Sbjct: 3959 GGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKFDFEDDQHKIPCH 4018
Query: 191 CDTCEIR 197
C R
Sbjct: 4019 CGAVNCR 4025
>gi|332870121|ref|XP_519508.3| PREDICTED: histone-lysine N-methyltransferase MLL3 [Pan troglodytes]
Length = 4026
Score = 47.0 bits (110), Expect = 0.009, Method: Composition-based stats.
Identities = 39/127 (30%), Positives = 54/127 (42%), Gaps = 7/127 (5%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK+ + +G I K L+ N M+ + A L
Sbjct: 3896 QGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGVYMFRMDNDHVIDATLT 3955
Query: 135 LGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNCL-CE 190
GPA YINH C PNC V ER + R I++GEE+ Y DF D++ + C
Sbjct: 3956 GGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKFDFEDDQHKIPCH 4015
Query: 191 CDTCEIR 197
C R
Sbjct: 4016 CGAVNCR 4022
>gi|21427632|gb|AAK00583.1| MLL3 [Homo sapiens]
Length = 4911
Score = 47.0 bits (110), Expect = 0.009, Method: Composition-based stats.
Identities = 39/127 (30%), Positives = 54/127 (42%), Gaps = 7/127 (5%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK+ + +G I K L+ N M+ + A L
Sbjct: 4781 QGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGVYMFRMDNDHVIDATLT 4840
Query: 135 LGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNCL-CE 190
GPA YINH C PNC V ER + R I++GEE+ Y DF D++ + C
Sbjct: 4841 GGPARYINHSCAPNCVAEVVTFERGHKIIISSSRRIQKGEELCYDYKFDFEDDQHKIPCH 4900
Query: 191 CDTCEIR 197
C R
Sbjct: 4901 CGAVNCR 4907
>gi|91718902|ref|NP_733751.2| histone-lysine N-methyltransferase MLL3 [Homo sapiens]
gi|221222521|sp|Q8NEZ4.3|MLL3_HUMAN RecName: Full=Histone-lysine N-methyltransferase MLL3; AltName:
Full=Homologous to ALR protein; AltName: Full=Lysine
N-methyltransferase 2C; Short=KMT2C; AltName:
Full=Myeloid/lymphoid or mixed-lineage leukemia protein 3
Length = 4911
Score = 47.0 bits (110), Expect = 0.009, Method: Composition-based stats.
Identities = 39/127 (30%), Positives = 54/127 (42%), Gaps = 7/127 (5%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK+ + +G I K L+ N M+ + A L
Sbjct: 4781 QGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGVYMFRMDNDHVIDATLT 4840
Query: 135 LGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNCL-CE 190
GPA YINH C PNC V ER + R I++GEE+ Y DF D++ + C
Sbjct: 4841 GGPARYINHSCAPNCVAEVVTFERGHKIIISSSRRIQKGEELCYDYKFDFEDDQHKIPCH 4900
Query: 191 CDTCEIR 197
C R
Sbjct: 4901 CGAVNCR 4907
>gi|148671129|gb|EDL03076.1| mCG113864 [Mus musculus]
Length = 4532
Score = 47.0 bits (110), Expect = 0.009, Method: Composition-based stats.
Identities = 39/127 (30%), Positives = 54/127 (42%), Gaps = 7/127 (5%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK+ + +G I K L+ N M+ + A L
Sbjct: 4402 QGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGVYMFRMDNDHVIDATLT 4461
Query: 135 LGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNCL-CE 190
GPA YINH C PNC V ER + R I++GEE+ Y DF D++ + C
Sbjct: 4462 GGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKFDFEDDQHKIPCH 4521
Query: 191 CDTCEIR 197
C R
Sbjct: 4522 CGAVNCR 4528
>gi|119574356|gb|EAW53971.1| hCG1990594, isoform CRA_a [Homo sapiens]
Length = 4911
Score = 47.0 bits (110), Expect = 0.009, Method: Composition-based stats.
Identities = 39/127 (30%), Positives = 54/127 (42%), Gaps = 7/127 (5%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK+ + +G I K L+ N M+ + A L
Sbjct: 4781 QGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGVYMFRMDNDHVIDATLT 4840
Query: 135 LGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNCL-CE 190
GPA YINH C PNC V ER + R I++GEE+ Y DF D++ + C
Sbjct: 4841 GGPARYINHSCAPNCVAEVVTFERGHKIIISSSRRIQKGEELCYDYKFDFEDDQHKIPCH 4900
Query: 191 CDTCEIR 197
C R
Sbjct: 4901 CGAVNCR 4907
>gi|355561196|gb|EHH17882.1| hypothetical protein EGK_14365, partial [Macaca mulatta]
Length = 4575
Score = 47.0 bits (110), Expect = 0.009, Method: Composition-based stats.
Identities = 39/127 (30%), Positives = 54/127 (42%), Gaps = 7/127 (5%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK+ + +G I K L+ N M+ + A L
Sbjct: 4445 QGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGVYMFRMDNDHVIDATLT 4504
Query: 135 LGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNCL-CE 190
GPA YINH C PNC V ER + R I++GEE+ Y DF D++ + C
Sbjct: 4505 GGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKFDFEDDQHKIPCH 4564
Query: 191 CDTCEIR 197
C R
Sbjct: 4565 CGAVNCR 4571
>gi|410953278|ref|XP_003983299.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Felis catus]
Length = 4884
Score = 47.0 bits (110), Expect = 0.010, Method: Composition-based stats.
Identities = 39/127 (30%), Positives = 54/127 (42%), Gaps = 7/127 (5%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK+ + +G I K L+ N M+ + A L
Sbjct: 4754 QGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGVYMFRMDSDHVIDATLT 4813
Query: 135 LGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNCL-CE 190
GPA YINH C PNC V ER + R I++GEE+ Y DF D++ + C
Sbjct: 4814 GGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKFDFEDDQHKIPCH 4873
Query: 191 CDTCEIR 197
C R
Sbjct: 4874 CGAVNCR 4880
>gi|395838450|ref|XP_003792128.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Otolemur
garnettii]
Length = 4945
Score = 47.0 bits (110), Expect = 0.010, Method: Composition-based stats.
Identities = 39/127 (30%), Positives = 54/127 (42%), Gaps = 7/127 (5%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK+ + +G I K L+ N M+ + A L
Sbjct: 4815 QGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGVYMFRMDNDHVIDATLT 4874
Query: 135 LGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNCL-CE 190
GPA YINH C PNC V ER + R I++GEE+ Y DF D++ + C
Sbjct: 4875 GGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKFDFEDDQHKIPCH 4934
Query: 191 CDTCEIR 197
C R
Sbjct: 4935 CGAVNCR 4941
>gi|367021898|ref|XP_003660234.1| hypothetical protein MYCTH_2298281 [Myceliophthora thermophila ATCC
42464]
gi|347007501|gb|AEO54989.1| hypothetical protein MYCTH_2298281 [Myceliophthora thermophila ATCC
42464]
Length = 942
Score = 47.0 bits (110), Expect = 0.010, Method: Composition-based stats.
Identities = 30/86 (34%), Positives = 39/86 (45%), Gaps = 11/86 (12%)
Query: 138 AAYINHDCRPNCRF----VATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN---CLCE 190
A ++NH C PNCR VA + A R I GEE+T Y D F KN CLC
Sbjct: 582 ARFVNHSCSPNCRMIKWIVAGQPRMALFAGDRPIMTGEELTYDYNFDPFSAKNVQKCLCG 641
Query: 191 CDTCEIRMKGAFAQSSVEPQTTQPSR 216
C +G E +T +P++
Sbjct: 642 SPNC----RGVLGPKPKEVKTPKPAK 663
>gi|359321427|ref|XP_003639590.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL3 [Canis lupus familiaris]
Length = 4874
Score = 47.0 bits (110), Expect = 0.010, Method: Composition-based stats.
Identities = 39/127 (30%), Positives = 54/127 (42%), Gaps = 7/127 (5%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK+ + +G I K L+ N M+ + A L
Sbjct: 4744 QGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGVYMFRMDNDHVIDATLT 4803
Query: 135 LGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNCL-CE 190
GPA YINH C PNC V ER + R I++GEE+ Y DF D++ + C
Sbjct: 4804 GGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKFDFEDDQHKIPCH 4863
Query: 191 CDTCEIR 197
C R
Sbjct: 4864 CGAVNCR 4870
>gi|301759361|ref|XP_002915551.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL3-like [Ailuropoda melanoleuca]
Length = 4927
Score = 47.0 bits (110), Expect = 0.010, Method: Composition-based stats.
Identities = 39/127 (30%), Positives = 54/127 (42%), Gaps = 7/127 (5%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK+ + +G I K L+ N M+ + A L
Sbjct: 4797 QGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGVYMFRMDNDHVIDATLT 4856
Query: 135 LGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNCL-CE 190
GPA YINH C PNC V ER + R I++GEE+ Y DF D++ + C
Sbjct: 4857 GGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKFDFEDDQHKIPCH 4916
Query: 191 CDTCEIR 197
C R
Sbjct: 4917 CGAVNCR 4923
>gi|47228511|emb|CAG05331.1| unnamed protein product [Tetraodon nigroviridis]
Length = 3691
Score = 47.0 bits (110), Expect = 0.010, Method: Composition-based stats.
Identities = 43/158 (27%), Positives = 64/158 (40%), Gaps = 10/158 (6%)
Query: 50 GTQHHRYSYRIQYL---PELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTK 106
G R+S QY E K + +G + A R EK + +G I
Sbjct: 3530 GAHQGRHSKSSQYRRMKAEWKSNVYLARSRIQGLGLYAARDIEKCTMVIEYIGTIIRSEV 3589
Query: 107 EEEKRLLHTGLNDFSVMYSCRKNC---AQLWLGPAAYINHDCRPNC--RFVATERDTA-C 160
K L+ N M+ + A + GPA YINH C PNC V+ E++
Sbjct: 3590 ANRKERLYESQNRGVYMFRIDNDYVIDATITGGPARYINHSCSPNCITEVVSVEKENKII 3649
Query: 161 VKVLRDIEQGEEITCFYGEDFFGDKNCL-CECDTCEIR 197
+ R I++GEE++ Y D D++ + C C R
Sbjct: 3650 ISSCRRIQRGEELSYDYKFDLEDDQHKIPCHCGAVNCR 3687
>gi|392347077|ref|XP_003749721.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Rattus
norvegicus]
Length = 4930
Score = 46.6 bits (109), Expect = 0.011, Method: Composition-based stats.
Identities = 39/127 (30%), Positives = 54/127 (42%), Gaps = 7/127 (5%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK+ + +G I K L+ N M+ + A L
Sbjct: 4800 QGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGVYMFRMDNDHVIDATLT 4859
Query: 135 LGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNCL-CE 190
GPA YINH C PNC V ER + R I++GEE+ Y DF D++ + C
Sbjct: 4860 GGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKFDFEDDQHKIPCH 4919
Query: 191 CDTCEIR 197
C R
Sbjct: 4920 CGAVNCR 4926
>gi|392339743|ref|XP_003753895.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Rattus
norvegicus]
Length = 4931
Score = 46.6 bits (109), Expect = 0.011, Method: Composition-based stats.
Identities = 39/127 (30%), Positives = 54/127 (42%), Gaps = 7/127 (5%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK+ + +G I K L+ N M+ + A L
Sbjct: 4801 QGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGVYMFRMDNDHVIDATLT 4860
Query: 135 LGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNCL-CE 190
GPA YINH C PNC V ER + R I++GEE+ Y DF D++ + C
Sbjct: 4861 GGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKFDFEDDQHKIPCH 4920
Query: 191 CDTCEIR 197
C R
Sbjct: 4921 CGAVNCR 4927
>gi|281339843|gb|EFB15427.1| hypothetical protein PANDA_003530 [Ailuropoda melanoleuca]
Length = 4780
Score = 46.6 bits (109), Expect = 0.011, Method: Composition-based stats.
Identities = 39/127 (30%), Positives = 54/127 (42%), Gaps = 7/127 (5%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK+ + +G I K L+ N M+ + A L
Sbjct: 4650 QGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGVYMFRMDNDHVIDATLT 4709
Query: 135 LGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNCL-CE 190
GPA YINH C PNC V ER + R I++GEE+ Y DF D++ + C
Sbjct: 4710 GGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKFDFEDDQHKIPCH 4769
Query: 191 CDTCEIR 197
C R
Sbjct: 4770 CGAVNCR 4776
>gi|240280400|gb|EER43904.1| SET domain-containing protein [Ajellomyces capsulatus H143]
Length = 392
Score = 46.6 bits (109), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 40/136 (29%), Positives = 60/136 (44%), Gaps = 8/136 (5%)
Query: 60 IQYLPELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAE-LTKEEEKRLLHTGLN 118
IQ E P +RG + + R + I VG I E ++ LH L
Sbjct: 215 IQLADETNPLLELHEYPDRGVGVRSLRSIKAGTYIGQYVGEIREPAARKGTTYALHHTLR 274
Query: 119 DFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFV-ATERDTACV--KVLRDIEQGEEITC 175
+ S+ A ++ Y+NH CR + FV A ++ACV + +RD E EEIT
Sbjct: 275 NRSIGVID----AAIYGNWTRYMNHSCRASVIFVSAVVGNSACVLVRAIRDSEIFEEITV 330
Query: 176 FYGEDFFGDKNCLCEC 191
YG +F + +C+C
Sbjct: 331 DYGAAYFAPRGRVCKC 346
>gi|5630077|gb|AAD45822.1|AC006017_2 similar to ALR; similar to AAC51735 (PID:g2358287) [Homo sapiens]
Length = 1813
Score = 46.6 bits (109), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 39/127 (30%), Positives = 54/127 (42%), Gaps = 7/127 (5%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK+ + +G I K L+ N M+ + A L
Sbjct: 1683 QGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGVYMFRMDNDHVIDATLT 1742
Query: 135 LGPAAYINHDCRPNC--RFVATERD-TACVKVLRDIEQGEEITCFYGEDFFGDKNCL-CE 190
GPA YINH C PNC V ER + R I++GEE+ Y DF D++ + C
Sbjct: 1743 GGPARYINHSCAPNCVAEVVTFERGHKIIISSSRRIQKGEELCYDYKFDFEDDQHKIPCH 1802
Query: 191 CDTCEIR 197
C R
Sbjct: 1803 CGAVNCR 1809
>gi|21739477|emb|CAD38780.1| hypothetical protein [Homo sapiens]
Length = 1033
Score = 46.6 bits (109), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 39/127 (30%), Positives = 54/127 (42%), Gaps = 7/127 (5%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK+ + +G I K L+ N M+ + A L
Sbjct: 903 QGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGVYMFRMDNDHVIDATLT 962
Query: 135 LGPAAYINHDCRPNC--RFVATERD-TACVKVLRDIEQGEEITCFYGEDFFGDKNCL-CE 190
GPA YINH C PNC V ER + R I++GEE+ Y DF D++ + C
Sbjct: 963 GGPARYINHSCAPNCVAEVVTFERGHKIIISSSRRIQKGEELCYDYKFDFEDDQHKIPCH 1022
Query: 191 CDTCEIR 197
C R
Sbjct: 1023 CGAVNCR 1029
>gi|354478318|ref|XP_003501362.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Cricetulus
griseus]
Length = 4871
Score = 46.6 bits (109), Expect = 0.011, Method: Composition-based stats.
Identities = 39/127 (30%), Positives = 54/127 (42%), Gaps = 7/127 (5%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK+ + +G I K L+ N M+ + A L
Sbjct: 4741 QGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGVYMFRMDNDHVIDATLT 4800
Query: 135 LGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNCL-CE 190
GPA YINH C PNC V ER + R I++GEE+ Y DF D++ + C
Sbjct: 4801 GGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKFDFEDDQHKIPCH 4860
Query: 191 CDTCEIR 197
C R
Sbjct: 4861 CGAVNCR 4867
>gi|397469943|ref|XP_003806597.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL3, partial [Pan paniscus]
Length = 4810
Score = 46.6 bits (109), Expect = 0.011, Method: Composition-based stats.
Identities = 39/127 (30%), Positives = 54/127 (42%), Gaps = 7/127 (5%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK+ + +G I K L+ N M+ + A L
Sbjct: 4680 QGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGVYMFRMDNDHVIDATLT 4739
Query: 135 LGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNCL-CE 190
GPA YINH C PNC V ER + R I++GEE+ Y DF D++ + C
Sbjct: 4740 GGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKFDFEDDQHKIPCH 4799
Query: 191 CDTCEIR 197
C R
Sbjct: 4800 CGAVNCR 4806
>gi|355748156|gb|EHH52653.1| hypothetical protein EGM_13123, partial [Macaca fascicularis]
Length = 4916
Score = 46.6 bits (109), Expect = 0.011, Method: Composition-based stats.
Identities = 39/127 (30%), Positives = 54/127 (42%), Gaps = 7/127 (5%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK+ + +G I K L+ N M+ + A L
Sbjct: 4786 QGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGVYMFRMDNDHVIDATLT 4845
Query: 135 LGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNCL-CE 190
GPA YINH C PNC V ER + R I++GEE+ Y DF D++ + C
Sbjct: 4846 GGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKFDFEDDQHKIPCH 4905
Query: 191 CDTCEIR 197
C R
Sbjct: 4906 CGAVNCR 4912
>gi|338724475|ref|XP_001495649.3| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL3-like [Equus caballus]
Length = 4910
Score = 46.6 bits (109), Expect = 0.011, Method: Composition-based stats.
Identities = 39/127 (30%), Positives = 54/127 (42%), Gaps = 7/127 (5%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK+ + +G I K L+ N M+ + A L
Sbjct: 4780 QGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGVYMFRMDNDHVIDATLT 4839
Query: 135 LGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNCL-CE 190
GPA YINH C PNC V ER + R I++GEE+ Y DF D++ + C
Sbjct: 4840 GGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKFDFEDDQHKIPCH 4899
Query: 191 CDTCEIR 197
C R
Sbjct: 4900 CGAVNCR 4906
>gi|325096532|gb|EGC49842.1| SET domain-containing protein [Ajellomyces capsulatus H88]
Length = 392
Score = 46.6 bits (109), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 40/136 (29%), Positives = 60/136 (44%), Gaps = 8/136 (5%)
Query: 60 IQYLPELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAE-LTKEEEKRLLHTGLN 118
IQ E P +RG + + R + I VG I E ++ LH L
Sbjct: 215 IQLADETNPLLELHEYPDRGVGVRSLRSIKAGTYIGQYVGEIREPAARKGTTYALHHTLR 274
Query: 119 DFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFV-ATERDTACV--KVLRDIEQGEEITC 175
+ S+ A ++ Y+NH CR + FV A ++ACV + +RD E EEIT
Sbjct: 275 NRSIGVID----AAIYGNWTRYMNHSCRASVIFVSAVVGNSACVLVRAIRDSEIFEEITV 330
Query: 176 FYGEDFFGDKNCLCEC 191
YG +F + +C+C
Sbjct: 331 DYGAAYFAPRGRVCKC 346
>gi|342887354|gb|EGU86868.1| hypothetical protein FOXB_02614 [Fusarium oxysporum Fo5176]
Length = 405
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 37/135 (27%), Positives = 59/135 (43%), Gaps = 7/135 (5%)
Query: 77 NRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCA--QLW 134
N G + AT +E + + LVG +A + + ND S C +
Sbjct: 268 NMGEGVWATSDYEMGEFLGELVGELAPIDYHTDGWAADLSRNDLSPKGDDVAWCQVYTRY 327
Query: 135 LGP-AAYINHDCRPNCRFVA---TERDTACVKVLRDIEQGEEITCFYGEDFFGD-KNCLC 189
+G +NH C PNC FV+ + R ++ R I++ I YG+D++ D + CLC
Sbjct: 328 MGNWVRKVNHSCEPNCAFVSFNISGRWRLMLQASRQIKRNSWILADYGQDYWTDGRRCLC 387
Query: 190 ECDTCEIRMKGAFAQ 204
D C R+ A+
Sbjct: 388 GSDQCISRLGADVAR 402
>gi|332243363|ref|XP_003270849.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Nomascus
leucogenys]
Length = 4856
Score = 46.6 bits (109), Expect = 0.012, Method: Composition-based stats.
Identities = 39/127 (30%), Positives = 54/127 (42%), Gaps = 7/127 (5%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK+ + +G I K L+ N M+ + A L
Sbjct: 4726 QGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGVYMFRMDNDHVIDATLT 4785
Query: 135 LGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNCL-CE 190
GPA YINH C PNC V ER + R I++GEE+ Y DF D++ + C
Sbjct: 4786 GGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKFDFEDDQHKIPCH 4845
Query: 191 CDTCEIR 197
C R
Sbjct: 4846 CGAVNCR 4852
>gi|291397406|ref|XP_002715125.1| PREDICTED: myeloid/lymphoid or mixed-lineage leukemia 3-like
[Oryctolagus cuniculus]
Length = 4865
Score = 46.6 bits (109), Expect = 0.012, Method: Composition-based stats.
Identities = 39/127 (30%), Positives = 54/127 (42%), Gaps = 7/127 (5%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK+ + +G I K L+ N M+ + A L
Sbjct: 4735 QGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGVYMFRMDNDHVIDATLT 4794
Query: 135 LGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNCL-CE 190
GPA YINH C PNC V ER + R I++GEE+ Y DF D++ + C
Sbjct: 4795 GGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKFDFEDDQHKIPCH 4854
Query: 191 CDTCEIR 197
C R
Sbjct: 4855 CGAVNCR 4861
>gi|296210171|ref|XP_002751860.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Callithrix
jacchus]
Length = 4909
Score = 46.6 bits (109), Expect = 0.012, Method: Composition-based stats.
Identities = 39/127 (30%), Positives = 54/127 (42%), Gaps = 7/127 (5%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK+ + +G I K L+ N M+ + A L
Sbjct: 4779 QGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGVYMFRMDNDHVIDATLT 4838
Query: 135 LGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNCL-CE 190
GPA YINH C PNC V ER + R I++GEE+ Y DF D++ + C
Sbjct: 4839 GGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKFDFEDDQHKIPCH 4898
Query: 191 CDTCEIR 197
C R
Sbjct: 4899 CGAVNCR 4905
>gi|37999865|sp|Q8BRH4.2|MLL3_MOUSE RecName: Full=Histone-lysine N-methyltransferase MLL3; AltName:
Full=Myeloid/lymphoid or mixed-lineage leukemia protein 3
homolog
Length = 4903
Score = 46.6 bits (109), Expect = 0.012, Method: Composition-based stats.
Identities = 39/127 (30%), Positives = 54/127 (42%), Gaps = 7/127 (5%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK+ + +G I K L+ N M+ + A L
Sbjct: 4773 QGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGVYMFRMDNDHVIDATLT 4832
Query: 135 LGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNCL-CE 190
GPA YINH C PNC V ER + R I++GEE+ Y DF D++ + C
Sbjct: 4833 GGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKFDFEDDQHKIPCH 4892
Query: 191 CDTCEIR 197
C R
Sbjct: 4893 CGAVNCR 4899
>gi|320593393|gb|EFX05802.1| set domain containing protein [Grosmannia clavigera kw1407]
Length = 573
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 29/78 (37%), Positives = 35/78 (44%), Gaps = 11/78 (14%)
Query: 139 AYINHDCRPNCRF-VATERDTACVKVLRDIEQGEEITCFYGEDFFGDK----------NC 187
A INHDCRPN + E T V RDIE+GEE+T Y D +
Sbjct: 350 AKINHDCRPNLSADIFGETMTMVVWANRDIEEGEELTISYLNDLLPSDRRSKVIKRRWDF 409
Query: 188 LCECDTCEIRMKGAFAQS 205
C+CD C + GA S
Sbjct: 410 QCQCDLCTGKADGALEAS 427
>gi|124487063|ref|NP_001074852.1| histone-lysine N-methyltransferase MLL3 [Mus musculus]
Length = 4904
Score = 46.6 bits (109), Expect = 0.012, Method: Composition-based stats.
Identities = 39/127 (30%), Positives = 54/127 (42%), Gaps = 7/127 (5%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK+ + +G I K L+ N M+ + A L
Sbjct: 4774 QGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGVYMFRMDNDHVIDATLT 4833
Query: 135 LGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNCL-CE 190
GPA YINH C PNC V ER + R I++GEE+ Y DF D++ + C
Sbjct: 4834 GGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKFDFEDDQHKIPCH 4893
Query: 191 CDTCEIR 197
C R
Sbjct: 4894 CGAVNCR 4900
>gi|426358564|ref|XP_004046577.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Gorilla gorilla
gorilla]
Length = 4782
Score = 46.6 bits (109), Expect = 0.012, Method: Composition-based stats.
Identities = 39/127 (30%), Positives = 54/127 (42%), Gaps = 7/127 (5%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK+ + +G I K L+ N M+ + A L
Sbjct: 4652 QGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGVYMFRMDNDHVIDATLT 4711
Query: 135 LGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNCL-CE 190
GPA YINH C PNC V ER + R I++GEE+ Y DF D++ + C
Sbjct: 4712 GGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKFDFEDDQHKIPCH 4771
Query: 191 CDTCEIR 197
C R
Sbjct: 4772 CGAVNCR 4778
>gi|410336273|gb|JAA37083.1| myeloid/lymphoid or mixed-lineage leukemia 3 [Pan troglodytes]
Length = 4912
Score = 46.6 bits (109), Expect = 0.012, Method: Composition-based stats.
Identities = 39/127 (30%), Positives = 54/127 (42%), Gaps = 7/127 (5%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK+ + +G I K L+ N M+ + A L
Sbjct: 4782 QGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGVYMFRMDNDHVIDATLT 4841
Query: 135 LGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNCL-CE 190
GPA YINH C PNC V ER + R I++GEE+ Y DF D++ + C
Sbjct: 4842 GGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKFDFEDDQHKIPCH 4901
Query: 191 CDTCEIR 197
C R
Sbjct: 4902 CGAVNCR 4908
>gi|297289715|ref|XP_001107669.2| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Macaca
mulatta]
Length = 4785
Score = 46.6 bits (109), Expect = 0.012, Method: Composition-based stats.
Identities = 39/127 (30%), Positives = 54/127 (42%), Gaps = 7/127 (5%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK+ + +G I K L+ N M+ + A L
Sbjct: 4655 QGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGVYMFRMDNDHVIDATLT 4714
Query: 135 LGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNCL-CE 190
GPA YINH C PNC V ER + R I++GEE+ Y DF D++ + C
Sbjct: 4715 GGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKFDFEDDQHKIPCH 4774
Query: 191 CDTCEIR 197
C R
Sbjct: 4775 CGAVNCR 4781
>gi|344276554|ref|XP_003410073.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL3-like [Loxodonta africana]
Length = 4785
Score = 46.6 bits (109), Expect = 0.012, Method: Composition-based stats.
Identities = 39/127 (30%), Positives = 54/127 (42%), Gaps = 7/127 (5%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK+ + +G I K L+ N M+ + A L
Sbjct: 4655 QGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGVYMFRMDNDHVIDATLT 4714
Query: 135 LGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNCL-CE 190
GPA YINH C PNC V ER + R I++GEE+ Y DF D++ + C
Sbjct: 4715 GGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKFDFEDDQHKIPCH 4774
Query: 191 CDTCEIR 197
C R
Sbjct: 4775 CGAVNCR 4781
>gi|116193789|ref|XP_001222707.1| hypothetical protein CHGG_06612 [Chaetomium globosum CBS 148.51]
gi|88182525|gb|EAQ89993.1| hypothetical protein CHGG_06612 [Chaetomium globosum CBS 148.51]
Length = 907
Score = 46.6 bits (109), Expect = 0.013, Method: Composition-based stats.
Identities = 44/140 (31%), Positives = 61/140 (43%), Gaps = 17/140 (12%)
Query: 70 YRYSLE----GNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLND---FSV 122
YR +E G+RG + + R +E N I G I +T+ E +R ++ D + +
Sbjct: 500 YRVGVEVVKTGDRGYGVRSNRCFEANQIIMEYTGEI--ITEAECERRMNEEYKDNECYYL 557
Query: 123 MYSCRKNCAQLWLGPAA-YINHDCRPNCRF----VATERDTACVKVLRDIEQGEEITCFY 177
M + G A ++NH C PNCR VA + A R I GEE+T Y
Sbjct: 558 MSFDQNMIIDATTGSIARFVNHSCSPNCRMIKWIVAGQPRMALFAGDRPIMTGEELTYDY 617
Query: 178 GEDFFGDKN---CLCECDTC 194
D F KN CLC C
Sbjct: 618 NFDPFSAKNVQKCLCGSPNC 637
>gi|302415258|ref|XP_003005461.1| conserved hypothetical protein [Verticillium albo-atrum VaMs.102]
gi|261356530|gb|EEY18958.1| conserved hypothetical protein [Verticillium albo-atrum VaMs.102]
Length = 402
Score = 46.6 bits (109), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 46/161 (28%), Positives = 69/161 (42%), Gaps = 36/161 (22%)
Query: 100 CIAELTKEEEKRLLH----TGLNDFSVMYSC----------RKNCAQLWLGP-AAYINHD 144
+++L K+ +KRL++ G + + MY N A + L P + +NH
Sbjct: 159 ALSQLPKDTQKRLVNLARSDGADPLTPMYDIVTTNTCGVFLGDNEAHIGLFPEVSRLNHA 218
Query: 145 CRPNCRFVATERD-TACVKVLRDIEQGEEITCFYG----------EDFFGDKNCLCECDT 193
C+PN F ++R T V RDI GEEIT Y + F + C C
Sbjct: 219 CKPNAFFRFSQRTLTMQVIAYRDIRAGEEITINYAPLGMPHKVRKKYLFDNYGFHCRCSL 278
Query: 194 CEIRMKGAFAQSSVEPQTTQPSRYCLRDTDMRLNKRKLHKK 234
C QSS E + +R +R ++R N RK H+K
Sbjct: 279 C---------QSSPEDRAISDARR-MRIVEVRGNMRKAHEK 309
>gi|389695592|ref|ZP_10183234.1| SET domain-containing protein [Microvirga sp. WSM3557]
gi|388584398|gb|EIM24693.1| SET domain-containing protein [Microvirga sp. WSM3557]
Length = 182
Score = 46.2 bits (108), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 39/123 (31%), Positives = 57/123 (46%), Gaps = 17/123 (13%)
Query: 79 GARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGL-----NDFSVMYSCRKNCAQL 133
G + AT EK I VG ++T EE +R +T + +++ S R N A+
Sbjct: 60 GLGLFATDVIEKGTFIIEYVG--PKITNEEVQRRRNTRYLFEISDRWTIDGSPRWNTAR- 116
Query: 134 WLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD--KNCLCEC 191
YINH CRPN R +K ++ I+ G+EIT YG+ +F K C C
Sbjct: 117 ------YINHGCRPNAE-ATVSRGRIRIKAIKRIKPGDEITYNYGKSYFETFIKPMGCRC 169
Query: 192 DTC 194
+C
Sbjct: 170 KSC 172
>gi|343960312|dbj|BAK64010.1| myeloid/lymphoid or mixed-lineage leukemia protein 3 homolog [Pan
troglodytes]
Length = 411
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 39/127 (30%), Positives = 54/127 (42%), Gaps = 7/127 (5%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK+ + +G I K L+ N M+ + A L
Sbjct: 281 QGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGVYMFRMDNDHVIDATLT 340
Query: 135 LGPAAYINHDCRPNC--RFVATERD-TACVKVLRDIEQGEEITCFYGEDFFGDKNCL-CE 190
GPA YINH C PNC V ER + R I++GEE+ Y DF D++ + C
Sbjct: 341 GGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKFDFEDDQHKIPCH 400
Query: 191 CDTCEIR 197
C R
Sbjct: 401 CGAVNCR 407
>gi|37360418|dbj|BAC98187.1| mKIAA1506 protein [Mus musculus]
Length = 1520
Score = 46.2 bits (108), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 39/127 (30%), Positives = 54/127 (42%), Gaps = 7/127 (5%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK+ + +G I K L+ N M+ + A L
Sbjct: 1390 QGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGVYMFRMDNDHVIDATLT 1449
Query: 135 LGPAAYINHDCRPNC--RFVATERD-TACVKVLRDIEQGEEITCFYGEDFFGDKNCL-CE 190
GPA YINH C PNC V ER + R I++GEE+ Y DF D++ + C
Sbjct: 1450 GGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKFDFEDDQHKIPCH 1509
Query: 191 CDTCEIR 197
C R
Sbjct: 1510 CGAVNCR 1516
>gi|346320473|gb|EGX90073.1| histone-lysine N-methyltransferase (Ash1), putative [Cordyceps
militaris CM01]
Length = 962
Score = 46.2 bits (108), Expect = 0.016, Method: Composition-based stats.
Identities = 35/118 (29%), Positives = 54/118 (45%), Gaps = 17/118 (14%)
Query: 138 AAYINHDCRPNCRF----VATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN---CLCE 190
A ++NH C PNCR V+ + A I G+E+T Y D F KN CLC
Sbjct: 671 ARFVNHSCNPNCRMIKWIVSGQPRMALFAGDSPIMTGDELTYDYNFDPFSAKNVQRCLCG 730
Query: 191 CDTCEIRMKGAFAQSSVEPQTTQPSRYCLRDT---DMRLNKRKLHKKLNRLLLASDKN 245
D C +G P+ +P + L++ ++ KRKL + L + ++DK+
Sbjct: 731 ADNC----RGFLGP---RPREVKPPKTDLKNAVKGTIKAGKRKLKEMLGEEVSSNDKS 781
>gi|74218832|dbj|BAE37820.1| unnamed protein product [Mus musculus]
Length = 342
Score = 45.8 bits (107), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 39/127 (30%), Positives = 54/127 (42%), Gaps = 7/127 (5%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK+ + +G I K L+ N M+ + A L
Sbjct: 212 QGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGVYMFRMDNDHVIDATLT 271
Query: 135 LGPAAYINHDCRPNC--RFVATERD-TACVKVLRDIEQGEEITCFYGEDFFGDKNCL-CE 190
GPA YINH C PNC V ER + R I++GEE+ Y DF D++ + C
Sbjct: 272 GGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKFDFEDDQHKIPCH 331
Query: 191 CDTCEIR 197
C R
Sbjct: 332 CGAVNCR 338
>gi|74193661|dbj|BAE22782.1| unnamed protein product [Mus musculus]
Length = 865
Score = 45.8 bits (107), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 39/127 (30%), Positives = 54/127 (42%), Gaps = 7/127 (5%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK+ + +G I K L+ N M+ + A L
Sbjct: 735 QGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGVYMFRMDNDHVIDATLT 794
Query: 135 LGPAAYINHDCRPNC--RFVATERD-TACVKVLRDIEQGEEITCFYGEDFFGDKNCL-CE 190
GPA YINH C PNC V ER + R I++GEE+ Y DF D++ + C
Sbjct: 795 GGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKFDFEDDQHKIPCH 854
Query: 191 CDTCEIR 197
C R
Sbjct: 855 CGAVNCR 861
>gi|70998122|ref|XP_753791.1| SET domain protein [Aspergillus fumigatus Af293]
gi|66851427|gb|EAL91753.1| SET domain protein [Aspergillus fumigatus Af293]
Length = 440
Score = 45.8 bits (107), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 21/58 (36%), Positives = 33/58 (56%), Gaps = 3/58 (5%)
Query: 140 YINHDCRPNCRFV---ATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTC 194
Y+NH C+P+ F+ +R V+ +RDI+ EEIT YG ++ ++ CLC C
Sbjct: 371 YLNHSCKPSTTFLKMTVGKRTIMAVQAVRDIDIFEEITVDYGTGYWKNRKCLCGEPGC 428
>gi|348568065|ref|XP_003469819.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Cavia
porcellus]
Length = 4878
Score = 45.8 bits (107), Expect = 0.021, Method: Composition-based stats.
Identities = 39/127 (30%), Positives = 53/127 (41%), Gaps = 7/127 (5%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK+ + +G I K L+ N M+ + A L
Sbjct: 4748 QGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGVYMFRMDNDHVIDATLT 4807
Query: 135 LGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNCL-CE 190
GPA YINH C PNC V ER + R I+ GEE+ Y DF D++ + C
Sbjct: 4808 GGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQTGEELCYDYKFDFEDDQHKIPCH 4867
Query: 191 CDTCEIR 197
C R
Sbjct: 4868 CGAVNCR 4874
>gi|407010256|gb|EKE25201.1| hypothetical protein ACD_5C00253G0004 [uncultured bacterium]
Length = 138
Score = 45.8 bits (107), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 25/56 (44%), Positives = 31/56 (55%), Gaps = 5/56 (8%)
Query: 138 AAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFF----GDKNCLC 189
A YINH CRPNC V R + L+DI +GEEI YG+++F CLC
Sbjct: 74 ARYINHCCRPNCE-VIINRKMILIYALKDIMKGEEINYDYGKEYFLTYIKPHGCLC 128
>gi|327354366|gb|EGE83223.1| SET domain-containing protein [Ajellomyces dermatitidis ATCC 18188]
Length = 464
Score = 45.8 bits (107), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 23/69 (33%), Positives = 36/69 (52%), Gaps = 3/69 (4%)
Query: 130 CAQLWLGPAAYINHDCRPNCRF---VATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN 186
C + + A ++NH C P+ F ++ + LRDIE EEIT YG+ ++ +K
Sbjct: 343 CPKWYGNWARFVNHSCEPSVEFRSRTIGKQIYMMMVALRDIEAFEEITVHYGKRYWREKQ 402
Query: 187 CLCECDTCE 195
CLC C+
Sbjct: 403 CLCRSRACK 411
>gi|183221524|ref|YP_001839520.1| SET domain-containing protein [Leptospira biflexa serovar Patoc
strain 'Patoc 1 (Paris)']
gi|189911607|ref|YP_001963162.1| lysine methyltransferase [Leptospira biflexa serovar Patoc strain
'Patoc 1 (Ames)']
gi|167776283|gb|ABZ94584.1| Lysine methyltransferase [Leptospira biflexa serovar Patoc strain
'Patoc 1 (Ames)']
gi|167779946|gb|ABZ98244.1| Putative SET domain-containing protein [Leptospira biflexa serovar
Patoc strain 'Patoc 1 (Paris)']
Length = 145
Score = 45.4 bits (106), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 33/112 (29%), Positives = 52/112 (46%), Gaps = 12/112 (10%)
Query: 77 NRGARIVATRKWEKNDNIAYLVGCI-----AELTKEEEKRLLHTGLNDFSVMYSCRKNCA 131
N G + + K D + Y +G I AE K + + L D+ + R++
Sbjct: 32 NIGMGLFTKQTLYKGDTVGYYMGKIITDDQAESNKYVDSKYLLWICKDWWIYGEGRESNY 91
Query: 132 QLWLGPAAYINHDCRPNCRFVATER-DTACVKVLRDIEQGEEITCFYGEDFF 182
YINH +PN + + R TA KVL+ I++GEEI YG+D++
Sbjct: 92 ------TRYINHSSKPNAELITSVRWKTARFKVLKTIKEGEEIFFDYGKDYW 137
>gi|301605820|ref|XP_002932540.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Xenopus
(Silurana) tropicalis]
Length = 5215
Score = 45.4 bits (106), Expect = 0.023, Method: Composition-based stats.
Identities = 38/127 (29%), Positives = 54/127 (42%), Gaps = 7/127 (5%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK+ + +G I K L+ N M+ A L
Sbjct: 5085 QGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGVYMFRIDNEHVIDATLT 5144
Query: 135 LGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNCL-CE 190
GPA YINH C PNC V E+ + R I++GEE++ Y DF D++ + C
Sbjct: 5145 GGPARYINHSCAPNCVAEVVTFEKGHRIIISSNRRIQKGEELSYDYKFDFEDDQHKIPCH 5204
Query: 191 CDTCEIR 197
C R
Sbjct: 5205 CGAVNCR 5211
>gi|358369137|dbj|GAA85752.1| histone-lysine N-methyltransferase [Aspergillus kawachii IFO 4308]
Length = 852
Score = 45.4 bits (106), Expect = 0.027, Method: Composition-based stats.
Identities = 55/198 (27%), Positives = 80/198 (40%), Gaps = 26/198 (13%)
Query: 76 GNRGARIVATRKWEKNDNIAYLVGCIAELTK-EEEKRLLHTGLNDFSVMYSCRKNCAQLW 134
+RG + + R +E N I G I T+ E+ R ++ + +MY +
Sbjct: 486 ADRGYGVRSNRTFEPNQIIVEYTGEIITQTECEKRMRTIYKHNECYYLMYFDQNMIIDAT 545
Query: 135 LGPAA-YINHDCRPNCRF----VATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN--- 186
G A ++NH C PNCR VA + A R I GEE+T Y D + KN
Sbjct: 546 RGSIARFVNHSCEPNCRMEKWTVAGKPRMALFAGDRGIMTGEELTYDYNFDPYSQKNVQQ 605
Query: 187 CLCECDTCE----IRMKGAFAQSSVEPQTTQPSRYCLRDTDMRL--NKRKLHKKLNRLLL 240
C C C R K AQ + E + +P + R T+ + KRK
Sbjct: 606 CRCGSSNCRGILGPRPKEK-AQRAKEQKVEKPKKSASRRTNGKTAGTKRK---------- 654
Query: 241 ASDKNDTNSSDNSSKYSL 258
+ D D +SS + K L
Sbjct: 655 SGDALDVSSSRANKKQKL 672
>gi|294056202|ref|YP_003549860.1| nuclear protein SET [Coraliomargarita akajimensis DSM 45221]
gi|293615535|gb|ADE55690.1| nuclear protein SET [Coraliomargarita akajimensis DSM 45221]
Length = 127
Score = 45.4 bits (106), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 35/102 (34%), Positives = 49/102 (48%), Gaps = 16/102 (15%)
Query: 89 EKNDNIAYLVGCI---AELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAA----YI 141
E+ D I Y G + AE +E+ N + V++ CR + G + YI
Sbjct: 33 EEGDTIGYYTGKVMDEAEFYREDRP------FNAY-VLWVCRTHIIDG-EGEGSNYTRYI 84
Query: 142 NHDCRPNCRFVATER-DTACVKVLRDIEQGEEITCFYGEDFF 182
NHD PN + + R TA + LR IE GEEI YGED++
Sbjct: 85 NHDDEPNAFLIVSSRWKTARFEALRTIEPGEEIFFDYGEDYW 126
>gi|126341226|ref|XP_001372106.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Monodelphis
domestica]
Length = 4862
Score = 45.4 bits (106), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 39/127 (30%), Positives = 54/127 (42%), Gaps = 7/127 (5%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK+ + +G I K L+ N M+ + A L
Sbjct: 4732 QGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGVYMFRIDNDHVIDATLT 4791
Query: 135 LGPAAYINHDCRPNC--RFVATERD-TACVKVLRDIEQGEEITCFYGEDFFGDKNCL-CE 190
GPA YINH C PNC V ER + R I++GEE+ Y DF D++ + C
Sbjct: 4792 GGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKFDFEDDQHKIPCH 4851
Query: 191 CDTCEIR 197
C R
Sbjct: 4852 CGAVNCR 4858
>gi|346978073|gb|EGY21525.1| hypothetical protein VDAG_10507 [Verticillium dahliae VdLs.17]
Length = 399
Score = 45.4 bits (106), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 45/161 (27%), Positives = 69/161 (42%), Gaps = 36/161 (22%)
Query: 100 CIAELTKEEEKRLLH----------TGLNDFSVMYSCR----KNCAQLWLGP-AAYINHD 144
+++L ++ +KRL++ T + D +C N A + L P + +NH
Sbjct: 156 ALSQLPRDTQKRLVNLARSDGADPLTPVYDIVTTNTCGVFLGDNVAHIGLFPEVSRLNHA 215
Query: 145 CRPNCRFVATERD-TACVKVLRDIEQGEEITCFYG----------EDFFGDKNCLCECDT 193
C+PN F ++R T V RDI GEEIT Y + F + C C
Sbjct: 216 CKPNAFFRFSQRTLTMQVIAYRDIHAGEEITINYAPLGMPHKVRKKYLFDNYGFHCRCSL 275
Query: 194 CEIRMKGAFAQSSVEPQTTQPSRYCLRDTDMRLNKRKLHKK 234
C QSS E + +R +R ++R N RK H+K
Sbjct: 276 C---------QSSPEDRAISDARR-MRIVEVRGNMRKAHEK 306
>gi|390335528|ref|XP_003724176.1| PREDICTED: uncharacterized protein LOC591084 isoform 2
[Strongylocentrotus purpuratus]
Length = 4860
Score = 45.4 bits (106), Expect = 0.029, Method: Composition-based stats.
Identities = 27/85 (31%), Positives = 44/85 (51%), Gaps = 13/85 (15%)
Query: 117 LNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNC--RFVATERDTACVKVL--RDIEQGEE 172
++D++V+ + R PA YINH C PNC V ++D + ++ R + +GEE
Sbjct: 4777 IDDYTVVDATRSG------NPARYINHSCNPNCVAEVVNFDKDQKKIIIISSRRLLKGEE 4830
Query: 173 ITCFYGEDFFGDKN---CLCECDTC 194
+T Y + D+N CLC+ C
Sbjct: 4831 LTYDYKFEIENDQNKIPCLCKAPNC 4855
>gi|390335530|ref|XP_795757.3| PREDICTED: uncharacterized protein LOC591084 isoform 3
[Strongylocentrotus purpuratus]
Length = 4856
Score = 45.4 bits (106), Expect = 0.029, Method: Composition-based stats.
Identities = 27/85 (31%), Positives = 44/85 (51%), Gaps = 13/85 (15%)
Query: 117 LNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNC--RFVATERDTACVKVL--RDIEQGEE 172
++D++V+ + R PA YINH C PNC V ++D + ++ R + +GEE
Sbjct: 4773 IDDYTVVDATRSG------NPARYINHSCNPNCVAEVVNFDKDQKKIIIISSRRLLKGEE 4826
Query: 173 ITCFYGEDFFGDKN---CLCECDTC 194
+T Y + D+N CLC+ C
Sbjct: 4827 LTYDYKFEIENDQNKIPCLCKAPNC 4851
>gi|390335526|ref|XP_003724175.1| PREDICTED: uncharacterized protein LOC591084 isoform 1
[Strongylocentrotus purpuratus]
Length = 4873
Score = 45.4 bits (106), Expect = 0.029, Method: Composition-based stats.
Identities = 27/85 (31%), Positives = 44/85 (51%), Gaps = 13/85 (15%)
Query: 117 LNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNC--RFVATERDTACVKVL--RDIEQGEE 172
++D++V+ + R PA YINH C PNC V ++D + ++ R + +GEE
Sbjct: 4790 IDDYTVVDATRSG------NPARYINHSCNPNCVAEVVNFDKDQKKIIIISSRRLLKGEE 4843
Query: 173 ITCFYGEDFFGDKN---CLCECDTC 194
+T Y + D+N CLC+ C
Sbjct: 4844 LTYDYKFEIENDQNKIPCLCKAPNC 4868
>gi|242813250|ref|XP_002486129.1| histone-lysine N-methyltransferase (Ash1), putative [Talaromyces
stipitatus ATCC 10500]
gi|218714468|gb|EED13891.1| histone-lysine N-methyltransferase (Ash1), putative [Talaromyces
stipitatus ATCC 10500]
Length = 865
Score = 45.1 bits (105), Expect = 0.031, Method: Composition-based stats.
Identities = 50/190 (26%), Positives = 76/190 (40%), Gaps = 25/190 (13%)
Query: 76 GNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLND---FSVMYSCRKNCAQ 132
+RG + + R +E N I G I +T+ E +R + T + + +MY +
Sbjct: 513 ADRGYGVRSNRTFEPNQVIVEYTGEI--ITQSECERRMRTVYKNNECYYLMYFDQNMIID 570
Query: 133 LWLGPAA-YINHDCRPNCRF----VATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN- 186
G A ++NH C PNCR V + A R I GEE+T Y D + KN
Sbjct: 571 ATRGSIARFVNHSCAPNCRMEKWTVGGKPRMALFAGDRGIMTGEELTYDYNFDPYSQKNV 630
Query: 187 --CLCECDTCEIRMKGAFAQSSVEPQTTQPSRYCLRDTDMRLNKRKLHKKLNRLLLASDK 244
C C TC +G +P +R++ K L KK L+ + +
Sbjct: 631 QQCRCGAPTC----RGVLG--------PRPKGREIRESKAEQKKVALQKKAKSTLVGTKR 678
Query: 245 NDTNSSDNSS 254
N D S+
Sbjct: 679 KLGNVLDEST 688
>gi|432926624|ref|XP_004080920.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Oryzias
latipes]
Length = 4455
Score = 45.1 bits (105), Expect = 0.031, Method: Composition-based stats.
Identities = 36/127 (28%), Positives = 54/127 (42%), Gaps = 7/127 (5%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK + +G I K L+ N M+ + A +
Sbjct: 4325 QGLGLYAARDIEKCTMVIEYIGAIIRSEVANRKERLYESQNRGVYMFRIDNDYVIDATIT 4384
Query: 135 LGPAAYINHDCRPNC--RFVATERDTA-CVKVLRDIEQGEEITCFYGEDFFGDKNCL-CE 190
GPA YINH C PNC V E++ + R I++GEE++ Y D D++ + C
Sbjct: 4385 GGPARYINHSCAPNCITEVVNVEKENKIIISSCRRIQRGEELSYDYKFDLEDDQHKIPCH 4444
Query: 191 CDTCEIR 197
C R
Sbjct: 4445 CGAVNCR 4451
>gi|171676169|ref|XP_001903038.1| hypothetical protein [Podospora anserina S mat+]
gi|170936150|emb|CAP60810.1| unnamed protein product [Podospora anserina S mat+]
Length = 894
Score = 45.1 bits (105), Expect = 0.034, Method: Composition-based stats.
Identities = 51/195 (26%), Positives = 81/195 (41%), Gaps = 21/195 (10%)
Query: 77 NRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLND---FSVMYSCRKNCAQL 133
+RG + +TR +E N I G I +T+EE +R ++ D + +M +
Sbjct: 502 DRGYGVRSTRCFEPNQIIMEYTGEI--ITEEECERRMNEKYKDNECYYLMSFDQNMIIDA 559
Query: 134 WLGPAA-YINHDCRPNCRF----VATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN-- 186
G A ++NH C PNCR V+ + A + I G+E+T Y D F KN
Sbjct: 560 TTGSMARFVNHSCSPNCRMIKWIVSGQPRMALFAGDKPIMTGDELTYDYNFDPFSAKNVQ 619
Query: 187 -CLCECDTCEIRMKGAFAQSSVEPQTTQPSRYCLRDTDMRLNKRKLHKKLNRLLLASDKN 245
CLC C + + Q + R + + +KL LL+ ++
Sbjct: 620 KCLCGSANCRGVLGPKPKEVKPPKPAKQEKK------TARGSIKAGKRKLKELLVGAE-- 671
Query: 246 DTNSSDNSSKYSLVQ 260
D S N++K VQ
Sbjct: 672 DEASGGNAAKKRKVQ 686
>gi|258514094|ref|YP_003190316.1| nuclear protein SET [Desulfotomaculum acetoxidans DSM 771]
gi|257777799|gb|ACV61693.1| nuclear protein SET [Desulfotomaculum acetoxidans DSM 771]
Length = 132
Score = 45.1 bits (105), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 31/109 (28%), Positives = 52/109 (47%), Gaps = 6/109 (5%)
Query: 76 GNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKN---CAQ 132
G +G ++A +K+++ + I + + E + T LND+ +++ +N
Sbjct: 8 GKKGRGVLAGKKFKRGEIIEQ--APVIVIPGSELYSIKSTVLNDYYFIWAKDENNLKTGA 65
Query: 133 LWLGPAAYINHDCRPNCRFVATERDTAC-VKVLRDIEQGEEITCFYGED 180
+ LG + NH +PN F +D K L DIE+GEEIT Y D
Sbjct: 66 IALGCGSLYNHSYKPNATFNKKHKDLIIEFKALSDIEEGEEITINYNGD 114
>gi|348681507|gb|EGZ21323.1| hypothetical protein PHYSODRAFT_493478 [Phytophthora sojae]
Length = 171
Score = 45.1 bits (105), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 25/65 (38%), Positives = 34/65 (52%), Gaps = 5/65 (7%)
Query: 136 GPAAYINHDCRPNCRFVAT---ERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECD 192
G ++NH C+P +FV R T V +DI +GEE+T YG+D + C CE D
Sbjct: 101 GLMRFVNHSCQPVAKFVEVANGRRTTVVVTSTQDIHRGEEVTVDYGDDLW--FVCRCELD 158
Query: 193 TCEIR 197
C R
Sbjct: 159 GCRHR 163
>gi|395539758|ref|XP_003771833.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Sarcophilus
harrisii]
Length = 4951
Score = 45.1 bits (105), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 39/127 (30%), Positives = 54/127 (42%), Gaps = 7/127 (5%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK+ + +G I K L+ N M+ + A L
Sbjct: 4821 QGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGVYMFRIDNDHVIDATLT 4880
Query: 135 LGPAAYINHDCRPNC--RFVATERD-TACVKVLRDIEQGEEITCFYGEDFFGDKNCL-CE 190
GPA YINH C PNC V ER + R I++GEE+ Y DF D++ + C
Sbjct: 4881 GGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKFDFEDDQHKIPCH 4940
Query: 191 CDTCEIR 197
C R
Sbjct: 4941 CGAVNCR 4947
>gi|14626492|gb|AAK70214.1| MLL3-like protein [Mus musculus]
Length = 677
Score = 45.1 bits (105), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 38/121 (31%), Positives = 53/121 (43%), Gaps = 7/121 (5%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK+ + +G I K L+ N M+ + A L
Sbjct: 556 QGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGVYMFRMDNDHVIDATLT 615
Query: 135 LGPAAYINHDCRPNC--RFVATERD-TACVKVLRDIEQGEEITCFYGEDFFGDKNCL-CE 190
GPA YINH C PNC V ER + R I++GEE+ Y DF D++ + C
Sbjct: 616 GGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKFDFEDDQHKIPCH 675
Query: 191 C 191
C
Sbjct: 676 C 676
>gi|213402529|ref|XP_002172037.1| histone-lysine N-methyltransferase [Schizosaccharomyces japonicus
yFS275]
gi|212000084|gb|EEB05744.1| histone-lysine N-methyltransferase [Schizosaccharomyces japonicus
yFS275]
Length = 977
Score = 45.1 bits (105), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 38/122 (31%), Positives = 55/122 (45%), Gaps = 10/122 (8%)
Query: 82 IVATRKWEKNDNIAYLVGCIAE--LTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLWLG 136
+ A +KND + VG I + E++ + G+ D S ++ K+ A
Sbjct: 852 LFAMENIDKNDMVIEYVGEIVRQRVADTRERKYVREGIGD-SYLFRIDKDAIVDATKKGN 910
Query: 137 PAAYINHDCRPNC--RFVATE-RDTACVKVLRDIEQGEEITCFYGEDFFGDK-NCLCECD 192
A +INH C PNC + + E + RDIE+GEE+T Y DK CLC
Sbjct: 911 IARFINHSCAPNCIAKIIRVEGHQKIVIYADRDIEEGEELTYDYKFPEEVDKIPCLCGAP 970
Query: 193 TC 194
TC
Sbjct: 971 TC 972
>gi|310796984|gb|EFQ32445.1| SET domain-containing protein [Glomerella graminicola M1.001]
Length = 583
Score = 45.1 bits (105), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 47/98 (47%), Gaps = 15/98 (15%)
Query: 112 LLHTGLNDFSVMYSCRKNCA-QLWLGPAA----YINHDCRPNCRF-----VATERDTACV 161
L+ GL + VM S A Q+W G ++NH C+ N +F ++T+R V
Sbjct: 485 LITKGLQNMDVMDSSAGMRAYQIWQGRQGNFTRFVNHSCKSNAQFQRFVWMSTQR---IV 541
Query: 162 KVLRDIEQGEEITCFYGEDFFG--DKNCLCECDTCEIR 197
V + IE G EIT Y ++ DKNCLC C R
Sbjct: 542 LVSKGIEAGHEITVDYSGSYWRGLDKNCLCGESCCRYR 579
>gi|402084863|gb|EJT79881.1| hypothetical protein GGTG_04964 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 1014
Score = 45.1 bits (105), Expect = 0.039, Method: Composition-based stats.
Identities = 42/140 (30%), Positives = 59/140 (42%), Gaps = 17/140 (12%)
Query: 70 YRYSLE----GNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYS 125
YR +E G+RG + A R +E I G I +T+EE R ++ D Y
Sbjct: 523 YRVGVEVYHTGDRGFGVRANRCFEPGQIIMEYAGEI--ITEEECDRRMNEVYKDKQCYYL 580
Query: 126 CRKNCAQLWLGP----AAYINHDCRPNCRF----VATERDTACVKVLRDIEQGEEITCFY 177
+ + A ++NH C PNCR V+ A R I+ G+E+T Y
Sbjct: 581 MSFDQNMILDATTGSIARFVNHSCSPNCRMIKWIVSGVPRMALFAGDRQIQTGDELTYDY 640
Query: 178 GEDFFGDKN---CLCECDTC 194
D F KN CLC + C
Sbjct: 641 NFDPFSAKNVQKCLCGSNNC 660
>gi|384248030|gb|EIE21515.1| hypothetical protein COCSUDRAFT_56730 [Coccomyxa subellipsoidea
C-169]
Length = 494
Score = 44.7 bits (104), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 20/40 (50%), Positives = 26/40 (65%), Gaps = 1/40 (2%)
Query: 138 AAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFY 177
A+ +NH CRPNC V + V+ LRDIE+GEE+T Y
Sbjct: 430 ASLLNHSCRPNC-VVVRSMTSGSVRALRDIEEGEELTISY 468
>gi|406919276|gb|EKD57612.1| nuclear protein SET [uncultured bacterium]
Length = 191
Score = 44.7 bits (104), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 38/149 (25%), Positives = 64/149 (42%), Gaps = 22/149 (14%)
Query: 60 IQYLPELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLN- 118
I + P L Y + G + A R KN+ I ++G + E+ +L++
Sbjct: 21 IAHSPVLSLYYVHKAGCTHGTGLFAARNIYKNEEIIKVIG---PVVGEQVADILYSSYGI 77
Query: 119 DFSVMYSCRKNCAQLWLGPA---AYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITC 175
D + +K W+ P+ +INH C PN F A + DI++GEE+T
Sbjct: 78 DVLIQVGTKK-----WILPSNETRFINHSCEPNLGFKA----AGMFVTMCDIKKGEELTW 128
Query: 176 FY------GEDFFGDKNCLCECDTCEIRM 198
Y G D+ +CLC+ +C ++
Sbjct: 129 DYSMCEINGSDYNWTIDCLCKTKSCRGKI 157
>gi|392593395|gb|EIW82720.1| hypothetical protein CONPUDRAFT_163799 [Coniophora puteana
RWD-64-598 SS2]
Length = 228
Score = 44.7 bits (104), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 31/85 (36%), Positives = 42/85 (49%), Gaps = 7/85 (8%)
Query: 65 ELKPCYRYSLEGNRGAR-IVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGL------ 117
E+ RYS + + I+ATR I L G +A LT EE+K L T L
Sbjct: 131 EISHTSRYSHKTGKSELCILATRPLIPGQVITELKGSMAHLTDEEDKELKRTDLRNIDIR 190
Query: 118 NDFSVMYSCRKNCAQLWLGPAAYIN 142
DFSV++S L+LGPA ++N
Sbjct: 191 RDFSVIHSRSMKKNHLFLGPARFVN 215
>gi|426228657|ref|XP_004008414.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL3 [Ovis aries]
Length = 4922
Score = 44.7 bits (104), Expect = 0.041, Method: Composition-based stats.
Identities = 38/129 (29%), Positives = 54/129 (41%), Gaps = 11/129 (8%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK+ + +G I K L+ N M+ + A L
Sbjct: 4792 QGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGVYMFRMDSDHVIDATLT 4851
Query: 135 LGPAAYINHDCRPNC-----RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCL- 188
GPA YINH C PNC F A + + R I++GEE+ Y DF D++ +
Sbjct: 4852 GGPARYINHSCAPNCVAEVVAFRAGHK--IIISSSRRIQKGEELCYDYKFDFEDDQHKIP 4909
Query: 189 CECDTCEIR 197
C C R
Sbjct: 4910 CHCGAVNCR 4918
>gi|74145466|dbj|BAE36172.1| unnamed protein product [Mus musculus]
Length = 563
Score = 44.7 bits (104), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 39/127 (30%), Positives = 54/127 (42%), Gaps = 7/127 (5%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK+ + +G I K L+ N M+ + A L
Sbjct: 433 QGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGVYMFRMDNDHVIDATLT 492
Query: 135 LGPAAYINHDCRPNC--RFVATERD-TACVKVLRDIEQGEEITCFYGEDFFGDKNCL-CE 190
GPA YINH C PNC V ER + R I++GEE+ Y DF D++ + C
Sbjct: 493 GGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKFDFEDDQHKIPCH 552
Query: 191 CDTCEIR 197
C R
Sbjct: 553 CGAVNCR 559
>gi|451997634|gb|EMD90099.1| hypothetical protein COCHEDRAFT_1225655 [Cochliobolus
heterostrophus C5]
Length = 787
Score = 44.7 bits (104), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 59/202 (29%), Positives = 81/202 (40%), Gaps = 32/202 (15%)
Query: 70 YRYSLE----GNRGARIVATRKWEKNDNIAYLVGCIAELTKEE-EKRLLHTGLND--FSV 122
Y Y +E +RG + A R +E + I G I +T+ E E+R+ D + +
Sbjct: 425 YDYGVEVLNTEDRGFGVRAMRTFEPHQIIVEYAGEI--ITQSECERRMKQVYKKDKCYYL 482
Query: 123 MYSCRKNCAQLWLGPAA-YINHDCRPNCRF----VATERDTACVKVLRDIEQGEEITCFY 177
M K G A ++NH C PNC V E A R I GEE+T Y
Sbjct: 483 MSFDNKMIIDATRGTIARFVNHSCEPNCEMIKWTVGGEPRMALFAGSRGIMTGEELTYDY 542
Query: 178 GEDFFGDKN---CLCECDTCEIRMKGAFAQSSVEPQTTQP-SRYCLRDTDMRLNKRKLHK 233
D F KN C C ++C +G P+ +P L + KRKL
Sbjct: 543 NFDPFSQKNIQECRCGTESC----RGVLG-----PKPKKPVEEKSLTSALIAGTKRKLQD 593
Query: 234 KLNRLLLASDKNDTNSSDNSSK 255
LL S + + SS NS K
Sbjct: 594 -----LLGSKRGGSESSQNSPK 610
>gi|433776186|ref|YP_007306653.1| SET domain-containing protein [Mesorhizobium australicum WSM2073]
gi|433668201|gb|AGB47277.1| SET domain-containing protein [Mesorhizobium australicum WSM2073]
Length = 133
Score = 44.7 bits (104), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 34/94 (36%), Positives = 45/94 (47%), Gaps = 14/94 (14%)
Query: 105 TKEEEKRLLHTGLNDFSVMYSCRKNCAQLWL--GPAAYINHDCRPNCRFVATERDTACVK 162
T E E + L+ +S Y R++ A + A Y+NHD PNC + E A
Sbjct: 45 TYESETGPVKNYLDRYS--YPDRRDPAYIVFEADDARYMNHDDEPNCDVSSPEETYA--- 99
Query: 163 VLRDIEQGEEITCFY------GEDFFGDKNCLCE 190
LRDI GEE+TC Y G DF GD++ E
Sbjct: 100 -LRDIAPGEEMTCNYNLFFETGFDFLGDRHLQSE 132
>gi|121704806|ref|XP_001270666.1| histone-lysine N-methyltransferase (Ash1), putative [Aspergillus
clavatus NRRL 1]
gi|119398812|gb|EAW09240.1| histone-lysine N-methyltransferase (Ash1), putative [Aspergillus
clavatus NRRL 1]
Length = 847
Score = 44.7 bits (104), Expect = 0.046, Method: Composition-based stats.
Identities = 38/127 (29%), Positives = 54/127 (42%), Gaps = 9/127 (7%)
Query: 77 NRGARIVATRKWEKNDNIAYLVG-CIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWL 135
+RG + + R +E N I G I ++ E+ R L+ + +MY +
Sbjct: 481 DRGYGVRSNRTFEPNQVIVEYTGEIITQVECEKRMRTLYKNNECYYLMYFDQNMIIDATR 540
Query: 136 GPAA-YINHDCRPNCRF----VATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN---C 187
G A ++NH C PNCR VA + A R I GEE+T Y D + KN C
Sbjct: 541 GSIARFVNHSCEPNCRMEKWTVAGKPRMALFAGDRGIMTGEELTYDYNFDPYSQKNVQQC 600
Query: 188 LCECDTC 194
C C
Sbjct: 601 RCGAPNC 607
>gi|405970535|gb|EKC35431.1| Histone-lysine N-methyltransferase MLL3 [Crassostrea gigas]
Length = 1927
Score = 44.7 bits (104), Expect = 0.046, Method: Composition-based stats.
Identities = 39/131 (29%), Positives = 60/131 (45%), Gaps = 19/131 (14%)
Query: 78 RGARIVATRKWEKN----DNIAYL----VGCIAELTKEEEKRLLHTGLNDFSVMYSCRKN 129
+G + A R EK+ + I YL V E+ EE+ R ++ D +
Sbjct: 1797 QGLGLFAARDLEKHTMVIEYIGYLIRNEVANRTEVVYEEQNRGVYMFRIDNETVVD---- 1852
Query: 130 CAQLWLGPAAYINHDCRPNC--RFVATERDTACVKVL-RDIEQGEEITCFYGEDFFGDKN 186
A + GPA YINH C PNC V ++++ + + R I +GEE+T Y DF +++
Sbjct: 1853 -ATMAGGPARYINHSCNPNCVAEVVPFDKESKIIIITNRRIPRGEELTYDYKFDFEDEQH 1911
Query: 187 ---CLCECDTC 194
C C C
Sbjct: 1912 KIPCCCGAPGC 1922
>gi|359790089|ref|ZP_09293005.1| nuclear protein SET [Mesorhizobium alhagi CCNWXJ12-2]
gi|359253999|gb|EHK57060.1| nuclear protein SET [Mesorhizobium alhagi CCNWXJ12-2]
Length = 424
Score = 44.7 bits (104), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 23/75 (30%), Positives = 35/75 (46%), Gaps = 6/75 (8%)
Query: 134 WLGPAA---YINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGE---DFFGDKNC 187
W+ PA ++NH C PN V DT + ++DIE+G+E+T Y D + C
Sbjct: 296 WIDPAGTFKFLNHSCGPNVGAVTIAADTMVIIAMKDIEKGDELTLDYSTTECDPHWEMAC 355
Query: 188 LCECDTCEIRMKGAF 202
C C ++ F
Sbjct: 356 TCGSVRCRKIVRSIF 370
>gi|340372263|ref|XP_003384664.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
[Amphimedon queenslandica]
Length = 1171
Score = 44.7 bits (104), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 45/173 (26%), Positives = 71/173 (41%), Gaps = 13/173 (7%)
Query: 76 GNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY--SCRKNC--- 130
GNRG + R D + VG I ++ +E RL T S Y + +N
Sbjct: 784 GNRGWGLKTMRSLSPGDFVIEYVGEIVDMAAVQE-RLKKTQEASVSSFYFLTLERNLIID 842
Query: 131 AQLWLGPAAYINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD--K 185
A++ A +INH C PNC ++ + ++DI++ E+T Y D G+ K
Sbjct: 843 ARVKSNHARFINHSCDPNCETQKWTVNGETRIGIFAIKDIKEDTELTFDYQFDCLGNEKK 902
Query: 186 NCLCECDTCEIRMKGAFAQSSVEPQTTQPSRYCLRDTDMRLNKRKLHKKLNRL 238
CLC C + Q ++ + QP + T R R +K R+
Sbjct: 903 ACLCGAQNCSGFLGEKPKQEKLKQEKPQPPSSSRKGTGKRA--RSFNKDQARV 953
>gi|34610109|gb|AAN11291.1| mixed-lineage leukemia 3 protein [Mus musculus]
Length = 3396
Score = 44.7 bits (104), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 39/127 (30%), Positives = 54/127 (42%), Gaps = 7/127 (5%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK+ + +G I K L+ N M+ + A L
Sbjct: 3266 QGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGVYMFRMDNDHVIDATLT 3325
Query: 135 LGPAAYINHDCRPNC--RFVATERD-TACVKVLRDIEQGEEITCFYGEDFFGDKNCL-CE 190
GPA YINH C PNC V ER + R I++GEE+ Y DF D++ + C
Sbjct: 3326 GGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKFDFEDDQHKIPCH 3385
Query: 191 CDTCEIR 197
C R
Sbjct: 3386 CGAVNCR 3392
>gi|407849311|gb|EKG04091.1| hypothetical protein TCSYLVIO_004851 [Trypanosoma cruzi]
Length = 937
Score = 44.7 bits (104), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 44/185 (23%), Positives = 67/185 (36%), Gaps = 25/185 (13%)
Query: 2 KSIVHNLILHVPIQLACHLLNKLNPSFWAFCVQALLAQRFNIGRILDMGTQHHRYSYRIQ 61
K+ + N + A L + +F + V + + + G TQ +
Sbjct: 424 KTEMQNNEMSTVTGAAIELFRRGGMTFISDAVTSQVVESLQSGAFFGEPTQAGEMWNALH 483
Query: 62 YLPELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFS 121
+ ++P R S+ G +IA LV C E + + + +FS
Sbjct: 484 LIAAMQPVDRNSIRTEDGF----------AHDIADLVRCWTE------RAIPFCPVEEFS 527
Query: 122 VMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTAC---------VKVLRDIEQGEE 172
+ + A +INH C PN FV E + AC V L DIE GEE
Sbjct: 528 IELEGAPGEGKGLYPFAGFINHSCSPNALFVFGEEEGACCSSGGGVLRVVALSDIEPGEE 587
Query: 173 ITCFY 177
IT Y
Sbjct: 588 ITVTY 592
>gi|367041896|ref|XP_003651328.1| hypothetical protein THITE_2111453 [Thielavia terrestris NRRL 8126]
gi|346998590|gb|AEO64992.1| hypothetical protein THITE_2111453 [Thielavia terrestris NRRL 8126]
Length = 950
Score = 44.3 bits (103), Expect = 0.052, Method: Composition-based stats.
Identities = 43/140 (30%), Positives = 61/140 (43%), Gaps = 17/140 (12%)
Query: 70 YRYSLE----GNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLND---FSV 122
YR +E +RG + + R +E N I G I +T+EE +R ++ D + +
Sbjct: 504 YRVGVEVFKTPDRGYGVRSNRCFEPNQIIMEYTGEI--ITEEECERRMNEEYKDNECYYL 561
Query: 123 MYSCRKNCAQLWLGPAA-YINHDCRPNCRF----VATERDTACVKVLRDIEQGEEITCFY 177
M + G A ++NH C PNCR V+ + A R I GEE+T Y
Sbjct: 562 MSFDQNMIIDATTGSIARFVNHSCSPNCRMIKWIVSGQPRMALFAGDRPIMTGEELTYDY 621
Query: 178 GEDFFGDKN---CLCECDTC 194
D F KN CLC C
Sbjct: 622 NFDPFSAKNVQKCLCGSANC 641
>gi|255949576|ref|XP_002565555.1| Pc22g16400 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211592572|emb|CAP98928.1| Pc22g16400 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 536
Score = 44.3 bits (103), Expect = 0.057, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 34/59 (57%), Gaps = 3/59 (5%)
Query: 140 YINHDCRPNCRF---VATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCE 195
YI+H C+P+ +F R V V+RDI EEIT YG +++G+K+ C C C+
Sbjct: 474 YISHSCQPSTQFKTRTVGNRVICTVDVIRDILPFEEITVNYGFNYWGNKDFDCLCGHCD 532
>gi|390571416|ref|ZP_10251658.1| hypothetical protein WQE_23703 [Burkholderia terrae BS001]
gi|389936625|gb|EIM98511.1| hypothetical protein WQE_23703 [Burkholderia terrae BS001]
Length = 160
Score = 44.3 bits (103), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 22/63 (34%), Positives = 31/63 (49%), Gaps = 6/63 (9%)
Query: 116 GLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITC 175
GL D V+ + WL NH CRPNC + ER ++ LRD+ G+E++
Sbjct: 70 GLADGRVIDGSVGGNSARWL------NHSCRPNCEAIEDERGRVFIETLRDVMPGDELSI 123
Query: 176 FYG 178
YG
Sbjct: 124 AYG 126
>gi|348667102|gb|EGZ06928.1| hypothetical protein PHYSODRAFT_565808 [Phytophthora sojae]
Length = 1540
Score = 44.3 bits (103), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 34/112 (30%), Positives = 47/112 (41%), Gaps = 15/112 (13%)
Query: 140 YINHDCRPNCRFVATERDTA------CVKVLRDIEQGEEITCFYGEDFFGDKN--CLCEC 191
++NH C PNC+ TE+ T V LRDIE GEE+T Y G + C C
Sbjct: 211 FVNHSCEPNCK---TEKWTVKGETRIAVIALRDIEVGEELTFDYQWKALGSRQIKCYCGA 267
Query: 192 DTCEIRMKGAFAQSSVEPQTTQPSRYCLRDTDMRLNKRKLHKKLNRLLLASD 243
+C KG + + + RD + R L K R+ L+ D
Sbjct: 268 PSC----KGVIGTQNDALKDAEAQTGYFRDPEKEDTGRALVGKRVRVFLSPD 315
>gi|406920297|gb|EKD58392.1| hypothetical protein ACD_56C00141G0022 [uncultured bacterium]
Length = 154
Score = 44.3 bits (103), Expect = 0.065, Method: Compositional matrix adjust.
Identities = 23/60 (38%), Positives = 32/60 (53%), Gaps = 3/60 (5%)
Query: 140 YINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFF--GDKNCLCECDTCEIR 197
YINH C+PNC V + + RDIE+GEE+ YG ++F K C+C C +
Sbjct: 91 YINHFCQPNCE-VQISKKMVLIYASRDIEKGEELGYDYGAEYFLAHIKTKGCKCPYCRFK 149
>gi|170050731|ref|XP_001861443.1| set domain protein [Culex quinquefasciatus]
gi|167872245|gb|EDS35628.1| set domain protein [Culex quinquefasciatus]
Length = 1181
Score = 44.3 bits (103), Expect = 0.065, Method: Compositional matrix adjust.
Identities = 41/161 (25%), Positives = 70/161 (43%), Gaps = 35/161 (21%)
Query: 35 ALLAQRF-NIGRILDMGTQHHRYSYRIQYLPELKPCYRYSLEGNRGARIVATRKWEKNDN 93
AL A+R N G L H+ + I+Y+ E+ + G AR + ++ +K++N
Sbjct: 952 ALAAKRIPNKGWGLVAQEDIHQGQFVIEYVGEV-------INGEELARRIKQKQEQKDEN 1004
Query: 94 IAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFV- 152
Y + +ELT + + N A+ +INH C PNC +
Sbjct: 1005 Y-YFLTVDSELT----------------IDAGPKGNLAR-------FINHSCEPNCETLL 1040
Query: 153 --ATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCEC 191
+ + L+D++ GEE+T Y + FGD+ +C C
Sbjct: 1041 WKVGGSQSVGLFALKDLKAGEELTFNYNFETFGDQKKICHC 1081
>gi|347829685|emb|CCD45382.1| similar to similar to SET domain containing protein [Botryotinia
fuckeliana]
Length = 584
Score = 44.3 bits (103), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 41/132 (31%), Positives = 59/132 (44%), Gaps = 29/132 (21%)
Query: 70 YRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYS-CRK 128
+R L N G I+A + + I VG L+ G+ VM S K
Sbjct: 453 FRTPLGMNNG--IIARCGFARGSAIGEFVG------------LITKGMEGKDVMQSKSTK 498
Query: 129 NCAQLWLGPAA----YINHDCRPNCRF-----VATERDTACVKVLRDIEQGEEITCFYGE 179
N Q++ G ++NH C PN +F + TER V V + IE G+EIT Y
Sbjct: 499 NQYQIYQGRMGNHTRFVNHSCAPNSQFQKFHWLGTER---IVLVSKGIEAGKEITVDYSH 555
Query: 180 DFFG--DKNCLC 189
+++ DK+CLC
Sbjct: 556 EYWKGLDKDCLC 567
>gi|159126473|gb|EDP51589.1| SET domain protein [Aspergillus fumigatus A1163]
Length = 440
Score = 44.3 bits (103), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 35/138 (25%), Positives = 64/138 (46%), Gaps = 10/138 (7%)
Query: 64 PELKPCYRYSLE-GNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEK--RLLH-TGLND 119
P L C+ ++ RG + A +++K D + VG + L +++ LLH + + +
Sbjct: 294 PTLAMCFVELVDYPQRGIGVRALARFKKGDILDEYVGELRPLDYTDDRVYALLHESKMTE 353
Query: 120 FSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFV---ATERDTACVKVLRDIEQGEEITCF 176
+ W ++NH C+P+ F+ +R V+ +RDI+ EEIT
Sbjct: 354 GHPLALISAKQHGNW---TRHLNHSCKPSTTFLKMTVGKRTIMAVQAVRDIDIFEEITVD 410
Query: 177 YGEDFFGDKNCLCECDTC 194
YG ++ ++ CLC C
Sbjct: 411 YGTGYWKNRKCLCGEPGC 428
>gi|350297194|gb|EGZ78171.1| hypothetical protein NEUTE2DRAFT_101826 [Neurospora tetrasperma
FGSC 2509]
Length = 1162
Score = 43.9 bits (102), Expect = 0.069, Method: Composition-based stats.
Identities = 25/64 (39%), Positives = 31/64 (48%), Gaps = 7/64 (10%)
Query: 138 AAYINHDCRPNCRF----VATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN---CLCE 190
A ++NH C PNCR V+ + A R I+ GEE+T Y D F KN CLC
Sbjct: 769 ARFVNHSCSPNCRMIKWIVSGQPRMALFAGDRPIQTGEELTYDYNFDPFSAKNVQKCLCG 828
Query: 191 CDTC 194
C
Sbjct: 829 APNC 832
>gi|336463582|gb|EGO51822.1| hypothetical protein NEUTE1DRAFT_125465 [Neurospora tetrasperma
FGSC 2508]
Length = 1162
Score = 43.9 bits (102), Expect = 0.069, Method: Composition-based stats.
Identities = 25/64 (39%), Positives = 31/64 (48%), Gaps = 7/64 (10%)
Query: 138 AAYINHDCRPNCRF----VATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN---CLCE 190
A ++NH C PNCR V+ + A R I+ GEE+T Y D F KN CLC
Sbjct: 769 ARFVNHSCSPNCRMIKWIVSGQPRMALFAGDRPIQTGEELTYDYNFDPFSAKNVQKCLCG 828
Query: 191 CDTC 194
C
Sbjct: 829 APNC 832
>gi|336264616|ref|XP_003347084.1| hypothetical protein SMAC_05383 [Sordaria macrospora k-hell]
gi|380093778|emb|CCC08742.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 1224
Score = 43.9 bits (102), Expect = 0.069, Method: Composition-based stats.
Identities = 25/64 (39%), Positives = 31/64 (48%), Gaps = 7/64 (10%)
Query: 138 AAYINHDCRPNCRF----VATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN---CLCE 190
A ++NH C PNCR V+ + A R I+ GEE+T Y D F KN CLC
Sbjct: 839 ARFVNHSCSPNCRMIKWIVSGQPRMALFAGDRPIQTGEELTYDYNFDPFSAKNVQKCLCG 898
Query: 191 CDTC 194
C
Sbjct: 899 APNC 902
>gi|164429563|ref|XP_964116.2| hypothetical protein NCU01932 [Neurospora crassa OR74A]
gi|157073530|gb|EAA34880.2| predicted protein [Neurospora crassa OR74A]
Length = 1162
Score = 43.9 bits (102), Expect = 0.069, Method: Composition-based stats.
Identities = 25/64 (39%), Positives = 31/64 (48%), Gaps = 7/64 (10%)
Query: 138 AAYINHDCRPNCRF----VATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN---CLCE 190
A ++NH C PNCR V+ + A R I+ GEE+T Y D F KN CLC
Sbjct: 769 ARFVNHSCSPNCRMIKWIVSGQPRMALFAGDRPIQTGEELTYDYNFDPFSAKNVQKCLCG 828
Query: 191 CDTC 194
C
Sbjct: 829 APNC 832
>gi|170112995|ref|XP_001887698.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164637336|gb|EDR01622.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 293
Score = 43.9 bits (102), Expect = 0.070, Method: Compositional matrix adjust.
Identities = 32/112 (28%), Positives = 54/112 (48%), Gaps = 2/112 (1%)
Query: 75 EGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLW 134
+G+ G I ATR + + + L+G +A ++ E RL + V + ++
Sbjct: 135 DGSGGFAIFATRDLVEGEILYPLMGLMASDSRAEHSRLSEIHPHPSQVKLEKGRMDPRVL 194
Query: 135 LGPAAYINHDCRP-NCRFVATERDTA-CVKVLRDIEQGEEITCFYGEDFFGD 184
+GP +INH C N FVA A V+ + I+ +E+ YG+D+F D
Sbjct: 195 VGPLRFINHTCHSFNAEFVAIRSALAFVVQTNKLIKFHDEVFVDYGKDYFED 246
>gi|451852041|gb|EMD65336.1| hypothetical protein COCSADRAFT_35392 [Cochliobolus sativus ND90Pr]
Length = 787
Score = 43.9 bits (102), Expect = 0.076, Method: Compositional matrix adjust.
Identities = 58/199 (29%), Positives = 80/199 (40%), Gaps = 32/199 (16%)
Query: 70 YRYSLE----GNRGARIVATRKWEKNDNIAYLVGCIAELTKEE-EKRLLHTGLND--FSV 122
Y Y +E +RG + A R +E + I G I +T+ E E+R+ D + +
Sbjct: 425 YDYGVEVLNTEDRGFGVRAMRTFEPHQIIVEYAGEI--ITQSECERRMKQVYKKDKCYYL 482
Query: 123 MYSCRKNCAQLWLGPAA-YINHDCRPNCRF----VATERDTACVKVLRDIEQGEEITCFY 177
M K G A ++NH C PNC V E A R I GEE+T Y
Sbjct: 483 MSFDNKMIIDATRGTIARFVNHSCEPNCEMIKWTVGGEPRMALFAGSRGIMTGEELTYDY 542
Query: 178 GEDFFGDKN---CLCECDTCEIRMKGAFAQSSVEPQTTQP-SRYCLRDTDMRLNKRKLHK 233
D F KN C C ++C +G P+ +P L T + KRKL
Sbjct: 543 NFDPFSQKNIQECRCGTESC----RGVLG-----PKPKKPVEEKSLTSTLIAGTKRKLQD 593
Query: 234 KLNRLLLASDKNDTNSSDN 252
LL S + + SS N
Sbjct: 594 -----LLGSKRGGSESSQN 607
>gi|94969255|ref|YP_591303.1| nuclear protein SET [Candidatus Koribacter versatilis Ellin345]
gi|94551305|gb|ABF41229.1| nuclear protein SET [Candidatus Koribacter versatilis Ellin345]
Length = 158
Score = 43.9 bits (102), Expect = 0.076, Method: Compositional matrix adjust.
Identities = 25/69 (36%), Positives = 33/69 (47%), Gaps = 2/69 (2%)
Query: 138 AAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITC-FYGEDFFGDKNCLCECDTCEI 196
A ++NH C PNC A D ++ +RDIE GEE+ +Y D + C C TC
Sbjct: 87 AMFVNHSCDPNCE-TAEYDDQIWIQAMRDIEPGEELVYDYYLYDGEEEAPCYCGAKTCRG 145
Query: 197 RMKGAFAQS 205
M G S
Sbjct: 146 TMYGVSPSS 154
>gi|327274410|ref|XP_003221970.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL3-like [Anolis carolinensis]
Length = 4817
Score = 43.9 bits (102), Expect = 0.078, Method: Composition-based stats.
Identities = 28/71 (39%), Positives = 35/71 (49%), Gaps = 4/71 (5%)
Query: 131 AQLWLGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNC 187
A L GPA YINH C PNC V ER + R I++GEE+ Y DF D++
Sbjct: 4743 ATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSSRRIQKGEELCYDYKFDFEDDQHK 4802
Query: 188 L-CECDTCEIR 197
+ C C R
Sbjct: 4803 IPCHCGAVNCR 4813
>gi|296413480|ref|XP_002836440.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295630261|emb|CAZ80631.1| unnamed protein product [Tuber melanosporum]
Length = 763
Score = 43.9 bits (102), Expect = 0.078, Method: Compositional matrix adjust.
Identities = 32/96 (33%), Positives = 42/96 (43%), Gaps = 24/96 (25%)
Query: 133 LWLGPAAYINHDCRPNC--RFVATERDTACVKVLRDIEQGEEITCFYGED---------- 180
LW+ P+ YINH C N F+ D V+ RD+ +GEEIT Y E+
Sbjct: 518 LWIMPS-YINHSCWQNSTRSFLG---DLLIVRAARDLSEGEEITINYMENESGVQKRQKA 573
Query: 181 FFGDKNCLCECDTCEIRMKGAFAQSSVEPQTTQPSR 216
F + C+C CEI + EPQ Q R
Sbjct: 574 FLSEWGFECKCTMCEI--------ETAEPQGVQDKR 601
>gi|326679526|ref|XP_001919281.3| PREDICTED: histone-lysine N-methyltransferase MLL3 [Danio rerio]
Length = 3915
Score = 43.9 bits (102), Expect = 0.079, Method: Composition-based stats.
Identities = 37/127 (29%), Positives = 52/127 (40%), Gaps = 7/127 (5%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK + +G I K ++ N M+ A +
Sbjct: 3785 QGLGLYAARDIEKYTMVIEYIGTIIRSEVANRKEKMYEAQNRGVYMFRIDSEHVIDATIT 3844
Query: 135 LGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNCL-CE 190
GPA YINH C PNC VA ER + R I++GEE+ Y D D++ + C
Sbjct: 3845 GGPARYINHSCAPNCITEVVALERGHKIIISSNRRIQRGEELCYDYKFDLEDDQHKIPCH 3904
Query: 191 CDTCEIR 197
C R
Sbjct: 3905 CGAVNCR 3911
>gi|242209059|ref|XP_002470378.1| predicted protein [Postia placenta Mad-698-R]
gi|220730548|gb|EED84403.1| predicted protein [Postia placenta Mad-698-R]
Length = 1006
Score = 43.9 bits (102), Expect = 0.079, Method: Composition-based stats.
Identities = 25/67 (37%), Positives = 34/67 (50%), Gaps = 13/67 (19%)
Query: 142 NHDCRPNCRFVATERDTACVKV--LRDIEQGEEITCFYGEDFF----------GDKNCLC 189
NHDC PN F +E +T C ++ + DIE+GEEIT Y + G N +C
Sbjct: 822 NHDCTPNAHFHFSE-NTCCGRLVAMHDIEEGEEITVRYVDSLAPREERQSLLRGRYNFVC 880
Query: 190 ECDTCEI 196
C TC +
Sbjct: 881 TCRTCSL 887
>gi|328865276|gb|EGG13662.1| SET domain-containing protein [Dictyostelium fasciculatum]
Length = 1418
Score = 43.9 bits (102), Expect = 0.087, Method: Composition-based stats.
Identities = 30/102 (29%), Positives = 44/102 (43%), Gaps = 10/102 (9%)
Query: 100 CIAELTKEE-EKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDT 158
C+ +T+ E EK L+ + + RK A +INH C PNC D
Sbjct: 1097 CLERMTESESEKYFYFLTLDRLECLDASRKG------NLARFINHSCDPNCETQKWNVDG 1150
Query: 159 AC---VKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIR 197
+ +RDI++GEE+T Y + FG +C C R
Sbjct: 1151 EVRIGIFAIRDIKRGEELTFDYNYERFGTSKQVCYCGAANCR 1192
>gi|261329776|emb|CBH12758.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
DAL972]
Length = 429
Score = 43.5 bits (101), Expect = 0.088, Method: Compositional matrix adjust.
Identities = 21/57 (36%), Positives = 31/57 (54%), Gaps = 5/57 (8%)
Query: 139 AYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDF-----FGDKNCLCE 190
+++NH C PNC FV T + VLRDI +GE++ Y D + D+ LC+
Sbjct: 358 SFLNHSCEPNCGFVGTNAMNRRLVVLRDIREGEQLLINYNADLTTCVSYEDRRALCK 414
>gi|429862858|gb|ELA37465.1| histone-lysine n-methyltransferase [Colletotrichum gloeosporioides
Nara gc5]
Length = 805
Score = 43.5 bits (101), Expect = 0.089, Method: Compositional matrix adjust.
Identities = 54/199 (27%), Positives = 79/199 (39%), Gaps = 30/199 (15%)
Query: 70 YRYSLE----GNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLND---FSV 122
YR +E +RG I A R + N I G I +T EE + T D + +
Sbjct: 445 YRVGVEVIKTADRGYGIRANRCFAPNQIIMEYTGEI--ITDEECSNRMETKYKDNKCYYL 502
Query: 123 MYSCRKNCAQLWLGPAA-YINHDCRPNCRF----VATERDTACVKVLRDIEQGEEITCFY 177
M + G A ++NH C PNCR V+ + A + I GEE+T Y
Sbjct: 503 MSFDQNMIIDATTGSIARFVNHSCAPNCRMIKWIVSGQPRMALFAGDKPIMTGEELTYDY 562
Query: 178 GEDFFGDKN---CLCECDTCEIRMKGAFAQSSVEPQTTQPSRYCLRDT---DMRLNKRKL 231
F D+N CLC C +G E + +P + L+ M+ +KRK
Sbjct: 563 NFSPFSDENVQKCLCGAPNC----RGILGPKPQEVKQPKPPKSTLKAVVKGAMKASKRKF 618
Query: 232 HKKLNRLLLASDKNDTNSS 250
L D ++T S+
Sbjct: 619 E------TLVGDDDETGSA 631
>gi|308511877|ref|XP_003118121.1| hypothetical protein CRE_00036 [Caenorhabditis remanei]
gi|308238767|gb|EFO82719.1| hypothetical protein CRE_00036 [Caenorhabditis remanei]
Length = 301
Score = 43.5 bits (101), Expect = 0.089, Method: Compositional matrix adjust.
Identities = 42/148 (28%), Positives = 60/148 (40%), Gaps = 20/148 (13%)
Query: 79 GARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGP- 137
G + A + D I G I KE EKR+L ++S Y+ + L++ P
Sbjct: 104 GNGLFAKEDINRGDQICIYAGPIIP-KKEHEKRVLKYVEKNYSDFYAYK--VGDLFVDPT 160
Query: 138 -----AAYINHDCRPNC---RFVATER----DTACVKVLRDIEQGEEITCFYGEDFFGDK 185
A + NH C PN R+ +R T R I +G+E+T YGE+ +
Sbjct: 161 ESGNLARFANHSCSPNMVAERWQIDKRFEGYRTIVFIAKRPIREGDELTVHYGENINISQ 220
Query: 186 NCLCECDTCEIRMKGAFAQSSVEPQTTQ 213
C C D C G Q EP T+
Sbjct: 221 ECRCGEDHCS----GWIGQKKEEPAATK 244
>gi|134079644|emb|CAK97070.1| unnamed protein product [Aspergillus niger]
Length = 885
Score = 43.5 bits (101), Expect = 0.089, Method: Composition-based stats.
Identities = 36/126 (28%), Positives = 54/126 (42%), Gaps = 14/126 (11%)
Query: 76 GNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWL 135
+RG + + R +E N I G I T+ E++ N+ ++ + R + A+
Sbjct: 455 ADRGYGVRSNRTFEPNQIIVEYTGEIITQTECEKRMRTIYKHNENMIIDATRGSIAR--- 511
Query: 136 GPAAYINHDCRPNCRF----VATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN---CL 188
++NH C PNCR VA + A R I GEE+T Y D + KN C
Sbjct: 512 ----FVNHSCEPNCRMEKWTVAGKPRMALFAGDRGIMTGEELTYDYNFDPYSQKNVQQCR 567
Query: 189 CECDTC 194
C C
Sbjct: 568 CGSSNC 573
>gi|303276042|ref|XP_003057315.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226461667|gb|EEH58960.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 439
Score = 43.5 bits (101), Expect = 0.091, Method: Compositional matrix adjust.
Identities = 25/69 (36%), Positives = 34/69 (49%), Gaps = 5/69 (7%)
Query: 137 PAAYINHDCRPNCRFVATERDTACVKVL---RDIEQGEEITCFYG--EDFFGDKNCLCEC 191
P+ Y+NH C P C F T + V V+ RD+ G+E+T Y ED CLC
Sbjct: 97 PSMYMNHSCEPTCWFEPTRVEWDGVVVMTASRDLRPGDELTFDYATSEDAEQSWKCLCGA 156
Query: 192 DTCEIRMKG 200
+C R+ G
Sbjct: 157 PSCRGRVTG 165
Score = 37.7 bits (86), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 25/82 (30%), Positives = 35/82 (42%), Gaps = 7/82 (8%)
Query: 140 YINHDCRPNCRFVATERDTAC-VKVLRDIEQGEEITCFY---GEDFF---GDKNCLCECD 192
++NH C PNC + A + RD+ GE IT Y ED G C C
Sbjct: 330 FLNHSCEPNCDVTVYPTNYAIKLTANRDVHPGEAITIDYEHTEEDLIMQGGAFECACGAP 389
Query: 193 TCEIRMKGAFAQSSVEPQTTQP 214
+C + G + S+ P+T P
Sbjct: 390 SCRGMIAGWKHRVSMSPKTPSP 411
>gi|72391782|ref|XP_846185.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|62358390|gb|AAX78854.1| hypothetical protein, conserved [Trypanosoma brucei]
gi|70802721|gb|AAZ12626.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 429
Score = 43.5 bits (101), Expect = 0.093, Method: Compositional matrix adjust.
Identities = 21/57 (36%), Positives = 31/57 (54%), Gaps = 5/57 (8%)
Query: 139 AYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDF-----FGDKNCLCE 190
+++NH C PNC FV T + VLRDI +GE++ Y D + D+ LC+
Sbjct: 358 SFLNHSCEPNCGFVGTNAMNRRLVVLRDIREGEQLLINYNADLTTCVSYEDRRALCK 414
>gi|350631656|gb|EHA20027.1| hypothetical protein ASPNIDRAFT_179076 [Aspergillus niger ATCC
1015]
Length = 853
Score = 43.5 bits (101), Expect = 0.093, Method: Composition-based stats.
Identities = 38/128 (29%), Positives = 54/128 (42%), Gaps = 9/128 (7%)
Query: 76 GNRGARIVATRKWEKNDNIAYLVGCIAELTK-EEEKRLLHTGLNDFSVMYSCRKNCAQLW 134
+RG + + R +E N I G I T+ E+ R ++ + +MY +
Sbjct: 487 ADRGYGVRSNRTFEPNQIIVEYTGEIITQTECEKRMRTIYKHNECYYLMYFDQNMIIDAT 546
Query: 135 LGPAA-YINHDCRPNCRF----VATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN--- 186
G A ++NH C PNCR VA + A R I GEE+T Y D + KN
Sbjct: 547 RGSIARFVNHSCEPNCRMEKWTVAGKPRMALFAGDRGIMTGEELTYDYNFDPYSQKNVQQ 606
Query: 187 CLCECDTC 194
C C C
Sbjct: 607 CRCGSSNC 614
>gi|325093968|gb|EGC47278.1| SET domain-containing protein [Ajellomyces capsulatus H88]
Length = 496
Score = 43.5 bits (101), Expect = 0.098, Method: Compositional matrix adjust.
Identities = 29/101 (28%), Positives = 49/101 (48%), Gaps = 6/101 (5%)
Query: 130 CAQLWLGPAAYINHDCRPNCRFVA---TERDTACVKVLRDIEQGEEITCFYGEDFF--GD 184
C + + A ++NH C P+ FV+ +R ++ RDI+ EEIT YG+ ++
Sbjct: 373 CPKRYGNWARFVNHSCDPSVDFVSRTIGKRIYMMMETRRDIKAFEEITVDYGKQYWRSKQ 432
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQPSRYCLRDTDMR 225
K CLC +C+ + K Q+ P P R+ ++R
Sbjct: 433 KQCLCRSQSCKDKDK-TIPQNVPHPVPAIPRDNPYRNWNLR 472
>gi|167533183|ref|XP_001748272.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163773392|gb|EDQ87033.1| predicted protein [Monosiga brevicollis MX1]
Length = 456
Score = 43.5 bits (101), Expect = 0.099, Method: Compositional matrix adjust.
Identities = 36/108 (33%), Positives = 48/108 (44%), Gaps = 24/108 (22%)
Query: 138 AAYINHDCRPNCRFVATERD---TACVKVLRDIEQGEEITCFYGEDF---------FGDK 185
A +NH C PNC V T R VK LRDI GEE+ Y + + GD+
Sbjct: 213 GAILNHSCEPNC--VMTYRGPLHAQAVKALRDIAVGEELCHSYIDLYAPTGQRHSHLGDQ 270
Query: 186 NCLCECDTCEIRMKGAFAQSSVEPQTTQPSRYCLRDTDMRLNKRKLHK 233
ECD C + + GA A+ P+ T +MRL + LH+
Sbjct: 271 YGF-ECD-CALYLDGALAELDALPEVTA--------AEMRLPGKMLHE 308
>gi|13476347|ref|NP_107917.1| hypothetical protein mll7640 [Mesorhizobium loti MAFF303099]
gi|14027108|dbj|BAB54062.1| mll7640 [Mesorhizobium loti MAFF303099]
Length = 139
Score = 43.5 bits (101), Expect = 0.100, Method: Compositional matrix adjust.
Identities = 33/90 (36%), Positives = 44/90 (48%), Gaps = 14/90 (15%)
Query: 105 TKEEEKRLLHTGLNDFSVMYSCRKNCAQLWL--GPAAYINHDCRPNCRFVATERDTACVK 162
T E E + L+ +S Y R++ A + A Y+NHD PNC + E A
Sbjct: 55 TYEGESGPVKNYLDRYS--YPDRRDPAYIVFEADDARYMNHDDEPNCDVSSPEETYA--- 109
Query: 163 VLRDIEQGEEITCFY------GEDFFGDKN 186
LRDI GEE+TC Y G DF GD++
Sbjct: 110 -LRDIAPGEELTCNYNHFFEAGFDFLGDRH 138
>gi|213406581|ref|XP_002174062.1| histone-lysine N-methyltransferase [Schizosaccharomyces japonicus
yFS275]
gi|212002109|gb|EEB07769.1| histone-lysine N-methyltransferase [Schizosaccharomyces japonicus
yFS275]
Length = 779
Score = 43.5 bits (101), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 43/86 (50%), Gaps = 4/86 (4%)
Query: 138 AAYINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCEC-DT 193
A + NH CRPNC ++V + + RDI++GEE+T Y D +G + C C +
Sbjct: 232 ARFCNHSCRPNCYVDKWVVGNKLRMGIFCKRDIQKGEELTFDYNVDRYGAQAQPCYCGED 291
Query: 194 CEIRMKGAFAQSSVEPQTTQPSRYCL 219
C + G Q+ +P+ + R L
Sbjct: 292 CCLGYIGGRTQTEAQPKLAENVREAL 317
>gi|324499811|gb|ADY39929.1| Histone-lysine N-methyltransferase trr [Ascaris suum]
Length = 2347
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 29/74 (39%), Positives = 36/74 (48%), Gaps = 9/74 (12%)
Query: 131 AQLWLGPAAYINHDCRPNC--RFVATERDTACVKVL----RDIEQGEEITCFYG---EDF 181
A + GPA YINH C PNC R VA+ + K++ R I GEE+T Y ED
Sbjct: 2270 ATMAGGPARYINHSCDPNCSTRIVASGPNGDDKKIIIIANRPISAGEELTYDYQFDIEDT 2329
Query: 182 FGDKNCLCECDTCE 195
CLC C+
Sbjct: 2330 ADKIACLCGAPNCQ 2343
>gi|308463011|ref|XP_003093784.1| hypothetical protein CRE_24784 [Caenorhabditis remanei]
gi|308249390|gb|EFO93342.1| hypothetical protein CRE_24784 [Caenorhabditis remanei]
Length = 243
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 45/163 (27%), Positives = 63/163 (38%), Gaps = 32/163 (19%)
Query: 65 ELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMY 124
E+KP G+ G + A + D + G I KE EKR+L +F+ Y
Sbjct: 88 EVKP------SGDTGYGLFAKENINEVDQVCIYAGSIIP-KKEHEKRVLKYKKKNFTHFY 140
Query: 125 SCRKNCAQLWLGP------AAYINHDCRPNCRFVATER----------DTACVKVLRDIE 168
+ + L++ P A + NH C PN + ER T R I
Sbjct: 141 AYK--VGDLFVDPTESGNLARFANHSCSPN---MVAERWQIDRRFEGYRTIVFIAKRPIC 195
Query: 169 QGEEITCFYGEDFFGDKNCLCECDTCEIRMKGAFAQSSVEPQT 211
+G+E+T YGE + C C D C G Q EP T
Sbjct: 196 EGDELTVHYGEKLNVSQECRCGEDNCS----GWIGQKKKEPAT 234
>gi|389745696|gb|EIM86877.1| hypothetical protein STEHIDRAFT_131417 [Stereum hirsutum FP-91666
SS1]
Length = 170
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 34/67 (50%), Gaps = 5/67 (7%)
Query: 140 YINHDCRPNCRF--VATERDTACVKVLRDIEQGEEITCFYGEDFFGDK---NCLCECDTC 194
Y+NH C PN F A V+ LRDI +G+ +T FY + + +CLC TC
Sbjct: 74 YVNHSCEPNVAFDLSALNYKEWHVRALRDIHKGDTLTFFYPSTEWNMEQPFDCLCGSSTC 133
Query: 195 EIRMKGA 201
+KGA
Sbjct: 134 LRTIKGA 140
>gi|225554567|gb|EEH02864.1| SET domain-containing protein [Ajellomyces capsulatus G186AR]
Length = 511
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 29/101 (28%), Positives = 49/101 (48%), Gaps = 6/101 (5%)
Query: 130 CAQLWLGPAAYINHDCRPNCRFVA---TERDTACVKVLRDIEQGEEITCFYGEDFF--GD 184
C + + A ++NH C P+ FV+ +R ++ RDI+ EEIT YG+ ++
Sbjct: 373 CPKRYGNWARFVNHSCDPSVDFVSRTIGKRIYMMMETRRDIKAFEEITVDYGKQYWRSKQ 432
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQPSRYCLRDTDMR 225
K CLC +C+ + K Q+ P P R+ ++R
Sbjct: 433 KQCLCRSQSCKDKDK-TIPQNVPHPVPAIPRDNPYRNWNLR 472
>gi|258573585|ref|XP_002540974.1| predicted protein [Uncinocarpus reesii 1704]
gi|237901240|gb|EEP75641.1| predicted protein [Uncinocarpus reesii 1704]
Length = 727
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 47/141 (33%), Positives = 61/141 (43%), Gaps = 17/141 (12%)
Query: 77 NRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHT--GLND-FSVMYSCRKNCAQL 133
+RG + + R + N I G I LT+EE +R + T ND + +MY +
Sbjct: 425 DRGYGVRSNRAFNPNQIIVEYTGEI--LTQEECERRMRTVYKKNDCYYLMYFDQNMVIDA 482
Query: 134 WLGP-AAYINHDCRPNCRF----VATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN-- 186
G A +INH C PNCR VA + A I GEE+T Y D + KN
Sbjct: 483 TRGSIARFINHSCEPNCRMEKWTVAGKPRMALFAGEDGIMTGEELTYDYNFDPYSQKNVQ 542
Query: 187 -CLCECDTCE----IRMKGAF 202
C C TC R K AF
Sbjct: 543 ECRCGAPTCRGVLGPRQKEAF 563
>gi|358391269|gb|EHK40673.1| hypothetical protein TRIATDRAFT_286297 [Trichoderma atroviride IMI
206040]
Length = 347
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 46/168 (27%), Positives = 71/168 (42%), Gaps = 31/168 (18%)
Query: 58 YRIQYLPELKPCYRYSLEGNRGARIVA------TRKWEKNDNIA--YLVGCIAELTKEEE 109
Y IQ +P + + +GARI++ E N + ++ + +L + ++
Sbjct: 20 YAIQAVPGKGISLIATTKILKGARILSELPIFKVSYAESNRQVLANHIANKLKDLDEAKQ 79
Query: 110 KRLLHT----GLNDFSVMYSCRKNCAQLWLGP--------AAYINHDCRPNCRFVATER- 156
+ LH GL+D + R N + L G AA INH CRPN E
Sbjct: 80 RGFLHLQNVYGLDDGLFLGIARSNVSPLGPGASEGGLFLDAARINHSCRPNAHKSWNENL 139
Query: 157 DTACVKVLRDIEQGEEITCFY-GEDF-FGDKNCL--------CECDTC 194
V +RDIE+G+EIT Y GE + ++ + C CD C
Sbjct: 140 QRLTVHAVRDIERGQEITISYLGETLSYIERQAILKQRFRFDCGCDLC 187
>gi|71649958|ref|XP_813687.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70878595|gb|EAN91836.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 501
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 23/41 (56%), Positives = 26/41 (63%), Gaps = 1/41 (2%)
Query: 138 AAYINHDCRPNCRFVATE-RDTACVKVLRDIEQGEEITCFY 177
A INH C P+ RFV T R +A V LRDIE+GEEI Y
Sbjct: 429 ATKINHSCLPSVRFVPTHGRVSAVVVALRDIEKGEEIRSSY 469
>gi|427735004|ref|YP_007054548.1| SET domain-containing protein [Rivularia sp. PCC 7116]
gi|427370045|gb|AFY54001.1| SET domain-containing protein [Rivularia sp. PCC 7116]
Length = 159
Score = 43.1 bits (100), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 32/130 (24%), Positives = 52/130 (40%), Gaps = 19/130 (14%)
Query: 76 GNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWL 135
G +G + ATR ++ + + +VG ++ E G N F+ L
Sbjct: 11 GGKGEGMFATRAFKPGETV--MVGRFEKMLDENHSHASQIGQNKFA-----------LHA 57
Query: 136 GPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGE-----DFFGDKNCLCE 190
G + +NH C PNC E ++DI EE++ Y ++F K C+C
Sbjct: 58 GLISKVNHSCNPNCGIQINETGAHDFVAMKDISVEEELSFDYAMRNYTVNYFPPK-CMCG 116
Query: 191 CDTCEIRMKG 200
C R+ G
Sbjct: 117 SKDCRGRITG 126
>gi|407843445|gb|EKG01401.1| hypothetical protein TCSYLVIO_007600 [Trypanosoma cruzi]
Length = 504
Score = 43.1 bits (100), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 23/41 (56%), Positives = 26/41 (63%), Gaps = 1/41 (2%)
Query: 138 AAYINHDCRPNCRFVATE-RDTACVKVLRDIEQGEEITCFY 177
A INH C P+ RFV T R +A V LRDIE+GEEI Y
Sbjct: 432 ATKINHSCVPSVRFVPTHGRVSAVVVALRDIEKGEEIRSSY 472
>gi|412985970|emb|CCO17170.1| predicted protein [Bathycoccus prasinos]
Length = 1058
Score = 43.1 bits (100), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 24/62 (38%), Positives = 33/62 (53%), Gaps = 3/62 (4%)
Query: 136 GPAAYINHDCRPNCRFVA-TERDTACVKV--LRDIEQGEEITCFYGEDFFGDKNCLCECD 192
G ++NH C PNC + C+ V LR+I+ GEEIT Y + +G+K C CD
Sbjct: 307 GVGRFLNHSCDPNCETQKWMVKGELCIGVFALREIQPGEEITIDYKFERYGEKPMRCFCD 366
Query: 193 TC 194
T
Sbjct: 367 TA 368
>gi|383758263|ref|YP_005437248.1| SET domain-containing protein [Rubrivivax gelatinosus IL144]
gi|381378932|dbj|BAL95749.1| SET domain protein [Rubrivivax gelatinosus IL144]
Length = 146
Score = 43.1 bits (100), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 28/65 (43%), Positives = 29/65 (44%), Gaps = 6/65 (9%)
Query: 133 LWLGPAAYINHDCRPNCRFVATERDTACVKV--LRDIEQGEEITCFYGEDFFGDK-NCLC 189
L P Y NH CRPN R V D V+ LR I GEEIT YGE G C C
Sbjct: 81 LSTDPMRYTNHSCRPNARLVL---DNGRVEFFSLRAIAPGEEITVDYGETHHGGTLPCRC 137
Query: 190 ECDTC 194
C
Sbjct: 138 GAPGC 142
>gi|145355325|ref|XP_001421914.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144582153|gb|ABP00208.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 860
Score = 43.1 bits (100), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 30/94 (31%), Positives = 43/94 (45%), Gaps = 5/94 (5%)
Query: 103 ELTKEEEKRLLHTGLND--FSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVA-TERDTA 159
E KE ++R G + F + S A + ++NH C PNC R
Sbjct: 215 EAYKERKRRYQDEGRSHYYFMTLSSSETIDATIRGNEGRFLNHSCAPNCETQKWMVRGEL 274
Query: 160 CVKVL--RDIEQGEEITCFYGEDFFGDKNCLCEC 191
C+ + RDIE+GEE+T Y + FG+K C C
Sbjct: 275 CIGIFATRDIEEGEELTIDYKFERFGEKPSRCYC 308
>gi|195451627|ref|XP_002073006.1| GK13904 [Drosophila willistoni]
gi|194169091|gb|EDW83992.1| GK13904 [Drosophila willistoni]
Length = 267
Score = 43.1 bits (100), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 35/134 (26%), Positives = 63/134 (47%), Gaps = 25/134 (18%)
Query: 84 ATRKWEKNDNIAYL--VGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYI 141
+ R+ E ND++ Y+ V C+ E EE +++ +++ CR+ Y+
Sbjct: 151 SQRRIEVNDSLGYMNYVLCLKEFASEE--------ISEVTIVDPCRRG------NIGRYL 196
Query: 142 NHDCRPNCRFVATERDTACVKV----LRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIR 197
NH C+PNC+ +A + K+ RDI EE+ CF+ +G + + +C+I
Sbjct: 197 NHSCQPNCQIMAVRVECPIPKIGIFASRDIHALEEL-CFH----YGGEETRPKTGSCKIC 251
Query: 198 MKGAFAQSSVEPQT 211
+ G+ S P T
Sbjct: 252 LCGSLNCSGFMPNT 265
>gi|451998004|gb|EMD90469.1| hypothetical protein COCHEDRAFT_1179307 [Cochliobolus
heterostrophus C5]
Length = 1062
Score = 43.1 bits (100), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 32/126 (25%), Positives = 55/126 (43%), Gaps = 25/126 (19%)
Query: 68 PCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCR 127
P Y + +VA +++ + LVG I + G+ V+ S
Sbjct: 435 PVQVYRTPDGKNNGLVALTSFDRGTGVGELVGLITK------------GMRHLDVIDSST 482
Query: 128 KNCA-QLWLGPAA----YINHDCRPNCR-----FVATERDTACVKVLRDIEQGEEITCFY 177
++ + Q+W G +INH C+PN + ++ T+R V V + IE G EIT Y
Sbjct: 483 ESTSYQIWQGRVGNYTRFINHSCKPNAQCSTFTWLDTQR---VVLVSKGIEAGSEITLDY 539
Query: 178 GEDFFG 183
G+ ++
Sbjct: 540 GDKYWA 545
>gi|225555248|gb|EEH03540.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR]
Length = 803
Score = 43.1 bits (100), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 56/129 (43%), Gaps = 13/129 (10%)
Query: 77 NRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLND---FSVMYSCRKNCAQL 133
+RG + + R + N I G I +T+EE +R + T D + +MY +
Sbjct: 437 DRGYGVRSNRTFAPNQIIVEYTGEI--ITQEECERRMRTVYKDNECYYLMYFDQNMIIDA 494
Query: 134 WLGP-AAYINHDCRPNCRF----VATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN-- 186
G A ++NH C PNCR VA + A I GEE+T Y D + KN
Sbjct: 495 TRGSIARFVNHSCEPNCRMEKWTVAGKPRMALFAGENGIMTGEELTYDYNFDPYSQKNVQ 554
Query: 187 -CLCECDTC 194
C C TC
Sbjct: 555 QCRCGVPTC 563
>gi|170050212|ref|XP_001859676.1| mixed-lineage leukemia protein [Culex quinquefasciatus]
gi|167871728|gb|EDS35111.1| mixed-lineage leukemia protein [Culex quinquefasciatus]
Length = 2977
Score = 42.7 bits (99), Expect = 0.15, Method: Composition-based stats.
Identities = 36/130 (27%), Positives = 56/130 (43%), Gaps = 9/130 (6%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK+ + +G + + E + + N M+ ++ A L
Sbjct: 2847 QGLGLYAARDLEKHTMVIEYIGEVIRVEVSELREKQYEARNRGIYMFRLDEDRVVDATLS 2906
Query: 135 LGPAAYINHDCRPNC--RFVATERDTACVKVL-RDIEQGEEITCFYGEDFFGDK---NCL 188
G A YINH C PNC V ER+ + R I +GEE++ Y D D +C+
Sbjct: 2907 GGLARYINHSCNPNCVTEIVEVEREVRIIIFAKRRINRGEELSYDYKFDIEDDAHKISCM 2966
Query: 189 CECDTCEIRM 198
C C+ M
Sbjct: 2967 CGAPNCKKWM 2976
>gi|71665802|ref|XP_819867.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70885187|gb|EAN98016.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 937
Score = 42.7 bits (99), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 26/49 (53%), Gaps = 9/49 (18%)
Query: 138 AAYINHDCRPNCRFVATERDTAC---------VKVLRDIEQGEEITCFY 177
A +INH C PN FV +E + AC V L DIE GEEIT Y
Sbjct: 544 AGFINHSCSPNALFVFSEEEGACCSSGGGVLRVVALCDIEPGEEITVTY 592
>gi|337269869|ref|YP_004613924.1| nuclear protein SET [Mesorhizobium opportunistum WSM2075]
gi|336030179|gb|AEH89830.1| nuclear protein SET [Mesorhizobium opportunistum WSM2075]
Length = 129
Score = 42.7 bits (99), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 33/90 (36%), Positives = 44/90 (48%), Gaps = 14/90 (15%)
Query: 105 TKEEEKRLLHTGLNDFSVMYSCRKNCAQLWL--GPAAYINHDCRPNCRFVATERDTACVK 162
T E E + L+ +S Y R++ A + A Y+NHD PNC + E A
Sbjct: 45 TYEGETGPVKNYLDRYS--YPDRRDPAYIVFEADDARYMNHDDEPNCDVSSPEETYA--- 99
Query: 163 VLRDIEQGEEITCFY------GEDFFGDKN 186
LRDI GEE+TC Y G DF GD++
Sbjct: 100 -LRDIAAGEELTCNYNLFFETGFDFLGDRH 128
>gi|119490907|ref|ZP_01623190.1| hypothetical protein L8106_18562 [Lyngbya sp. PCC 8106]
gi|119453725|gb|EAW34884.1| hypothetical protein L8106_18562 [Lyngbya sp. PCC 8106]
Length = 161
Score = 42.7 bits (99), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 25/65 (38%), Positives = 37/65 (56%), Gaps = 6/65 (9%)
Query: 140 YINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFY-GEDFFGDK--NCLCECDTCEI 196
YINH C PN F +R V L++IE GEE T FY ++ D+ +C+C+ + C
Sbjct: 76 YINHSCNPNVFFDTQKR---VVIALQNIEIGEEFTFFYPSTEWSMDRGFDCICQSENCLG 132
Query: 197 RMKGA 201
++GA
Sbjct: 133 YIQGA 137
>gi|212544714|ref|XP_002152511.1| histone-lysine N-methyltransferase (Ash1), putative [Talaromyces
marneffei ATCC 18224]
gi|210065480|gb|EEA19574.1| histone-lysine N-methyltransferase (Ash1), putative [Talaromyces
marneffei ATCC 18224]
Length = 865
Score = 42.7 bits (99), Expect = 0.16, Method: Composition-based stats.
Identities = 50/190 (26%), Positives = 75/190 (39%), Gaps = 25/190 (13%)
Query: 76 GNRGARIVATRKWEKNDNIAYLVGCIAELTKEE-EKRLLHTGLND--FSVMYSCRKNCAQ 132
+RG + + R +E N I G I +T+ E E+R+ N+ + +MY +
Sbjct: 516 ADRGYGVRSNRTFEPNQIIVEYTGEI--ITQNECERRMRSVYKNNECYYLMYFDQNMIID 573
Query: 133 LWLGPAA-YINHDCRPNCRF----VATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN- 186
G A ++NH C PNCR V + A R I GEE+T Y D + KN
Sbjct: 574 ATRGSIARFVNHSCAPNCRMEKWTVGGKPRMALFAGDRGIMTGEELTYDYNFDPYSQKNV 633
Query: 187 --CLCECDTCEIRMKGAFAQSSVEPQTTQPSRYCLRDTDMRLNKRKLHKKLNRLLLASDK 244
C C TC +G +P +R++ K L KK L + +
Sbjct: 634 QQCRCGAPTC----RGVLG--------PRPKEREIRESKAEQKKEALQKKAKGALAGTKR 681
Query: 245 NDTNSSDNSS 254
D S+
Sbjct: 682 KLGGVLDEST 691
>gi|321469515|gb|EFX80495.1| hypothetical protein DAPPUDRAFT_51465 [Daphnia pulex]
Length = 927
Score = 42.7 bits (99), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 43/127 (33%), Positives = 57/127 (44%), Gaps = 12/127 (9%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCI--AELTKEEEKRLLHTGLNDFSVMYSC---RKNCAQ 132
+G + A R EK+ + +G + AEL + EKR + N M+ R A
Sbjct: 798 QGLGLYAARDIEKHTMVIEYIGEMIRAELAECREKR--YEAANRGIYMFRLDEQRVIDAT 855
Query: 133 LWLGPAAYINHDCRPNC--RFVATERDTACVKVL-RDIEQGEEITCFYGEDFFGDKN--C 187
L G A YINH C PNC V ERD + R I +GEE++ Y D D C
Sbjct: 856 LCGGLARYINHSCGPNCVAEAVEVERDLRIIIFASRRIARGEELSYDYKFDIEDDHKIPC 915
Query: 188 LCECDTC 194
LC +C
Sbjct: 916 LCGAASC 922
>gi|119479621|ref|XP_001259839.1| SET domain protein [Neosartorya fischeri NRRL 181]
gi|119407993|gb|EAW17942.1| SET domain protein [Neosartorya fischeri NRRL 181]
Length = 443
Score = 42.7 bits (99), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 35/137 (25%), Positives = 65/137 (47%), Gaps = 8/137 (5%)
Query: 64 PELKPCYRYSLE-GNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEK--RLLHTGLNDF 120
P L C+ ++ +RG + A +++K D + VG + + ++++ LLH +
Sbjct: 297 PTLAMCFVELVDYPHRGIGVRALARFKKGDILDEYVGELRPVDYKDDRVYALLHE--SKM 354
Query: 121 SVMYSCRKNCAQLWLGPAAYINHDCRPNCRFV---ATERDTACVKVLRDIEQGEEITCFY 177
S + A+ Y+NH C+P+ F+ +R V+ +RDI+ EEIT Y
Sbjct: 355 SEGHPLALISAKRHGNWTRYLNHSCKPSTTFLKMTVGKRTIMAVQAVRDIDIFEEITVDY 414
Query: 178 GEDFFGDKNCLCECDTC 194
G ++ ++ LC C
Sbjct: 415 GTGYWKNRKGLCGEPGC 431
>gi|408790805|ref|ZP_11202416.1| SET domain protein [Leptospira meyeri serovar Hardjo str. Went 5]
gi|408464834|gb|EKJ88558.1| SET domain protein [Leptospira meyeri serovar Hardjo str. Went 5]
Length = 145
Score = 42.7 bits (99), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 32/112 (28%), Positives = 50/112 (44%), Gaps = 12/112 (10%)
Query: 77 NRGARIVATRKWEKNDNIAYLVGCI-----AELTKEEEKRLLHTGLNDFSVMYSCRKNCA 131
N G + + K D + Y +G I AE K + + L D+ + R++
Sbjct: 32 NIGMGLFTKQTLYKGDTVGYYMGKIITDEQAESNKYVDSKYLLWICKDWWIYGEGRESNY 91
Query: 132 QLWLGPAAYINHDCRPNCRFVATER-DTACVKVLRDIEQGEEITCFYGEDFF 182
YINH +PN + + R TA KVL+ I +G EI YG+D++
Sbjct: 92 ------TRYINHSSKPNAELITSVRWKTARFKVLKTIPEGGEIFFDYGKDYW 137
>gi|367032576|ref|XP_003665571.1| hypothetical protein MYCTH_54709 [Myceliophthora thermophila ATCC
42464]
gi|347012842|gb|AEO60326.1| hypothetical protein MYCTH_54709 [Myceliophthora thermophila ATCC
42464]
Length = 607
Score = 42.7 bits (99), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 39/126 (30%), Positives = 51/126 (40%), Gaps = 19/126 (15%)
Query: 133 LWLGPAAYINHDCRPNCRFVATER----DTACVKVLRDIEQGEEITCFYG--EDFFGDKN 186
LW+ AAY NH C PN TER D V+ LRDI GEE+ Y D+ G +
Sbjct: 460 LWIR-AAYFNHSCVPN-----TEREFIGDLIVVRALRDIAAGEELVQSYDVTGDYEGRRE 513
Query: 187 CL-------CECDTCEIRMKGAFAQSSVEPQTTQPSRYCLRDTDMRLNKRKLHKKLNRLL 239
L C C CE A + + + L+ +KR K RL+
Sbjct: 514 ALMTTWGFECNCALCEAERTDDVAVREKRSKLAKEADEFLKTVGTLASKRLAIAKARRLV 573
Query: 240 LASDKN 245
A D+
Sbjct: 574 AAIDET 579
>gi|198413251|ref|XP_002126315.1| PREDICTED: transcription factor protein, partial [Ciona intestinalis]
Length = 1130
Score = 42.7 bits (99), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 36/126 (28%), Positives = 53/126 (42%), Gaps = 9/126 (7%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A+R E + +G I E++ + N M+ + A +
Sbjct: 1000 QGLGLFASRDIEPGTMVIEYIGDIIRSEVAEKREKNYEAANRGVYMFRLDSDYIVDATVT 1059
Query: 135 LGPAAYINHDCRPNC--RFVATERDTACVKVL-RDIEQGEEITCFYGEDFFGDKN---CL 188
GPA YINH C PNC V E++ + + R I GEE+ Y DF + N CL
Sbjct: 1060 GGPARYINHSCNPNCVAEVVNFEKEKKIMIISNRHILSGEELNYDYKFDFEDEGNKIPCL 1119
Query: 189 CECDTC 194
C C
Sbjct: 1120 CGAINC 1125
>gi|156374111|ref|XP_001629652.1| predicted protein [Nematostella vectensis]
gi|156216657|gb|EDO37589.1| predicted protein [Nematostella vectensis]
Length = 463
Score = 42.7 bits (99), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 38/126 (30%), Positives = 55/126 (43%), Gaps = 9/126 (7%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R E + +G I +K ++ N M+ + A +
Sbjct: 333 QGLGLFANRDMEPGCMVIEYIGSIIRNEVANKKESIYESQNRGIYMFRIDSDSVIDATIA 392
Query: 135 LGPAAYINHDCRPNC--RFVATERD-TACVKVLRDIEQGEEITCFYGEDFFGDK---NCL 188
GPA YINH C PNC V E++ + R IE+GEE+T Y DF D+ +CL
Sbjct: 393 GGPARYINHSCMPNCVAEVVTFEKEQKIIIISSRKIEKGEELTYDYKFDFEDDEHKISCL 452
Query: 189 CECDTC 194
C C
Sbjct: 453 CGAPNC 458
>gi|357580553|sp|P0CY36.1|Y1806_PHANO RecName: Full=SET domain-containing protein SNOG_11806
Length = 605
Score = 42.7 bits (99), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 39/153 (25%), Positives = 63/153 (41%), Gaps = 29/153 (18%)
Query: 60 IQYLPELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLND 119
++ P YR + N G ++A +E+ + LVG L+ G+
Sbjct: 468 VKRKPPPVQIYRTAEGKNNG--LIALTAFERGTAVGELVG------------LITNGVRH 513
Query: 120 FSVMYSCRKNCA-QLWLGPAA----YINHDCRPNCR-----FVATERDTACVKVLRDIEQ 169
VM Q+W G + NH C+ N + ++ T+R + V + IE
Sbjct: 514 LDVMEGSTPLAKYQIWQGREGNYTRFANHSCKANAQTSNFTWLDTQR---VILVSKGIEA 570
Query: 170 GEEITCFYGEDFFG--DKNCLCECDTCEIRMKG 200
G EIT YG+ ++ DK+CLC C + G
Sbjct: 571 GSEITVDYGDKYWAGLDKSCLCGETCCRYKRDG 603
>gi|358059417|dbj|GAA94823.1| hypothetical protein E5Q_01477 [Mixia osmundae IAM 14324]
Length = 202
Score = 42.7 bits (99), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 24/66 (36%), Positives = 33/66 (50%), Gaps = 4/66 (6%)
Query: 140 YINHDCRPNCRF-VATERDTACVKVLRDIEQGEEITCFYGE---DFFGDKNCLCECDTCE 195
Y+NH C PN F + +R V+ L DI QGE +T Y D +C+C C
Sbjct: 80 YVNHSCDPNVAFELPKDRSRWLVRALTDIPQGEALTFAYYSTEWDMAQPFDCICGTSKCL 139
Query: 196 IRMKGA 201
R++GA
Sbjct: 140 GRIQGA 145
>gi|157109807|ref|XP_001650833.1| mixed-lineage leukemia protein, mll [Aedes aegypti]
gi|108878935|gb|EAT43160.1| AAEL005380-PA [Aedes aegypti]
Length = 2874
Score = 42.7 bits (99), Expect = 0.19, Method: Composition-based stats.
Identities = 37/130 (28%), Positives = 55/130 (42%), Gaps = 9/130 (6%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK+ + +G + E + + N M+ ++ A L
Sbjct: 2744 QGLGLYAARDLEKHTMVIEYIGEVIRTEVSELREKQYEARNRGIYMFRLDEDRVVDATLS 2803
Query: 135 LGPAAYINHDCRPNC--RFVATERDTACVKVL-RDIEQGEEITCFYGEDFFGDK---NCL 188
G A YINH C PNC V ERD + R I +GEE++ Y D D +C+
Sbjct: 2804 GGLARYINHSCNPNCVTETVEVERDLRIIIFAKRRINRGEELSYDYKFDIEDDAHKISCM 2863
Query: 189 CECDTCEIRM 198
C C+ M
Sbjct: 2864 CGAPNCKKWM 2873
>gi|406707491|ref|YP_006757843.1| SET domain-containing protein [alpha proteobacterium HIMB59]
gi|406653267|gb|AFS48666.1| SET domain protein [alpha proteobacterium HIMB59]
Length = 166
Score = 42.7 bits (99), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 30/59 (50%), Gaps = 3/59 (5%)
Query: 138 AAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDF--FGDKNCLCECDTC 194
A YINH C PNC R + +++I++GEE+T YG D F D C C C
Sbjct: 85 AKYINHSCDPNCE-TDIVRGKIWINAIKNIKKGEELTYDYGYDMECFEDHPCRCGSKNC 142
>gi|19075312|ref|NP_587812.1| histone lysine methyltransferase Set1 [Schizosaccharomyces pombe
972h-]
gi|74698592|sp|Q9Y7R4.1|SET1_SCHPO RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component set1; AltName:
Full=Lysine N-methyltransferase 2; AltName: Full=SET
domain-containing protein 1; AltName: Full=Set1 complex
component set1; Short=Set1C component set1; AltName:
Full=Spset1
gi|4704279|emb|CAB41652.1| histone lysine methyltransferase Set1 [Schizosaccharomyces pombe]
Length = 920
Score = 42.4 bits (98), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 37/122 (30%), Positives = 53/122 (43%), Gaps = 10/122 (8%)
Query: 82 IVATRKWEKNDNIAYLVGCI--AELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLWLG 136
+ A +KND + +G I + EK + G+ D S ++ ++ A
Sbjct: 795 LFAMENIDKNDMVIEYIGEIIRQRVADNREKNYVREGIGD-SYLFRIDEDVIVDATKKGN 853
Query: 137 PAAYINHDCRPNC--RFVATE-RDTACVKVLRDIEQGEEITCFYGEDFFGDK-NCLCECD 192
A +INH C PNC R + E + + RDI GEE+T Y DK CLC
Sbjct: 854 IARFINHSCAPNCIARIIRVEGKRKIVIYADRDIMHGEELTYDYKFPEEADKIPCLCGAP 913
Query: 193 TC 194
TC
Sbjct: 914 TC 915
>gi|348670644|gb|EGZ10465.1| hypothetical protein PHYSODRAFT_518318 [Phytophthora sojae]
Length = 437
Score = 42.4 bits (98), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 41/134 (30%), Positives = 53/134 (39%), Gaps = 20/134 (14%)
Query: 82 IVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGP-AAY 140
+++ K EK + A L +L E K N FSV + L P A
Sbjct: 148 LLSRYKAEKREKNATLDELHRDLELEILKLFGRVNCNAFSVANEFTNEAVGIGLYPEGAL 207
Query: 141 INHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFY--------------GEDFFGDKN 186
NHDC PNC R+ V+V+RDIE EE+T Y E +F D
Sbjct: 208 FNHDCDPNCVVSFKGREMQ-VRVVRDIEVDEELTVSYVELLQSTKARRRELKESYFFD-- 264
Query: 187 CLCECDTCEIRMKG 200
CEC C+ G
Sbjct: 265 --CECKRCKAATNG 276
>gi|383848022|ref|XP_003699651.1| PREDICTED: uncharacterized protein LOC100881339 [Megachile rotundata]
Length = 4805
Score = 42.4 bits (98), Expect = 0.21, Method: Composition-based stats.
Identities = 41/127 (32%), Positives = 57/127 (44%), Gaps = 11/127 (8%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCI--AEL--TKEEEKRLLHTGLNDFSVMYSCRKNCAQL 133
+G + A R EK+ + +G I EL T+E++ + G+ F + R A L
Sbjct: 4675 QGLGLYAARDLEKHTMVIEYIGEIIRTELAETREKQYEARNRGIYMFRLDEE-RVVDATL 4733
Query: 134 WLGPAAYINHDCRPNC--RFVATERDTACVKVL-RDIEQGEEITCFYGEDFFGDKN---C 187
G A YINH C PNC V ERD + R I +GEE+ Y D D++ C
Sbjct: 4734 CGGLARYINHSCNPNCVAEIVEVERDLRIIIFAKRRISRGEELAYDYKFDIEDDQHKIAC 4793
Query: 188 LCECDTC 194
C C
Sbjct: 4794 ACGAPNC 4800
>gi|350405219|ref|XP_003487363.1| PREDICTED: hypothetical protein LOC100745609 [Bombus impatiens]
Length = 5619
Score = 42.4 bits (98), Expect = 0.21, Method: Composition-based stats.
Identities = 41/127 (32%), Positives = 57/127 (44%), Gaps = 11/127 (8%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCI--AEL--TKEEEKRLLHTGLNDFSVMYSCRKNCAQL 133
+G + A R EK+ + +G I EL T+E++ + G+ F + R A L
Sbjct: 5489 QGLGLYAARDLEKHTMVIEYIGEIIRTELAETREKQYEARNRGIYMFRLDEE-RVVDATL 5547
Query: 134 WLGPAAYINHDCRPNC--RFVATERDTACVKVL-RDIEQGEEITCFYGEDFFGDKN---C 187
G A YINH C PNC V ERD + R I +GEE+ Y D D++ C
Sbjct: 5548 CGGLARYINHSCNPNCVAEIVEVERDLRIIIFAKRRISRGEELAYDYKFDIEDDQHKIAC 5607
Query: 188 LCECDTC 194
C C
Sbjct: 5608 ACGAPNC 5614
>gi|348679810|gb|EGZ19626.1| hypothetical protein PHYSODRAFT_496640 [Phytophthora sojae]
Length = 170
Score = 42.4 bits (98), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 23/61 (37%), Positives = 32/61 (52%), Gaps = 5/61 (8%)
Query: 140 YINHDCRPNCRFVAT---ERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEI 196
++NH C+P +FV R T V +DI +GEE+T YG+D + C C D C
Sbjct: 104 FVNHSCQPVAKFVEVANGRRTTVAVASTQDIHRGEEVTVDYGDDLW--FVCSCGLDGCRH 161
Query: 197 R 197
R
Sbjct: 162 R 162
>gi|340726153|ref|XP_003401426.1| PREDICTED: hypothetical protein LOC100646364 [Bombus terrestris]
Length = 5622
Score = 42.4 bits (98), Expect = 0.21, Method: Composition-based stats.
Identities = 41/127 (32%), Positives = 57/127 (44%), Gaps = 11/127 (8%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCI--AEL--TKEEEKRLLHTGLNDFSVMYSCRKNCAQL 133
+G + A R EK+ + +G I EL T+E++ + G+ F + R A L
Sbjct: 5492 QGLGLYAARDLEKHTMVIEYIGEIIRTELAETREKQYEARNRGIYMFRLDEE-RVVDATL 5550
Query: 134 WLGPAAYINHDCRPNC--RFVATERDTACVKVL-RDIEQGEEITCFYGEDFFGDKN---C 187
G A YINH C PNC V ERD + R I +GEE+ Y D D++ C
Sbjct: 5551 CGGLARYINHSCNPNCVAEIVEVERDLRIIIFAKRRISRGEELAYDYKFDIEDDQHKIAC 5610
Query: 188 LCECDTC 194
C C
Sbjct: 5611 ACGAPNC 5617
>gi|322792928|gb|EFZ16758.1| hypothetical protein SINV_09282 [Solenopsis invicta]
Length = 3429
Score = 42.4 bits (98), Expect = 0.21, Method: Composition-based stats.
Identities = 41/127 (32%), Positives = 57/127 (44%), Gaps = 11/127 (8%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCI--AEL--TKEEEKRLLHTGLNDFSVMYSCRKNCAQL 133
+G + A R EK+ + +G I EL T+E++ + G+ F + R A L
Sbjct: 3299 QGLGLYAARDLEKHTMVIEYIGEIIRTELAETREKQYEARNRGIYMFRLDEE-RVVDATL 3357
Query: 134 WLGPAAYINHDCRPNC--RFVATERDTACVKVL-RDIEQGEEITCFYGEDFFGDKN---C 187
G A YINH C PNC V ERD + R I +GEE+ Y D D++ C
Sbjct: 3358 CGGLARYINHSCNPNCVAEIVEVERDLRIIIFAKRRISRGEELAYDYKFDIEDDQHKIAC 3417
Query: 188 LCECDTC 194
C C
Sbjct: 3418 ACGAPNC 3424
>gi|332710212|ref|ZP_08430163.1| SET domain protein [Moorea producens 3L]
gi|332350991|gb|EGJ30580.1| SET domain protein [Moorea producens 3L]
Length = 165
Score = 42.4 bits (98), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 22/65 (33%), Positives = 37/65 (56%), Gaps = 6/65 (9%)
Query: 140 YINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFY-GEDFFGDK--NCLCECDTCEI 196
Y+NH C PN F ++R ++ + IE GEE+T FY ++ D+ +C+C+ C
Sbjct: 79 YLNHSCAPNVFFAVSDR---VLRAMTKIEIGEELTFFYPSTEWSMDRGFDCICQSKDCLG 135
Query: 197 RMKGA 201
++GA
Sbjct: 136 TIRGA 140
>gi|317029965|ref|XP_001391607.2| hypothetical protein ANI_1_1796064 [Aspergillus niger CBS 513.88]
Length = 730
Score = 42.4 bits (98), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 21/58 (36%), Positives = 30/58 (51%), Gaps = 3/58 (5%)
Query: 140 YINHDCRPNCRF---VATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTC 194
+INH CR N F +R V+ LRDI+ EE+T YG ++ + C C + C
Sbjct: 534 FINHSCRANTTFERRTIGDRAMMTVEALRDIQPFEELTVNYGRAYWKGRTCECGEEGC 591
>gi|429853222|gb|ELA28312.1| TPR domain-containing protein [Colletotrichum gloeosporioides Nara
gc5]
Length = 643
Score = 42.4 bits (98), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 36/68 (52%), Gaps = 8/68 (11%)
Query: 119 DFSVMYSCRKNCAQLWLGPAAYINHDCRPNCR--FVATERDTACVKVLRDIEQGEEITCF 176
D S + K LW+ AAYINH C PN + FV D ++ LR I++GEEI F
Sbjct: 484 DDSHLGGVSKPSTGLWV-RAAYINHSCVPNAKREFVG---DLMVIRSLRKIKKGEEI--F 537
Query: 177 YGEDFFGD 184
+ D GD
Sbjct: 538 HSYDESGD 545
>gi|327348240|gb|EGE77097.1| histone-lysine N-methyltransferase [Ajellomyces dermatitidis ATCC
18188]
Length = 1280
Score = 42.4 bits (98), Expect = 0.22, Method: Composition-based stats.
Identities = 41/149 (27%), Positives = 62/149 (41%), Gaps = 13/149 (8%)
Query: 59 RIQYLPELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVG--CIAELTKEEEKRLLHTG 116
R L + K R++ + A ND I VG ++ E+R L +G
Sbjct: 1129 RFNQLKKRKKPVRFARSAIHNWGLYAEENISANDMIIEYVGEKVRQQVANMRERRYLKSG 1188
Query: 117 LNDFSVMYSCRKNC---AQLWLGPAAYINHDCRPNCRFVATERDTA---CVKVLRDIEQG 170
+ S ++ +N A G A +INH C PNC + D + + LRDIE+
Sbjct: 1189 IGS-SYLFRIDENTVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERD 1247
Query: 171 EEITCFYGEDFFGDKN----CLCECDTCE 195
EE+T Y + D + CLC C+
Sbjct: 1248 EELTYDYKFEREWDSDDRIPCLCGSTGCK 1276
>gi|259482394|tpe|CBF76835.1| TPA: histone-lysine N-methyltransferase (Ash1), putative
(AFU_orthologue; AFUA_3G06480) [Aspergillus nidulans
FGSC A4]
Length = 812
Score = 42.4 bits (98), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 39/127 (30%), Positives = 57/127 (44%), Gaps = 18/127 (14%)
Query: 77 NRGARIVATRKWEKNDNIAYLVGCIAELTKEE-EKRLLHT-GLNDFSVMYSCRKNCAQLW 134
+RG + + R +E N I G I +T+ E EKR+ N+ ++ + R + A+
Sbjct: 479 DRGYGVRSNRTFEPNQIIVEYTGEI--ITQAECEKRMRTIYKKNENMIIDATRGSIAR-- 534
Query: 135 LGPAAYINHDCRPNCRF----VATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN---C 187
++NH C PNCR VA + A R I GEE+T Y D + KN C
Sbjct: 535 -----FVNHGCEPNCRMEKWTVAGKPRMALFAGDRGIMTGEELTYDYNFDPYSQKNVQQC 589
Query: 188 LCECDTC 194
C C
Sbjct: 590 RCGSSKC 596
>gi|339244151|ref|XP_003378001.1| putative SET domain protein [Trichinella spiralis]
gi|316973127|gb|EFV56754.1| putative SET domain protein [Trichinella spiralis]
Length = 1416
Score = 42.4 bits (98), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 37/126 (29%), Positives = 50/126 (39%), Gaps = 9/126 (7%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R E N + VG + E + + N M+ + A +
Sbjct: 1286 QGLGLFANRDVEMNAMVIEYVGEVIRNEVAERREKSYQKRNRGVYMFRLDSDHVIDATVA 1345
Query: 135 LGPAAYINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN---CL 188
GPA YINH C PNC R + R I +GEE+T Y DF + N CL
Sbjct: 1346 GGPARYINHSCDPNCIAERIDFDRESRIVIMSCRPICKGEELTYDYQFDFEDELNKLPCL 1405
Query: 189 CECDTC 194
C C
Sbjct: 1406 CRAPNC 1411
>gi|400598429|gb|EJP66146.1| SET domain-containing protein [Beauveria bassiana ARSEF 2860]
Length = 638
Score = 42.4 bits (98), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 46/103 (44%), Gaps = 20/103 (19%)
Query: 99 GCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCA-QLWLGPAA----YINHDCRPNCRF-- 151
G I E L+ + D VM K + Q+W G +INH C PN +F
Sbjct: 508 GAIGEFVG-----LVTKDIEDLDVMKCLTKGRSYQIWQGRQGNFTRFINHSCNPNAQFQH 562
Query: 152 ---VATERDTACVKVLRDIEQGEEITCFYGEDFFG--DKNCLC 189
++TER + V + I G EIT Y E ++ +K CLC
Sbjct: 563 FVWLSTER---IILVSKGIAAGSEITVDYSESYWSGLNKRCLC 602
>gi|134076084|emb|CAK39443.1| unnamed protein product [Aspergillus niger]
Length = 821
Score = 42.4 bits (98), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 21/58 (36%), Positives = 30/58 (51%), Gaps = 3/58 (5%)
Query: 140 YINHDCRPNCRF---VATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTC 194
+INH CR N F +R V+ LRDI+ EE+T YG ++ + C C + C
Sbjct: 539 FINHSCRANTTFERRTIGDRAMMTVEALRDIQPFEELTVNYGRAYWKGRTCECGEEGC 596
>gi|115450985|ref|NP_001049093.1| Os03g0168700 [Oryza sativa Japonica Group]
gi|19071649|gb|AAL84316.1|AC073556_33 hypothetical protein [Oryza sativa Japonica Group]
gi|108706390|gb|ABF94185.1| TPR Domain containing protein [Oryza sativa Japonica Group]
gi|113547564|dbj|BAF11007.1| Os03g0168700 [Oryza sativa Japonica Group]
Length = 536
Score = 42.4 bits (98), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 25/50 (50%), Positives = 28/50 (56%), Gaps = 3/50 (6%)
Query: 129 NCA-QLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFY 177
NC LW+ PA +INH C PN R D A V RDI+ GEEIT Y
Sbjct: 309 NCGVGLWILPA-FINHSCHPNARRTHVG-DHAIVHASRDIKAGEEITFAY 356
>gi|432114829|gb|ELK36567.1| Putative histone-lysine N-methyltransferase NSD2 [Myotis davidii]
Length = 1037
Score = 42.4 bits (98), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 36/125 (28%), Positives = 56/125 (44%), Gaps = 13/125 (10%)
Query: 77 NRGARIVATRKWEKNDNIAYLVGCIAELTKEEE--KRLLHTGLNDFSVMYSCRKNCAQLW 134
+G +VATR K + + VG EL EEE R+ H ND + Y + ++
Sbjct: 686 GKGWGLVATRDIRKGEFVNEYVG---ELIDEEECMARIKHAQENDITHFYMLTIDKDRII 742
Query: 135 -LGPAA----YINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN 186
GP ++NH C+PNC ++ + + DI G E+T Y D G++
Sbjct: 743 DAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTELTFNYNLDCLGNEK 802
Query: 187 CLCEC 191
+C C
Sbjct: 803 TVCRC 807
>gi|302664384|ref|XP_003023822.1| histone-lysine N-methyltransferase (Ash1), putative [Trichophyton
verrucosum HKI 0517]
gi|291187840|gb|EFE43204.1| histone-lysine N-methyltransferase (Ash1), putative [Trichophyton
verrucosum HKI 0517]
Length = 709
Score = 42.4 bits (98), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 50/182 (27%), Positives = 81/182 (44%), Gaps = 33/182 (18%)
Query: 77 NRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTG--------LNDFS---VMYS 125
+RG + + R +E N I G I LT+ E +R + T L DF ++ +
Sbjct: 433 SRGYGVRSNRTFEPNQIIVEYTGEI--LTQIEAQRRMKTIYKKNECFYLMDFDQDMIIDA 490
Query: 126 CRKNCAQLWLGPAAYINHDCRPNCRF----VATERDTACVKVLRDIEQGEEITCFYGEDF 181
R + A+ ++NH C PNC+ VA + A I GEE+T Y D
Sbjct: 491 TRGSIAR-------FVNHSCEPNCKMEKWIVAGKPRIALFAGDNGIMTGEELTYDYNFDP 543
Query: 182 FGDKN---CLCECDTCE----IRMKG--AFAQSSVEPQTTQPSRYCLRDTDMRLNKRKLH 232
+ +KN C C +C R KG + +S+V+P +T+ R T+ + +K++
Sbjct: 544 YSNKNVQECRCGTPSCRGVLGPRPKGKDSKEKSTVKPASTRTKRKSQGVTNSQASKKQKT 603
Query: 233 KK 234
K
Sbjct: 604 TK 605
>gi|125542564|gb|EAY88703.1| hypothetical protein OsI_10178 [Oryza sativa Indica Group]
Length = 536
Score = 42.4 bits (98), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 25/50 (50%), Positives = 28/50 (56%), Gaps = 3/50 (6%)
Query: 129 NCA-QLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFY 177
NC LW+ PA +INH C PN R D A V RDI+ GEEIT Y
Sbjct: 309 NCGVGLWILPA-FINHSCHPNARRTHVG-DHAIVHASRDIKAGEEITFAY 356
>gi|380493696|emb|CCF33689.1| SET domain-containing protein [Colletotrichum higginsianum]
Length = 833
Score = 42.4 bits (98), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 29/102 (28%), Positives = 47/102 (46%), Gaps = 8/102 (7%)
Query: 138 AAYINHDCRPNCRF----VATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN---CLCE 190
A ++NH C PNCR V+ + A + I GEE+T Y F D+N CLC
Sbjct: 550 ARFVNHSCAPNCRMIKWIVSGQPRMALFAGDKPIMTGEELTYDYNFSPFSDENVQTCLCG 609
Query: 191 CDTCEIRMKGAFAQSSVEPQTTQPSRYCLRDTDMRLNKRKLH 232
C + G Q +P+ + + + + ++ +KRK+
Sbjct: 610 APNCR-GILGPKPQEVKQPKPPKSTLKAVVKSAVKASKRKIE 650
>gi|281202822|gb|EFA77024.1| hypothetical protein PPL_09776 [Polysphondylium pallidum PN500]
Length = 1187
Score = 42.4 bits (98), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 23/73 (31%), Positives = 32/73 (43%), Gaps = 10/73 (13%)
Query: 138 AAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGE----------DFFGDKNC 187
AAY+NH C PN + + +RDI+ GEEIT Y + +
Sbjct: 940 AAYLNHSCEPNTVWHNDNNGMIAYRSIRDIKAGEEITTTYIDITKYKSTRQLNLLSQYAF 999
Query: 188 LCECDTCEIRMKG 200
LC+C C+ R G
Sbjct: 1000 LCQCARCQDRATG 1012
>gi|71664694|ref|XP_819325.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70884621|gb|EAN97474.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 504
Score = 42.4 bits (98), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 23/41 (56%), Positives = 25/41 (60%), Gaps = 1/41 (2%)
Query: 138 AAYINHDCRPNCRFVATE-RDTACVKVLRDIEQGEEITCFY 177
A INH C P+ RFV T R A V LRDIE+GEEI Y
Sbjct: 432 ATKINHSCVPSVRFVPTHGRVGAVVVALRDIEKGEEIRSSY 472
>gi|310794022|gb|EFQ29483.1| SET domain-containing protein [Glomerella graminicola M1.001]
Length = 828
Score = 42.4 bits (98), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 48/178 (26%), Positives = 74/178 (41%), Gaps = 18/178 (10%)
Query: 70 YRYSLE----GNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLND---FSV 122
YR +E +RG I A R +E N I G I +T EE + D + +
Sbjct: 462 YRVGVEVIKTSDRGYGIRANRCFEPNQIIMEYTGEI--ITDEECSERMENKYKDSKCYYL 519
Query: 123 MYSCRKNCAQLWLGPAA-YINHDCRPNCRF----VATERDTACVKVLRDIEQGEEITCFY 177
M + G A ++NH C PNCR V+ + A + I GEE+T Y
Sbjct: 520 MSFDQNMIIDATKGSIARFVNHSCAPNCRMIKWIVSGQPRMALFAGDKPIMTGEELTYDY 579
Query: 178 GEDFFGDKN---CLCECDTCEIRMKGAFAQSSVEPQTTQPSRYCLRDTDMRLNKRKLH 232
F D+N CLC C + G Q +P+ + + + + ++ +KRK+
Sbjct: 580 NFSPFSDENVQKCLCGAPNCR-GILGPKPQEVKQPKPPKSTLKAVVKSAVKASKRKIE 636
>gi|407425989|gb|EKF39557.1| hypothetical protein MOQ_000211 [Trypanosoma cruzi marinkellei]
Length = 504
Score = 42.4 bits (98), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 22/41 (53%), Positives = 26/41 (63%), Gaps = 1/41 (2%)
Query: 138 AAYINHDCRPNCRFVATE-RDTACVKVLRDIEQGEEITCFY 177
A INH C P+ RF+ T R +A V LRDIE+GEEI Y
Sbjct: 432 ATKINHSCVPSVRFIPTHGRVSAVVVALRDIEKGEEIRSSY 472
>gi|338738577|ref|YP_004675539.1| Nuclear protein SET [Hyphomicrobium sp. MC1]
gi|337759140|emb|CCB64967.1| Nuclear protein SET [Hyphomicrobium sp. MC1]
Length = 133
Score = 42.4 bits (98), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 21/45 (46%), Positives = 27/45 (60%), Gaps = 1/45 (2%)
Query: 138 AAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFF 182
A +INH C PNC V + V LRDI GEE++ YGE++F
Sbjct: 74 ARFINHSCAPNCE-VDIHKRRVYVFALRDIAPGEELSYDYGEEYF 117
>gi|261201264|ref|XP_002627032.1| histone-lysine N-methyltransferase [Ajellomyces dermatitidis
SLH14081]
gi|239592091|gb|EEQ74672.1| histone-lysine N-methyltransferase [Ajellomyces dermatitidis
SLH14081]
gi|239611745|gb|EEQ88732.1| histone-lysine N-methyltransferase [Ajellomyces dermatitidis ER-3]
Length = 1259
Score = 42.4 bits (98), Expect = 0.25, Method: Composition-based stats.
Identities = 41/149 (27%), Positives = 62/149 (41%), Gaps = 13/149 (8%)
Query: 59 RIQYLPELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVG--CIAELTKEEEKRLLHTG 116
R L + K R++ + A ND I VG ++ E+R L +G
Sbjct: 1108 RFNQLKKRKKPVRFARSAIHNWGLYAEENISANDMIIEYVGEKVRQQVADMRERRYLKSG 1167
Query: 117 LNDFSVMYSCRKNC---AQLWLGPAAYINHDCRPNCRFVATERDTA---CVKVLRDIEQG 170
+ S ++ +N A G A +INH C PNC + D + + LRDIE+
Sbjct: 1168 IGS-SYLFRIDENTVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERD 1226
Query: 171 EEITCFYGEDFFGDKN----CLCECDTCE 195
EE+T Y + D + CLC C+
Sbjct: 1227 EELTYDYKFEREWDSDDRIPCLCGSTGCK 1255
>gi|328776663|ref|XP_394941.4| PREDICTED: hypothetical protein LOC411466 [Apis mellifera]
Length = 4678
Score = 42.4 bits (98), Expect = 0.25, Method: Composition-based stats.
Identities = 41/127 (32%), Positives = 57/127 (44%), Gaps = 11/127 (8%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCI--AEL--TKEEEKRLLHTGLNDFSVMYSCRKNCAQL 133
+G + A R EK+ + +G I EL T+E++ + G+ F + R A L
Sbjct: 4548 QGLGLYAARDLEKHTMVIEYIGEIIRTELAETREKQYEARNRGIYMFRLDEE-RVVDATL 4606
Query: 134 WLGPAAYINHDCRPNC--RFVATERDTACVKVL-RDIEQGEEITCFYGEDFFGDKN---C 187
G A YINH C PNC V ERD + R I +GEE+ Y D D++ C
Sbjct: 4607 CGGLARYINHSCNPNCVAEIVEVERDLRIIIFAKRRISRGEELAYDYKFDIEDDQHKIAC 4666
Query: 188 LCECDTC 194
C C
Sbjct: 4667 ACGAPNC 4673
>gi|238505954|ref|XP_002384179.1| histone-lysine N-methyltransferase (Ash1), putative [Aspergillus
flavus NRRL3357]
gi|220690293|gb|EED46643.1| histone-lysine N-methyltransferase (Ash1), putative [Aspergillus
flavus NRRL3357]
gi|391868614|gb|EIT77824.1| hypothetical protein Ao3042_05901 [Aspergillus oryzae 3.042]
Length = 789
Score = 42.4 bits (98), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 41/129 (31%), Positives = 58/129 (44%), Gaps = 13/129 (10%)
Query: 77 NRGARIVATRKWEKNDNIAYLVGCIAELTKEE-EKRLLHTGLND--FSVMYSCRKNCAQL 133
+RG + + R +E N I G I +T+ E EKR+ N+ + +MY +
Sbjct: 441 DRGYGVRSNRTFEPNQVIVEYTGEI--ITQAECEKRMRTIYKNNECYYLMYFDQNMIIDA 498
Query: 134 WLGP-AAYINHDCRPNCRF----VATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN-- 186
G A ++NH C PNCR VA + A R I G+E+T Y D + KN
Sbjct: 499 TRGSIARFVNHSCEPNCRMEKWTVAGKPRMALFAGDRGIMTGDELTYDYNFDPYSQKNVQ 558
Query: 187 -CLCECDTC 194
C C D C
Sbjct: 559 QCRCGSDRC 567
>gi|350635659|gb|EHA24020.1| hypothetical protein ASPNIDRAFT_180174 [Aspergillus niger ATCC
1015]
Length = 311
Score = 42.0 bits (97), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 32/119 (26%), Positives = 48/119 (40%), Gaps = 3/119 (2%)
Query: 79 GARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPA 138
G + A ++EK D +A VG + + T V C +
Sbjct: 193 GVGVRALAQFEKGDILAEFVGPLHPPCYIGDPVYALTMQGKVRVKEEIALICPMEFGNFT 252
Query: 139 AYINHDCRPNCRF---VATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTC 194
+INH CR N F +R V+ LRDI+ EE+T YG ++ + C C + C
Sbjct: 253 RFINHSCRANTTFERRTIGDRAMMTVEALRDIQPFEELTVNYGRAYWKGRTCECGEEGC 311
>gi|380024451|ref|XP_003696009.1| PREDICTED: uncharacterized protein LOC100866111 [Apis florea]
Length = 5713
Score = 42.0 bits (97), Expect = 0.26, Method: Composition-based stats.
Identities = 41/127 (32%), Positives = 57/127 (44%), Gaps = 11/127 (8%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCI--AEL--TKEEEKRLLHTGLNDFSVMYSCRKNCAQL 133
+G + A R EK+ + +G I EL T+E++ + G+ F + R A L
Sbjct: 5583 QGLGLYAARDLEKHTMVIEYIGEIIRTELAETREKQYEARNRGIYMFRLDEE-RVVDATL 5641
Query: 134 WLGPAAYINHDCRPNC--RFVATERDTACVKVL-RDIEQGEEITCFYGEDFFGDKN---C 187
G A YINH C PNC V ERD + R I +GEE+ Y D D++ C
Sbjct: 5642 CGGLARYINHSCNPNCVAEIVEVERDLRIIIFAKRRISRGEELAYDYKFDIEDDQHKIAC 5701
Query: 188 LCECDTC 194
C C
Sbjct: 5702 ACGAPNC 5708
>gi|380511418|ref|ZP_09854825.1| hypothetical protein XsacN4_09403 [Xanthomonas sacchari NCPPB 4393]
Length = 155
Score = 42.0 bits (97), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 21/48 (43%), Positives = 27/48 (56%), Gaps = 7/48 (14%)
Query: 138 AAYINHDCRPNCRFVATE-------RDTACVKVLRDIEQGEEITCFYG 178
A +INH C PNC V E +D ++ LRDI+ GEE+T YG
Sbjct: 75 ARWINHSCAPNCEAVIVEAEGDDRRKDKVVIEALRDIKPGEELTYNYG 122
>gi|295646353|gb|ADG23062.1| myeloid/lymphoid or mixed-lineage leukemia 3, partial [Gadus
morhua]
Length = 117
Score = 42.0 bits (97), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 26/71 (36%), Positives = 37/71 (52%), Gaps = 4/71 (5%)
Query: 131 AQLWLGPAAYINHDCRPNC--RFVATERDTA-CVKVLRDIEQGEEITCFYGEDFFGDKNC 187
A + GPA Y+NH C PNC V ER+ + R I++GEE++ Y D D++
Sbjct: 43 ATITGGPARYVNHSCSPNCITEVVTVERENKIIISSCRRIQRGEELSYDYKFDLEDDQHK 102
Query: 188 L-CECDTCEIR 197
+ C C T R
Sbjct: 103 IPCHCGTLNCR 113
>gi|320036571|gb|EFW18510.1| histone-lysine N-methyltransferase [Coccidioides posadasii str.
Silveira]
Length = 742
Score = 42.0 bits (97), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 41/129 (31%), Positives = 56/129 (43%), Gaps = 13/129 (10%)
Query: 77 NRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLND---FSVMYSCRKNCAQL 133
+RG + + R ++ N I G I LT+EE +R + T + +MY +
Sbjct: 408 DRGYGVRSNRSFDPNQIIVEYTGEI--LTQEECERRMRTVYKKNECYYLMYFDQNMVIDA 465
Query: 134 WLGP-AAYINHDCRPNCRF----VATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN-- 186
G A +INH C PNCR VA + A I GEE+T Y D + KN
Sbjct: 466 TRGSIARFINHSCEPNCRMEKWTVAGKPRMALFAGEDGIMTGEELTYDYNFDPYSQKNVQ 525
Query: 187 -CLCECDTC 194
C C TC
Sbjct: 526 ECRCGAPTC 534
>gi|303313071|ref|XP_003066547.1| SET domain containing protein [Coccidioides posadasii C735 delta
SOWgp]
gi|240106209|gb|EER24402.1| SET domain containing protein [Coccidioides posadasii C735 delta
SOWgp]
Length = 742
Score = 42.0 bits (97), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 41/129 (31%), Positives = 56/129 (43%), Gaps = 13/129 (10%)
Query: 77 NRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLND---FSVMYSCRKNCAQL 133
+RG + + R ++ N I G I LT+EE +R + T + +MY +
Sbjct: 408 DRGYGVRSNRSFDPNQIIVEYTGEI--LTQEECERRMRTVYKKNECYYLMYFDQNMVIDA 465
Query: 134 WLGP-AAYINHDCRPNCRF----VATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN-- 186
G A +INH C PNCR VA + A I GEE+T Y D + KN
Sbjct: 466 TRGSIARFINHSCEPNCRMEKWTVAGKPRMALFAGEDGIMTGEELTYDYNFDPYSQKNVQ 525
Query: 187 -CLCECDTC 194
C C TC
Sbjct: 526 ECRCGAPTC 534
>gi|302666919|ref|XP_003025054.1| hypothetical protein TRV_00712 [Trichophyton verrucosum HKI 0517]
gi|291189136|gb|EFE44443.1| hypothetical protein TRV_00712 [Trichophyton verrucosum HKI 0517]
Length = 1376
Score = 42.0 bits (97), Expect = 0.28, Method: Composition-based stats.
Identities = 41/149 (27%), Positives = 62/149 (41%), Gaps = 13/149 (8%)
Query: 59 RIQYLPELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVG--CIAELTKEEEKRLLHTG 116
R L + K R++ + A ND I VG ++ E+R L +G
Sbjct: 1225 RFNQLKKRKKPVRFARSAIHNWGLYAEENITANDMIIEYVGEKVRQQVADMRERRYLKSG 1284
Query: 117 LNDFSVMYSCRKNC---AQLWLGPAAYINHDCRPNCRFVATERDTA---CVKVLRDIEQG 170
+ S ++ +N A G A +INH C PNC + D + + LRDIE+
Sbjct: 1285 IGS-SYLFRIDENTVIDATKHGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERD 1343
Query: 171 EEITCFYGEDFFGDKN----CLCECDTCE 195
EE+T Y + D + CLC C+
Sbjct: 1344 EELTYDYKFEREWDSDDRIPCLCGSTGCK 1372
>gi|348672048|gb|EGZ11868.1| hypothetical protein PHYSODRAFT_250536 [Phytophthora sojae]
Length = 605
Score = 42.0 bits (97), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 26/77 (33%), Positives = 35/77 (45%), Gaps = 7/77 (9%)
Query: 140 YINHDCRPNCRF--VAT-ERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEI 196
++NH C P F VA R T V D+ QG+EIT YG+D + +C C +
Sbjct: 288 FVNHSCEPVATFHEVANGRRTTVVVATTEDVLQGQEITVDYGDDLW----FVCRCGSESC 343
Query: 197 RMKGAFAQSSVEPQTTQ 213
R + Q P T Q
Sbjct: 344 RHRAIQDQRDPRPLTVQ 360
>gi|154278862|ref|XP_001540244.1| hypothetical protein HCAG_04084 [Ajellomyces capsulatus NAm1]
gi|150412187|gb|EDN07574.1| hypothetical protein HCAG_04084 [Ajellomyces capsulatus NAm1]
Length = 1266
Score = 42.0 bits (97), Expect = 0.29, Method: Composition-based stats.
Identities = 41/149 (27%), Positives = 62/149 (41%), Gaps = 13/149 (8%)
Query: 59 RIQYLPELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVG--CIAELTKEEEKRLLHTG 116
R L + K R++ + A ND I VG ++ E+R L +G
Sbjct: 1115 RFNQLKKRKKPVRFARSAIHNWGLYAEENISANDMIIEYVGEKVRQQVADMRERRYLKSG 1174
Query: 117 LNDFSVMYSCRKNC---AQLWLGPAAYINHDCRPNCRFVATERDTA---CVKVLRDIEQG 170
+ S ++ +N A G A +INH C PNC + D + + LRDIE+
Sbjct: 1175 IGS-SYLFRIDENTVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERD 1233
Query: 171 EEITCFYGEDFFGDKN----CLCECDTCE 195
EE+T Y + D + CLC C+
Sbjct: 1234 EELTYDYKFEREWDSDDRIPCLCGSTGCK 1262
>gi|301100332|ref|XP_002899256.1| histone-lysine N-methyltransferase, putative [Phytophthora
infestans T30-4]
gi|262104173|gb|EEY62225.1| histone-lysine N-methyltransferase, putative [Phytophthora
infestans T30-4]
Length = 1501
Score = 42.0 bits (97), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 32/112 (28%), Positives = 47/112 (41%), Gaps = 15/112 (13%)
Query: 140 YINHDCRPNCRFVATERDTA------CVKVLRDIEQGEEITCFYGEDFFGDK--NCLCEC 191
++NH C PNC+ TE+ T V LRDIE GEE+T Y G + C C
Sbjct: 200 FVNHSCEPNCK---TEKWTVKGETRIAVSALRDIEVGEELTFDYKWKALGSRQITCCCGA 256
Query: 192 DTCEIRMKGAFAQSSVEPQTTQPSRYCLRDTDMRLNKRKLHKKLNRLLLASD 243
+C KG + Q + RD + + L + R+ L+ +
Sbjct: 257 PSC----KGVIGTQNETLQNAEAQTGYFRDPEKQETGDALVGRRVRVFLSPE 304
>gi|389774090|ref|ZP_10192271.1| SET domain-containing protein [Rhodanobacter spathiphylli B39]
gi|388438708|gb|EIL95448.1| SET domain-containing protein [Rhodanobacter spathiphylli B39]
Length = 155
Score = 42.0 bits (97), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 20/48 (41%), Positives = 28/48 (58%), Gaps = 7/48 (14%)
Query: 138 AAYINHDCRPNCRFVATE-------RDTACVKVLRDIEQGEEITCFYG 178
A +INH C PNCR + E RD ++ +R+I+ GEE+T YG
Sbjct: 75 ARWINHSCAPNCRALVEESASGDPRRDKVVIEAIRNIKPGEELTYDYG 122
>gi|71406264|ref|XP_805685.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70869186|gb|EAN83834.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 937
Score = 42.0 bits (97), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 25/49 (51%), Gaps = 9/49 (18%)
Query: 138 AAYINHDCRPNCRFVATERDTAC---------VKVLRDIEQGEEITCFY 177
A +INH C PN FV E + AC V L DIE GEEIT Y
Sbjct: 544 AGFINHSCSPNAVFVFGEEEGACCSSRGGVLRVVALCDIEPGEEITVTY 592
>gi|317151143|ref|XP_001824470.2| histone-lysine N-methyltransferase (Ash1) [Aspergillus oryzae
RIB40]
Length = 796
Score = 42.0 bits (97), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 41/129 (31%), Positives = 58/129 (44%), Gaps = 13/129 (10%)
Query: 77 NRGARIVATRKWEKNDNIAYLVGCIAELTKEE-EKRLLHTGLND--FSVMYSCRKNCAQL 133
+RG + + R +E N I G I +T+ E EKR+ N+ + +MY +
Sbjct: 448 DRGYGVRSNRTFEPNQVIVEYTGEI--ITQAECEKRMRTIYKNNECYYLMYFDQNMIIDA 505
Query: 134 WLGP-AAYINHDCRPNCRF----VATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN-- 186
G A ++NH C PNCR VA + A R I G+E+T Y D + KN
Sbjct: 506 TRGSIARFVNHSCEPNCRMEKWTVAGKPRMALFAGDRGIMTGDELTYDYNFDPYSQKNVQ 565
Query: 187 -CLCECDTC 194
C C D C
Sbjct: 566 QCRCGSDRC 574
>gi|389793438|ref|ZP_10196605.1| nuclear protein SET [Rhodanobacter fulvus Jip2]
gi|388434184|gb|EIL91133.1| nuclear protein SET [Rhodanobacter fulvus Jip2]
Length = 159
Score = 42.0 bits (97), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 21/48 (43%), Positives = 27/48 (56%), Gaps = 7/48 (14%)
Query: 138 AAYINHDCRPNCRFVATE-------RDTACVKVLRDIEQGEEITCFYG 178
A +INH C PNCR + E RD ++ LR I+ GEE+T YG
Sbjct: 75 ARWINHSCAPNCRALVEENADGDRRRDKVVIEALRKIKPGEELTYDYG 122
>gi|357611223|gb|EHJ67375.1| hypothetical protein KGM_13830 [Danaus plexippus]
Length = 1798
Score = 42.0 bits (97), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 25/84 (29%), Positives = 39/84 (46%), Gaps = 6/84 (7%)
Query: 136 GPAAYINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFG---DKNCLC 189
G ++NH CRPNC ++ A + LRDIE EE+T Y F + C C
Sbjct: 1029 GDGRFVNHSCRPNCEMQKWTANGTFRMALFALRDIEPDEELTYDYNFSLFNPAVGQPCKC 1088
Query: 190 ECDTCEIRMKGAFAQSSVEPQTTQ 213
+ + C + G + + +P +Q
Sbjct: 1089 DSEDCRGVIGGKSQRITKQPVKSQ 1112
>gi|119192128|ref|XP_001246670.1| hypothetical protein CIMG_00441 [Coccidioides immitis RS]
gi|392864092|gb|EAS35106.2| histone-lysine N-methyltransferase [Coccidioides immitis RS]
Length = 742
Score = 42.0 bits (97), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 41/129 (31%), Positives = 56/129 (43%), Gaps = 13/129 (10%)
Query: 77 NRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLND---FSVMYSCRKNCAQL 133
+RG + + R ++ N I G I LT+EE +R + T + +MY +
Sbjct: 408 DRGYGVRSNRSFDPNQIIVEYTGEI--LTQEECERRMRTVYKKNECYYLMYFDQNMVIDA 465
Query: 134 WLGP-AAYINHDCRPNCRF----VATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN-- 186
G A +INH C PNCR VA + A I GEE+T Y D + KN
Sbjct: 466 TRGSIARFINHSCEPNCRMEKWTVAGKPRMALFAGEDGIMTGEELTYDYNFDPYSQKNVQ 525
Query: 187 -CLCECDTC 194
C C TC
Sbjct: 526 ECRCGAPTC 534
>gi|125585065|gb|EAZ25729.1| hypothetical protein OsJ_09564 [Oryza sativa Japonica Group]
Length = 509
Score = 42.0 bits (97), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 25/50 (50%), Positives = 28/50 (56%), Gaps = 3/50 (6%)
Query: 129 NCA-QLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFY 177
NC LW+ PA +INH C PN R D A V RDI+ GEEIT Y
Sbjct: 282 NCGVGLWILPA-FINHSCHPNARRTHVG-DHAIVHASRDIKAGEEITFAY 329
>gi|330920531|ref|XP_003299046.1| hypothetical protein PTT_09957 [Pyrenophora teres f. teres 0-1]
gi|311327461|gb|EFQ92879.1| hypothetical protein PTT_09957 [Pyrenophora teres f. teres 0-1]
Length = 791
Score = 42.0 bits (97), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 43/140 (30%), Positives = 59/140 (42%), Gaps = 17/140 (12%)
Query: 70 YRYSLE----GNRGARIVATRKWEKNDNIAYLVGCIAELTKEE-EKRLLHTGLND--FSV 122
Y Y +E +RG + A R +E + I G I +T+ E E+R+ D + +
Sbjct: 423 YDYGVEVLDTEDRGYGVRAMRTFEPHQIIVEYAGEI--ITQSECERRMKQVYKKDKCYYL 480
Query: 123 MYSCRKNCAQLWLGPAA-YINHDCRPNCRF----VATERDTACVKVLRDIEQGEEITCFY 177
M K G A ++NH C PNC V E A R I GEE+T Y
Sbjct: 481 MSFDNKMIIDATRGTIARFVNHSCEPNCEMIKWTVGGEPRMALFAGPRGIMTGEELTYDY 540
Query: 178 GEDFFGDKN---CLCECDTC 194
D F KN C C ++C
Sbjct: 541 NFDPFSQKNIQECRCGTESC 560
>gi|189202688|ref|XP_001937680.1| SET domain containing protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187984779|gb|EDU50267.1| SET domain containing protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 646
Score = 42.0 bits (97), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 43/140 (30%), Positives = 59/140 (42%), Gaps = 17/140 (12%)
Query: 70 YRYSLE----GNRGARIVATRKWEKNDNIAYLVGCIAELTKEE-EKRLLHTGLND--FSV 122
Y Y +E +RG + A R +E + I G I +T+ E E+R+ D + +
Sbjct: 393 YDYGVEVLDTEDRGYGVRAMRTFEPHQIIVEYAGEI--ITQSECERRMKQVYKKDKCYYL 450
Query: 123 MYSCRKNCAQLWLGPAA-YINHDCRPNCRF----VATERDTACVKVLRDIEQGEEITCFY 177
M K G A ++NH C PNC V E A R I GEE+T Y
Sbjct: 451 MSFDNKMIIDATRGTIARFVNHSCEPNCEMIKWTVGGEPRMALFAGPRGIMTGEELTYDY 510
Query: 178 GEDFFGDKN---CLCECDTC 194
D F KN C C ++C
Sbjct: 511 NFDPFSQKNIQECRCGTESC 530
>gi|406868443|gb|EKD21480.1| SET domain-containing protein [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 796
Score = 42.0 bits (97), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 34/118 (28%), Positives = 50/118 (42%), Gaps = 12/118 (10%)
Query: 138 AAYINHDCRPNCRF----VATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCL-CECD 192
A ++NH C PNC+ V + A I G+E+T Y D F KN C C
Sbjct: 545 ARFVNHSCNPNCKMVKWIVGGKPRMALFAGDNPIMTGDELTYDYNFDPFSAKNVQECRCG 604
Query: 193 TCEIRMKGAFAQSSVEPQTTQPSRYC-LRD---TDMRLNKRKLHKKLNRLLLASDKND 246
+ R F +P +P Y L+D ++ + + +KLN LL D +D
Sbjct: 605 SSNCR---GFLGPKPKPNAKEPKVYKELKDNFKAGVKAGVKAVKRKLNELLGGDDTDD 659
>gi|352090580|ref|ZP_08954613.1| nuclear protein SET [Rhodanobacter sp. 2APBS1]
gi|389799056|ref|ZP_10202061.1| SET domain-containing protein [Rhodanobacter sp. 116-2]
gi|351676093|gb|EHA59248.1| nuclear protein SET [Rhodanobacter sp. 2APBS1]
gi|388443877|gb|EIM00011.1| SET domain-containing protein [Rhodanobacter sp. 116-2]
Length = 155
Score = 42.0 bits (97), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 21/48 (43%), Positives = 27/48 (56%), Gaps = 7/48 (14%)
Query: 138 AAYINHDCRPNCRFVATE-------RDTACVKVLRDIEQGEEITCFYG 178
A +INH C PNCR V E RD ++ +R I+ GEE+T YG
Sbjct: 75 ARWINHSCAPNCRAVVEESASGDPRRDRVLIEAIRAIKPGEELTYDYG 122
>gi|67537188|ref|XP_662368.1| hypothetical protein AN4764.2 [Aspergillus nidulans FGSC A4]
gi|40741616|gb|EAA60806.1| hypothetical protein AN4764.2 [Aspergillus nidulans FGSC A4]
Length = 870
Score = 41.6 bits (96), Expect = 0.34, Method: Composition-based stats.
Identities = 35/126 (27%), Positives = 55/126 (43%), Gaps = 12/126 (9%)
Query: 77 NRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLG 136
+RG + + R +E N I G I + E++ N+ ++ + R + A+
Sbjct: 537 DRGYGVRSNRTFEPNQIIVEYTGEIITQAECEKRMRTIYKKNENMIIDATRGSIAR---- 592
Query: 137 PAAYINHDCRPNCRF----VATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNC-LCEC 191
++NH C PNCR VA + A R I GEE+T Y D + KN C C
Sbjct: 593 ---FVNHGCEPNCRMEKWTVAGKPRMALFAGDRGIMTGEELTYDYNFDPYSQKNVQQCRC 649
Query: 192 DTCEIR 197
+ + R
Sbjct: 650 GSSKCR 655
>gi|350294046|gb|EGZ75131.1| histone-lysine N-methyltransferase, H3 lysine-4 specific [Neurospora
tetrasperma FGSC 2509]
Length = 1313
Score = 41.6 bits (96), Expect = 0.35, Method: Composition-based stats.
Identities = 51/177 (28%), Positives = 72/177 (40%), Gaps = 37/177 (20%)
Query: 39 QRFNIGRILDMGTQHHRYSYRIQYLPELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLV 98
QR N G+ D+ R L + K +++ + A K+D I V
Sbjct: 1150 QRKNFGQDSDV--------LRFNQLKKRKKPVKFARSAIHNWGLYAMENINKDDMIIEYV 1201
Query: 99 G-----CIAELTKEEEKRLLHTGL--------NDFSVMYSCRKNCAQLWLGPAAYINHDC 145
G IAEL E R L +G+ +D +V+ + +K G A +INH C
Sbjct: 1202 GEEVRQQIAEL---REARYLKSGIGSSYLFRIDDNTVIDATKKG------GIARFINHSC 1252
Query: 146 RPNC--RFVATERDTACV-KVLRDIEQGEEITCFYG-EDFFGDKN---CLCECDTCE 195
PNC + + E V LRDI Q EE+T Y E G + CLC C+
Sbjct: 1253 MPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDYKFEREIGSTDRIPCLCGTAACK 1309
>gi|300024301|ref|YP_003756912.1| nuclear protein SET [Hyphomicrobium denitrificans ATCC 51888]
gi|299526122|gb|ADJ24591.1| nuclear protein SET [Hyphomicrobium denitrificans ATCC 51888]
Length = 133
Score = 41.6 bits (96), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 24/59 (40%), Positives = 32/59 (54%), Gaps = 4/59 (6%)
Query: 138 AAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCL--CECDTC 194
A +INH C PNC + R V LR IE GEE++ YG+++F D + C C C
Sbjct: 74 ARFINHSCAPNCE-IEIRRRRIYVIALRAIEPGEELSYDYGQEYF-DMHLAGRCRCPKC 130
>gi|336472713|gb|EGO60873.1| hypothetical protein NEUTE1DRAFT_144212 [Neurospora tetrasperma FGSC
2508]
Length = 1282
Score = 41.6 bits (96), Expect = 0.35, Method: Composition-based stats.
Identities = 51/177 (28%), Positives = 72/177 (40%), Gaps = 37/177 (20%)
Query: 39 QRFNIGRILDMGTQHHRYSYRIQYLPELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLV 98
QR N G+ D+ R L + K +++ + A K+D I V
Sbjct: 1119 QRKNFGQDSDV--------LRFNQLKKRKKPVKFARSAIHNWGLYAMENINKDDMIIEYV 1170
Query: 99 G-----CIAELTKEEEKRLLHTGL--------NDFSVMYSCRKNCAQLWLGPAAYINHDC 145
G IAEL E R L +G+ +D +V+ + +K G A +INH C
Sbjct: 1171 GEEVRQQIAEL---REARYLKSGIGSSYLFRIDDNTVIDATKKG------GIARFINHSC 1221
Query: 146 RPNC--RFVATERDTACV-KVLRDIEQGEEITCFYG-EDFFGDKN---CLCECDTCE 195
PNC + + E V LRDI Q EE+T Y E G + CLC C+
Sbjct: 1222 MPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDYKFEREIGSTDRIPCLCGTAACK 1278
>gi|440804218|gb|ELR25095.1| SET and MYND domain containing 3, putative [Acanthamoeba
castellanii str. Neff]
Length = 365
Score = 41.6 bits (96), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 26/62 (41%), Positives = 31/62 (50%), Gaps = 1/62 (1%)
Query: 118 NDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFY 177
N+F+V K PAA INH C PNC + R V+ LRDI + EEIT Y
Sbjct: 115 NNFTVSDGELKPLGLGIYPPAALINHSCDPNCVIIFEGRQ-CTVRSLRDITKDEEITFNY 173
Query: 178 GE 179
E
Sbjct: 174 VE 175
>gi|434407260|ref|YP_007150145.1| SET domain-containing protein [Cylindrospermum stagnale PCC 7417]
gi|428261515|gb|AFZ27465.1| SET domain-containing protein [Cylindrospermum stagnale PCC 7417]
Length = 122
Score = 41.6 bits (96), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 29/101 (28%), Positives = 47/101 (46%), Gaps = 5/101 (4%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGP 137
+G I A + + K + I C+ + KE+ K + HT L F+ ++ + LG
Sbjct: 10 KGRGIFAHKDFAKGEVIEK--ACVIVIPKEQVKLITHTVL--FNYYFNWHGESGAIALGL 65
Query: 138 AAYINHDCRPNCRFVAT-ERDTACVKVLRDIEQGEEITCFY 177
A+ NH PN +V + V +DI +GEE+T Y
Sbjct: 66 ASLFNHSYHPNALYVKNFAKRVIEVIAYQDIAEGEEVTVNY 106
>gi|285017825|ref|YP_003375536.1| hypothetical protein XALc_1033 [Xanthomonas albilineans GPE PC73]
gi|283473043|emb|CBA15548.1| conserved hypothetical protein [Xanthomonas albilineans GPE PC73]
Length = 155
Score = 41.6 bits (96), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 21/48 (43%), Positives = 26/48 (54%), Gaps = 7/48 (14%)
Query: 138 AAYINHDCRPNCRFVATE-------RDTACVKVLRDIEQGEEITCFYG 178
A +INH C PNC V E +D ++ LRDI GEE+T YG
Sbjct: 75 ARWINHSCAPNCEAVIVEAEGKDRRKDKVVIEALRDIAPGEELTYNYG 122
>gi|327349992|gb|EGE78849.1| histone-lysine N-methyltransferase [Ajellomyces dermatitidis ATCC
18188]
Length = 775
Score = 41.6 bits (96), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 39/130 (30%), Positives = 56/130 (43%), Gaps = 13/130 (10%)
Query: 76 GNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLND---FSVMYSCRKNCAQ 132
+RG + + R + N I G I +T+EE +R + T + + +MY R
Sbjct: 437 ADRGYGVRSNRTFAPNQIIVEYTGEI--VTQEECERRMRTVYKNNECYYLMYFDRNMIID 494
Query: 133 LWLGPAA-YINHDCRPNCRF----VATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN- 186
G A ++NH C PNC+ VA + A I GEE+T Y D + KN
Sbjct: 495 ATRGSIARFVNHSCEPNCKIEKWTVAGKPRMALFAGEHGIMTGEELTYDYNFDPYSQKNV 554
Query: 187 --CLCECDTC 194
C C TC
Sbjct: 555 QECRCGAPTC 564
>gi|348675505|gb|EGZ15323.1| hypothetical protein PHYSODRAFT_506382 [Phytophthora sojae]
Length = 136
Score = 41.6 bits (96), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 25/65 (38%), Positives = 34/65 (52%), Gaps = 5/65 (7%)
Query: 136 GPAAYINHDCRPNCRF--VATERDTACVKV-LRDIEQGEEITCFYGEDFFGDKNCLCECD 192
G ++NH C P+ RF V T++ V V +RDI GEE+T FYG + C C
Sbjct: 62 GNPRFMNHACNPSARFHEVQTDQRLTVVAVSIRDIYPGEEVTVFYGNKLW--LVCRCGWS 119
Query: 193 TCEIR 197
C+ R
Sbjct: 120 GCQHR 124
>gi|255938628|ref|XP_002560084.1| Pc14g00900 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211584705|emb|CAP74231.1| Pc14g00900 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 1202
Score = 41.6 bits (96), Expect = 0.38, Method: Composition-based stats.
Identities = 41/149 (27%), Positives = 62/149 (41%), Gaps = 13/149 (8%)
Query: 59 RIQYLPELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVG--CIAELTKEEEKRLLHTG 116
R L + K R++ + A ND I VG ++ E+R L +G
Sbjct: 1051 RFNQLKKRKKPVRFARSAIHNWGLYAEENITANDMIIEYVGEKVRQQVADMRERRYLKSG 1110
Query: 117 LNDFSVMYSCRKNC---AQLWLGPAAYINHDCRPNCRFVATERDTA---CVKVLRDIEQG 170
+ S ++ +N A G A +INH C PNC + D + + LRDIE+
Sbjct: 1111 IGS-SYLFRIDENTVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERD 1169
Query: 171 EEITCFYGEDFFGDKN----CLCECDTCE 195
EE+T Y + D + CLC C+
Sbjct: 1170 EELTYDYKFEREWDSDDRIPCLCGSTGCK 1198
>gi|74697791|sp|Q8X0S9.1|SET1_NEUCR RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component SET1; AltName:
Full=SET domain-containing protein 1
gi|18376303|emb|CAD21415.1| related to regulatory protein SET1 [Neurospora crassa]
Length = 1313
Score = 41.6 bits (96), Expect = 0.38, Method: Composition-based stats.
Identities = 51/177 (28%), Positives = 72/177 (40%), Gaps = 37/177 (20%)
Query: 39 QRFNIGRILDMGTQHHRYSYRIQYLPELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLV 98
QR N G+ D+ R L + K +++ + A K+D I V
Sbjct: 1150 QRKNFGQDSDV--------LRFNQLKKRKKPVKFARSAIHNWGLYAMENINKDDMIIEYV 1201
Query: 99 G-----CIAELTKEEEKRLLHTGL--------NDFSVMYSCRKNCAQLWLGPAAYINHDC 145
G IAEL E R L +G+ +D +V+ + +K G A +INH C
Sbjct: 1202 GEEVRQQIAEL---REARYLKSGIGSSYLFRIDDNTVIDATKKG------GIARFINHSC 1252
Query: 146 RPNC--RFVATERDTACV-KVLRDIEQGEEITCFYG-EDFFGDKN---CLCECDTCE 195
PNC + + E V LRDI Q EE+T Y E G + CLC C+
Sbjct: 1253 MPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDYKFEREIGSTDRIPCLCGTAACK 1309
>gi|134084734|emb|CAK43391.1| unnamed protein product [Aspergillus niger]
Length = 1079
Score = 41.6 bits (96), Expect = 0.39, Method: Composition-based stats.
Identities = 41/149 (27%), Positives = 62/149 (41%), Gaps = 13/149 (8%)
Query: 59 RIQYLPELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVG--CIAELTKEEEKRLLHTG 116
R L + K R++ + A ND I VG ++ E+R L +G
Sbjct: 928 RFNQLKKRKKPVRFARSAIHNWGLYAEENISANDMIIEYVGEKVRQQVADMRERRYLKSG 987
Query: 117 LNDFSVMYSCRKNC---AQLWLGPAAYINHDCRPNCRFVATERDTA---CVKVLRDIEQG 170
+ S ++ +N A G A +INH C PNC + D + + LRDIE+
Sbjct: 988 IGS-SYLFRIDENTVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERD 1046
Query: 171 EEITCFYGEDFFGDKN----CLCECDTCE 195
EE+T Y + D + CLC C+
Sbjct: 1047 EELTYDYKFEREWDSDDRIPCLCGSTGCK 1075
>gi|429862241|gb|ELA36898.1| histone-lysine n-methyltransferase [Colletotrichum gloeosporioides
Nara gc5]
Length = 1270
Score = 41.6 bits (96), Expect = 0.40, Method: Composition-based stats.
Identities = 38/123 (30%), Positives = 56/123 (45%), Gaps = 23/123 (18%)
Query: 90 KNDNIAYLVG--CIAELTKEEEKRLLHTGL--------NDFSVMYSCRKNCAQLWLGPAA 139
K+D I VG +++ EKR L +G+ +D +V+ + +K G A
Sbjct: 1150 KDDMIIEYVGEQVRQSISEIREKRYLKSGMGSSYLFRIDDNTVIDATKKG------GIAR 1203
Query: 140 YINHDCRPNCRFVATERDTA---CVKVLRDIEQGEEITCFYG-EDFFGDKN---CLCECD 192
+INH C PNC + D + + LRDI Q EE+T Y E G + CLC
Sbjct: 1204 FINHSCMPNCTAKIIKVDGSKRIVIYALRDIAQHEELTYDYKFEREIGSLDRIPCLCGTA 1263
Query: 193 TCE 195
C+
Sbjct: 1264 ACK 1266
>gi|358373521|dbj|GAA90119.1| SET domain protein [Aspergillus kawachii IFO 4308]
Length = 1239
Score = 41.6 bits (96), Expect = 0.40, Method: Composition-based stats.
Identities = 41/149 (27%), Positives = 62/149 (41%), Gaps = 13/149 (8%)
Query: 59 RIQYLPELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVG--CIAELTKEEEKRLLHTG 116
R L + K R++ + A ND I VG ++ E+R L +G
Sbjct: 1088 RFNQLKKRKKPVRFARSAIHNWGLYAEENISANDMIIEYVGEKVRQQVADMRERRYLKSG 1147
Query: 117 LNDFSVMYSCRKNC---AQLWLGPAAYINHDCRPNCRFVATERDTA---CVKVLRDIEQG 170
+ S ++ +N A G A +INH C PNC + D + + LRDIE+
Sbjct: 1148 IGS-SYLFRIDENTVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERD 1206
Query: 171 EEITCFYGEDFFGDKN----CLCECDTCE 195
EE+T Y + D + CLC C+
Sbjct: 1207 EELTYDYKFEREWDSDDRIPCLCGSTGCK 1235
>gi|322694149|gb|EFY85986.1| histone-lysine N-methyltransferase (Ash1), putative [Metarhizium
acridum CQMa 102]
Length = 760
Score = 41.6 bits (96), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 32/104 (30%), Positives = 48/104 (46%), Gaps = 17/104 (16%)
Query: 138 AAYINHDCRPNCRF----VATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN---CLCE 190
A ++NH C PNCR V+ + A R I G+E+T Y D F KN CLC
Sbjct: 500 ARFVNHSCNPNCRMIKWIVSGQPRMALFAGDRPIMTGDELTYDYNFDPFSAKNVQKCLCG 559
Query: 191 CDTCEIRMKGAFAQSSVEPQTTQPSRYCLRDT---DMRLNKRKL 231
C +G +P+ +P++ ++ T ++ KRKL
Sbjct: 560 EHNC----RGVLGP---KPRDVKPAKADIKKTVKATVKAGKRKL 596
>gi|164426120|ref|XP_961572.2| hypothetical protein NCU01206 [Neurospora crassa OR74A]
gi|157071206|gb|EAA32336.2| conserved hypothetical protein [Neurospora crassa OR74A]
Length = 1150
Score = 41.6 bits (96), Expect = 0.40, Method: Composition-based stats.
Identities = 51/177 (28%), Positives = 72/177 (40%), Gaps = 37/177 (20%)
Query: 39 QRFNIGRILDMGTQHHRYSYRIQYLPELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLV 98
QR N G+ D+ R L + K +++ + A K+D I V
Sbjct: 987 QRKNFGQDSDV--------LRFNQLKKRKKPVKFARSAIHNWGLYAMENINKDDMIIEYV 1038
Query: 99 G-----CIAELTKEEEKRLLHTGL--------NDFSVMYSCRKNCAQLWLGPAAYINHDC 145
G IAEL E R L +G+ +D +V+ + +K G A +INH C
Sbjct: 1039 GEEVRQQIAEL---REARYLKSGIGSSYLFRIDDNTVIDATKKG------GIARFINHSC 1089
Query: 146 RPNC--RFVATERDTACV-KVLRDIEQGEEITCFYG-EDFFGDKN---CLCECDTCE 195
PNC + + E V LRDI Q EE+T Y E G + CLC C+
Sbjct: 1090 MPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDYKFEREIGSTDRIPCLCGTAACK 1146
>gi|389806004|ref|ZP_10203146.1| SET domain-containing protein [Rhodanobacter thiooxydans LCS2]
gi|388446376|gb|EIM02413.1| SET domain-containing protein [Rhodanobacter thiooxydans LCS2]
Length = 155
Score = 41.6 bits (96), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 20/48 (41%), Positives = 28/48 (58%), Gaps = 7/48 (14%)
Query: 138 AAYINHDCRPNCRFVATE-------RDTACVKVLRDIEQGEEITCFYG 178
A +INH C PNCR + E RD ++ +R+I+ GEE+T YG
Sbjct: 75 ARWINHSCAPNCRALVEESASGDPRRDRVLIEAIRNIKPGEELTYDYG 122
>gi|336259450|ref|XP_003344526.1| hypothetical protein SMAC_07534 [Sordaria macrospora k-hell]
gi|380093240|emb|CCC08898.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 1314
Score = 41.6 bits (96), Expect = 0.41, Method: Composition-based stats.
Identities = 51/177 (28%), Positives = 72/177 (40%), Gaps = 37/177 (20%)
Query: 39 QRFNIGRILDMGTQHHRYSYRIQYLPELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLV 98
QR N G+ D+ R L + K +++ + A K+D I V
Sbjct: 1151 QRKNFGQDSDV--------LRFNQLKKRKKPVKFARSAIHNWGLYAMENINKDDMIIEYV 1202
Query: 99 G-----CIAELTKEEEKRLLHTGL--------NDFSVMYSCRKNCAQLWLGPAAYINHDC 145
G IAEL E R L +G+ +D +V+ + +K G A +INH C
Sbjct: 1203 GEEVRQQIAEL---REARYLKSGIGSSYLFRIDDNTVIDATKKG------GIARFINHSC 1253
Query: 146 RPNC--RFVATERDTACV-KVLRDIEQGEEITCFYG-EDFFGDKN---CLCECDTCE 195
PNC + + E V LRDI Q EE+T Y E G + CLC C+
Sbjct: 1254 MPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDYKFEREIGSTDRIPCLCGTAACK 1310
>gi|400595512|gb|EJP63307.1| SET domain-containing protein [Beauveria bassiana ARSEF 2860]
Length = 824
Score = 41.6 bits (96), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 34/117 (29%), Positives = 50/117 (42%), Gaps = 17/117 (14%)
Query: 138 AAYINHDCRPNCRF----VATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN---CLCE 190
A ++NH C PNCR V+ + A I G+E+T Y D F KN CLC
Sbjct: 541 ARFVNHSCNPNCRMIKWIVSGQPRMALFAGDNPIMTGDELTYDYNFDPFSAKNVQKCLCG 600
Query: 191 CDTCEIRMKGAFAQSSVEPQTTQPSRYCLRDT---DMRLNKRKLHKKLNRLLLASDK 244
C +G P+ +P + L+ ++ KRKL + L + +DK
Sbjct: 601 AHNC----RGVLGP---RPREVKPPKTDLKKAVKGTIKAGKRKLKEMLGDEVSGNDK 650
>gi|270001730|gb|EEZ98177.1| hypothetical protein TcasGA2_TC000606 [Tribolium castaneum]
Length = 5215
Score = 41.6 bits (96), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 40/128 (31%), Positives = 57/128 (44%), Gaps = 13/128 (10%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCI--AELTKEEEKRLLHTGLNDFSVMYSC---RKNCAQ 132
+G + A R EK+ + +G I EL + EK+ + N M+ R A
Sbjct: 5085 QGLGLYAARDLEKHTMVIEYIGEIIRTELAETREKK--YEAKNRGIYMFRLDEERVVDAT 5142
Query: 133 LWLGPAAYINHDCRPNC--RFVATERD-TACVKVLRDIEQGEEITCFYGEDFFGDK---N 186
L G A YINH C PNC V +RD + R I++GEE+ Y D D+ +
Sbjct: 5143 LCGGLARYINHSCNPNCVAETVEVDRDYRIIIFAKRRIQRGEELAYDYKFDIEDDQHKIS 5202
Query: 187 CLCECDTC 194
C+C C
Sbjct: 5203 CMCGAPNC 5210
>gi|84997445|ref|XP_953444.1| hypothetical protein [Theileria annulata strain Ankara]
gi|65304440|emb|CAI76819.1| hypothetical protein, conserved [Theileria annulata]
Length = 1083
Score = 41.6 bits (96), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 30/93 (32%), Positives = 41/93 (44%), Gaps = 6/93 (6%)
Query: 138 AAYINHDCRPNCRFVATE-RDTACVKVL--RDIEQGEEITCFYG---EDFFGDKNCLCEC 191
A +INH C PNC V R T + V R I+QGEE+T YG + G C C
Sbjct: 847 ARFINHSCDPNCASVPINVRGTYRMGVFAQRKIKQGEEVTYNYGFTSKGVGGGFRCRCRA 906
Query: 192 DTCEIRMKGAFAQSSVEPQTTQPSRYCLRDTDM 224
C + A S + + S+Y + D+
Sbjct: 907 KNCRGIIGSQLAHSPESLMSIEASKYSGAEADV 939
>gi|301121168|ref|XP_002908311.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262103342|gb|EEY61394.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 424
Score = 41.6 bits (96), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 19/37 (51%), Positives = 23/37 (62%)
Query: 141 INHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFY 177
INH C+PNC + V+ LRDIE+GEEIT Y
Sbjct: 172 INHSCQPNCTWSNAGDSVMEVRALRDIEEGEEITLSY 208
>gi|444724233|gb|ELW64844.1| Histone-lysine N-methyltransferase MLL3 [Tupaia chinensis]
Length = 4664
Score = 41.6 bits (96), Expect = 0.43, Method: Composition-based stats.
Identities = 32/103 (31%), Positives = 45/103 (43%), Gaps = 6/103 (5%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK+ + +G I K L+ N M+ + A L
Sbjct: 4408 QGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGVYMFRMDNDHVIDATLT 4467
Query: 135 LGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEIT 174
GPA YINH C PNC V ER + R I++GEE++
Sbjct: 4468 GGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEEVS 4510
>gi|255644430|gb|ACU22720.1| unknown [Glycine max]
Length = 381
Score = 41.6 bits (96), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 27/94 (28%), Positives = 46/94 (48%), Gaps = 20/94 (21%)
Query: 138 AAYINHDCRPN-CRFVAT-------ERDTA--CVKVLRDIEQGEEITCFY---GEDFFGD 184
A + NHDC PN CRF ER++ ++++ D+++G+E+ Y G D+
Sbjct: 214 ATFFNHDCVPNACRFDYVDSTNDDYERNSTDIVIRLIEDVDEGKEVCISYFRIGRDYCTR 273
Query: 185 KNCL-------CECDTCEIRMKGAFAQSSVEPQT 211
K L C CD C +++ G +++V T
Sbjct: 274 KRILMEDYGFTCGCDRCTLKLIGMVKKTTVTFHT 307
>gi|341901991|gb|EGT57926.1| hypothetical protein CAEBREN_15025 [Caenorhabditis brenneri]
Length = 882
Score = 41.2 bits (95), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 28/71 (39%), Positives = 34/71 (47%), Gaps = 9/71 (12%)
Query: 119 DFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVAT-ERDTA--------CVKVLRDIEQ 169
DF VM CA A ++NH+C PNC FV T R+ A CV LRDI+
Sbjct: 775 DFKVMNPNFILCAGKIGNVARFLNHNCDPNCAFVETHSRELASDLLIPRICVYALRDIKV 834
Query: 170 GEEITCFYGED 180
GE + Y D
Sbjct: 835 GETVNISYWGD 845
>gi|408398587|gb|EKJ77717.1| hypothetical protein FPSE_02215 [Fusarium pseudograminearum CS3096]
Length = 467
Score = 41.2 bits (95), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 36/139 (25%), Positives = 59/139 (42%), Gaps = 19/139 (13%)
Query: 58 YRIQYLPELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGL 117
+ + LP +R S N G + A +E+ I VG I + +E++ G
Sbjct: 324 FGVPRLPIAVQVFRTSNGRNNG--LQALVAFERGTAIGEFVGLITKDIEEQDVMDSQVGG 381
Query: 118 NDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRF-----VATERDTACVKVLRDIEQGEE 172
+ + + N + + NH C+PN +F + T+ + V + IE G E
Sbjct: 382 TRYQIWQGKQGNFTR-------FANHSCKPNAQFEKFVWLGTQH---ILMVSKGIEAGAE 431
Query: 173 ITCFYGEDFFG--DKNCLC 189
IT Y E ++ DK CLC
Sbjct: 432 ITVDYTESYWRGLDKKCLC 450
>gi|116199091|ref|XP_001225357.1| hypothetical protein CHGG_07701 [Chaetomium globosum CBS 148.51]
gi|121922631|sp|Q2GWF3.1|SET1_CHAGB RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component SET1; AltName:
Full=SET domain-containing protein 1
gi|88178980|gb|EAQ86448.1| hypothetical protein CHGG_07701 [Chaetomium globosum CBS 148.51]
Length = 1076
Score = 41.2 bits (95), Expect = 0.47, Method: Composition-based stats.
Identities = 42/126 (33%), Positives = 56/126 (44%), Gaps = 29/126 (23%)
Query: 90 KNDNIAYLVG-----CIAELTKEEEKRLLHTGL--------NDFSVMYSCRKNCAQLWLG 136
K+D I VG IAEL E R L +G+ +D +V+ + +K G
Sbjct: 956 KDDMIIEYVGEEVRQQIAEL---RENRYLKSGIGSSYLFRIDDNTVIDATKKG------G 1006
Query: 137 PAAYINHDCRPNC--RFVATERDTACV-KVLRDIEQGEEITCFYG-EDFFGDKN---CLC 189
A +INH C PNC + + E V LRDI Q EE+T Y E G + CLC
Sbjct: 1007 IARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDYKFERELGSTDRIPCLC 1066
Query: 190 ECDTCE 195
C+
Sbjct: 1067 GTAACK 1072
>gi|345565974|gb|EGX48921.1| hypothetical protein AOL_s00079g142 [Arthrobotrys oligospora ATCC
24927]
Length = 1227
Score = 41.2 bits (95), Expect = 0.47, Method: Composition-based stats.
Identities = 38/133 (28%), Positives = 57/133 (42%), Gaps = 11/133 (8%)
Query: 74 LEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQL 133
L ++G + A R + N I +G I +T++E + LH D Y + + +
Sbjct: 362 LTQDKGYGLRACRSFGPNQIIVEYIGEI--ITQDECEERLHGPYKDNESYYLMEFDNSLI 419
Query: 134 WLGP----AAYINHDCRPNCR----FVATERDTACVKVLRDIEQGEEITCFYGEDFF-GD 184
A ++NH C PNC+ VA + A IE GEE+T Y +F G
Sbjct: 420 IDATRGSLARFVNHSCSPNCKMEKWMVAGQPRMALFAGDEGIEVGEELTYDYNFSWFSGG 479
Query: 185 KNCLCECDTCEIR 197
LC C T + R
Sbjct: 480 STQLCRCGTEQCR 492
>gi|328869415|gb|EGG17793.1| SET domain-containing protein [Dictyostelium fasciculatum]
Length = 363
Score = 41.2 bits (95), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 22/67 (32%), Positives = 32/67 (47%), Gaps = 10/67 (14%)
Query: 138 AAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGE---DFFGDKNCL------ 188
+++ NH C PN TA + LRDI++GEE+ YG+ D F + L
Sbjct: 284 SSFFNHSCDPNVYMAFPHDKTAVITALRDIKKGEELFISYGDSEKDMFDRQTHLFDNYGF 343
Query: 189 -CECDTC 194
C+C C
Sbjct: 344 NCDCPKC 350
>gi|110598362|ref|ZP_01386636.1| Nuclear protein SET [Chlorobium ferrooxidans DSM 13031]
gi|110340060|gb|EAT58561.1| Nuclear protein SET [Chlorobium ferrooxidans DSM 13031]
Length = 118
Score = 41.2 bits (95), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 20/67 (29%), Positives = 33/67 (49%), Gaps = 3/67 (4%)
Query: 122 VMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTA---CVKVLRDIEQGEEITCFYG 178
V Y R+N + +G NH PN + + D + LRDIE+GEE+ YG
Sbjct: 50 VFYGSRENSRLVVMGNGMLFNHSFTPNVAYYLEQTDIGHELILYALRDIEKGEELFYNYG 109
Query: 179 EDFFGDK 185
++++ +
Sbjct: 110 DEWWATR 116
>gi|358342811|dbj|GAA39455.2| histone-lysine N-methyltransferase MLL3 [Clonorchis sinensis]
Length = 1443
Score = 41.2 bits (95), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 39/130 (30%), Positives = 61/130 (46%), Gaps = 16/130 (12%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEE---EKRLLHTGLNDFSVMYSCRKNC---A 131
+G + A R K+ A+++ + EL + E + L+ N M+ ++ A
Sbjct: 1312 QGLGLYAARNLSKH---AFIIEYLGELIRNEVGNRRERLYELQNRGIYMFRVDEDSIVDA 1368
Query: 132 QLWLGPAAYINHDCRPNC--RFVATERDTACVKVL-RDIEQGEEITCFYG---EDFFGDK 185
+ G A YINH C PNC + E + + + RDIE+GEE+T Y E+ GD+
Sbjct: 1369 TMCGGLARYINHSCEPNCTAEVLNCENGSHIIIIASRDIEKGEELTYDYKFDIEEERGDR 1428
Query: 186 -NCLCECDTC 194
CLC C
Sbjct: 1429 IPCLCGAPNC 1438
>gi|156083827|ref|XP_001609397.1| SET domain containing protein [Babesia bovis T2Bo]
gi|154796648|gb|EDO05829.1| SET domain containing protein [Babesia bovis]
Length = 799
Score = 41.2 bits (95), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 23/62 (37%), Positives = 31/62 (50%), Gaps = 5/62 (8%)
Query: 138 AAYINHDCRPNCRFVATER--DTACVKV--LRDIEQGEEITCFYGEDFFGDKNCLCECDT 193
A ++NH C PN + R D C+ V +RDI GE +T YG + CLC D
Sbjct: 733 ARFLNHSCDPNVEVITVWRGDDFPCIAVYAIRDIPAGEALTYCYGSQ-YKSIPCLCGTDK 791
Query: 194 CE 195
C+
Sbjct: 792 CK 793
>gi|402081815|gb|EJT76960.1| histone-lysine N-methyltransferase [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 1319
Score = 41.2 bits (95), Expect = 0.50, Method: Composition-based stats.
Identities = 39/123 (31%), Positives = 53/123 (43%), Gaps = 23/123 (18%)
Query: 90 KNDNIAYLVG--CIAELTKEEEKRLLHTGL--------NDFSVMYSCRKNCAQLWLGPAA 139
K+D I VG + K E R L +G+ +D +V+ + +K G A
Sbjct: 1199 KDDMIIEYVGEEVRPSVAKVREARYLKSGIGSTYLFRIDDEAVIDATKKG------GIAR 1252
Query: 140 YINHDCRPNC--RFVATERDTACV-KVLRDIEQGEEITCFYG----EDFFGDKNCLCECD 192
+INH C PNC + + E V LRDI Q EE+T Y ED CLC
Sbjct: 1253 FINHSCMPNCTAKIIKVEGSKRIVIYALRDIGQNEELTYDYKFEPEEDQKDRVPCLCGTT 1312
Query: 193 TCE 195
C+
Sbjct: 1313 ACK 1315
>gi|449305119|gb|EMD01126.1| hypothetical protein BAUCODRAFT_100872 [Baudoinia compniacensis
UAMH 10762]
Length = 449
Score = 41.2 bits (95), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 28/99 (28%), Positives = 47/99 (47%), Gaps = 15/99 (15%)
Query: 113 LHTGLNDFSVMYS-CRKNCAQLWLGPAA----YINHDCRPNCRFV-----ATERDTACVK 162
+ +GL++ VM + Q+W G ++NH C PN ++ T+R V
Sbjct: 342 ITSGLSNMDVMIGQTNRATYQIWQGRQGNYTRFVNHSCEPNSQYERFQWRGTQR---IVL 398
Query: 163 VLRDIEQGEEITCFYGEDFFG--DKNCLCECDTCEIRMK 199
V + IE G+E+T Y + ++ DK CLC C + +
Sbjct: 399 VSKGIEAGDEVTVDYSDTYWRNLDKVCLCGAAQCRYKER 437
>gi|367035324|ref|XP_003666944.1| hypothetical protein MYCTH_2312118 [Myceliophthora thermophila ATCC
42464]
gi|347014217|gb|AEO61699.1| hypothetical protein MYCTH_2312118 [Myceliophthora thermophila ATCC
42464]
Length = 234
Score = 41.2 bits (95), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 33/126 (26%), Positives = 58/126 (46%), Gaps = 11/126 (8%)
Query: 140 YINHDCRPNCRFVATERDTACVKVL---RDIEQGEEITCFYG--EDFFGDK-NCLCECDT 193
YINH C P+ F DT+ + V+ + ++ G+E+T FY E + +CLC T
Sbjct: 77 YINHSCEPSLIF-----DTSSMTVIAGPKGLKPGDELTFFYPSTEWYMAQPFDCLCGAPT 131
Query: 194 CEIRMKGAFAQSSVEPQTTQPSRYCLRDTDMRLNKRKLHKKLNRLLLASDKNDTNSSDNS 253
C R+ GA +S + + +R+ + R K ++ N N T+ +D +
Sbjct: 132 CRGRIAGARDMTSAQLEGMWLNRHIRDLLEERDAGLKAKQEANGTDGTDGTNGTDGTDGT 191
Query: 254 SKYSLV 259
+ + V
Sbjct: 192 GRVNGV 197
>gi|156061849|ref|XP_001596847.1| hypothetical protein SS1G_03070 [Sclerotinia sclerotiorum 1980]
gi|154700471|gb|EDO00210.1| hypothetical protein SS1G_03070 [Sclerotinia sclerotiorum 1980 UF-70]
Length = 1264
Score = 41.2 bits (95), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 40/120 (33%), Positives = 54/120 (45%), Gaps = 19/120 (15%)
Query: 91 NDNIAYLVG-----CIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLWLGPAAYIN 142
ND I VG +A+L E R L +G+ S ++ +N A G A +IN
Sbjct: 1145 NDMIIEYVGEKVRQQVADL---RENRYLKSGIGS-SYLFRIDENTVIDATKKGGIARFIN 1200
Query: 143 HDCRPNC--RFVATERDTACV-KVLRDIEQGEEITCFYG-EDFFGDKN---CLCECDTCE 195
H C PNC + + E+ V LRDI Q EE+T Y E G + CLC C+
Sbjct: 1201 HSCMPNCTAKIITVEKSKRIVIYALRDIAQNEELTYDYKFEREIGSTDRIPCLCGTPACK 1260
>gi|389722005|ref|ZP_10188703.1| SET domain-containing protein [Rhodanobacter sp. 115]
gi|388444781|gb|EIM00877.1| SET domain-containing protein [Rhodanobacter sp. 115]
Length = 155
Score = 41.2 bits (95), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 23/69 (33%), Positives = 35/69 (50%), Gaps = 13/69 (18%)
Query: 117 LNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATE-------RDTACVKVLRDIEQ 169
LN+ ++ + RK W INH C PNCR + E +D ++ +R+I+
Sbjct: 60 LNEHYLIDANRKGNTARW------INHSCAPNCRALVEESTSGDPRKDKVVIEAIRNIKP 113
Query: 170 GEEITCFYG 178
GEE+T YG
Sbjct: 114 GEELTYDYG 122
>gi|62147647|emb|CAI72344.1| set domain-containing protein, putative [Phytophthora infestans]
gi|63086927|emb|CAI72253.1| SET-domain containing protein, putative [Phytophthora infestans]
Length = 529
Score = 41.2 bits (95), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 32/59 (54%), Gaps = 9/59 (15%)
Query: 140 YINHDCRPNCRFVATERDTAC---VKVLRDIEQGEEITCFYG-EDFFGDKNCLCECDTC 194
+I+H C PN FV ++ VK++RD++ G EIT YG E +F C CD C
Sbjct: 465 FISHSCDPNAAFVEQSNRSSVKVLVKMIRDVKAGAEITVHYGNERWFK-----CACDEC 518
>gi|260794338|ref|XP_002592166.1| hypothetical protein BRAFLDRAFT_125138 [Branchiostoma floridae]
gi|229277381|gb|EEN48177.1| hypothetical protein BRAFLDRAFT_125138 [Branchiostoma floridae]
Length = 1891
Score = 41.2 bits (95), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 26/70 (37%), Positives = 34/70 (48%), Gaps = 6/70 (8%)
Query: 131 AQLWLGPAAYINHDCRPNC--RFVATERDTACVKV-LRDIEQGEEITCFYGEDFFGDKN- 186
A L GPA YINH C PNC V E++ + + R + +GEE+T Y D D+
Sbjct: 1817 ATLAGGPARYINHSCNPNCVAEVVNFEKEQKIIIISSRRLSKGEELTYDYKFDIEDDEQK 1876
Query: 187 --CLCECDTC 194
C C C
Sbjct: 1877 IPCCCGAPNC 1886
>gi|71000549|ref|XP_754958.1| histone-lysine N-methyltransferase (Ash1) [Aspergillus fumigatus
Af293]
gi|66852595|gb|EAL92920.1| histone-lysine N-methyltransferase (Ash1), putative [Aspergillus
fumigatus Af293]
gi|159127971|gb|EDP53086.1| histone-lysine N-methyltransferase (Ash1), putative [Aspergillus
fumigatus A1163]
Length = 845
Score = 41.2 bits (95), Expect = 0.52, Method: Composition-based stats.
Identities = 36/128 (28%), Positives = 52/128 (40%), Gaps = 9/128 (7%)
Query: 76 GNRGARIVATRKWEKNDNIAYLVG-CIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLW 134
+RG + + R +E N I G I + E+ R ++ + +MY +
Sbjct: 478 ADRGYGVRSNRTFEPNQIIVEYTGEIITQAECEKRMRTIYKNNECYYLMYFDQNMIIDAT 537
Query: 135 LGPAA-YINHDCRPNCRF----VATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN--- 186
G A ++NH C PNCR VA + A I GEE+T Y D + KN
Sbjct: 538 RGSIARFVNHSCEPNCRMEKWTVAGKPRMALFAGDHGIMTGEELTYDYNFDPYSQKNVQQ 597
Query: 187 CLCECDTC 194
C C C
Sbjct: 598 CRCGASNC 605
>gi|326518712|dbj|BAJ92517.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 166
Score = 41.2 bits (95), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 37/138 (26%), Positives = 58/138 (42%), Gaps = 18/138 (13%)
Query: 50 GTQHHRYS--------YRIQYLPELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCI 101
G HH Y Y + +L L+P + + +V T++W N V
Sbjct: 6 GRTHHLYPFISLFQPLYHLVFLDLLQPHFH---DKVSLTVLVLTKQWYINVLARIRVNAF 62
Query: 102 -AELTKEEEKRLLHTGLNDFSVMYSCRKNCAQ-LWLGPAAYINHDCRPNCRFVATERDTA 159
EL + LL + + + C + +++ P+ Y NHDC PN V E A
Sbjct: 63 RIELVASSYEDLLSSAVASVT----CDASVGNAVYMLPSFY-NHDCDPNTHIVWLENADA 117
Query: 160 CVKVLRDIEQGEEITCFY 177
+ LRDI++GEE+ Y
Sbjct: 118 KLNTLRDIDEGEELRICY 135
>gi|119488913|ref|ZP_01621875.1| hypothetical protein L8106_20188 [Lyngbya sp. PCC 8106]
gi|119455074|gb|EAW36216.1| hypothetical protein L8106_20188 [Lyngbya sp. PCC 8106]
Length = 164
Score = 41.2 bits (95), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 25/65 (38%), Positives = 37/65 (56%), Gaps = 6/65 (9%)
Query: 140 YINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFY-GEDFFGDK--NCLCECDTCEI 196
YINH C PN F T+ V LR I+ GE++T FY ++ D+ +CLC+ + C
Sbjct: 77 YINHSCTPNVGFDTTD---WVVTALRRIKIGEQLTFFYPSTEWSMDRGFDCLCKSEACLG 133
Query: 197 RMKGA 201
++GA
Sbjct: 134 YIQGA 138
>gi|70568862|dbj|BAE06306.1| transcription factor protein [Ciona intestinalis]
Length = 589
Score = 41.2 bits (95), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 36/126 (28%), Positives = 53/126 (42%), Gaps = 9/126 (7%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A+R E + +G I E++ + N M+ + A +
Sbjct: 459 QGLGLFASRDIEPGTMVIEYIGDIIRSEVAEKREKNYEAANRGVYMFRLDSDYIVDATVT 518
Query: 135 LGPAAYINHDCRPNC--RFVATERDTACVKVL-RDIEQGEEITCFYGEDFFGDKN---CL 188
GPA YINH C PNC V E++ + + R I GEE+ Y DF + N CL
Sbjct: 519 GGPARYINHSCNPNCVAEVVNFEKEKKIMIISNRHILSGEELNYDYKFDFEDEGNKIPCL 578
Query: 189 CECDTC 194
C C
Sbjct: 579 CGAINC 584
>gi|347827280|emb|CCD42977.1| similar to histone-lysine N-methyltransferase [Botryotinia
fuckeliana]
Length = 1265
Score = 41.2 bits (95), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 40/120 (33%), Positives = 54/120 (45%), Gaps = 19/120 (15%)
Query: 91 NDNIAYLVG-----CIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLWLGPAAYIN 142
ND I VG +A+L E R L +G+ S ++ +N A G A +IN
Sbjct: 1146 NDMIIEYVGEKVRQQVADL---RENRYLKSGIGS-SYLFRIDENTVIDATKKGGIARFIN 1201
Query: 143 HDCRPNC--RFVATERDTACV-KVLRDIEQGEEITCFYG-EDFFGDKN---CLCECDTCE 195
H C PNC + + E+ V LRDI Q EE+T Y E G + CLC C+
Sbjct: 1202 HSCMPNCTAKIITVEKSKRIVIYALRDIAQNEELTYDYKFEREIGSTDRIPCLCGTPACK 1261
>gi|119493172|ref|XP_001263805.1| histone-lysine N-methyltransferase (Ash1), putative [Neosartorya
fischeri NRRL 181]
gi|119411965|gb|EAW21908.1| histone-lysine N-methyltransferase (Ash1), putative [Neosartorya
fischeri NRRL 181]
Length = 839
Score = 41.2 bits (95), Expect = 0.56, Method: Composition-based stats.
Identities = 36/128 (28%), Positives = 52/128 (40%), Gaps = 9/128 (7%)
Query: 76 GNRGARIVATRKWEKNDNIAYLVG-CIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLW 134
+RG + + R +E N I G I + E+ R ++ + +MY +
Sbjct: 478 ADRGYGVRSNRTFEPNQIIVEYTGEIITQAECEKRMRTIYKNNECYYLMYFDQNMIIDAT 537
Query: 135 LGPAA-YINHDCRPNCRF----VATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN--- 186
G A ++NH C PNCR VA + A I GEE+T Y D + KN
Sbjct: 538 RGSIARFVNHSCEPNCRMEKWTVAGKPRMALFAGDHGIMTGEELTYDYNFDPYSQKNVQQ 597
Query: 187 CLCECDTC 194
C C C
Sbjct: 598 CRCGASNC 605
>gi|380494835|emb|CCF32851.1| SET domain-containing protein [Colletotrichum higginsianum]
Length = 1257
Score = 40.8 bits (94), Expect = 0.57, Method: Composition-based stats.
Identities = 38/123 (30%), Positives = 56/123 (45%), Gaps = 23/123 (18%)
Query: 90 KNDNIAYLVG--CIAELTKEEEKRLLHTGL--------NDFSVMYSCRKNCAQLWLGPAA 139
K+D I VG +++ EKR L +G+ +D +V+ + +K G A
Sbjct: 1137 KDDMIIEYVGEQVRQSISEIREKRYLKSGMGSSYLFRIDDNTVIDATKKG------GIAR 1190
Query: 140 YINHDCRPNCRFVATERDTA---CVKVLRDIEQGEEITCFYG-EDFFGDKN---CLCECD 192
+INH C PNC + D + + LRDI Q EE+T Y E G + CLC
Sbjct: 1191 FINHSCMPNCTAKIIKVDGSKRIVIYALRDIGQHEELTYDYKFEREIGSLDRIPCLCGTA 1250
Query: 193 TCE 195
C+
Sbjct: 1251 ACK 1253
>gi|310792530|gb|EFQ28057.1| SET domain-containing protein [Glomerella graminicola M1.001]
Length = 1262
Score = 40.8 bits (94), Expect = 0.57, Method: Composition-based stats.
Identities = 38/123 (30%), Positives = 56/123 (45%), Gaps = 23/123 (18%)
Query: 90 KNDNIAYLVG--CIAELTKEEEKRLLHTGL--------NDFSVMYSCRKNCAQLWLGPAA 139
K+D I VG +++ EKR L +G+ +D +V+ + +K G A
Sbjct: 1142 KDDMIIEYVGEQVRQSISEIREKRYLKSGMGSSYLFRIDDNTVIDATKKG------GIAR 1195
Query: 140 YINHDCRPNCRFVATERDTA---CVKVLRDIEQGEEITCFYG-EDFFGDKN---CLCECD 192
+INH C PNC + D + + LRDI Q EE+T Y E G + CLC
Sbjct: 1196 FINHSCMPNCTAKIIKVDGSKRIVIYALRDIGQHEELTYDYKFEREIGSLDRIPCLCGTA 1255
Query: 193 TCE 195
C+
Sbjct: 1256 ACK 1258
>gi|409081585|gb|EKM81944.1| hypothetical protein AGABI1DRAFT_98514 [Agaricus bisporus var.
burnettii JB137-S8]
gi|426196823|gb|EKV46751.1| hypothetical protein AGABI2DRAFT_70795 [Agaricus bisporus var.
bisporus H97]
Length = 172
Score = 40.8 bits (94), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 40/132 (30%), Positives = 58/132 (43%), Gaps = 15/132 (11%)
Query: 75 EGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLW 134
+G+ + + AT+ ++ D +A L EL K + +T SV Y N
Sbjct: 22 DGDFNSCLRATKTFKAGDYVAPL-----ELGKLTKGPKTYT-----SVQYGPGSNDHIEL 71
Query: 135 LGPAAYINHDCRPNCRFVATERDTA--CVKVLRDIEQGEEITCFY-GEDFFGDKNCLCEC 191
YINH C PN F + D + V+ LR IE G +T FY ++ D+ CEC
Sbjct: 72 NSDLVYINHSCDPNVAFDLSSSDPSEWHVRALRRIETGGSVTFFYPSTEWDMDQPFACEC 131
Query: 192 --DTCEIRMKGA 201
TC + GA
Sbjct: 132 RAKTCLGSIYGA 143
>gi|396494195|ref|XP_003844247.1| hypothetical protein LEMA_P018980.1 [Leptosphaeria maculans JN3]
gi|312220827|emb|CBY00768.1| hypothetical protein LEMA_P018980.1 [Leptosphaeria maculans JN3]
Length = 358
Score = 40.8 bits (94), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 27/88 (30%), Positives = 41/88 (46%), Gaps = 8/88 (9%)
Query: 140 YINHDCRPNCRF----VATERDTACVKVLRDIEQGEEITCFYGEDFFG---DKNCLCECD 192
+INH C P+C F V R V+ +R+I G E+T YG ++G +K C C
Sbjct: 268 FINHSCAPHCDFRVRRVGMMR-IMTVEAIRNIPAGVELTVSYGLSYYGPESNKICYCGTK 326
Query: 193 TCEIRMKGAFAQSSVEPQTTQPSRYCLR 220
C R + A + P+ + + C R
Sbjct: 327 KCVSRDRKAKTEDRPRPKEKRRVKKCKR 354
>gi|328768420|gb|EGF78466.1| hypothetical protein BATDEDRAFT_90401 [Batrachochytrium
dendrobatidis JAM81]
Length = 578
Score = 40.8 bits (94), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 23/87 (26%), Positives = 35/87 (40%), Gaps = 10/87 (11%)
Query: 138 AAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDF----------FGDKNC 187
A +NH C PNC FV ++ V+ L + G E+ Y + F K
Sbjct: 253 VAMLNHSCAPNCAFVTSQSGQMLVRTLTQVSPGTELCVGYIDLFTPRWERRGKLLTTKLF 312
Query: 188 LCECDTCEIRMKGAFAQSSVEPQTTQP 214
C C C R + ++++P T P
Sbjct: 313 WCTCIRCGPRAEETVVLTTLDPPTMDP 339
>gi|169619746|ref|XP_001803285.1| hypothetical protein SNOG_13071 [Phaeosphaeria nodorum SN15]
gi|160703895|gb|EAT79398.2| hypothetical protein SNOG_13071 [Phaeosphaeria nodorum SN15]
Length = 381
Score = 40.8 bits (94), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 21/40 (52%), Positives = 24/40 (60%), Gaps = 1/40 (2%)
Query: 139 AYINHDCRPNCRFVATE-RDTACVKVLRDIEQGEEITCFY 177
A +NHDCRPN + E T V LRDI+ GEEIT Y
Sbjct: 186 AMMNHDCRPNAAYFWDEDMMTHYVHALRDIQPGEEITITY 225
>gi|170105942|ref|XP_001884183.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164640955|gb|EDR05218.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 369
Score = 40.8 bits (94), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 35/142 (24%), Positives = 56/142 (39%), Gaps = 7/142 (4%)
Query: 75 EGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLW 134
+ G I ATR + + L+G +A + E L + V ++
Sbjct: 196 DDTHGFSIRATRDLVAEEMLYPLIGLLASDSTAEHSHLSELRPHLSQVQEQAETEKPRVL 255
Query: 135 LGPAAYINHDCRP-NCRFVATERDTACVK-VLRDIEQGEEITCFYGEDFFGDKNCLCECD 192
+GP +INH C N +F+ + V + I+ G+E+ YGE +F D C C
Sbjct: 256 IGPLRFINHACNSFNAKFIPINSTLSFVVCTTKAIKSGQEVFVNYGEKYFEDLG-QCPCR 314
Query: 193 TCEIRMKGAFAQSSVEPQTTQP 214
C KG ++ T P
Sbjct: 315 DC----KGESSEDVRSEDATSP 332
>gi|71274753|ref|ZP_00651041.1| Nuclear protein SET [Xylella fastidiosa Dixon]
gi|170730005|ref|YP_001775438.1| hypothetical protein Xfasm12_0820 [Xylella fastidiosa M12]
gi|182681286|ref|YP_001829446.1| nuclear protein SET [Xylella fastidiosa M23]
gi|386084790|ref|YP_006001072.1| nuclear protein SET [Xylella fastidiosa subsp. fastidiosa GB514]
gi|417558283|ref|ZP_12209265.1| Protein containing SET domain [Xylella fastidiosa EB92.1]
gi|71164485|gb|EAO14199.1| Nuclear protein SET [Xylella fastidiosa Dixon]
gi|71729035|gb|EAO31163.1| Nuclear protein SET [Xylella fastidiosa Ann-1]
gi|71732543|gb|EAO34596.1| Nuclear protein SET [Xylella fastidiosa subsp. sandyi Ann-1]
gi|167964798|gb|ACA11808.1| conserved hypothetical protein [Xylella fastidiosa M12]
gi|182631396|gb|ACB92172.1| nuclear protein SET [Xylella fastidiosa M23]
gi|307579737|gb|ADN63706.1| nuclear protein SET [Xylella fastidiosa subsp. fastidiosa GB514]
gi|338179087|gb|EGO82051.1| Protein containing SET domain [Xylella fastidiosa EB92.1]
Length = 155
Score = 40.8 bits (94), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 26/77 (33%), Positives = 35/77 (45%), Gaps = 17/77 (22%)
Query: 138 AAYINHDCRPNCRFVATE-------RDTACVKVLRDIEQGEEITCFYGEDFFGDKN---- 186
A +INH C PNC V E +D ++ +RDI GEE+T YG +++
Sbjct: 75 ARWINHSCSPNCEAVVEEDTGGNRRKDKIFIQAIRDIASGEELTYNYG-IVLAERHTARL 133
Query: 187 -----CLCECDTCEIRM 198
CLC D C M
Sbjct: 134 KKIWACLCGSDHCTHTM 150
>gi|374584404|ref|ZP_09657496.1| nuclear protein SET [Leptonema illini DSM 21528]
gi|373873265|gb|EHQ05259.1| nuclear protein SET [Leptonema illini DSM 21528]
Length = 299
Score = 40.8 bits (94), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 24/73 (32%), Positives = 34/73 (46%), Gaps = 12/73 (16%)
Query: 133 LWLGP--------AAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
L++GP A ++NH C PN F R + +RDI GEE+T Y
Sbjct: 70 LFMGPESLDAVEEAEFVNHSCDPNLGF----RGQVTLVAMRDIMPGEELTVDYAMSEARQ 125
Query: 185 KNCLCECDTCEIR 197
+N CEC + + R
Sbjct: 126 QNFACECGSRQCR 138
>gi|348500783|ref|XP_003437952.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Oreochromis
niloticus]
Length = 4872
Score = 40.8 bits (94), Expect = 0.64, Method: Composition-based stats.
Identities = 26/66 (39%), Positives = 30/66 (45%), Gaps = 4/66 (6%)
Query: 136 GPAAYINHDCRPNC--RFVATERD-TACVKVLRDIEQGEEITCFYG-EDFFGDKNCLCEC 191
G A YINH C PNC V ER + +R IE+GEE+ Y E G C C
Sbjct: 4803 GIARYINHSCAPNCVAEVVTFERGYKIIISCIRRIEKGEELCFNYQFESVEGQHKIACHC 4862
Query: 192 DTCEIR 197
E R
Sbjct: 4863 GAPECR 4868
>gi|367024877|ref|XP_003661723.1| SET1-like protein [Myceliophthora thermophila ATCC 42464]
gi|347008991|gb|AEO56478.1| SET1-like protein [Myceliophthora thermophila ATCC 42464]
Length = 1260
Score = 40.8 bits (94), Expect = 0.64, Method: Composition-based stats.
Identities = 42/126 (33%), Positives = 56/126 (44%), Gaps = 29/126 (23%)
Query: 90 KNDNIAYLVG-----CIAELTKEEEKRLLHTGL--------NDFSVMYSCRKNCAQLWLG 136
K+D I VG IAEL E R L +G+ +D +V+ + +K G
Sbjct: 1140 KDDMIIEYVGEEVRQQIAEL---REHRYLKSGIGSSYLFRIDDNTVIDATKKG------G 1190
Query: 137 PAAYINHDCRPNC--RFVATERDTACV-KVLRDIEQGEEITCFYG-EDFFGDKN---CLC 189
A +INH C PNC + + E V LRDI Q EE+T Y E G + CLC
Sbjct: 1191 IARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDYKFERELGSTDRIPCLC 1250
Query: 190 ECDTCE 195
C+
Sbjct: 1251 GTAACK 1256
>gi|116199379|ref|XP_001225501.1| hypothetical protein CHGG_07845 [Chaetomium globosum CBS 148.51]
gi|88179124|gb|EAQ86592.1| hypothetical protein CHGG_07845 [Chaetomium globosum CBS 148.51]
Length = 245
Score = 40.8 bits (94), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 25/68 (36%), Positives = 36/68 (52%), Gaps = 11/68 (16%)
Query: 140 YINHDCRPNCRFVATERDTACVKVL---RDIEQGEEITCFYG--EDFFGDK-NCLCECDT 193
YINH C P+ F DTA + V+ + ++ G+E+T FY E + +CLC T
Sbjct: 77 YINHSCEPSLIF-----DTASMNVIAGPKGLQPGDELTFFYPSTEWYMAQPFDCLCGTPT 131
Query: 194 CEIRMKGA 201
C R+ GA
Sbjct: 132 CRGRIAGA 139
>gi|340506260|gb|EGR32441.1| SET domain protein [Ichthyophthirius multifiliis]
Length = 479
Score = 40.8 bits (94), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 20/55 (36%), Positives = 26/55 (47%), Gaps = 5/55 (9%)
Query: 129 NCAQLWLGPAAYINHDCRPNCRFVAT-----ERDTACVKVLRDIEQGEEITCFYG 178
N + +INH +PNCR T CVK ++DI GEE+T YG
Sbjct: 233 NSVVIMTPVVDWINHSFQPNCRVTGTYFEHENESYVCVKAIKDILPGEELTLNYG 287
>gi|312384477|gb|EFR29200.1| hypothetical protein AND_02075 [Anopheles darlingi]
Length = 5086
Score = 40.8 bits (94), Expect = 0.65, Method: Composition-based stats.
Identities = 36/130 (27%), Positives = 53/130 (40%), Gaps = 9/130 (6%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSC---RKNCAQLW 134
+G + A R EK+ + +G + E + + N M+ R A L
Sbjct: 4956 QGLGLYAARDLEKHTMVIEYIGEVIRTEVSELREKQYEAKNRGIYMFRLDEERVVDATLS 5015
Query: 135 LGPAAYINHDCRPNCRFVATERDTACVKVL---RDIEQGEEITCFYGEDFFGDK---NCL 188
G A YINH C PNC E D ++ R I +GEE++ Y D D +C+
Sbjct: 5016 GGLARYINHSCNPNCVTETVEVDRELRIIIFAKRRINRGEELSYDYKFDIEDDAHKISCM 5075
Query: 189 CECDTCEIRM 198
C C+ M
Sbjct: 5076 CGAPNCKKWM 5085
>gi|170105116|ref|XP_001883771.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164641406|gb|EDR05667.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 173
Score = 40.8 bits (94), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 25/68 (36%), Positives = 37/68 (54%), Gaps = 5/68 (7%)
Query: 139 AYINHDCRPNCRFVATERDTAC--VKVLRDIEQGEEITCFY-GEDFFGDKNCLCECDT-- 193
Y+NH C PN F + DT+ ++ LR IE GE +T FY ++ D+ C+C T
Sbjct: 73 VYVNHSCEPNVAFDLSSPDTSNWHLRALRRIEPGETLTFFYPSTEWEMDQPFDCQCGTPS 132
Query: 194 CEIRMKGA 201
C ++GA
Sbjct: 133 CLGTIQGA 140
>gi|57997471|emb|CAE45854.2| hypothetical protein [Homo sapiens]
Length = 116
Score = 40.8 bits (94), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 28/71 (39%), Positives = 35/71 (49%), Gaps = 4/71 (5%)
Query: 131 AQLWLGPAAYINHDCRPNC--RFVATERD-TACVKVLRDIEQGEEITCFYGEDFFGDKNC 187
A L GPA YINH C PNC V ER + R I++GEE+ Y DF D++
Sbjct: 42 ATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSSRRIQKGEELCYDYKFDFEDDQHK 101
Query: 188 L-CECDTCEIR 197
+ C C R
Sbjct: 102 IPCHCGAVNCR 112
>gi|324523879|gb|ADY48320.1| Histone-lysine N-methyltransferase ASH1L, partial [Ascaris suum]
Length = 287
Score = 40.8 bits (94), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 26/86 (30%), Positives = 38/86 (44%), Gaps = 8/86 (9%)
Query: 116 GLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTAC---VKVLRDIEQGEE 172
G+ + Y +NC + + +NH C PNC+ A D V L+DI G+E
Sbjct: 57 GMQITNEFYVDARNCGNM----SRSVNHSCEPNCKVNAVTVDGVYRLKVSALKDIAAGDE 112
Query: 173 ITCFYGEDFF-GDKNCLCECDTCEIR 197
+T YG + + G C C T R
Sbjct: 113 LTYDYGTELWSGMVGMRCRCGTAGCR 138
>gi|303279855|ref|XP_003059220.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226459056|gb|EEH56352.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 591
Score = 40.8 bits (94), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 20/40 (50%)
Query: 138 AAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFY 177
A+ INH C PN D A LR IE GEEIT Y
Sbjct: 246 ASAINHACAPNAAVAVAGEDVATAYALRAIEPGEEITVSY 285
>gi|158287424|ref|XP_309452.4| AGAP011192-PA [Anopheles gambiae str. PEST]
gi|157019644|gb|EAA05242.4| AGAP011192-PA [Anopheles gambiae str. PEST]
Length = 2808
Score = 40.8 bits (94), Expect = 0.68, Method: Composition-based stats.
Identities = 36/130 (27%), Positives = 53/130 (40%), Gaps = 9/130 (6%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSC---RKNCAQLW 134
+G + A R EK+ + +G + E + + N M+ R A L
Sbjct: 2678 QGLGLYAARDLEKHTMVIEYIGEVIRTEVSEMREKQYEARNRGIYMFRLDEERVVDATLS 2737
Query: 135 LGPAAYINHDCRPNCRFVATERDTACVKVL---RDIEQGEEITCFYGEDFFGDK---NCL 188
G A YINH C PNC E D ++ R I +GEE++ Y D D +C+
Sbjct: 2738 GGLARYINHSCNPNCVTEIVEVDRELRIIIFAKRRINRGEELSYDYKFDIEDDAHKISCM 2797
Query: 189 CECDTCEIRM 198
C C+ M
Sbjct: 2798 CGAPNCKKWM 2807
>gi|357023315|ref|ZP_09085517.1| hypothetical protein MEA186_01563 [Mesorhizobium amorphae
CCNWGS0123]
gi|355544737|gb|EHH13811.1| hypothetical protein MEA186_01563 [Mesorhizobium amorphae
CCNWGS0123]
Length = 174
Score = 40.8 bits (94), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 32/90 (35%), Positives = 42/90 (46%), Gaps = 14/90 (15%)
Query: 105 TKEEEKRLLHTGLNDFSVMYSCRKNCAQLWL--GPAAYINHDCRPNCRFVATERDTACVK 162
T E E + L+ +S Y R+N + A Y+NH PNC + E A
Sbjct: 90 TYEGESGPVKGYLDRYS--YPDRRNPGYIVFEADDARYMNHADEPNCDVSSPEETYA--- 144
Query: 163 VLRDIEQGEEITCFY------GEDFFGDKN 186
LRDI GEE+TC Y G DF GD++
Sbjct: 145 -LRDIAPGEELTCDYNHFFDSGFDFLGDRH 173
>gi|119574357|gb|EAW53972.1| hCG1990594, isoform CRA_b [Homo sapiens]
Length = 4884
Score = 40.8 bits (94), Expect = 0.69, Method: Composition-based stats.
Identities = 32/102 (31%), Positives = 44/102 (43%), Gaps = 6/102 (5%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK+ + +G I K L+ N M+ + A L
Sbjct: 4781 QGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGVYMFRMDNDHVIDATLT 4840
Query: 135 LGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEI 173
GPA YINH C PNC V ER + R I++GEE+
Sbjct: 4841 GGPARYINHSCAPNCVAEVVTFERGHKIIISSSRRIQKGEEV 4882
>gi|392953197|ref|ZP_10318751.1| nuclear protein SET [Hydrocarboniphaga effusa AP103]
gi|391858712|gb|EIT69241.1| nuclear protein SET [Hydrocarboniphaga effusa AP103]
Length = 161
Score = 40.8 bits (94), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 20/48 (41%), Positives = 29/48 (60%), Gaps = 7/48 (14%)
Query: 138 AAYINHDCRPNCRFVATE-------RDTACVKVLRDIEQGEEITCFYG 178
A +INH C PNC+ V E +D ++ LR+I++GEE+T YG
Sbjct: 77 ARWINHSCDPNCQAVLYENQNGDRKKDRVVIETLREIKRGEELTYDYG 124
>gi|332023034|gb|EGI63299.1| Histone-lysine N-methyltransferase trr [Acromyrmex echinatior]
Length = 3474
Score = 40.8 bits (94), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 41/127 (32%), Positives = 58/127 (45%), Gaps = 11/127 (8%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCI--AEL--TKEEEKRLLHTGLNDFSVMYSCRKNCAQL 133
+G + A R EK+ + +G I +EL T+E++ + G+ F + R A L
Sbjct: 3344 QGLGLYAARDLEKHTMVIEYIGEIIRSELAETREKQYEARNRGIYMFRLDEE-RVIDATL 3402
Query: 134 WLGPAAYINHDCRPNC--RFVATERD-TACVKVLRDIEQGEEITCFYGEDFFGDKN---C 187
G A YINH C PNC V ERD + R I +GEE+ Y D D++ C
Sbjct: 3403 CGGLARYINHSCNPNCVAEIVEVERDFRIIIFAKRRISRGEELAYDYKFDIEDDQHKIAC 3462
Query: 188 LCECDTC 194
C C
Sbjct: 3463 ACGAPNC 3469
>gi|302144054|emb|CBI23159.3| unnamed protein product [Vitis vinifera]
Length = 503
Score = 40.8 bits (94), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 21/45 (46%), Positives = 26/45 (57%), Gaps = 2/45 (4%)
Query: 133 LWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFY 177
LW+ PA +INH C PN R + D V RD++ GEEIT Y
Sbjct: 294 LWILPA-FINHSCNPNARRLHVG-DNVIVHTSRDVKAGEEITFAY 336
>gi|121713188|ref|XP_001274205.1| SET domain protein [Aspergillus clavatus NRRL 1]
gi|119402358|gb|EAW12779.1| SET domain protein [Aspergillus clavatus NRRL 1]
Length = 448
Score = 40.8 bits (94), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 21/58 (36%), Positives = 29/58 (50%), Gaps = 3/58 (5%)
Query: 140 YINHDCRPNCRFV---ATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTC 194
+I+H C P F +R V+V+RDI EEIT YG ++ + CLC C
Sbjct: 384 FISHSCDPLTIFTRRTIGKRTMTVVEVIRDISPFEEITVDYGRAYWKSRTCLCGEPRC 441
>gi|225444416|ref|XP_002265832.1| PREDICTED: uncharacterized protein LOC100253788 [Vitis vinifera]
Length = 550
Score = 40.8 bits (94), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 21/45 (46%), Positives = 26/45 (57%), Gaps = 2/45 (4%)
Query: 133 LWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFY 177
LW+ PA +INH C PN R + D V RD++ GEEIT Y
Sbjct: 341 LWILPA-FINHSCNPNARRLHVG-DNVIVHTSRDVKAGEEITFAY 383
>gi|194364781|ref|YP_002027391.1| nuclear protein SET [Stenotrophomonas maltophilia R551-3]
gi|194347585|gb|ACF50708.1| nuclear protein SET [Stenotrophomonas maltophilia R551-3]
Length = 155
Score = 40.8 bits (94), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 21/48 (43%), Positives = 26/48 (54%), Gaps = 7/48 (14%)
Query: 138 AAYINHDCRPNCRFVATER-------DTACVKVLRDIEQGEEITCFYG 178
A +INH C PNC V E D ++ LRDI+ GEE+T YG
Sbjct: 75 ARWINHSCDPNCEAVIEEDEDGDSRGDKVFIEALRDIQAGEELTYNYG 122
>gi|357416835|ref|YP_004929855.1| nuclear protein SET [Pseudoxanthomonas spadix BD-a59]
gi|355334413|gb|AER55814.1| nuclear protein SET [Pseudoxanthomonas spadix BD-a59]
Length = 155
Score = 40.4 bits (93), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 24/69 (34%), Positives = 33/69 (47%), Gaps = 13/69 (18%)
Query: 117 LNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATE-------RDTACVKVLRDIEQ 169
LND V+ R + W INH C PNC + E +D ++ LR+I+
Sbjct: 60 LNDKYVIDGTRGGNSARW------INHSCDPNCEALIEEHDGPDRRKDKVVIEALRNIKP 113
Query: 170 GEEITCFYG 178
GEE+T YG
Sbjct: 114 GEELTYNYG 122
>gi|15838079|ref|NP_298767.1| hypothetical protein XF1478 [Xylella fastidiosa 9a5c]
gi|28198603|ref|NP_778917.1| hypothetical protein PD0695 [Xylella fastidiosa Temecula1]
gi|9106502|gb|AAF84287.1|AE003977_10 hypothetical protein XF_1478 [Xylella fastidiosa 9a5c]
gi|28056687|gb|AAO28566.1| conserved hypothetical protein [Xylella fastidiosa Temecula1]
Length = 139
Score = 40.4 bits (93), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 26/77 (33%), Positives = 35/77 (45%), Gaps = 17/77 (22%)
Query: 138 AAYINHDCRPNCRFVATE-------RDTACVKVLRDIEQGEEITCFYGEDFFGDKN---- 186
A +INH C PNC V E +D ++ +RDI GEE+T YG +++
Sbjct: 59 ARWINHSCSPNCEAVVEEDTGGNRRKDKIFIQAIRDIASGEELTYNYGI-VLAERHTARL 117
Query: 187 -----CLCECDTCEIRM 198
CLC D C M
Sbjct: 118 KKIWACLCGSDHCTHTM 134
>gi|406863955|gb|EKD17001.1| lysine methyltransferase [Marssonina brunnea f. sp. 'multigermtubi'
MB_m1]
Length = 448
Score = 40.4 bits (93), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 45/148 (30%), Positives = 65/148 (43%), Gaps = 27/148 (18%)
Query: 60 IQYLPELKPCYRYSLEGNRGARIVATRKWEKNDNIAY-LVGCIAELTKEEEKRLLHTGLN 118
+++LPE + R SL+ R +R ++ ++ Y L G E E LL G
Sbjct: 183 MRHLPETE---RLSLQ-----REAVSRLPRRSQSLFYDLAGHFG--VDETEDVLLTNG-- 230
Query: 119 DFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERD--TACVKVLRDIEQGEEITCF 176
FS ++ + + AA +NHDCRPN RF A +R T V R I G+EIT
Sbjct: 231 -FSAAVGKSQHGFGIVVPEAARLNHDCRPNARF-AFDRTSLTHRVHATRAIHPGDEITVS 288
Query: 177 Y--GEDFFGDKNCL--------CECDTC 194
Y G+ F + + C C C
Sbjct: 289 YIDGKQDFAARQAVIHAHWGFQCRCSLC 316
>gi|407409897|gb|EKF32551.1| hypothetical protein MOQ_003596 [Trypanosoma cruzi marinkellei]
Length = 941
Score = 40.4 bits (93), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 26/46 (56%), Gaps = 9/46 (19%)
Query: 138 AAYINHDCRPNCRFVATERDTAC---------VKVLRDIEQGEEIT 174
A++INH C PN F+ +E + AC V L DIE GEEIT
Sbjct: 548 ASFINHSCSPNALFIFSEEEGACCSSGGGVLRVVALCDIEPGEEIT 593
>gi|385810419|ref|YP_005846815.1| SET domain-containing protein [Ignavibacterium album JCM 16511]
gi|383802467|gb|AFH49547.1| SET domain protein [Ignavibacterium album JCM 16511]
Length = 137
Score = 40.4 bits (93), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 22/62 (35%), Positives = 32/62 (51%), Gaps = 3/62 (4%)
Query: 140 YINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEIRMK 199
YINH C NC E + +DIE GEE+T YG D ++ C C +C+ +++
Sbjct: 78 YINHSCNCNCYIDEDESGNLILVASKDIEAGEELTIDYGYDEIYEE---CSCKSCDNKLE 134
Query: 200 GA 201
A
Sbjct: 135 SA 136
>gi|295663144|ref|XP_002792125.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
gi|226279300|gb|EEH34866.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
Length = 816
Score = 40.4 bits (93), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 55/129 (42%), Gaps = 11/129 (8%)
Query: 76 GNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLND--FSVMYSCRKNCAQL 133
+RG + + R +E N I G I KE E+R+ N+ + +MY +
Sbjct: 434 ADRGYGVRSNRTFEPNQIIVEYTGEIV-TQKECERRMRTVYKNNECYYLMYFDQNMIIDA 492
Query: 134 WLGP-AAYINHDCRPNCRF----VATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN-- 186
G A ++NH C PNC VA + A I GEE+T Y D + KN
Sbjct: 493 TRGSIARFVNHSCEPNCEMEKWTVAGKPRMALFAGKNGITTGEELTYDYNFDPYSQKNVQ 552
Query: 187 -CLCECDTC 194
C C +TC
Sbjct: 553 ECRCGAETC 561
>gi|350630881|gb|EHA19253.1| hypothetical protein ASPNIDRAFT_56859 [Aspergillus niger ATCC 1015]
Length = 1101
Score = 40.4 bits (93), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 36/117 (30%), Positives = 53/117 (45%), Gaps = 13/117 (11%)
Query: 91 NDNIAYLVG--CIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLWLGPAAYINHDC 145
ND I VG ++ E+R L +G+ S ++ +N A G A +INH C
Sbjct: 982 NDMIIEYVGEKVRQQVADMRERRYLKSGIGS-SYLFRIDENTVIDATKRGGIARFINHSC 1040
Query: 146 RPNCRFVATERDTA---CVKVLRDIEQGEEITCFYGEDFFGDKN----CLCECDTCE 195
PNC + D + + LRDIE+ EE+T Y + D + CLC C+
Sbjct: 1041 TPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDSDDRIPCLCGSTGCK 1097
>gi|162458873|ref|NP_001105199.1| SET domain protein 123 [Zea mays]
gi|27466907|gb|AAO12860.1| SET domain protein 123 [Zea mays]
gi|194705040|gb|ACF86604.1| unknown [Zea mays]
gi|195624178|gb|ACG33919.1| SET domain protein 123 [Zea mays]
gi|414871456|tpg|DAA50013.1| TPA: SET domain protein 123 [Zea mays]
Length = 303
Score = 40.4 bits (93), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 21/61 (34%), Positives = 30/61 (49%)
Query: 117 LNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCF 176
L+ + SC + ++ NHDC PN V + A +K LRDIE+GEE+
Sbjct: 212 LSSAAASVSCDSSVGNAVYMLPSFYNHDCDPNAHIVWLQNADAKLKALRDIEEGEELCIC 271
Query: 177 Y 177
Y
Sbjct: 272 Y 272
>gi|414871461|tpg|DAA50018.1| TPA: hypothetical protein ZEAMMB73_470058 [Zea mays]
Length = 173
Score = 40.4 bits (93), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 28/45 (62%), Gaps = 1/45 (2%)
Query: 133 LWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFY 177
+++ P+ Y NHDC PN V + A +K LRDIE+GEE+ Y
Sbjct: 99 VYMLPSFY-NHDCDPNAHIVWLQNADAKLKALRDIEEGEELCICY 142
>gi|350420879|ref|XP_003492658.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
isoform 1 [Bombus impatiens]
Length = 1230
Score = 40.4 bits (93), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 32/107 (29%), Positives = 49/107 (45%), Gaps = 10/107 (9%)
Query: 140 YINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD--KNCLCECDTC 194
++NH C PNC ++ + L DIE+GEE+T Y G+ K CLC C
Sbjct: 911 FMNHSCSPNCETQKWTVNGDTRIGLFALCDIERGEELTFNYNLACDGETRKPCLCGAPNC 970
Query: 195 EIRMKGAFAQSSVEPQTTQPSRYCLRDTDM-RLNKRKLHKKLNRLLL 240
G +PQ T PS ++ +++K K K+L + +L
Sbjct: 971 ----SGFIGLKVQKPQVTTPSTTGIQQKKFDKIDKMKRQKRLRKHVL 1013
>gi|171692915|ref|XP_001911382.1| hypothetical protein [Podospora anserina S mat+]
gi|170946406|emb|CAP73207.1| unnamed protein product [Podospora anserina S mat+]
Length = 1083
Score = 40.4 bits (93), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 42/126 (33%), Positives = 56/126 (44%), Gaps = 29/126 (23%)
Query: 90 KNDNIAYLVG-----CIAELTKEEEKRLLHTGL--------NDFSVMYSCRKNCAQLWLG 136
K+D I VG IAEL E R L +G+ +D +V+ + +K G
Sbjct: 963 KDDMIIEYVGEEVRQVIAEL---REARYLKSGIGSSYLFRIDDNTVIDATKKG------G 1013
Query: 137 PAAYINHDCRPNC--RFVATERDTACV-KVLRDIEQGEEITCFYG-EDFFGDKN---CLC 189
A +INH C PNC + + E V LRDI Q EE+T Y E G + CLC
Sbjct: 1014 IARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDYKFEREIGATDRIPCLC 1073
Query: 190 ECDTCE 195
C+
Sbjct: 1074 GTAACK 1079
>gi|119185079|ref|XP_001243361.1| hypothetical protein CIMG_07257 [Coccidioides immitis RS]
gi|121936913|sp|Q1DR06.1|SET1_COCIM RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component SET1; AltName:
Full=SET domain-containing protein 1
gi|392866240|gb|EAS28850.2| histone-lysine N-methyltransferase, H3 lysine-4 specific
[Coccidioides immitis RS]
Length = 1271
Score = 40.4 bits (93), Expect = 0.84, Method: Composition-based stats.
Identities = 40/149 (26%), Positives = 62/149 (41%), Gaps = 13/149 (8%)
Query: 59 RIQYLPELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVG--CIAELTKEEEKRLLHTG 116
R L + K R++ + A ND I VG ++ E+R L +G
Sbjct: 1120 RFNQLKKRKKPVRFARSAIHNWGLYAEENISANDMIIEYVGEKVRQQVADMRERRYLKSG 1179
Query: 117 LNDFSVMYSCRKNC---AQLWLGPAAYINHDCRPNCRFVATERDTA---CVKVLRDIEQG 170
+ S ++ +N A G A +INH C PNC + D + + LRDI++
Sbjct: 1180 IGS-SYLFRIDENTVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIDRD 1238
Query: 171 EEITCFYGEDFFGDKN----CLCECDTCE 195
EE+T Y + D + CLC C+
Sbjct: 1239 EELTYDYKFEREWDSDDRIPCLCGSAGCK 1267
>gi|414871458|tpg|DAA50015.1| TPA: hypothetical protein ZEAMMB73_470058 [Zea mays]
Length = 299
Score = 40.4 bits (93), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 21/61 (34%), Positives = 30/61 (49%)
Query: 117 LNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCF 176
L+ + SC + ++ NHDC PN V + A +K LRDIE+GEE+
Sbjct: 208 LSSAAASVSCDSSVGNAVYMLPSFYNHDCDPNAHIVWLQNADAKLKALRDIEEGEELCIC 267
Query: 177 Y 177
Y
Sbjct: 268 Y 268
>gi|171679842|ref|XP_001904867.1| hypothetical protein [Podospora anserina S mat+]
gi|170939547|emb|CAP64774.1| unnamed protein product [Podospora anserina S mat+]
Length = 581
Score = 40.4 bits (93), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 37/86 (43%), Gaps = 12/86 (13%)
Query: 140 YINHDCRPNCRFVATERDTA------CVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDT 193
+INH CRPN VA +T K LR IE+G EI YG +F C C+
Sbjct: 252 FINHHCRPN---VACSPETIGGIRVLAFKALRPIEEGHEIYVNYGRSYFDKFKMQCRCNL 308
Query: 194 CEIRMKGAFAQSSVEPQTTQPSRYCL 219
I K A+ + T +P L
Sbjct: 309 YPIAHK---AEGHADTATNKPRTRAL 331
>gi|367054834|ref|XP_003657795.1| hypothetical protein THITE_2123835 [Thielavia terrestris NRRL 8126]
gi|347005061|gb|AEO71459.1| hypothetical protein THITE_2123835 [Thielavia terrestris NRRL 8126]
Length = 271
Score = 40.4 bits (93), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 23/68 (33%), Positives = 34/68 (50%), Gaps = 11/68 (16%)
Query: 140 YINHDCRPNCRFVATERDTACVKVL---RDIEQGEEITCFYGE---DFFGDKNCLCECDT 193
YINH C P+ F DTA + ++ + ++ G+E+T FY D +C C T
Sbjct: 77 YINHSCEPSLIF-----DTANMVIIAGPKGLQPGDELTFFYPSTEWDMAQPFDCFCGAPT 131
Query: 194 CEIRMKGA 201
C R+ GA
Sbjct: 132 CRGRIAGA 139
>gi|378725927|gb|EHY52386.1| histone-lysine N-methyltransferase SETD1 [Exophiala dermatitidis
NIH/UT8656]
Length = 1277
Score = 40.4 bits (93), Expect = 0.87, Method: Composition-based stats.
Identities = 26/69 (37%), Positives = 34/69 (49%), Gaps = 11/69 (15%)
Query: 136 GPAAYINHDCRPNC-----RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN---- 186
G A +INH C PNC R T+R + LRDIE+ EE+T Y + D +
Sbjct: 1207 GIARFINHSCSPNCTAKIIRVGGTKR--IVIYALRDIEKDEELTYDYKFEREIDSDDRIP 1264
Query: 187 CLCECDTCE 195
CLC C+
Sbjct: 1265 CLCGSAVCK 1273
>gi|242216987|ref|XP_002474297.1| predicted protein [Postia placenta Mad-698-R]
gi|220726589|gb|EED80534.1| predicted protein [Postia placenta Mad-698-R]
Length = 1061
Score = 40.4 bits (93), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 34/67 (50%), Gaps = 13/67 (19%)
Query: 142 NHDCRPNCRFVATERDTACVKV--LRDIEQGEEITCFYGEDFF----------GDKNCLC 189
NHDC PN F +E +T C ++ + DIE+GEEIT Y + G N +C
Sbjct: 875 NHDCTPNAHFHFSE-NTCCGRLVAMHDIEEGEEITVRYVDSLAPREERQSLLRGRYNFVC 933
Query: 190 ECDTCEI 196
C TC +
Sbjct: 934 TCRTCSL 940
>gi|432114497|gb|ELK36345.1| Histone-lysine N-methyltransferase MLL2 [Myotis davidii]
Length = 1545
Score = 40.4 bits (93), Expect = 0.88, Method: Composition-based stats.
Identities = 40/132 (30%), Positives = 59/132 (44%), Gaps = 12/132 (9%)
Query: 70 YRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKN 129
Y + +RGA +V K + G +L +EE + + +N+ V+
Sbjct: 1418 YEEQVWADRGAIVVHGPHSAKEQGAVW--GSGGKLWEEENRGIYMFRINNEHVID----- 1470
Query: 130 CAQLWLGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKN 186
A L GPA YINH C PNC V ++ D + R I +GEE+T Y DF D++
Sbjct: 1471 -ATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQFDFEDDQH 1529
Query: 187 CL-CECDTCEIR 197
+ C C R
Sbjct: 1530 KIPCHCGAWNCR 1541
>gi|350420881|ref|XP_003492659.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
isoform 2 [Bombus impatiens]
Length = 1239
Score = 40.4 bits (93), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 32/107 (29%), Positives = 49/107 (45%), Gaps = 10/107 (9%)
Query: 140 YINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD--KNCLCECDTC 194
++NH C PNC ++ + L DIE+GEE+T Y G+ K CLC C
Sbjct: 920 FMNHSCSPNCETQKWTVNGDTRIGLFALCDIERGEELTFNYNLACDGETRKPCLCGAPNC 979
Query: 195 EIRMKGAFAQSSVEPQTTQPSRYCLRDTDM-RLNKRKLHKKLNRLLL 240
G +PQ T PS ++ +++K K K+L + +L
Sbjct: 980 ----SGFIGLKVQKPQVTTPSTTGIQQKKFDKIDKMKRQKRLRKHVL 1022
>gi|340519773|gb|EGR50011.1| predicted protein [Trichoderma reesei QM6a]
Length = 654
Score = 40.4 bits (93), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 25/64 (39%), Positives = 30/64 (46%), Gaps = 7/64 (10%)
Query: 138 AAYINHDCRPNCRF----VATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN---CLCE 190
A ++NH C PNCR V+ + A R I GEE+T Y D F KN CLC
Sbjct: 533 ARFVNHSCNPNCRMIKWIVSGQPRMALFAGDRPIMTGEELTYDYNFDPFSAKNVQKCLCG 592
Query: 191 CDTC 194
C
Sbjct: 593 SPNC 596
>gi|223935219|ref|ZP_03627137.1| nuclear protein SET [bacterium Ellin514]
gi|223896103|gb|EEF62546.1| nuclear protein SET [bacterium Ellin514]
Length = 160
Score = 40.4 bits (93), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 23/60 (38%), Positives = 29/60 (48%), Gaps = 3/60 (5%)
Query: 137 PAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDF--FGDKNCLCECDTC 194
PA ++NH C PNC E + RDI+QGEEIT Y D + + C C C
Sbjct: 72 PARFLNHSCAPNCE-AQDEEGRIWIVASRDIKQGEEITFNYNYDLEDYKEHPCCCGAPKC 130
>gi|319784585|ref|YP_004144061.1| nuclear protein SET [Mesorhizobium ciceri biovar biserrulae
WSM1271]
gi|317170473|gb|ADV14011.1| nuclear protein SET [Mesorhizobium ciceri biovar biserrulae
WSM1271]
Length = 135
Score = 40.4 bits (93), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 26/63 (41%), Positives = 32/63 (50%), Gaps = 11/63 (17%)
Query: 138 AAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFY------GEDFFGDKNCLCEC 191
A Y+NH PNC + E A LRDI GEE+TC Y G DF GD++ L
Sbjct: 78 ARYMNHADEPNCDVSSPEESFA----LRDIAPGEELTCNYNHFFETGFDFLGDRH-LSSA 132
Query: 192 DTC 194
D+
Sbjct: 133 DSI 135
>gi|317037780|ref|XP_001399137.2| histone-lysine N-methyltransferase, H3 lysine-4 specific [Aspergillus
niger CBS 513.88]
Length = 1239
Score = 40.4 bits (93), Expect = 0.91, Method: Compositional matrix adjust.
Identities = 36/117 (30%), Positives = 53/117 (45%), Gaps = 13/117 (11%)
Query: 91 NDNIAYLVG--CIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLWLGPAAYINHDC 145
ND I VG ++ E+R L +G+ S ++ +N A G A +INH C
Sbjct: 1120 NDMIIEYVGEKVRQQVADMRERRYLKSGIGS-SYLFRIDENTVIDATKRGGIARFINHSC 1178
Query: 146 RPNCRFVATERDTA---CVKVLRDIEQGEEITCFYGEDFFGDKN----CLCECDTCE 195
PNC + D + + LRDIE+ EE+T Y + D + CLC C+
Sbjct: 1179 TPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDSDDRIPCLCGSTGCK 1235
>gi|302801428|ref|XP_002982470.1| hypothetical protein SELMODRAFT_421873 [Selaginella moellendorffii]
gi|300149569|gb|EFJ16223.1| hypothetical protein SELMODRAFT_421873 [Selaginella moellendorffii]
Length = 1285
Score = 40.4 bits (93), Expect = 0.91, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 47/102 (46%), Gaps = 16/102 (15%)
Query: 103 ELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATER----DT 158
E ++ +K LN V+ +CRK LG +INH C PNC+ TE+
Sbjct: 620 EYARQRQKHFYFMTLNSSEVIDACRKGN----LG--RFINHSCEPNCQ---TEKWCVNGE 670
Query: 159 ACVKV--LRDIEQGEEITCFYG-EDFFGDKNCLCECDTCEIR 197
C+ + +RD+ + EEIT Y E +G C C + R
Sbjct: 671 ICIGLFAIRDVAKNEEITFNYNFERLYGAAAKKCHCGSAHCR 712
>gi|308482418|ref|XP_003103412.1| hypothetical protein CRE_28789 [Caenorhabditis remanei]
gi|308259833|gb|EFP03786.1| hypothetical protein CRE_28789 [Caenorhabditis remanei]
Length = 645
Score = 40.4 bits (93), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 37/150 (24%), Positives = 58/150 (38%), Gaps = 23/150 (15%)
Query: 77 NRGARIVATRKWEKNDNIAYLVG-CIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWL 135
+RG + AT + N+ VG I E+ K + +++ + + Y+ R ++
Sbjct: 445 DRGYGVFATENIKSETNLVEYVGEVIDEVEKNKRLEIINASGDLEASHYNMRMKKVHFYM 504
Query: 136 GP------AAYINHDCRPNCR------FVATERDTAC---------VKVLRDIEQGEEIT 174
A Y+NH C PN +V + + V+ R I + EEIT
Sbjct: 505 DSTRAGNIARYLNHSCEPNAEVLEVGVYVPNQSKSKTKLKFEPRILVRTTRPIRKNEEIT 564
Query: 175 CFYGEDFFGDKNCLCECDTCEIRMKGAFAQ 204
Y D + CLC C RMK Q
Sbjct: 565 IDYDMDMV-NHVCLCGSSKCSGRMKAGAVQ 593
>gi|195404288|ref|XP_002060445.1| GJ19853 [Drosophila virilis]
gi|194156330|gb|EDW71514.1| GJ19853 [Drosophila virilis]
Length = 1720
Score = 40.4 bits (93), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 37/126 (29%), Positives = 51/126 (40%), Gaps = 9/126 (7%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSC---RKNCAQLW 134
+G + A R EK+ I +G + E + + N M+ R A L
Sbjct: 1590 QGLGLYAARDIEKHTMIIEYIGEVIRTEVSEIREKQYESKNRGIYMFRLDEDRVVDATLS 1649
Query: 135 LGPAAYINHDCRPNC--RFVATERDTACVKVL-RDIEQGEEITCFYGEDFFGDKN---CL 188
G A YINH C PNC V +RD + R I +GEE++ Y D D + C
Sbjct: 1650 GGLARYINHSCNPNCVTEIVEVDRDVRIIIFAKRKIYRGEELSYDYKFDIEDDAHKIPCA 1709
Query: 189 CECDTC 194
C C
Sbjct: 1710 CGAPNC 1715
>gi|308811675|ref|XP_003083145.1| Predicted histone tail methylase containing SET domain (ISS)
[Ostreococcus tauri]
gi|116055024|emb|CAL57420.1| Predicted histone tail methylase containing SET domain (ISS)
[Ostreococcus tauri]
Length = 459
Score = 40.4 bits (93), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 27/68 (39%), Positives = 32/68 (47%), Gaps = 13/68 (19%)
Query: 141 INHDCRPNCRFVATERDT--ACVKVLR-DIEQGEEITCFY----------GEDFFGDKNC 187
NHDC PN + E+D ACV V R DI +GEEIT Y +D D
Sbjct: 382 FNHDCDPNAEPMKGEQDITGACVIVARKDIAEGEEITISYLDDDSKSRDERQDALADYGF 441
Query: 188 LCECDTCE 195
+C C CE
Sbjct: 442 VCRCARCE 449
>gi|348681060|gb|EGZ20876.1| hypothetical protein PHYSODRAFT_313346 [Phytophthora sojae]
Length = 494
Score = 40.4 bits (93), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 24/37 (64%)
Query: 141 INHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFY 177
INH C+PNC + A V+ LRDI++G+EIT Y
Sbjct: 218 INHSCQPNCTWSNAGDGIAEVRALRDIKEGDEITLSY 254
>gi|358385026|gb|EHK22623.1| hypothetical protein TRIVIDRAFT_212935 [Trichoderma virens Gv29-8]
Length = 738
Score = 40.4 bits (93), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 25/64 (39%), Positives = 30/64 (46%), Gaps = 7/64 (10%)
Query: 138 AAYINHDCRPNCRF----VATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN---CLCE 190
A ++NH C PNCR V+ + A R I GEE+T Y D F KN CLC
Sbjct: 521 ARFVNHSCNPNCRMIKWIVSGQPRMALFAGDRPIMTGEELTYDYNFDPFSAKNVQKCLCG 580
Query: 191 CDTC 194
C
Sbjct: 581 QPNC 584
>gi|378733973|gb|EHY60432.1| histone-lysine N-methyltransferase ASH1L [Exophiala dermatitidis
NIH/UT8656]
Length = 780
Score = 40.4 bits (93), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 58/129 (44%), Gaps = 13/129 (10%)
Query: 77 NRGARIVATRKWEKNDNIAYLVGCIAELTKEE-EKRL--LHTGLNDFSVMYSCRKNCAQL 133
+RG + + R +E N I G I +T+EE E+R+ ++ G + +M +
Sbjct: 471 DRGYGVRSNRTFEPNQIIVEYTGEI--ITQEECERRMNTMYKGNECYYLMLFDQNMIIDA 528
Query: 134 WLGPAA-YINHDCRPNCRF----VATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN-- 186
G A ++NH C PNCR V + A R I GEE+T Y D + KN
Sbjct: 529 TRGSIARFVNHSCAPNCRMEKWTVNGKPRMALFAGDRGIMTGEELTYDYNFDPYSQKNVQ 588
Query: 187 -CLCECDTC 194
C C + C
Sbjct: 589 VCRCGAENC 597
>gi|296805347|ref|XP_002843498.1| histone-lysine N-methyltransferase [Arthroderma otae CBS 113480]
gi|238844800|gb|EEQ34462.1| histone-lysine N-methyltransferase [Arthroderma otae CBS 113480]
Length = 1344
Score = 40.0 bits (92), Expect = 0.97, Method: Compositional matrix adjust.
Identities = 36/117 (30%), Positives = 53/117 (45%), Gaps = 13/117 (11%)
Query: 91 NDNIAYLVG--CIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLWLGPAAYINHDC 145
ND I VG ++ E+R L +G+ S ++ +N A G A +INH C
Sbjct: 1225 NDMIIEYVGEKVRQQVADMRERRYLKSGIGS-SYLFRIDENTVIDATKHGGIARFINHSC 1283
Query: 146 RPNCRFVATERDTA---CVKVLRDIEQGEEITCFYGEDFFGDKN----CLCECDTCE 195
PNC + D + + LRDIE+ EE+T Y + D + CLC C+
Sbjct: 1284 TPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDSDDRIPCLCGSTGCK 1340
>gi|255945051|ref|XP_002563293.1| Pc20g07700 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211588028|emb|CAP86099.1| Pc20g07700 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 788
Score = 40.0 bits (92), Expect = 0.98, Method: Compositional matrix adjust.
Identities = 40/130 (30%), Positives = 59/130 (45%), Gaps = 11/130 (8%)
Query: 77 NRGARIVATRKWEKNDNIAYLVGCIAELTKEE-EKRLLHTGLND--FSVMYSCRKNCAQL 133
+RG + + R +E N I G I LT+ E EKR+ N+ + +MY +
Sbjct: 450 DRGYGVRSNRSFEPNQIIVEYTGEI--LTQLECEKRMRTVYKNNDCYYLMYFDQNMIIDA 507
Query: 134 WLGP-AAYINHDCRPNCRF----VATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCL 188
G A ++NH C PNCR VA + A R + GEE++ Y D + +KN
Sbjct: 508 TRGSIARFVNHACEPNCRMEKWTVAGKPRMALFAGDRGVSTGEELSYDYNFDPYSNKNVQ 567
Query: 189 -CECDTCEIR 197
C C + R
Sbjct: 568 ECRCGSANCR 577
>gi|408823304|ref|ZP_11208194.1| nuclear protein SET [Pseudomonas geniculata N1]
Length = 155
Score = 40.0 bits (92), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 21/48 (43%), Positives = 26/48 (54%), Gaps = 7/48 (14%)
Query: 138 AAYINHDCRPNCRFVATER-------DTACVKVLRDIEQGEEITCFYG 178
A +INH C PNC V E D ++ LRDI+ GEE+T YG
Sbjct: 75 ARWINHSCDPNCEAVIEEDEDGDSRGDKVFIEALRDIKAGEELTYNYG 122
>gi|328725324|ref|XP_003248429.1| PREDICTED: hypothetical protein LOC100165877 isoform 2
[Acyrthosiphon pisum]
Length = 1384
Score = 40.0 bits (92), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 42/141 (29%), Positives = 59/141 (41%), Gaps = 19/141 (13%)
Query: 77 NRGARIVATRKWEKNDNIAYLVGCIAE-------LTKEEEKRLLH-TGLNDFSVMYSCRK 128
N+G I A K I+ VG + + LTK+E K L + LN ++ S K
Sbjct: 515 NKGYGICAVEDIPKGALISEYVGEVIDYNEMCNRLTKKEYKNLNYMVQLNPDEIIDSTSK 574
Query: 129 NCAQLWLGPAAYINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDK 185
+INH C PN ++ + R IE+GEEIT Y FGD
Sbjct: 575 GNV------TRFINHSCDPNSVGEKWHVLGQSRIGFFSTRHIEKGEEITFDYSFQIFGDG 628
Query: 186 NCLCECDTCEIRMKGAFAQSS 206
+C C + + R G +SS
Sbjct: 629 AQICYCGSSKCR--GYINKSS 647
>gi|147810952|emb|CAN74411.1| hypothetical protein VITISV_025097 [Vitis vinifera]
Length = 588
Score = 40.0 bits (92), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 21/45 (46%), Positives = 26/45 (57%), Gaps = 2/45 (4%)
Query: 133 LWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFY 177
LW+ PA +INH C PN R + D V RD++ GEEIT Y
Sbjct: 379 LWILPA-FINHSCNPNARRLHVG-DHVIVHASRDVKAGEEITFAY 421
>gi|428170891|gb|EKX39812.1| hypothetical protein GUITHDRAFT_143201 [Guillardia theta CCMP2712]
Length = 496
Score = 40.0 bits (92), Expect = 1.00, Method: Compositional matrix adjust.
Identities = 29/72 (40%), Positives = 36/72 (50%), Gaps = 17/72 (23%)
Query: 139 AYINHDCRPNCRFVA-TERDTACVKVLRDIEQGEEITCFY-GEDFFGD------------ 184
A +NH CRPN F + TE + +R IE+GEE+T Y GED GD
Sbjct: 169 AKVNHSCRPNVMFSSQTEDGKLRLIAMRRIERGEEVTFSYLGED--GDVMSREERRERMR 226
Query: 185 -KNCLCECDTCE 195
K+ LC C CE
Sbjct: 227 GKDFLCSCARCE 238
>gi|326477398|gb|EGE01408.1| histone-lysine N-methyltransferase [Trichophyton equinum CBS 127.97]
Length = 1331
Score = 40.0 bits (92), Expect = 1.00, Method: Compositional matrix adjust.
Identities = 36/117 (30%), Positives = 53/117 (45%), Gaps = 13/117 (11%)
Query: 91 NDNIAYLVG--CIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLWLGPAAYINHDC 145
ND I VG ++ E+R L +G+ S ++ +N A G A +INH C
Sbjct: 1212 NDMIIEYVGEKVRQQVADMRERRYLKSGIGS-SYLFRIDENTVIDATKHGGIARFINHSC 1270
Query: 146 RPNCRFVATERDTA---CVKVLRDIEQGEEITCFYGEDFFGDKN----CLCECDTCE 195
PNC + D + + LRDIE+ EE+T Y + D + CLC C+
Sbjct: 1271 TPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDSDDRIPCLCGSTGCK 1327
>gi|190573184|ref|YP_001971029.1| hypothetical protein Smlt1159 [Stenotrophomonas maltophilia K279a]
gi|386717496|ref|YP_006183822.1| proteins containing SET domain [Stenotrophomonas maltophilia D457]
gi|190011106|emb|CAQ44715.1| conserved hypothetical protein [Stenotrophomonas maltophilia K279a]
gi|384077058|emb|CCH11644.1| Proteins containing SET domain [Stenotrophomonas maltophilia D457]
Length = 155
Score = 40.0 bits (92), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 21/48 (43%), Positives = 26/48 (54%), Gaps = 7/48 (14%)
Query: 138 AAYINHDCRPNCRFVATER-------DTACVKVLRDIEQGEEITCFYG 178
A +INH C PNC V E D ++ LRDI+ GEE+T YG
Sbjct: 75 ARWINHSCDPNCEAVIEEDEDGDSRGDKVFIEALRDIKAGEELTYNYG 122
>gi|72393353|ref|XP_847477.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|62175123|gb|AAX69271.1| hypothetical protein, conserved [Trypanosoma brucei]
gi|70803507|gb|AAZ13411.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 545
Score = 40.0 bits (92), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 22/43 (51%), Positives = 24/43 (55%), Gaps = 1/43 (2%)
Query: 138 AAYINHDCRPNCRFVATERDT-ACVKVLRDIEQGEEITCFYGE 179
AA INH C P+ RFV T A V LRDI GEE+ Y E
Sbjct: 469 AAKINHSCAPSVRFVPTHGGVKAVVVALRDIPSGEEVRTSYIE 511
>gi|326472906|gb|EGD96915.1| histone-lysine N-methyltransferase [Trichophyton tonsurans CBS
112818]
Length = 1330
Score = 40.0 bits (92), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 36/117 (30%), Positives = 53/117 (45%), Gaps = 13/117 (11%)
Query: 91 NDNIAYLVG--CIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLWLGPAAYINHDC 145
ND I VG ++ E+R L +G+ S ++ +N A G A +INH C
Sbjct: 1211 NDMIIEYVGEKVRQQVADMRERRYLKSGIGS-SYLFRIDENTVIDATKHGGIARFINHSC 1269
Query: 146 RPNCRFVATERDTA---CVKVLRDIEQGEEITCFYGEDFFGDKN----CLCECDTCE 195
PNC + D + + LRDIE+ EE+T Y + D + CLC C+
Sbjct: 1270 TPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDSDDRIPCLCGSTGCK 1326
>gi|327304525|ref|XP_003236954.1| histone-lysine N-methyltransferase [Trichophyton rubrum CBS 118892]
gi|326459952|gb|EGD85405.1| histone-lysine N-methyltransferase [Trichophyton rubrum CBS 118892]
Length = 1337
Score = 40.0 bits (92), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 36/117 (30%), Positives = 53/117 (45%), Gaps = 13/117 (11%)
Query: 91 NDNIAYLVG--CIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLWLGPAAYINHDC 145
ND I VG ++ E+R L +G+ S ++ +N A G A +INH C
Sbjct: 1218 NDMIIEYVGEKVRQQVADMRERRYLKSGIGS-SYLFRIDENTVIDATKHGGIARFINHSC 1276
Query: 146 RPNCRFVATERDTA---CVKVLRDIEQGEEITCFYGEDFFGDKN----CLCECDTCE 195
PNC + D + + LRDIE+ EE+T Y + D + CLC C+
Sbjct: 1277 TPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDSDDRIPCLCGSTGCK 1333
>gi|407918366|gb|EKG11637.1| hypothetical protein MPH_11130 [Macrophomina phaseolina MS6]
Length = 980
Score = 40.0 bits (92), Expect = 1.0, Method: Composition-based stats.
Identities = 51/192 (26%), Positives = 87/192 (45%), Gaps = 18/192 (9%)
Query: 77 NRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLND---FSVMYSCRKNCAQL 133
+RG + A R + ++ + +G I +T++E R ++ D F +M K
Sbjct: 513 DRGHGVRAMRPFHQDQIVVEYIGEI--ITQQESDRRVNEVYKDHKAFYLMNFYDKLIIDG 570
Query: 134 WLGPAA-YINHDCRPNCRF----VATERDTACVKVLRDIEQGEEITCFYGEDFFG-DKNC 187
+ G A ++NH C PNCR V E+ A R+I GEE+T Y + +G ++ C
Sbjct: 571 YRGNVARFVNHSCDPNCRMEKWTVNGEQRMALF-ANRNIMTGEELTWHYNFESYGKEQPC 629
Query: 188 LCECDTCEIRMKGAFAQSSVEPQTTQPSRYCLRDTDMRLNKR-KLHKKLNRLLLASDKND 246
C C G + + +P+ R + + +R K +K+NRL+ A+ N
Sbjct: 630 YCGAWNC----AGVIGRRNRFKLPKEPAAGNKRKSPDDVEERPKKKQKVNRLVEAA-SNA 684
Query: 247 TNSSDNSSKYSL 258
S + +K SL
Sbjct: 685 GKSLISKTKESL 696
>gi|456735440|gb|EMF60191.1| Hypothetical protein EPM1_2683 [Stenotrophomonas maltophilia EPM1]
Length = 155
Score = 40.0 bits (92), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 22/53 (41%), Positives = 27/53 (50%), Gaps = 7/53 (13%)
Query: 138 AAYINHDCRPNCRFVATER-------DTACVKVLRDIEQGEEITCFYGEDFFG 183
A +INH C PNC V E D ++ LRDI+ GEE+T YG G
Sbjct: 75 ARWINHSCDPNCEAVIEEDEDGDSRGDKVFIEALRDIKPGEELTYNYGITLAG 127
>gi|443917442|gb|ELU38162.1| histone methyltransferase [Rhizoctonia solani AG-1 IA]
Length = 270
Score = 40.0 bits (92), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 31/104 (29%), Positives = 46/104 (44%), Gaps = 7/104 (6%)
Query: 87 KWEKNDNIAYLVGCIAELT--KEEEKRLLHTGLNDFSVMYSCRKNC--AQLWLGPAAYIN 142
K KND + VG + T ++ + G+ F M R+ A G + N
Sbjct: 148 KTSKNDFVYEYVGDVVNETVLRKRMREYAEEGIQHFYFMMLQREQYIDATKRGGKGRFAN 207
Query: 143 HDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFG 183
H C PNC ++V +R + RDIE+ EE+T Y D +G
Sbjct: 208 HSCNPNCYVAKWVVGKRIRMGIFAKRDIEENEELTFNYNVDRYG 251
>gi|383421363|gb|AFH33895.1| putative histone-lysine N-methyltransferase NSD2 isoform 1 [Macaca
mulatta]
gi|384949270|gb|AFI38240.1| putative histone-lysine N-methyltransferase NSD2 isoform 1 [Macaca
mulatta]
gi|387540940|gb|AFJ71097.1| putative histone-lysine N-methyltransferase NSD2 isoform 1 [Macaca
mulatta]
Length = 1365
Score = 40.0 bits (92), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 35/125 (28%), Positives = 55/125 (44%), Gaps = 13/125 (10%)
Query: 77 NRGARIVATRKWEKNDNIAYLVGCIAELTKEEE--KRLLHTGLNDFSVMYSCRKNCAQLW 134
+G +VA R K + + VG EL EEE R+ H ND + Y + ++
Sbjct: 1072 GKGWGLVAKRDIRKGEFVNEYVG---ELIDEEECMARIKHAHENDITHFYMLTIDKDRII 1128
Query: 135 -LGPAA----YINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN 186
GP ++NH C+PNC ++ + + DI G E+T Y D G++
Sbjct: 1129 DAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTELTFNYNLDCLGNEK 1188
Query: 187 CLCEC 191
+C C
Sbjct: 1189 TVCRC 1193
>gi|355557406|gb|EHH14186.1| Putative histone-lysine N-methyltransferase NSD2 [Macaca mulatta]
Length = 1365
Score = 40.0 bits (92), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 35/125 (28%), Positives = 55/125 (44%), Gaps = 13/125 (10%)
Query: 77 NRGARIVATRKWEKNDNIAYLVGCIAELTKEEE--KRLLHTGLNDFSVMYSCRKNCAQLW 134
+G +VA R K + + VG EL EEE R+ H ND + Y + ++
Sbjct: 1072 GKGWGLVAKRDIRKGEFVNEYVG---ELIDEEECMARIKHAHENDITHFYMLTIDKDRII 1128
Query: 135 -LGPAA----YINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN 186
GP ++NH C+PNC ++ + + DI G E+T Y D G++
Sbjct: 1129 DAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTELTFNYNLDCLGNEK 1188
Query: 187 CLCEC 191
+C C
Sbjct: 1189 TVCRC 1193
>gi|315045626|ref|XP_003172188.1| histone-lysine N-methyltransferase [Arthroderma gypseum CBS 118893]
gi|311342574|gb|EFR01777.1| histone-lysine N-methyltransferase [Arthroderma gypseum CBS 118893]
Length = 1334
Score = 40.0 bits (92), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 36/117 (30%), Positives = 53/117 (45%), Gaps = 13/117 (11%)
Query: 91 NDNIAYLVG--CIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLWLGPAAYINHDC 145
ND I VG ++ E+R L +G+ S ++ +N A G A +INH C
Sbjct: 1215 NDMIIEYVGEKVRQQVADMRERRYLKSGIGS-SYLFRIDENTVIDATKHGGIARFINHSC 1273
Query: 146 RPNCRFVATERDTA---CVKVLRDIEQGEEITCFYGEDFFGDKN----CLCECDTCE 195
PNC + D + + LRDIE+ EE+T Y + D + CLC C+
Sbjct: 1274 TPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDSDDRIPCLCGSTGCK 1330
>gi|261330732|emb|CBH13717.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
DAL972]
Length = 545
Score = 40.0 bits (92), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 22/43 (51%), Positives = 24/43 (55%), Gaps = 1/43 (2%)
Query: 138 AAYINHDCRPNCRFVATERDT-ACVKVLRDIEQGEEITCFYGE 179
AA INH C P+ RFV T A V LRDI GEE+ Y E
Sbjct: 469 AAKINHSCAPSVRFVPTHGGVKAVVVALRDIPSGEEVRTSYIE 511
>gi|242036801|ref|XP_002465795.1| hypothetical protein SORBIDRAFT_01g045950 [Sorghum bicolor]
gi|241919649|gb|EER92793.1| hypothetical protein SORBIDRAFT_01g045950 [Sorghum bicolor]
Length = 543
Score = 40.0 bits (92), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 24/50 (48%), Positives = 28/50 (56%), Gaps = 3/50 (6%)
Query: 129 NCA-QLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFY 177
NC LW+ P+ +INH C PN R D A V RDI+ GEEIT Y
Sbjct: 318 NCGVGLWVLPS-FINHSCHPNARRTHIG-DHAIVHASRDIKAGEEITFPY 365
>gi|322788177|gb|EFZ13959.1| hypothetical protein SINV_06678 [Solenopsis invicta]
Length = 1093
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 27/77 (35%), Positives = 35/77 (45%), Gaps = 7/77 (9%)
Query: 140 YINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFY--GEDFFGDKNCLCECDTC 194
++NH C PNC ++ + LRDIE GEE+T Y D K CLC C
Sbjct: 966 FMNHSCAPNCETQKWTVNGDTRIGLFALRDIESGEELTFNYNLASDGETRKACLCGASNC 1025
Query: 195 E--IRMKGAFAQSSVEP 209
I +K Q S+ P
Sbjct: 1026 SGFIGLKAQKQQLSITP 1042
>gi|297672976|ref|XP_002814554.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
N-methyltransferase NSD2 [Pongo abelii]
Length = 1365
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 35/125 (28%), Positives = 55/125 (44%), Gaps = 13/125 (10%)
Query: 77 NRGARIVATRKWEKNDNIAYLVGCIAELTKEEE--KRLLHTGLNDFSVMYSCRKNCAQLW 134
+G +VA R K + + VG EL EEE R+ H ND + Y + ++
Sbjct: 1072 GKGWGLVAKRDIRKGEFVNEYVG---ELIDEEECMARIKHAHENDITHFYMLTIDKDRII 1128
Query: 135 -LGPAA----YINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN 186
GP ++NH C+PNC ++ + + DI G E+T Y D G++
Sbjct: 1129 DAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTELTFNYNLDCLGNEK 1188
Query: 187 CLCEC 191
+C C
Sbjct: 1189 TVCRC 1193
>gi|169849847|ref|XP_001831622.1| hypothetical protein CC1G_05693 [Coprinopsis cinerea okayama7#130]
gi|116507260|gb|EAU90155.1| hypothetical protein CC1G_05693 [Coprinopsis cinerea okayama7#130]
Length = 176
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 39/74 (52%), Gaps = 5/74 (6%)
Query: 140 YINHDCRPNCRFVATERDTAC--VKVLRDIEQGEEITCFY-GEDFFGDKNCLCECD--TC 194
YINH C PN F + RD + ++ L+DI+ GE + FY ++ D+ CEC+ +C
Sbjct: 75 YINHSCDPNVAFDLSSRDPSQWHLRALKDIKAGETLGFFYPSTEWDMDQPFKCECNARSC 134
Query: 195 EIRMKGAFAQSSVE 208
+ GA S E
Sbjct: 135 LGTVAGAKYLSKTE 148
>gi|348683454|gb|EGZ23269.1| hypothetical protein PHYSODRAFT_481357 [Phytophthora sojae]
Length = 169
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 32/131 (24%), Positives = 54/131 (41%), Gaps = 12/131 (9%)
Query: 77 NRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLN-------DFSVMYSCRKN 129
R +VA E + + +G + L+ R ++G + ++ C
Sbjct: 36 TRSLGVVAGEDIEAGEVLGQYLGEMEHLSVSRTTRPRNSGYRLVMRQRPERPMLPVCMAI 95
Query: 130 CAQLWLGPAAYINHDCRPNCRFVAT---ERDTACVKVLRDIEQGEEITCFYGEDFFGDKN 186
A+ G ++ H C+ +FV R T V ++DI +GEE+T YG+D +
Sbjct: 96 SAENMGGLMRFVKHSCQQVAKFVEVANGRRTTVVVASMQDIHRGEEVTVDYGDDMW--FV 153
Query: 187 CLCECDTCEIR 197
C C D C R
Sbjct: 154 CRCGLDVCRHR 164
>gi|355744804|gb|EHH49429.1| Putative histone-lysine N-methyltransferase NSD2 [Macaca
fascicularis]
Length = 1365
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 35/125 (28%), Positives = 55/125 (44%), Gaps = 13/125 (10%)
Query: 77 NRGARIVATRKWEKNDNIAYLVGCIAELTKEEE--KRLLHTGLNDFSVMYSCRKNCAQLW 134
+G +VA R K + + VG EL EEE R+ H ND + Y + ++
Sbjct: 1072 GKGWGLVAKRDIRKGEFVNEYVG---ELIDEEECMARIKHAHENDITHFYMLTIDKDRII 1128
Query: 135 -LGPAA----YINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN 186
GP ++NH C+PNC ++ + + DI G E+T Y D G++
Sbjct: 1129 DAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTELTFNYNLDCLGNEK 1188
Query: 187 CLCEC 191
+C C
Sbjct: 1189 TVCRC 1193
>gi|171060912|ref|YP_001793261.1| nuclear protein SET [Leptothrix cholodnii SP-6]
gi|170778357|gb|ACB36496.1| nuclear protein SET [Leptothrix cholodnii SP-6]
Length = 139
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 28/59 (47%), Gaps = 2/59 (3%)
Query: 137 PAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFF-GDKNCLCECDTC 194
P + NH C PN + + + +R IE GEEITC YGE G + C C C
Sbjct: 78 PLRFTNHSCSPNA-VLRIRQGRIEIYAMRAIEVGEEITCDYGETHHAGQRPCGCGAPNC 135
>gi|429850088|gb|ELA25393.1| set domain protein [Colletotrichum gloeosporioides Nara gc5]
Length = 590
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 31/98 (31%), Positives = 46/98 (46%), Gaps = 15/98 (15%)
Query: 112 LLHTGLNDFSVMYSCRKNCA-QLWLGPAA----YINHDCRPNCRF-----VATERDTACV 161
L+ L D VM S A Q+W G ++NH C+ N +F ++T+R +
Sbjct: 488 LITKDLRDMDVMDSSTGVRAYQIWQGRQGNFTRFVNHSCKSNAQFQQFVWMSTQR---II 544
Query: 162 KVLRDIEQGEEITCFYGEDFFG--DKNCLCECDTCEIR 197
V + IE G+E+T Y ++ DK CLC C R
Sbjct: 545 LVSKGIEAGQEVTVDYSGSYWRGLDKECLCGEACCRYR 582
>gi|426343599|ref|XP_004038381.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 1
[Gorilla gorilla gorilla]
Length = 1365
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 35/125 (28%), Positives = 55/125 (44%), Gaps = 13/125 (10%)
Query: 77 NRGARIVATRKWEKNDNIAYLVGCIAELTKEEE--KRLLHTGLNDFSVMYSCRKNCAQLW 134
+G +VA R K + + VG EL EEE R+ H ND + Y + ++
Sbjct: 1072 GKGWGLVAKRDIRKGEFVNEYVG---ELIDEEECMARIKHAHENDITHFYMLTIDKDRII 1128
Query: 135 -LGPAA----YINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN 186
GP ++NH C+PNC ++ + + DI G E+T Y D G++
Sbjct: 1129 DAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTELTFNYNLDCLGNEK 1188
Query: 187 CLCEC 191
+C C
Sbjct: 1189 TVCRC 1193
>gi|328725322|ref|XP_001947915.2| PREDICTED: hypothetical protein LOC100165877 isoform 3
[Acyrthosiphon pisum]
Length = 1568
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 42/141 (29%), Positives = 59/141 (41%), Gaps = 19/141 (13%)
Query: 77 NRGARIVATRKWEKNDNIAYLVGCIAE-------LTKEEEKRLLH-TGLNDFSVMYSCRK 128
N+G I A K I+ VG + + LTK+E K L + LN ++ S K
Sbjct: 515 NKGYGICAVEDIPKGALISEYVGEVIDYNEMCNRLTKKEYKNLNYMVQLNPDEIIDSTSK 574
Query: 129 NCAQLWLGPAAYINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDK 185
+INH C PN ++ + R IE+GEEIT Y FGD
Sbjct: 575 GNV------TRFINHSCDPNSVGEKWHVLGQSRIGFFSTRHIEKGEEITFDYSFQIFGDG 628
Query: 186 NCLCECDTCEIRMKGAFAQSS 206
+C C + + R G +SS
Sbjct: 629 AQICYCGSSKCR--GYINKSS 647
>gi|114592860|ref|XP_001146084.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 6
[Pan troglodytes]
gi|114592864|ref|XP_001146248.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 7
[Pan troglodytes]
gi|114592866|ref|XP_001146323.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 8
[Pan troglodytes]
gi|114592870|ref|XP_001146473.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform
10 [Pan troglodytes]
gi|397483594|ref|XP_003812984.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Pan
paniscus]
gi|410227780|gb|JAA11109.1| Wolf-Hirschhorn syndrome candidate 1 [Pan troglodytes]
gi|410259494|gb|JAA17713.1| Wolf-Hirschhorn syndrome candidate 1 [Pan troglodytes]
gi|410299310|gb|JAA28255.1| Wolf-Hirschhorn syndrome candidate 1 [Pan troglodytes]
gi|410334709|gb|JAA36301.1| Wolf-Hirschhorn syndrome candidate 1 [Pan troglodytes]
Length = 1365
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 35/125 (28%), Positives = 55/125 (44%), Gaps = 13/125 (10%)
Query: 77 NRGARIVATRKWEKNDNIAYLVGCIAELTKEEE--KRLLHTGLNDFSVMYSCRKNCAQLW 134
+G +VA R K + + VG EL EEE R+ H ND + Y + ++
Sbjct: 1072 GKGWGLVAKRDIRKGEFVNEYVG---ELIDEEECMARIKHAHENDITHFYMLTIDKDRII 1128
Query: 135 -LGPAA----YINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN 186
GP ++NH C+PNC ++ + + DI G E+T Y D G++
Sbjct: 1129 DAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTELTFNYNLDCLGNEK 1188
Query: 187 CLCEC 191
+C C
Sbjct: 1189 TVCRC 1193
>gi|194767856|ref|XP_001966030.1| GF19475 [Drosophila ananassae]
gi|190622915|gb|EDV38439.1| GF19475 [Drosophila ananassae]
Length = 2490
Score = 40.0 bits (92), Expect = 1.1, Method: Composition-based stats.
Identities = 36/126 (28%), Positives = 52/126 (41%), Gaps = 9/126 (7%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK+ I +G + E + + N M+ ++ A L
Sbjct: 2360 QGLGLYAARDIEKHTMIIEYIGEVIRTEVSEIREKQYESKNRGIYMFRLDEDRVVDATLS 2419
Query: 135 LGPAAYINHDCRPNC--RFVATERDTACVKVL-RDIEQGEEITCFYGEDFFGDKN---CL 188
G A YINH C PNC V +RD + R I +GEE++ Y D D + C
Sbjct: 2420 GGLARYINHSCNPNCVTEIVEVDRDVRIIIFAKRKIYRGEELSYDYKFDIEDDAHKIPCA 2479
Query: 189 CECDTC 194
C C
Sbjct: 2480 CGAPNC 2485
>gi|115400872|ref|XP_001216024.1| hypothetical protein ATEG_07403 [Aspergillus terreus NIH2624]
gi|114189965|gb|EAU31665.1| hypothetical protein ATEG_07403 [Aspergillus terreus NIH2624]
Length = 1230
Score = 40.0 bits (92), Expect = 1.1, Method: Composition-based stats.
Identities = 40/149 (26%), Positives = 62/149 (41%), Gaps = 13/149 (8%)
Query: 59 RIQYLPELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVG--CIAELTKEEEKRLLHTG 116
R L + K R++ + A ND I VG ++ E++ L +G
Sbjct: 1079 RFNQLKKRKKPVRFARSAIHNWGLYAEENISANDMIIEYVGEKVRQQVADMRERQYLKSG 1138
Query: 117 LNDFSVMYSCRKNC---AQLWLGPAAYINHDCRPNCRFVATERDTA---CVKVLRDIEQG 170
+ S ++ +N A G A +INH C PNC + D + + LRDIE+
Sbjct: 1139 IGS-SYLFRIDENTVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERD 1197
Query: 171 EEITCFYGEDFFGDKN----CLCECDTCE 195
EE+T Y + D + CLC C+
Sbjct: 1198 EELTYDYKFEREWDSDDRIPCLCGSTGCK 1226
>gi|19913348|ref|NP_579877.1| histone-lysine N-methyltransferase NSD2 isoform 1 [Homo sapiens]
gi|19913350|ref|NP_579878.1| histone-lysine N-methyltransferase NSD2 isoform 1 [Homo sapiens]
gi|19913358|ref|NP_579890.1| histone-lysine N-methyltransferase NSD2 isoform 1 [Homo sapiens]
gi|109633019|ref|NP_001035889.1| histone-lysine N-methyltransferase NSD2 isoform 1 [Homo sapiens]
gi|74706096|sp|O96028.1|NSD2_HUMAN RecName: Full=Histone-lysine N-methyltransferase NSD2; AltName:
Full=Multiple myeloma SET domain-containing protein;
Short=MMSET; AltName: Full=Nuclear SET domain-containing
protein 2; Short=NSD2; AltName: Full=Protein trithorax-5;
AltName: Full=Wolf-Hirschhorn syndrome candidate 1
protein; Short=WHSC1
gi|3249713|gb|AAC24150.1| MMSET type II [Homo sapiens]
gi|4378019|gb|AAD19343.1| putative WHSC1 protein [Homo sapiens]
gi|4521954|gb|AAD21770.1| putative WHSC1 protein [Homo sapiens]
gi|4521955|gb|AAD21771.1| putative WHSC1 protein [Homo sapiens]
gi|5123789|emb|CAB45386.1| TRX5 protein [Homo sapiens]
gi|6683809|gb|AAF23370.1| MMSET type II [Homo sapiens]
gi|119602958|gb|EAW82552.1| Wolf-Hirschhorn syndrome candidate 1, isoform CRA_e [Homo sapiens]
gi|119602959|gb|EAW82553.1| Wolf-Hirschhorn syndrome candidate 1, isoform CRA_e [Homo sapiens]
gi|119602962|gb|EAW82556.1| Wolf-Hirschhorn syndrome candidate 1, isoform CRA_e [Homo sapiens]
gi|168273154|dbj|BAG10416.1| histone-lysine N-methyltransferase NSD2 [synthetic construct]
gi|187252511|gb|AAI66668.1| Wolf-Hirschhorn syndrome candidate 1 [synthetic construct]
Length = 1365
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 35/125 (28%), Positives = 55/125 (44%), Gaps = 13/125 (10%)
Query: 77 NRGARIVATRKWEKNDNIAYLVGCIAELTKEEE--KRLLHTGLNDFSVMYSCRKNCAQLW 134
+G +VA R K + + VG EL EEE R+ H ND + Y + ++
Sbjct: 1072 GKGWGLVAKRDIRKGEFVNEYVG---ELIDEEECMARIKHAHENDITHFYMLTIDKDRII 1128
Query: 135 -LGPAA----YINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN 186
GP ++NH C+PNC ++ + + DI G E+T Y D G++
Sbjct: 1129 DAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTELTFNYNLDCLGNEK 1188
Query: 187 CLCEC 191
+C C
Sbjct: 1189 TVCRC 1193
>gi|406863054|gb|EKD16102.1| SET domain-containing protein [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 1280
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 39/126 (30%), Positives = 57/126 (45%), Gaps = 29/126 (23%)
Query: 90 KNDNIAYLVG-----CIAELTKEEEKRLLHTGL--------NDFSVMYSCRKNCAQLWLG 136
+ND I VG +A+L E R L +G+ ++ +V+ + +K G
Sbjct: 1160 QNDMIIEYVGEKVRQTVADL---REHRYLKSGIGSSYLFRIDESTVIDATKKG------G 1210
Query: 137 PAAYINHDCRPNC--RFVATERDTACV-KVLRDIEQGEEITCFYG-EDFFGDKN---CLC 189
A +INH C PNC + + E+ V LRDI Q EE+T Y E G + CLC
Sbjct: 1211 IARFINHSCMPNCTAKIITVEKGKRIVIYALRDIAQNEELTYDYKFEREIGSTDRIPCLC 1270
Query: 190 ECDTCE 195
C+
Sbjct: 1271 GTAACK 1276
>gi|424667456|ref|ZP_18104481.1| hypothetical protein A1OC_01033 [Stenotrophomonas maltophilia
Ab55555]
gi|401069070|gb|EJP77593.1| hypothetical protein A1OC_01033 [Stenotrophomonas maltophilia
Ab55555]
Length = 170
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 21/48 (43%), Positives = 26/48 (54%), Gaps = 7/48 (14%)
Query: 138 AAYINHDCRPNCRFVATER-------DTACVKVLRDIEQGEEITCFYG 178
A +INH C PNC V E D ++ LRDI+ GEE+T YG
Sbjct: 90 ARWINHSCDPNCEAVIEEDEDGDSRGDKVFIEALRDIKAGEELTYNYG 137
>gi|325089235|gb|EGC42545.1| histone-lysine N-methyltransferase [Ajellomyces capsulatus H88]
Length = 1267
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 36/117 (30%), Positives = 53/117 (45%), Gaps = 13/117 (11%)
Query: 91 NDNIAYLVG--CIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLWLGPAAYINHDC 145
ND I VG ++ E+R L +G+ S ++ +N A G A +INH C
Sbjct: 1148 NDMIIEYVGEKVRQQVADMRERRYLKSGIGS-SYLFRIDENTVIDATKRGGIARFINHSC 1206
Query: 146 RPNCRFVATERDTA---CVKVLRDIEQGEEITCFYGEDFFGDKN----CLCECDTCE 195
PNC + D + + LRDIE+ EE+T Y + D + CLC C+
Sbjct: 1207 TPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDSDDRIPCLCGSTGCK 1263
>gi|26346432|dbj|BAC36867.1| unnamed protein product [Mus musculus]
Length = 101
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 28/71 (39%), Positives = 35/71 (49%), Gaps = 4/71 (5%)
Query: 131 AQLWLGPAAYINHDCRPNC--RFVATERD-TACVKVLRDIEQGEEITCFYGEDFFGDKNC 187
A L GPA YINH C PNC V ER + R I++GEE+ Y DF D++
Sbjct: 27 ATLTGGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEELCYDYKFDFEDDQHK 86
Query: 188 L-CECDTCEIR 197
+ C C R
Sbjct: 87 IPCHCGAVNCR 97
>gi|254523335|ref|ZP_05135390.1| nuclear protein SET [Stenotrophomonas sp. SKA14]
gi|344206415|ref|YP_004791556.1| nuclear protein SET [Stenotrophomonas maltophilia JV3]
gi|219720926|gb|EED39451.1| nuclear protein SET [Stenotrophomonas sp. SKA14]
gi|343777777|gb|AEM50330.1| nuclear protein SET [Stenotrophomonas maltophilia JV3]
Length = 170
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 21/48 (43%), Positives = 26/48 (54%), Gaps = 7/48 (14%)
Query: 138 AAYINHDCRPNCRFVATER-------DTACVKVLRDIEQGEEITCFYG 178
A +INH C PNC V E D ++ LRDI+ GEE+T YG
Sbjct: 90 ARWINHSCDPNCEAVIEEDEDGDSRGDKVFIEALRDIKAGEELTYNYG 137
>gi|388852139|emb|CCF54145.1| uncharacterized protein [Ustilago hordei]
Length = 1338
Score = 40.0 bits (92), Expect = 1.2, Method: Composition-based stats.
Identities = 26/78 (33%), Positives = 32/78 (41%), Gaps = 23/78 (29%)
Query: 140 YINHDCRPNCR---FVATERDTAC-------VKVLRDIEQGEEITCFYGEDFFGDKN--- 186
+ NH C PNCR F+ D A + LRDIE GEE+T YG F ++
Sbjct: 795 FANHSCEPNCRIEKFIICGTDEALSAEFQIGLFALRDIEAGEELTYNYGWSAFQPRDITG 854
Query: 187 ----------CLCECDTC 194
CLC C
Sbjct: 855 APTEEVPPEQCLCGAANC 872
>gi|390360513|ref|XP_785219.3| PREDICTED: histone-lysine N-methyltransferase NSD3-like
[Strongylocentrotus purpuratus]
Length = 1736
Score = 40.0 bits (92), Expect = 1.2, Method: Composition-based stats.
Identities = 35/127 (27%), Positives = 55/127 (43%), Gaps = 17/127 (13%)
Query: 77 NRGARIVATRKWEKNDNIAYLVGCIAELTKEEE-----KRLLHTGLNDFSVMYSCRKNCA 131
+RG +VA +K D + VG EL EEE K+ + DF + +
Sbjct: 1242 HRGWGLVAMVDIKKGDFVNEYVG---ELVDEEECRRRIKQAHEENITDFYFLTLDKDRII 1298
Query: 132 QLWLGP----AAYINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD 184
GP + ++NH C+PNC ++ + +R+I G EI+ Y D G+
Sbjct: 1299 DA--GPKGNLSRFMNHSCQPNCETQKWTVNGDTRVGLFAIRNIAAGNEISFNYNLDCLGN 1356
Query: 185 KNCLCEC 191
+ CEC
Sbjct: 1357 EKKRCEC 1363
>gi|225554361|gb|EEH02660.1| histone-lysine N-methyltransferase [Ajellomyces capsulatus G186AR]
Length = 1267
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 36/117 (30%), Positives = 53/117 (45%), Gaps = 13/117 (11%)
Query: 91 NDNIAYLVG--CIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLWLGPAAYINHDC 145
ND I VG ++ E+R L +G+ S ++ +N A G A +INH C
Sbjct: 1148 NDMIIEYVGEKVRQQVADMRERRYLKSGIGS-SYLFRIDENTVIDATKRGGIARFINHSC 1206
Query: 146 RPNCRFVATERDTA---CVKVLRDIEQGEEITCFYGEDFFGDKN----CLCECDTCE 195
PNC + D + + LRDIE+ EE+T Y + D + CLC C+
Sbjct: 1207 TPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDSDDRIPCLCGSTGCK 1263
>gi|425778490|gb|EKV16615.1| Histone-lysine N-methyltransferase (Ash1), putative [Penicillium
digitatum PHI26]
gi|425784214|gb|EKV22005.1| Histone-lysine N-methyltransferase (Ash1), putative [Penicillium
digitatum Pd1]
Length = 770
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 40/130 (30%), Positives = 59/130 (45%), Gaps = 11/130 (8%)
Query: 77 NRGARIVATRKWEKNDNIAYLVGCIAELTKEE-EKRLLHTGLND--FSVMYSCRKNCAQL 133
+RG + + R ++ N I G I LT+ E EKR+ N+ + +MY +
Sbjct: 450 DRGYGVRSNRSFDPNQIIVEYTGEI--LTQLECEKRMRTVYKNNECYYLMYFDQNMIIDA 507
Query: 134 WLGP-AAYINHDCRPNCRF----VATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNC- 187
G A ++NH C PNCR VA + A R I GEE++ Y D + +KN
Sbjct: 508 TRGSIARFVNHACEPNCRMEKWTVAGKPRMALFAGDRGISTGEELSYDYNFDPYSNKNVQ 567
Query: 188 LCECDTCEIR 197
C C + R
Sbjct: 568 QCRCGSANCR 577
>gi|242016925|ref|XP_002428945.1| hypothetical protein Phum_PHUM411800 [Pediculus humanus corporis]
gi|212513774|gb|EEB16207.1| hypothetical protein Phum_PHUM411800 [Pediculus humanus corporis]
Length = 6073
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 39/128 (30%), Positives = 57/128 (44%), Gaps = 13/128 (10%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIA--ELTKEEEKRLLHTGLNDFSVMYSC---RKNCAQ 132
+G + A R E++ + +G I EL++ EK+ + N M+ R A
Sbjct: 5943 QGLGLYAARDLERHTMVIEYIGEIIRHELSESREKQ--YEARNRGIYMFRLDEERVIDAT 6000
Query: 133 LWLGPAAYINHDCRPNC--RFVATERDTACVK-VLRDIEQGEEITCFYGEDFFGDKN--- 186
+ G A YINH C PNC V +RD + R I +GEE+ Y D D++
Sbjct: 6001 ICGGLARYINHSCNPNCVAEIVEVDRDLRIIIFAKRRITRGEELAYDYKFDIEDDQHKIP 6060
Query: 187 CLCECDTC 194
CLC C
Sbjct: 6061 CLCGAPNC 6068
>gi|154311696|ref|XP_001555177.1| hypothetical protein BC1G_06307 [Botryotinia fuckeliana B05.10]
Length = 451
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 40/120 (33%), Positives = 54/120 (45%), Gaps = 19/120 (15%)
Query: 91 NDNIAYLVG-----CIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLWLGPAAYIN 142
ND I VG +A+L E R L +G+ S ++ +N A G A +IN
Sbjct: 332 NDMIIEYVGEKVRQQVADL---RENRYLKSGIGS-SYLFRIDENTVIDATKKGGIARFIN 387
Query: 143 HDCRPNC--RFVATERDTACV-KVLRDIEQGEEITCFYG-EDFFGDKN---CLCECDTCE 195
H C PNC + + E+ V LRDI Q EE+T Y E G + CLC C+
Sbjct: 388 HSCMPNCTAKIITVEKSKRIVIYALRDIAQNEELTYDYKFEREIGSTDRIPCLCGTPACK 447
>gi|428186593|gb|EKX55443.1| hypothetical protein GUITHDRAFT_62776, partial [Guillardia theta
CCMP2712]
Length = 84
Score = 40.0 bits (92), Expect = 1.2, Method: Composition-based stats.
Identities = 22/62 (35%), Positives = 29/62 (46%), Gaps = 7/62 (11%)
Query: 140 YINHDCRPNCRFVATERDTACVKVL----RDIEQGEEITCFYGED---FFGDKNCLCECD 192
++NH C PN D A ++L RDI GEE+T YGE G C+C +
Sbjct: 6 FVNHSCDPNMEAKPVRIDAALPRILFCASRDILAGEELTISYGEGSDAHGGGSKCVCGAE 65
Query: 193 TC 194
C
Sbjct: 66 AC 67
>gi|406706812|ref|YP_006757165.1| SET domain-containing protein [alpha proteobacterium HIMB5]
gi|406652588|gb|AFS47988.1| SET domain protein [alpha proteobacterium HIMB5]
Length = 165
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 25/62 (40%), Positives = 31/62 (50%), Gaps = 8/62 (12%)
Query: 138 AAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN-----CLCECD 192
A INH C NC + VK +RDI++GEE TC YG F D+N C C+
Sbjct: 75 AGLINHSCDNNCDYDGKGL-KIWVKAIRDIKKGEEFTCDYGFGF--DENYKQFPCKCKSK 131
Query: 193 TC 194
C
Sbjct: 132 NC 133
>gi|444722051|gb|ELW62755.1| putative histone-lysine N-methyltransferase NSD2 [Tupaia chinensis]
Length = 1421
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 35/125 (28%), Positives = 55/125 (44%), Gaps = 13/125 (10%)
Query: 77 NRGARIVATRKWEKNDNIAYLVGCIAELTKEEE--KRLLHTGLNDFSVMYSCRKNCAQLW 134
+G +VA R K + + VG EL EEE R+ + ND + Y + ++
Sbjct: 915 GKGWGLVAKRDIRKGEFVNEYVG---ELIDEEECMARIKYAHENDITHFYMLTIDKDRII 971
Query: 135 -LGPAA----YINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN 186
GP ++NH C+PNC ++ + L DI G E+T Y D G++
Sbjct: 972 DAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFALCDIPAGTELTFNYNLDCLGNEK 1031
Query: 187 CLCEC 191
+C C
Sbjct: 1032 TVCRC 1036
>gi|379730569|ref|YP_005322765.1| nuclear protein SET [Saprospira grandis str. Lewin]
gi|378576180|gb|AFC25181.1| nuclear protein SET [Saprospira grandis str. Lewin]
Length = 128
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 27/107 (25%), Positives = 50/107 (46%), Gaps = 5/107 (4%)
Query: 72 YSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCA 131
Y + +G R + T + D + + + L+ E+ +++ T L+D+ ++ A
Sbjct: 8 YFKDSPKGGRGIFTADYIPKDTLIEICPVLV-LSNEDREKIHQTFLHDYYFLWDKEGKQA 66
Query: 132 QLWLGPAAYINHDCRPNCRFVATERDTACVKVL--RDIEQGEEITCF 176
+ LG + NH +PN + +D + V DIE G+EI CF
Sbjct: 67 AIALGYGSLYNHSYQPNA-YYQMNKDGQSIDVYAGEDIEPGQEI-CF 111
>gi|169624389|ref|XP_001805600.1| hypothetical protein SNOG_15453 [Phaeosphaeria nodorum SN15]
gi|160705160|gb|EAT77118.2| hypothetical protein SNOG_15453 [Phaeosphaeria nodorum SN15]
Length = 408
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 26/67 (38%), Positives = 35/67 (52%), Gaps = 7/67 (10%)
Query: 140 YINHDCRPNCRFV----ATERDTACVKVLRDIEQGEEITCFYGEDFF-GDKN-CLC-ECD 192
Y+NH C PNC F V R I +GEE+T YG ++F G K+ CLC C
Sbjct: 225 YLNHSCEPNCDFYEGRCGEHYRLVWVSTNRAISKGEELTVNYGPEWFKGPKDRCLCAACS 284
Query: 193 TCEIRMK 199
T +I+ +
Sbjct: 285 TSKIKTE 291
>gi|296413387|ref|XP_002836395.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295630213|emb|CAZ80586.1| unnamed protein product [Tuber melanosporum]
Length = 901
Score = 40.0 bits (92), Expect = 1.2, Method: Composition-based stats.
Identities = 41/150 (27%), Positives = 61/150 (40%), Gaps = 17/150 (11%)
Query: 76 GNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLL---HTGLNDFSVMYSCRKNCAQ 132
N G + ATR + N I G I +T+EE +R + + ++ +M +
Sbjct: 456 ANCGHGLRATRGFMPNQIIVEYTGEI--ITQEESERRMVEVYKDNKNYYLMLFHQNMILD 513
Query: 133 LWLGPAA-YINHDCRPNCR----FVATERDTACVKVLRDIEQGEEITCFYGEDFF---GD 184
G A ++NH C PNCR V A IE GEE+T Y ++F
Sbjct: 514 ATRGSVARFVNHSCDPNCRMEKWLVEGRPRMALFAGDDGIEAGEELTYDYNFNWFTGVSQ 573
Query: 185 KNCLCECDTCEIRMKGAFAQSSVEPQTTQP 214
+ C C D C +GA + + Q P
Sbjct: 574 QTCHCGADNC----RGALGKKADGFQRASP 599
>gi|195392728|ref|XP_002055009.1| GJ19139 [Drosophila virilis]
gi|194149519|gb|EDW65210.1| GJ19139 [Drosophila virilis]
Length = 2531
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 37/126 (29%), Positives = 51/126 (40%), Gaps = 9/126 (7%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSC---RKNCAQLW 134
+G + A R EK+ I +G + E + + N M+ R A L
Sbjct: 2401 QGLGLYAARDIEKHTMIIEYIGEVIRTEVSEIREKQYESKNRGIYMFRLDEDRVVDATLS 2460
Query: 135 LGPAAYINHDCRPNC--RFVATERDTACVK-VLRDIEQGEEITCFYGEDFFGDKN---CL 188
G A YINH C PNC V +RD + R I +GEE++ Y D D + C
Sbjct: 2461 GGLARYINHSCNPNCVTEIVEVDRDVRIIIFAKRKIYRGEELSYDYKFDIEDDAHKIPCA 2520
Query: 189 CECDTC 194
C C
Sbjct: 2521 CGAPNC 2526
>gi|212276262|ref|NP_001130593.1| uncharacterized protein LOC100191692 [Zea mays]
gi|194689580|gb|ACF78874.1| unknown [Zea mays]
gi|414865032|tpg|DAA43589.1| TPA: hypothetical protein ZEAMMB73_306282 [Zea mays]
Length = 544
Score = 39.7 bits (91), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 24/50 (48%), Positives = 28/50 (56%), Gaps = 3/50 (6%)
Query: 129 NCA-QLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFY 177
NC LW+ P+ +INH C PN R D A V RDI+ GEEIT Y
Sbjct: 319 NCGVGLWVLPS-FINHSCHPNARRTHIG-DHAIVHASRDIKAGEEITFPY 366
>gi|308812131|ref|XP_003083373.1| Predicted histone tail methylase containing SET domain (ISS)
[Ostreococcus tauri]
gi|116055253|emb|CAL57649.1| Predicted histone tail methylase containing SET domain (ISS)
[Ostreococcus tauri]
Length = 190
Score = 39.7 bits (91), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 24/45 (53%)
Query: 133 LWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFY 177
+ LG A+ INH C P+C A V RDI +GEEIT Y
Sbjct: 97 VLLGSASNINHSCEPSCEVAFIHDARAHVIATRDITKGEEITISY 141
>gi|319786707|ref|YP_004146182.1| nuclear protein SET [Pseudoxanthomonas suwonensis 11-1]
gi|317465219|gb|ADV26951.1| nuclear protein SET [Pseudoxanthomonas suwonensis 11-1]
Length = 155
Score = 39.7 bits (91), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 20/48 (41%), Positives = 26/48 (54%), Gaps = 7/48 (14%)
Query: 138 AAYINHDCRPNCRFVATE-------RDTACVKVLRDIEQGEEITCFYG 178
A +INH C PNC V E +D ++ +RDI GEE+T YG
Sbjct: 75 ARWINHSCDPNCEAVVVEADGDDRSKDRIFIEAIRDIRPGEELTYNYG 122
>gi|340975580|gb|EGS22695.1| histone-lysine N-methyltransferase-like protein [Chaetomium
thermophilum var. thermophilum DSM 1495]
Length = 964
Score = 39.7 bits (91), Expect = 1.3, Method: Composition-based stats.
Identities = 25/64 (39%), Positives = 29/64 (45%), Gaps = 7/64 (10%)
Query: 138 AAYINHDCRPNCRF----VATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN---CLCE 190
A ++NH C PNCR V + A R I GEE+T Y D F KN CLC
Sbjct: 578 ARFVNHSCSPNCRMIKWIVKGQPRMALFAGDRPITTGEELTYDYNFDPFCAKNVQKCLCG 637
Query: 191 CDTC 194
C
Sbjct: 638 SANC 641
>gi|400593481|gb|EJP61426.1| SET domain-containing protein 5 [Beauveria bassiana ARSEF 2860]
Length = 633
Score = 39.7 bits (91), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 20/41 (48%), Positives = 25/41 (60%), Gaps = 1/41 (2%)
Query: 138 AAYINHDCRPNCRFVATER-DTACVKVLRDIEQGEEITCFY 177
A+ INH CR N + ER + + V RDIE+GEEIT Y
Sbjct: 440 ASRINHSCRHNAQNTWNERLNKITIHVFRDIEEGEEITISY 480
>gi|383316103|ref|YP_005376945.1| SET domain-containing protein [Frateuria aurantia DSM 6220]
gi|379043207|gb|AFC85263.1| SET domain-containing protein [Frateuria aurantia DSM 6220]
Length = 180
Score = 39.7 bits (91), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 21/48 (43%), Positives = 26/48 (54%), Gaps = 7/48 (14%)
Query: 138 AAYINHDCRPNCRFVA-------TERDTACVKVLRDIEQGEEITCFYG 178
A +INH C PNC+ V RD ++ +RDI GEEIT YG
Sbjct: 87 ARWINHSCMPNCQAVIEISAGGDPRRDRILIEAVRDIAAGEEITYDYG 134
>gi|291223879|ref|XP_002731935.1| PREDICTED: Wolf-Hirschhorn syndrome candidate 1 protein-like
[Saccoglossus kowalevskii]
Length = 1787
Score = 39.7 bits (91), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 41/149 (27%), Positives = 70/149 (46%), Gaps = 15/149 (10%)
Query: 52 QHHRYSYRIQYLPELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIA--ELTKEEE 109
Q+ R+ ++QY PE P +R +G RG + T+ +K D + VG + E +E
Sbjct: 1313 QNQRFQ-KMQY-PETIP-FRTDEKG-RGWGLKTTQDIKKGDFVHEYVGELVDEETCRERI 1368
Query: 110 KRLLHTGLNDFSVMYSCRKNCAQLWLGP----AAYINHDCRPNC---RFVATERDTACVK 162
K+ +++F ++ + + + GP A ++NH C PNC ++ +
Sbjct: 1369 KKCQQLDIDNFYMLTIDKDHV--IDAGPKGNLARFMNHSCDPNCETMKWTILPDTRVGLF 1426
Query: 163 VLRDIEQGEEITCFYGEDFFGDKNCLCEC 191
RDI G E+T Y D G++ CEC
Sbjct: 1427 AKRDITAGSELTFNYNLDCLGNEKKKCEC 1455
>gi|195132807|ref|XP_002010834.1| GI21761 [Drosophila mojavensis]
gi|193907622|gb|EDW06489.1| GI21761 [Drosophila mojavensis]
Length = 2748
Score = 39.7 bits (91), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 37/126 (29%), Positives = 51/126 (40%), Gaps = 9/126 (7%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSC---RKNCAQLW 134
+G + A R EK+ I +G + E + + N M+ R A L
Sbjct: 2618 QGLGLYAARDIEKHTMIIEYIGEVIRTEVSEIREKQYESKNRGIYMFRLDEDRVVDATLS 2677
Query: 135 LGPAAYINHDCRPNC--RFVATERDTACVK-VLRDIEQGEEITCFYGEDFFGDKN---CL 188
G A YINH C PNC V +RD + R I +GEE++ Y D D + C
Sbjct: 2678 GGLARYINHSCNPNCVTEIVEVDRDVRIIIFAKRKIYRGEELSYDYKFDIEDDAHKIPCA 2737
Query: 189 CECDTC 194
C C
Sbjct: 2738 CGAPNC 2743
>gi|432097048|gb|ELK27546.1| Histone-lysine N-methyltransferase MLL3 [Myotis davidii]
Length = 4785
Score = 39.7 bits (91), Expect = 1.3, Method: Composition-based stats.
Identities = 32/101 (31%), Positives = 43/101 (42%), Gaps = 6/101 (5%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK+ + +G I K L+ N M+ + A L
Sbjct: 4631 QGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGVYMFRMDNDHVIDATLT 4690
Query: 135 LGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEE 172
GPA YINH C PNC V ER + R I++GEE
Sbjct: 4691 GGPARYINHSCAPNCVAEVVTFERGHKIIISSNRRIQKGEE 4731
>gi|168702682|ref|ZP_02734959.1| hypothetical protein GobsU_24356 [Gemmata obscuriglobus UQM 2246]
Length = 129
Score = 39.7 bits (91), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 20/54 (37%), Positives = 27/54 (50%), Gaps = 1/54 (1%)
Query: 128 KNCAQLWLGPAAYINHDCRPNCRFVATE-RDTACVKVLRDIEQGEEITCFYGED 180
+N + LG + NH PN RF E R ++ +RDI GE+I YG D
Sbjct: 63 ENGRAIALGYGSLYNHSASPNARFTPRESRTDIVIRAMRDITAGEQIMIDYGWD 116
>gi|452982319|gb|EME82078.1| hypothetical protein MYCFIDRAFT_77623 [Pseudocercospora fijiensis
CIRAD86]
Length = 709
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 26/49 (53%), Gaps = 2/49 (4%)
Query: 133 LWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDF 181
LW+ +YINH C PN + D VK R+I QGEEIT Y D
Sbjct: 476 LWIK-TSYINHSCIPNAE-PSVIGDLMVVKATRNITQGEEITISYFNDI 522
>gi|380487861|emb|CCF37765.1| SET domain-containing protein [Colletotrichum higginsianum]
Length = 665
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 32/137 (23%), Positives = 58/137 (42%), Gaps = 17/137 (12%)
Query: 68 PCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCR 127
P + R ++A +E+ I VG + + + + TG+ + + +
Sbjct: 535 PVQVFRTPDGRNNGLMALCPFERGAAIGEFVGLVTKDLQNMDVMDSSTGVRAYQIWQGRQ 594
Query: 128 KNCAQLWLGPAAYINHDCRPNCRF-----VATERDTACVKVLRDIEQGEEITCFYGEDFF 182
N + ++NH C+ N +F ++T+R V V + IE G E+T Y ++
Sbjct: 595 GNFTR-------FVNHSCKANAQFQQFVWMSTQR---IVLVSKGIEAGREVTVDYSGSYW 644
Query: 183 G--DKNCLCECDTCEIR 197
DK+CLC C R
Sbjct: 645 RGLDKDCLCGESCCRYR 661
>gi|403333784|gb|EJY66014.1| hypothetical protein OXYTRI_13823 [Oxytricha trifallax]
Length = 1936
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 32/64 (50%), Gaps = 9/64 (14%)
Query: 138 AAYINHDCRPNCRFVATER----DTACVKV--LRDIEQGEEITCFYGEDFFGDKNCLCEC 191
A +INH C PNC T++ CV + LRDI + EE++ Y DFF C C
Sbjct: 232 ARFINHSCEPNC---ETQKWNVLGEVCVGIFSLRDIHENEELSFDYQFDFFKTPFTKCYC 288
Query: 192 DTCE 195
T +
Sbjct: 289 GTSK 292
>gi|367037743|ref|XP_003649252.1| lysine methyltransferase enzyme-like protein [Thielavia terrestris
NRRL 8126]
gi|346996513|gb|AEO62916.1| lysine methyltransferase enzyme-like protein [Thielavia terrestris
NRRL 8126]
Length = 1286
Score = 39.7 bits (91), Expect = 1.4, Method: Composition-based stats.
Identities = 38/123 (30%), Positives = 55/123 (44%), Gaps = 23/123 (18%)
Query: 90 KNDNIAYLVG--CIAELTKEEEKRLLHTGL--------NDFSVMYSCRKNCAQLWLGPAA 139
K+D I VG ++ + E R L +G+ +D +V+ + +K G A
Sbjct: 1166 KDDMIIEYVGEEVRQQIAEIREHRYLKSGIGSSYLFRIDDNTVIDATKKG------GIAR 1219
Query: 140 YINHDCRPNC--RFVATERDTACV-KVLRDIEQGEEITCFYG-EDFFGDKN---CLCECD 192
+INH C PNC + + E V LRDI Q EE+T Y E G + CLC
Sbjct: 1220 FINHSCMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDYKFERELGSTDRIPCLCGTA 1279
Query: 193 TCE 195
C+
Sbjct: 1280 ACK 1282
>gi|386331753|ref|YP_006027922.1| set domain protein [Ralstonia solanacearum Po82]
gi|334194201|gb|AEG67386.1| set domain protein [Ralstonia solanacearum Po82]
Length = 188
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 27/70 (38%), Positives = 33/70 (47%), Gaps = 11/70 (15%)
Query: 138 AAYINHDCRPNCRFVATERDTAC-VKVLRDIEQGEEITCFYGEDFFGDKN--------CL 188
A +INH C+PNC A E+D + LRDIE GEE+ YG G + C
Sbjct: 111 ARWINHACKPNCE--AREKDGRVFIHALRDIEAGEELFYDYGLVIEGRQTKALKAQFACH 168
Query: 189 CECDTCEIRM 198
C TC M
Sbjct: 169 CGAKTCRGTM 178
>gi|317033153|ref|XP_001394944.2| histone-lysine N-methyltransferase (Ash1) [Aspergillus niger CBS
513.88]
Length = 825
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 38/128 (29%), Positives = 54/128 (42%), Gaps = 9/128 (7%)
Query: 76 GNRGARIVATRKWEKNDNIAYLVGCIAELTK-EEEKRLLHTGLNDFSVMYSCRKNCAQLW 134
+RG + + R +E N I G I T+ E+ R ++ + +MY +
Sbjct: 459 ADRGYGVRSNRTFEPNQIIVEYTGEIITQTECEKRMRTIYKHNECYYLMYFDQNMIIDAT 518
Query: 135 LGP-AAYINHDCRPNCRF----VATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN--- 186
G A ++NH C PNCR VA + A R I GEE+T Y D + KN
Sbjct: 519 RGSIARFVNHSCEPNCRMEKWTVAGKPRMALFAGDRGIMTGEELTYDYNFDPYSQKNVQQ 578
Query: 187 CLCECDTC 194
C C C
Sbjct: 579 CRCGSSNC 586
>gi|149239909|ref|XP_001525830.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
gi|146449953|gb|EDK44209.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
Length = 822
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 21/57 (36%), Positives = 30/57 (52%), Gaps = 3/57 (5%)
Query: 138 AAYINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCEC 191
A ++NH C PN ++V +R + RDI GEEIT Y D +G ++ C C
Sbjct: 184 ARFVNHSCSPNAFVDKWVVADRLRMGIFAKRDIMAGEEITFDYNVDRYGAQSQPCYC 240
>gi|440801440|gb|ELR22460.1| Histone-lysine N-methyltransferase [Acanthamoeba castellanii str.
Neff]
Length = 545
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 21/37 (56%)
Query: 141 INHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFY 177
+NH C PNC+ E A V LRDI GEE++ Y
Sbjct: 416 LNHSCEPNCQVAYLEDGQALVFALRDIAAGEELSISY 452
>gi|374373101|ref|ZP_09630761.1| nuclear protein SET [Niabella soli DSM 19437]
gi|373234074|gb|EHP53867.1| nuclear protein SET [Niabella soli DSM 19437]
Length = 127
Score = 39.7 bits (91), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 26/88 (29%), Positives = 40/88 (45%), Gaps = 1/88 (1%)
Query: 104 LTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRF-VATERDTACVK 162
++ E K L T L+D+ + N + LG NH NC + + E ++ +K
Sbjct: 38 MSARERKLLDQTLLHDYIFEWGFDGNRCAMALGWVPVYNHSFLSNCEYEMDYETNSIQIK 97
Query: 163 VLRDIEQGEEITCFYGEDFFGDKNCLCE 190
+RDI GEE+ YG + DK E
Sbjct: 98 TVRDIAAGEELFINYGGSWDSDKQVWFE 125
>gi|340057686|emb|CCC52033.1| conserved hypothetical protein [Trypanosoma vivax Y486]
Length = 981
Score = 39.7 bits (91), Expect = 1.5, Method: Composition-based stats.
Identities = 29/78 (37%), Positives = 35/78 (44%), Gaps = 19/78 (24%)
Query: 119 DFSVMYSCRKNCAQLWLGP-AAYINHDCRPNCRFVATE--------RD--------TACV 161
+F + C Q L P A+ INH C PN TE RD + CV
Sbjct: 532 EFKIGQGLGMGCRQRALCPTASLINHSCTPNALMFVTEAGEFAKPIRDNGHGSCTGSMCV 591
Query: 162 KV--LRDIEQGEEITCFY 177
V LRD+E+GEEIT Y
Sbjct: 592 SVVLLRDVEEGEEITVSY 609
>gi|452979022|gb|EME78785.1| hypothetical protein MYCFIDRAFT_178889 [Pseudocercospora fijiensis
CIRAD86]
Length = 511
Score = 39.7 bits (91), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 39/144 (27%), Positives = 57/144 (39%), Gaps = 17/144 (11%)
Query: 52 QHHRYSYRIQYLPELKPCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEE-K 110
+ R + +Y P+++P G+ G TR D YL + L ++
Sbjct: 267 EEFRRRHFFKYKPQIQP------SGSCGYGAFNTRSPIPADT--YLSEYLGNLIPSDDAP 318
Query: 111 RLLHTGLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVAT---ERDTACVKVLRDI 167
R L+ L D ++S A W +IN C PN A +R K LR I
Sbjct: 319 RSLY--LLDIKGLFSIDAAKAGNW---TRFINSSCEPNVAAEAAMLGKRHVLVFKTLRRI 373
Query: 168 EQGEEITCFYGEDFFGDKNCLCEC 191
GEE+T +YG +F C C
Sbjct: 374 LPGEELTFYYGRKYFQQAGFKCSC 397
>gi|198418893|ref|XP_002124393.1| PREDICTED: similar to SET domain containing 2 [Ciona intestinalis]
Length = 2228
Score = 39.7 bits (91), Expect = 1.5, Method: Composition-based stats.
Identities = 21/61 (34%), Positives = 29/61 (47%), Gaps = 3/61 (4%)
Query: 140 YINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEI 196
+INH C PNC ++ R +RDI +GEEIT Y +G + C C +
Sbjct: 1139 FINHSCDPNCETQKWTVNGRLRVGFFTMRDINKGEEITFDYQFQRYGKEAQACYCGSSNC 1198
Query: 197 R 197
R
Sbjct: 1199 R 1199
>gi|352094405|ref|ZP_08955576.1| nuclear protein SET [Synechococcus sp. WH 8016]
gi|351680745|gb|EHA63877.1| nuclear protein SET [Synechococcus sp. WH 8016]
Length = 123
Score = 39.7 bits (91), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 24/70 (34%), Positives = 31/70 (44%), Gaps = 6/70 (8%)
Query: 136 GPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGE-----DFFGDKNCLCE 190
G + +NH C P+C R IE+GEEIT Y D+F D CLC
Sbjct: 29 GLISKVNHSCNPSCGISLNASGAHDFIAFRVIEKGEEITFDYAMRNYSIDYFPDP-CLCG 87
Query: 191 CDTCEIRMKG 200
TC + + G
Sbjct: 88 EVTCRVHVNG 97
>gi|313212115|emb|CBY16137.1| unnamed protein product [Oikopleura dioica]
Length = 149
Score = 39.7 bits (91), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 35/112 (31%), Positives = 49/112 (43%), Gaps = 23/112 (20%)
Query: 90 KNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAA-YINHDCRPN 148
+ND + Y++ +AE+ EK ++H LG AA + NH C PN
Sbjct: 44 ENDKMNYIL-SLAEVFGNGEKEIVHIDAGK---------------LGNAARFANHSCSPN 87
Query: 149 CRF--VATERDTACVKVL--RDIEQGEEITCFYG--EDFFGDKNCLCECDTC 194
+ V E D A + + R IE GEEIT YG E ++ C C C
Sbjct: 88 SKLYPVRVENDIARIAIFAERFIEPGEEITYDYGSAESTLSERKCQCGSRCC 139
>gi|17552316|ref|NP_498041.1| Protein SET-2, isoform b [Caenorhabditis elegans]
gi|351058301|emb|CCD65735.1| Protein SET-2, isoform b [Caenorhabditis elegans]
Length = 739
Score = 39.7 bits (91), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 25/61 (40%), Positives = 33/61 (54%), Gaps = 4/61 (6%)
Query: 138 AAYINHDCRPNC--RFVATERDTACVKVLRD-IEQGEEITCFYGEDFFGDK-NCLCECDT 193
A +INH C+PNC + + E + V R I++GEEIT Y DK +CLC T
Sbjct: 674 ARFINHSCQPNCYAKVLTIEGEKRIVIYSRTIIKKGEEITYDYKFPIEDDKIDCLCGAKT 733
Query: 194 C 194
C
Sbjct: 734 C 734
>gi|430813766|emb|CCJ28924.1| unnamed protein product [Pneumocystis jirovecii]
Length = 809
Score = 39.7 bits (91), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 26/83 (31%), Positives = 42/83 (50%), Gaps = 6/83 (7%)
Query: 136 GPAAYINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD--KNCLCE 190
G A ++NH C PNC ++V + + R+I +GEE+T Y D +G+ + C CE
Sbjct: 279 GLARFLNHSCSPNCYVDKWVVGTKLRMGIFCKRNILKGEELTFDYNVDRYGNTAQPCYCE 338
Query: 191 CDTCEIRMKGAFAQSSVEPQTTQ 213
C I G Q+ ++ + Q
Sbjct: 339 EPGC-IGFIGGKTQTELDSKIPQ 360
>gi|367048473|ref|XP_003654616.1| hypothetical protein THITE_51688, partial [Thielavia terrestris
NRRL 8126]
gi|347001879|gb|AEO68280.1| hypothetical protein THITE_51688, partial [Thielavia terrestris
NRRL 8126]
Length = 122
Score = 39.7 bits (91), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 22/76 (28%), Positives = 35/76 (46%), Gaps = 15/76 (19%)
Query: 140 YINHDCRPNCRFVATERDTACVKVLR-----DIEQGEEITCFYGEDFFGDKNCLCECDTC 194
++N CRPN AT V+++ DI GE++ YG D+F ++ +C C+
Sbjct: 57 FVNSSCRPN--VAATPEQIGKVRIVAFRAETDIAAGEQVLINYGRDYFAERGIMCCCEVS 114
Query: 195 EIRMKGAFAQSSVEPQ 210
AQ + PQ
Sbjct: 115 --------AQPHLPPQ 122
>gi|334322119|ref|XP_003340188.1| PREDICTED: SET and MYND domain-containing protein 3-like
[Monodelphis domestica]
Length = 441
Score = 39.7 bits (91), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 23/71 (32%), Positives = 36/71 (50%), Gaps = 11/71 (15%)
Query: 139 AYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDF---------FGDKNCL- 188
+ +NH C PNC + E + ++ +RDI+QGEE+T Y + ++ C
Sbjct: 215 SLLNHSCDPNC-VIVFEGPSLFLRAVRDIQQGEELTICYLDVLMPSAERQKQLKEQYCFD 273
Query: 189 CECDTCEIRMK 199
C+C CE R K
Sbjct: 274 CDCPGCETRSK 284
>gi|410958014|ref|XP_003985618.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Felis
catus]
Length = 1300
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 35/125 (28%), Positives = 55/125 (44%), Gaps = 13/125 (10%)
Query: 77 NRGARIVATRKWEKNDNIAYLVGCIAELTKEEE--KRLLHTGLNDFSVMYSCRKNCAQLW 134
+G +VA R K + + VG EL EEE R+ H ND + Y + ++
Sbjct: 1007 GKGWGLVAKRDIRKGEFVNEYVG---ELIDEEECMARIKHAHENDITHFYMLTIDKDRII 1063
Query: 135 -LGPAA----YINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN 186
GP ++NH C+PNC ++ + + DI G E+T Y D G++
Sbjct: 1064 DAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTELTFNYNLDCLGNEK 1123
Query: 187 CLCEC 191
+C C
Sbjct: 1124 TVCRC 1128
>gi|68071179|ref|XP_677503.1| hypothetical protein [Plasmodium berghei strain ANKA]
gi|56497646|emb|CAH98690.1| conserved hypothetical protein [Plasmodium berghei]
Length = 1325
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 27/75 (36%), Positives = 40/75 (53%), Gaps = 7/75 (9%)
Query: 131 AQLWLGPAAYINHDCRPN--CRFVATERDTACVKVL--RDIEQGEEITCFY--GEDFFGD 184
A W + +INH C PN C+ V+ +++ + + +DI EEIT Y G + G+
Sbjct: 1250 ATKWGNASRFINHSCEPNCFCKIVSCDQNLKHIVIFAKKDILPHEEITYDYQFGVESEGE 1309
Query: 185 K-NCLCECDTCEIRM 198
K CLC +TC RM
Sbjct: 1310 KLICLCGSNTCLGRM 1324
>gi|357617692|gb|EHJ70932.1| hypothetical protein KGM_14792 [Danaus plexippus]
Length = 147
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 37/126 (29%), Positives = 55/126 (43%), Gaps = 9/126 (7%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCI--AELTKEEEKRLLHTGLNDFSVMYSCRKNC-AQLW 134
+G + A R EK+ + +G I +EL++ EK+ + R+ A L
Sbjct: 17 QGLGLYAARDLEKHTMVIEYIGEIIRSELSEIREKKYEAKNRGIYMFRLDERRVVDATLC 76
Query: 135 LGPAAYINHDCRPNCRFVATERDTACVKVL---RDIEQGEEITCFYGEDFFGDKN---CL 188
G A YINH C+PNC E D ++ R I +GEE+ Y D D + C+
Sbjct: 77 GGLARYINHSCQPNCVAETVEVDRHLRIIIFAKRRIARGEELAYDYKFDIEDDAHKIMCM 136
Query: 189 CECDTC 194
C C
Sbjct: 137 CGAPNC 142
>gi|186683761|ref|YP_001866957.1| nuclear protein SET [Nostoc punctiforme PCC 73102]
gi|186466213|gb|ACC82014.1| nuclear protein SET [Nostoc punctiforme PCC 73102]
Length = 157
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 36/129 (27%), Positives = 55/129 (42%), Gaps = 16/129 (12%)
Query: 75 EGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLW 134
E +G + AT+K+ K + + +VG E E +R +++ DF++ + +
Sbjct: 20 ETAKGRGVFATKKFAKGETV--VVGIPIE---EVPQRTIYSFQMDFNLYVNLDE------ 68
Query: 135 LGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGE---DFFGDKNCLCEC 191
PA INH C PN + L DIE GEEIT Y + CLC+
Sbjct: 69 --PAVVINHSCDPNTGVSNNQFGGYDFVALGDIEVGEEITWDYETTEYESIAVSRCLCKS 126
Query: 192 DTCEIRMKG 200
C + G
Sbjct: 127 LFCRGKTLG 135
>gi|320593249|gb|EFX05658.1| set domain containing protein [Grosmannia clavigera kw1407]
Length = 1450
Score = 39.7 bits (91), Expect = 1.6, Method: Composition-based stats.
Identities = 27/67 (40%), Positives = 33/67 (49%), Gaps = 7/67 (10%)
Query: 136 GPAAYINHDCRPNC--RFVATERDTACV-KVLRDIEQGEEITC---FYGEDFFGDK-NCL 188
G A +INH C PNC + + E V LRDI Q EE+T F ED D+ CL
Sbjct: 1380 GIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIGQNEELTYDYKFEPEDNPEDRVPCL 1439
Query: 189 CECDTCE 195
C C+
Sbjct: 1440 CGTTACK 1446
>gi|198467813|ref|XP_001354516.2| GA17728 [Drosophila pseudoobscura pseudoobscura]
gi|198146119|gb|EAL31569.2| GA17728 [Drosophila pseudoobscura pseudoobscura]
Length = 2566
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 37/126 (29%), Positives = 51/126 (40%), Gaps = 9/126 (7%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSC---RKNCAQLW 134
+G + A R EK+ I +G + E + + N M+ R A L
Sbjct: 2436 QGLGLYAARDIEKHTMIIEYIGEVIRTEVSEIREKQYESKNRGIYMFRLDEDRVVDATLS 2495
Query: 135 LGPAAYINHDCRPNC--RFVATERDTACVK-VLRDIEQGEEITCFYGEDFFGDKN---CL 188
G A YINH C PNC V +RD + R I +GEE++ Y D D + C
Sbjct: 2496 GGLARYINHSCNPNCVTEIVEVDRDVRIIIFAKRKIYRGEELSYDYKFDIEDDAHKIPCA 2555
Query: 189 CECDTC 194
C C
Sbjct: 2556 CGAPNC 2561
>gi|195170125|ref|XP_002025864.1| GL18348 [Drosophila persimilis]
gi|194110717|gb|EDW32760.1| GL18348 [Drosophila persimilis]
Length = 1046
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 37/126 (29%), Positives = 51/126 (40%), Gaps = 9/126 (7%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSC---RKNCAQLW 134
+G + A R EK+ I +G + E + + N M+ R A L
Sbjct: 916 QGLGLYAARDIEKHTMIIEYIGEVIRTEVSEIREKQYESKNRGIYMFRLDEDRVVDATLS 975
Query: 135 LGPAAYINHDCRPNC--RFVATERDTACVKVL-RDIEQGEEITCFYGEDFFGDKN---CL 188
G A YINH C PNC V +RD + R I +GEE++ Y D D + C
Sbjct: 976 GGLARYINHSCNPNCVTEIVEVDRDVRIIIFAKRKIYRGEELSYDYKFDIEDDAHKIPCA 1035
Query: 189 CECDTC 194
C C
Sbjct: 1036 CGAPNC 1041
>gi|396457972|ref|XP_003833599.1| hypothetical protein LEMA_P063600.1 [Leptosphaeria maculans JN3]
gi|312210147|emb|CBX90234.1| hypothetical protein LEMA_P063600.1 [Leptosphaeria maculans JN3]
Length = 1372
Score = 39.7 bits (91), Expect = 1.6, Method: Composition-based stats.
Identities = 25/67 (37%), Positives = 31/67 (46%), Gaps = 7/67 (10%)
Query: 136 GPAAYINHDCRPNCRFVATERDTA---CVKVLRDIEQGEEITCFYGEDFFGDKN----CL 188
G A +INH C PNC DT+ + LRDI Q EE+T Y + D CL
Sbjct: 1302 GIARFINHSCTPNCTAKIIRVDTSKRIVIYALRDISQDEELTYDYKFEREMDATDRIPCL 1361
Query: 189 CECDTCE 195
C C+
Sbjct: 1362 CGSIGCK 1368
>gi|25395700|pir||H88444 protein C26E6.12 [imported] - Caenorhabditis elegans
Length = 1802
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 25/61 (40%), Positives = 33/61 (54%), Gaps = 4/61 (6%)
Query: 138 AAYINHDCRPNC--RFVATERDTACVKVLRD-IEQGEEITCFYGEDFFGDK-NCLCECDT 193
A +INH C+PNC + + E + V R I++GEEIT Y DK +CLC T
Sbjct: 1737 ARFINHSCQPNCYAKVLTIEGEKRIVIYSRTIIKKGEEITYDYKFPIEDDKIDCLCGAKT 1796
Query: 194 C 194
C
Sbjct: 1797 C 1797
>gi|367042716|ref|XP_003651738.1| hypothetical protein THITE_68287 [Thielavia terrestris NRRL 8126]
gi|346999000|gb|AEO65402.1| hypothetical protein THITE_68287 [Thielavia terrestris NRRL 8126]
Length = 562
Score = 39.3 bits (90), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 18/39 (46%), Positives = 25/39 (64%), Gaps = 1/39 (2%)
Query: 139 AYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFY 177
A +NH C PN F+ +R TA ++ R I++GEEIT Y
Sbjct: 226 AMVNHSCVPNA-FIGFDRRTAVLRAERPIQEGEEITISY 263
>gi|170581736|ref|XP_001895813.1| F/Y-rich N-terminus family protein [Brugia malayi]
gi|158597106|gb|EDP35332.1| F/Y-rich N-terminus family protein [Brugia malayi]
Length = 2144
Score = 39.3 bits (90), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 34/74 (45%), Gaps = 9/74 (12%)
Query: 131 AQLWLGPAAYINHDCRPNC--RFVATERDTACVKVL----RDIEQGEEITCFYG---EDF 181
A + GPA YINH C PNC R V + K++ R I GEE+T Y ED
Sbjct: 2067 ATMAGGPARYINHSCDPNCSTRLVDSGPCGDDKKIIIIANRPISAGEELTYDYQFDIEDV 2126
Query: 182 FGDKNCLCECDTCE 195
CLC C+
Sbjct: 2127 ADKIPCLCGAPNCQ 2140
>gi|171684729|ref|XP_001907306.1| hypothetical protein [Podospora anserina S mat+]
gi|170942325|emb|CAP67977.1| unnamed protein product [Podospora anserina S mat+]
Length = 559
Score = 39.3 bits (90), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 30/46 (65%), Gaps = 2/46 (4%)
Query: 133 LWLGPA-AYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFY 177
++L PA A +NH C PN F+ ++ TA ++ R I++GEEIT Y
Sbjct: 217 IFLDPALARVNHSCVPNA-FIGFDKRTATLRAERPIKEGEEITISY 261
>gi|407001765|gb|EKE18683.1| hypothetical protein ACD_9C00262G0005 [uncultured bacterium]
Length = 127
Score = 39.3 bits (90), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 19/57 (33%), Positives = 32/57 (56%), Gaps = 3/57 (5%)
Query: 140 YINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD--KNCLCECDTC 194
YINH C PNC V + + +++I+ GEE+ YG+++F + + C+C C
Sbjct: 70 YINHSCNPNCE-VRIKSARILIYAIKNIQPGEELGYDYGKEYFEEFIRPAGCKCKKC 125
>gi|302894745|ref|XP_003046253.1| predicted protein [Nectria haematococca mpVI 77-13-4]
gi|256727180|gb|EEU40540.1| predicted protein [Nectria haematococca mpVI 77-13-4]
Length = 642
Score = 39.3 bits (90), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 20/58 (34%), Positives = 32/58 (55%), Gaps = 5/58 (8%)
Query: 141 INHDCRPNCRFVATERDTA----CVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTC 194
+NH CR + R V R + ++ +RDI GE+IT +G+ F ++ CEC+ C
Sbjct: 582 LNHHCRASAR-VRPLRVSGHWICGIEAVRDIRHGEQITVNFGKRFLRNQGLRCECEAC 638
>gi|40789042|dbj|BAA83042.2| KIAA1090 protein [Homo sapiens]
Length = 715
Score = 39.3 bits (90), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 35/125 (28%), Positives = 55/125 (44%), Gaps = 13/125 (10%)
Query: 77 NRGARIVATRKWEKNDNIAYLVGCIAELTKEEE--KRLLHTGLNDFSVMYSCRKNCAQLW 134
+G +VA R K + + VG EL EEE R+ H ND + Y + ++
Sbjct: 422 GKGWGLVAKRDIRKGEFVNEYVG---ELIDEEECMARIKHAHENDITHFYMLTIDKDRII 478
Query: 135 -LGPAA----YINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN 186
GP ++NH C+PNC ++ + + DI G E+T Y D G++
Sbjct: 479 DAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTELTFNYNLDCLGNEK 538
Query: 187 CLCEC 191
+C C
Sbjct: 539 TVCRC 543
>gi|213624868|gb|AAI71696.1| Wolf-Hirschhorn syndrome candidate 1 [Danio rerio]
Length = 1461
Score = 39.3 bits (90), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 33/126 (26%), Positives = 56/126 (44%), Gaps = 13/126 (10%)
Query: 76 GNRGARIVATRKWEKNDNIAYLVGCIAELTKEEE--KRLLHTGLNDFSVMYSCRKNCAQL 133
+G +++ R +K + + VG EL EEE R+ H ND + Y + ++
Sbjct: 1162 AGKGWGLISLRDIKKGEFVNEYVG---ELIDEEECRSRIRHAQENDITHFYMLTIDKDRI 1218
Query: 134 W-LGPAA----YINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDK 185
GP ++NH C+PNC ++ + + DI G E+T Y D G++
Sbjct: 1219 IDAGPKGNYSRFMNHSCQPNCETQKWTVNGDTRVGLFAVCDIPAGTELTFNYNLDCLGNE 1278
Query: 186 NCLCEC 191
+C C
Sbjct: 1279 KTVCRC 1284
>gi|67515875|ref|XP_657823.1| hypothetical protein AN0219.2 [Aspergillus nidulans FGSC A4]
gi|40746936|gb|EAA66092.1| hypothetical protein AN0219.2 [Aspergillus nidulans FGSC A4]
gi|259489570|tpe|CBF89950.1| TPA: SET domain protein (AFU_orthologue; AFUA_4G09180) [Aspergillus
nidulans FGSC A4]
Length = 598
Score = 39.3 bits (90), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 46/102 (45%), Gaps = 17/102 (16%)
Query: 112 LLHTGLNDFSVMYSCRK-NCAQLWLGPAA----YINHDCRPNCRFV-----ATERDTACV 161
+ +G+ VM K Q++ G +INH CRPN +F ER +
Sbjct: 499 FITSGITGVDVMIGGSKARPYQIFQGGMGNFTRFINHSCRPNSQFQRFYWRGKER---II 555
Query: 162 KVLRDIEQGEEITCFYGEDFFG--DKNCLCECDTCEIRMKGA 201
V R + G EIT Y E ++ +K+CLC C R +GA
Sbjct: 556 VVSRGVPAGAEITVDYSEGYWRELEKSCLCGEACC--RFRGA 595
>gi|425781518|gb|EKV19478.1| hypothetical protein PDIG_02750 [Penicillium digitatum PHI26]
gi|425782798|gb|EKV20685.1| hypothetical protein PDIP_14010 [Penicillium digitatum Pd1]
Length = 330
Score = 39.3 bits (90), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 18/41 (43%), Positives = 20/41 (48%)
Query: 141 INHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDF 181
NH C PNC+ R+ RDIE GEEI Y DF
Sbjct: 125 FNHSCIPNCKVPTVSREVISSFATRDIEVGEEINLCYYSDF 165
>gi|17552320|ref|NP_498039.1| Protein SET-2, isoform c [Caenorhabditis elegans]
gi|351058302|emb|CCD65736.1| Protein SET-2, isoform c [Caenorhabditis elegans]
Length = 1510
Score = 39.3 bits (90), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 25/61 (40%), Positives = 33/61 (54%), Gaps = 4/61 (6%)
Query: 138 AAYINHDCRPNC--RFVATERDTACVKVLRD-IEQGEEITCFYGEDFFGDK-NCLCECDT 193
A +INH C+PNC + + E + V R I++GEEIT Y DK +CLC T
Sbjct: 1445 ARFINHSCQPNCYAKVLTIEGEKRIVIYSRTIIKKGEEITYDYKFPIEDDKIDCLCGAKT 1504
Query: 194 C 194
C
Sbjct: 1505 C 1505
>gi|322711625|gb|EFZ03198.1| Histone Lysine Methyltransferase [Metarhizium anisopliae ARSEF 23]
Length = 359
Score = 39.3 bits (90), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 25/82 (30%), Positives = 34/82 (41%), Gaps = 18/82 (21%)
Query: 131 AQLWLGPAAYINHDCRPNCRFVATERDTA-------CVKVLRDIEQGEEITCFYGEDFFG 183
+ GP +INH C PN R A D A + ++DI++GEE+T Y + G
Sbjct: 273 GEFMSGPTRFINHSCDPNMRIFARVGDHADKHIHDLALFAIKDIQKGEELTFDYVDGASG 332
Query: 184 D-----------KNCLCECDTC 194
D CLC C
Sbjct: 333 DFEELEGKVEDMTKCLCGSSKC 354
>gi|440792261|gb|ELR13489.1| tetratricopeptide repeat domain containing protein [Acanthamoeba
castellanii str. Neff]
Length = 764
Score = 39.3 bits (90), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 17/40 (42%), Positives = 22/40 (55%)
Query: 138 AAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFY 177
AA +NH C PN + CV+ +R IE GEE+T Y
Sbjct: 367 AALLNHSCLPNANWSVDGEGRLCVRAVRPIEAGEEVTVAY 406
>gi|17552318|ref|NP_498040.1| Protein SET-2, isoform a [Caenorhabditis elegans]
gi|30173238|sp|Q18221.2|SET2_CAEEL RecName: Full=Probable histone-lysine N-methyltransferase set-2;
AltName: Full=SET domain-containing protein 2
gi|351058300|emb|CCD65734.1| Protein SET-2, isoform a [Caenorhabditis elegans]
Length = 1507
Score = 39.3 bits (90), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 25/61 (40%), Positives = 33/61 (54%), Gaps = 4/61 (6%)
Query: 138 AAYINHDCRPNC--RFVATERDTACVKVLRD-IEQGEEITCFYGEDFFGDK-NCLCECDT 193
A +INH C+PNC + + E + V R I++GEEIT Y DK +CLC T
Sbjct: 1442 ARFINHSCQPNCYAKVLTIEGEKRIVIYSRTIIKKGEEITYDYKFPIEDDKIDCLCGAKT 1501
Query: 194 C 194
C
Sbjct: 1502 C 1502
>gi|297282129|ref|XP_002802212.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
[Macaca mulatta]
Length = 713
Score = 39.3 bits (90), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 35/125 (28%), Positives = 55/125 (44%), Gaps = 13/125 (10%)
Query: 77 NRGARIVATRKWEKNDNIAYLVGCIAELTKEEE--KRLLHTGLNDFSVMYSCRKNCAQLW 134
+G +VA R K + + VG EL EEE R+ H ND + Y + ++
Sbjct: 420 GKGWGLVAKRDIRKGEFVNEYVG---ELIDEEECMARIKHAHENDITHFYMLTIDKDRII 476
Query: 135 -LGPAA----YINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN 186
GP ++NH C+PNC ++ + + DI G E+T Y D G++
Sbjct: 477 DAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTELTFNYNLDCLGNEK 536
Query: 187 CLCEC 191
+C C
Sbjct: 537 TVCRC 541
>gi|221487471|gb|EEE25703.1| SET domain-containing protein, putative [Toxoplasma gondii GT1]
Length = 1579
Score = 39.3 bits (90), Expect = 1.9, Method: Composition-based stats.
Identities = 18/40 (45%), Positives = 22/40 (55%), Gaps = 2/40 (5%)
Query: 138 AAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFY 177
A YINH C PNC V + + LRD+ GEE+ FY
Sbjct: 1539 ARYINHSCEPNCESVRMPHNAVAIVALRDLLPGEEL--FY 1576
>gi|322693004|gb|EFY84883.1| Histone Lysine Methyltransferase [Metarhizium acridum CQMa 102]
Length = 299
Score = 39.3 bits (90), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 25/82 (30%), Positives = 34/82 (41%), Gaps = 18/82 (21%)
Query: 131 AQLWLGPAAYINHDCRPNCRFVATERDTA-------CVKVLRDIEQGEEITCFYGEDFFG 183
+ GP +INH C PN R A D A + ++DI++GEE+T Y + G
Sbjct: 213 GEFMSGPTRFINHSCDPNMRIFARVGDHADKHIHDLALFAIKDIQKGEELTFDYVDGASG 272
Query: 184 D-----------KNCLCECDTC 194
D CLC C
Sbjct: 273 DFEELEGKIEDMTKCLCRSSKC 294
>gi|237830201|ref|XP_002364398.1| SET domain-containing protein [Toxoplasma gondii ME49]
gi|211962062|gb|EEA97257.1| SET domain-containing protein [Toxoplasma gondii ME49]
gi|221507269|gb|EEE32873.1| SET domain-containing protein, putative [Toxoplasma gondii VEG]
Length = 1582
Score = 39.3 bits (90), Expect = 1.9, Method: Composition-based stats.
Identities = 18/40 (45%), Positives = 22/40 (55%), Gaps = 2/40 (5%)
Query: 138 AAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFY 177
A YINH C PNC V + + LRD+ GEE+ FY
Sbjct: 1542 ARYINHSCEPNCESVRMPHNAVAIVALRDLLPGEEL--FY 1579
>gi|156230137|gb|AAI52413.1| WHSC1 protein [Homo sapiens]
Length = 713
Score = 39.3 bits (90), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 35/125 (28%), Positives = 55/125 (44%), Gaps = 13/125 (10%)
Query: 77 NRGARIVATRKWEKNDNIAYLVGCIAELTKEEE--KRLLHTGLNDFSVMYSCRKNCAQLW 134
+G +VA R K + + VG EL EEE R+ H ND + Y + ++
Sbjct: 420 GKGWGLVAKRDIRKGEFVNEYVG---ELIDEEECMARIKHAHENDITHFYMLTIDKDRII 476
Query: 135 -LGPAA----YINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN 186
GP ++NH C+PNC ++ + + DI G E+T Y D G++
Sbjct: 477 DAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTELTFNYNLDCLGNEK 536
Query: 187 CLCEC 191
+C C
Sbjct: 537 TVCRC 541
>gi|389629206|ref|XP_003712256.1| hypothetical protein MGG_12994 [Magnaporthe oryzae 70-15]
gi|351644588|gb|EHA52449.1| hypothetical protein MGG_12994 [Magnaporthe oryzae 70-15]
gi|440470135|gb|ELQ39221.1| hypothetical protein OOU_Y34scaffold00511g11 [Magnaporthe oryzae
Y34]
gi|440480105|gb|ELQ60800.1| hypothetical protein OOW_P131scaffold01234g11 [Magnaporthe oryzae
P131]
Length = 264
Score = 39.3 bits (90), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 28/77 (36%), Positives = 36/77 (46%), Gaps = 19/77 (24%)
Query: 140 YINHDCRPNCRFVATERDTACVKVL---RDIEQGEEITCFY-------GEDFFGDKNCLC 189
YINH C P+ F DTA + +L + I GEE+T FY + F CLC
Sbjct: 77 YINHSCDPSLIF-----DTANLAILVGPKGIRAGEELTFFYPSTEWRMAQPF----ACLC 127
Query: 190 ECDTCEIRMKGAFAQSS 206
TC + GA A S+
Sbjct: 128 GAATCRGTISGAEAMST 144
>gi|296004740|ref|XP_966279.2| SET domain protein, putative [Plasmodium falciparum 3D7]
gi|263429753|sp|C6KTD2.1|HKNMT_PLAF7 RecName: Full=Putative histone-lysine N-methyltransferase PFF1440w
gi|225631776|emb|CAG25109.2| SET domain protein, putative [Plasmodium falciparum 3D7]
Length = 6753
Score = 39.3 bits (90), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 28/75 (37%), Positives = 38/75 (50%), Gaps = 7/75 (9%)
Query: 131 AQLWLGPAAYINHDCRPN--CRFVATERDTACVKVL--RDIEQGEEITCFY--GEDFFGD 184
A W + +INH C PN C+ V+ +++ + + RDI EEIT Y G + G
Sbjct: 6678 ATKWGNVSRFINHSCEPNCFCKIVSCDQNLKHIVIFAKRDIAAHEEITYDYQFGVESEGK 6737
Query: 185 K-NCLCECDTCEIRM 198
K CLC TC RM
Sbjct: 6738 KLICLCGSSTCLGRM 6752
>gi|128485462|ref|NP_001076020.1| probable histone-lysine N-methyltransferase NSD2 [Danio rerio]
Length = 1461
Score = 39.3 bits (90), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 33/126 (26%), Positives = 56/126 (44%), Gaps = 13/126 (10%)
Query: 76 GNRGARIVATRKWEKNDNIAYLVGCIAELTKEEE--KRLLHTGLNDFSVMYSCRKNCAQL 133
+G +++ R +K + + VG EL EEE R+ H ND + Y + ++
Sbjct: 1162 AGKGWGLISLRDIKKGEFVNEYVG---ELIDEEECRSRIRHAQENDITHFYMLTIDKDRI 1218
Query: 134 W-LGPAA----YINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDK 185
GP ++NH C+PNC ++ + + DI G E+T Y D G++
Sbjct: 1219 IDAGPKGNYSRFMNHSCQPNCETQKWTVNGDTRVGLFAVCDIPAGTELTFNYNLDCLGNE 1278
Query: 186 NCLCEC 191
+C C
Sbjct: 1279 KTVCRC 1284
>gi|402852477|ref|XP_003890948.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
[Papio anubis]
Length = 1013
Score = 39.3 bits (90), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 35/125 (28%), Positives = 55/125 (44%), Gaps = 13/125 (10%)
Query: 77 NRGARIVATRKWEKNDNIAYLVGCIAELTKEEE--KRLLHTGLNDFSVMYSCRKNCAQLW 134
+G +VA R K + + VG EL EEE R+ H ND + Y + ++
Sbjct: 720 GKGWGLVAKRDIRKGEFVNEYVG---ELIDEEECMARIKHAHENDITHFYMLTIDKDRII 776
Query: 135 -LGPAA----YINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN 186
GP ++NH C+PNC ++ + + DI G E+T Y D G++
Sbjct: 777 DAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTELTFNYNLDCLGNEK 836
Query: 187 CLCEC 191
+C C
Sbjct: 837 TVCRC 841
>gi|346327338|gb|EGX96934.1| TPR domain protein [Cordyceps militaris CM01]
Length = 756
Score = 39.3 bits (90), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 27/85 (31%), Positives = 38/85 (44%), Gaps = 23/85 (27%)
Query: 129 NCAQLWLGPAAYINHDCRPNCR--FVATERDTACVKVLRDIEQGEEITCFY--------- 177
+ A +W+ A+Y+NH C NCR F+ D V+ RD+E G E+ Y
Sbjct: 494 STAGVWIK-ASYVNHSCIGNCRRSFIG---DMQIVRATRDLEAGTELLFAYRAAQELDSY 549
Query: 178 -----GEDFFGDKNCLCECDTCEIR 197
G F+G C+C CE R
Sbjct: 550 EDVQSGPKFWG---FTCDCALCEAR 571
>gi|255078050|ref|XP_002502605.1| set domain protein [Micromonas sp. RCC299]
gi|226517870|gb|ACO63863.1| set domain protein [Micromonas sp. RCC299]
Length = 546
Score = 39.3 bits (90), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 25/75 (33%), Positives = 31/75 (41%), Gaps = 13/75 (17%)
Query: 142 NHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDK------------NCLC 189
NH CRPN V+ D + LR IE GEEIT YG +C
Sbjct: 215 NHSCRPNA-AVSVSDDLVTLYSLRRIEAGEEITVSYGTSLLWLPLQMRRTQLARVWGFVC 273
Query: 190 ECDTCEIRMKGAFAQ 204
+C CE + A A+
Sbjct: 274 KCQRCETDLHRAMAE 288
>gi|313246452|emb|CBY35359.1| unnamed protein product [Oikopleura dioica]
Length = 192
Score = 39.3 bits (90), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 35/112 (31%), Positives = 49/112 (43%), Gaps = 23/112 (20%)
Query: 90 KNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPAA-YINHDCRPN 148
+ND + Y++ +AE+ EK ++H LG AA + NH C PN
Sbjct: 87 ENDKMNYILS-LAEVFGNGEKEIVHIDAGK---------------LGNAARFANHSCSPN 130
Query: 149 CRF--VATERDTACVKVL--RDIEQGEEITCFYG--EDFFGDKNCLCECDTC 194
+ V E D A + + R IE GEEIT YG E ++ C C C
Sbjct: 131 SKLYPVRVENDIARIAIFAERFIEPGEEITYDYGSAESTLSERKCQCGSRCC 182
>gi|443693726|gb|ELT95020.1| hypothetical protein CAPTEDRAFT_183922 [Capitella teleta]
Length = 666
Score = 39.3 bits (90), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 28/48 (58%), Gaps = 1/48 (2%)
Query: 138 AAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDK 185
AA +NH C P V + A V+ +RDI++ EEI+ YG F+ D+
Sbjct: 480 AALLNHSCNPEA-IVCYYGNKAVVRAIRDIDKNEEISIAYGVTFYDDE 526
>gi|330843375|ref|XP_003293631.1| hypothetical protein DICPUDRAFT_90323 [Dictyostelium purpureum]
gi|325076008|gb|EGC29833.1| hypothetical protein DICPUDRAFT_90323 [Dictyostelium purpureum]
Length = 343
Score = 39.3 bits (90), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 23/74 (31%), Positives = 34/74 (45%), Gaps = 12/74 (16%)
Query: 138 AAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGE----------DFFGDKNC 187
++INH C PN TA +++L+ I GEEIT Y + F +
Sbjct: 265 TSFINHSCDPNAYVQFPNDHTAEIRLLKPINPGEEITISYADTSKDIIDRRSQLFENYGF 324
Query: 188 LCECDTC--EIRMK 199
CEC C E+++K
Sbjct: 325 NCECPKCLNELKLK 338
>gi|441664377|ref|XP_003279042.2| PREDICTED: histone-lysine N-methyltransferase NSD2-like [Nomascus
leucogenys]
Length = 780
Score = 39.3 bits (90), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 35/125 (28%), Positives = 55/125 (44%), Gaps = 13/125 (10%)
Query: 77 NRGARIVATRKWEKNDNIAYLVGCIAELTKEEE--KRLLHTGLNDFSVMYSCRKNCAQLW 134
+G +VA R K + + VG EL EEE R+ H ND + Y + ++
Sbjct: 567 GKGWGLVAKRDIRKGEFVNEYVG---ELIDEEECMARIKHAHENDITHFYMLTIDKDRII 623
Query: 135 -LGPAA----YINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN 186
GP ++NH C+PNC ++ + + DI G E+T Y D G++
Sbjct: 624 DAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTELTFNYNLDCLGNEK 683
Query: 187 CLCEC 191
+C C
Sbjct: 684 TVCRC 688
>gi|424841770|ref|ZP_18266395.1| SET domain-containing protein [Saprospira grandis DSM 2844]
gi|395319968|gb|EJF52889.1| SET domain-containing protein [Saprospira grandis DSM 2844]
Length = 128
Score = 39.3 bits (90), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 27/107 (25%), Positives = 50/107 (46%), Gaps = 5/107 (4%)
Query: 72 YSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCA 131
Y + +G R V T + D + + + L+ ++ +++ T L+D+ ++ A
Sbjct: 8 YFKDSPKGGRGVFTADYIPKDTLIEICPVLV-LSNDDREKIHQTFLHDYYFLWDKEGKQA 66
Query: 132 QLWLGPAAYINHDCRPNCRFVATERDTACVKVL--RDIEQGEEITCF 176
+ LG + NH +PN + +D + V DIE G+EI CF
Sbjct: 67 AISLGYGSLYNHSYQPNA-YYQMNKDGQSIDVYAGEDIEPGQEI-CF 111
>gi|341038847|gb|EGS23839.1| hypothetical protein CTHT_0005460 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 303
Score = 39.3 bits (90), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 23/68 (33%), Positives = 35/68 (51%), Gaps = 11/68 (16%)
Query: 140 YINHDCRPNCRFVATERDTACVKVL---RDIEQGEEITCFYGE---DFFGDKNCLCECDT 193
YINH C P+ F DT+ + V+ + ++ G+E+T FY + +CLC T
Sbjct: 79 YINHSCEPSLIF-----DTSSMLVIAGPKGLKPGDELTFFYPSTEWEMAQPFDCLCGAPT 133
Query: 194 CEIRMKGA 201
C R+ GA
Sbjct: 134 CRGRIAGA 141
>gi|299115343|emb|CBN74163.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 579
Score = 39.3 bits (90), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 22/67 (32%), Positives = 32/67 (47%), Gaps = 11/67 (16%)
Query: 139 AYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFY----------GEDFFGDKNCL 188
A INH CRPNC FV + V+ L + G E++ +Y ++ K+ L
Sbjct: 301 AMINHACRPNCTFVYFGGNLE-VRTLEPVSAGAELSVYYIDLLQSTAARRQELLTSKHFL 359
Query: 189 CECDTCE 195
C+C CE
Sbjct: 360 CKCSRCE 366
>gi|345565262|gb|EGX48214.1| hypothetical protein AOL_s00081g77 [Arthrobotrys oligospora ATCC
24927]
Length = 169
Score = 39.3 bits (90), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 23/65 (35%), Positives = 31/65 (47%), Gaps = 6/65 (9%)
Query: 140 YINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGE---DFFGDKNCLCECDTCEI 196
Y NH C PN F + D V+ +R+I++GE IT Y D C C D C
Sbjct: 77 YANHSCNPNVVF---DTDKGEVRTVRNIQEGEPITFNYLSTEWDMAQAFKCECGSDNCLG 133
Query: 197 RMKGA 201
++GA
Sbjct: 134 EIQGA 138
>gi|322703611|gb|EFY95217.1| histone-lysine N-methyltransferase (Ash1), putative [Metarhizium
anisopliae ARSEF 23]
Length = 806
Score = 39.3 bits (90), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 32/104 (30%), Positives = 47/104 (45%), Gaps = 17/104 (16%)
Query: 138 AAYINHDCRPNCRF----VATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN---CLCE 190
A ++NH C PNCR V+ + A R I G+E+T Y D F KN CLC
Sbjct: 546 ARFVNHSCNPNCRMIKWIVSGQPRMALFAGDRPIMTGDELTYDYNFDPFSAKNVQKCLCG 605
Query: 191 CDTCEIRMKGAFAQSSVEPQTTQPSRYCLRDT---DMRLNKRKL 231
C +G +P+ + S+ ++ T ++ KRKL
Sbjct: 606 EHNC----RGVLGP---KPRDVKSSKTDIKKTVKATVKAGKRKL 642
>gi|344254290|gb|EGW10394.1| Histone-lysine N-methyltransferase MLL2 [Cricetulus griseus]
Length = 1475
Score = 39.3 bits (90), Expect = 2.1, Method: Composition-based stats.
Identities = 28/71 (39%), Positives = 36/71 (50%), Gaps = 4/71 (5%)
Query: 131 AQLWLGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNC 187
A L GPA YINH C PNC V ++ D + R I +GEE+T Y DF D++
Sbjct: 1401 ATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQFDFEDDQHK 1460
Query: 188 L-CECDTCEIR 197
+ C C R
Sbjct: 1461 IPCHCGAWNCR 1471
>gi|332531177|ref|ZP_08407090.1| nuclear protein SET [Hylemonella gracilis ATCC 19624]
gi|332039284|gb|EGI75697.1| nuclear protein SET [Hylemonella gracilis ATCC 19624]
Length = 159
Score = 39.3 bits (90), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 18/48 (37%), Positives = 26/48 (54%)
Query: 131 AQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYG 178
A++ + +INH C PNC T+ +K LR I+ GEE+ YG
Sbjct: 79 AKVGGNASRWINHSCNPNCEADETDEGRVFIKALRKIKTGEELNYDYG 126
>gi|159473232|ref|XP_001694743.1| histone methyltransferase [Chlamydomonas reinhardtii]
gi|158276555|gb|EDP02327.1| histone methyltransferase [Chlamydomonas reinhardtii]
Length = 1105
Score = 39.3 bits (90), Expect = 2.1, Method: Composition-based stats.
Identities = 35/124 (28%), Positives = 52/124 (41%), Gaps = 20/124 (16%)
Query: 122 VMYSCRKNCAQLWLGPAAYINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFYG 178
V+ +CRK + +INH C PNC +++ + +RDI + E+T Y
Sbjct: 263 VIDACRKG------NISRFINHSCEPNCETQKWLVHGELAIGLFAVRDIPKDTELTFDYN 316
Query: 179 EDFFGDKN--CLCECDTCEIRMKGAF---------AQSSVEPQTTQPSRYCLRDTDMRLN 227
+ +GDK C C+ C + G A +VE T L +TDM N
Sbjct: 317 FERYGDKPMRCYCKSGGCRKFIGGTQDNFDVSLLPAVETVEDATHDWPPIMLGETDMDPN 376
Query: 228 KRKL 231
R L
Sbjct: 377 MRML 380
>gi|403287002|ref|XP_003934751.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Saimiri
boliviensis boliviensis]
Length = 1368
Score = 39.3 bits (90), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 34/125 (27%), Positives = 55/125 (44%), Gaps = 13/125 (10%)
Query: 77 NRGARIVATRKWEKNDNIAYLVGCIAELTKEEE--KRLLHTGLNDFSVMYSCRKNCAQLW 134
+G +VA R K + + VG E+ EEE R+ H ND + Y + ++
Sbjct: 1072 GKGWGLVAKRDIRKGEFVNEYVG---EVIDEEECMARIKHAHENDITHFYMLTIDKDRII 1128
Query: 135 -LGPAA----YINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN 186
GP ++NH C+PNC ++ + + DI G E+T Y D G++
Sbjct: 1129 DAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTELTFNYNLDCLGNEK 1188
Query: 187 CLCEC 191
+C C
Sbjct: 1189 TVCRC 1193
>gi|390461098|ref|XP_003732596.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 2
[Callithrix jacchus]
Length = 1400
Score = 38.9 bits (89), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 34/125 (27%), Positives = 55/125 (44%), Gaps = 13/125 (10%)
Query: 77 NRGARIVATRKWEKNDNIAYLVGCIAELTKEEE--KRLLHTGLNDFSVMYSCRKNCAQLW 134
+G +VA R K + + VG E+ EEE R+ H ND + Y + ++
Sbjct: 1107 GKGWGLVAKRDIRKGEFVNEYVG---EVIDEEECMARIKHAHENDITHFYMLTIDKDRII 1163
Query: 135 -LGPAA----YINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN 186
GP ++NH C+PNC ++ + + DI G E+T Y D G++
Sbjct: 1164 DAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTELTFNYNLDCLGNEK 1223
Query: 187 CLCEC 191
+C C
Sbjct: 1224 TVCRC 1228
>gi|326919530|ref|XP_003206033.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
[Meleagris gallopavo]
Length = 1348
Score = 38.9 bits (89), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 34/125 (27%), Positives = 57/125 (45%), Gaps = 13/125 (10%)
Query: 77 NRGARIVATRKWEKNDNIAYLVGCIAELTKEEE--KRLLHTGLNDFSVMYSCRKNCAQLW 134
+G +VA R +K + + VG EL EEE R+ + ND + Y + ++
Sbjct: 1056 GKGWGLVAKRDIKKGEFVNEYVG---ELIDEEECMARIKYAHENDITHFYMLTIDKDRII 1112
Query: 135 -LGPAA----YINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN 186
GP ++NH C+PNC ++ + + DI G E+T Y D G++
Sbjct: 1113 DAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTELTFNYNLDCLGNEK 1172
Query: 187 CLCEC 191
+C+C
Sbjct: 1173 TVCKC 1177
>gi|302686772|ref|XP_003033066.1| hypothetical protein SCHCODRAFT_67062 [Schizophyllum commune H4-8]
gi|300106760|gb|EFI98163.1| hypothetical protein SCHCODRAFT_67062 [Schizophyllum commune H4-8]
Length = 433
Score = 38.9 bits (89), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 20/65 (30%), Positives = 32/65 (49%), Gaps = 10/65 (15%)
Query: 141 INHDCRPNCRFVATERD-TACVKVLRDIEQGEEITCFYG---------EDFFGDKNCLCE 190
+NH C PN R+ ++ T+ ++ +RDI+ GEEIT Y + +C
Sbjct: 283 VNHSCSPNARYTFNKQTFTSRLRAVRDIKAGEEITITYSRLDVPSADRQKDLAPYGFVCT 342
Query: 191 CDTCE 195
CD C+
Sbjct: 343 CDACK 347
>gi|118090799|ref|XP_420839.2| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Gallus
gallus]
Length = 1369
Score = 38.9 bits (89), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 34/125 (27%), Positives = 57/125 (45%), Gaps = 13/125 (10%)
Query: 77 NRGARIVATRKWEKNDNIAYLVGCIAELTKEEE--KRLLHTGLNDFSVMYSCRKNCAQLW 134
+G +VA R +K + + VG EL EEE R+ + ND + Y + ++
Sbjct: 1077 GKGWGLVAKRDIKKGEFVNEYVG---ELIDEEECMARIKYAHENDITHFYMLTIDKDRII 1133
Query: 135 -LGPAA----YINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN 186
GP ++NH C+PNC ++ + + DI G E+T Y D G++
Sbjct: 1134 DAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTELTFNYNLDCLGNEK 1193
Query: 187 CLCEC 191
+C+C
Sbjct: 1194 TVCKC 1198
>gi|12642795|gb|AAK00344.1|AF330040_1 IL-5 promoter REII-region-binding protein [Homo sapiens]
gi|119602961|gb|EAW82555.1| Wolf-Hirschhorn syndrome candidate 1, isoform CRA_g [Homo sapiens]
gi|133777178|gb|AAH94825.2| Wolf-Hirschhorn syndrome candidate 1 [Homo sapiens]
Length = 584
Score = 38.9 bits (89), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 35/125 (28%), Positives = 55/125 (44%), Gaps = 13/125 (10%)
Query: 77 NRGARIVATRKWEKNDNIAYLVGCIAELTKEEE--KRLLHTGLNDFSVMYSCRKNCAQLW 134
+G +VA R K + + VG EL EEE R+ H ND + Y + ++
Sbjct: 291 GKGWGLVAKRDIRKGEFVNEYVG---ELIDEEECMARIKHAHENDITHFYMLTIDKDRII 347
Query: 135 -LGPAA----YINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN 186
GP ++NH C+PNC ++ + + DI G E+T Y D G++
Sbjct: 348 DAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTELTFNYNLDCLGNEK 407
Query: 187 CLCEC 191
+C C
Sbjct: 408 TVCRC 412
>gi|348535504|ref|XP_003455240.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Oreochromis niloticus]
Length = 2122
Score = 38.9 bits (89), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 29/107 (27%), Positives = 50/107 (46%), Gaps = 10/107 (9%)
Query: 95 AYLVGCIAELTKEEE--KRLLHTGLNDFSVMYSCRKNCAQLW-LGP----AAYINHDCRP 147
A++ + E+ EEE R+ H ND Y + ++ GP A ++NH C+P
Sbjct: 1670 AFISEYVGEVIDEEECRARIRHAQENDICNFYMLTLDKDRIIDAGPKGNQARFMNHSCQP 1729
Query: 148 NC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCEC 191
NC ++ + L+D+ +GEE+T Y + G+ C+C
Sbjct: 1730 NCETQKWTVNGDTRVGLFALQDVPKGEELTFNYNLECRGNGKTACKC 1776
>gi|171910458|ref|ZP_02925928.1| Nuclear protein SET [Verrucomicrobium spinosum DSM 4136]
Length = 241
Score = 38.9 bits (89), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 21/59 (35%), Positives = 28/59 (47%), Gaps = 2/59 (3%)
Query: 138 AAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDF--FGDKNCLCECDTC 194
A +NH C PN + LRDI++GEE+T YG D + D C C +C
Sbjct: 89 ARLMNHSCEPNVEAQTWDEQEIWFVALRDIKKGEELTFNYGFDLECWEDHPCRCGTASC 147
>gi|342884988|gb|EGU85104.1| hypothetical protein FOXB_04383 [Fusarium oxysporum Fo5176]
Length = 344
Score = 38.9 bits (89), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 26/82 (31%), Positives = 36/82 (43%), Gaps = 18/82 (21%)
Query: 131 AQLWLGPAAYINHDCRPNCRFVATERDTA-------CVKVLRDIEQGEEITCFY------ 177
+ GP ++NH C PN R A D A + ++DI +GEE+T Y
Sbjct: 258 GEFMSGPTRFVNHSCDPNMRIFARVGDHADKHIHDLALFAIKDIPEGEELTFDYVDGVSH 317
Query: 178 -GEDFFGDKN----CLCECDTC 194
GE+ GD + CLC C
Sbjct: 318 EGEETGGDIDHMTRCLCGSKKC 339
>gi|296197020|ref|XP_002746091.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform 1
[Callithrix jacchus]
Length = 1365
Score = 38.9 bits (89), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 34/125 (27%), Positives = 55/125 (44%), Gaps = 13/125 (10%)
Query: 77 NRGARIVATRKWEKNDNIAYLVGCIAELTKEEE--KRLLHTGLNDFSVMYSCRKNCAQLW 134
+G +VA R K + + VG E+ EEE R+ H ND + Y + ++
Sbjct: 1072 GKGWGLVAKRDIRKGEFVNEYVG---EVIDEEECMARIKHAHENDITHFYMLTIDKDRII 1128
Query: 135 -LGPAA----YINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN 186
GP ++NH C+PNC ++ + + DI G E+T Y D G++
Sbjct: 1129 DAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTELTFNYNLDCLGNEK 1188
Query: 187 CLCEC 191
+C C
Sbjct: 1189 TVCRC 1193
>gi|359687401|ref|ZP_09257402.1| lysine methyltransferase [Leptospira licerasiae serovar Varillal
str. MMD0835]
gi|418750259|ref|ZP_13306545.1| SET domain protein [Leptospira licerasiae str. MMD4847]
gi|418756365|ref|ZP_13312553.1| SET domain protein [Leptospira licerasiae serovar Varillal str. VAR
010]
gi|384116036|gb|EIE02293.1| SET domain protein [Leptospira licerasiae serovar Varillal str. VAR
010]
gi|404272862|gb|EJZ40182.1| SET domain protein [Leptospira licerasiae str. MMD4847]
Length = 143
Score = 38.9 bits (89), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 17/44 (38%), Positives = 28/44 (63%), Gaps = 1/44 (2%)
Query: 140 YINHDCRPNCRFVATER-DTACVKVLRDIEQGEEITCFYGEDFF 182
YINH +PN + V + R TA + +R I+ GEE+ YG++++
Sbjct: 89 YINHSSKPNVKLVVSTRWKTARFEAMRKIKAGEELFFDYGDEYW 132
>gi|302921594|ref|XP_003053314.1| hypothetical protein NECHADRAFT_123121 [Nectria haematococca mpVI
77-13-4]
gi|256734254|gb|EEU47601.1| hypothetical protein NECHADRAFT_123121 [Nectria haematococca mpVI
77-13-4]
Length = 776
Score = 38.9 bits (89), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 30/64 (46%), Gaps = 7/64 (10%)
Query: 138 AAYINHDCRPNCRF----VATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN---CLCE 190
A ++NH C PNCR V+ + A + I GEE+T Y D F KN CLC
Sbjct: 530 ARFVNHSCNPNCRMIKWIVSGQPRMALFAGDKPITTGEELTYDYNFDPFSAKNVQKCLCG 589
Query: 191 CDTC 194
C
Sbjct: 590 EPNC 593
>gi|406863858|gb|EKD16905.1| TPR domain protein [Marssonina brunnea f. sp. 'multigermtubi'
MB_m1]
Length = 739
Score = 38.9 bits (89), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 27/49 (55%), Gaps = 6/49 (12%)
Query: 133 LWLGPAAYINHDCRPNCR--FVATERDTACVKVLRDIEQGEEITCFYGE 179
LW+ A+YINH C PNCR F+ D V+ RD+ EIT Y E
Sbjct: 510 LWIL-ASYINHSCEPNCRRAFIG---DVQIVRAARDMPADTEITLSYIE 554
>gi|359687226|ref|ZP_09257227.1| SET family protein [Leptospira licerasiae serovar Varillal str.
MMD0835]
gi|418751084|ref|ZP_13307370.1| SET domain protein [Leptospira licerasiae str. MMD4847]
gi|418756288|ref|ZP_13312476.1| SET domain protein [Leptospira licerasiae serovar Varillal str. VAR
010]
gi|384115959|gb|EIE02216.1| SET domain protein [Leptospira licerasiae serovar Varillal str. VAR
010]
gi|404273687|gb|EJZ41007.1| SET domain protein [Leptospira licerasiae str. MMD4847]
Length = 176
Score = 38.9 bits (89), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 25/103 (24%), Positives = 46/103 (44%), Gaps = 6/103 (5%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTK--EEEKRLLHTGLNDFSVMYSCRKNCAQLWL 135
+G + + +W +++ + V ++E + EEEK + D + + +
Sbjct: 22 KGTLLFSYSEWIEDEEFGWKVLSVSEADELPEEEKEIFMKYGYDVDFGLVTGPSGPEFVI 81
Query: 136 GPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYG 178
+ ++NH C PN + T+ A RDIE GEE+ YG
Sbjct: 82 NHSNFMNHSCDPNMWYDQTDNIIAK----RDIEVGEELNIDYG 120
>gi|341899936|gb|EGT55871.1| hypothetical protein CAEBREN_19232 [Caenorhabditis brenneri]
Length = 280
Score = 38.9 bits (89), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 22/67 (32%), Positives = 32/67 (47%), Gaps = 7/67 (10%)
Query: 140 YINHDCRPNCRFVATERDT-------ACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECD 192
+INH C PN + D C + I++GEE+T YGE ++ +K C C
Sbjct: 207 FINHSCSPNVKVANVSWDYDEIQLIHMCFYTDKLIKKGEELTIDYGEAWWTNKKFACMCG 266
Query: 193 TCEIRMK 199
+ E R K
Sbjct: 267 SAECRYK 273
>gi|301114847|ref|XP_002999193.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262111287|gb|EEY69339.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 295
Score = 38.9 bits (89), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 23/65 (35%), Positives = 30/65 (46%), Gaps = 6/65 (9%)
Query: 136 GPAAYINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDK---NCLC 189
G A +INH C PNC ++ + + LRDI GEE+T Y + F CLC
Sbjct: 72 GWARFINHSCNPNCGVEKWDVNGEERCAIFALRDIVAGEELTFDYKFESFSKAEITECLC 131
Query: 190 ECDTC 194
C
Sbjct: 132 GAPNC 136
>gi|327277055|ref|XP_003223281.1| PREDICTED: hypothetical protein LOC100554175 [Anolis carolinensis]
Length = 5261
Score = 38.9 bits (89), Expect = 2.3, Method: Composition-based stats.
Identities = 28/71 (39%), Positives = 36/71 (50%), Gaps = 4/71 (5%)
Query: 131 AQLWLGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNC 187
A L GPA YINH C PNC V ++ D + R I +GEE+T Y DF D++
Sbjct: 5187 ATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQFDFEDDQHK 5246
Query: 188 L-CECDTCEIR 197
+ C C R
Sbjct: 5247 IPCHCGAWNCR 5257
>gi|195393214|ref|XP_002055249.1| GJ18896 [Drosophila virilis]
gi|194149759|gb|EDW65450.1| GJ18896 [Drosophila virilis]
Length = 506
Score = 38.9 bits (89), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 26/76 (34%), Positives = 37/76 (48%), Gaps = 12/76 (15%)
Query: 139 AYINHDCRPNCRFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGD----------KNC 187
A +NHDC PN + E+ + V+ DI +GEEIT Y + F G+ K
Sbjct: 214 AVMNHDCIPNSYYTFEEKTNNMIVRAAVDIAEGEEITTTYTKLFTGNIARHLYLKMKKGF 273
Query: 188 LCECDTC-EIRMKGAF 202
C+C C + KGA+
Sbjct: 274 TCKCPRCSDPTEKGAY 289
>gi|336468088|gb|EGO56251.1| hypothetical protein NEUTE1DRAFT_64645 [Neurospora tetrasperma FGSC
2508]
gi|350289671|gb|EGZ70896.1| hypothetical protein NEUTE2DRAFT_92703 [Neurospora tetrasperma FGSC
2509]
Length = 280
Score = 38.9 bits (89), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 25/68 (36%), Positives = 34/68 (50%), Gaps = 11/68 (16%)
Query: 140 YINHDCRPNCRFVATERDTACVKVL---RDIEQGEEITCFYGEDFFGDK---NCLCECDT 193
YINH C P+ F DT + V+ + I+ GEE+T FY + +CLC T
Sbjct: 77 YINHSCEPSLIF-----DTGNMNVIAGPKGIQIGEELTFFYPSTEWTMAQPFDCLCAKPT 131
Query: 194 CEIRMKGA 201
C R+ GA
Sbjct: 132 CRGRISGA 139
>gi|115402571|ref|XP_001217362.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114189208|gb|EAU30908.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 790
Score = 38.9 bits (89), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 39/130 (30%), Positives = 57/130 (43%), Gaps = 13/130 (10%)
Query: 76 GNRGARIVATRKWEKNDNIAYLVGCIAELTKEE-EKRLLHTGLND--FSVMYSCRKNCAQ 132
+RG + + R ++ N I G I +T+ E EKR+ N+ + +MY +
Sbjct: 450 ADRGYGVRSNRTFDPNQIIVEYTGEI--ITQSECEKRMRTIYKNNECYYLMYFDQNMIID 507
Query: 133 LWLGP-AAYINHDCRPNCRF----VATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN- 186
G A ++NH C PNCR VA + A R + GEE+T Y D + KN
Sbjct: 508 ATRGSIARFVNHSCEPNCRMEKWTVAGKPRMALFAGDRGVMTGEELTYDYNFDPYSQKNV 567
Query: 187 --CLCECDTC 194
C C C
Sbjct: 568 QQCRCGSANC 577
>gi|62088568|dbj|BAD92731.1| myeloid/lymphoid or mixed-lineage leukemia 2 variant [Homo sapiens]
Length = 2704
Score = 38.9 bits (89), Expect = 2.3, Method: Composition-based stats.
Identities = 28/71 (39%), Positives = 36/71 (50%), Gaps = 4/71 (5%)
Query: 131 AQLWLGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNC 187
A L GPA YINH C PNC V ++ D + R I +GEE+T Y DF D++
Sbjct: 2630 ATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQFDFEDDQHK 2689
Query: 188 L-CECDTCEIR 197
+ C C R
Sbjct: 2690 IPCHCGAWNCR 2700
>gi|224050217|ref|XP_002195834.1| PREDICTED: histone-lysine N-methyltransferase NSD2 [Taeniopygia
guttata]
Length = 1339
Score = 38.9 bits (89), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 34/125 (27%), Positives = 57/125 (45%), Gaps = 13/125 (10%)
Query: 77 NRGARIVATRKWEKNDNIAYLVGCIAELTKEEE--KRLLHTGLNDFSVMYSCRKNCAQLW 134
+G +VA R +K + + VG EL EEE R+ + ND + Y + ++
Sbjct: 1076 GKGWGLVAKRDIKKGEFVNEYVG---ELIDEEECMARIKYAHENDITHFYMLTIDKDRII 1132
Query: 135 -LGPAA----YINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN 186
GP ++NH C+PNC ++ + + DI G E+T Y D G++
Sbjct: 1133 DAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTELTFNYNLDCLGNEK 1192
Query: 187 CLCEC 191
+C+C
Sbjct: 1193 TVCKC 1197
>gi|221057732|ref|XP_002261374.1| SET-domain protein [Plasmodium knowlesi strain H]
gi|194247379|emb|CAQ40779.1| SET-domain protein, putative [Plasmodium knowlesi strain H]
Length = 6442
Score = 38.9 bits (89), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 28/75 (37%), Positives = 38/75 (50%), Gaps = 7/75 (9%)
Query: 131 AQLWLGPAAYINHDCRPN--CRFVATERDTACVKVL--RDIEQGEEITCFY--GEDFFGD 184
A W + +INH C PN C+ V+ +++ + + RDI EEIT Y G + G
Sbjct: 6367 ATKWGNVSRFINHSCEPNCFCKIVSCDQNLKHIVIFAKRDIVAHEEITYDYQFGVESEGK 6426
Query: 185 K-NCLCECDTCEIRM 198
K CLC TC RM
Sbjct: 6427 KLICLCGSSTCLGRM 6441
>gi|410916601|ref|XP_003971775.1| PREDICTED: N-lysine methyltransferase SMYD2-B-like [Takifugu
rubripes]
Length = 432
Score = 38.9 bits (89), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 32/112 (28%), Positives = 51/112 (45%), Gaps = 26/112 (23%)
Query: 135 LGPAAY-----INHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFG--DKN- 186
LG A Y INH CRP+ V +A V+ LRD++ G+E+ Y + + D+N
Sbjct: 193 LGTAVYPDVALINHSCRPSV-IVTYSGTSAHVRALRDMKPGDEVLISYIDVLYPTEDRNN 251
Query: 187 -------CLCECDTCEIR------MKGAFAQSSVEPQT----TQPSRYCLRD 221
C+C+ CE R +K +EP+ + +R C+R+
Sbjct: 252 RLRESYYFTCQCEECESRSMDQAKLKARKRSEPIEPEAISNMVRYARKCIRE 303
>gi|398343713|ref|ZP_10528416.1| lysine methyltransferase [Leptospira inadai serovar Lyme str. 10]
Length = 143
Score = 38.9 bits (89), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 18/44 (40%), Positives = 27/44 (61%), Gaps = 1/44 (2%)
Query: 140 YINHDCRPNCRFVATER-DTACVKVLRDIEQGEEITCFYGEDFF 182
YINH +PN + V + R TA + LR I GEE+ YG++++
Sbjct: 89 YINHSTKPNIKLVVSTRWKTARFEALRKIRAGEELFFDYGDEYW 132
>gi|281343718|gb|EFB19302.1| hypothetical protein PANDA_017001 [Ailuropoda melanoleuca]
Length = 4932
Score = 38.9 bits (89), Expect = 2.4, Method: Composition-based stats.
Identities = 28/71 (39%), Positives = 36/71 (50%), Gaps = 4/71 (5%)
Query: 131 AQLWLGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNC 187
A L GPA YINH C PNC V ++ D + R I +GEE+T Y DF D++
Sbjct: 4858 ATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQFDFEDDQHK 4917
Query: 188 L-CECDTCEIR 197
+ C C R
Sbjct: 4918 IPCHCGAWNCR 4928
>gi|426372409|ref|XP_004053116.1| PREDICTED: histone-lysine N-methyltransferase MLL2 isoform 2 [Gorilla
gorilla gorilla]
Length = 5284
Score = 38.9 bits (89), Expect = 2.4, Method: Composition-based stats.
Identities = 28/71 (39%), Positives = 36/71 (50%), Gaps = 4/71 (5%)
Query: 131 AQLWLGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNC 187
A L GPA YINH C PNC V ++ D + R I +GEE+T Y DF D++
Sbjct: 5210 ATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQFDFEDDQHK 5269
Query: 188 L-CECDTCEIR 197
+ C C R
Sbjct: 5270 IPCHCGAWNCR 5280
>gi|395744200|ref|XP_002823221.2| PREDICTED: histone-lysine N-methyltransferase MLL2 isoform 3 [Pongo
abelii]
Length = 5293
Score = 38.9 bits (89), Expect = 2.4, Method: Composition-based stats.
Identities = 28/71 (39%), Positives = 36/71 (50%), Gaps = 4/71 (5%)
Query: 131 AQLWLGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNC 187
A L GPA YINH C PNC V ++ D + R I +GEE+T Y DF D++
Sbjct: 5219 ATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQFDFEDDQHK 5278
Query: 188 L-CECDTCEIR 197
+ C C R
Sbjct: 5279 IPCHCGAWNCR 5289
>gi|308511463|ref|XP_003117914.1| hypothetical protein CRE_00035 [Caenorhabditis remanei]
gi|308238560|gb|EFO82512.1| hypothetical protein CRE_00035 [Caenorhabditis remanei]
Length = 363
Score = 38.9 bits (89), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 45/198 (22%), Positives = 84/198 (42%), Gaps = 25/198 (12%)
Query: 76 GNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWL 135
G++G+ + A + +K +L +L +E+ D + + ++
Sbjct: 120 GDKGSGVYALEELKKG---RFLCAYFGDLISPDERNRRKELYKDEGLPHDYTFQAGSFFI 176
Query: 136 GP------AAYINHDCRPNCRFVATERDTACVKVL-------RDIEQGEEITCFYGEDFF 182
P A Y NH C PN + + D ++ R+I GEE+T Y D+
Sbjct: 177 DPTKRGNIAKYANHSCIPNMVAITYKLDGHHKNLMVIGYFAARNIGIGEELTVAYNFDYD 236
Query: 183 GDKN--CLCECDTCEIRM-KGAFAQSSV-EPQTTQPSRYC-----LRDTDMRLNKRKLHK 233
+ + CLC +C+ M KG + V E PS+ +R+T K+++ K
Sbjct: 237 PENSQICLCGATSCKGFMGKGPKKEDMVAEIPVRVPSKKTGVIGRVRNTKNVFKKQQVPK 296
Query: 234 KLNRLLLASDKNDTNSSD 251
+ N+ + + +ND N++D
Sbjct: 297 QANQPSVEALENDENNAD 314
>gi|119578438|gb|EAW58034.1| myeloid/lymphoid or mixed-lineage leukemia 2, isoform CRA_a [Homo
sapiens]
Length = 4539
Score = 38.9 bits (89), Expect = 2.4, Method: Composition-based stats.
Identities = 28/71 (39%), Positives = 36/71 (50%), Gaps = 4/71 (5%)
Query: 131 AQLWLGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNC 187
A L GPA YINH C PNC V ++ D + R I +GEE+T Y DF D++
Sbjct: 4465 ATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQFDFEDDQHK 4524
Query: 188 L-CECDTCEIR 197
+ C C R
Sbjct: 4525 IPCHCGAWNCR 4535
>gi|17861882|gb|AAL39418.1| GM10003p [Drosophila melanogaster]
Length = 421
Score = 38.9 bits (89), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 36/126 (28%), Positives = 51/126 (40%), Gaps = 9/126 (7%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSC---RKNCAQLW 134
+G + A R EK+ I +G + E + + N M+ R A L
Sbjct: 291 QGLGLYAARDIEKHTMIIEYIGEVIRTEVSEIREKQYESKNRGIYMFRLDEDRVVDATLS 350
Query: 135 LGPAAYINHDCRPNC--RFVATERDTACVKVL-RDIEQGEEITCFYGEDFFGDKN---CL 188
G A YINH C PNC V +RD + R I +GEE++ Y D + + C
Sbjct: 351 GGLARYINHSCNPNCVTEIVEVDRDVRIIIFAKRKIYRGEELSYDYKFDIEDESHKIPCA 410
Query: 189 CECDTC 194
C C
Sbjct: 411 CGAPNC 416
>gi|395540930|ref|XP_003772403.1| PREDICTED: histone-lysine N-methyltransferase MLL2 [Sarcophilus
harrisii]
Length = 5047
Score = 38.9 bits (89), Expect = 2.4, Method: Composition-based stats.
Identities = 28/71 (39%), Positives = 36/71 (50%), Gaps = 4/71 (5%)
Query: 131 AQLWLGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNC 187
A L GPA YINH C PNC V ++ D + R I +GEE+T Y DF D++
Sbjct: 4973 ATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQFDFEDDQHK 5032
Query: 188 L-CECDTCEIR 197
+ C C R
Sbjct: 5033 IPCHCGAWNCR 5043
>gi|393908177|gb|EJD74941.1| F/Y-rich family protein [Loa loa]
Length = 2288
Score = 38.9 bits (89), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 34/74 (45%), Gaps = 9/74 (12%)
Query: 131 AQLWLGPAAYINHDCRPNC--RFVATERDTACVKVL----RDIEQGEEITCFYG---EDF 181
A + GPA YINH C PNC R V + K++ R I GEE+T Y ED
Sbjct: 2211 ATMAGGPARYINHSCDPNCSTRLVDSGPCGDDKKIIIIANRPISAGEELTYDYQFDIEDA 2270
Query: 182 FGDKNCLCECDTCE 195
CLC C+
Sbjct: 2271 ADKIPCLCGAPNCQ 2284
>gi|413959799|ref|ZP_11399030.1| nuclear protein [Burkholderia sp. SJ98]
gi|413939749|gb|EKS71717.1| nuclear protein [Burkholderia sp. SJ98]
Length = 146
Score = 38.9 bits (89), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 34/74 (45%), Gaps = 13/74 (17%)
Query: 138 AAYINHDCRPNCRFVATERDTACV--KVLRDIEQGEEITCFYGEDFFGDKN--------C 187
A ++NH C PNC E+D + V LRDI+ GEE+ YG G K C
Sbjct: 75 ARWVNHSCAPNCE---AEQDGSRVFFLALRDIKPGEELFIDYGLIVEGRKTAALKKLYAC 131
Query: 188 LCECDTCEIRMKGA 201
C DTC M A
Sbjct: 132 RCGADTCRGTMLAA 145
>gi|119578440|gb|EAW58036.1| myeloid/lymphoid or mixed-lineage leukemia 2, isoform CRA_c [Homo
sapiens]
Length = 5265
Score = 38.9 bits (89), Expect = 2.4, Method: Composition-based stats.
Identities = 28/71 (39%), Positives = 36/71 (50%), Gaps = 4/71 (5%)
Query: 131 AQLWLGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNC 187
A L GPA YINH C PNC V ++ D + R I +GEE+T Y DF D++
Sbjct: 5191 ATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQFDFEDDQHK 5250
Query: 188 L-CECDTCEIR 197
+ C C R
Sbjct: 5251 IPCHCGAWNCR 5261
>gi|395841650|ref|XP_003793647.1| PREDICTED: uncharacterized protein LOC100944849 [Otolemur garnettii]
Length = 5488
Score = 38.9 bits (89), Expect = 2.5, Method: Composition-based stats.
Identities = 28/71 (39%), Positives = 36/71 (50%), Gaps = 4/71 (5%)
Query: 131 AQLWLGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNC 187
A L GPA YINH C PNC V ++ D + R I +GEE+T Y DF D++
Sbjct: 5414 ATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQFDFEDDQHK 5473
Query: 188 L-CECDTCEIR 197
+ C C R
Sbjct: 5474 IPCHCGAWNCR 5484
>gi|67539250|ref|XP_663399.1| hypothetical protein AN5795.2 [Aspergillus nidulans FGSC A4]
gi|74680884|sp|Q5B0Y5.1|SET1_EMENI RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component SET1; AltName:
Full=SET domain-containing protein 1
gi|40743698|gb|EAA62888.1| hypothetical protein AN5795.2 [Aspergillus nidulans FGSC A4]
gi|259484715|tpe|CBF81174.1| TPA: Histone-lysine N-methyltransferase, H3 lysine-4 specific (EC
2.1.1.43)(COMPASS component SET1)(SET domain-containing
protein 1) [Source:UniProtKB/Swiss-Prot;Acc:Q5B0Y5]
[Aspergillus nidulans FGSC A4]
Length = 1220
Score = 38.9 bits (89), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 46/97 (47%), Gaps = 11/97 (11%)
Query: 109 EKRLLHTGLNDFSVMYSCRKNC---AQLWLGPAAYINHDCRPNCRFVATERDTA---CVK 162
E+R L +G+ S ++ +N A G A +INH C PNC + D + +
Sbjct: 1121 ERRYLKSGIGS-SYLFRIDENTVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIY 1179
Query: 163 VLRDIEQGEEITCFYGEDFFGDKN----CLCECDTCE 195
LRDIE+ EE+T Y + D + CLC C+
Sbjct: 1180 ALRDIERDEELTYDYKFEREWDSDDRIPCLCGSAGCK 1216
>gi|426372407|ref|XP_004053115.1| PREDICTED: histone-lysine N-methyltransferase MLL2 isoform 1 [Gorilla
gorilla gorilla]
Length = 5550
Score = 38.9 bits (89), Expect = 2.5, Method: Composition-based stats.
Identities = 28/71 (39%), Positives = 36/71 (50%), Gaps = 4/71 (5%)
Query: 131 AQLWLGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNC 187
A L GPA YINH C PNC V ++ D + R I +GEE+T Y DF D++
Sbjct: 5476 ATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQFDFEDDQHK 5535
Query: 188 L-CECDTCEIR 197
+ C C R
Sbjct: 5536 IPCHCGAWNCR 5546
>gi|410964289|ref|XP_003988688.1| PREDICTED: histone-lysine N-methyltransferase MLL2 [Felis catus]
Length = 5559
Score = 38.9 bits (89), Expect = 2.5, Method: Composition-based stats.
Identities = 28/71 (39%), Positives = 36/71 (50%), Gaps = 4/71 (5%)
Query: 131 AQLWLGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNC 187
A L GPA YINH C PNC V ++ D + R I +GEE+T Y DF D++
Sbjct: 5485 ATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQFDFEDDQHK 5544
Query: 188 L-CECDTCEIR 197
+ C C R
Sbjct: 5545 IPCHCGAWNCR 5555
>gi|403297007|ref|XP_003939383.1| PREDICTED: histone-lysine N-methyltransferase MLL2 [Saimiri
boliviensis boliviensis]
Length = 5498
Score = 38.9 bits (89), Expect = 2.5, Method: Composition-based stats.
Identities = 28/71 (39%), Positives = 36/71 (50%), Gaps = 4/71 (5%)
Query: 131 AQLWLGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNC 187
A L GPA YINH C PNC V ++ D + R I +GEE+T Y DF D++
Sbjct: 5424 ATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQFDFEDDQHK 5483
Query: 188 L-CECDTCEIR 197
+ C C R
Sbjct: 5484 IPCHCGAWNCR 5494
>gi|398348743|ref|ZP_10533446.1| lysine methyltransferase [Leptospira broomii str. 5399]
Length = 143
Score = 38.9 bits (89), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 18/44 (40%), Positives = 27/44 (61%), Gaps = 1/44 (2%)
Query: 140 YINHDCRPNCRFVATER-DTACVKVLRDIEQGEEITCFYGEDFF 182
YINH +PN + V + R TA + LR I GEE+ YG++++
Sbjct: 89 YINHSTKPNIKLVVSTRWKTARFEALRKIRAGEELFFDYGDEYW 132
>gi|397510996|ref|XP_003846168.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL2 [Pan paniscus]
Length = 5373
Score = 38.9 bits (89), Expect = 2.5, Method: Composition-based stats.
Identities = 28/71 (39%), Positives = 36/71 (50%), Gaps = 4/71 (5%)
Query: 131 AQLWLGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNC 187
A L GPA YINH C PNC V ++ D + R I +GEE+T Y DF D++
Sbjct: 5299 ATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQFDFEDDQHK 5358
Query: 188 L-CECDTCEIR 197
+ C C R
Sbjct: 5359 IPCHCGAWNCR 5369
>gi|359718904|ref|NP_001028448.3| histone-lysine N-methyltransferase MLL2 [Mus musculus]
Length = 5588
Score = 38.9 bits (89), Expect = 2.5, Method: Composition-based stats.
Identities = 28/71 (39%), Positives = 36/71 (50%), Gaps = 4/71 (5%)
Query: 131 AQLWLGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNC 187
A L GPA YINH C PNC V ++ D + R I +GEE+T Y DF D++
Sbjct: 5514 ATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQFDFEDDQHK 5573
Query: 188 L-CECDTCEIR 197
+ C C R
Sbjct: 5574 IPCHCGAWNCR 5584
>gi|348580193|ref|XP_003475863.1| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Cavia
porcellus]
Length = 5577
Score = 38.9 bits (89), Expect = 2.5, Method: Composition-based stats.
Identities = 28/71 (39%), Positives = 36/71 (50%), Gaps = 4/71 (5%)
Query: 131 AQLWLGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNC 187
A L GPA YINH C PNC V ++ D + R I +GEE+T Y DF D++
Sbjct: 5503 ATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQFDFEDDQHK 5562
Query: 188 L-CECDTCEIR 197
+ C C R
Sbjct: 5563 IPCHCGAWNCR 5573
>gi|345792161|ref|XP_543684.3| PREDICTED: histone-lysine N-methyltransferase MLL2 [Canis lupus
familiaris]
Length = 5552
Score = 38.9 bits (89), Expect = 2.5, Method: Composition-based stats.
Identities = 28/71 (39%), Positives = 36/71 (50%), Gaps = 4/71 (5%)
Query: 131 AQLWLGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNC 187
A L GPA YINH C PNC V ++ D + R I +GEE+T Y DF D++
Sbjct: 5478 ATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQFDFEDDQHK 5537
Query: 188 L-CECDTCEIR 197
+ C C R
Sbjct: 5538 IPCHCGAWNCR 5548
>gi|332206905|ref|XP_003252537.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL2 [Nomascus leucogenys]
Length = 5407
Score = 38.9 bits (89), Expect = 2.5, Method: Composition-based stats.
Identities = 28/71 (39%), Positives = 36/71 (50%), Gaps = 4/71 (5%)
Query: 131 AQLWLGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNC 187
A L GPA YINH C PNC V ++ D + R I +GEE+T Y DF D++
Sbjct: 5333 ATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQFDFEDDQHK 5392
Query: 188 L-CECDTCEIR 197
+ C C R
Sbjct: 5393 IPCHCGAWNCR 5403
>gi|297691727|ref|XP_002823219.1| PREDICTED: histone-lysine N-methyltransferase MLL2 isoform 1 [Pongo
abelii]
Length = 5559
Score = 38.9 bits (89), Expect = 2.5, Method: Composition-based stats.
Identities = 28/71 (39%), Positives = 36/71 (50%), Gaps = 4/71 (5%)
Query: 131 AQLWLGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNC 187
A L GPA YINH C PNC V ++ D + R I +GEE+T Y DF D++
Sbjct: 5485 ATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQFDFEDDQHK 5544
Query: 188 L-CECDTCEIR 197
+ C C R
Sbjct: 5545 IPCHCGAWNCR 5555
>gi|313471390|sp|Q6PDK2.2|MLL2_MOUSE RecName: Full=Histone-lysine N-methyltransferase MLL2; AltName:
Full=Lysine N-methyltransferase 2D; Short=KMT2D
Length = 5588
Score = 38.9 bits (89), Expect = 2.5, Method: Composition-based stats.
Identities = 28/71 (39%), Positives = 36/71 (50%), Gaps = 4/71 (5%)
Query: 131 AQLWLGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNC 187
A L GPA YINH C PNC V ++ D + R I +GEE+T Y DF D++
Sbjct: 5514 ATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQFDFEDDQHK 5573
Query: 188 L-CECDTCEIR 197
+ C C R
Sbjct: 5574 IPCHCGAWNCR 5584
>gi|194666944|ref|XP_583302.4| PREDICTED: histone-lysine N-methyltransferase MLL2 [Bos taurus]
gi|297474553|ref|XP_002687353.1| PREDICTED: histone-lysine N-methyltransferase MLL2 [Bos taurus]
gi|296487853|tpg|DAA29966.1| TPA: myeloid/lymphoid or mixed-lineage leukemia 2-like [Bos taurus]
Length = 5503
Score = 38.9 bits (89), Expect = 2.5, Method: Composition-based stats.
Identities = 28/71 (39%), Positives = 36/71 (50%), Gaps = 4/71 (5%)
Query: 131 AQLWLGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNC 187
A L GPA YINH C PNC V ++ D + R I +GEE+T Y DF D++
Sbjct: 5429 ATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQFDFEDDQHK 5488
Query: 188 L-CECDTCEIR 197
+ C C R
Sbjct: 5489 IPCHCGAWNCR 5499
>gi|164423494|ref|XP_963144.2| hypothetical protein NCU08475 [Neurospora crassa OR74A]
gi|16944512|emb|CAB92029.2| hypothetical protein [Neurospora crassa]
gi|157070117|gb|EAA33908.2| predicted protein [Neurospora crassa OR74A]
Length = 328
Score = 38.9 bits (89), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 25/68 (36%), Positives = 34/68 (50%), Gaps = 11/68 (16%)
Query: 140 YINHDCRPNCRFVATERDTACVKVL---RDIEQGEEITCFYGEDFFGDK---NCLCECDT 193
YINH C P+ F DT + V+ + I+ GEE+T FY + +CLC T
Sbjct: 77 YINHSCEPSLIF-----DTGNMNVIAGPKGIQIGEELTFFYPSTEWTMAQPFDCLCAKPT 131
Query: 194 CEIRMKGA 201
C R+ GA
Sbjct: 132 CRGRISGA 139
>gi|426226681|ref|XP_004007467.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL2 [Ovis aries]
Length = 5387
Score = 38.9 bits (89), Expect = 2.5, Method: Composition-based stats.
Identities = 28/71 (39%), Positives = 36/71 (50%), Gaps = 4/71 (5%)
Query: 131 AQLWLGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNC 187
A L GPA YINH C PNC V ++ D + R I +GEE+T Y DF D++
Sbjct: 5313 ATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQFDFEDDQHK 5372
Query: 188 L-CECDTCEIR 197
+ C C R
Sbjct: 5373 IPCHCGAWNCR 5383
>gi|392355921|ref|XP_002729900.2| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Rattus
norvegicus]
Length = 5543
Score = 38.9 bits (89), Expect = 2.5, Method: Composition-based stats.
Identities = 28/71 (39%), Positives = 36/71 (50%), Gaps = 4/71 (5%)
Query: 131 AQLWLGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNC 187
A L GPA YINH C PNC V ++ D + R I +GEE+T Y DF D++
Sbjct: 5469 ATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQFDFEDDQHK 5528
Query: 188 L-CECDTCEIR 197
+ C C R
Sbjct: 5529 IPCHCGAWNCR 5539
>gi|392341685|ref|XP_001062568.3| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL2 [Rattus norvegicus]
Length = 5543
Score = 38.9 bits (89), Expect = 2.5, Method: Composition-based stats.
Identities = 28/71 (39%), Positives = 36/71 (50%), Gaps = 4/71 (5%)
Query: 131 AQLWLGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNC 187
A L GPA YINH C PNC V ++ D + R I +GEE+T Y DF D++
Sbjct: 5469 ATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQFDFEDDQHK 5528
Query: 188 L-CECDTCEIR 197
+ C C R
Sbjct: 5529 IPCHCGAWNCR 5539
>gi|355564192|gb|EHH20692.1| hypothetical protein EGK_03605 [Macaca mulatta]
Length = 5538
Score = 38.9 bits (89), Expect = 2.5, Method: Composition-based stats.
Identities = 28/71 (39%), Positives = 36/71 (50%), Gaps = 4/71 (5%)
Query: 131 AQLWLGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNC 187
A L GPA YINH C PNC V ++ D + R I +GEE+T Y DF D++
Sbjct: 5464 ATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQFDFEDDQHK 5523
Query: 188 L-CECDTCEIR 197
+ C C R
Sbjct: 5524 IPCHCGAWNCR 5534
>gi|350583914|ref|XP_003481621.1| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Sus scrofa]
Length = 5154
Score = 38.9 bits (89), Expect = 2.5, Method: Composition-based stats.
Identities = 28/71 (39%), Positives = 36/71 (50%), Gaps = 4/71 (5%)
Query: 131 AQLWLGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNC 187
A L GPA YINH C PNC V ++ D + R I +GEE+T Y DF D++
Sbjct: 5082 ATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQFDFEDDQHK 5141
Query: 188 L-CECDTCEIR 197
+ C C R
Sbjct: 5142 IPCHCGAWNCR 5152
>gi|148762969|ref|NP_003473.3| histone-lysine N-methyltransferase MLL2 [Homo sapiens]
gi|313104132|sp|O14686.2|MLL2_HUMAN RecName: Full=Histone-lysine N-methyltransferase MLL2; AltName:
Full=ALL1-related protein; AltName: Full=Lysine
N-methyltransferase 2D; Short=KMT2D; AltName:
Full=Myeloid/lymphoid or mixed-lineage leukemia protein 2
gi|119578439|gb|EAW58035.1| myeloid/lymphoid or mixed-lineage leukemia 2, isoform CRA_b [Homo
sapiens]
Length = 5537
Score = 38.9 bits (89), Expect = 2.5, Method: Composition-based stats.
Identities = 28/71 (39%), Positives = 36/71 (50%), Gaps = 4/71 (5%)
Query: 131 AQLWLGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNC 187
A L GPA YINH C PNC V ++ D + R I +GEE+T Y DF D++
Sbjct: 5463 ATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQFDFEDDQHK 5522
Query: 188 L-CECDTCEIR 197
+ C C R
Sbjct: 5523 IPCHCGAWNCR 5533
>gi|70994120|ref|XP_751907.1| SET domain protein [Aspergillus fumigatus Af293]
gi|66849541|gb|EAL89869.1| SET domain protein [Aspergillus fumigatus Af293]
gi|159125178|gb|EDP50295.1| SET domain protein [Aspergillus fumigatus A1163]
Length = 367
Score = 38.9 bits (89), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 34/134 (25%), Positives = 54/134 (40%), Gaps = 25/134 (18%)
Query: 68 PCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCR 127
P +S + +VA +++ I VG L+ G++ VM
Sbjct: 237 PVQIFSTASGKNNGVVARVNFDRGAAIGEFVG------------LITNGISGVDVMVGGT 284
Query: 128 KN-CAQLWLGPAA----YINHDCRPNCRFV-----ATERDTACVKVLRDIEQGEEITCFY 177
K+ Q++ G +INH CRPN +F ER + V R + G EIT Y
Sbjct: 285 KSRTYQIYQGEMGNFTRFINHSCRPNSQFQRFYWRGQER---IIVVSRGVVAGREITVDY 341
Query: 178 GEDFFGDKNCLCEC 191
+ ++ N +C C
Sbjct: 342 SDHYWKQLNKVCLC 355
>gi|402885854|ref|XP_003919662.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL2 [Papio anubis]
Length = 5547
Score = 38.9 bits (89), Expect = 2.5, Method: Composition-based stats.
Identities = 28/71 (39%), Positives = 36/71 (50%), Gaps = 4/71 (5%)
Query: 131 AQLWLGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNC 187
A L GPA YINH C PNC V ++ D + R I +GEE+T Y DF D++
Sbjct: 5473 ATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQFDFEDDQHK 5532
Query: 188 L-CECDTCEIR 197
+ C C R
Sbjct: 5533 IPCHCGAWNCR 5543
>gi|390467630|ref|XP_002807137.2| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL2 [Callithrix jacchus]
Length = 5289
Score = 38.9 bits (89), Expect = 2.5, Method: Composition-based stats.
Identities = 28/71 (39%), Positives = 36/71 (50%), Gaps = 4/71 (5%)
Query: 131 AQLWLGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNC 187
A L GPA YINH C PNC V ++ D + R I +GEE+T Y DF D++
Sbjct: 5215 ATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQFDFEDDQHK 5274
Query: 188 L-CECDTCEIR 197
+ C C R
Sbjct: 5275 IPCHCGAWNCR 5285
>gi|341896007|gb|EGT51942.1| hypothetical protein CAEBREN_26218 [Caenorhabditis brenneri]
Length = 1670
Score = 38.9 bits (89), Expect = 2.5, Method: Composition-based stats.
Identities = 25/61 (40%), Positives = 32/61 (52%), Gaps = 4/61 (6%)
Query: 138 AAYINHDCRPNC--RFVATERDTACVKVLRD-IEQGEEITCFYGEDFFGDK-NCLCECDT 193
A +INH C+PNC + + E + V R I +GEEIT Y DK +CLC T
Sbjct: 1605 ARFINHSCQPNCYAKVLTIEGEKRIVIYSRSTINKGEEITYDYKFPIEEDKIDCLCGAKT 1664
Query: 194 C 194
C
Sbjct: 1665 C 1665
>gi|301783643|ref|XP_002927255.1| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Ailuropoda
melanoleuca]
Length = 5483
Score = 38.9 bits (89), Expect = 2.5, Method: Composition-based stats.
Identities = 28/71 (39%), Positives = 36/71 (50%), Gaps = 4/71 (5%)
Query: 131 AQLWLGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNC 187
A L GPA YINH C PNC V ++ D + R I +GEE+T Y DF D++
Sbjct: 5409 ATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQFDFEDDQHK 5468
Query: 188 L-CECDTCEIR 197
+ C C R
Sbjct: 5469 IPCHCGAWNCR 5479
>gi|226294579|gb|EEH49999.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb18]
Length = 808
Score = 38.9 bits (89), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 36/127 (28%), Positives = 57/127 (44%), Gaps = 18/127 (14%)
Query: 77 NRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGL--NDFSVMYSCRKNCAQLW 134
+RG + + R + N I G I +T++E +R + T N+ ++ + R + A+
Sbjct: 435 DRGYGVRSNRTFAPNQIIVEYTGEI--ITQKECERRMRTVYKNNENMIIDATRGSIAR-- 490
Query: 135 LGPAAYINHDCRPNCRF----VATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN---C 187
++NH C PNC VA + A I GEE+T Y D + KN C
Sbjct: 491 -----FVNHSCEPNCEMEKWTVAGKPRMALFAGKNGITTGEELTYDYNFDPYSQKNVQEC 545
Query: 188 LCECDTC 194
C +TC
Sbjct: 546 RCGAETC 552
>gi|86278478|gb|ABC88477.1| Wolf-Hirschhorn syndrome candidate 1 protein [Danio rerio]
Length = 1366
Score = 38.9 bits (89), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 33/126 (26%), Positives = 56/126 (44%), Gaps = 13/126 (10%)
Query: 76 GNRGARIVATRKWEKNDNIAYLVGCIAELTKEEE--KRLLHTGLNDFSVMYSCRKNCAQL 133
+G +++ R +K + + VG EL EEE R+ H ND + Y + ++
Sbjct: 1067 AGKGWGLISLRDIKKGEFVNEYVG---ELIDEEECRSRIRHAQENDITHFYMLTIDKDRI 1123
Query: 134 W-LGPAA----YINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDK 185
GP ++NH C+PNC ++ + + DI G E+T Y D G++
Sbjct: 1124 IDAGPKGNYSRFMNHSCQPNCETQKWTVNGDTRVGLFAVCDIPAGTELTFNYNLDCLGNE 1183
Query: 186 NCLCEC 191
+C C
Sbjct: 1184 KTVCRC 1189
>gi|134116841|ref|XP_772647.1| hypothetical protein CNBK0210 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50255265|gb|EAL18000.1| hypothetical protein CNBK0210 [Cryptococcus neoformans var.
neoformans B-3501A]
Length = 537
Score = 38.9 bits (89), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 23/70 (32%), Positives = 30/70 (42%), Gaps = 16/70 (22%)
Query: 141 INHDCRPNCRFV-----ATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN--------- 186
I+ CRPN RF+ E A + + DI G E+ YG FFGD+N
Sbjct: 307 ISKACRPNVRFIYLDPHNPEVTIAYLVAIDDIPAGHELFIRYGPTFFGDRNQRQRLLIQE 366
Query: 187 --CLCECDTC 194
C+C C
Sbjct: 367 TGSKCQCKDC 376
>gi|66571150|gb|AAY51540.1| IP01448p [Drosophila melanogaster]
Length = 275
Score = 38.9 bits (89), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 23/64 (35%), Positives = 27/64 (42%), Gaps = 9/64 (14%)
Query: 140 YINHDCRPNCRFVATERDTACVKV----LRDIEQGEEITCFYGED-----FFGDKNCLCE 190
Y+NH C PNC A D K+ RDI EE+ YG + G K CLC
Sbjct: 201 YLNHSCEPNCHIAAVRIDCPIPKIGIFAARDIAAKEELCFHYGGEGQYKKMTGGKTCLCG 260
Query: 191 CDTC 194
C
Sbjct: 261 ASKC 264
>gi|161078181|ref|NP_001097743.1| CG4565 [Drosophila melanogaster]
gi|158030216|gb|AAF54563.2| CG4565 [Drosophila melanogaster]
gi|220943364|gb|ACL84225.1| CG4565-PB [synthetic construct]
gi|220953330|gb|ACL89208.1| CG4565-PB [synthetic construct]
Length = 269
Score = 38.9 bits (89), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 23/64 (35%), Positives = 27/64 (42%), Gaps = 9/64 (14%)
Query: 140 YINHDCRPNCRFVATERDTACVKV----LRDIEQGEEITCFYGED-----FFGDKNCLCE 190
Y+NH C PNC A D K+ RDI EE+ YG + G K CLC
Sbjct: 195 YLNHSCEPNCHIAAVRIDCPIPKIGIFAARDIAAKEELCFHYGGEGQYKKMTGGKTCLCG 254
Query: 191 CDTC 194
C
Sbjct: 255 ASKC 258
>gi|359492362|ref|XP_002284621.2| PREDICTED: uncharacterized protein LOC100245350 [Vitis vinifera]
Length = 2184
Score = 38.9 bits (89), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 31/80 (38%), Positives = 41/80 (51%), Gaps = 14/80 (17%)
Query: 130 CAQLWLGPAAYINHDCRPNCRFVATER----DTACVKV--LRDIEQGEEITCFYG-EDFF 182
CA+ LG +INH C PNCR TE+ C+ + LRDI++GEE+T Y F
Sbjct: 1317 CAKGNLG--RFINHSCDPNCR---TEKWMVNGEICIGLFALRDIKKGEEVTFDYNYVRVF 1371
Query: 183 G--DKNCLCECDTCEIRMKG 200
G K C+C C + G
Sbjct: 1372 GAAAKKCVCGSPQCRGYIGG 1391
>gi|398398285|ref|XP_003852600.1| hypothetical protein MYCGRDRAFT_93271 [Zymoseptoria tritici IPO323]
gi|339472481|gb|EGP87576.1| hypothetical protein MYCGRDRAFT_93271 [Zymoseptoria tritici IPO323]
Length = 344
Score = 38.9 bits (89), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 21/65 (32%), Positives = 31/65 (47%), Gaps = 3/65 (4%)
Query: 120 FSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATE--RDTACVKVLRDIEQGEEITCFY 177
+S M+ R + G A+ NH C PNC + C+ + RD+ GE++T Y
Sbjct: 97 YSNMFHIRGQPREGCFGHASRFNHSCAPNCAITTNGQWQQQQCLTI-RDVSVGEDLTFSY 155
Query: 178 GEDFF 182
EDF
Sbjct: 156 TEDFL 160
>gi|300702470|ref|YP_003744070.1| histone-lysine N-methyltransferase fragment, partial [Ralstonia
solanacearum CFBP2957]
gi|299070131|emb|CBJ41421.1| Putative histone-lysine N-methyltransferase fragment [Ralstonia
solanacearum CFBP2957]
Length = 179
Score = 38.9 bits (89), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 21/42 (50%), Positives = 26/42 (61%), Gaps = 3/42 (7%)
Query: 138 AAYINHDCRPNCRFVATERDTAC-VKVLRDIEQGEEITCFYG 178
A +INH C+PNC A E+D + LRDIE GEE+ YG
Sbjct: 102 ARWINHACKPNCE--AREKDGRVFIHALRDIEAGEELFYDYG 141
>gi|449510125|ref|XP_004176585.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL2-like, partial [Taeniopygia guttata]
Length = 4299
Score = 38.9 bits (89), Expect = 2.7, Method: Composition-based stats.
Identities = 28/71 (39%), Positives = 36/71 (50%), Gaps = 4/71 (5%)
Query: 131 AQLWLGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNC 187
A L GPA YINH C PNC V ++ D + R I +GEE+T Y DF D++
Sbjct: 4225 APLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQFDFEDDQHK 4284
Query: 188 L-CECDTCEIR 197
+ C C R
Sbjct: 4285 IPCHCGAWNCR 4295
>gi|346978461|gb|EGY21913.1| SET domain-containing protein [Verticillium dahliae VdLs.17]
Length = 680
Score = 38.9 bits (89), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 32/140 (22%), Positives = 58/140 (41%), Gaps = 17/140 (12%)
Query: 68 PCYRYSLEGNRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCR 127
P + R ++A +E+ I VG + + + + TG+ + + +
Sbjct: 549 PVQVFRTTDGRNNGLLALCAFERGAAIGEFVGLVTKDLQNLDVMDSSTGVRGYQIWQGRQ 608
Query: 128 KNCAQLWLGPAAYINHDCRPNCRF-----VATERDTACVKVLRDIEQGEEITCFYGEDFF 182
N + + NH C+ N +F ++T+R + V + IE G EIT Y ++
Sbjct: 609 GNYTR-------FTNHSCKANAQFQHFVWMSTQR---IILVSKGIEAGHEITVDYSGSYW 658
Query: 183 G--DKNCLCECDTCEIRMKG 200
DK+CLC C + G
Sbjct: 659 RGLDKDCLCGEACCRYKNSG 678
>gi|395543169|ref|XP_003773493.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
N-methyltransferase NSD2 [Sarcophilus harrisii]
Length = 1464
Score = 38.9 bits (89), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 34/125 (27%), Positives = 56/125 (44%), Gaps = 13/125 (10%)
Query: 77 NRGARIVATRKWEKNDNIAYLVGCIAELTKEEE--KRLLHTGLNDFSVMYSCRKNCAQLW 134
+G +VA R +K + + VG EL EEE R+ + ND + Y + ++
Sbjct: 1073 GKGWGLVAKRDIKKGEFVNEYVG---ELIDEEECMARIKYAHENDITHFYMLTIDKDRII 1129
Query: 135 -LGPAA----YINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN 186
GP ++NH C+PNC ++ + + DI G E+T Y D G++
Sbjct: 1130 DAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTELTFNYNLDCLGNEK 1189
Query: 187 CLCEC 191
+C C
Sbjct: 1190 TVCRC 1194
>gi|346973210|gb|EGY16662.1| histone-lysine N-methyltransferase [Verticillium dahliae VdLs.17]
Length = 354
Score = 38.9 bits (89), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 35/83 (42%), Gaps = 19/83 (22%)
Query: 131 AQLWLGPAAYINHDCRPNCRFVATERDTA-------CVKVLRDIEQGEEITCFY--GEDF 181
+ GP ++NH C PN R A D A + LRDI +GEE+T Y G
Sbjct: 267 GEFMSGPTRFVNHSCAPNLRIFARVGDHADKHIHDIAMFALRDIPRGEELTFDYVSGMVH 326
Query: 182 FGDKN----------CLCECDTC 194
GD+ CLC D C
Sbjct: 327 EGDEKDEQKRDHMTRCLCGSDKC 349
>gi|401888851|gb|EJT52799.1| histone-lysine N-methyltransferase [Trichosporon asahii var. asahii
CBS 2479]
Length = 905
Score = 38.9 bits (89), Expect = 2.7, Method: Composition-based stats.
Identities = 19/56 (33%), Positives = 29/56 (51%), Gaps = 3/56 (5%)
Query: 136 GPAAYINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCL 188
G + NH C PNC ++V R + RD+++ EEIT Y + G+ NC+
Sbjct: 312 GIGRFANHSCNPNCEVQKWVVGRRMRMGIFTKRDVKKDEEITFNYNICYCGEPNCV 367
>gi|421889781|ref|ZP_16320788.1| Putative histone-lysine N-methyltransferase fragment [Ralstonia
solanacearum K60-1]
gi|378964862|emb|CCF97536.1| Putative histone-lysine N-methyltransferase fragment [Ralstonia
solanacearum K60-1]
Length = 179
Score = 38.9 bits (89), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 21/42 (50%), Positives = 26/42 (61%), Gaps = 3/42 (7%)
Query: 138 AAYINHDCRPNCRFVATERDTAC-VKVLRDIEQGEEITCFYG 178
A +INH C+PNC A E+D + LRDIE GEE+ YG
Sbjct: 102 ARWINHACKPNCE--AREKDGRVFIHALRDIEAGEELFYDYG 141
>gi|348686237|gb|EGZ26052.1| hypothetical protein PHYSODRAFT_479714 [Phytophthora sojae]
Length = 170
Score = 38.9 bits (89), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 24/70 (34%), Positives = 32/70 (45%), Gaps = 5/70 (7%)
Query: 131 AQLWLGPAAYINHDCRPNCRF---VATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNC 187
AQ G ++NH C P F R T V D+ QG+EIT YG+D + C
Sbjct: 95 AQHLGGLMRFVNHSCEPVATFHEVANGRRTTVVVATTEDVLQGQEITVDYGDDLW--FVC 152
Query: 188 LCECDTCEIR 197
C ++C R
Sbjct: 153 RCGSESCRHR 162
>gi|195130539|ref|XP_002009709.1| GI15088 [Drosophila mojavensis]
gi|193908159|gb|EDW07026.1| GI15088 [Drosophila mojavensis]
Length = 492
Score = 38.9 bits (89), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 26/76 (34%), Positives = 37/76 (48%), Gaps = 12/76 (15%)
Query: 139 AYINHDCRPNCRFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGD----------KNC 187
A +NHDC PN + E+ + V+ DI +GEEIT Y + F G+ K
Sbjct: 214 AVMNHDCIPNSYYTFEEKTNNMIVRAAVDIAEGEEITTTYTKLFTGNIARHLFLKMKKGF 273
Query: 188 LCECDTC-EIRMKGAF 202
C+C C + KG+F
Sbjct: 274 TCKCPRCLDPTEKGSF 289
>gi|398392173|ref|XP_003849546.1| hypothetical protein MYCGRDRAFT_48051 [Zymoseptoria tritici IPO323]
gi|339469423|gb|EGP84522.1| hypothetical protein MYCGRDRAFT_48051 [Zymoseptoria tritici IPO323]
Length = 633
Score = 38.9 bits (89), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 29/80 (36%), Positives = 36/80 (45%), Gaps = 15/80 (18%)
Query: 127 RKNCAQLWLGPAAYINHDCRPNCR--FVATERDTACVKVLRDIEQGEEITCFY--GEDFF 182
RK LWL AAYINH C PN F+ D V+ R+I G EIT Y D+
Sbjct: 481 RKASTGLWL-MAAYINHSCLPNAEKSFLG---DLMVVRATRNITAGSEITHSYDSSSDYD 536
Query: 183 GDKNCL-------CECDTCE 195
+ L C C+ C+
Sbjct: 537 ARQEALMKTWGFRCRCELCK 556
>gi|2358285|gb|AAC51734.1| ALR [Homo sapiens]
Length = 5262
Score = 38.5 bits (88), Expect = 2.9, Method: Composition-based stats.
Identities = 28/71 (39%), Positives = 36/71 (50%), Gaps = 4/71 (5%)
Query: 131 AQLWLGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNC 187
A L GPA YINH C PNC V ++ D + R I +GEE+T Y DF D++
Sbjct: 5188 ATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQFDFEDDQHE 5247
Query: 188 L-CECDTCEIR 197
+ C C R
Sbjct: 5248 IPCHCGAWNCR 5258
>gi|432881031|ref|XP_004073771.1| PREDICTED: histone-lysine N-methyltransferase ASH1L-like [Oryzias
latipes]
Length = 2798
Score = 38.5 bits (88), Expect = 2.9, Method: Composition-based stats.
Identities = 34/141 (24%), Positives = 61/141 (43%), Gaps = 9/141 (6%)
Query: 138 AAYINHDCRPNCRFVATERDTAC---VKVLRDIEQGEEITCFYGEDFFGDKN---CLCEC 191
A +INH C PNC + + L+D+ G E+T Y F + C C
Sbjct: 2027 ARFINHSCDPNCEMQKWSVNGVYRIGLFALKDVSSGTELTYDYNFHSFNTEEQQACKCGS 2086
Query: 192 DTCEIRMKGAFAQSSVEPQTTQPSRYCLRDTDMRLNKRKLHKKLNRLLLASDKNDTNSSD 251
++C + G + + P T +R R + R +K + K+++ + + +SSD
Sbjct: 2087 ESCRGIIGGKSQRINGLPGKTGGARRLGRLKEKRKSKHQFKKRVSG---SCSDAEEDSSD 2143
Query: 252 NSSKYSLVQWNNNSKENIFDL 272
N+ Y + +++E F L
Sbjct: 2144 NNKFYPHLMKPMSNRERNFVL 2164
>gi|119602957|gb|EAW82551.1| Wolf-Hirschhorn syndrome candidate 1, isoform CRA_d [Homo sapiens]
Length = 742
Score = 38.5 bits (88), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 35/125 (28%), Positives = 55/125 (44%), Gaps = 13/125 (10%)
Query: 77 NRGARIVATRKWEKNDNIAYLVGCIAELTKEEE--KRLLHTGLNDFSVMYSCRKNCAQLW 134
+G +VA R K + + VG EL EEE R+ H ND + Y + ++
Sbjct: 449 GKGWGLVAKRDIRKGEFVNEYVG---ELIDEEECMARIKHAHENDITHFYMLTIDKDRII 505
Query: 135 -LGPAA----YINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN 186
GP ++NH C+PNC ++ + + DI G E+T Y D G++
Sbjct: 506 DAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTELTFNYNLDCLGNEK 565
Query: 187 CLCEC 191
+C C
Sbjct: 566 TVCRC 570
>gi|83749539|ref|ZP_00946526.1| Zinc finger protein [Ralstonia solanacearum UW551]
gi|207744855|ref|YP_002261247.1| set domain protein [Ralstonia solanacearum IPO1609]
gi|421896098|ref|ZP_16326497.1| set domain protein [Ralstonia solanacearum MolK2]
gi|83723775|gb|EAP70966.1| Zinc finger protein [Ralstonia solanacearum UW551]
gi|206587263|emb|CAQ17847.1| set domain protein [Ralstonia solanacearum MolK2]
gi|206596265|emb|CAQ63192.1| set domain protein [Ralstonia solanacearum IPO1609]
Length = 188
Score = 38.5 bits (88), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 21/42 (50%), Positives = 26/42 (61%), Gaps = 3/42 (7%)
Query: 138 AAYINHDCRPNCRFVATERDTAC-VKVLRDIEQGEEITCFYG 178
A +INH C+PNC A E+D + LRDIE GEE+ YG
Sbjct: 111 ARWINHACKPNCE--AREKDGRVFIHALRDIEAGEELFYDYG 150
>gi|402080897|gb|EJT76042.1| histone-lysine N-methyltransferase [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 338
Score = 38.5 bits (88), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 34/74 (45%), Gaps = 15/74 (20%)
Query: 136 GPAAYINHDCRPNCRFVATERDTA-------CVKVLRDIEQGEEITCFYGEDFFGDK--- 185
GP+ +INH C PN R A D A + +RDI +GEE+T Y + DK
Sbjct: 260 GPSRFINHSCDPNMRIFARVGDHADKHMHDLALFAIRDIPKGEELTFDYVDGVDIDKASK 319
Query: 186 -----NCLCECDTC 194
CLC + C
Sbjct: 320 TDGHTKCLCGSNKC 333
>gi|344168416|emb|CCA80699.1| putative histone-lysine N-methyltransferase fragment [blood disease
bacterium R229]
Length = 178
Score = 38.5 bits (88), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 21/42 (50%), Positives = 26/42 (61%), Gaps = 3/42 (7%)
Query: 138 AAYINHDCRPNCRFVATERDTAC-VKVLRDIEQGEEITCFYG 178
A +INH C+PNC A E+D + LRDIE GEE+ YG
Sbjct: 101 ARWINHACKPNCE--AREKDGRVFIHALRDIEAGEELFYDYG 140
>gi|2358287|gb|AAC51735.1| ALR [Homo sapiens]
Length = 4957
Score = 38.5 bits (88), Expect = 3.0, Method: Composition-based stats.
Identities = 28/71 (39%), Positives = 36/71 (50%), Gaps = 4/71 (5%)
Query: 131 AQLWLGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNC 187
A L GPA YINH C PNC V ++ D + R I +GEE+T Y DF D++
Sbjct: 4883 ATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQFDFEDDQHE 4942
Query: 188 L-CECDTCEIR 197
+ C C R
Sbjct: 4943 IPCHCGAWNCR 4953
>gi|156101223|ref|XP_001616305.1| SET domain containing protein [Plasmodium vivax Sal-1]
gi|148805179|gb|EDL46578.1| SET domain containing protein [Plasmodium vivax]
Length = 6587
Score = 38.5 bits (88), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 28/75 (37%), Positives = 38/75 (50%), Gaps = 7/75 (9%)
Query: 131 AQLWLGPAAYINHDCRPN--CRFVATERDTACVKVL--RDIEQGEEITCFY--GEDFFGD 184
A W + +INH C PN C+ V+ +++ + + RDI EEIT Y G + G
Sbjct: 6512 ATKWGNVSRFINHSCEPNCFCKIVSCDQNLKHIVIFAKRDIVAHEEITYDYQFGVESEGK 6571
Query: 185 K-NCLCECDTCEIRM 198
K CLC TC RM
Sbjct: 6572 KLICLCGSSTCLGRM 6586
>gi|290992999|ref|XP_002679121.1| set domain-containing protein [Naegleria gruberi]
gi|284092736|gb|EFC46377.1| set domain-containing protein [Naegleria gruberi]
Length = 153
Score = 38.5 bits (88), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 22/63 (34%), Positives = 31/63 (49%), Gaps = 3/63 (4%)
Query: 138 AAYINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTC 194
A +INH C PN ++ + V LRDIE+GEE+T Y + +G C C +
Sbjct: 79 ARFINHSCDPNSVLQKWTVGHQSRIGVFALRDIEKGEEVTFDYAMECYGVSFQKCYCGSA 138
Query: 195 EIR 197
R
Sbjct: 139 NCR 141
>gi|157278865|gb|AAI15212.1| Whsc1 protein [Danio rerio]
Length = 486
Score = 38.5 bits (88), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 33/126 (26%), Positives = 56/126 (44%), Gaps = 13/126 (10%)
Query: 76 GNRGARIVATRKWEKNDNIAYLVGCIAELTKEEE--KRLLHTGLNDFSVMYSCRKNCAQL 133
+G +++ R +K + + VG EL EEE R+ H ND + Y + ++
Sbjct: 187 AGKGWGLISLRDIKKGEFVNEYVG---ELIDEEECRSRIRHAQENDITHFYMLTIDKDRI 243
Query: 134 W-LGPAA----YINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDK 185
GP ++NH C+PNC ++ + + DI G E+T Y D G++
Sbjct: 244 IDAGPKGNYSRFMNHSCQPNCETQKWTVNGDTRVGLFAVCDIPAGTELTFNYNLDCLGNE 303
Query: 186 NCLCEC 191
+C C
Sbjct: 304 KTVCRC 309
>gi|70950103|ref|XP_744403.1| hypothetical protein [Plasmodium chabaudi chabaudi]
gi|56524343|emb|CAH80265.1| SET-domain protein, putative [Plasmodium chabaudi chabaudi]
Length = 870
Score = 38.5 bits (88), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 27/75 (36%), Positives = 39/75 (52%), Gaps = 7/75 (9%)
Query: 131 AQLWLGPAAYINHDCRPN--CRFVATERDTACVKVL--RDIEQGEEITCFY--GEDFFGD 184
A W + +INH C PN C+ V+ +++ + + +DI EEIT Y G + G+
Sbjct: 795 ATKWGNVSRFINHSCEPNCFCKIVSCDQNLKHIVIFAKKDILPHEEITYDYQFGVESEGE 854
Query: 185 K-NCLCECDTCEIRM 198
K CLC TC RM
Sbjct: 855 KLICLCGSSTCLGRM 869
>gi|348680817|gb|EGZ20633.1| hypothetical protein PHYSODRAFT_488582 [Phytophthora sojae]
Length = 170
Score = 38.5 bits (88), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 22/65 (33%), Positives = 32/65 (49%), Gaps = 5/65 (7%)
Query: 136 GPAAYINHDCRPNCRFVAT---ERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECD 192
G ++NH C+P +FV R T V +DI +G+E+T Y +D + C C D
Sbjct: 100 GLMRFVNHSCQPVAKFVEVANGRRTTVVVASTQDIHRGDEVTVDYSDDLW--FVCRCGLD 157
Query: 193 TCEIR 197
C R
Sbjct: 158 GCRHR 162
>gi|294463315|gb|ADE77193.1| unknown [Picea sitchensis]
Length = 288
Score = 38.5 bits (88), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 19/47 (40%), Positives = 28/47 (59%), Gaps = 1/47 (2%)
Query: 131 AQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFY 177
+ +++ P+ Y NHDC PN + E A +K LR+IE GEE+ Y
Sbjct: 212 SAVYMLPSMY-NHDCDPNAHILWIENAKARLKALRNIEPGEELRICY 257
>gi|121709862|ref|XP_001272547.1| SET domain protein [Aspergillus clavatus NRRL 1]
gi|119400697|gb|EAW11121.1| SET domain protein [Aspergillus clavatus NRRL 1]
Length = 1232
Score = 38.5 bits (88), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 35/117 (29%), Positives = 53/117 (45%), Gaps = 13/117 (11%)
Query: 91 NDNIAYLVG--CIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLWLGPAAYINHDC 145
ND I VG ++ E++ L +G+ S ++ +N A G A +INH C
Sbjct: 1113 NDMIIEYVGEKVRQQVADMRERQYLKSGIGS-SYLFRIDENTVIDATKRGGIARFINHSC 1171
Query: 146 RPNCRFVATERDTA---CVKVLRDIEQGEEITCFYGEDFFGDKN----CLCECDTCE 195
PNC + D + + LRDIE+ EE+T Y + D + CLC C+
Sbjct: 1172 TPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDSDDRIPCLCGSTGCK 1228
>gi|116195712|ref|XP_001223668.1| hypothetical protein CHGG_04454 [Chaetomium globosum CBS 148.51]
gi|88180367|gb|EAQ87835.1| hypothetical protein CHGG_04454 [Chaetomium globosum CBS 148.51]
Length = 820
Score = 38.5 bits (88), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 20/49 (40%), Positives = 26/49 (53%), Gaps = 7/49 (14%)
Query: 136 GPAAYINHDCRPNCRFVATERDTA-------CVKVLRDIEQGEEITCFY 177
GP +INH C+PN R A D A + +RDI +GEE+T Y
Sbjct: 737 GPTRFINHSCQPNMRIFARVGDHADKHIHDLALFAIRDIPEGEELTFDY 785
>gi|156056575|ref|XP_001594211.1| hypothetical protein SS1G_04018 [Sclerotinia sclerotiorum 1980]
gi|154701804|gb|EDO01543.1| hypothetical protein SS1G_04018 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 763
Score = 38.5 bits (88), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 29/64 (45%), Gaps = 7/64 (10%)
Query: 138 AAYINHDCRPNCRFVA----TERDTACVKVLRDIEQGEEITCFYGEDFFGDKN---CLCE 190
A ++NH CRPNCR V + A I G+E+T Y D F KN C C
Sbjct: 516 ARFVNHSCRPNCRMVKWIVEGKPRMALFAGDNPIMTGDELTYDYNFDPFSAKNVQACRCG 575
Query: 191 CDTC 194
D C
Sbjct: 576 SDNC 579
>gi|336272855|ref|XP_003351183.1| hypothetical protein SMAC_08199 [Sordaria macrospora k-hell]
gi|380087872|emb|CCC14032.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 320
Score = 38.5 bits (88), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 24/70 (34%), Positives = 35/70 (50%), Gaps = 11/70 (15%)
Query: 138 AAYINHDCRPNCRFVATERDT-ACVKVLRDIEQGEEITCFYG---------EDFFGDKNC 187
A+ INH C+PN + ER + + +RDI++GEEIT Y + F DK
Sbjct: 112 ASRINHACKPNTQNSWNERISRETIHAVRDIKKGEEITISYMGHFASYDERQAFLKDKFK 171
Query: 188 L-CECDTCEI 196
C C+ C +
Sbjct: 172 FDCACEVCSL 181
>gi|118379402|ref|XP_001022867.1| conserved hypothetical protein [Tetrahymena thermophila]
gi|89304634|gb|EAS02622.1| conserved hypothetical protein [Tetrahymena thermophila SB210]
Length = 418
Score = 38.5 bits (88), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 27/65 (41%), Positives = 32/65 (49%), Gaps = 10/65 (15%)
Query: 140 YINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGE-----DF---FGDKN--CLC 189
Y+NH C PN V + V +RDIEQGEEI Y + DF F +N LC
Sbjct: 162 YMNHSCTPNVICVFNKLPQVRVIAIRDIEQGEEIMNSYIDTKKDLDFRRRFLKQNYFFLC 221
Query: 190 ECDTC 194
EC C
Sbjct: 222 ECKRC 226
>gi|380797995|gb|AFE70873.1| putative histone-lysine N-methyltransferase NSD2 isoform 1, partial
[Macaca mulatta]
Length = 421
Score = 38.5 bits (88), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 35/125 (28%), Positives = 55/125 (44%), Gaps = 13/125 (10%)
Query: 77 NRGARIVATRKWEKNDNIAYLVGCIAELTKEEE--KRLLHTGLNDFSVMYSCRKNCAQLW 134
+G +VA R K + + VG EL EEE R+ H ND + Y + ++
Sbjct: 128 GKGWGLVAKRDIRKGEFVNEYVG---ELIDEEECMARIKHAHENDITHFYMLTIDKDRII 184
Query: 135 -LGPAA----YINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN 186
GP ++NH C+PNC ++ + + DI G E+T Y D G++
Sbjct: 185 DAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTELTFNYNLDCLGNEK 244
Query: 187 CLCEC 191
+C C
Sbjct: 245 TVCRC 249
>gi|24639197|ref|NP_525040.2| trithorax-related, isoform C [Drosophila melanogaster]
gi|22831528|gb|AAF45684.2| trithorax-related, isoform C [Drosophila melanogaster]
Length = 2410
Score = 38.5 bits (88), Expect = 3.3, Method: Composition-based stats.
Identities = 35/126 (27%), Positives = 52/126 (41%), Gaps = 9/126 (7%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK+ I +G + E + + N M+ ++ A L
Sbjct: 2280 QGLGLYAARDIEKHTMIIEYIGEVIRTEVSEIREKQYESKNRGIYMFRLDEDRVVDATLS 2339
Query: 135 LGPAAYINHDCRPNC--RFVATERDTACVKVL-RDIEQGEEITCFYGEDFFGDKN---CL 188
G A YINH C PNC V +RD + R I +GEE++ Y D + + C
Sbjct: 2340 GGLARYINHSCNPNCVTEIVEVDRDVRIIIFAKRKIYRGEELSYDYKFDIEDESHKIPCA 2399
Query: 189 CECDTC 194
C C
Sbjct: 2400 CGAPNC 2405
>gi|28571451|ref|NP_726773.2| trithorax-related, isoform D [Drosophila melanogaster]
gi|74865454|sp|Q8IRW8.2|TRR_DROME RecName: Full=Histone-lysine N-methyltransferase trr; AltName:
Full=Lysine N-methyltransferase 2C; AltName:
Full=Trithorax-related protein
gi|28381556|gb|AAN09063.2| trithorax-related, isoform D [Drosophila melanogaster]
Length = 2431
Score = 38.5 bits (88), Expect = 3.3, Method: Composition-based stats.
Identities = 35/126 (27%), Positives = 52/126 (41%), Gaps = 9/126 (7%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK+ I +G + E + + N M+ ++ A L
Sbjct: 2301 QGLGLYAARDIEKHTMIIEYIGEVIRTEVSEIREKQYESKNRGIYMFRLDEDRVVDATLS 2360
Query: 135 LGPAAYINHDCRPNC--RFVATERDTACVKVL-RDIEQGEEITCFYGEDFFGDKN---CL 188
G A YINH C PNC V +RD + R I +GEE++ Y D + + C
Sbjct: 2361 GGLARYINHSCNPNCVTEIVEVDRDVRIIIFAKRKIYRGEELSYDYKFDIEDESHKIPCA 2420
Query: 189 CECDTC 194
C C
Sbjct: 2421 CGAPNC 2426
>gi|406697594|gb|EKD00852.1| histone-lysine N-methyltransferase [Trichosporon asahii var. asahii
CBS 8904]
Length = 843
Score = 38.5 bits (88), Expect = 3.3, Method: Composition-based stats.
Identities = 19/56 (33%), Positives = 29/56 (51%), Gaps = 3/56 (5%)
Query: 136 GPAAYINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCL 188
G + NH C PNC ++V R + RD+++ EEIT Y + G+ NC+
Sbjct: 312 GIGRFANHSCNPNCEVQKWVVGRRMRMGIFTKRDVKKDEEITFNYNICYCGEPNCV 367
>gi|302141761|emb|CBI18964.3| unnamed protein product [Vitis vinifera]
Length = 1958
Score = 38.5 bits (88), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 31/80 (38%), Positives = 41/80 (51%), Gaps = 14/80 (17%)
Query: 130 CAQLWLGPAAYINHDCRPNCRFVATER----DTACVKV--LRDIEQGEEITCFYG-EDFF 182
CA+ LG +INH C PNCR TE+ C+ + LRDI++GEE+T Y F
Sbjct: 1333 CAKGNLG--RFINHSCDPNCR---TEKWMVNGEICIGLFALRDIKKGEEVTFDYNYVRVF 1387
Query: 183 G--DKNCLCECDTCEIRMKG 200
G K C+C C + G
Sbjct: 1388 GAAAKKCVCGSPQCRGYIGG 1407
>gi|194912727|ref|XP_001982563.1| GG12672 [Drosophila erecta]
gi|190648239|gb|EDV45532.1| GG12672 [Drosophila erecta]
Length = 2406
Score = 38.5 bits (88), Expect = 3.3, Method: Composition-based stats.
Identities = 35/126 (27%), Positives = 52/126 (41%), Gaps = 9/126 (7%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK+ I +G + E + + N M+ ++ A L
Sbjct: 2276 QGLGLYAARDIEKHTMIIEYIGEVIRTEVSEIREKQYESKNRGIYMFRLDEDRVVDATLS 2335
Query: 135 LGPAAYINHDCRPNC--RFVATERDTACVKVL-RDIEQGEEITCFYGEDFFGDKN---CL 188
G A YINH C PNC V +RD + R I +GEE++ Y D + + C
Sbjct: 2336 GGLARYINHSCNPNCVTEIVEVDRDVRIIIFAKRKIYRGEELSYDYKFDIEDESHKIPCA 2395
Query: 189 CECDTC 194
C C
Sbjct: 2396 CGAPNC 2401
>gi|3256105|emb|CAA15944.1| EG:63B12.3 [Drosophila melanogaster]
Length = 2422
Score = 38.5 bits (88), Expect = 3.3, Method: Composition-based stats.
Identities = 35/126 (27%), Positives = 52/126 (41%), Gaps = 9/126 (7%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A R EK+ I +G + E + + N M+ ++ A L
Sbjct: 2292 QGLGLYAARDIEKHTMIIEYIGEVIRTEVSEIREKQYESKNRGIYMFRLDEDRVVDATLS 2351
Query: 135 LGPAAYINHDCRPNC--RFVATERDTACVKVL-RDIEQGEEITCFYGEDFFGDKN---CL 188
G A YINH C PNC V +RD + R I +GEE++ Y D + + C
Sbjct: 2352 GGLARYINHSCNPNCVTEIVEVDRDVRIIIFAKRKIYRGEELSYDYKFDIEDESHKIPCA 2411
Query: 189 CECDTC 194
C C
Sbjct: 2412 CGAPNC 2417
>gi|126332220|ref|XP_001374612.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2
[Monodelphis domestica]
Length = 1366
Score = 38.5 bits (88), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 34/125 (27%), Positives = 56/125 (44%), Gaps = 13/125 (10%)
Query: 77 NRGARIVATRKWEKNDNIAYLVGCIAELTKEEE--KRLLHTGLNDFSVMYSCRKNCAQLW 134
+G +VA R +K + + VG EL EEE R+ + ND + Y + ++
Sbjct: 1073 GKGWGLVAKRDIKKGEFVNEYVG---ELIDEEECMARIKYAHENDITHFYMLTIDKDRII 1129
Query: 135 -LGPAA----YINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN 186
GP ++NH C+PNC ++ + + DI G E+T Y D G++
Sbjct: 1130 DAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTELTFNYNLDCLGNEK 1189
Query: 187 CLCEC 191
+C C
Sbjct: 1190 TVCRC 1194
>gi|347963044|ref|XP_566389.3| AGAP000041-PA [Anopheles gambiae str. PEST]
gi|333467386|gb|EAL41307.3| AGAP000041-PA [Anopheles gambiae str. PEST]
Length = 522
Score = 38.5 bits (88), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 32/68 (47%), Gaps = 10/68 (14%)
Query: 138 AAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFG----------DKNC 187
A +NH CRPN R+V V R I++GE+I Y + +G KN
Sbjct: 258 GALMNHCCRPNVRYVFDGELRMRVHASRPIKKGEQIMNNYSKILWGSQHRIIHLCFSKNF 317
Query: 188 LCECDTCE 195
LC CD C+
Sbjct: 318 LCCCDRCK 325
>gi|118394814|ref|XP_001029767.1| SET domain containing protein [Tetrahymena thermophila]
gi|89284034|gb|EAR82104.1| SET domain containing protein [Tetrahymena thermophila SB210]
Length = 2437
Score = 38.5 bits (88), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 37/119 (31%), Positives = 51/119 (42%), Gaps = 9/119 (7%)
Query: 85 TRKWEKNDNIA--YLVGCIAE-LTKEEEKRLLHTGLNDFSVMYSCRKNC--AQLWLGPAA 139
+ + K D+I YL I + L EK G D + +C A A
Sbjct: 2314 AKTYFKQDDIVVEYLGETIRQVLADYREKIYKQRGFGDCYMFKACPDKIIDATFKGNEAR 2373
Query: 140 YINHDCRPNCR--FVATERDTA-CVKVLRDIEQGEEITCFYGEDFFGDK-NCLCECDTC 194
Y+NH C PNC + E+D+ + RDI+ GEE+T Y D +K NC C C
Sbjct: 2374 YLNHSCNPNCSSLVIEYEKDSKIIIYAKRDIKPGEELTYDYCFDIEEEKINCNCNDPNC 2432
>gi|47228685|emb|CAG07417.1| unnamed protein product [Tetraodon nigroviridis]
Length = 4301
Score = 38.5 bits (88), Expect = 3.4, Method: Composition-based stats.
Identities = 27/71 (38%), Positives = 36/71 (50%), Gaps = 4/71 (5%)
Query: 131 AQLWLGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNC 187
A L GPA Y+NH C PNC V ++ D + R I +GEE+T Y DF D++
Sbjct: 4227 ATLTGGPARYVNHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQFDFEDDQHK 4286
Query: 188 L-CECDTCEIR 197
+ C C R
Sbjct: 4287 IPCHCGAWNCR 4297
>gi|348684130|gb|EGZ23945.1| hypothetical protein PHYSODRAFT_413098 [Phytophthora sojae]
Length = 57
Score = 38.5 bits (88), Expect = 3.5, Method: Composition-based stats.
Identities = 19/58 (32%), Positives = 29/58 (50%), Gaps = 5/58 (8%)
Query: 140 YINHDCRPNCRFVATERD---TACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTC 194
++NH C NC+F + T V + DI G E+T YG++ + CLC + C
Sbjct: 2 FMNHSCEANCKFYEVQNRRFVTVVVVAMEDIGAGSEVTVDYGDELW--FTCLCGSEDC 57
>gi|367020696|ref|XP_003659633.1| hypothetical protein MYCTH_99686 [Myceliophthora thermophila ATCC
42464]
gi|347006900|gb|AEO54388.1| hypothetical protein MYCTH_99686 [Myceliophthora thermophila ATCC
42464]
Length = 573
Score = 38.5 bits (88), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 17/39 (43%), Positives = 25/39 (64%), Gaps = 1/39 (2%)
Query: 139 AYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFY 177
A +NH C PN F+ ++ TA ++ R I++GEEIT Y
Sbjct: 237 AMVNHSCVPNA-FIGFDKRTAILRAERPIQEGEEITISY 274
>gi|452822785|gb|EME29801.1| myeloid/lymphoid or mixed-lineage leukemia protein 3 [Galdieria
sulphuraria]
Length = 969
Score = 38.5 bits (88), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 23/66 (34%), Positives = 33/66 (50%), Gaps = 5/66 (7%)
Query: 138 AAYINHDCRPNCR--FVATERDTACVKVL--RDIEQGEEITCFYGEDFFGDK-NCLCECD 192
A +INH C PNCR + + D + + R+I GEE+T Y + FG+ C C
Sbjct: 903 ARFINHSCEPNCRSKIITVDGDKQVIGIFAKRNIAAGEELTYDYQFEEFGETIPCNCGAP 962
Query: 193 TCEIRM 198
C +M
Sbjct: 963 NCRGKM 968
>gi|340378403|ref|XP_003387717.1| PREDICTED: histone-lysine N-methyltransferase SETD2-like
[Amphimedon queenslandica]
Length = 862
Score = 38.5 bits (88), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 22/60 (36%), Positives = 29/60 (48%), Gaps = 5/60 (8%)
Query: 140 YINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD--KNCLCECDTC 194
+INH C PNC ++ R LR I GEE+T Y FG+ + C C +TC
Sbjct: 199 FINHSCEPNCETQKWTVNGRLRVGFFALRHIPAGEELTFDYQFQRFGESVQKCYCGSETC 258
>gi|324505555|gb|ADY42386.1| Histone-lysine N-methyltransferase Mes-4 [Ascaris suum]
Length = 743
Score = 38.5 bits (88), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 26/86 (30%), Positives = 38/86 (44%), Gaps = 8/86 (9%)
Query: 116 GLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTAC---VKVLRDIEQGEE 172
G+ + Y +NC + + +NH C PNC+ A D V L+DI G+E
Sbjct: 513 GMQITNEFYVDARNCGNM----SRSVNHSCEPNCKVNAVTVDGVYRLKVSALKDIAAGDE 568
Query: 173 ITCFYGEDFF-GDKNCLCECDTCEIR 197
+T YG + + G C C T R
Sbjct: 569 LTYDYGTELWSGMVGMRCRCGTAGCR 594
>gi|428163995|gb|EKX33039.1| hypothetical protein GUITHDRAFT_120775 [Guillardia theta CCMP2712]
Length = 446
Score = 38.5 bits (88), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 22/38 (57%), Gaps = 1/38 (2%)
Query: 141 INHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYG 178
+NHD PNCR + D C + RDIE GEE+T Y
Sbjct: 181 LNHDREPNCRLI-QHGDRICAEACRDIEAGEELTHPYA 217
>gi|348687184|gb|EGZ26998.1| hypothetical protein PHYSODRAFT_473675 [Phytophthora sojae]
Length = 170
Score = 38.5 bits (88), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 17/46 (36%), Positives = 27/46 (58%), Gaps = 3/46 (6%)
Query: 140 YINHDCRPNCRFVAT---ERDTACVKVLRDIEQGEEITCFYGEDFF 182
+++H C+P +FV R T V +DI +GEE+T YG+D +
Sbjct: 104 FVHHPCQPVAKFVEVANGRRTTVVVASTQDIHRGEEVTVDYGDDLW 149
>gi|440733254|ref|ZP_20913014.1| hypothetical protein A989_16703 [Xanthomonas translucens DAR61454]
gi|440363577|gb|ELQ00739.1| hypothetical protein A989_16703 [Xanthomonas translucens DAR61454]
Length = 155
Score = 38.5 bits (88), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 20/48 (41%), Positives = 26/48 (54%), Gaps = 7/48 (14%)
Query: 138 AAYINHDCRPNCRFVATE-------RDTACVKVLRDIEQGEEITCFYG 178
A +INH C PNC V E +D ++ LR I+ GEE+T YG
Sbjct: 75 ARWINHSCAPNCEAVIIEAEGDDRRQDKVVIETLRAIKPGEELTYNYG 122
>gi|402082242|gb|EJT77387.1| hypothetical protein GGTG_07299 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 403
Score = 38.5 bits (88), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 22/60 (36%), Positives = 30/60 (50%), Gaps = 11/60 (18%)
Query: 128 KNCAQLWLGP----------AAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFY 177
+N LW+G AA INHDCRP+ + + T +RDI+ GEEI+ Y
Sbjct: 182 RNAFTLWIGARGKHMAVYPQAALINHDCRPSTTYRLSNL-THITTAVRDIQPGEEISLSY 240
>gi|301611266|ref|XP_002935167.1| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Xenopus
(Silurana) tropicalis]
Length = 6019
Score = 38.5 bits (88), Expect = 3.6, Method: Composition-based stats.
Identities = 27/71 (38%), Positives = 34/71 (47%), Gaps = 4/71 (5%)
Query: 131 AQLWLGPAAYINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNC 187
A L GPA YINH C PNC + D + R I +GEE+T Y DF D++
Sbjct: 5945 ATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQFDFEDDQHK 6004
Query: 188 L-CECDTCEIR 197
+ C C R
Sbjct: 6005 IPCHCGAWNCR 6015
>gi|424792343|ref|ZP_18218581.1| hypothetical protein XTG29_01521 [Xanthomonas translucens pv.
graminis ART-Xtg29]
gi|433678359|ref|ZP_20510230.1| hypothetical protein BN444_02427 [Xanthomonas translucens pv.
translucens DSM 18974]
gi|422797114|gb|EKU25501.1| hypothetical protein XTG29_01521 [Xanthomonas translucens pv.
graminis ART-Xtg29]
gi|430816544|emb|CCP40704.1| hypothetical protein BN444_02427 [Xanthomonas translucens pv.
translucens DSM 18974]
Length = 155
Score = 38.5 bits (88), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 20/48 (41%), Positives = 26/48 (54%), Gaps = 7/48 (14%)
Query: 138 AAYINHDCRPNCRFVATE-------RDTACVKVLRDIEQGEEITCFYG 178
A +INH C PNC V E +D ++ LR I+ GEE+T YG
Sbjct: 75 ARWINHSCAPNCEAVIIEAEGDDRRQDKVVIETLRAIKPGEELTYNYG 122
>gi|384250973|gb|EIE24451.1| SET domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 606
Score = 38.5 bits (88), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 27/98 (27%), Positives = 40/98 (40%), Gaps = 24/98 (24%)
Query: 131 AQLWLGPA-AYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFY----------GE 179
+ L L PA + +NH C PNC F + D V+ +R + GE++T Y
Sbjct: 320 SALGLYPALSMLNHSCLPNCVFASCGSDMH-VRAIRPVAAGEQLTVTYINIMEPRRIRAR 378
Query: 180 DFFGDKNCLCECDTCEIRMKGAFAQSSVEPQTTQPSRY 217
+ K+ C C+ C V P T P R+
Sbjct: 379 ELMDTKHFACACERC------------VSPLETHPDRF 404
>gi|407847593|gb|EKG03260.1| hypothetical protein TCSYLVIO_005710 [Trypanosoma cruzi]
Length = 1444
Score = 38.5 bits (88), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 41/83 (49%), Gaps = 4/83 (4%)
Query: 136 GPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFY---GEDFFGDKNCLCECD 192
G + +I H C+PN R +A E D+ LR IE GE++T Y D +C C D
Sbjct: 173 GGSEFIRHACQPNLR-LAIEGDSIHGIALRPIEGGEQLTYNYLCTEWDIAEPFHCACNTD 231
Query: 193 TCEIRMKGAFAQSSVEPQTTQPS 215
+C ++G + ++ E PS
Sbjct: 232 SCYRFIRGFYYLNAEEKALLFPS 254
>gi|406867181|gb|EKD20220.1| TPR domain protein [Marssonina brunnea f. sp. 'multigermtubi'
MB_m1]
Length = 709
Score = 38.5 bits (88), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 18/40 (45%), Positives = 23/40 (57%)
Query: 138 AAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFY 177
AA NHDCRPN R T+ + R+I +GEE+T Y
Sbjct: 500 AARFNHDCRPNVRRGFTKENCIAFVTAREITRGEELTISY 539
>gi|195438882|ref|XP_002067361.1| GK16377 [Drosophila willistoni]
gi|194163446|gb|EDW78347.1| GK16377 [Drosophila willistoni]
Length = 2510
Score = 38.5 bits (88), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 36/126 (28%), Positives = 51/126 (40%), Gaps = 9/126 (7%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSC---RKNCAQLW 134
+G + A R EK+ I +G + E + + N M+ R A L
Sbjct: 2380 QGLGLYAARDIEKHTMIIEYIGEVIRTEVSEIREKQYEAKNRGIYMFRLDEDRVVDATLS 2439
Query: 135 LGPAAYINHDCRPNC--RFVATERDTACVK-VLRDIEQGEEITCFYGEDFFGDKN---CL 188
G A YINH C PNC V +RD + R I +GEE++ Y D + + C
Sbjct: 2440 GGLARYINHSCNPNCVTEIVEVDRDVRIIIFAKRKIYRGEELSYDYKFDIEDESHKIPCA 2499
Query: 189 CECDTC 194
C C
Sbjct: 2500 CGAPNC 2505
>gi|34782989|gb|AAH39197.1| MLL2 protein, partial [Homo sapiens]
Length = 395
Score = 38.1 bits (87), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 53/121 (43%), Gaps = 7/121 (5%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLW 134
+G + A + EK+ + +G I + ++ N M+ A L
Sbjct: 265 QGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGIYMFRINNEHVIDATLT 324
Query: 135 LGPAAYINHDCRPNCRF-VAT--ERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCL-CE 190
GPA YINH C PNC V T + D + R I +GEE+T Y DF D++ + C
Sbjct: 325 GGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQFDFEDDQHKIPCH 384
Query: 191 C 191
C
Sbjct: 385 C 385
>gi|391863483|gb|EIT72791.1| histone H3 (Lys4) methyltransferase complex, subunit SET1
[Aspergillus oryzae 3.042]
Length = 1223
Score = 38.1 bits (87), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 35/117 (29%), Positives = 53/117 (45%), Gaps = 13/117 (11%)
Query: 91 NDNIAYLVG--CIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLWLGPAAYINHDC 145
ND I VG ++ E++ L +G+ S ++ +N A G A +INH C
Sbjct: 1104 NDMIIEYVGEKVRQQVADMRERQYLKSGIGS-SYLFRIDENTVIDATKRGGIARFINHSC 1162
Query: 146 RPNCRFVATERDTA---CVKVLRDIEQGEEITCFYGEDFFGDKN----CLCECDTCE 195
PNC + D + + LRDIE+ EE+T Y + D + CLC C+
Sbjct: 1163 TPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDSDDRIPCLCGSTGCK 1219
>gi|299116669|emb|CBN74814.1| Whsc1 protein [Ectocarpus siliculosus]
Length = 1551
Score = 38.1 bits (87), Expect = 3.7, Method: Composition-based stats.
Identities = 25/88 (28%), Positives = 34/88 (38%), Gaps = 7/88 (7%)
Query: 136 GPAAYINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGD----KNCL 188
G A + NH C PNC ++ + RDI GEE+T Y D G + CL
Sbjct: 412 GEARFANHSCSPNCLMQKWSVLGETRVVLVAARDISVGEELTYNYQADTLGGFVERQKCL 471
Query: 189 CECDTCEIRMKGAFAQSSVEPQTTQPSR 216
C C + G E + S+
Sbjct: 472 CGEPQCSGFIGGELVAVKAEEWQRRASK 499
>gi|168027193|ref|XP_001766115.1| trithorax-like protein, histone-lysine N-methyltransferase
[Physcomitrella patens subsp. patens]
gi|162682758|gb|EDQ69174.1| trithorax-like protein, histone-lysine N-methyltransferase
[Physcomitrella patens subsp. patens]
Length = 514
Score = 38.1 bits (87), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 27/90 (30%), Positives = 39/90 (43%), Gaps = 9/90 (10%)
Query: 138 AAYINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTC 194
A INH C PNC A+ D + RD+E GEE+T Y F L TC
Sbjct: 419 AHLINHSCEPNCYSRTVTASGEDRIIIFAKRDLEIGEELT--YDYRFMSKTEVL----TC 472
Query: 195 EIRMKGAFAQSSVEPQTTQPSRYCLRDTDM 224
G +V+ + P++ C+ +D+
Sbjct: 473 YCGSAGCRGSVNVQDEDGDPTKLCIPLSDL 502
>gi|440484183|gb|ELQ64299.1| hypothetical protein OOW_P131scaffold00655g1, partial [Magnaporthe
oryzae P131]
Length = 668
Score = 38.1 bits (87), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 21/41 (51%), Positives = 25/41 (60%), Gaps = 7/41 (17%)
Query: 141 INHDCRPNCRFVATERDTAC----VKVLRDIEQGEEITCFY 177
+NH+CRPN + E DT V LRDIEQGEE+T Y
Sbjct: 520 VNHNCRPNVYW---EYDTKTMAQEVVALRDIEQGEELTHSY 557
>gi|384499018|gb|EIE89509.1| hypothetical protein RO3G_14220 [Rhizopus delemar RA 99-880]
Length = 962
Score = 38.1 bits (87), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 27/87 (31%), Positives = 42/87 (48%), Gaps = 6/87 (6%)
Query: 138 AAYINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECD-- 192
A +INH C PNC ++V + + R I+ GEE+T Y + +G + +C C
Sbjct: 240 ARFINHSCNPNCVTQKWVVGKNMRIGIFTNRGIKAGEELTFDYKFERYGAQAQVCYCGEF 299
Query: 193 TCEIRMKGAFAQSSVEPQTTQPSRYCL 219
C+ + G+F S P+T S L
Sbjct: 300 ACKGFIGGSFKNSD-SPRTKTVSSKIL 325
>gi|209880293|ref|XP_002141586.1| SET domain-containing protein [Cryptosporidium muris RN66]
gi|209557192|gb|EEA07237.1| SET domain-containing protein [Cryptosporidium muris RN66]
Length = 517
Score = 38.1 bits (87), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 40/152 (26%), Positives = 64/152 (42%), Gaps = 16/152 (10%)
Query: 98 VGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLWLGPAAYINHDCRPNCR---- 150
VG I L + EE++ + + L ++S A A +INH C PN
Sbjct: 147 VGEIINLKQLEERKKMTSKLTKHIYVFSLGNETYIDATYKGNLARFINHSCEPNLVAQKW 206
Query: 151 FVATERDTACVKVLRDIEQGEEITCFY--GEDFFGDK--NCLCECDTCEIRMKGAFAQSS 206
FV ++ L+DI+ G+E+T Y G GD+ C+C CE R+ S
Sbjct: 207 FVGSDIKVGLFS-LKDIKAGDELTFDYRFGTSISGDQPFECMCGSKLCEKRISNT-KDSR 264
Query: 207 VEPQTTQPSRYCLRDTDMRLNKRKLHKKLNRL 238
+ + T + + + LN + L K + R
Sbjct: 265 IYNEITNQYKMTISNF---LNVKSLSKPMQRF 293
>gi|407408342|gb|EKF31820.1| hypothetical protein MOQ_004336 [Trypanosoma cruzi marinkellei]
Length = 1444
Score = 38.1 bits (87), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 36/68 (52%), Gaps = 4/68 (5%)
Query: 136 GPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFY---GEDFFGDKNCLCECD 192
G + +I H C+PN R +A E D+ LR IE+GE++T Y D +C C D
Sbjct: 173 GGSEFIRHACQPNLR-LAIEGDSIHGIALRPIEEGEQLTYNYLCTEWDIAKPFHCACNTD 231
Query: 193 TCEIRMKG 200
+C ++G
Sbjct: 232 SCYRFIRG 239
>gi|396469794|ref|XP_003838493.1| hypothetical protein LEMA_P114290.1 [Leptosphaeria maculans JN3]
gi|312215061|emb|CBX95014.1| hypothetical protein LEMA_P114290.1 [Leptosphaeria maculans JN3]
Length = 880
Score = 38.1 bits (87), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 41/140 (29%), Positives = 59/140 (42%), Gaps = 17/140 (12%)
Query: 70 YRYSLE----GNRGARIVATRKWEKNDNIAYLVGCIAELTKEE-EKRLLHTGLND--FSV 122
Y Y +E +RG + A R +E + I G I +T+ E E+R+ D + +
Sbjct: 427 YDYGVEVVDTHDRGFGVRAMRTFEPHQIIVEYAGEI--ITQWECERRMKQVYKKDKCYYL 484
Query: 123 MYSCRKNCAQLWLGPAA-YINHDCRPNCRF----VATERDTACVKVLRDIEQGEEITCFY 177
M K G A ++NH C PNC V E A R + G+E+T Y
Sbjct: 485 MSFDNKMIIDATRGTIARFVNHSCEPNCEMIKWTVGGEPRMALFAGSRGVMTGDELTYDY 544
Query: 178 GEDFFGDKN---CLCECDTC 194
D F KN C C ++C
Sbjct: 545 NFDPFSQKNIQVCRCGTESC 564
>gi|37590100|gb|AAH58659.1| Mll2 protein [Mus musculus]
Length = 1250
Score = 38.1 bits (87), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 28/65 (43%), Positives = 35/65 (53%), Gaps = 4/65 (6%)
Query: 131 AQLWLGPAAYINHDCRPNCRF-VAT--ERDTACVKVLRDIEQGEEITCFYGEDFFGDKNC 187
A L GPA YINH C PNC V T + D + R I +GEE+T Y DF D++
Sbjct: 1176 ATLTGGPARYINHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQFDFEDDQHK 1235
Query: 188 L-CEC 191
+ C C
Sbjct: 1236 IPCHC 1240
>gi|169769549|ref|XP_001819244.1| histone-lysine N-methyltransferase, H3 lysine-4 specific [Aspergillus
oryzae RIB40]
gi|121933328|sp|Q2UMH3.1|SET1_ASPOR RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific; AltName: Full=COMPASS component SET1; AltName:
Full=SET domain-containing protein 1
gi|83767103|dbj|BAE57242.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 1229
Score = 38.1 bits (87), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 35/117 (29%), Positives = 53/117 (45%), Gaps = 13/117 (11%)
Query: 91 NDNIAYLVG--CIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLWLGPAAYINHDC 145
ND I VG ++ E++ L +G+ S ++ +N A G A +INH C
Sbjct: 1110 NDMIIEYVGEKVRQQVADMRERQYLKSGIGS-SYLFRIDENTVIDATKRGGIARFINHSC 1168
Query: 146 RPNCRFVATERDTA---CVKVLRDIEQGEEITCFYGEDFFGDKN----CLCECDTCE 195
PNC + D + + LRDIE+ EE+T Y + D + CLC C+
Sbjct: 1169 TPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREWDSDDRIPCLCGSTGCK 1225
>gi|303313714|ref|XP_003066866.1| SET domain containing protein [Coccidioides posadasii C735 delta
SOWgp]
gi|240106533|gb|EER24721.1| SET domain containing protein [Coccidioides posadasii C735 delta
SOWgp]
Length = 1271
Score = 38.1 bits (87), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 35/117 (29%), Positives = 53/117 (45%), Gaps = 13/117 (11%)
Query: 91 NDNIAYLVG--CIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLWLGPAAYINHDC 145
ND I VG ++ E+R L +G+ S ++ +N A G A +INH C
Sbjct: 1152 NDMIIEYVGEKVRQQVADMRERRYLKSGIGS-SYLFRIDENTVIDATKRGGIARFINHSC 1210
Query: 146 RPNCRFVATERDTA---CVKVLRDIEQGEEITCFYGEDFFGDKN----CLCECDTCE 195
PNC + D + + LRDI++ EE+T Y + D + CLC C+
Sbjct: 1211 TPNCTAKIIKVDGSKRIVIYALRDIDRDEELTYDYKFEREWDSDDRIPCLCGSAGCK 1267
>gi|351704076|gb|EHB06995.1| Putative histone-lysine N-methyltransferase NSD2 [Heterocephalus
glaber]
Length = 1372
Score = 38.1 bits (87), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 34/125 (27%), Positives = 55/125 (44%), Gaps = 13/125 (10%)
Query: 77 NRGARIVATRKWEKNDNIAYLVGCIAELTKEEE--KRLLHTGLNDFSVMYSCRKNCAQLW 134
+G +VA R K + + VG EL EEE R+ + ND + Y + ++
Sbjct: 1078 GKGWGLVAKRDIRKGEFVNEYVG---ELIDEEECMARIKYAHENDITHFYMLTIDKDRII 1134
Query: 135 -LGPAA----YINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN 186
GP ++NH C+PNC ++ + + DI G E+T Y D G++
Sbjct: 1135 DAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTELTFNYNLDCLGNEK 1194
Query: 187 CLCEC 191
+C C
Sbjct: 1195 TVCRC 1199
>gi|347840183|emb|CCD54755.1| hypothetical protein [Botryotinia fuckeliana]
Length = 806
Score = 38.1 bits (87), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 29/64 (45%), Gaps = 7/64 (10%)
Query: 138 AAYINHDCRPNCRFVA----TERDTACVKVLRDIEQGEEITCFYGEDFFGDKN---CLCE 190
A ++NH CRPNCR V + A I G+E+T Y D F KN C C
Sbjct: 536 ARFVNHSCRPNCRMVKWIVEGKPRMALFAGDNPIMTGDELTYDYNFDPFSAKNVQACRCG 595
Query: 191 CDTC 194
D C
Sbjct: 596 SDNC 599
>gi|320032561|gb|EFW14513.1| histone-lysine N-methyltransferase [Coccidioides posadasii str.
Silveira]
Length = 1271
Score = 38.1 bits (87), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 35/117 (29%), Positives = 53/117 (45%), Gaps = 13/117 (11%)
Query: 91 NDNIAYLVG--CIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLWLGPAAYINHDC 145
ND I VG ++ E+R L +G+ S ++ +N A G A +INH C
Sbjct: 1152 NDMIIEYVGEKVRQQVADMRERRYLKSGIGS-SYLFRIDENTVIDATKRGGIARFINHSC 1210
Query: 146 RPNCRFVATERDTA---CVKVLRDIEQGEEITCFYGEDFFGDKN----CLCECDTCE 195
PNC + D + + LRDI++ EE+T Y + D + CLC C+
Sbjct: 1211 TPNCTAKIIKVDGSKRIVIYALRDIDRDEELTYDYKFEREWDSDDRIPCLCGSAGCK 1267
>gi|321260843|ref|XP_003195141.1| histone-lysine N-methyltransferase [Cryptococcus gattii WM276]
gi|317461614|gb|ADV23354.1| Histone-lysine N-methyltransferase, putative [Cryptococcus gattii
WM276]
Length = 833
Score = 38.1 bits (87), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 21/59 (35%), Positives = 28/59 (47%), Gaps = 3/59 (5%)
Query: 136 GPAAYINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCEC 191
G + NH C PNC ++V R + RD+ +GEEIT Y D +G C C
Sbjct: 253 GIGRFANHSCNPNCEVQKWVVGRRLRMGIFTKRDVVKGEEITFNYNVDRYGHDAQTCYC 311
>gi|170036329|ref|XP_001846017.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167878894|gb|EDS42277.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 574
Score = 38.1 bits (87), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 24/73 (32%), Positives = 34/73 (46%), Gaps = 6/73 (8%)
Query: 134 WLGPAAYI-NHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCL---- 188
+L P A + NH C PN + V VLR I +GE++T YG ++ N
Sbjct: 421 FLSPIATVFNHSCDPNVLIIPRASRIVTV-VLRPIRKGEQLTSSYGPTWWDIPNLPLTFD 479
Query: 189 CECDTCEIRMKGA 201
C C C + KG+
Sbjct: 480 CRCVVCVLGKKGS 492
>gi|398392059|ref|XP_003849489.1| hypothetical protein MYCGRDRAFT_95879 [Zymoseptoria tritici IPO323]
gi|339469366|gb|EGP84465.1| hypothetical protein MYCGRDRAFT_95879 [Zymoseptoria tritici IPO323]
Length = 254
Score = 38.1 bits (87), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 19/56 (33%), Positives = 27/56 (48%), Gaps = 3/56 (5%)
Query: 140 YINHDCRPNCRFVAT---ERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECD 192
++N C+PN R A +R + R+I GEE+ YG +F LC CD
Sbjct: 176 FVNSSCKPNVRARAATVGKRHAILFQAARNIGPGEELRINYGGMYFQQAGLLCMCD 231
>gi|332028801|gb|EGI68830.1| Putative histone-lysine N-methyltransferase NSD2 [Acromyrmex
echinatior]
Length = 1304
Score = 38.1 bits (87), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 27/82 (32%), Positives = 36/82 (43%), Gaps = 7/82 (8%)
Query: 140 YINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFY--GEDFFGDKNCLCECDTC 194
++NH C PNC ++ + LRDIE GEE+T Y D K CLC C
Sbjct: 1024 FMNHSCAPNCETQKWTVNGDTRIGLFALRDIESGEELTFNYNLASDGETRKACLCGAPNC 1083
Query: 195 E--IRMKGAFAQSSVEPQTTQP 214
I +K Q+ V + P
Sbjct: 1084 SGFIGLKAQKQQTPVTQKQQMP 1105
>gi|225685245|gb|EEH23529.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb03]
Length = 756
Score = 38.1 bits (87), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 37/129 (28%), Positives = 56/129 (43%), Gaps = 13/129 (10%)
Query: 77 NRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLND---FSVMYSCRKNCAQL 133
+RG + + R + N I G I +T++E +R + T + + +MY +
Sbjct: 374 DRGYGVRSNRTFAPNQIIVEYTGEI--ITQKECERRMRTVYKNNECYYLMYFDQNMIIDA 431
Query: 134 WLGPAA-YINHDCRPNCRF----VATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN-- 186
G A ++NH C PNC VA + A I GEE+T Y D + KN
Sbjct: 432 TRGSIARFVNHSCEPNCEMEKWTVAGKPRMALFAGKNGITTGEELTYDYNFDPYSQKNVQ 491
Query: 187 -CLCECDTC 194
C C +TC
Sbjct: 492 ECRCGAETC 500
>gi|289664564|ref|ZP_06486145.1| nuclear protein SET [Xanthomonas campestris pv. vasculorum NCPPB
702]
Length = 155
Score = 38.1 bits (87), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 20/48 (41%), Positives = 26/48 (54%), Gaps = 7/48 (14%)
Query: 138 AAYINHDCRPNCRFVATE-------RDTACVKVLRDIEQGEEITCFYG 178
A +INH C PNC V E +D ++ RDI+ GEE+T YG
Sbjct: 75 ARWINHSCDPNCEAVIEEAEGDDRRKDKIFIEAKRDIKPGEELTYNYG 122
>gi|78048633|ref|YP_364808.1| hypothetical protein XCV3077 [Xanthomonas campestris pv.
vesicatoria str. 85-10]
gi|78037063|emb|CAJ24807.1| conserved hypothetical protein containing SET domain [Xanthomonas
campestris pv. vesicatoria str. 85-10]
Length = 155
Score = 38.1 bits (87), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 20/48 (41%), Positives = 26/48 (54%), Gaps = 7/48 (14%)
Query: 138 AAYINHDCRPNCRFVATE-------RDTACVKVLRDIEQGEEITCFYG 178
A +INH C PNC V E +D ++ RDI+ GEE+T YG
Sbjct: 75 ARWINHSCDPNCEAVIEEAEGDDRRKDKIFIEAKRDIKPGEELTYNYG 122
>gi|418516642|ref|ZP_13082814.1| SET domain-containing protein [Xanthomonas axonopodis pv.
malvacearum str. GSPB1386]
gi|418519556|ref|ZP_13085608.1| SET domain-containing protein [Xanthomonas axonopodis pv.
malvacearum str. GSPB2388]
gi|410705000|gb|EKQ63479.1| SET domain-containing protein [Xanthomonas axonopodis pv.
malvacearum str. GSPB2388]
gi|410706658|gb|EKQ65116.1| SET domain-containing protein [Xanthomonas axonopodis pv.
malvacearum str. GSPB1386]
Length = 155
Score = 38.1 bits (87), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 20/48 (41%), Positives = 26/48 (54%), Gaps = 7/48 (14%)
Query: 138 AAYINHDCRPNCRFVATE-------RDTACVKVLRDIEQGEEITCFYG 178
A +INH C PNC V E +D ++ RDI+ GEE+T YG
Sbjct: 75 ARWINHSCDPNCEAVIEEAEGDDRRKDKIFIEAKRDIKPGEELTYNYG 122
>gi|308495530|ref|XP_003109953.1| CRE-SET-11 protein [Caenorhabditis remanei]
gi|308244790|gb|EFO88742.1| CRE-SET-11 protein [Caenorhabditis remanei]
Length = 277
Score = 38.1 bits (87), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 21/67 (31%), Positives = 32/67 (47%), Gaps = 7/67 (10%)
Query: 140 YINHDCRPNCRFVATERDT-------ACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECD 192
+INH C PN + D C + I++GEE+T YGE ++ +K C C
Sbjct: 204 FINHSCSPNVKVANVSWDYDEIQLIHMCFYTDKLIKKGEELTIDYGEAWWANKKFACMCG 263
Query: 193 TCEIRMK 199
+ E R +
Sbjct: 264 SSECRYQ 270
>gi|91079710|ref|XP_969181.1| PREDICTED: similar to SET and MYND domain containing 4 [Tribolium
castaneum]
Length = 393
Score = 38.1 bits (87), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 22/48 (45%), Positives = 27/48 (56%), Gaps = 5/48 (10%)
Query: 138 AAYINHDCRPNCR--FVATERDTACVKVLRDIEQGEEITCFYGEDFFG 183
+ +NH CRPN F++ DT VK L DI Q EEI YG D+ G
Sbjct: 133 VSIMNHSCRPNVTNYFMS---DTIVVKALEDIAQNEEIFNCYGIDYRG 177
>gi|58581030|ref|YP_200046.1| hypothetical protein XOO1407 [Xanthomonas oryzae pv. oryzae KACC
10331]
gi|188577994|ref|YP_001914923.1| nuclear protein SET [Xanthomonas oryzae pv. oryzae PXO99A]
gi|289671189|ref|ZP_06492264.1| nuclear protein SET [Xanthomonas campestris pv. musacearum NCPPB
4381]
gi|294626273|ref|ZP_06704876.1| nuclear protein SET [Xanthomonas fuscans subsp. aurantifolii str.
ICPB 11122]
gi|294664778|ref|ZP_06730104.1| nuclear protein SET [Xanthomonas fuscans subsp. aurantifolii str.
ICPB 10535]
gi|346725747|ref|YP_004852416.1| SET domain-containing protein [Xanthomonas axonopodis pv. citrumelo
F1]
gi|381172326|ref|ZP_09881456.1| SET domain protein [Xanthomonas citri pv. mangiferaeindicae LMG
941]
gi|384418465|ref|YP_005627825.1| nuclear protein SET [Xanthomonas oryzae pv. oryzicola BLS256]
gi|390989910|ref|ZP_10260203.1| SET domain protein [Xanthomonas axonopodis pv. punicae str. LMG
859]
gi|58425624|gb|AAW74661.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae KACC
10331]
gi|188522446|gb|ACD60391.1| nuclear protein SET [Xanthomonas oryzae pv. oryzae PXO99A]
gi|292599419|gb|EFF43553.1| nuclear protein SET [Xanthomonas fuscans subsp. aurantifolii str.
ICPB 11122]
gi|292605465|gb|EFF48790.1| nuclear protein SET [Xanthomonas fuscans subsp. aurantifolii str.
ICPB 10535]
gi|346650494|gb|AEO43118.1| Protein containing SET domain [Xanthomonas axonopodis pv. citrumelo
F1]
gi|353461379|gb|AEQ95658.1| nuclear protein SET [Xanthomonas oryzae pv. oryzicola BLS256]
gi|372555354|emb|CCF67178.1| SET domain protein [Xanthomonas axonopodis pv. punicae str. LMG
859]
gi|380687170|emb|CCG37943.1| SET domain protein [Xanthomonas citri pv. mangiferaeindicae LMG
941]
Length = 155
Score = 38.1 bits (87), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 20/48 (41%), Positives = 26/48 (54%), Gaps = 7/48 (14%)
Query: 138 AAYINHDCRPNCRFVATE-------RDTACVKVLRDIEQGEEITCFYG 178
A +INH C PNC V E +D ++ RDI+ GEE+T YG
Sbjct: 75 ARWINHSCDPNCEAVIEEAEGDDRRKDKIFIEAKRDIKPGEELTYNYG 122
>gi|348521556|ref|XP_003448292.1| PREDICTED: histone-lysine N-methyltransferase MLL2-like [Oreochromis
niloticus]
Length = 4907
Score = 38.1 bits (87), Expect = 4.2, Method: Composition-based stats.
Identities = 27/71 (38%), Positives = 36/71 (50%), Gaps = 4/71 (5%)
Query: 131 AQLWLGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNC 187
A L GPA Y+NH C PNC V ++ D + R I +GEE+T Y DF D++
Sbjct: 4833 ATLTGGPARYVNHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQFDFEDDQHK 4892
Query: 188 L-CECDTCEIR 197
+ C C R
Sbjct: 4893 IPCHCGAWNCR 4903
>gi|307111585|gb|EFN59819.1| hypothetical protein CHLNCDRAFT_18588, partial [Chlorella
variabilis]
Length = 380
Score = 38.1 bits (87), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 23/66 (34%), Positives = 31/66 (46%), Gaps = 5/66 (7%)
Query: 140 YINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDK--NCLCECDTC 194
+INH C PNC ++V + L DI G E+T Y + +GDK CLC C
Sbjct: 254 FINHSCEPNCETQKWVVHGELAIGLFTLEDISAGTELTFDYNFERYGDKPMKCLCGSKNC 313
Query: 195 EIRMKG 200
+ G
Sbjct: 314 RGVIGG 319
>gi|219109809|ref|XP_002176658.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217411193|gb|EEC51121.1| predicted protein, partial [Phaeodactylum tricornutum CCAP 1055/1]
Length = 139
Score = 38.1 bits (87), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 20/43 (46%), Positives = 26/43 (60%), Gaps = 2/43 (4%)
Query: 131 AQLWLGPAAYINHDCRPNC--RFVATERDTACVKVLRDIEQGE 171
A L GPA YINH+C PNC + V+ E + C+K + I Q E
Sbjct: 60 ATLKGGPARYINHNCTPNCFAKIVSGEPPSPCLKRVIIIAQQE 102
>gi|46117750|ref|XP_384827.1| hypothetical protein FG04651.1 [Gibberella zeae PH-1]
Length = 812
Score = 38.1 bits (87), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 43/92 (46%), Gaps = 21/92 (22%)
Query: 125 SCRK----NCAQLWLGPAAYINHDCRPNCR--FVATERDTACVKVLRDIEQGEEITCFY- 177
SCR+ A +WL AA INH C NCR F+ D V+ RDI G E+T Y
Sbjct: 556 SCRERPLFTTAGVWLL-AARINHSCVGNCRRSFIG---DIQIVRAARDIPAGTELTFPYC 611
Query: 178 ----GEDFFGDKNCL------CECDTCEIRMK 199
E + +N L C+C+ C+ R K
Sbjct: 612 PTGDSETYQDVQNKLAKWGFTCDCELCKDRRK 643
>gi|407404730|gb|EKF30079.1| hypothetical protein MOQ_006118 [Trypanosoma cruzi marinkellei]
Length = 359
Score = 38.1 bits (87), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 28/90 (31%), Positives = 38/90 (42%), Gaps = 15/90 (16%)
Query: 141 INHDCRPNCRFVATER--DTACVKVLRDIEQGEEITCFY----------GEDFFGDKNCL 188
NH C PN + V +E +T V LRDIE GEE+ Y ++ + +
Sbjct: 267 FNHSCEPNLQVVYSESGDETLSVVALRDIEPGEELCISYIDESLSYPERQQELYEHYLFV 326
Query: 189 CECDTCEIRMKGAFAQSSVEPQTTQPSRYC 218
C C CE + A A +EP S C
Sbjct: 327 CHCPKCE---REAAAWEPIEPPEEALSSAC 353
>gi|195059316|ref|XP_001995609.1| GH17848 [Drosophila grimshawi]
gi|193896395|gb|EDV95261.1| GH17848 [Drosophila grimshawi]
Length = 2535
Score = 38.1 bits (87), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 36/126 (28%), Positives = 51/126 (40%), Gaps = 9/126 (7%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSC---RKNCAQLW 134
+G + A R EK+ I +G + E + + N M+ R A L
Sbjct: 2405 QGLGLYAARDIEKHTMIIEYIGEVIRTEVSEIREKQYESKNRGIYMFRLDEDRVVDATLS 2464
Query: 135 LGPAAYINHDCRPNC--RFVATERDTACVK-VLRDIEQGEEITCFYGEDFFGDKN---CL 188
G A YINH C PNC V +RD + R I +GEE++ Y D + + C
Sbjct: 2465 GGLARYINHSCNPNCVTEIVEVDRDVRIIIFAKRKIYRGEELSYDYKFDIEDESHKIPCA 2524
Query: 189 CECDTC 194
C C
Sbjct: 2525 CGAPNC 2530
>gi|356533089|ref|XP_003535101.1| PREDICTED: histone-lysine N-methyltransferase ATXR4-like [Glycine
max]
Length = 313
Score = 38.1 bits (87), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 19/45 (42%), Positives = 27/45 (60%), Gaps = 1/45 (2%)
Query: 133 LWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFY 177
++L P+ Y NHDC PN + + A +K LRDI +GEE+ Y
Sbjct: 239 VYLLPSFY-NHDCDPNAHIIWIDNADAKLKALRDIVEGEELRICY 282
>gi|326476825|gb|EGE00835.1| histone-lysine N-methyltransferase [Trichophyton tonsurans CBS
112818]
Length = 628
Score = 38.1 bits (87), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 39/136 (28%), Positives = 59/136 (43%), Gaps = 27/136 (19%)
Query: 77 NRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHT--------GLNDFS---VMYS 125
+RG + + R +E N I G I LT+ E +R + T L DF ++ +
Sbjct: 352 SRGYGVRSNRTFEPNQIIVEYTGEI--LTQIEAQRRMKTIYKKNECFYLMDFDQDMIIDA 409
Query: 126 CRKNCAQLWLGPAAYINHDCRPNCRF----VATERDTACVKVLRDIEQGEEITCFYGEDF 181
R + A+ ++NH C PNC+ VA + A I GEE+T Y D
Sbjct: 410 TRGSIAR-------FVNHSCEPNCKMEKWIVAGKPRIALFAGDNGIMTGEELTYDYNFDP 462
Query: 182 FGDKN---CLCECDTC 194
+ +KN C C +C
Sbjct: 463 YSNKNVQECRCGTPSC 478
>gi|269861803|ref|XP_002650585.1| hypothetical protein EBI_25893 [Enterocytozoon bieneusi H348]
gi|220065922|gb|EED43472.1| hypothetical protein EBI_25893 [Enterocytozoon bieneusi H348]
Length = 310
Score = 38.1 bits (87), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 33/100 (33%), Positives = 50/100 (50%), Gaps = 12/100 (12%)
Query: 97 LVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLWLGPAAYINHDCRPNCRFVA 153
++G I ++ + EK + G+N M++ K+ A L A Y+NH C+PNC ++
Sbjct: 207 IIGKI--MSDKREKNYIKNGINSI-YMFALGKDKIIDATLVGNQARYVNHSCKPNC--LS 261
Query: 154 TE-RDTACVKV--LRDIEQGEEITCFY-GEDFFGDKNCLC 189
E DT +K +R I EEIT Y + KNC C
Sbjct: 262 HECLDTNTIKYYSIRTIYPDEEITIDYCNVNIINPKNCNC 301
>gi|348676143|gb|EGZ15961.1| hypothetical protein PHYSODRAFT_334153 [Phytophthora sojae]
Length = 245
Score = 38.1 bits (87), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 21/61 (34%), Positives = 29/61 (47%), Gaps = 5/61 (8%)
Query: 140 YINHDCRPNCRFVATE---RDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTCEI 196
++ H C P +FV R T + DI QG+E+T YG+D + C C D C
Sbjct: 127 FVYHSCEPLTKFVEVSNGRRTTVVMASTEDIRQGQEMTVDYGDDLW--FICRCGSDRCRH 184
Query: 197 R 197
R
Sbjct: 185 R 185
>gi|294055775|ref|YP_003549433.1| nuclear protein SET [Coraliomargarita akajimensis DSM 45221]
gi|293615108|gb|ADE55263.1| nuclear protein SET [Coraliomargarita akajimensis DSM 45221]
Length = 180
Score = 38.1 bits (87), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 20/60 (33%), Positives = 30/60 (50%), Gaps = 3/60 (5%)
Query: 137 PAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDF--FGDKNCLCECDTC 194
PA Y+NH C NC + + + + +D+ +G+E+ YG D F D C C D C
Sbjct: 81 PARYMNHSCDGNCEAINYDGEI-WITARKDVRKGDELVYDYGYDMEHFMDHPCRCGADNC 139
>gi|348668507|gb|EGZ08331.1| hypothetical protein PHYSODRAFT_254134 [Phytophthora sojae]
Length = 320
Score = 38.1 bits (87), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 17/45 (37%), Positives = 23/45 (51%), Gaps = 3/45 (6%)
Query: 140 YINHDCRPNCRFVATE---RDTACVKVLRDIEQGEEITCFYGEDF 181
++ H C P +FV R T V DI QG+E+T YG+D
Sbjct: 254 FVKHSCEPLAKFVEVSNGRRTTVVVASTEDIRQGQEVTVDYGDDL 298
>gi|348533938|ref|XP_003454461.1| PREDICTED: hypothetical protein LOC100700132 [Oreochromis niloticus]
Length = 2924
Score = 38.1 bits (87), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 42/140 (30%), Positives = 59/140 (42%), Gaps = 24/140 (17%)
Query: 70 YRYSLEGNRGARIVATRKWEKNDNIAYLVGCI--AELTKEEEKRLLHTGL-------NDF 120
YR ++ G RG + R E + + G + + LT + EK G+ +DF
Sbjct: 2789 YRSAIHG-RG--LFCKRNIEAGEMVIEYAGIVIRSVLTDKREKYYDGKGIGCYMFRIDDF 2845
Query: 121 SVMYSCRKNCAQLWLGPAAYINHDCRPNC--RFVATE-RDTACVKVLRDIEQGEEITCFY 177
V+ + A A +INH C PNC R + E R + LR I +GEE+T Y
Sbjct: 2846 DVVDATMHGNA------ARFINHSCEPNCYSRVINVEGRKHIVIFALRKIYRGEELTYDY 2899
Query: 178 G---EDFFGDKNCLCECDTC 194
ED NC C C
Sbjct: 2900 KFPIEDASNKLNCNCGARRC 2919
>gi|312089370|ref|XP_003146221.1| histone methyltransferase [Loa loa]
Length = 689
Score = 38.1 bits (87), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 28/77 (36%), Positives = 34/77 (44%), Gaps = 9/77 (11%)
Query: 131 AQLWLGPAAYINHDCRPNC--RFVAT----ERDTACVKVLRDIEQGEEITCFYG---EDF 181
A + GPA YINH C PNC R V + + + R I GEE+T Y ED
Sbjct: 612 ATMAGGPARYINHSCDPNCSTRLVDSGPCGDDKKIIIIANRPISAGEELTYDYQFDIEDA 671
Query: 182 FGDKNCLCECDTCEIRM 198
CLC C+ M
Sbjct: 672 ADKIPCLCGAPNCQKWM 688
>gi|145347416|ref|XP_001418163.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144578392|gb|ABO96456.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 639
Score = 38.1 bits (87), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 25/45 (55%), Gaps = 1/45 (2%)
Query: 138 AAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGEDFF 182
A+ NH PN + V + T VK LR+I GEEIT YGE +
Sbjct: 244 ASLFNHSSTPNAQ-VMFKGKTLVVKTLREIAVGEEITISYGEQYM 287
>gi|83286297|ref|XP_730100.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
gi|23489726|gb|EAA21665.1| similar to KIAA0304 gene product-related [Plasmodium yoelii yoelii]
Length = 1137
Score = 38.1 bits (87), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 27/75 (36%), Positives = 39/75 (52%), Gaps = 7/75 (9%)
Query: 131 AQLWLGPAAYINHDCRPN--CRFVATERDTACVKVL--RDIEQGEEITCFY--GEDFFGD 184
A W + +INH C PN C+ V+ +++ + + +DI EEIT Y G + G+
Sbjct: 1062 ATKWGNVSRFINHSCEPNCFCKIVSCDQNLKHIVIFAKKDILPHEEITYDYQFGVESEGE 1121
Query: 185 K-NCLCECDTCEIRM 198
K CLC TC RM
Sbjct: 1122 KLICLCGSSTCLGRM 1136
>gi|432866237|ref|XP_004070753.1| PREDICTED: uncharacterized protein LOC101172242 [Oryzias latipes]
Length = 4897
Score = 38.1 bits (87), Expect = 4.5, Method: Composition-based stats.
Identities = 27/71 (38%), Positives = 36/71 (50%), Gaps = 4/71 (5%)
Query: 131 AQLWLGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNC 187
A L GPA Y+NH C PNC V ++ D + R I +GEE+T Y DF D++
Sbjct: 4823 ATLTGGPARYVNHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQFDFEDDQHK 4882
Query: 188 L-CECDTCEIR 197
+ C C R
Sbjct: 4883 IPCHCGAWNCR 4893
>gi|58269858|ref|XP_572085.1| histone-lysine N-methyltransferase [Cryptococcus neoformans var.
neoformans JEC21]
gi|338819405|sp|P0CO28.1|SET2_CRYNJ RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36
specific; AltName: Full=SET domain-containing protein 2
gi|57228321|gb|AAW44778.1| histone-lysine N-methyltransferase, putative [Cryptococcus
neoformans var. neoformans JEC21]
Length = 834
Score = 38.1 bits (87), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 21/59 (35%), Positives = 28/59 (47%), Gaps = 3/59 (5%)
Query: 136 GPAAYINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCEC 191
G + NH C PNC ++V R + RD+ +GEEIT Y D +G C C
Sbjct: 253 GIGRFANHSCNPNCEVQKWVVGRRLRMGIFTKRDVIKGEEITFNYNVDRYGHDAQTCYC 311
>gi|195564658|ref|XP_002105931.1| GD16408 [Drosophila simulans]
gi|194203296|gb|EDX16872.1| GD16408 [Drosophila simulans]
Length = 1076
Score = 38.1 bits (87), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 36/126 (28%), Positives = 51/126 (40%), Gaps = 9/126 (7%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSC---RKNCAQLW 134
+G + A R EK+ I +G + E + + N M+ R A L
Sbjct: 946 QGLGLYAARDIEKHTMIIEYIGEVIRTEVSEIREKQYESKNRGIYMFRLDEDRVVDATLS 1005
Query: 135 LGPAAYINHDCRPNC--RFVATERDTACVKVL-RDIEQGEEITCFYGEDFFGDKN---CL 188
G A YINH C PNC V +RD + R I +GEE++ Y D + + C
Sbjct: 1006 GGLARYINHSCNPNCVTEIVEVDRDVRIIIFAKRKIYRGEELSYDYKFDIEDESHKIPCA 1065
Query: 189 CECDTC 194
C C
Sbjct: 1066 CGAPNC 1071
>gi|134113727|ref|XP_774448.1| hypothetical protein CNBG0940 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|338819404|sp|P0CO29.1|SET2_CRYNB RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36
specific; AltName: Full=SET domain-containing protein 2
gi|50257086|gb|EAL19801.1| hypothetical protein CNBG0940 [Cryptococcus neoformans var.
neoformans B-3501A]
Length = 834
Score = 38.1 bits (87), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 21/59 (35%), Positives = 28/59 (47%), Gaps = 3/59 (5%)
Query: 136 GPAAYINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCEC 191
G + NH C PNC ++V R + RD+ +GEEIT Y D +G C C
Sbjct: 253 GIGRFANHSCNPNCEVQKWVVGRRLRMGIFTKRDVIKGEEITFNYNVDRYGHDAQTCYC 311
>gi|432909101|ref|XP_004078112.1| PREDICTED: uncharacterized protein LOC101174945 [Oryzias latipes]
Length = 3692
Score = 38.1 bits (87), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 42/140 (30%), Positives = 59/140 (42%), Gaps = 24/140 (17%)
Query: 70 YRYSLEGNRGARIVATRKWEKNDNIAYLVGCI--AELTKEEEKRLLHTGL-------NDF 120
YR ++ G RG + R E + + G + + LT + EK G+ +DF
Sbjct: 3557 YRSAIHG-RG--LFCKRNIEAGEMVIEYAGIVIRSVLTDKREKYYDGKGIGCYMFRIDDF 3613
Query: 121 SVMYSCRKNCAQLWLGPAAYINHDCRPNC--RFVATE-RDTACVKVLRDIEQGEEITCFY 177
V+ + A A +INH C PNC R + E R + LR I +GEE+T Y
Sbjct: 3614 DVVDATMHGNA------ARFINHSCEPNCYSRVINVEGRKHIVIFALRKIYRGEELTYDY 3667
Query: 178 G---EDFFGDKNCLCECDTC 194
ED NC C C
Sbjct: 3668 KFPIEDASNKLNCNCGARRC 3687
>gi|3540281|gb|AAC34383.1| All-1 related protein [Takifugu rubripes]
Length = 4823
Score = 38.1 bits (87), Expect = 4.6, Method: Composition-based stats.
Identities = 27/71 (38%), Positives = 36/71 (50%), Gaps = 4/71 (5%)
Query: 131 AQLWLGPAAYINHDCRPNC--RFVATER-DTACVKVLRDIEQGEEITCFYGEDFFGDKNC 187
A L GPA Y+NH C PNC V ++ D + R I +GEE+T Y DF D++
Sbjct: 4749 ATLTGGPARYVNHSCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQFDFEDDQHK 4808
Query: 188 L-CECDTCEIR 197
+ C C R
Sbjct: 4809 IPCHCGAWNCR 4819
>gi|145540026|ref|XP_001455703.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124423511|emb|CAK88306.1| unnamed protein product [Paramecium tetraurelia]
Length = 1371
Score = 38.1 bits (87), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 24/66 (36%), Positives = 33/66 (50%), Gaps = 11/66 (16%)
Query: 138 AAYINHDCRPNCRFVATER----DTACVKV--LRDIEQGEEITCFYGEDFFGD--KNCLC 189
A +INH C PNC TE+ C+ + +RDI + EE+T Y D F CLC
Sbjct: 203 ARFINHSCEPNC---ITEKWNVLGEVCIGIFAIRDINEDEELTFDYQFDVFHTPLTKCLC 259
Query: 190 ECDTCE 195
+ C+
Sbjct: 260 GANKCK 265
>gi|354548180|emb|CCE44916.1| hypothetical protein CPAR2_407180 [Candida parapsilosis]
Length = 810
Score = 38.1 bits (87), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 19/57 (33%), Positives = 31/57 (54%), Gaps = 3/57 (5%)
Query: 138 AAYINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCEC 191
A ++NH C PN ++V ++ + R+I +GEEIT Y D +G ++ C C
Sbjct: 183 ARFVNHSCNPNAFVDKWVVGDKLRMGIFAKRNIAKGEEITFDYNVDRYGAQSQPCYC 239
>gi|332662681|ref|YP_004445469.1| nuclear protein SET [Haliscomenobacter hydrossis DSM 1100]
gi|332331495|gb|AEE48596.1| nuclear protein SET [Haliscomenobacter hydrossis DSM 1100]
Length = 132
Score = 38.1 bits (87), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 30/104 (28%), Positives = 52/104 (50%), Gaps = 4/104 (3%)
Query: 79 GARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNCAQLWLGPA 138
G R V + + D++ + IA KE+ + T L+D+ ++ + + LG
Sbjct: 15 GGRGVFSTEPIAADSLVEICPVIA-CPKEDLPVIHKTFLHDYYFLWGENDDECAIALGFG 73
Query: 139 AYINHDCRPNCRFVATERDTACVKV--LRDIEQGEEITCFYGED 180
+ NH +PN R+ + + A +++ +RDIE GEEIT Y D
Sbjct: 74 SLYNHSRQPNARY-WCDPEAAIIEIWTIRDIEAGEEITVNYNGD 116
>gi|405121647|gb|AFR96415.1| histone-lysine N-methyltransferase [Cryptococcus neoformans var.
grubii H99]
Length = 836
Score = 38.1 bits (87), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 21/59 (35%), Positives = 28/59 (47%), Gaps = 3/59 (5%)
Query: 136 GPAAYINHDCRPNC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCEC 191
G + NH C PNC ++V R + RD+ +GEEIT Y D +G C C
Sbjct: 253 GIGRFANHSCNPNCEVQKWVVGRRLRMGIFTKRDVIKGEEITFNYNVDRYGHDAQTCYC 311
>gi|403366787|gb|EJY83200.1| TPR repeat-containing protein [Oxytricha trifallax]
Length = 760
Score = 38.1 bits (87), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 39/122 (31%), Positives = 54/122 (44%), Gaps = 25/122 (20%)
Query: 72 YSLEGNRGARIVATRKWEKNDN----------------IAYLVGCIAELTKEEEKRLLHT 115
Y G+ G+ V+ + EKN+ I L GC LT + E +L T
Sbjct: 487 YLFNGSNGSLQVSIQDLEKNNQHPQSFSQRITFFEMQKIIKLNGCNI-LTLQNELDMLKT 545
Query: 116 GLNDFSVMYSCRKNCAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITC 175
ND YS + LW+ +++ NHDC NC + D V RDI++GEEIT
Sbjct: 546 --ND----YSKLEFKDALWV-ISSFFNHDCYGNCSRYSI-GDVLFVVANRDIQEGEEITQ 597
Query: 176 FY 177
Y
Sbjct: 598 QY 599
>gi|116192273|ref|XP_001221949.1| hypothetical protein CHGG_05854 [Chaetomium globosum CBS 148.51]
gi|88181767|gb|EAQ89235.1| hypothetical protein CHGG_05854 [Chaetomium globosum CBS 148.51]
Length = 337
Score = 38.1 bits (87), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 17/35 (48%), Positives = 24/35 (68%), Gaps = 3/35 (8%)
Query: 160 CVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTC 194
CV+ RDI +GE++T YG+ +K +CECDTC
Sbjct: 294 CVQARRDIREGEQLTWDYGKR---EKGFVCECDTC 325
>gi|18027368|gb|AAL55766.1|AF289582_1 unknown [Homo sapiens]
Length = 129
Score = 38.1 bits (87), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 20/39 (51%), Positives = 25/39 (64%), Gaps = 5/39 (12%)
Query: 54 HRYSYRIQYLPE----LKPCYRYSLEGNRGARIVATRKW 88
H Y Y +LPE + PC RYS+E N GA+IV+TR W
Sbjct: 92 HVYRYLRAFLPESGFTILPCTRYSMETN-GAKIVSTRAW 129
>gi|21243660|ref|NP_643242.1| hypothetical protein XAC2933 [Xanthomonas axonopodis pv. citri str.
306]
gi|84622947|ref|YP_450319.1| hypothetical protein XOO_1290 [Xanthomonas oryzae pv. oryzae MAFF
311018]
gi|325925545|ref|ZP_08186933.1| SET domain-containing protein [Xanthomonas perforans 91-118]
gi|21109238|gb|AAM37778.1| conserved hypothetical protein [Xanthomonas axonopodis pv. citri
str. 306]
gi|84366887|dbj|BAE68045.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae MAFF
311018]
gi|325544049|gb|EGD15444.1| SET domain-containing protein [Xanthomonas perforans 91-118]
Length = 139
Score = 38.1 bits (87), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 20/48 (41%), Positives = 26/48 (54%), Gaps = 7/48 (14%)
Query: 138 AAYINHDCRPNCRFVATE-------RDTACVKVLRDIEQGEEITCFYG 178
A +INH C PNC V E +D ++ RDI+ GEE+T YG
Sbjct: 59 ARWINHSCDPNCEAVIEEAEGDDRRKDKIFIEAKRDIKPGEELTYNYG 106
>gi|328874806|gb|EGG23171.1| hypothetical protein DFA_05303 [Dictyostelium fasciculatum]
Length = 473
Score = 38.1 bits (87), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 23/40 (57%), Gaps = 1/40 (2%)
Query: 135 LGPAAYINHDCRPNCRFVATE-RDTACVKVLRDIEQGEEI 173
G A INH C PNC V + A +K LRDI+ GEE+
Sbjct: 389 FGLQAMINHSCEPNCLVVFDNGSNFAHIKALRDIQAGEEL 428
>gi|327306067|ref|XP_003237725.1| hypothetical protein TERG_02440 [Trichophyton rubrum CBS 118892]
gi|326460723|gb|EGD86176.1| hypothetical protein TERG_02440 [Trichophyton rubrum CBS 118892]
Length = 711
Score = 37.7 bits (86), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 39/136 (28%), Positives = 59/136 (43%), Gaps = 27/136 (19%)
Query: 77 NRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHT--------GLNDFS---VMYS 125
+RG + + R +E N I G I LT+ E +R + T L DF ++ +
Sbjct: 436 SRGYGVRSNRTFEPNQIIVEYTGEI--LTQIEAQRRMKTIYKKNECFYLMDFDQDMIIDA 493
Query: 126 CRKNCAQLWLGPAAYINHDCRPNCRF----VATERDTACVKVLRDIEQGEEITCFYGEDF 181
R + A+ ++NH C PNC+ VA + A I GEE+T Y D
Sbjct: 494 TRGSIAR-------FVNHSCEPNCKMEKWIVAGKPRIALFAGDNGIMTGEELTYDYNFDP 546
Query: 182 FGDKN---CLCECDTC 194
+ +KN C C +C
Sbjct: 547 YSNKNVQECRCGTPSC 562
>gi|315047188|ref|XP_003172969.1| SUV39H [Arthroderma gypseum CBS 118893]
gi|311343355|gb|EFR02558.1| SUV39H [Arthroderma gypseum CBS 118893]
Length = 703
Score = 37.7 bits (86), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 39/136 (28%), Positives = 59/136 (43%), Gaps = 27/136 (19%)
Query: 77 NRGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHT--------GLNDFS---VMYS 125
+RG + + R +E N I G I LT+ E +R + T L DF ++ +
Sbjct: 435 SRGYGVRSNRTFEPNQIIVEYTGEI--LTQIEAQRRMKTIYKKNECFYLMDFDQDMIIDA 492
Query: 126 CRKNCAQLWLGPAAYINHDCRPNCRF----VATERDTACVKVLRDIEQGEEITCFYGEDF 181
R + A+ ++NH C PNC+ VA + A I GEE+T Y D
Sbjct: 493 TRGSIAR-------FVNHSCEPNCKMEKWIVAGKPRIALFAGDNGIMTGEELTYDYNFDP 545
Query: 182 FGDKN---CLCECDTC 194
+ +KN C C +C
Sbjct: 546 YSNKNVQECRCGTPSC 561
>gi|281212379|gb|EFA86539.1| hypothetical protein PPL_00338 [Polysphondylium pallidum PN500]
Length = 468
Score = 37.7 bits (86), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 21/47 (44%), Positives = 26/47 (55%), Gaps = 1/47 (2%)
Query: 135 LGPAAYINHDCRPNCRFVATERDT-ACVKVLRDIEQGEEITCFYGED 180
G A INH C PNC V + + A +K LRDI GEE+ Y E+
Sbjct: 377 FGLQAMINHSCVPNCLVVFEQGSSLAYIKALRDIVPGEELFHSYIEE 423
>gi|412985398|emb|CCO18844.1| predicted protein [Bathycoccus prasinos]
Length = 528
Score = 37.7 bits (86), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 26/67 (38%), Positives = 30/67 (44%), Gaps = 13/67 (19%)
Query: 142 NHDCRPNCRFVATERDT--ACVKVLRD-IEQGEEITCFYGE----------DFFGDKNCL 188
NHDC PNC E D +CV V R I +GEE+T Y E D D +
Sbjct: 448 NHDCDPNCHPFKDETDINGSCVLVARKPIRKGEELTISYLEDDQLDWSRRQDALSDYGFV 507
Query: 189 CECDTCE 195
C C CE
Sbjct: 508 CRCARCE 514
>gi|108756776|ref|YP_633846.1| methyltransferase [Myxococcus xanthus DK 1622]
gi|108460656|gb|ABF85841.1| putative methyltransferase [Myxococcus xanthus DK 1622]
Length = 136
Score = 37.7 bits (86), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 19/53 (35%), Positives = 26/53 (49%), Gaps = 1/53 (1%)
Query: 135 LGPAAYINHDCRPNCRF-VATERDTACVKVLRDIEQGEEITCFYGEDFFGDKN 186
LG NH PN + RD C LRDI GE++ YGED++ ++
Sbjct: 79 LGYGMIYNHAREPNTSYHREATRDVFCYHALRDIHPGEQLCISYGEDWWASRD 131
>gi|255543292|ref|XP_002512709.1| protein with unknown function [Ricinus communis]
gi|223548670|gb|EEF50161.1| protein with unknown function [Ricinus communis]
Length = 379
Score = 37.7 bits (86), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 24/73 (32%), Positives = 35/73 (47%), Gaps = 13/73 (17%)
Query: 138 AAYINHDCRPN-CRF--VATERDTACVKVLRDIEQGEEITCFYGEDFFG----------D 184
AA NHDC PN CRF V T+ ++++ D+ QG EI Y + D
Sbjct: 214 AALFNHDCLPNACRFDYVDTQDTDLIIRMIHDVPQGREICLSYFPVNYDYSTRQKRLRED 273
Query: 185 KNCLCECDTCEIR 197
+C+CD C++
Sbjct: 274 YGFICDCDRCKVE 286
>gi|432879768|ref|XP_004073538.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific-like [Oryzias latipes]
Length = 2321
Score = 37.7 bits (86), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 29/107 (27%), Positives = 51/107 (47%), Gaps = 10/107 (9%)
Query: 95 AYLVGCIAELTKEEE--KRLLHTGLNDFSVMYSCRKNCAQLW-LGP----AAYINHDCRP 147
A++ + E+ EEE R+ H +D Y + ++ GP A ++NH C+P
Sbjct: 1889 AFVSEYVGEVIDEEECRARIRHAQEHDICNFYMLTLDKDRVIDAGPKGNQARFMNHSCQP 1948
Query: 148 NC---RFVATERDTACVKVLRDIEQGEEITCFYGEDFFGDKNCLCEC 191
NC ++ + L+DI +GEE+T Y + G+ +C+C
Sbjct: 1949 NCETQKWTVNGDTRVGLFALQDIAKGEELTFNYNLECRGNGKTVCKC 1995
>gi|189200969|ref|XP_001936821.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187983920|gb|EDU49408.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 189
Score = 37.7 bits (86), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 20/65 (30%), Positives = 32/65 (49%), Gaps = 3/65 (4%)
Query: 140 YINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGE---DFFGDKNCLCECDTCEI 196
YINH C P+ F + + +++G+E+T FY D +CLC+ + C
Sbjct: 97 YINHSCAPSLVFDMARWEIRVSDKGQGLKEGDELTFFYPSTEWDMAQPFDCLCKDEVCVG 156
Query: 197 RMKGA 201
R+ GA
Sbjct: 157 RVGGA 161
>gi|325187666|emb|CCA22203.1| histonelysine Nmethyltransferase putative [Albugo laibachii Nc14]
Length = 1414
Score = 37.7 bits (86), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 21/58 (36%), Positives = 27/58 (46%), Gaps = 3/58 (5%)
Query: 140 YINHDCRPNCRFV--ATERDT-ACVKVLRDIEQGEEITCFYGEDFFGDKNCLCECDTC 194
++NH C PNC+ E DT V LR IE G E+T Y G + C C+
Sbjct: 423 FVNHSCDPNCKTEKWTVEGDTRIAVVALRHIEFGTELTFDYQWKSLGSRQLACHCNAA 480
>gi|169806539|ref|XP_001828014.1| hypothetical protein EBI_25885 [Enterocytozoon bieneusi H348]
gi|161779154|gb|EDQ31179.1| hypothetical protein EBI_25885 [Enterocytozoon bieneusi H348]
Length = 310
Score = 37.7 bits (86), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 33/100 (33%), Positives = 50/100 (50%), Gaps = 12/100 (12%)
Query: 97 LVGCIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLWLGPAAYINHDCRPNCRFVA 153
++G I ++ + EK + G+N M++ K+ A L A Y+NH C+PNC ++
Sbjct: 207 IIGKI--MSDKREKNYIKNGINSI-YMFALGKDKIIDATLVGNQARYVNHSCKPNC--LS 261
Query: 154 TE-RDTACVKV--LRDIEQGEEITCFY-GEDFFGDKNCLC 189
E DT +K +R I EEIT Y + KNC C
Sbjct: 262 HECLDTNTIKYYSIRTIYPDEEITIDYCNVNIINPKNCNC 301
>gi|154313916|ref|XP_001556283.1| predicted protein [Botryotinia fuckeliana B05.10]
Length = 380
Score = 37.7 bits (86), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 23/78 (29%), Positives = 35/78 (44%), Gaps = 12/78 (15%)
Query: 138 AAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEIT---CFYGEDFFGDKNC------L 188
A+ NH C+ NC + ++ +RDI +GEEIT YG F N +
Sbjct: 171 ASGTNHACKTNCSRAFNSEHSISIRAMRDIRKGEEITHNYGAYGTASFRAGNIAERWKFI 230
Query: 189 CECDTCEIRMKGAFAQSS 206
C C+ C +KG + +
Sbjct: 231 CTCNAC---IKGEYVPET 245
>gi|326469547|gb|EGD93556.1| hypothetical protein TESG_01099 [Trichophyton tonsurans CBS 112818]
gi|326478926|gb|EGE02936.1| hypothetical protein TEQG_01974 [Trichophyton equinum CBS 127.97]
Length = 337
Score = 37.7 bits (86), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 23/75 (30%), Positives = 31/75 (41%), Gaps = 10/75 (13%)
Query: 130 CAQLWLGPAAYINHDCRPNCRFVATERDTACVKVLRDIEQGEEITCFYGE--DFFGDK-- 185
C L P NH C NC+ + C +RDI GEE+T Y FF +
Sbjct: 117 CGMFILQP--RFNHSCTANCKAPFNGMEAICTYAVRDIAAGEELTLNYDARVTFFPPRER 174
Query: 186 ----NCLCECDTCEI 196
+C+C C+I
Sbjct: 175 HASLGFVCDCPACDI 189
>gi|242786320|ref|XP_002480782.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
gi|218720929|gb|EED20348.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
Length = 1155
Score = 37.7 bits (86), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 35/117 (29%), Positives = 50/117 (42%), Gaps = 13/117 (11%)
Query: 91 NDNIAYLVG--CIAELTKEEEKRLLHTGLNDFSVMYSCRKNC---AQLWLGPAAYINHDC 145
ND I VG ++ E+R L +G+ S ++ +N A G A +INH C
Sbjct: 1036 NDMIIEYVGEKVRQQVADMRERRYLKSGIGS-SYLFRIDENAVIDATKRGGIARFINHSC 1094
Query: 146 RPNCRFVATERDTA---CVKVLRDIEQGEEITCFYGEDFFGDKN----CLCECDTCE 195
PNC D + + LRDI + EE+T Y + D CLC C+
Sbjct: 1095 TPNCTAKIIRVDGSKRIVIYALRDISKDEELTYDYKFEREWDSEDRIPCLCGSAGCK 1151
>gi|21064853|gb|AAM29656.1| SD13650p [Drosophila melanogaster]
Length = 1019
Score = 37.7 bits (86), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 36/126 (28%), Positives = 51/126 (40%), Gaps = 9/126 (7%)
Query: 78 RGARIVATRKWEKNDNIAYLVGCIAELTKEEEKRLLHTGLNDFSVMYSC---RKNCAQLW 134
+G + A R EK+ I +G + E + + N M+ R A L
Sbjct: 889 QGLGLYAARDIEKHTMIIEYIGEVIRTEVSEIREKQYESKNRGIYMFRLDEDRVVDATLS 948
Query: 135 LGPAAYINHDCRPNC--RFVATERDTACVKVL-RDIEQGEEITCFYGEDFFGDKN---CL 188
G A YINH C PNC V +RD + R I +GEE++ Y D + + C
Sbjct: 949 GGLARYINHSCNPNCVTEIVEVDRDVRIIIFAKRKIYRGEELSYDYKFDIEDESHKIPCA 1008
Query: 189 CECDTC 194
C C
Sbjct: 1009 CGAPNC 1014
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.323 0.137 0.435
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,370,917,070
Number of Sequences: 23463169
Number of extensions: 173446431
Number of successful extensions: 391267
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 385
Number of HSP's successfully gapped in prelim test: 1230
Number of HSP's that attempted gapping in prelim test: 389490
Number of HSP's gapped (non-prelim): 1839
length of query: 278
length of database: 8,064,228,071
effective HSP length: 140
effective length of query: 138
effective length of database: 9,074,351,707
effective search space: 1252260535566
effective search space used: 1252260535566
T: 11
A: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 76 (33.9 bits)